
Crowdsourced-Data Normalization with Python and Pandas
Author(s) -
Halle Burns
Publication year - 2021
Publication title -
the programming historian
Language(s) - English
Resource type - Journals
ISSN - 2397-2068
DOI - 10.46430/phen0093
Subject(s) - python (programming language) , normalization (sociology) , crowdsourcing , computer science , pandas , database normalization , data science , artificial intelligence , world wide web , pattern recognition (psychology) , programming language , psychology , sociology , anthropology , psychiatry
Pandas is a popular and powerful package used in Python communities for data handling and analysis. This lesson describes crowdsourcing as a form of data creation as well as how pandas can be used to prepare a crowdsourced dataset for analysis. This lesson covers managing duplicate and missing data and explains the difficulties of dealing with dates.