Premium
Cleaning data: guess the olympian
Author(s) -
Richards Kate,
Davies Neville
Publication year - 2012
Publication title -
teaching statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.425
H-Index - 13
eISSN - 1467-9639
pISSN - 0141-982X
DOI - 10.1111/j.1467-9639.2011.00495.x
Subject(s) - computer science , information retrieval , data science , data mining
This article tackles the problem of what should be done with real textual data that are contaminated by errors of recording, particularly when the data contain words that are misspelt, unintentionally or otherwise.