Premium
What Kind of Data is it? Situating Sociolinguistic Corpora in Context
Author(s) -
Tagliamonte Sali A.
Publication year - 2014
Publication title -
language and linguistics compass
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 44
ISSN - 1749-818X
DOI - 10.1111/lnc3.12103
Subject(s) - categorization , context (archaeology) , computer science , linguistics , coding (social sciences) , data collection , sociology , data science , history , artificial intelligence , social science , philosophy , archaeology
In this paper, I discuss how sociolinguistic corpora can be compiled so as to document and maximize access to the context of its collection. This is no doubt a murky issue for the coding and categorization enterprise, but it is as critical as demographic information if we are going to be able to compare data sets from different communities, eras, or across research projects. However, how far does the researcher go in documenting this type of information? My goal will be to outline what I have found to be ‘best practice’ in my own research while at the same time highlighting issues and problems I have encountered along the way. I build on the foundations of earlier corpus‐building projects and on data arising from my own fieldwork conducted in the UK and Canada between 1995–2011.