
COVID19.BR: A Dataset of Misinformation about COVID-19 in Brazilian Portuguese WhatsApp Messages
Author(s) -
Aline Maria Araújo Martins,
Lucas Manoel da Silva Cabral,
Pedro Jorge Chaves Mourão,
Ivandro Claudino de Sá,
Ângelo Brayner,
Javam C. Machado
Publication year - 2021
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/dsw.2021.17422
Subject(s) - misinformation , portuguese , context (archaeology) , covid-19 , set (abstract data type) , internet privacy , computer science , social media , brazilian portuguese , data set , world wide web , data science , computer security , artificial intelligence , medicine , geography , philosophy , linguistics , disease , archaeology , pathology , infectious disease (medical specialty) , programming language
Nowadays, our society suffers with a major issue that unfortunately is becoming more and more problematic, once again through social networks, that is the misinformation. The primary source of misinformation in Brazil is the messaging application WhatsApp. However, due to WhatsApp's private messaging nature, there still few misinformation data sets built specifically from this platform. In this context, building a data set of WhatsApp messages about COVID-19 in Brazilian Portuguese and label misinformation messages within it becomes a crucial challenge. In this work, we present the COVID-19.BR, a data set of WhatsApp messages about coronavirus in Brazilian Portuguese, collected from Brazilian public groups and manually labeled.