
Portuguese Vocabulary Profile: uma lista de vocabulário a aprendentes do PL2/PLE, baseada nos corpora de aprendentes e de livros de ensino
Author(s) -
Shintaro Torigoe
Publication year - 2017
Publication title -
revista da associação portuguesa de linguística
Language(s) - English
Resource type - Journals
ISSN - 2183-9077
DOI - 10.26334/2183-9077/rapln3ano2017a20
Subject(s) - vocabulary , portuguese , computer science , brazilian portuguese , word (group theory) , word list , norm (philosophy) , natural language processing , artificial intelligence , linguistics , philosophy , class (philosophy) , epistemology
This paper reports the second pilot study of the Portuguese Vocabulary Profile (PVP) project, a Portuguese vocabulary list for learners in Japan based on the Common European Framework of Reference for Languages. Inspired by the English Vocabulary Profile (Capel, 2010, 2012), the PVP takes a learner-centric approach. For this study, the author modified the first pilot version which was constructed solely from learner corpora (Torigoe, 2016a) by comparing it with a word list based on a corpus of Portuguese textbooks published in Japan. The result is a broadened vocabulary for both the elementary and intermediate levels. The major improvement is that some intuitively basic words, including numbers, months of the year, foods, and facilities, which had been previously categorized as intermediate or advanced level words or which were missing from the first version due to their low frequency were correctly categorized as the elementary level words. However, the norm of word classification remains somewhat arbitrary given that the small size of both the input (learner corpora) and the comparative data (textbook corpus) does not allow for the use of statistical methods with less frequent words.