z-logo
open-access-imgOpen Access
Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts
Author(s) -
An-Vinh Lương,
Diep Nguyen,
Điền Đinh
Publication year - 2020
Publication title -
acta linguistica asiatica
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.114
H-Index - 2
ISSN - 2232-3317
DOI - 10.4312/ala.10.2.127-142
Subject(s) - readability , vietnamese , computer science , natural language processing , artificial intelligence , part of speech , word (group theory) , sentence , word length , linguistics , decision tree , speech recognition , philosophy , programming language
The readability of the text plays a very important role in selecting appropriate materials for the level of the reader. Text readability in Vietnamese language has received a lot of attention in recent years, however, studies have mainly been limited to simple statistics at the level of a sentence length, word length, etc. In this article, we investigate the role of word-level grammatical characteristics in assessing the difficulty of texts in Vietnamese textbooks. We have used machine learning models (for instance, Decision Tree, K-nearest neighbor, Support Vector Machines, etc.) to evaluate the accuracy of classifying texts according to readability, using grammatical features in word level along with other statistical characteristics. Empirical results show that the presence of POS-level characteristics increases the accuracy of the classification by 2-4%.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here