Exploiting Syntactic Similarities for Preposition Error Corrections on Indonesian Sentences Written by Second Language Learner
Author(s) -
Budi Irmawati,
Hiroyuki Shindo,
Yūji Matsumoto
Publication year - 2016
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2016.04.052
Subject(s) - computer science , sentence , indonesian , natural language processing , artificial intelligence , similarity (geometry) , linguistics , speech recognition , philosophy , image (mathematics)
We propose a method to artificially generate training data to correct preposition errors in Indonesian sentences written by second language learners. Basically, we injected large size of native sentences with preposition errors learned from learners’ sentences. Our method copies a preposition error from a learner sentence to a native sentence by firstly calculating a syntactic similarity score between the native sentence and the learners’ sentence. Then, it chooses the preposition error from the learner sentence that has the highest syntactic similarity score to the native sentence, to replace the original preposition in the native sentence.Experimental results show that the preposition error correction model trained on the artificial data resulted from our method outperforms the correction model trained on the similar size of native data
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom