Native Language Identification With Classifier Stacking and Ensembles
Author(s) -
Shervin Malmasi,
Mark Dras
Publication year - 2018
Publication title -
computational linguistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.314
H-Index - 98
eISSN - 1530-9312
pISSN - 0891-2017
DOI - 10.1162/coli_a_00323
Subject(s) - computer science , classifier (uml) , stacking , ensemble learning , artificial intelligence , natural language processing , task (project management) , machine learning , pattern recognition (psychology) , physics , nuclear magnetic resonance , management , economics
Ensemble methods using multiple classifiers have proven to be the among the most successful approaches for the task of Native Language Identification (NLI), achieving the current state of the art. However, a systematic examination of ensemble methods for NLI has yet to be conducted. Additionally, deeper ensemble architectures such as classifier stacking have not been closely evaluated. We present a set of experiments using three ensemble-based models, testing each with multiple configurations and algorithms. This includes a rigorous application of meta-classification models for NLI, achieving state-of-the-art results on several large datasets, evaluated in both intra-corpus and cross-corpus modes.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom