A new training method for sequence data
Author(s) -
Lingfeng Niua,
Yong Shi
Publication year - 2010
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2010.04.269
Subject(s) - computer science , discriminative model , sequence (biology) , margin (machine learning) , task (project management) , artificial intelligence , feature (linguistics) , function (biology) , binary number , data mining , training set , scale (ratio) , machine learning , training (meteorology) , pattern recognition (psychology) , binary classification , support vector machine , linguistics , philosophy , genetics , physics , arithmetic , management , mathematics , quantum mechanics , evolutionary biology , meteorology , economics , biology
Under the framework of max margin method, this work proposes a model for training sequence data, which can be solved as a binary classification. However, there are too many samples in the auxiliary classification problem to make the model efficient enough for median to large scale data sets in practice. Therefore, under the additive assumption for the feature mapping and loss function, a simplified model is introduced in order to speed up training. The major advantage of our method is that the new model does not share slack variable for a sequence. This provides the ability to utilize the discriminate information within the sequence and select the discriminative patterns more precisely. Experiment on the task of named entity recognition validates the effectiveness of the new method
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom