
A transition-based neural framework for Chinese information extraction
Author(s) -
Wenzhang Huang,
Junchi Zhang,
Donghong Ji
Publication year - 2020
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0235796
Subject(s) - computer science , sequence labeling , information extraction , segmentation , named entity recognition , task (project management) , mutual information , artificial intelligence , word (group theory) , artificial neural network , pattern recognition (psychology) , relationship extraction , process (computing) , cascade , character (mathematics) , data mining , natural language processing , linguistics , philosophy , chemistry , management , chromatography , economics , geometry , mathematics , operating system
Chinese information extraction is traditionally performed in the process of word segmentation, entity recognition, relation extraction and event detection. This pipelined approach suffers from two limitations: 1) It is prone to introduce propagated errors from upstream tasks to subsequent applications; 2) Mutual benefits of cross-task dependencies are hard to be introduced in non-overlapping models. To address these two challenges, we propose a novel transition-based model that jointly performs entity recognition, relation extraction and event detection as a single task. In addition, we incorporate subword-level information into character sequence with the use of a hybrid lattice structure, removing the reliance of external word tokenizers. Results on standard ACE benchmarks show the benefits of the proposed joint model and lattice network, which gives the best result in the literature.