HIGH‐PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA | Zendy

Cohen K. Bretonnel | Zendy; Verspoor Karin | Zendy; Johnson             Helen L. | Zendy; Roeder Chris | Zendy; Ogren             Philip V. | Zendy; Baumgartner Jr             William A. | Zendy; White Elizabeth | Zendy; Tipney Hannah | Zendy; Hunter Lawrence | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

HIGH‐PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA

Author(s) -

Cohen K. Bretonnel,

Verspoor Karin,

Johnson Helen L.,

Roeder Chris,

Ogren Philip V.,

Baumgartner Jr William A.,

White Elizabeth,

Tipney Hannah,

Hunter Lawrence

Publication year - 2011

Publication title -

computational intelligence

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.353

H-Index - 52

eISSN - 1467-8640

pISSN - 0824-7935

DOI - 10.1111/j.1467-8640.2011.00405.x

Subject(s) - computer science , event (particle physics) , natural language processing , task (project management) , parsing , set (abstract data type) , false positive paradox , artificial intelligence , biomedical text mining , sentence , negation , data mining , text mining , physics , management , quantum mechanics , economics , programming language

We approached the problems of event detection, argument identification, and negation and speculation detection in the BioNLP'09 information extraction challenge through concept recognition and analysis. Our methodology involved using the OpenDMAP semantic parser with manually written rules. The original OpenDMAP system was updated for this challenge with a broad ontology defined for the events of interest, new linguistic patterns for those events, and specialized coordination handling. We achieved state-of-the-art precision for two of the three tasks, scoring the highest of 24 teams at precision of 71.81 on Task 1 and the highest of 6 teams at precision of 70.97 on Task 2. We provide a detailed analysis of the training data and show that a number of trigger words were ambiguous as to event type, even when their arguments are constrained by semantic class. The data is also shown to have a number of missing annotations. Analysis of a sampling of the comparatively small number of false positives returned by our system shows that major causes of this type of error were failing to recognize second themes in two-theme events, failing to recognize events when they were the arguments to other events, failure to recognize nontheme arguments, and sentence segmentation errors. We show that specifically handling coordination had a small but important impact on the overall performance of the system. The OpenDMAP system and the rule set are available at http://bionlp.sourceforge.net.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore