z-logo
open-access-imgOpen Access
Combining an Expert-Based Medical Entity Recognizer to a Machine-Learning System: Methods and a Case Study
Author(s) -
Pierre Zweigenbaum,
Thomas Lavergne,
Natalia Grabar,
Thierry Hamon,
Sophie Rosset,
Cyril Grouin
Publication year - 2013
Publication title -
biomedical informatics insights
Language(s) - English
Resource type - Journals
ISSN - 1178-2226
DOI - 10.4137/bii.s11770
Subject(s) - conditional random field , overfitting , computer science , expert system , artificial intelligence , machine learning , random forest , named entity recognition , classifier (uml) , data mining , natural language processing , artificial neural network , engineering , systems engineering , task (project management)
Medical entity recognition is currently generally performed by data-driven methods based on supervised machine learning. Expert-based systems, where linguistic and domain expertise are directly provided to the system are often combined with data-driven systems. We present here a case study where an existing expert-based medical entity recognition system, Ogmios, is combined with a data-driven system, Caramba, based on a linear-chain Conditional Random Field (CRF) classifier. Our case study specifically highlights the risk of overfitting incurred by an expert-based system. We observe that it prevents the combination of the 2 systems from obtaining improvements in precision, recall, or F-measure, and analyze the underlying mechanisms through a post-hoc feature-level analysis. Wrapping the expert-based system alone as attributes input to a CRF classifier does boost its F-measure from 0.603 to 0.710, bringing it on par with the data-driven system. The generalization of this method remains to be further investigated.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom