Combining an Expert-Based Medical Entity Recognizer to a Machine-Learning System: Methods and a Case Study | Zendy

Pierre Zweigenbaum | Zendy; Thomas Lavergne | Zendy; Natalia Grabar | Zendy; Thierry Hamon | Zendy; Sophie Rosset | Zendy; Cyril Grouin | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Combining an Expert-Based Medical Entity Recognizer to a Machine-Learning System: Methods and a Case Study

Author(s) -

Pierre Zweigenbaum,

Thomas Lavergne,

Natalia Grabar,

Thierry Hamon,

Sophie Rosset,

Cyril Grouin

Publication year - 2013

Publication title -

biomedical informatics insights

Language(s) - English

Resource type - Journals

ISSN - 1178-2226

DOI - 10.4137/bii.s11770

Subject(s) - conditional random field , overfitting , computer science , expert system , artificial intelligence , machine learning , random forest , named entity recognition , classifier (uml) , data mining , natural language processing , artificial neural network , engineering , systems engineering , task (project management)

Medical entity recognition is currently generally performed by data-driven methods based on supervised machine learning. Expert-based systems, where linguistic and domain expertise are directly provided to the system are often combined with data-driven systems. We present here a case study where an existing expert-based medical entity recognition system, Ogmios, is combined with a data-driven system, Caramba, based on a linear-chain Conditional Random Field (CRF) classifier. Our case study specifically highlights the risk of overfitting incurred by an expert-based system. We observe that it prevents the combination of the 2 systems from obtaining improvements in precision, recall, or F-measure, and analyze the underlying mechanisms through a post-hoc feature-level analysis. Wrapping the expert-based system alone as attributes input to a CRF classifier does boost its F-measure from 0.603 to 0.710, bringing it on par with the data-driven system. The generalization of this method remains to be further investigated.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research