Semi-automated Construction of Decision Rules to Predict Morbidities from Clinical Texts | Zendy

Richárd Farkas | Zendy; György Szarvas | Zendy; Ito Sadayoshi | Zendy; Ali Almasi | Zendy; Veronika Vincze | Zendy; Róbert Ormándi | Zendy; Róbert BusaFekete | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Semi-automated Construction of Decision Rules to Predict Morbidities from Clinical Texts

Author(s) -

Richárd Farkas,

György Szarvas,

Ito Sadayoshi,

Ali Almasi,

Veronika Vincze,

Róbert Ormándi,

Róbert BusaFekete

Publication year - 2009

Publication title -

journal of the american medical informatics association

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.614

H-Index - 150

eISSN - 1527-974X

pISSN - 1067-5027

DOI - 10.1197/jamia.m3097

Subject(s) - spelling , computer science , natural language processing , intuition , outlier , artificial intelligence , set (abstract data type) , medical diagnosis , information retrieval , psychology , medicine , linguistics , pathology , cognitive science , philosophy , programming language

OBJECTIVE In this study the authors describe the system submitted by the team of University of Szeged to the second i2b2 Challenge in Natural Language Processing for Clinical Data. The challenge focused on the development of automatic systems that analyzed clinical discharge summary texts and addressed the following question: "Who's obese and what co-morbidities do they (definitely/most likely) have?". Target diseases included obesity and its 15 most frequent comorbidities exhibited by patients, while the target labels corresponded to expert judgments based on textual evidence and intuition (separately). DESIGN The authors applied statistical methods to preselect the most common and confident terms and evaluated outlier documents by hand to discover infrequent spelling variants. The authors expected a system with dictionaries gathered semi-automatically to have a good performance with moderate development costs (the authors examined just a small proportion of the records manually). MEASUREMENTS Following the standard evaluation method of the second Workshop on challenges in Natural Language Processing for Clinical Data, the authors used both macro- and microaveraged Fbeta=1 measure for evaluation. RESULTS The authors submission achieved a microaverage F(beta=1) score of 97.29% for classification based on textual evidence (macroaverage F(beta=1) = 76.22%) and 96.42% for intuitive judgments (macroaverage F(beta=1) = 67.27%). CONCLUSIONS The results demonstrate the feasibility of the authors approach and show that even very simple systems with a shallow linguistic analysis can achieve remarkable accuracy scores for classifying clinical records on a limited set of concepts.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research