
Event-Concept Pair Series Extraction to Represent Medical Complications from Texts
Author(s) -
Chaveevan Pechsiri,
Sumran Phainoun
Publication year - 2018
Publication title -
indonesian journal of electrical engineering and computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.241
H-Index - 17
eISSN - 2502-4760
pISSN - 2502-4752
DOI - 10.11591/ijeecs.v12.i3.pp1320-1333
Subject(s) - correctness , computer science , artificial intelligence , series (stratigraphy) , natural language processing , relationship extraction , event (particle physics) , phrase , unified medical language system , sentence , set (abstract data type) , relation (database) , machine learning , data mining , information extraction , algorithm , programming language , paleontology , physics , quantum mechanics , biology
This research aims to determine an event-concept pair series as consequent events, particularly a Cause-Effect-concept pair (called ‘CEpair’) series on disease documents downloaded from hospital-web-boards. CEpair series are used for representing medical/disease complications which benefit for Solving system. Each causative/effect event concept is expressed by a verb phrase of an elementary discourse unit (EDU) which is a simple sentence. The research has three problems; how to determine each adjacent-EDU pair having the cause-effect relation, how to determine a CEpair series mingled with non-causeeffect-relation EDUs, and how to identify the complication of several extracted CEpair series from the documents. Therefore, we extract NWordCo-concept set having the causative/effect concepts from EDUs’ verb phrases including a support vector machine to solve each NWordCo size. We apply the Naïve Bayes classifier to learn and extract an NWordCoconcept pair set as a knowledge template having the cause-effect relation from the documents. We then propose using the knowledge template to extract several CEpair series. We also apply the intersection of the NWordCo-concept sets to identify the commoncause/effect for representing the complication-development parts of the extracted-CEpair series. The research results provide the high percent correctness of the CEpair-series determination from the documents.