tmVar: a text mining approach for extracting sequence variants in biomedical literature
Author(s) -
Chih-Hsuan Wei,
Bethany Harris,
HungYu Kao,
Zhiyong Lu
Publication year - 2013
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btt156
Subject(s) - computer science , conditional random field , mutation , sequence (biology) , set (abstract data type) , data mining , biomedical text mining , information retrieval , scope (computer science) , field (mathematics) , computational biology , artificial intelligence , text mining , biology , genetics , gene , mathematics , pure mathematics , programming language
Text-mining mutation information from the literature becomes a critical part of the bioinformatics approach for the analysis and interpretation of sequence variations in complex diseases in the post-genomic era. It has also been used for assisting the creation of disease-related mutation databases. Most of existing approaches are rule-based and focus on limited types of sequence variations, such as protein point mutations. Thus, extending their extraction scope requires significant manual efforts in examining new instances and developing corresponding rules. As such, new automatic approaches are greatly needed for extracting different kinds of mutations with high accuracy.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom