Automatic ontology construction from the literature.
Author(s) -
Christian Blaschke,
Alfonso Valencia
Publication year - 2002
Publication title -
genome informatics. international conference on genome informatics
Language(s) - English
DOI - 10.11234/gi1990.13.201
Detailed classifications, controlled vocabularies and organised terminology are widely used in different areas of science and technology. Their relatively recent introduction in molecular biology has been crucial for progress in the analysis of genonics and massive proteomics experiments. Unfortunately the construction of the ontologies, including terminology, classification and entity relations requires considerable effort, including the analysis of massive amounts of literature. We propose here a method that automatically generates classifications of gene-product functions using bibliographic information. The corresponding classification structures mirror the ones constructed by human experts. The analysis of a large structure built for yeast gene-products, and the detailed inspection of various examples, show encouraging properties. In particular, the comparison with the well accepted GO ontology points to different situations in which the automatically derived classification can be useful for assisting human experts in the annotation of ontologies.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom