Ontology application and use at the ENCODE DCC
Author(s) -
Venkat S. Malladi,
Drew T. Erickson,
Nikhil R. Podduturi,
Laurence D. Rowe,
Esther T. Chan,
Jean M. Davidson,
Benjamin C. Hitz,
Marcus Ho,
Brian T. Lee,
Stuart R. Miyasato,
Gregory R. Roe,
Matt Simison,
Cricket A. Sloan,
J. Seth Strattan,
Forrest Y. Tanaka,
W. James Kent,
J. Michael Cherry,
Eurie L. Hong
Publication year - 2015
Publication title -
database
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.406
H-Index - 62
ISSN - 1758-0463
DOI - 10.1093/database/bav010
Subject(s) - encode , metadata , computer science , ontology , information retrieval , encyclopedia , standardization , world wide web , data science , biology , library science , gene , biochemistry , philosophy , epistemology , operating system
The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a catalog of genomic annotations. To date, the project has generated over 4000 experiments across more than 350 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory network and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All ENCODE experimental data, metadata and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage and distribution to community resources and the scientific community. As the volume of data increases, the organization of experimental details becomes increasingly complicated and demands careful curation to identify related experiments. Here, we describe the ENCODE DCC's use of ontologies to standardize experimental metadata. We discuss how ontologies, when used to annotate metadata, provide improved searching capabilities and facilitate the ability to find connections within a set of experiments. Additionally, we provide examples of how ontologies are used to annotate ENCODE metadata and how the annotations can be identified via ontology-driven searches at the ENCODE portal. As genomic datasets grow larger and more interconnected, standardization of metadata becomes increasingly vital to allow for exploration and comparison of data between different scientific projects.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom