Protein Ontology (PRO): enhancing and scaling up the representation of protein entities | Zendy

Darren A. Natale | Zendy; Cecilia Arighi | Zendy; Judith A. Blake | Zendy; Jonathan P. Bona | Zendy; Chuming Chen | Zendy; ShengChih Chen | Zendy; Karen Christie | Zendy; Julie Cowart | Zendy; Peter D’Eustachio | Zendy; Alexander D. Diehl | Zendy; Harold Drabkin | Zendy; William D. Duncan | Zendy; Hongzhan Huang | Zendy; Jia Ren | Zendy; Karen Ross | Zendy; Alan Ruttenberg | Zendy; Veronica Shamovsky | Zendy; Barry Smith | Zendy; Qinghua Wang | Zendy; Jian Zhang | Zendy; Abdelrahman Elsayed | Zendy; Cathy Wu | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Protein Ontology (PRO): enhancing and scaling up the representation of protein entities

Author(s) -

Darren A. Natale,

Cecilia Arighi,

Judith A. Blake,

Jonathan P. Bona,

Chuming Chen,

ShengChih Chen,

Karen Christie,

Julie Cowart,

Peter D’Eustachio,

Alexander D. Diehl,

Harold Drabkin,

William D. Duncan,

Hongzhan Huang,

Jia Ren,

Karen Ross,

Alan Ruttenberg,

Veronica Shamovsky,

Barry Smith,

Qinghua Wang,

Jian Zhang,

Abdelrahman Elsayed,

Cathy Wu

Publication year - 2016

Publication title -

nucleic acids research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.008

H-Index - 537

eISSN - 1362-4954

pISSN - 0305-1048

DOI - 10.1093/nar/gkw1075

Subject(s) - uniprot , discoverability , sparql , computer science , ontology , scalability , representation (politics) , protein sequencing , computational biology , biology , information retrieval , semantic web , bioinformatics , rdf , world wide web , database , peptide sequence , gene , genetics , philosophy , epistemology , politics , political science , law

The Protein Ontology (PRO; http://purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific and taxon-neutral protein-related entities in three major areas: proteins related by evolution; proteins produced from a given gene; and protein-containing complexes. PRO thus serves as a tool for referencing protein entities at any level of specificity. To enhance this ability, and to facilitate the comparison of such entities described in different resources, we developed a standardized representation of proteoforms using UniProtKB as a sequence reference and PSI-MOD as a post-translational modification reference. We illustrate its use in facilitating an alignment between PRO and Reactome protein entities. We also address issues of scalability, describing our first steps into the use of text mining to identify protein-related entities, the large-scale import of proteoform information from expert curated resources, and our ability to dynamically generate PRO terms. Web views for individual terms are now more informative about closely-related terms, including for example an interactive multiple sequence alignment. Finally, we describe recent improvement in semantic utility, with PRO now represented in OWL and as a SPARQL endpoint. These developments will further support the anticipated growth of PRO and facilitate discoverability of and allow aggregation of data relating to protein entities.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research