z-logo
open-access-imgOpen Access
OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes
Author(s) -
Marie A. Brunet,
Mylène Brunelle,
JeanFrançois Lucier,
Vivian Delcourt,
Maxime Levesque,
Frédéric Grenier,
Sondos Samandi,
Sébastien Leblanc,
Jean-David Aguilar,
Pascal Dufour,
JeanFrançois Jacques,
Isabelle Fournier,
Aïda Ouangraoua,
Michelle S Scott,
FrançoisMichel Boisvert,
Xavier Roucou
Publication year - 2018
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/gky936
Subject(s) - orfs , biology , ribosome profiling , genome , proteome , computational biology , open reading frame , proteomics , genetics , annotation , human proteome project , ribosome , gene , peptide sequence , rna
Advances in proteomics and sequencing have highlighted many non-annotated open reading frames (ORFs) in eukaryotic genomes. Genome annotations, cornerstones of today's research, mostly rely on protein prior knowledge and on ab initio prediction algorithms. Such algorithms notably enforce an arbitrary criterion of one coding sequence (CDS) per transcript, leading to a substantial underestimation of the coding potential of eukaryotes. Here, we present OpenProt, the first database fully endorsing a polycistronic model of eukaryotic genomes to date. OpenProt contains all possible ORFs longer than 30 codons across 10 species, and cumulates supporting evidence such as protein conservation, translation and expression. OpenProt annotates all known proteins (RefProts), novel predicted isoforms (Isoforms) and novel predicted proteins from alternative ORFs (AltProts). It incorporates cutting-edge algorithms to evaluate protein orthology and re-interrogate publicly available ribosome profiling and mass spectrometry datasets, supporting the annotation of thousands of predicted ORFs. The constantly growing database currently cumulates evidence from 87 ribosome profiling and 114 mass spectrometry studies from several species, tissues and cell lines. All data is freely available and downloadable from a web platform (www.openprot.org) supporting a genome browser and advanced queries for each species. Thus, OpenProt enables a more comprehensive landscape of eukaryotic genomes' coding potential.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom