z-logo
open-access-imgOpen Access
Sequence analysis of the Methanococcus jannaschii genome and the prediction of protein function
Author(s) -
Miguel A. AndradeNavarro,
Georg Casari,
Antoine de Daruvar,
Chris Sander,
Reinhard Schneider,
Javier Tamames,
Alfonso Valencia,
Christos Ouzounis
Publication year - 1997
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/13.4.481
Subject(s) - methanococcus , sequence (biology) , sequence analysis , genome , phylogenetic tree , function (biology) , whole genome sequencing , computer science , computational biology , gene , archaea , biology , genetics
With the completion of the genome sequence of Methanococcus jannaschii (Bult et al., 1996), computational analysis has revealed a number of interesting predicted functions for this organism. Although the success rate of the initial prediction was <40%, due to a conservative attitude towards possible over-interpretation (Venter, 1996), additional efforts to annotate the sequence follow, contributing a significant increase of functional assignments through a combination of different methods (Kyrpides et al., 1996). In our continuing effort to annotate the gene products for each complete genome (Casari et al., 1995), we have analyzed the full genomic sequence of M.jannaschii, and predicted gene function by sequence similarity. We use GeneQuiz, a system for large-scale sequence analysis (Scharf et al., 1994), which exploits the combination of a number of predictive methods with a rule-based engine that increases the success of predictions thanks to a collection of heuristics and a number of benchmarking cycles (Casari et al., 1996). We have maintained a strong interest in the study of Archaea (Ouzounis and Sander, 1992; Ouzounis et al., 1995), and the analysis of the M.jannaschii genome was greatly anticipated. The scope of this communication is 3-fold: (i) to compare the performance of the GeneQuiz system against the laboriously derived (but highly accurate) manual annotation of the previous attempts; (ii) to discuss some of the cases where GeneQuiz has succeeded or failed; (iii) to assure potential users of the results that the quality of the analysis is high, despite the peculiar biochemical and phylogenetic disposition of this organism. From a total of 1682 chromosomal open reading frames (ORFs), GeneQuiz has assigned function to 774 (46%) with high confidence ('clear' cases), and, with decreasing confidence, 118 (7%) with probable function ('tentative' cases), while 482 (29%) ORFs have a clear homolog whose function remains unknown. The remaining 308 cases (18%)

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom