z-logo
Premium
A computer filtering method to drive out tiny genes from the yeast genome
Author(s) -
Barry Caroline,
Fichant Gwennaele,
Kalogeropoulos Angelos,
Quentin Yves
Publication year - 1996
Publication title -
yeast
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.923
H-Index - 102
eISSN - 1097-0061
pISSN - 0749-503X
DOI - 10.1002/(sici)1097-0061(19960915)12:11<1163::aid-yea6>3.0.co;2-7
Subject(s) - orfs , biology , genetics , gene , open reading frame , genome , intergenic region , computational biology , coding region , peptide sequence
The authors of the first yeast chromosome sequence defined a minimum threshold requirement of 100 codons, above which an open reading frame (ORF) is retained as a putative coding sequence. However, at least 58 yeast genes shorter than 100 codons have an assigned protein function. Therefore, the yeast genome may contain other tiny but functionally important genes that are discarded from analyses by this simple filtering rule. We have established discriminant functions from the in‐phase hexamer frequencies of functional genes and of simulated ORFs derived from a stationary Markov chain model. Fifty‐two out of the 58 genes were recognized as coding ORFs by our discriminating method. The test was also applied to all the small ORFs (36 to 100 codons) found in the intergenic regions of published chromosomes. It retained 140 new potential tiny coding sequences, among which we identified seven new genes by similarity searches. Our method, used conjointly with similarity searches, can also highlight sequencing errors resulting from the disruption of the coding frame of longer ORFs. This method, by its ability to detect potential coding ORFs, can be a very useful tool for functional analysis.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here