z-logo
Premium
How many protein‐coding genes are there in the Saccharomyces cerevisiae genome?
Author(s) -
Mackiewicz Pawel,
Kowalczuk Maria,
Mackiewicz Dorota,
Nowicka Aleksandra,
Dudkiewicz Malgorzata,
Laszkiewicz Agnieszka,
Dudek Miroslaw R.,
Cebrat Stanislaw
Publication year - 2002
Publication title -
yeast
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.923
H-Index - 102
eISSN - 1097-0061
pISSN - 0749-503X
DOI - 10.1002/yea.865
Subject(s) - orfs , biology , gene , genome , genetics , saccharomyces cerevisiae , gene prediction , computational biology , coding region , open reading frame , peptide sequence
We have compared the results of estimations of the total number of protein‐coding genes in the Saccharomyces cerevisiae genome, which have been obtained by many laboratories since the yeast genome sequence was published in 1996. We propose that there are 5300–5400 genes in the genome. This makes the first estimation of the number of intronless ORFs longer than 100 codons, based on the features of the set of genes with phenotypes known in 1997 to be correct. This estimation assumed that the set of the first 2300 genes with known phenotypes was representative for the whole set of protein‐coding genes in the genome. The same method used in this paper for the approximation of the total number of protein‐coding sequences among more than 40 000 ORFs longer than 20 codons gives a result that is only slightly higher. This suggests that there are still some non‐coding ORFs in the databases and a few dozen small ORFs, not yet annotated, which probably code for proteins. Copyright © 2002 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here