z-logo
open-access-imgOpen Access
Statistical evaluation of the coding capacity of complementary DNA strands
Author(s) -
Anna Tramontano,
Vincenzo Scarlato,
N. Barni,
Marilena Cipollaro,
Annamaria Franzé,
M. Macchiato,
A. Cascino
Publication year - 1984
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/12.12.5049
Subject(s) - biology , dna , gene , genetics , reading frame , coding region , dna sequencing , computational biology , noncoding dna , open reading frame , rna , peptide sequence
Two independent methods are used to evaluate the protein-coding information content in different classes of DNA sequences. The first method allows to evaluate the statistical relevance of finding unidentified reading frames, longer than 100 codons, on both DNA strands of: a) 117 DNA sequences that code for 142 nuclear proteins; b) 39 stable RNA coding sequences and c) 36 other DNA sequences which include regulatory and as yet unknown function sequences. The finding of 50 reading frames longer than 100 codons (complementary inverted proteins or c.i.p. genes) located on the DNA strand complementary to the protein-coding one is drastically in excess of the number predicted by chance alone. An independent method (testcode) applied to c.i.p. gene sequences, which assigns the probability of coding to a given sequence, predicts that more than 50% of these genes are translated in a functional product. These analyses indicate the existence of a new class of protein-coding genes, located on the DNA sequences complementary to the protein-coding DNA strand.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here