z-logo
Premium
The 32 Codon World of A. dehalogenans
Author(s) -
Duax William L,
Huether Robert,
Pletnev Vladimir,
Kirsch Tyler,
Hogan Dana,
Teysir Jimmitti,
Benatovich Sam,
Bednarz Jane,
Dziak David,
McCauley Maddy
Publication year - 2009
Publication title -
the faseb journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.709
H-Index - 277
eISSN - 1530-6860
pISSN - 0892-6638
DOI - 10.1096/fasebj.23.1_supplement.858.3
Subject(s) - genetic code , open reading frame , stop codon , genetics , genome , reading frame , biology , coding region , gc content , computational biology , dna , nonsense mutation , base pair , codon usage bias , gene , mutation , peptide sequence , missense mutation
The genome of the soil bacteria Anaeromyxobacter dehalogenans (Adhal) is reported to have 4346 coding sequences composed of 1,521,374 codons, a GC content of 74.82% and GC occupancy of the third base position of 97.07%. Over 50% of the proteins in Adhal are annotated as having uncertain or unknown function. Over 1000 have two or less Met residues and Val and Leu residues are often identified as start codes. Examination of the sequences indicates that over 80% of them have 3 open reading frames. The presence of multiple open reading frames and ambiguity of start and stop code identification has resulted in misidentification of hundreds of nonsense sequences as hypothetical proteins. Significant differences in codon use between functionally characterized and hypothetical proteins, errors in coding frame selection, and in start and stop code identifications indicate that most (and possibly all) of the sixteen codons that have A or T bases in the first and third position are nonsense codes (de facto stops). These patterns present in many bacterial genomes suggest that the modern genetic code evolved from a coding system in which just the 32 codons that end in G or C define all 20 amino acids. The DNA in which this code evolved was stabilized by the presence of three hydrogen bonds linking the GC pairs present in every third base pair of the DNA encoding proteins.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here