
Identification of Eukaryotic Open Reading Frames in Metagenomic cDNA Libraries Made from Environmental Samples
Author(s) -
Susan Grant,
William D. Grant,
Don A. Cowan,
Brian E. Jones,
Yanhe Ma,
António Ventosa,
Shaun Heaphy
Publication year - 2006
Publication title -
applied and environmental microbiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.552
H-Index - 324
eISSN - 1070-6291
pISSN - 0099-2240
DOI - 10.1128/aem.72.1.135-143.2006
Subject(s) - biology , orfs , cdna library , rna , ribosomal rna , metagenomics , open reading frame , complementary dna , polyadenylation , computational biology , genetics , gene , peptide sequence
Here we describe the application of metagenomic technologies to construct cDNA libraries from RNA isolated from environmental samples. RNAlater (Ambion) was shown to stabilize RNA in environmental samples for periods of at least 3 months at -20 degrees C. Protocols for library construction were established on total RNA extracted from Acanthamoeba polyphaga trophozoites. The methodology was then used on algal mats from geothermal hot springs in Tengchong county, Yunnan Province, People's Republic of China, and activated sludge from a sewage treatment plant in Leicestershire, United Kingdom. The Tenchong libraries were dominated by RNA from prokaryotes, reflecting the mainly prokaryote microbial composition. The majority of these clones resulted from rRNA; only a few appeared to be derived from mRNA. In contrast, many clones from the activated sludge library had significant similarity to eukaryote mRNA-encoded protein sequences. A library was also made using polyadenylated RNA isolated from total RNA from activated sludge; many more clones in this library were related to eukaryotic mRNA sequences and proteins. Open reading frames (ORFs) up to 378 amino acids in size could be identified. Some resembled known proteins over their full length, e.g., 36% match to cystatin, 49% match to ribosomal protein L32, 63% match to ribosomal protein S16, 70% to CPC2 protein. The methodology described here permits the polyadenylated transcriptome to be isolated from environmental samples with no knowledge of the identity of the microorganisms in the sample or the necessity to culture them. It has many uses, including the identification of novel eukaryotic ORFs encoding proteins and enzymes.