“COI-like” Sequences Are Becoming Problematic in Molecular Systematic and DNA Barcoding Studies
Author(s) -
Jennifer E. Buhay
Publication year - 2009
Publication title -
journal of crustacean biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.509
H-Index - 54
eISSN - 1937-240X
pISSN - 0278-0372
DOI - 10.1651/08-3020.1
Subject(s) - biology , genbank , dna barcoding , evolutionary biology , mitochondrial dna , phylogenetic tree , pseudogene , indel , genetics , gene , genome , genotype , single nucleotide polymorphism
The cytochrome c oxidase subunit I (COI) gene plays a pivotal role in a global effort to document biodiversity and continues to be a gene of choice in phylogenetic and phylogeographic studies. Due to increased attention on this gene as a species' barcode, quality control and sequence homology issues are re-emerging. Taylor and Knouft (2006) attempted to examine gonopod morphology in light of the subgeneric classification scheme within the freshwater crayfish genus Orconectes using COI sequences. However, their erroneous analyses were not only based on supposed mitochondrial sequences but also incorporated many questionable sequences due to the possible presence of numts and manual editing or sequencing errors. In fact, 22 of the 86 sequences were flagged as “COI-like” by GenBank due to the presence of stop codons and indels in what should be the open reading frame of a conservative protein-coding gene. A subsequent search of “COI-like” accessions in GenBank turned up a multitude of taxa across Crustacea from published and unpublished studies thereby warranting this illustrated discussion about quality control, pseudogenes, and sequence composition.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom