z-logo
open-access-imgOpen Access
Context-dependent individualization of nucleotides and virtual genomic hybridization allow the precise location of human SNPs
Author(s) -
José Luis Reyes,
Laura Gómez-Romero,
Ximena Ibarra-Soria,
Kim Palacios-Flores,
Luis R. Arriola,
Alejandro Wences,
Delfino García,
Margareta Boege,
Guillermo Dávila,
Margarita Avilés Flores,
Rafael Palacios
Publication year - 2011
Publication title -
proceedings of the national academy of sciences of the united states of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 5.011
H-Index - 771
eISSN - 1091-6490
pISSN - 0027-8424
DOI - 10.1073/pnas.1112567108
Subject(s) - context (archaeology) , genome , reference genome , false positive paradox , human genome , computational biology , set (abstract data type) , computer science , genetics , biology , artificial intelligence , gene , paleontology , programming language
We have entered the era of individual genomic sequencing, and can already see exponential progress in the field. It is of utmost importance to exclude false-positive variants from reported datasets. However, because of the nature of the used algorithms, this task has not been optimized to the required level of precision. This study presents a unique strategy for identifying SNPs, called COIN-VGH, that largely minimizes the presence of false-positives in the generated data. The algorithm was developed using the X-chromosome–specific regions from the previously sequenced genomes of Craig Venter and James Watson. The algorithm is based on the concept that a nucleotide can be individualized if it is analyzed in the context of its surrounding genomic sequence. COIN-VGH consists of defining the most comprehensive set of nucleotide strings of a defined length that map with 100% identity to a unique position within the human reference genome (HRG). Such set is used to retrieve sequence reads from a query genome (QG), allowing the production of a genomic landscape that represents a draft HRG-guided assembly of the QG. This landscape is analyzed for specific signatures that indicate the presence of SNPs. The fidelity of the variation signature was assessed using simulation experiments by virtually altering the HRG at defined positions. Finally, the signature regions identified in the HRG and in the QG reads are aligned and the precise nature and position of the corresponding SNPs are detected. The advantages of COIN-VGH over previous algorithms are discussed.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here