Management of High-Throughput DNA Sequencing Projects: Alpheus
Author(s) -
Neil A Miller,
Stephen F. Kingsmore,
Andrew Farmer,
Raymond J. Langley,
Joann Mudge,
John Crow,
Álvaro González,
Faye Schilkey,
Ryan J Kim,
Jennifer C. van Velkinburgh,
Gregory D. May,
C. Forrest Black,
Margaret Myers,
John P. Utsey,
Nicholas S Frost,
David J. Sugarbaker,
Raphael Bueno,
Stephen R Gullans,
Susan M. Baxter,
Steve W Day,
Ernest F. Retzel
Publication year - 2008
Publication title -
journal of computer science and systems biology
Language(s) - English
Resource type - Journals
ISSN - 0974-7230
DOI - 10.4172/jcsb.1000013
Subject(s) - biology , dna sequencing , pyrosequencing , computational biology , massive parallel sequencing , reference genome , sequence assembly , genetics , indel , genomics , false positive paradox , sequence analysis , dna microarray , single nucleotide polymorphism , genome , transcriptome , gene , computer science , genotype , gene expression , machine learning
High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricul- tural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage sig- nificant volumes of high-dimensional and interrelated data and analysis. Alpheus is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multigigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche- 454 (pyrosequencing) and Applied Biosystem's SOLiD (sequencing-by-ligation). Alpheus enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. Alpheus is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximiz- ing the identification of true positives. Alpheus also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom