Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling
Author(s) -
Chun-Tien Chang,
ChiNeu Tsai,
Chuan Yi Tang,
ChunHouh Chen,
Jang-Hau Lian,
Chi-Yu Hu,
Chia-Lung Tsai,
Angel Chao,
ChyongHuey Lai,
TzuHao Wang,
YunShien Lee
Publication year - 2012
Publication title -
the scientific world journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.453
H-Index - 93
eISSN - 2356-6140
pISSN - 1537-744X
DOI - 10.1100/2012/365104
Subject(s) - indel , genetics , microsatellite , biology , sequence (biology) , computational biology , dna sequencing , reference genome , indel mutation , dna , gene , genotype , single nucleotide polymorphism , allele
The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available In d elligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β -defensin 4 ( DEFB4 ) and its paralog HSPDP3 .
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom