Inference of Population Mutation Rate and Detection of Segregating Sites from Next-Generation Sequence Data
Author(s) -
Chul Joo Kang,
Paul Marjoram
Publication year - 2011
Publication title -
genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.792
H-Index - 246
eISSN - 1943-2631
pISSN - 0016-6731
DOI - 10.1534/genetics.111.130898
Subject(s) - biology , inference , mutation rate , 1000 genomes project , sequence (biology) , mutation , population , genome , dna sequencing , computational biology , genetics , variation (astronomy) , genetic data , computer science , artificial intelligence , gene , genotype , demography , physics , sociology , single nucleotide polymorphism , astrophysics
We live in an age in which our ability to collect large amounts of genome-wide genetic variation data offers the promise of providing the key to the understanding and treatment of genetic diseases. Over the next few years this effort will be spearheaded by so-called next-generation sequencing technologies, which provide vast amounts of short-read sequence data at relatively low cost. This technology is often used to detect unknown variation in regions that have been linked with a given disease or phenotype. However, error rates are significant, leading to some nontrivial issues when it comes to interpreting the data. In this article, we present a method with which to address questions of widespread interest: calling variants and estimating the population mutation rate. We show performance of the method using simulation studies before applying our approach to an analysis of data from the 1000 Genomes project.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom