Premium
One thousand genomes imputation in the national cancer institute breast and prostate cancer cohort consortium aggressive prostate cancer genome‐wide association study
Author(s) -
Machiela Mitchell J.,
Chen Constance,
Liang Liming,
Diver W. Ryan,
Stevens Victoria L.,
Tsilidis Konstantinos K.,
Haiman Christopher A.,
Chanock Stephen J.,
Hunter David J.,
Kraft Peter
Publication year - 2013
Publication title -
the prostate
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.295
H-Index - 123
eISSN - 1097-0045
pISSN - 0270-4137
DOI - 10.1002/pros.22608
Subject(s) - genome wide association study , imputation (statistics) , prostate cancer , single nucleotide polymorphism , 1000 genomes project , minor allele frequency , genetic association , linkage disequilibrium , breast cancer , oncology , biology , genetics , cancer , medicine , genotype , statistics , missing data , gene , mathematics
BACKGROUND Genotype imputation substantially increases available markers for analysis in genome‐wide association studies (GWAS) by leveraging linkage disequilibrium from a reference panel. We sought to (i) investigate the performance of imputation from the August 2010 release of the 1000 Genomes Project (1000GP) in an existing GWAS of prostate cancer, (ii) look for novel associations with prostate cancer risk, (iii) fine‐map known prostate cancer susceptibility regions using an approximate Bayesian framework and stepwise regression, and (iv) compare power and efficiency of imputation and de novo sequencing. METHODS We used 2,782 aggressive prostate cancer cases and 4,458 controls from the NCI Breast and Prostate Cancer Cohort Consortium aggressive prostate cancer GWAS to infer 5.8 million well‐imputed autosomal single nucleotide polymorphisms (SNPs). RESULTS Imputation quality, as measured by correlation between imputed and true allele counts, was higher among common variants than rare variants. We found no novel prostate cancer associations among a subset of 1.2 million well‐imputed low‐frequency variants. At a genome‐wide sequencing cost of $2,500, imputation from SNP arrays is a more powerful strategy than sequencing for detecting disease associations of SNPs with minor allele frequencies (MAF) above 1%. CONCLUSIONS 1000GP imputation provided dense coverage of previously identified prostate cancer susceptibility regions, highlighting its potential as an inexpensive first‐pass approach to fine mapping in regions such as 5p15 and 8q24. Our study shows 1000GP imputation can accurately identify low‐frequency variants and stresses the importance of large sample size when studying these variants. Prostate 73: 677–689, 2013. © 2012 Wiley Periodicals, Inc.