Multiple Genetic Variant Association Testing by Collapsing and Kernel Methods With Pedigree or Population Structured Data | Zendy

Schaid Daniel J. | Zendy; McDonnell Shan K. | Zendy; Sinnwell Jason P. | Zendy; Thibodeau Stephen N. | Zendy

Premium

Multiple Genetic Variant Association Testing by Collapsing and Kernel Methods With Pedigree or Population Structured Data

Author(s) -

Schaid Daniel J.,

McDonnell Shan K.,

Sinnwell Jason P.,

Thibodeau Stephen N.

Publication year - 2013

Publication title -

genetic epidemiology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.301

H-Index - 98

eISSN - 1098-2272

pISSN - 0741-0395

DOI - 10.1002/gepi.21727

Subject(s) - linkage disequilibrium , statistic , pedigree chart , identity by descent , population , kernel (algebra) , statistics , biology , genetic association , genetics , mathematics , genotype , haplotype , medicine , combinatorics , gene , single nucleotide polymorphism , environmental health

Searching for rare genetic variants associated with complex diseases can be facilitated by enriching for diseased carriers of rare variants by sampling cases from pedigrees enriched for disease, possibly with related or unrelated controls. This strategy, however, complicates analyses because of shared genetic ancestry, as well as linkage disequilibrium among genetic markers. To overcome these problems, we developed broad classes of “burden” statistics and kernel statistics, extending commonly used methods for unrelated case‐control data to allow for known pedigree relationships, for autosomes and the X chromosome. Furthermore, by replacing pedigree‐based genetic correlation matrices with estimates of genetic relationships based on large‐scale genomic data, our methods can be used to account for population‐structured data. By simulations, we show that the type I error rates of our developed methods are near the asymptotic nominal levels, allowing rapid computation of P ‐values. Our simulations also show that a linear weighted kernel statistic is generally more powerful than a weighted “burden” statistic. Because the proposed statistics are rapid to compute, they can be readily used for large‐scale screening of the association of genomic sequence data with disease status.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research