z-logo
Premium
Linkage disequilibrium clustering‐based approach for association mapping with tightly linked genomewide data
Author(s) -
Li Zitong,
Kemppainen Petri,
Rastas Pasi,
Merilä Juha
Publication year - 2018
Publication title -
molecular ecology resources
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.96
H-Index - 136
eISSN - 1755-0998
pISSN - 1755-098X
DOI - 10.1111/1755-0998.12893
Subject(s) - linkage disequilibrium , genome wide association study , biology , computational biology , genetic association , genetics , linkage (software) , tag snp , single nucleotide polymorphism , cluster analysis , snp , association mapping , computer science , genotype , gene , machine learning
Genomewide association studies ( GWAS ) aim to identify genetic markers strongly associated with quantitative traits by utilizing linkage disequilibrium ( LD ) between candidate genes and markers. However, because of LD between nearby genetic markers, the standard GWAS approaches typically detect a number of correlated SNP s covering long genomic regions, making corrections for multiple testing overly conservative. Additionally, the high dimensionality of modern GWAS data poses considerable challenges for GWAS procedures such as permutation tests, which are computationally intensive. We propose a cluster‐based GWAS approach that first divides the genome into many large nonoverlapping windows and uses linkage disequilibrium network analysis in combination with principal component ( PC ) analysis as dimensional reduction tools to summarize the SNP data to independent PC s within clusters of loci connected by high LD . We then introduce single‐ and multilocus models that can efficiently conduct the association tests on such high‐dimensional data. The methods can be adapted to different model structures and used to analyse samples collected from the wild or from biparental F 2 populations, which are commonly used in ecological genetics mapping studies. We demonstrate the performance of our approaches with two publicly available data sets from a plant ( Arabidopsis thaliana ) and a fish ( Pungitius pungitius ), as well as with simulated data.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here