z-logo
Premium
Tag SNP selection using particle swarm optimization
Author(s) -
Chuang LiYeh,
Yang ChengSan,
Ho ChangHsuan,
Yang ChengHong
Publication year - 2009
Publication title -
biotechnology progress
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.572
H-Index - 129
eISSN - 1520-6033
pISSN - 8756-7938
DOI - 10.1002/btpr.350
Subject(s) - single nucleotide polymorphism , snp , snp genotyping , selection (genetic algorithm) , genotyping , tag snp , computer science , haplotype , computational biology , genome , support vector machine , genome wide association study , biology , genetics , artificial intelligence , genotype , gene
Abstract Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variations amongst species. With the genome‐wide SNP discovery, many genome‐wide association studies are likely to identify multiple genetic variants that are associated with complex diseases. However, genotyping all existing SNPs for a large number of samples is still challenging even though SNP arrays have been developed to facilitate the task. Therefore, it is essential to select only informative SNPs representing the original SNP distributions in the genome (tag SNP selection) for genome‐wide association studies. These SNPs are usually chosen from haplotypes and called haplotype tag SNPs (htSNPs). Accordingly, the scale and cost of genotyping are expected to be largely reduced. We introduce binary particle swarm optimization (BPSO) with local search capability to improve the prediction accuracy of STAMPA. The proposed method does not rely on block partitioning of the genomic region, and consistently identified tag SNPs with higher prediction accuracy than either STAMPA or SVM/STSA. We compared the prediction accuracy and time complexity of BPSO to STAMPA and an SVM‐based (SVM/STSA) method using publicly available data sets. For STAMPA and SVM/STSA, BPSO effective improved prediction accuracy for smaller and larger scale data sets. These results demonstrate that the BPSO method selects tag SNP with higher accuracy no matter the scale of data sets is used. © 2009 American Institute of Chemical Engineers Biotechnol. Prog., 2010

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here