z-logo
open-access-imgOpen Access
pyGenClean: efficient tool for genetic data clean up before association testing
Author(s) -
LouisPhilippe Lemieux Perreault,
Sylvie Provost,
MarcAndré Legault,
Amina Barhdadi,
MariePierre Dubé
Publication year - 2013
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btt261
Subject(s) - python (programming language) , computer science , genotyping , data mining , software , source code , pipeline (software) , documentation , data quality , operating system , genotype , biology , engineering , metric (unit) , biochemistry , operations management , gene
Genetic association studies making use of high-throughput genotyping arrays need to process large amounts of data in the order of millions of markers per experiment. The first step of any analysis with genotyping arrays is typically the conduct of a thorough data clean up and quality control to remove poor quality genotypes and generate metrics to inform and select individuals for downstream statistical analysis. We have developed pyGenClean, a bioinformatics tool to facilitate and standardize the genetic data clean up pipeline with genotyping array data. In conjunction with a source batch-queuing system, the tool minimizes data manipulation errors, accelerates the completion of the data clean up process and provides informative plots and metrics to guide decision making for statistical analysis.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom