z-logo
Premium
System for Quality‐Assured Data Analysis: Flexible, reproducible scientific workflows
Author(s) -
Fowler Jerry,
San Lucas Francis Anthony,
Scheet Paul
Publication year - 2019
Publication title -
genetic epidemiology
Language(s) - Uncategorized
Resource type - Journals
SCImago Journal Rank - 1.301
H-Index - 98
eISSN - 1098-2272
pISSN - 0741-0395
DOI - 10.1002/gepi.22178
Subject(s) - workflow , computer science , quality (philosophy) , data science , data mining , database , philosophy , epistemology
The reproducibility of scientific processes is one of the paramount problems of bioinformatics, an engineering problem that must be addressed to perform good research. The System for Quality-Assured Data Analysis (SyQADA), described here, seeks to address reproducibility by managing many of the details of procedural bookkeeping in bioinformatics in as simple and transparent a manner as possible. SyQADA has been used by persons with backgrounds ranging from expert programmer to Unix novice, to perform and repeat dozens of diverse bioinformatics workflows on tens of thousands of samples, consuming over 80 CPU-months of computing on over 300,000 individual tasks of scores of projects on laptops, computer servers, and computing clusters. SyQADA is especially well-suited for paired-sample analyses found in cancer tumor-normal studies. SyQADA executable source code, documentation, tutorial examples, and workflows used in our lab is available from http://scheet.org/software.html.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here