Genomic data imputation with variational auto-encoders
Author(s) -
Yeping Lina Qiu,
Hong Zheng,
Olivier Gevaert
Publication year - 2020
Publication title -
gigascience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.947
H-Index - 54
ISSN - 2047-217X
DOI - 10.1093/gigascience/giaa082
Subject(s) - imputation (statistics) , computer science , autoencoder , computational biology , genetic data , data mining , artificial intelligence , biology , missing data , machine learning , medicine , artificial neural network , population , environmental health
As missing values are frequently present in genomic data, practical methods to handle missing data are necessary for downstream analyses that require complete data sets. State-of-the-art imputation techniques, including methods based on singular value decomposition and K-nearest neighbors, can be computationally expensive for large data sets and it is difficult to modify these algorithms to handle certain cases not missing at random.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom