Premium
Prediction of disease‐associated functional variants in noncoding regions through a comprehensive analysis by integrating datasets and features
Author(s) -
Lu Yu,
Wu Yiming,
Liu Yuan,
Li Yizhou,
Jing Runyu,
Li Menglong
Publication year - 2021
Publication title -
human mutation
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.981
H-Index - 162
eISSN - 1098-1004
pISSN - 1059-7794
DOI - 10.1002/humu.24203
Subject(s) - biology , computational biology , genetics , bioinformatics
One of the greatest challenges in human genetics is deciphering the link between functional variants in noncoding sequences and the pathophysiology of complex diseases. To address this issue, many methods have been developed to sort functional single‐nucleotide variants (SNVs) for neutral SNVs in noncoding regions. In this study, we integrated well‐established features and commonly used datasets and merged them into large‐scale datasets based on a random forest model, which yielded promising performance and outperformed some cutting‐edge approaches. Our analyses of feature importance and data coverage also provide certain clues for future research in enhancing the prediction of functional noncoding SNVs.