Predicting Ovarian/Breast Cancer Pathogenic Risks of Human BRCA1 Gene Variants of Unknown Significance
Author(s) -
Hui-Heng Lin,
Hongyan Xu,
Hongbo Hu,
Zhanzhong Ma,
Jie Zhou,
Qingyun Liang
Publication year - 2021
Publication title -
biomed research international
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.772
H-Index - 126
eISSN - 2314-6141
pISSN - 2314-6133
DOI - 10.1155/2021/6667201
Subject(s) - clinical significance , pathogenicity , random forest , computational biology , dna sequencing , medical diagnosis , benchmarking , computer science , machine learning , gene , biology , genetics , medicine , pathology , microbiology and biotechnology , marketing , business
High-throughput sequencing is gaining popularity in clinical diagnoses, but more and more novel gene variants with unknown clinical significance are being found, giving difficulties to interpretations of people's genetic data, precise disease diagnoses, and the making of therapeutic strategies and decisions. In order to solve these issues, it is of critical importance to figure out ways to analyze and interpret such variants. In this work, BRCA1 gene variants with unknown clinical significance were identified from clinical sequencing data, and then, we developed machine learning models so as to predict the pathogenicity for variants with unknown clinical significance. Through performance benchmarking, we found that the optimized random forest model scored 0.85 in area under receiver operating characteristic curve, which outperformed other models. Finally, we applied the best random forest model to predict the pathogenicity of 6321 BRCA1 variants from both sequencing data and ClinVar database. As a result, we obtained the predictive pathogenic risks of BRCA1 variants of unknown significance.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom