Premium
Improved, ACMG‐compliant, in silico prediction of pathogenicity for missense substitutions encoded by TP53 variants
Author(s) -
Fortuno Cristina,
James Paul A.,
Young Erin L.,
Feng Bing,
Olivier Magali,
Pesaran Tina,
Tavtigian Sean V.,
Spurdle Amanda B.
Publication year - 2018
Publication title -
human mutation
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.981
H-Index - 162
eISSN - 1098-1004
pISSN - 1059-7794
DOI - 10.1002/humu.23553
Subject(s) - missense mutation , in silico , matthews correlation coefficient , biology , computational biology , medical genetics , genetics , computer science , mutation , machine learning , gene , support vector machine
Clinical interpretation of germline missense variants represents a major challenge, including those in the TP53 Li–Fraumeni syndrome gene. Bioinformatic prediction is a key part of variant classification strategies. We aimed to optimize the performance of the Align‐GVGD tool used for p53 missense variant prediction, and compare its performance to other bioinformatic tools (SIFT, PolyPhen‐2) and ensemble methods (REVEL, BayesDel). Reference sets of assumed pathogenic and assumed benign variants were defined using functional and/or clinical data. Area under the curve and Matthews correlation coefficient (MCC) values were used as objective functions to select an optimized protein multisequence alignment with best performance for Align‐GVGD. MCC comparison of tools using binary categories showed optimized Align‐GVGD (C15 cut‐off) combined with BayesDel (0.16 cut‐off), or with REVEL (0.5 cut‐off), to have the best overall performance. Further, a semi‐quantitative approach using multiple tiers of bioinformatic prediction, validated using an independent set of nonfunctional and functional variants, supported use of Align‐GVGD and BayesDel prediction for different strength of evidence levels in ACMG/AMP rules. We provide rationale for bioinformatic tool selection for TP53 variant classification, and have also computed relevant bioinformatic predictions for every possible p53 missense variant to facilitate their use by the scientific and medical community.