Predicting Splicing from Primary Sequence with Deep Learning
Author(s) -
Kishore Jaganathan,
Sofia Kyriazopoulou Panagiotopoulou,
Jeremy F. McRae,
Siavash Fazel Darbandi,
David A. Knowles,
Yang Li,
Jack A. Kosmicki,
Juan David Arbelaez,
Wenwu Cui,
Grace Schwartz,
Eric D. Chow,
Efstathios Kanterakis,
Hong Gao,
Amirali Kia,
Serafim Batzoglou,
Stephan Sanders,
Kyle KaiHow Farh
Publication year - 2019
Publication title -
cell
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 26.304
H-Index - 776
eISSN - 1097-4172
pISSN - 0092-8674
DOI - 10.1016/j.cell.2018.12.015
Subject(s) - biology , rna splicing , splice , genetics , alternative splicing , sequence (biology) , gene , computational biology , rna , mutation , messenger rna
The splicing of pre-mRNAs into mature transcripts is remarkable for its precision, but the mechanisms by which the cellular machinery achieves such specificity are incompletely understood. Here, we describe a deep neural network that accurately predicts splice junctions from an arbitrary pre-mRNA transcript sequence, enabling precise prediction of noncoding genetic variants that cause cryptic splicing. Synonymous and intronic mutations with predicted splice-altering consequence validate at a high rate on RNA-seq and are strongly deleterious in the human population. De novo mutations with predicted splice-altering consequence are significantly enriched in patients with autism and intellectual disability compared to healthy controls and validate against RNA-seq in 21 out of 28 of these patients. We estimate that 9%-11% of pathogenic mutations in patients with rare genetic disorders are caused by this previously underappreciated class of disease variation.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom