Domain-specific introduction to machine learning terminology, pitfalls and opportunities in CRISPR-based gene editing
Author(s) -
Aidan R. O’Brien,
Gaétan Burgio,
Denis C. Bauer
Publication year - 2019
Publication title -
briefings in bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.204
H-Index - 113
eISSN - 1477-4054
pISSN - 1467-5463
DOI - 10.1093/bib/bbz145
Subject(s) - jargon , computer science , terminology , crispr , context (archaeology) , domain (mathematical analysis) , interoperability , genome editing , data science , machine learning , artificial intelligence , gene , world wide web , biology , paleontology , mathematical analysis , philosophy , biochemistry , linguistics , mathematics
The use of machine learning (ML) has become prevalent in the genome engineering space, with applications ranging from predicting target site efficiency to forecasting the outcome of repair events. However, jargon and ML-specific accuracy measures have made it hard to assess the validity of individual approaches, potentially leading to misinterpretation of ML results. This review aims to close the gap by discussing ML approaches and pitfalls in the context of CRISPR gene-editing applications. Specifically, we address common considerations, such as algorithm choice, as well as problems, such as overestimating accuracy and data interoperability, by providing tangible examples from the genome-engineering domain. Equipping researchers with the knowledge to effectively use ML to better design gene-editing experiments and predict experimental outcomes will help advance the field more rapidly.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom