z-logo
open-access-imgOpen Access
A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
Author(s) -
Vitale Ornella,
Preste Roberto,
Palmisano Donato,
Attimonelli Marcella
Publication year - 2020
Publication title -
molecular genetics and genomic medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.765
H-Index - 29
ISSN - 2324-9269
DOI - 10.1002/mgg3.1085
Subject(s) - pipeline (software) , mitochondrial dna , context (archaeology) , annotation , mitochondrial disease , computer science , human mitochondrial genetics , computational biology , task (project management) , bioinformatics , biology , genetics , artificial intelligence , gene , paleontology , management , economics , programming language
Background Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different clinical expressions and phenotypes. Although a large part of the observed variations is fixed in a population and hence expected to be benign, the estimation of the degree of the pathogenicity of any possible human mitochondrial DNA variant is clinically pivotal. Methods In this scenario, the establishment of standard criteria based on functional studies is required. In this context, a “data and text mining” pipeline is proposed here, developed using the programming language R, capable of extracting information regarding mitochondrial DNA functional studies and related clinical assessments from the literature, thus improving the annotation of human mitochondrial variants reported in the HmtVar database. Results The data mining pipeline has produced a list of 1,073 Pubmed IDs (PMIDs) from which the text mining pipeline has retrieved information on 932 human mitochondrial variants regarding experimental validation and clinical features. Conclusions The application of the pipeline will contribute to supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers faced with this task.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here