
EvoProDom: evolutionary modeling of protein families by assessing translocations of protein domains
Author(s) -
Carmi Gon,
Gorohovski Alessandro,
FrenkelMorgenstern Milana
Publication year - 2021
Publication title -
febs open bio
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.718
H-Index - 31
ISSN - 2211-5463
DOI - 10.1002/2211-5463.13245
Subject(s) - biology , protein domain , computational biology , merge (version control) , gene , protein evolution , rna splicing , protein family , fusion protein , intein , genetics , computer science , rna , information retrieval , recombinant dna
Here, we introduce a novel ‘evolution of protein domains’ (EvoProDom) model for describing the evolution of proteins based on the ‘mix and merge’ of protein domains. We assembled and integrated genomic and proteomic data comprising protein domain content and orthologous proteins from 109 organisms. In EvoProDom, we characterized evolutionary events, particularly, translocations, as reciprocal exchanges of protein domains between orthologous proteins in different organisms. We showed that protein domains that translocate with highly frequency are generated by transcripts enriched in trans ‐splicing events, that is, the generation of novel transcripts from the fusion of two distinct genes. In EvoProDom, we describe a general method to collate orthologous protein annotation from KEGG, and protein domain content from protein sequences using tools such as KoFamKOAL and Pfam. To summarize, EvoProDom presents a novel model for protein evolution based on the ‘mix and merge’ of protein domains rather than DNA‐based evolution models. This confers the advantage of considering chromosomal alterations as drivers of protein evolutionary events.