Premium
Computational Tools for the Identification and Interpretation of Sequence Motifs in Immunopeptidomes
Author(s) -
Alvarez Bruno,
Barra Carolina,
Nielsen Morten,
Andreatta Massimo
Publication year - 2018
Publication title -
proteomics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.26
H-Index - 167
eISSN - 1615-9861
pISSN - 1615-9853
DOI - 10.1002/pmic.201700252
Subject(s) - computational biology , major histocompatibility complex , computer science , proteomics , deconvolution , identification (biology) , mhc class i , biology , antigen , genetics , algorithm , botany , gene
Recent advances in proteomics and mass‐spectrometry have widely expanded the detectable peptide repertoire presented by major histocompatibility complex (MHC) molecules on the cell surface, collectively known as the immunopeptidome. Finely characterizing the immunopeptidome brings about important basic insights into the mechanisms of antigen presentation, but can also reveal promising targets for vaccine development and cancer immunotherapy. This report describes a number of practical and efficient approaches to analyze immunopeptidomics data, discussing the identification of meaningful sequence motifs in various scenarios and considering current limitations. Guidelines are provided for the filtering of false hits and contaminants, and to address the problem of motif deconvolution in cell lines expressing multiple MHC alleles, both for the MHC class I and class II systems. Finally, it is demonstrated how machine learning can be readily employed by non‐expert users to generate accurate prediction models directly from mass‐spectrometry eluted ligand data sets.