Premium
Assigning protein subcellular distributions of vesicle associated tail‐anchored membrane proteins by image‐based machine learning
Author(s) -
Schormann Wiebke,
Hariharan Santosh,
Andrews David
Publication year - 2021
Publication title -
the faseb journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.709
H-Index - 277
eISSN - 1530-6860
pISSN - 0892-6638
DOI - 10.1096/fasebj.2021.35.s1.04091
Subject(s) - subcellular localization , organelle , endoplasmic reticulum , protein subcellular localization prediction , microbiology and biotechnology , membrane protein , protein sorting signals , green fluorescent protein , fusion protein , biology , protein targeting , vesicle , biochemistry , cytoplasm , membrane , peptide sequence , gene , signal peptide , recombinant dna
Subcellular localization of proteins is a key feature of eukaryotes. Traditionally, co‐staining with known marker proteins (antibody based or fluorescence fusion protein) or organelle‐specific dyes (e.g., MitoTracker TM ) have been used to study subcellular localization of proteins of interest, followed by, a final inspection of the fluorescence microscope images by an experimentalist. However, human visual examination is inclined to bias and membrane proteins are trafficked in vesicles between subcellular locations such that assignment to a specific organelle can be misleading. As an alternative way to assign localization we generated a reference library of confocal micrographs of EGFP fusion proteins localized at key subcellular organelles in murine and human cell lines. Rather than assign the localization of an unknown protein to a specific organelle we consider which reference protein has the most similar subcellular distribution from image sets that are optically validated and comprise 789,011 and 523,319 individual human and murine cell images, respectively. Both morphology and statistical features were computed to enable automated assignment of the subcellular distribution for query proteins by machine learning algorithms with high accuracy. By means of this tool we investigate subcellular transport and distribution for model tail‐anchored proteins with randomly mutated C‐terminal targeting sequences for the endoplasmic reticulum and through out the secretory pathway.