
Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling
Author(s) -
Allen W. Zhang,
Ciara H. O’Flanagan,
Elizabeth A. Chavez,
Jamie Lim,
Nicholas Ceglia,
Andrew McPherson,
Matt Wiens,
Pascale Walters,
Tim Chan,
Brittany Hewitson,
Daniel Lai,
Anja Mottok,
Clémentine Sarkozy,
Lauren C. Chong,
Tomohiro Aoki,
Xuehai Wang,
Andrew P. Weng,
Jessica N. McAlpine,
Samuel Aparicio,
Christian Steidl,
Kieran R Campbell,
Sohrab P. Shah
Publication year - 2019
Publication title -
nature methods
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 19.469
H-Index - 318
eISSN - 1548-7105
pISSN - 1548-7091
DOI - 10.1038/s41592-019-0529-1
Subject(s) - cell type , computational biology , computer science , probabilistic logic , tumor microenvironment , rna seq , cell , biology , transcriptome , gene , artificial intelligence , genetics , cancer , gene expression
Single-cell RNA sequencing has enabled the decomposition of complex tissues into functionally distinct cell types. Often, investigators wish to assign cells to cell types through unsupervised clustering followed by manual annotation or via 'mapping' to existing data. However, manual interpretation scales poorly to large datasets, mapping approaches require purified or pre-annotated data and both are prone to batch effects. To overcome these issues, we present CellAssign, a probabilistic model that leverages prior knowledge of cell-type marker genes to annotate single-cell RNA sequencing data into predefined or de novo cell types. CellAssign automates the process of assigning cells in a highly scalable manner across large datasets while controlling for batch and sample effects. We demonstrate the advantages of CellAssign through extensive simulations and analysis of tumor microenvironment composition in high-grade serous ovarian cancer and follicular lymphoma.