z-logo
Premium
Computationally efficient multidimensional analysis of complex flow cytometry data using second order polynomial histograms
Author(s) -
Zaunders John,
Jing Junmei,
Leipold Michael,
Maecker Holden,
Kelleher Anthony D.,
Koch Inge
Publication year - 2016
Publication title -
cytometry part a
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.316
H-Index - 90
eISSN - 1552-4930
pISSN - 1552-4922
DOI - 10.1002/cyto.a.22704
Subject(s) - histogram , bin , multivariate statistics , computer science , cluster analysis , estimator , dimension (graph theory) , cytometry , polynomial , algorithm , mathematics , flow cytometry , biology , combinatorics , statistics , artificial intelligence , image (mathematics) , mathematical analysis , genetics
Many methods have been described for automated clustering analysis of complex flow cytometry data, but so far the goal to efficiently estimate multivariate densities and their modes for a moderate number of dimensions and potentially millions of data points has not been attained. We have devised a novel approach to describing modes using second order polynomial histogram estimators (SOPHE). The method divides the data into multivariate bins and determines the shape of the data in each bin based on second order polynomials, which is an efficient computation. These calculations yield local maxima and allow joining of adjacent bins to identify clusters. The use of second order polynomials also optimally uses wide bins, such that in most cases each parameter (dimension) need only be divided into 4–8 bins, again reducing computational load. We have validated this method using defined mixtures of up to 17 fluorescent beads in 16 dimensions, correctly identifying all populations in data files of 100,000 beads in <10 s, on a standard laptop. The method also correctly clustered granulocytes, lymphocytes, including standard T, B, and NK cell subsets, and monocytes in 9‐color stained peripheral blood, within seconds. SOPHE successfully clustered up to 36 subsets of memory CD4 T cells using differentiation and trafficking markers, in 14‐color flow analysis, and up to 65 subpopulations of PBMC in 33‐dimensional CyTOF data, showing its usefulness in discovery research. SOPHE has the potential to greatly increase efficiency of analysing complex mixtures of cells in higher dimensions. © 2015 International Society for Advancement of Cytometry

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here