Visual analysis of mass cytometry data by hierarchical stochastic neighbour embedding reveals rare cell types
Vincent van Unen, Thomas Höllt, Nicola Pezzotti, Na Li, Marcel J T Reinders, Elmar Eisemann, Frits Koning, Anna Vilanova, Boudewijn P F Lelieveldt, Vincent van Unen, Thomas Höllt, Nicola Pezzotti, Na Li, Marcel J T Reinders, Elmar Eisemann, Frits Koning, Anna Vilanova, Boudewijn P F Lelieveldt
Abstract
Mass cytometry allows high-resolution dissection of the cellular composition of the immune system. However, the high-dimensionality, large size, and non-linear structure of the data poses considerable challenges for the data analysis. In particular, dimensionality reduction-based techniques like t-SNE offer single-cell resolution but are limited in the number of cells that can be analyzed. Here we introduce Hierarchical Stochastic Neighbor Embedding (HSNE) for the analysis of mass cytometry data sets. HSNE constructs a hierarchy of non-linear similarities that can be interactively explored with a stepwise increase in detail up to the single-cell level. We apply HSNE to a study on gastrointestinal disorders and three other available mass cytometry data sets. We find that HSNE efficiently replicates previous observations and identifies rare cell populations that were previously missed due to downsampling. Thus, HSNE removes the scalability limit of conventional t-SNE analysis, a feature that makes it highly suitable for the analysis of massive high-dimensional data sets.
Conflict of interest statement
The authors declare no competing financial interests.
Figures
References
- Saeys Y, Gassen SV, Lambrecht BN. Computational flow cytometry: helping to make sense of high-dimensional immunology data. Nat. Rev. Immunol. 2016;16:449–462. doi: 10.1038/nri.2016.56.
- Qiu P, et al. Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE. Nat. Biotechnol. 2011;29:886–891. doi: 10.1038/nbt.1991.
- Zunder ER, Lujan E, Goltsev Y, Wernig M, Nolan GP. A continuous molecular roadmap to iPSC reprogramming through progression analysis of single-cell mass cytometry. Cell Stem Cell. 2015;16:323–337. doi: 10.1016/j.stem.2015.01.015.
- Levine JH, et al. Data-Driven phenotypic dissection of AML reveals progenitor-like cells that correlate with prognosis. Cell. 2015;162:184–197. doi: 10.1016/j.cell.2015.05.047.
- Samusik N, Good Z, Spitzer MH, Davis KL, Nolan GP. Automated mapping of phenotype space with single-cell data. Nat. Methods. 2016;13:493–496. doi: 10.1038/nmeth.3863.
- Spitzer MH, et al. IMMUNOLOGY. An interactive reference framework for modeling a dynamic immune system. Science. 2015;349:1259425. doi: 10.1126/science.1259425.
- Hotelling, H. Analysis of a complex of statistical variables into principal components. J Ed. Psychol. 24, 417–441 (1933).
- van der Maaten, L. J. P. & Hinton, G. E. Visualizing high-dimensional data using t-SNE. J. Mach. Learn. Res.9, 2579–2605 (2008).
- Amir EAD, et al. viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat. Biotechnol. 2013;31:545–552. doi: 10.1038/nbt.2594.
- Haghverdi L, Buettner F, Theis FJ. Diffusion maps for high-dimensional single-cell analysis of differentiation data. Bioinformatics. 2015;31:2989–2998. doi: 10.1093/bioinformatics/btv325.
- Bendall SC, Nolan GP, Roederer M, Chattopadhyay PK. A deep profiler’s guide to cytometry. Trends Immunol. 2012;33:323–332. doi: 10.1016/j.it.2012.02.010.
- Chattopadhyay PK, Gierahn TM, Roederer M, Love JC. Single-cell technologies for monitoring immune systems. Nat. Immunol. 2014;15:128–135. doi: 10.1038/ni.2796.
- Pezzotti N, Höllt T, Lelieveldt B, Eisemann E, Vilanova A. Hierarchical Stochastic Neighbor Embedding. Comput. Graph. Forum. 2016;35:21–30. doi: 10.1111/cgf.12878.
- van Unen V, et al. Mass cytometry of the human mucosal immune system identifies tissue- and disease-associated immune subsets. Immunity. 2016;44:1227–1239. doi: 10.1016/j.immuni.2016.04.014.
- van der Maaten, L. Accelerating t-SNE using tree-based algorithms. J. Mach. Learn. Res. 15, 3221–3245 (2014).
- Pezzotti N, et al. Approximated and user steerable tSNE for progressive visual analytics. IEEE. Trans. Vis. Comput. Graph. 2016;23:1739–1752. doi: 10.1109/TVCG.2016.2570755.
- Setty M, et al. Wishbone identifies bifurcating developmental trajectories from single-cell data. Nat. Biotechnol. 2016;34:637–645. doi: 10.1038/nbt.3569.
- Comaniciu D, Meer P. Mean shift: a robust approach toward feature space analysis. IEEE. Trans. Pattern Anal. Mach. Intell. 2002;24:603–619. doi: 10.1109/34.1000236.
- Spits H, Cupedo T. Innate lymphoid cells: emerging insights in development, lineage relationships, and function. Annu. Rev. Immunol. 2012;30:647–675. doi: 10.1146/annurev-immunol-020711-075053.
- McKenzie ANJ, Spits H, Eberl G. Innate lymphoid cells in inflammation and immunity. Immunity. 2014;41:366–374. doi: 10.1016/j.immuni.2014.09.006.
- Spits H, et al. Innate lymphoid cells--a proposal for uniform nomenclature. Nat. Rev. Immunol. 2013;13:145–149. doi: 10.1038/nri3365.
- Robinette ML, et al. Transcriptional programs define molecular characteristics of innate lymphoid cell classes and subsets. Nat. Immunol. 2015;16:306–317. doi: 10.1038/ni.3094.
- Schmitz F, et al. Identification of a potential physiological precursor of aberrant cells in refractory coeliac disease type II. Gut. 2013;62:509–519. doi: 10.1136/gutjnl-2012-302265.
- Schmitz F, et al. The composition and differentiation potential of the duodenal intraepithelial innate lymphocyte compartment is altered in coeliac disease. Gut. 2016;65:1269–1278. doi: 10.1136/gutjnl-2014-308153.
- Ettersperger J, et al. Interleukin-15-dependent T-cell-like innate intraepithelial lymphocytes develop in the intestine and transform into lymphomas in celiac disease. Immunity. 2016;45:610–625. doi: 10.1016/j.immuni.2016.07.018.
- Mou D, Espinosa J, Lo DJ, Kirk AD. CD28 negative T cells: is their loss our gain? Am. J. Transplant. 2014;14:2460–2466. doi: 10.1111/ajt.12937.
- Bendall SC, et al. Single-cell mass cytometry of differential immune and drug responses across a human hematopoietic continuum. Science. 2011;332:687–696. doi: 10.1126/science.1198704.
- Shaham, U. & Steinerberger, S. Stochastic neighbor embedding separates well-separated clusters. arXiv:1702.02670 [] (2017).
- Höllt T, et al. Cytosplore: Interactive immune cell phenotyping for large single-cell datasets. Comput. Graph. Forum. 2016;35:171–180. doi: 10.1111/cgf.12893.
- Finck R, et al. Normalization of mass cytometry data with bead standards. Cytometry A. 2013;83:483–494. doi: 10.1002/cyto.a.22271.
Source: PubMed