TY - JOUR
T1 - Visual analysis of mass cytometry data by hierarchical stochastic neighbour embedding reveals rare cell types
AU - van Unen, Vincent
AU - Höllt, Thomas
AU - Pezzotti, Nicola
AU - Li, Na
AU - Reinders, Marcel J.T.
AU - Eisemann, Elmar
AU - Koning, Frits
AU - Vilanova, Anna
AU - Lelieveldt, Boudewijn P.F.
PY - 2017
Y1 - 2017
N2 - Mass cytometry allows high-resolution dissection of the cellular composition of the immune system. However, the high-dimensionality, large size, and non-linear structure of the data poses considerable challenges for the data analysis. In particular, dimensionality reduction-based techniques like t-SNE offer single-cell resolution but are limited in the number of cells that can be analyzed. Here we introduce Hierarchical Stochastic Neighbor Embedding (HSNE) for the analysis of mass cytometry data sets. HSNE constructs a hierarchy of non-linear similarities that can be interactively explored with a stepwise increase in detail up to the single-cell level. We apply HSNE to a study on gastrointestinal disorders and three other available mass cytometry data sets. We find that HSNE efficiently replicates previous observations and identifies rare cell populations that were previously missed due to downsampling. Thus, HSNE removes the scalability limit of conventional t-SNE analysis, a feature that makes it highly suitable for the analysis of massive high-dimensional data sets.
AB - Mass cytometry allows high-resolution dissection of the cellular composition of the immune system. However, the high-dimensionality, large size, and non-linear structure of the data poses considerable challenges for the data analysis. In particular, dimensionality reduction-based techniques like t-SNE offer single-cell resolution but are limited in the number of cells that can be analyzed. Here we introduce Hierarchical Stochastic Neighbor Embedding (HSNE) for the analysis of mass cytometry data sets. HSNE constructs a hierarchy of non-linear similarities that can be interactively explored with a stepwise increase in detail up to the single-cell level. We apply HSNE to a study on gastrointestinal disorders and three other available mass cytometry data sets. We find that HSNE efficiently replicates previous observations and identifies rare cell populations that were previously missed due to downsampling. Thus, HSNE removes the scalability limit of conventional t-SNE analysis, a feature that makes it highly suitable for the analysis of massive high-dimensional data sets.
KW - Computational biology and bioinformatics
KW - Flow cytometry
KW - Immunology
UR - http://www.scopus.com/inward/record.url?scp=85034844512&partnerID=8YFLogxK
UR - http://resolver.tudelft.nl/uuid:7d5657e0-a8d6-4e2c-9c67-43f6949cc529
U2 - 10.1038/s41467-017-01689-9
DO - 10.1038/s41467-017-01689-9
M3 - Article
AN - SCOPUS:85034844512
VL - 8
SP - 1
EP - 10
JO - Nature Communications
JF - Nature Communications
SN - 2041-1723
M1 - 1740
ER -