Multi-set Pre-processing of Multicolor Flow Cytometry Data.
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
16 06 2020
16 06 2020
Historique:
received:
14
05
2019
accepted:
29
04
2020
entrez:
18
6
2020
pubmed:
18
6
2020
medline:
15
12
2020
Statut:
epublish
Résumé
Flow Cytometry is an analytical technology to simultaneously measure multiple markers per single cell. Ten thousands to millions of single cells can be measured per sample and each sample may contain a different number of cells. All samples may be bundled together, leading to a 'multi-set' structure. Many multivariate methods have been developed for Flow Cytometry data but none of them considers this structure in their quantitative handling of the data. The standard pre-processing used by existing multivariate methods provides models mainly influenced by the samples with more cells, while such a model should provide a balanced view of the biomedical information within all measurements. We propose an alternative 'multi-set' preprocessing that corrects for the difference in number of cells measured, balancing the relative importance of each multi-cell sample in the data while using all data collected from these expensive analyses. Moreover, one case example shows how multi-set pre-processing may benefit removal of undesired measurement-to-measurement variability and another where class-based multi-set pre-processing enhances the studied response upon comparison to the control reference samples. Our results show that adjusting data analysis algorithms to consider this multi-set structure may greatly benefit immunological insight and classification performance of Flow Cytometry data.
Identifiants
pubmed: 32546713
doi: 10.1038/s41598-020-66195-3
pii: 10.1038/s41598-020-66195-3
pmc: PMC7297713
doi:
Substances chimiques
Biomarkers
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
9716Références
Robinson, J. P. & Roederer, M. Flow cytometry strikes gold. Science 350, 739–740, https://doi.org/10.1126/science.aad6770 (2015).
doi: 10.1126/science.aad6770
pubmed: 26564833
Theunissen, P. et al. Standardized flow cytometry for highly sensitive MRD measurements in B-cell acute lymphoblastic leukemia. Blood 129, 347–357, https://doi.org/10.1182/blood-2016-07-726307 (2017).
doi: 10.1182/blood-2016-07-726307
pubmed: 27903527
pmcid: 5291958
Macaulay, I. C. et al. Single-Cell RNA-Sequencing Reveals a Continuous Spectrum of Differentiation in Hematopoietic. Cells. Cell Reports 14, 966–977, https://doi.org/10.1016/j.celrep.2015.12.082 (2016).
doi: 10.1016/j.celrep.2015.12.082
pubmed: 26804912
Tauler, R., Maeder, M. & De Juan, A. in Comprehensive Chemometrics: Chemical and Biochemical Data Analysis 2, 473–505 (2009).
doi: 10.1016/B978-044452701-1.00055-7
Smilde, A. K., Westerhuis, J. A. & de Jong, S. A framework for sequential multiblock component methods. Journal of Chemometrics: A Journal of the Chemometrics Society 17, 323–337 (2003).
doi: 10.1002/cem.811
Saeys, Y., Van Gassen, S. & Lambrecht, B. N. Computational flow cytometry: helping to make sense of high-dimensional immunology data. Nature Reviews Immunology 16, 449 (2016).
doi: 10.1038/nri.2016.56
pubmed: 27320317
Bruggner, R. V., Bodenmiller, B., Dill, D. L., Tibshirani, R. J. & Nolan, G. P. Automated identification of stratifying signatures in cellular subpopulations. Proceedings of the National Academy of Sciences 111, E2770–E2777 (2014).
doi: 10.1073/pnas.1408792111
Van Gassen, S. et al. FlowSOM: Using self‐organizing maps for visualization and interpretation of cytometry data. Cytometry Part A 87, 636–645 (2015).
doi: 10.1002/cyto.a.22625
Qiu, P. et al. Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE. Nature biotechnology 29, 886–891 (2011).
doi: 10.1038/nbt.1991
pubmed: 21964415
pmcid: 3196363
Amir, E.-aD. et al. viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nature biotechnology 31, 545–552 (2013).
doi: 10.1038/nbt.2594
pmcid: 4076922
Tinnevelt, G. H. et al. Novel data analysis method for multicolour flow cytometry links variability of multiple markers on single cells to a clinical phenotype. Scientific Reports 7, 5471 (2017).
doi: 10.1038/s41598-017-05714-1
pubmed: 28710472
pmcid: 5511252
Folcarelli, R. et al. Automated flow cytometric identification of disease-specific cells by the ECLIPSE algorithm. Scientific reports 8, 10907 (2018).
doi: 10.1038/s41598-018-29367-w
pubmed: 30026601
pmcid: 6053450
Pillay, J. et al. A subset of neutrophils in human systemic inflammation inhibits T cell responses through Mac-1. J Clin Invest 122, 327–336, https://doi.org/10.1172/jci57990 (2012).
doi: 10.1172/jci57990
pubmed: 22156198
Wouters, K. et al. Circulating classical monocytes are associated with CD11c(+) macrophages in human visceral adipose tissue. Scientific Reports 7, 42665, https://doi.org/10.1038/srep42665 (2017).
doi: 10.1038/srep42665
pubmed: 28198418
pmcid: 5309742
Engel, J. et al. Breaking with trends in pre-processing? TrAC Trends in Analytical Chemistry 50, 96–106, https://doi.org/10.1016/j.trac.2013.04.015 (2013).
doi: 10.1016/j.trac.2013.04.015
Nemes, E. et al. Differential leukocyte counting and immunophenotyping in cryopreserved ex vivo whole blood. Cytometry Part A 87, 157–165, https://doi.org/10.1002/cyto.a.22610 (2015).
doi: 10.1002/cyto.a.22610
de Ruiter, K. et al. A field-applicable method for flow cytometric analysis of granulocyte activation: Cryopreservation of fixed granulocytes. Cytometry Part A 93, 540–547, https://doi.org/10.1002/cyto.a.23354 (2018).
doi: 10.1002/cyto.a.23354
Johnson, N. L. Systems of Frequency Curves Generated by Methods of Translation. Biometrika. 36, 149–176, https://doi.org/10.1093/biomet/36.1-2.149 (1949).
doi: 10.1093/biomet/36.1-2.149
pubmed: 18132090
Roederer, M. Spectral compensation for flow cytometry: visualization artifacts, limitations, and caveats. Cytometry 45, 194–205 (2001).
doi: 10.1002/1097-0320(20011101)45:3<194::AID-CYTO1163>3.0.CO;2-C
pubmed: 11746088
Finak, G., Perez, J. M., Weng, A. & Gottardo, R. Optimizing transformations for automated, high throughput analysis of flow cytometry data. BMC bioinformatics 11, 546, https://doi.org/10.1186/1471-2105-11-546 (2010).
doi: 10.1186/1471-2105-11-546
pubmed: 21050468
pmcid: 3243046
Bro, R. & Smilde, A. K. Centering and scaling in component analysis. Journal of Chemometrics 17, 16–33, https://doi.org/10.1002/cem.773 (2003).
doi: 10.1002/cem.773
Joliffe, I. T. Principal Component Analysis, Second Edition. Vol. 98 (Springer, 1986).
Lugli, E., Roederer, M. & Cossarizza, A. Data analysis in flow cytometry: the future just started. Cytometry. Part A: the journal of the International Society for Analytical Cytology 77, 705–713, https://doi.org/10.1002/cyto.a.20901 (2010).
doi: 10.1002/cyto.a.20901
Timmerman, M. E. & Kiers, H. A. L. Four simultaneous component models for the analysis of multivariate time series from more than one subject to model intraindividual and interindividual differences. Psychometrika 68, 105–121, https://doi.org/10.1007/BF02296656 (2003).
doi: 10.1007/BF02296656
Westerhuis, J. A., Kourti, T. & MacGregor, J. F. Analysis of multiblock and hierarchical PCA and PLS models. Journal of Chemometrics 12, 301-321, 10.1002/(SICI)1099-128X(199809/10)12:5<301::AID-CEM515>3.0.CO;2-S (1998).
Gower, J. C., Lubbe, S. & Le Roux, N. Understanding Biplots. (Wiley, 2011).
Pillay, J. et al. Functional heterogeneity and differential priming of circulating neutrophils in human experimental endotoxemia. Journal of leukocyte biology 88, 211–220, https://doi.org/10.1189/jlb.1209793 (2010).
doi: 10.1189/jlb.1209793
pubmed: 20400675
Hahne, F. et al. Per‐channel basis normalization methods for flow cytometry data. Cytometry Part A: The Journal of the International Society for Advancement of Cytometry 77, 121–131 (2010).
Vesanto, J., Himberg, J., Alhoniemi, E. & Parhankangas, J. Self-organizing map in Matlab: the SOM Toolbox. Proceedings of the Matlab DSP conference 99 (1999).
Trygg, J. & Wold, S. Orthogonal projections to latent structures (O-PLS). Journal of Chemometrics 16, 119–128, https://doi.org/10.1002/cem.695 (2002).
doi: 10.1002/cem.695
Szymańska, E., Saccenti, E., Smilde, A. & Westerhuis, J. Double-check: validation of diagnostic statistics for PLS-DA models in metabolomics studies. Metabolomics 8, 3–16.