Literature DB >> 11598946

Probability binning comparison: a metric for quantitating multivariate distribution differences.

M Roederer1, W Moore, A Treister, R R Hardy, L A Herzenberg.   

Abstract

BACKGROUND: While several algorithms for the comparison of univariate distributions arising from flow cytometric analyses have been developed and studied for many years, algorithms for comparing multivariate distributions remain elusive. Such algorithms could be useful for comparing differences between samples based on several independent measurements, rather than differences based on any single measurement. It is conceivable that distributions could be completely distinct in multivariate space, but unresolvable in any combination of univariate histograms. Multivariate comparisons could also be useful for providing feedback about instrument stability, when only subtle changes in measurements are occurring.
METHODS: We apply a variant of Probability Binning, described in the accompanying article, to multidimensional data. In this approach, hyper-rectangles of n dimensions (where n is the number of measurements being compared) comprise the bins used for the chi-squared statistic. These hyper-dimensional bins are constructed such that the control sample has the same number of events in each bin; the bins are then applied to the test samples for chi-squared calculations.
RESULTS: Using a Monte-Carlo simulation, we determined the distribution of chi-squared values obtained by comparing sets of events from the same distribution; this distribution of chi-squared values was identical as for the univariate algorithm. Hence, the same formulae can be used to construct a metric, analogous to a t-score, that estimates the probability with which distributions are distinct. As for univariate comparisons, this metric scales with the difference between two distributions, and can be used to rank samples according to similarity to a control. We apply the algorithm to multivariate immunophenotyping data, and demonstrate that it can be used to discriminate distinct samples and to rank samples according to a biologically-meaningful difference.
CONCLUSION: Probability binning, as shown here, provides a useful metric for determining the probability with which two or more multivariate distributions represent distinct sets of data. The metric can be used to identify the similarity or dissimilarity of samples. Finally, as demonstrated in the accompanying paper, the algorithm can be used to gate on events in one sample that are different from a control sample, even if those events cannot be distinguished on the basis of any combination of univariate or bivariate displays. Published 2001 Wiley-Liss, Inc.

Entities:  

Mesh:

Year:  2001        PMID: 11598946     DOI: 10.1002/1097-0320(20010901)45:1<47::aid-cyto1143>3.0.co;2-a

Source DB:  PubMed          Journal:  Cytometry        ISSN: 0196-4763


  36 in total

1.  FAST: Rapid determinations of antibiotic susceptibility phenotypes using label-free cytometry.

Authors:  Tzu-Hsueh Huang; Yih-Ling Tzeng; Robert M Dickson
Journal:  Cytometry A       Date:  2018-05-07       Impact factor: 4.355

2.  Dendritic cell maturation and chemotaxis is regulated by TRPM2-mediated lysosomal Ca2+ release.

Authors:  Adriana Sumoza-Toledo; Ingo Lange; Hanna Cortado; Harivadan Bhagat; Yasuo Mori; Andrea Fleig; Reinhold Penner; Santiago Partida-Sánchez
Journal:  FASEB J       Date:  2011-07-13       Impact factor: 5.191

Review 3.  Diagnostic pathology and laboratory medicine in the age of "omics": a paper from the 2006 William Beaumont Hospital Symposium on Molecular Pathology.

Authors:  William G Finn
Journal:  J Mol Diagn       Date:  2007-07-25       Impact factor: 5.568

4.  Automated high-dimensional flow cytometric data analysis.

Authors:  Saumyadipta Pyne; Xinli Hu; Kui Wang; Elizabeth Rossin; Tsung-I Lin; Lisa M Maier; Clare Baecher-Allan; Geoffrey J McLachlan; Pablo Tamayo; David A Hafler; Philip L De Jager; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-14       Impact factor: 11.205

5.  2,3,7,8-tetrachlorodibenzo-p-dioxin-mediated suppression of toll-like receptor stimulated B-lymphocyte activation and initiation of plasmacytic differentiation.

Authors:  Colin M North; Robert B Crawford; Haitian Lu; Norbert E Kaminski
Journal:  Toxicol Sci       Date:  2010-03-26       Impact factor: 4.849

6.  Deep profiling of multitube flow cytometry data.

Authors:  Kieran O'Neill; Nima Aghaeepour; Jeremy Parker; Donna Hogge; Aly Karsan; Bakul Dalal; Ryan R Brinkman
Journal:  Bioinformatics       Date:  2015-01-18       Impact factor: 6.937

7.  Methods for discovery and characterization of cell subsets in high dimensional mass cytometry data.

Authors:  Kirsten E Diggins; P Brent Ferrell; Jonathan M Irish
Journal:  Methods       Date:  2015-05-13       Impact factor: 3.608

8.  FlowFP: A Bioconductor Package for Fingerprinting Flow Cytometric Data.

Authors:  Wade T Rogers; Herbert A Holyst
Journal:  Adv Bioinformatics       Date:  2009-09-24

9.  A survey of flow cytometry data analysis methods.

Authors:  Ali Bashashati; Ryan R Brinkman
Journal:  Adv Bioinformatics       Date:  2009-12-06

10.  flowClust: a Bioconductor package for automated gating of flow cytometry data.

Authors:  Kenneth Lo; Florian Hahne; Ryan R Brinkman; Raphael Gottardo
Journal:  BMC Bioinformatics       Date:  2009-05-14       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.