L Boddy1, M F Wilkins, C W Morris. 1. Cardiff School of Biosciences, Cardiff University, Cardiff, United Kingdom. boddyl@cf.ac.uk
Abstract
BACKGROUND: Analytical flow cytometry (AFC), by quantifying sometimes more than 10 optical parameters on cells at rates of approximately 10(3) cells/s, rapidly generates vast quantities of multidimensional data, which provides a considerable challenge for data analysis. We review the application of multivariate data analysis and pattern recognition techniques to flow cytometry. METHODS: Approaches were divided into two broad types depending on whether the aim was identification or clustering. Multivariate statistical approaches, supervised artificial neural networks (ANNs), problems of overlapping character distributions, unbounded data sets, missing parameters, scaling up, and estimating proportions of different types of cells comprised the first category. Classic clustering methods, fuzzy clustering, and unsupervised ANNs comprised the second category. We demonstrate the state of the art by using AFC data on marine phytoplankton populations. RESULTS AND CONCLUSIONS: Information held within the large quantities of data generated by AFC was tractable using ANNs, but for field studies the problem of obtaining suitable training data needs to be resolved, and coping with an almost infinite number of cell categories needs further research. Copyright 2001 Wiley-Liss, Inc.
BACKGROUND: Analytical flow cytometry (AFC), by quantifying sometimes more than 10 optical parameters on cells at rates of approximately 10(3) cells/s, rapidly generates vast quantities of multidimensional data, which provides a considerable challenge for data analysis. We review the application of multivariate data analysis and pattern recognition techniques to flow cytometry. METHODS: Approaches were divided into two broad types depending on whether the aim was identification or clustering. Multivariate statistical approaches, supervised artificial neural networks (ANNs), problems of overlapping character distributions, unbounded data sets, missing parameters, scaling up, and estimating proportions of different types of cells comprised the first category. Classic clustering methods, fuzzy clustering, and unsupervised ANNs comprised the second category. We demonstrate the state of the art by using AFC data on marine phytoplankton populations. RESULTS AND CONCLUSIONS: Information held within the large quantities of data generated by AFC was tractable using ANNs, but for field studies the problem of obtaining suitable training data needs to be resolved, and coping with an almost infinite number of cell categories needs further research. Copyright 2001 Wiley-Liss, Inc.
Authors: Eun-Young Kim; Qing Zeng; James Rawn; Matthew Wand; Alan J Young; Edgar Milford; Steven J Mentzer; Robert A Greenes Journal: Proc AMIA Symp Date: 2002
Authors: Josef Spidlen; Robert C Gentleman; Perry D Haaland; Morgan Langille; Nolwenn Le Meur; Michael F Ochs; Charles Schmitt; Clayton A Smith; Adam S Treister; Ryan R Brinkman Journal: OMICS Date: 2006