Literature DB >> 28507238

Clustering: how much bias do we need?

Tom Lorimer1, Jenny Held2, Ruedi Stoop3.   

Abstract

Scientific investigations in medicine and beyond increasingly require observations to be described by more features than can be simultaneously visualized. Simply reducing the dimensionality by projections destroys essential relationships in the data. Similarly, traditional clustering algorithms introduce data bias that prevents detection of natural structures expected from generic nonlinear processes. We examine how these problems can best be addressed, where in particular we focus on two recent clustering approaches, Phenograph and Hebbian learning clustering, applied to synthetic and natural data examples. Our results reveal that already for very basic questions, minimizing clustering bias is essential, but that results can benefit further from biased post-processing.This article is part of the themed issue 'Mathematical methods in medicine: neuroscience, cardiology and pathology'.
© 2017 The Author(s).

Keywords:  dimension reduction; dynamical systems; nonlinear projections; unbiased clustering

Mesh:

Year:  2017        PMID: 28507238      PMCID: PMC5434083          DOI: 10.1098/rsta.2016.0293

Source DB:  PubMed          Journal:  Philos Trans A Math Phys Eng Sci        ISSN: 1364-503X            Impact factor:   4.226


  17 in total

1.  Modeling of spiking-bursting neural behavior using two-dimensional map.

Authors:  Nikolai F Rulkov
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2002-04-10

2.  Sequential superparamagnetic clustering for unbiased classification of high-dimensional chemical data.

Authors:  Thomas Ott; Albert Kern; Ausgar Schuffenhauer; Maxim Popov; Pierre Acklin; Edgar Jacoby; Ruedi Stoop
Journal:  J Chem Inf Comput Sci       Date:  2004 Jul-Aug

3.  Real-world existence and origins of the spiral organization of shrimp-shaped domains.

Authors:  Ruedi Stoop; Philipp Benner; Yoko Uwate
Journal:  Phys Rev Lett       Date:  2010-08-11       Impact factor: 9.161

4.  Periodic orbit analysis demonstrates genetic constraints, variability, and switching in Drosophila courtship behavior.

Authors:  Ruedi Stoop; Benjamin I Arthur
Journal:  Chaos       Date:  2008-06       Impact factor: 3.642

5.  Hebbian self-organizing integrate-and-fire networks for data clustering.

Authors:  Florian Landis; Thomas Ott; Ruedi Stoop
Journal:  Neural Comput       Date:  2010-01       Impact factor: 2.026

6.  Mesocopic comparison of complex networks based on periodic orbits.

Authors:  R Stoop; J Joller
Journal:  Chaos       Date:  2011-03       Impact factor: 3.642

7.  Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry data.

Authors:  Lukas M Weber; Mark D Robinson
Journal:  Cytometry A       Date:  2016-12-19       Impact factor: 4.355

8.  Birhythmicity, chaos, and other patterns of temporal self-organization in a multiply regulated biochemical system.

Authors:  O Decroly; A Goldbeter
Journal:  Proc Natl Acad Sci U S A       Date:  1982-11       Impact factor: 11.205

9.  Data-Driven Phenotypic Dissection of AML Reveals Progenitor-like Cells that Correlate with Prognosis.

Authors:  Jacob H Levine; Erin F Simonds; Sean C Bendall; Kara L Davis; El-ad D Amir; Michelle D Tadmor; Oren Litvin; Harris G Fienberg; Astraea Jager; Eli R Zunder; Rachel Finck; Amanda L Gedman; Ina Radtke; James R Downing; Dana Pe'er; Garry P Nolan
Journal:  Cell       Date:  2015-06-18       Impact factor: 41.582

10.  Critical assessment of automated flow cytometry data analysis techniques.

Authors:  Nima Aghaeepour; Greg Finak; Holger Hoos; Tim R Mosmann; Ryan Brinkman; Raphael Gottardo; Richard H Scheuermann
Journal:  Nat Methods       Date:  2013-02-10       Impact factor: 28.547

View more
  2 in total

Review 1.  Mathematical methods in medicine: neuroscience, cardiology and pathology.

Authors:  José M Amigó; Michael Small
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2017-06-28       Impact factor: 4.226

2.  Assessment of Automated Flow Cytometry Data Analysis Tools within Cell and Gene Therapy Manufacturing.

Authors:  Melissa Cheung; Jonathan J Campbell; Robert J Thomas; Julian Braybrook; Jon Petzing
Journal:  Int J Mol Sci       Date:  2022-03-17       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.