Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 GigaSOM.jl: High-performance clustering and visualization of huge cytometry datasets.

Literature DB >> 33205814

GigaSOM.jl: High-performance clustering and visualization of huge cytometry datasets.

Miroslav Kratochvíl^1,2, Oliver Hunewald³, Laurent Heirendt⁴, Vasco Verissimo⁴, Jiří Vondrášek¹, Venkata P Satagopam^4,5, Reinhard Schneider^4,5, Christophe Trefois^4,5, Markus Ollert^3,6.

Abstract

BACKGROUND: The amount of data generated in large clinical and phenotyping studies that use single-cell cytometry is constantly growing. Recent technological advances allow the easy generation of data with hundreds of millions of single-cell data points with >40 parameters, originating from thousands of individual samples. The analysis of that amount of high-dimensional data becomes demanding in both hardware and software of high-performance computational resources. Current software tools often do not scale to the datasets of such size; users are thus forced to downsample the data to bearable sizes, in turn losing accuracy and ability to detect many underlying complex phenomena.
RESULTS: We present GigaSOM.jl, a fast and scalable implementation of clustering and dimensionality reduction for flow and mass cytometry data. The implementation of GigaSOM.jl in the high-level and high-performance programming language Julia makes it accessible to the scientific community and allows for efficient handling and processing of datasets with billions of data points using distributed computing infrastructures. We describe the design of GigaSOM.jl, measure its performance and horizontal scaling capability, and showcase the functionality on a large dataset from a recent study.
CONCLUSIONS: GigaSOM.jl facilitates the use of commonly available high-performance computing resources to process the largest available datasets within minutes, while producing results of the same quality as the current state-of-art software. Measurements indicate that the performance scales to much larger datasets. The example use on the data from a massive mouse phenotyping effort confirms the applicability of GigaSOM.jl to huge-scale studies.

Entities: Chemical Disease Species

Keywords: Julia; clustering; dimensionality reduction; high-performance computing; self-organizing maps; single-cell cytometry

Year: 2020 PMID： 33205814 PMCID： PMC7672468 DOI： 10.1093/gigascience/giaa127

Source DB: PubMed Journal: Gigascience ISSN： 2047-217X Impact factor: 6.524

15 in total

Review 1. The end of gating? An introduction to automated analysis of high dimensional cytometry data.

Authors: Florian Mair; Felix J Hartmann; Dunja Mrdjen; Vinko Tosevski; Carsten Krieg; Burkhard Becher
Journal: Eur J Immunol Date: 2015-11-30 Impact factor: 5.532

2. FlowSOM: Using self-organizing maps for visualization and interpretation of cytometry data.

Authors: Sofie Van Gassen; Britt Callebaut; Mary J Van Helden; Bart N Lambrecht; Piet Demeester; Tom Dhaene; Yvan Saeys
Journal: Cytometry A Date: 2015-01-08 Impact factor: 4.355

3. Essentials of the self-organizing map.

Authors: Teuvo Kohonen
Journal: Neural Netw Date: 2012-10-04

4. Automated identification of stratifying signatures in cellular subpopulations.

Authors: Robert V Bruggner; Bernd Bodenmiller; David L Dill; Robert J Tibshirani; Garry P Nolan
Journal: Proc Natl Acad Sci U S A Date: 2014-06-16 Impact factor: 11.205

5. Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometry.

Authors: Dmitry R Bandura; Vladimir I Baranov; Olga I Ornatsky; Alexei Antonov; Robert Kinach; Xudong Lou; Serguei Pavlov; Sergey Vorobiev; John E Dick; Scott D Tanner
Journal: Anal Chem Date: 2009-08-15 Impact factor: 6.986

6. Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE.

Authors: Peng Qiu; Erin F Simonds; Sean C Bendall; Kenneth D Gibbs; Robert V Bruggner; Michael D Linderman; Karen Sachs; Garry P Nolan; Sylvia K Plevritis
Journal: Nat Biotechnol Date: 2011-10-02 Impact factor: 54.908

7. Sensitive detection of rare disease-associated cell subsets via representation learning.

Authors: Eirini Arvaniti; Manfred Claassen
Journal: Nat Commun Date: 2017-04-06 Impact factor: 14.919

8. Testing for differential abundance in mass cytometry data.

Authors: Aaron T L Lun; Arianne C Richard; John C Marioni
Journal: Nat Methods Date: 2017-05-15 Impact factor: 28.547

9. Spectral Cytometry Has Unique Properties Allowing Multicolor Analysis of Cell Suspensions Isolated from Solid Tissues.

Authors: Sandrine Schmutz; Mariana Valente; Ana Cumano; Sophie Novault
Journal: PLoS One Date: 2016-08-08 Impact factor: 3.240

10. Generalized EmbedSOM on quadtree-structured self-organizing maps.

Authors: Miroslav Kratochvíl; Abhishek Koladiya; Jiří Vondrášek
Journal: F1000Res Date: 2019-12-18

1 in total

1. Evidence of premature lymphocyte aging in people with low anti-spike antibody levels after BNT162b2 vaccination.

Authors: Yapei Huang; Juliana E Shin; Alexander M Xu; Changfu Yao; Sandy Joung; Min Wu; Ruan Zhang; Bongha Shin; Joslyn Foley; Simeon B Mahov; Matthew E Modes; Joseph E Ebinger; Matthew Driver; Jonathan G Braun; Caroline A Jefferies; Tanyalak Parimon; Chelsea Hayes; Kimia Sobhani; Akil Merchant; Sina A Gharib; Stanley C Jordan; Susan Cheng; Helen S Goodridge; Peter Chen
Journal: iScience Date: 2022-09-26

1 in total