Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.

Literature DB >> 34320340

Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.

Yang Yang¹, Hongjian Sun², Yu Zhang³, Tiefu Zhang⁴, Jialei Gong⁵, Yunbo Wei³, Yong-Gang Duan⁵, Minglei Shu⁶, Yuchen Yang⁷, Di Wu⁸, Di Yu⁹.

Abstract

Transcriptomic analysis plays a key role in biomedical research. Linear dimensionality reduction methods, especially principal-component analysis (PCA), are widely used in detecting sample-to-sample heterogeneity, while recently developed non-linear methods, such as t-distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP), can efficiently cluster heterogeneous samples in single-cell RNA sequencing analysis. Yet, the application of t-SNE and UMAP in bulk transcriptomic analysis and comparison with conventional methods have not been achieved. We compare four major dimensionality reduction methods (PCA, multidimensional scaling [MDS], t-SNE, and UMAP) in analyzing 71 large bulk transcriptomic datasets. UMAP is superior to PCA and MDS but shows some advantages over t-SNE in differentiating batch effects, identifying pre-defined biological groups, and revealing in-depth clusters in two-dimensional space. Importantly, UMAP generates sample clusters uncovering biological features and clinical meaning. We recommend deploying UMAP in visualizing and analyzing sizable bulk transcriptomic datasets to reinforce sample heterogeneity analysis.

Entities: Chemical

Keywords: PCA; UMAP; bulk transcriptomics; clustering structure; dimensionality reduction; heterogeneity analysis; t-SNE

Mesh：

Year: 2021 PMID： 34320340 DOI： 10.1016/j.celrep.2021.109442

Source DB: PubMed Journal: Cell Rep Impact factor: 9.423

Keyword Cloud
Cited

8 in total

Review 1. Targeting T_FH cells in human diseases and vaccination: rationale and practice.

Authors: Di Yu; Lucy S K Walker; Zheng Liu; Michelle A Linterman; Zhanguo Li
Journal: Nat Immunol Date: 2022-07-11 Impact factor: 31.250

2. A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity.

Authors: Scott Bowler; Georgios Papoutsoglou; Aristides Karanikas; Ioannis Tsamardinos; Michael J Corley; Lishomwa C Ndhlovu
Journal: Sci Rep Date: 2022-10-19 Impact factor: 4.996

3. Characterizing partisan political narrative frameworks about COVID-19 on Twitter.

Authors: Elise Jing; Yong-Yeol Ahn
Journal: EPJ Data Sci Date: 2021-10-30 Impact factor: 3.184

4. Blood Transcriptome Analysis of Septic Patients Reveals a Long Non-Coding Alu-RNA in the Complement C5a Receptor 1 Gene.

Authors: Åse Emblem; Erik Knutsen; Tor Erik Jørgensen; Hilde Fure; Steinar Daae Johansen; Ole-Lars Brekke; Tom Eirik Mollnes; Bård Ove Karlsen
Journal: Noncoding RNA Date: 2022-03-29

Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.

Review 1. Targeting T_FH cells in human diseases and vaccination: rationale and practice.

2. A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity.

3. Characterizing partisan political narrative frameworks about COVID-19 on Twitter.

4. Blood Transcriptome Analysis of Septic Patients Reveals a Long Non-Coding Alu-RNA in the Complement C5a Receptor 1 Gene.

Review 5. A Toolkit for Profiling the Immune Landscape of Pediatric Central Nervous System Malignancies.

6. Visual Clustering of Transcriptomic Data from Primary and Metastatic Tumors-Dependencies and Novel Pitfalls.

7. Reclassifying tumour cell cycle activity in terms of its tissue of origin.

8. Oncocytoma-Related Gene Signature to Differentiate Chromophobe Renal Cancer and Oncocytoma Using Machine Learning.

Dimensionality reduction by UMAP reinforces sample heterogeneity analysis in bulk transcriptomic data.

Review 1. Targeting TFH cells in human diseases and vaccination: rationale and practice.

2. A machine learning approach utilizing DNA methylation as an accurate classifier of COVID-19 disease severity.

3. Characterizing partisan political narrative frameworks about COVID-19 on Twitter.

4. Blood Transcriptome Analysis of Septic Patients Reveals a Long Non-Coding Alu-RNA in the Complement C5a Receptor 1 Gene.

Review 5. A Toolkit for Profiling the Immune Landscape of Pediatric Central Nervous System Malignancies.

6. Visual Clustering of Transcriptomic Data from Primary and Metastatic Tumors-Dependencies and Novel Pitfalls.

7. Reclassifying tumour cell cycle activity in terms of its tissue of origin.

8. Oncocytoma-Related Gene Signature to Differentiate Chromophobe Renal Cancer and Oncocytoma Using Machine Learning.

Review 1. Targeting T_FH cells in human diseases and vaccination: rationale and practice.