| Literature DB >> 26906061 |
Erdogan Taskesen1, Marcel J T Reinders1.
Abstract
The GTEx Consortium reported that hierarchical clustering of RNA profiles from 25 unique tissue types among 1641 individuals accurately distinguished the tissue types, but a multidimensional scaling failed to generate a 2D projection of the data that separates tissue-subtypes. In this study we show that a projection by t-Distributed Stochastic Neighbor Embedding is in line with the cluster analysis which allows a more detailed examination and visualization of human tissue relationships.Entities:
Mesh:
Year: 2016 PMID: 26906061 PMCID: PMC4764374 DOI: 10.1371/journal.pone.0149853
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The tissue landscape of the gene expression profiles illustrated in a two-dimensional map.
(A) Projection of 1641 GTEx samples in a 2D-map: Each point represents a sample which is coloured according to the (sub)tissue label (45 in total). Samples that cluster outside the matching tissue subtype-cluster are circled in red. Sample clusters are illustrated by the 22 differentially coloured density maps. (B) the projection of 313 brain samples and their associated brain-tissue. (C) Comparison of brain samples clustered by the HC approach versus the t-SNE approach resulted in cophenetic correlation[6] of 0.68. An edge links the sample ID positions between the HC and t-SNE clustering. Edge colours are based on the brain tissue regions. Clusters are labelled if a particular brain region was significantly overrepresented in the cluster (hypergeometric test with P≤0.001).
Fig 2Cluster associations and comparison with the hierarchical clustering approach.
(A) Association of the clusters based on the t-SNE-map with the tissue types. A star indicates significant overrepresentation (P≤0.0001) of samples in a cluster with the respective tissue label. Red-coloured numbers depict samples outside the cluster that matches their tissue label. Coloured squares depict the percentage of samples that overlap. (B) Comparison of the t-SNE-map based clusters versus the HC approach[2]. A star indicates significant over-representation (P≤0.0001) of samples in a cluster of the t-SNE-map with samples in a cluster of the HC approach[2]. Coloured squares depict the percentage of samples that overlap.