| Literature DB >> 20096121 |
Abstract
BACKGROUND: The clustered heat map is the most popular means of visualizing genomic data. It compactly displays a large amount of data in an intuitive format that facilitates the detection of hidden structures and relations in the data. However, it is hampered by its use of cluster analysis which does not always respect the intrinsic relations in the data, often requiring non-standardized reordering of rows/columns to be performed post-clustering. This sometimes leads to uninformative and/or misleading conclusions. Often it is more informative to use dimension-reduction algorithms (such as Principal Component Analysis and Multi-Dimensional Scaling) which respect the topology inherent in the data. Yet, despite their proven utility in the analysis of biological data, they are not as widely used. This is at least partially due to the lack of user-friendly visualization methods with the visceral impact of the heat map.Entities:
Mesh:
Year: 2010 PMID: 20096121 PMCID: PMC3098076 DOI: 10.1186/1471-2105-11-45
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Different ways of representing the cyclic genes for the alpha experiment in Spellman [26]. (a) is the standard heat map using average-linkage hierarchical clustering in MeV, shown here for comparison. (b) is the result of 2D nMDS. The profiles for all the genes in each grid cell in (b) are shown using lineplot in the corresponding grid cell in (c). (d) shows heatmap1 in which the angular positions of genes in (b) is used to reorder the rows in (a). (e) is circularmap using the angular positions of points in (b).
Figure 2Representations of the tissue relations in the human gene atlas data: (a) is the average-linkage hierarchical clustering (using Pearson correlation) result applied to the tissues; (b) shows the superimposition of the clustering result on a 2D nMDS embedding of tissues using draw.dendrogram3d.
Figure 3Representations of the human gene atlas data: (a) shows the expression profiles underlying figure 2(b) using profileplot3d. The different groups of tissues are marked with labels of differing colors. (b) is a stereo plot of the same result created using stereo.profileplot3d. (BP = Brain Proper, ONS = Other Nervous System and R = Rest)