| Literature DB >> 19255640 |
Douglas E Brown1, Amy J Powell, Ignazio Carbone, Ralph A Dean.
Abstract
UNLABELLED: Inexpensive computational power combined with high-throughput experimental platforms has created a wealth of biological information requiring analytical tools and techniques for interpretation. Graph-theoretic concepts and tools have provided an important foundation for information visualization, integration, and analysis of datasets, but they have often been relegated to background analysis tasks. GT-Miner is designed for visual data analysis and mining operations, interacts with other software, including databases, and works with diverse data types. It facilitates a discovery-oriented approach to data mining wherein exploration of alterations of the data and variations of the visualization is encouraged. The user is presented with a basic iterative process, consisting of loading, visualizing, transforming, and then storing the resultant information. Complex analyses are built-up through repeated iterations and user interactions. The iterative process is optimized by automatic layout following transformations and by maintaining a current selection set of interest for elements modified by the transformations. Multiple visualizations are supported including hierarchical, spring, and force-directed self-organizing layouts. Graphs can be transformed with an extensible set of algorithms or manually with an integral visual editor. GT-Miner is intended to allow easier access to visual data mining for the non-expert. AVAILABILITY: The GT-Miner program and supplemental materials, including example uses and a user guide, are freely available from http://www.cifr.ncsu.edu/bioinformatics/downloads/Entities:
Keywords: data mining; graph theory; information visualization; visualization
Year: 2008 PMID: 19255640 PMCID: PMC2646195 DOI: 10.6026/97320630003235
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Application of GT-Miner for visualizing and analyzing gene families. Panel shows an example of the initial relationships for a family of homologous genes from the plant pathogenic fungus Magnaporthe grisea, strain 70-15, determined using the NCBI blastp program. Iterative analysis of the gene family using GT-Miner rapidly reveals that the linkage between MGG_14378.5 and MGG_14423.5 may be erroneously linking two different families.