Literature DB >> 19654113

The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets.

John W Nicol1, Gregg A Helt, Steven G Blanchard, Archana Raja, Ann E Loraine.   

Abstract

UNLABELLED: Experimental techniques that survey an entire genome demand flexible, highly interactive visualization tools that can display new data alongside foundation datasets, such as reference gene annotations. The Integrated Genome Browser (IGB) aims to meet this need. IGB is an open source, desktop graphical display tool implemented in Java that supports real-time zooming and panning through a genome; layout of genomic features and datasets in moveable, adjustable tiers; incremental or genome-scale data loading from remote web servers or local files; and dynamic manipulation of quantitative data via genome graphs. AVAILABILITY: The application and source code are available from http://igb.bioviz.org and http://genoviz.sourceforge.net.

Entities:  

Mesh:

Year:  2009        PMID: 19654113      PMCID: PMC2759552          DOI: 10.1093/bioinformatics/btp472

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Effective use of data from genome-scale assays requires flexible, highly interactive visualization software. To achieve maximum flexibility, genome visualization software should support rapid navigation through multiple zooming scales and across large regions of genomic sequence. Such tools should also enable users to display their data alongside canonical gene annotations, EST alignments and reference datasets harvested from the public domain. Web-based tools, because of their typically tight integration with back-end databases, often make it easy to display one's own data alongside reference datasets, but few match the interactivity and flexibility of desktop software. The Integrated Genome Browser (IGB, pronounced ig-bee) aims to provide the best of both worlds, providing a highly interactive and user-friendly interface, while at the same time offering users the ability to load data from remote databases via web services middleware.

2 IMPLEMENTATION

The IGB is implemented in Java and runs on any computer platform that supports Java version 1.6 or higher.

3 PROGRAM OVERVIEW

The IGB implements a flexible, highly interactive desktop software environment for viewing genome-scale datasets. IGB is the flagship product of the open source Genoviz project, which develops visualization software for bioinformatics and genomics. IGB is based on a library of visualization ‘widgets’ called the Genoviz SDK (Helt et al., 2009). The Genoviz SDK provides a framework for building visualization applications for genomics; it builds on work begun at the Berkeley Drosophila Genome Project (Helt et al., 1998) and continued at Neomorphic Software and then at Affymetrix when the companies merged (Loraine and Helt, 2002). Developers at Affymetrix created the first versions of IGB to support visualization of data from the Affymetrix tiling microarray platform. In 2005, the company moved IGB and the Genoviz SDK to a public version control system at Sourceforge.net and released the software under an open source license. Since then, developers have streamlined the user interface and added new features, such as the ability to handle new data sources. IGB can display data loaded from local files and web servers. IGB loads data from web servers via two protocols: Quickload, an IGB-specific mechanism, and the Distributed Annotation System (DAS), an evolving community standard that supports region-based queries on a genome (Jenkinson et al., 2008). Data providers can also embed links in web pages directing IGB to show a designated region. Examples appear in the web supplement of Cui and Loraine (2006). IGB can load data from multiple sources, allowing users to combine expression, genomic features, methylation, sequence similarity and sequence variation information for a given genome. The DAS and Quickload mechanisms have complementary strengths. Quickload offers a simple way to load an entire data collection at once, such as the set of curated gene models from the Arabidopsis Information Resource (TAIR). Quickload servers are easy to establish, consisting of web accessible or local directories with simple genome descriptor and annotation files. The DAS method works well for data collections that are too large to be viewed productively in their entirety, such as the set of all human ESTs. Data types IGB can display include gene structure annotations, shown as linked blocks with taller blocks indicating translated regions; genomic alignments of expression array target sequences and probes, shown as linked blocks bearing smaller blocks representing probes; and EST/cDNA genomic alignments, shown as linked spans. IGB displays numerical data associated with base pair positions as highly customizable graphs. Users can also use IGB to display data saved to local files on their desktop. IGB supports multiple file formats, including BED and PSL formats developed by UCSC Genome Bioinformatics for scored gene models and genomic alignments, respectively, and wig, bar and sgr formats for genome graphs. IGB informatics harmonizes with UCSC tools; users can populate a Quickload server using data from the UCSC Table Browser. When users load a new dataset or open a file, the new data appear in labeled tracks. Users can click-drag track labels to move tracks to new locations. Right- or control-clicking a track label activates a popup menu with multiple options. One option (Make Annotation Depth Graph) creates a new genome graph summarizing the number of annotations covering each base position, which users can save to a file (Fig. 1).
Fig. 1.

Visualizing ESTs and tiling array data. ESTs (blue) are from a 454 sequencing experiment (Weber et al., 2007). An Annotation Depth Graph (red) summarizes ESTs covering Arabidopsis gene model AT4G37300.1 (dark blue). An expression tiling array genome graph is shown in blue/yellow heatmap style (Yamada et al., 2003). Data are from Arabidopsis seedlings.

Visualizing ESTs and tiling array data. ESTs (blue) are from a 454 sequencing experiment (Weber et al., 2007). An Annotation Depth Graph (red) summarizes ESTs covering Arabidopsis gene model AT4G37300.1 (dark blue). An expression tiling array genome graph is shown in blue/yellow heatmap style (Yamada et al., 2003). Data are from Arabidopsis seedlings. IGB supports dynamic zooming and panning through a genome, allowing users to navigate easily through a genome at multiple scales. Zooming focuses on the user's last click, indicated by a vertical stripe in the display. During zooming, the zoom stripe remains stationary as flanking regions expand or contract in an animated fashion as users operate the zoom controls. The zoom stripe provides a base pair pointer in close-up views for inspecting residues at feature boundaries. The display contains several tabbed control panels and users can move into new windows using the View menu. The Graph Adjuster panel lets the users to fine-tune a graph's appearance and adjust the range of values it displays. It also offers options to add or subtract graphs from each other, providing a first-pass visual assessment of differential expression across sample types. A literature survey identified 70 articles that used IGB in diverse applications, including transcription factor binding site discovery (Kim et al., 2008; Morohashi and Grotewold, 2009; Zheng et al., 2007), chromatin structure or modification assays (He et al., 2008; Lee et al., 2007; Yagi et al., 2008), statistical methods development (Cui and Loraine, 2009; Xing et al., 2006) and gene expression studies (Lang et al., 2009). Based on users' comments (Gresham et al., 2008) and publications, we conclude that IGB's main appeal is flexibility: it provides a highly interactive environment for viewing large amounts of data and can handle diverse data sources and formats.
  17 in total

1.  Dispersed mutations in histone H3 that affect transcriptional repression and chromatin structure of the CHA1 promoter in Saccharomyces cerevisiae.

Authors:  Qiye He; Cailin Yu; Randall H Morse
Journal:  Eukaryot Cell       Date:  2008-07-25

2.  The cost of gene expression underlies a fitness trade-off in yeast.

Authors:  Gregory I Lang; Andrew W Murray; David Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-19       Impact factor: 11.205

3.  DNA methylation profile of tissue-dependent and differentially methylated regions (T-DMRs) in mouse promoter regions demonstrating tissue-specific gene expression.

Authors:  Shintaro Yagi; Keiji Hirabayashi; Shinya Sato; Wei Li; Yoko Takahashi; Tsutomu Hirakawa; Guoying Wu; Naoko Hattori; Naka Hattori; Jun Ohgane; Satoshi Tanaka; X Shirley Liu; Kunio Shiota
Journal:  Genome Res       Date:  2008-10-29       Impact factor: 9.043

4.  An extended transcriptional network for pluripotency of embryonic stem cells.

Authors:  Jonghwan Kim; Jianlin Chu; Xiaohua Shen; Jianlong Wang; Stuart H Orkin
Journal:  Cell       Date:  2008-03-21       Impact factor: 41.582

5.  Integrating biological data--the Distributed Annotation System.

Authors:  Andrew M Jenkinson; Mario Albrecht; Ewan Birney; Hagen Blankenburg; Thomas Down; Robert D Finn; Henning Hermjakob; Tim J P Hubbard; Rafael C Jimenez; Philip Jones; Andreas Kähäri; Eugene Kulesha; José R Macías; Gabrielle A Reeves; Andreas Prlić
Journal:  BMC Bioinformatics       Date:  2008-07-22       Impact factor: 3.169

6.  Genoviz Software Development Kit: Java tool kit for building genomics visualization applications.

Authors:  Gregg A Helt; John W Nicol; Ed Erwin; Eric Blossom; Steven G Blanchard; Stephen A Chervitz; Cyrus Harmon; Ann E Loraine
Journal:  BMC Bioinformatics       Date:  2009-08-25       Impact factor: 3.169

7.  Consistency analysis of redundant probe sets on affymetrix three-prime expression arrays and applications to differential mRNA processing.

Authors:  Xiangqin Cui; Ann E Loraine
Journal:  PLoS One       Date:  2009-01-23       Impact factor: 3.240

8.  A systems approach reveals regulatory circuitry for Arabidopsis trichome initiation by the GL3 and GL1 selectors.

Authors:  Kengo Morohashi; Erich Grotewold
Journal:  PLoS Genet       Date:  2009-02-27       Impact factor: 5.917

9.  Visualizing the genome: techniques for presenting human genome data and annotations.

Authors:  Ann E Loraine; Gregg A Helt
Journal:  BMC Bioinformatics       Date:  2002-07-30       Impact factor: 3.169

Review 10.  Comparing whole genomes using DNA microarrays.

Authors:  David Gresham; Maitreya J Dunham; David Botstein
Journal:  Nat Rev Genet       Date:  2008-04       Impact factor: 53.242

View more
  371 in total

1.  Using MACS to identify peaks from ChIP-Seq data.

Authors:  Jianxing Feng; Tao Liu; Yong Zhang
Journal:  Curr Protoc Bioinformatics       Date:  2011-06

2.  Genome-wide antisense transcription drives mRNA processing in bacteria.

Authors:  Iñigo Lasa; Alejandro Toledo-Arana; Alexander Dobin; Maite Villanueva; Igor Ruiz de los Mozos; Marta Vergara-Irigaray; Víctor Segura; Delphine Fagegaltier; José R Penadés; Jaione Valle; Cristina Solano; Thomas R Gingeras
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-28       Impact factor: 11.205

3.  Alternative splicing of CD44 mRNA by ESRP1 enhances lung colonization of metastatic cancer cell.

Authors:  Toshifumi Yae; Kenji Tsuchihashi; Takatsugu Ishimoto; Takeshi Motohara; Momoko Yoshikawa; Go J Yoshida; Takeyuki Wada; Takashi Masuko; Kaoru Mogushi; Hiroshi Tanaka; Tsuyoshi Osawa; Yasuharu Kanki; Takashi Minami; Hiroyuki Aburatani; Mitsuyo Ohmura; Akiko Kubo; Makoto Suematsu; Kazuhisa Takahashi; Hideyuki Saya; Osamu Nagano
Journal:  Nat Commun       Date:  2012-06-06       Impact factor: 14.919

4.  Genomic variation in natural populations of Drosophila melanogaster.

Authors:  Charles H Langley; Kristian Stevens; Charis Cardeno; Yuh Chwen G Lee; Daniel R Schrider; John E Pool; Sasha A Langley; Charlyn Suarez; Russell B Corbett-Detig; Bryan Kolaczkowski; Shu Fang; Phillip M Nista; Alisha K Holloway; Andrew D Kern; Colin N Dewey; Yun S Song; Matthew W Hahn; David J Begun
Journal:  Genetics       Date:  2012-06-05       Impact factor: 4.562

5.  Taking the next step: building an Arabidopsis information portal.

Authors: 
Journal:  Plant Cell       Date:  2012-06-29       Impact factor: 11.277

6.  Unraveling cell type-specific and reprogrammable human replication origin signatures associated with G-quadruplex consensus motifs.

Authors:  Emilie Besnard; Amélie Babled; Laure Lapasset; Ollivier Milhavet; Hugues Parrinello; Christelle Dantec; Jean-Michel Marin; Jean-Marc Lemaitre
Journal:  Nat Struct Mol Biol       Date:  2012-07-01       Impact factor: 15.369

7.  Chromatin remodeling around nucleosome-free regions leads to repression of noncoding RNA transcription.

Authors:  Adam N Yadon; Daniel Van de Mark; Ryan Basom; Jeffrey Delrow; Iestyn Whitehouse; Toshio Tsukiyama
Journal:  Mol Cell Biol       Date:  2010-08-30       Impact factor: 4.272

8.  Metagenomic detection of phage-encoded platelet-binding factors in the human oral cavity.

Authors:  Dana Willner; Mike Furlan; Robert Schmieder; Juris A Grasis; David T Pride; David A Relman; Florent E Angly; Tracey McDole; Ray P Mariella; Forest Rohwer; Matthew Haynes
Journal:  Proc Natl Acad Sci U S A       Date:  2010-06-14       Impact factor: 11.205

9.  CisGenome Browser: a flexible tool for genomic data visualization.

Authors:  Hui Jiang; Fan Wang; Nigel P Dyer; Wing Hung Wong
Journal:  Bioinformatics       Date:  2010-05-30       Impact factor: 6.937

10.  A quartet of PIF bHLH factors provides a transcriptionally centered signaling hub that regulates seedling morphogenesis through differential expression-patterning of shared target genes in Arabidopsis.

Authors:  Yu Zhang; Oleg Mayba; Anne Pfeiffer; Hui Shi; James M Tepperman; Terence P Speed; Peter H Quail
Journal:  PLoS Genet       Date:  2013-01-31       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.