Literature DB >> 19846439

G-compass: a web-based comparative genome browser between human and other vertebrate genomes.

Yoshihiro Kawahara1, Ryuichi Sakate, Akihiro Matsuya, Katsuhiko Murakami, Yoshiharu Sato, Hao Zhang, Takashi Gojobori, Takeshi Itoh, Tadashi Imanishi.   

Abstract

SUMMARY: G-compass is designed for efficient comparative genome analysis between human and other vertebrate genomes. The current version of G-compass allows us to browse two corresponding genomic regions between human and another species in parallel. One-to-one evolutionarily conserved regions (i.e. orthologous regions) between species are highlighted along the genomes. Information such as locations of duplicated regions, copy number variations and mammalian ultra-conserved elements is also provided. These features of G-compass enable us to easily determine patterns of genomic rearrangements and changes in gene orders through evolutionary time. Since G-compass is a satellite database of H-InvDB, which is a comprehensive annotation resource for human genes and transcripts, users can easily refer to manually curated functional annotations and other abundant biological information for each human transcript. G-compass is expected to be a valuable tool for comparing human and model organisms and promoting the exchange of functional information. AVAILABILITY: G-compass is freely available at http://www.h-invitational.jp/g-compass/. CONTACT: t.imanishi@aist.go.jp

Entities:  

Mesh:

Year:  2009        PMID: 19846439      PMCID: PMC2788932          DOI: 10.1093/bioinformatics/btp594

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Information regarding evolutionary sequence conservation enables us to exchange biological knowledge between human and model organisms, since conserved sequences generally have important and similar functions across species. There are various genome browsers that show gene structures with patterns of genomic conservation, such as the UCSC Genome Browser (Karolchik et al., 2008), the Ensembl Genome Browser (Hubbard et al., 2009) and the VISTA Genome Browser (Visel et al., 2007). However, these browsers were designed to consider one genome as a reference, showing the patterns of genomic conservation of other genomes along the reference genome. The previous version of G-compass (Fujii et al., 2005), which only provided humanmouse genome comparisons, also considered the human genome as a reference. These browsers do not provide a quick overview of the genomic conservations and synteny information simultaneously in parallel. We found that only Combo (Engels et al., 2006) displays two genomic regions for different species in parallel and allows us to easily determine genomic rearrangements and changes in gene orders through evolutionary time. However, because Combo is provided as a Java program, it requires installation and configuration of a Java client. Furthermore, users have to prepare data in an appropriate format to be shown, making it difficult to use Combo, especially for bench biologists who are not familiar with computer-based analysis. Here we present a new version of G-compass, which is freely available for all researchers via commonly used web browsers in any operating system, such as Windows, Mac or Linux, and has greatly improved user-friendly interfaces. Information on genomic conservations between human and another species is visually displayed simultaneously in parallel. G-compass also allow in silico biologists to download all information of one-to-one conserved genomic regions between human and other species for further analyses.

2 DATA CONSTRUCTION AND USAGE

Whole genome sequences were obtained from the UCSC web site (http://genome.ucsc.edu/): human (hg18), chimpanzee (panTro2), rhesus monkey (rheMac2), mouse (mm8), rat (rn4), dog (canFam2), cow (bosTau3), horse (equCab1), opossum (monDom4), chicken (galGal3), zebrafish (danRer4), medaka (oryLat1) and tetraodon (tetNig1). To identify evolutionary-conserved regions with duplicated regions, BLASTZ sequence similarity searches (Schwartz et al., 2003) were performed. Whole genome sequences were cut into overlapping fragments (‘10 Mb + 10 kb overlap’ for the reference genome and ‘50 Mb + 10 Mb overlap’ for the query genome). Round-robin BLASTZ searches were performed for genome fragments with the following parameters: ‘C=0 H=2000 Y=3400’ for primates, ‘C=0 H=2000’ for other mammals and ‘C=0 H=2000 Q=HoxD55.q’ for non-mammalian species. Subsequently, ‘reversed’ round-robin BLASTZ searches were performed by replacing the reference with the query and vice versa. Results of these searches were merged to produce genome alignments. Because the BLASTZ parameters used in these searches were more relaxed than those used for comparative genomics tracks in the UCSC Genome Browser, more conserved-regions are provided in G-compass; for example, numbers of conserved regions between the human and mouse genomes were 4 462 310 and 1 769 239, respectively, and coverage of the human genome was 36.3% and 35.9% for G-compass and axtNet data of UCSC, respectively. In addition to the conserved regions with duplicated regions, one-to-one (1:1) orthologous genomic regions were identified by removing redundancy in the genome alignments. In the top page of G-compass, three entrances to the Main view are available: (i) keyword search for all transcripts and one-to-one orthologous genomic regions; (ii) the blast-like alignment tool (BLAT; Kent, 2002) for sequence similarity searches against the human, chimpanzee and mouse genomes; and (iii) clickable human chromosome maps with patterns of conserved regions with other species painted (Fig. 1A). In the Main view, two corresponding genomic regions in different species are displayed in parallel. For both genomes, mapped transcripts are shown and one-to-one orthologous genomic regions are highlighted as bands (light blue) (Fig. 1B). Copy number variations (CNVs) among human individuals and ultra-conserved elements (UCEs), which are genomic regions of at least 200 bp with 100% nucleotide identity between human and other mammalian organisms, are provided in genomic feature tracks. Clicking on each transcript or one-to-one orthologous genomic region, a pop-up window will display information. Users can get more detailed functional annotation and orthologous relationships from the links to H-InvDB (Genome Information Integration Project and H-Invitational 2, 2008) and Evola (Matsuya et al., 2008), respectively, in the pop-up window. The genome alignment viewer, also available from the link in the pop-up windows, provides detailed nucleotide sequence alignments and a user-interactive sliding window analysis tool for calculating the rate of nucleotide substitutions, gaps and GC contents (Fig. 1C). CGPLOT is a dot-plot viewer used to effectively investigate genome rearrangements between two genomic regions by comparing mapped gene structures (Fig. 1D).
Fig. 1.

Screenshots of G-compass. (A) Top page providing three entrances for users. (B) Main view showing conserved genomic regions between human and mouse in parallel. (C) Genome alignment viewer showing the levels of sequence conservation by sliding window analysis and the nucleotide sequence alignment with gene structures. (D) CGPLOT showing a dot-plot graph of two genomic regions with gene structures.

Screenshots of G-compass. (A) Top page providing three entrances for users. (B) Main view showing conserved genomic regions between human and mouse in parallel. (C) Genome alignment viewer showing the levels of sequence conservation by sliding window analysis and the nucleotide sequence alignment with gene structures. (D) CGPLOT showing a dot-plot graph of two genomic regions with gene structures.

3 CONCLUSION

G-compass has been designed to serve as a comparative genome browser aiding general biologists in obtaining information by intuitive manipulation. It displays evolutionary conserved regions for human and another species in parallel. It also displays genomic features, such as UCEs, CNVs and conserved cis-regulatory elements. Close relationships with the annotated human gene database H-InvDB and its ortholog database Evola form the background of G-compass by adding standardized annotation resource of genes and transcript. Therefore, G-compass provides valuable information needed to investigate the patterns of vertebrate genome evolution and the function of conserved genomic regions.
  9 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  A web tool for comparative genomics: G-compass.

Authors:  Yasuyuki Fujii; Takeshi Itoh; Ryuichi Sakate; Kanako O Koyanagi; Akihiro Matsuya; Takuya Habara; Kaori Yamaguchi; Yayoi Kaneko; Takashi Gojobori; Tadashi Imanishi
Journal:  Gene       Date:  2005-10-05       Impact factor: 3.688

3.  Combo: a whole genome comparative browser.

Authors:  Reinhard Engels; Tamara Yu; Chris Burge; Jill P Mesirov; David DeCaprio; James E Galagan
Journal:  Bioinformatics       Date:  2006-05-18       Impact factor: 6.937

4.  Human-mouse alignments with BLASTZ.

Authors:  Scott Schwartz; W James Kent; Arian Smit; Zheng Zhang; Robert Baertsch; Ross C Hardison; David Haussler; Webb Miller
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

5.  VISTA Enhancer Browser--a database of tissue-specific human enhancers.

Authors:  Axel Visel; Simon Minovitsky; Inna Dubchak; Len A Pennacchio
Journal:  Nucleic Acids Res       Date:  2006-11-27       Impact factor: 16.971

6.  Ensembl 2009.

Authors:  T J P Hubbard; B L Aken; S Ayling; B Ballester; K Beal; E Bragin; S Brent; Y Chen; P Clapham; L Clarke; G Coates; S Fairley; S Fitzgerald; J Fernandez-Banet; L Gordon; S Graf; S Haider; M Hammond; R Holland; K Howe; A Jenkinson; N Johnson; A Kahari; D Keefe; S Keenan; R Kinsella; F Kokocinski; E Kulesha; D Lawson; I Longden; K Megy; P Meidl; B Overduin; A Parker; B Pritchard; D Rios; M Schuster; G Slater; D Smedley; W Spooner; G Spudich; S Trevanion; A Vilella; J Vogel; S White; S Wilder; A Zadissa; E Birney; F Cunningham; V Curwen; R Durbin; X M Fernandez-Suarez; J Herrero; A Kasprzyk; G Proctor; J Smith; S Searle; P Flicek
Journal:  Nucleic Acids Res       Date:  2008-11-25       Impact factor: 16.971

7.  The UCSC Genome Browser Database: 2008 update.

Authors:  D Karolchik; R M Kuhn; R Baertsch; G P Barber; H Clawson; M Diekhans; B Giardine; R A Harte; A S Hinrichs; F Hsu; K M Kober; W Miller; J S Pedersen; A Pohl; B J Raney; B Rhead; K R Rosenbloom; K E Smith; M Stanke; A Thakkapallayil; H Trumbower; T Wang; A S Zweig; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2007-12-17       Impact factor: 16.971

8.  The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts.

Authors:  Chisato Yamasaki; Katsuhiko Murakami; Yasuyuki Fujii; Yoshiharu Sato; Erimi Harada; Jun-ichi Takeda; Takayuki Taniya; Ryuichi Sakate; Shingo Kikugawa; Makoto Shimada; Motohiko Tanino; Kanako O Koyanagi; Roberto A Barrero; Craig Gough; Hong-Woo Chun; Takuya Habara; Hideki Hanaoka; Yosuke Hayakawa; Phillip B Hilton; Yayoi Kaneko; Masako Kanno; Yoshihiro Kawahara; Toshiyuki Kawamura; Akihiro Matsuya; Naoki Nagata; Kensaku Nishikata; Akiko Ogura Noda; Shin Nurimoto; Naomi Saichi; Hiroaki Sakai; Ryoko Sanbonmatsu; Rie Shiba; Mami Suzuki; Kazuhiko Takabayashi; Aiko Takahashi; Takuro Tamura; Masayuki Tanaka; Susumu Tanaka; Fusano Todokoro; Kaori Yamaguchi; Naoyuki Yamamoto; Toshihisa Okido; Jun Mashima; Aki Hashizume; Lihua Jin; Kyung-Bum Lee; Yi-Chueh Lin; Asami Nozaki; Katsunaga Sakai; Masahito Tada; Satoru Miyazaki; Takashi Makino; Hajime Ohyanagi; Naoki Osato; Nobuhiko Tanaka; Yoshiyuki Suzuki; Kazuho Ikeo; Naruya Saitou; Hideaki Sugawara; Claire O'Donovan; Tamara Kulikova; Eleanor Whitfield; Brian Halligan; Mary Shimoyama; Simon Twigger; Kei Yura; Kouichi Kimura; Tomohiro Yasuda; Tetsuo Nishikawa; Yutaka Akiyama; Chie Motono; Yuri Mukai; Hideki Nagasaki; Makiko Suwa; Paul Horton; Reiko Kikuno; Osamu Ohara; Doron Lancet; Eric Eveno; Esther Graudens; Sandrine Imbeaud; Marie Anne Debily; Yoshihide Hayashizaki; Clara Amid; Michael Han; Andreas Osanger; Toshinori Endo; Michael A Thomas; Mika Hirakawa; Wojciech Makalowski; Mitsuteru Nakao; Nam-Soon Kim; Hyang-Sook Yoo; Sandro J De Souza; Maria de Fatima Bonaldo; Yoshihito Niimura; Vladimir Kuryshev; Ingo Schupp; Stefan Wiemann; Matthew Bellgard; Masafumi Shionyu; Libin Jia; Danielle Thierry-Mieg; Jean Thierry-Mieg; Lukas Wagner; Qinghua Zhang; Mitiko Go; Shinsei Minoshima; Masafumi Ohtsubo; Kousuke Hanada; Peter Tonellato; Takao Isogai; Ji Zhang; Boris Lenhard; Sangsoo Kim; Zhu Chen; Ursula Hinz; Anne Estreicher; Kenta Nakai; Izabela Makalowska; Winston Hide; Nicola Tiffin; Laurens Wilming; Ranajit Chakraborty; Marcelo Bento Soares; Maria Luisa Chiusano; Yutaka Suzuki; Charles Auffray; Yumi Yamaguchi-Kabata; Takeshi Itoh; Teruyoshi Hishiki; Satoshi Fukuchi; Ken Nishikawa; Sumio Sugano; Nobuo Nomura; Yoshio Tateno; Tadashi Imanishi; Takashi Gojobori
Journal:  Nucleic Acids Res       Date:  2007-12-18       Impact factor: 16.971

9.  Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees.

Authors:  Akihiro Matsuya; Ryuichi Sakate; Yoshihiro Kawahara; Kanako O Koyanagi; Yoshiharu Sato; Yasuyuki Fujii; Chisato Yamasaki; Takuya Habara; Hajime Nakaoka; Fusano Todokoro; Kaori Yamaguchi; Toshinori Endo; Satoshi Oota; Wojciech Makalowski; Kazuho Ikeo; Yoshiyuki Suzuki; Kousuke Hanada; Katsuyuki Hashimoto; Momoki Hirai; Hisakazu Iwama; Naruya Saitou; Aiko T Hiraki; Lihua Jin; Yayoi Kaneko; Masako Kanno; Katsuhiko Murakami; Akiko Ogura Noda; Naomi Saichi; Ryoko Sanbonmatsu; Mami Suzuki; Jun-ichi Takeda; Masayuki Tanaka; Takashi Gojobori; Tadashi Imanishi; Takeshi Itoh
Journal:  Nucleic Acids Res       Date:  2007-11-03       Impact factor: 16.971

  9 in total
  4 in total

1.  Abundance of ultramicro inversions within local alignments between human and chimpanzee genomes.

Authors:  Yuichiro Hara; Tadashi Imanishi
Journal:  BMC Evol Biol       Date:  2011-10-19       Impact factor: 3.260

2.  Biological database of images and genomes: tools for community annotations linking image and genomic information.

Authors:  Andrew T Oberlin; Dominika A Jurkovic; Mitchell F Balish; Iddo Friedberg
Journal:  Database (Oxford)       Date:  2013-04-02       Impact factor: 3.451

3.  Reconstructing the demographic history of the human lineage using whole-genome sequences from human and three great apes.

Authors:  Yuichiro Hara; Tadashi Imanishi; Yoko Satta
Journal:  Genome Biol Evol       Date:  2012       Impact factor: 3.416

4.  PBrowse: a web-based platform for real-time collaborative exploration of genomic data.

Authors:  Peter S Szot; Andrian Yang; Xin Wang; Chirag Parsania; Uwe Röhm; Koon Ho Wong; Joshua W K Ho
Journal:  Nucleic Acids Res       Date:  2017-05-19       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.