Literature DB >> 16381930

DBTGR: a database of tunicate promoters and their regulatory elements.

Nicolas Sierro1, Takehiro Kusakabe, Keun-Joon Park, Riu Yamashita, Kengo Kinoshita, Kenta Nakai.   

Abstract

The high similarity of tunicates and vertebrates during their development coupled with the transparency of tunicate larvae, their well-studied cell lineages and the availability of simple and efficient transgenesis methods makes of this subphylum an ideal system for the investigation of vertebrate physiological and developmental processes. Recently, the sequencing of two different Ciona genomes has lead to the identification of numerous genes. In order to better understand the regulation of these genes, a database was created containing information on regulation of tunicate genes collected from literature. It includes for instance information regarding the minimal promoter length, the transcription factors involved and their binding sites, as well as the localization of the gene expression. Additionally, binding sites for characterized transcription factors were predicted based on published in vitro recognition sites. Comparison of the promoters of homologous genes in different species is also provided to allow identification of conserved cis elements. At the time of writing, information about 184 promoters, containing 73 identified binding sites and >2000 newly predicted binding sites is available. This database is accessible at http://dbtgr.hgc.jp.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16381930      PMCID: PMC1347427          DOI: 10.1093/nar/gkj064

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Tunicates, which include larvaceans, thaliaceans and sedentary ascidians, or sea squirts, such as Ciona intestinalis and Ciona savignyi, are lower chordates and share basic gene repertoires and many characteristics, both developmental and physiological, with vertebrates (1–3). Although the adult ascidian shows no resemblance to vertebrate animals, its fertilized egg develops within a day into a tadpole-like larva consisting of ∼2600 cells which shares a basic body-plan with vertebrates (2). The well-established cell lineage (4,5) and the transparency of the larva allow the visualization of the spatial and temporal gene expression pattern in detail during development, making tunicates an efficient tool to elucidate the genetic regulatory systems underlying the developmental and physiological processes of vertebrates. Furthermore, simple electroporation methods permit the simultaneous transformation of several hundred synchronously developing embryos (6) and transient transgenesis has been applied successfully for efficient expression of exogenous genes (6–8), thus providing researchers simple and reliable tools to carry out large-scale investigations of the different tunicate gene expression networks. Although the regulation of specific genes has been investigated for several years (9), the recent availability of the draft genome of C.intestinalis and C.savignyi, coupled with the results of systematic in situ hybridization experiments, offers new possibilities with regards to the identification of cis-elements involved in both the spatial and temporal gene regulation (10). DBTGR, the DataBase of Tunicate Gene Regulation, was constructed in order to provide a comprehensive access to published information regarding regulation of gene expression in tunicates. It further offers information on putative binding sites for the identified transcription factors. Alignments of orthologous promoter sequences are also provided for comparative analysis of regulatory elements. Finally, user-defined motif searches can be carried out on the complete set of available promoter sequences. The web pages provided by DBTGR are generated by PHP scripts from the information stored in a MySQL database. The database itself consists of several cross-linked tables containing data about the promoters, the transcription factors and the binding sites, as well as the relation between them. The consensus and weight matrix searches are performed by external C programs and their results cached to provide fast access to frequent queries.

OVERVIEW

DBTGR mainly consists of a collection of experimentally characterized promoters for which the location of gene expression is given along with the transcription factor driving it. In addition, when available, the position of known binding sites of transcription factors as well as other regulatory elements are given for each promoter. Information about the gene product and the localization of its expression during the development of the organism is given both as a graphical overview and a detailed text (Figure 1). The graphical overview represents a larva consisting of six domains: the epidermis, the nervous system, the endoderm, the brain, the notochord and the muscles. Each domain in which expression is found is filled with a different color, thus providing an easy way to identify promoters targeting expression to specific areas. The text entry gives detailed information about the localization of the gene expression by enumerating the different tissues and cells where it was experimentally observed.
Figure 1

Graphical representation of gene expression. The schematic larva is separated in six domains, which are highlighted with different colors depending on the localization of gene expression.

Promoter regions critical for gene expression, as well as the size of the minimal functional promoter are given provided they were determined during the promoter characterization. Similarly, information regarding the recognition sequences of the involved transcription factors is given when available. A list of the identified binding sites, separated in two categories to distinguish between binding sites for which experiments have shown involvement in gene expression and predicted binding sites, is then presented, indicating among others the transcription factor involved, the position of the binding site and the strand on which it was identified. Selected binding sites can easily be added or removed from the sequence displayed at the bottom of the page by ticking the corresponding checkbox and reloading the page via the provided button. Cross-references to the corresponding gene location in both version 1.00 [available from the US Department of Energy Joint Genome Institute (JGI) ] and version 1.95 [available from Ensembl (11)] of the draft genome of C.intestinalis is also given, and links are provided to the respective genome browsers and gene entry pages. Furthermore, when predicted promoters for homologous genes could be obtained from the C.savignyi genome, a link to the alignment of both promoter sequences where conserved regions are highlighted is provided. The promoter sequence presented at the bottom of the page consists of at most the 3000 bases upstream of the protein-coding region and was originally obtained from the JGI version 1.00 genome draft. When possible, the corresponding sequence in the JGI version 2.00 genome draft was extracted by BLAST search (12). Although only the latest available promoter sequence is given as an online image, when present, both sequence versions are available as FASTA files for download. In the online image, binding sites are shown as arrows located above or under the nucleotide sequence depending on the strand on which they are located. Their color corresponds to that of the previously listed binding site description, and a question mark is added next to predicted binding sites. The information available in the binding sites list can also be obtained by moving the mouse over the arrow representing it. Both the listed and the image binding sites are linked to a motif-specific page providing a list of all the genes where that motif was found, the sequence of the corresponding binding sites with the bases matching the consensus sequence being highlighted, and both a position-specific weight matrix and a consensus sequence for that motif computed based on the binding sites for which experimental evidences of their involvement in regulation are present. Similarly, the transcription factors listed are linked to a page providing information on the genes they regulate and the binding site they recognize.

FEATURES

In order to provide the user with information about promoter regions conserved between several species, predicted promoter sequences for C.savignyi genes were extracted from the C.savignyi genome draft based on the whole genome alignment made with the JGI version 1.00 C.intestinalis genome draft available from VISTA (13). As for C.intestinalis promoter sequences, the C.savignyi sequences are available as FASTA files for download, and as online images displaying predicted binding sites. Furthermore, the latest version of the C.intestinalis promoter sequences were aligned using ClustalW (14) with the predicted C.savignyi promoter sequences. Links to these alignments are available from the detailed promoter pages. Each alignment is shown as a dynamically generated online image with the regions conserved in both promoter sequences highlighted (Figure 2). The possibility to modify the extent of the highlighted conserved regions is provided via the adjustment of two parameters: the minimum size of a conserved block and the maximum length of the non-conserved region between two such blocks. In addition to the highlighted conserved regions, the binding sites identified in either sequences are also shown, so that a correlation between the predicted or proven binding sites and sequence conservation can easily be visualized.
Figure 2

Alignment of the C.intestinalis and C.savignyi snail promoter sequences. The two promoter sequences were aligned with ClustalW. Arrows above the alignment represent binding sites identified in the upper sequence, while arrows under the alignment refer to binding sites in the lower sequence. The direction of the arrows indicates on which strand the specific binding site was found. Binding sites with the same color are bound by the same transcription factor. The two overlapping red arrows indicate that this specific site was both reported in publications and predicted by consensus searches, while overlapping arrows with different colors identify a binding site recognized by two or more transcription factors. Clicking on an arrow leads to a motif-specific page listing all occurrences of that motif, the concerned promoters being linked back to their detailed page. There, all identified binding sites are given, with the possibility to toggle their display on the single promoter sequence, or to obtain the pre-aligned sequences again.

Searches for particular motifs can be performed in the complete dataset by using either a consensus sequence or a position-specific weight matrix. The extent of the returned results can be modified by adjusting the number of mismatches allowed in the case of a consensus search, and the cutoff threshold in the case of a weight matrix search. As alternative to user-defined matrices, the provided JASPAR weight matrices (15) can also be used. The C.intestinalis promoter entries are linked to the JGI and Ensembl genome browsers and gene information pages, as well as to the corresponding ANISEED and Ghost entries.

FUTURE PROSPECTS

The available promoter sequences will be updated when new genome versions are released using a similar method as described here for the extraction of the JGI version 2.0 sequences. Further improvements such as the inclusion of information on the regulation of Halocynthia roretzi, C.savignyi and Oikopleura dioica genes extracted from the literature are planned. In addition, contributions from researchers are welcomed, either under the form of links to recently published work containing relevant data, or by direct submission of newly characterized promoter sequences and their features. Furthermore, recent discussions showed an interest for centralized contact information regarding the various promoter constructs available from the different groups of the tunicate community. This information could similarly be added to DBTGR by extraction from the literature or direct submission from the community. Close cooperation with the recently formed ‘Model Organism Database Working Group’ of the tunicate community will ensure that DBTGR grows in accordance with and in response to its needs, as well as its integration with the several other complementary databases provided by the community.
  13 in total

Review 1.  The ascidian as a model organism in developmental and evolutionary biology.

Authors:  J C Corbo; A Di Gregorio; M Levine
Journal:  Cell       Date:  2001-09-07       Impact factor: 41.582

2.  The draft genome of Ciona intestinalis: insights into chordate and vertebrate origins.

Authors:  Paramvir Dehal; Yutaka Satou; Robert K Campbell; Jarrod Chapman; Bernard Degnan; Anthony De Tomaso; Brad Davidson; Anna Di Gregorio; Maarten Gelpke; David M Goodstein; Naoe Harafuji; Kenneth E M Hastings; Isaac Ho; Kohji Hotta; Wayne Huang; Takeshi Kawashima; Patrick Lemaire; Diego Martinez; Ian A Meinertzhagen; Simona Necula; Masaru Nonaka; Nik Putnam; Sam Rash; Hidetoshi Saiga; Masanobu Satake; Astrid Terry; Lixy Yamada; Hong-Gang Wang; Satoko Awazu; Kaoru Azumi; Jeffrey Boore; Margherita Branno; Stephen Chin-Bow; Rosaria DeSantis; Sharon Doyle; Pilar Francino; David N Keys; Shinobu Haga; Hiroko Hayashi; Kyosuke Hino; Kaoru S Imai; Kazuo Inaba; Shungo Kano; Kenji Kobayashi; Mari Kobayashi; Byung-In Lee; Kazuhiro W Makabe; Chitra Manohar; Giorgio Matassi; Monica Medina; Yasuaki Mochizuki; Steve Mount; Tomomi Morishita; Sachiko Miura; Akie Nakayama; Satoko Nishizaka; Hisayo Nomoto; Fumiko Ohta; Kazuko Oishi; Isidore Rigoutsos; Masako Sano; Akane Sasaki; Yasunori Sasakura; Eiichi Shoguchi; Tadasu Shin-i; Antoinetta Spagnuolo; Didier Stainier; Miho M Suzuki; Olivier Tassy; Naohito Takatori; Miki Tokuoka; Kasumi Yagi; Fumiko Yoshizaki; Shuichi Wada; Cindy Zhang; P Douglas Hyatt; Frank Larimer; Chris Detter; Norman Doggett; Tijana Glavina; Trevor Hawkins; Paul Richardson; Susan Lucas; Yuji Kohara; Michael Levine; Nori Satoh; Daniel S Rokhsar
Journal:  Science       Date:  2002-12-13       Impact factor: 47.728

3.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

Authors:  Albin Sandelin; Wynand Alkema; Pär Engström; Wyeth W Wasserman; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Hox cluster disintegration with persistent anteroposterior order of expression in Oikopleura dioica.

Authors:  Hee-Chan Seo; Rolf Brudvik Edvardsen; Anne Dorthea Maeland; Marianne Bjordal; Marit Flo Jensen; Anette Hansen; Mette Flaat; Jean Weissenbach; Hans Lehrach; Patrick Wincker; Richard Reinhardt; Daniel Chourrout
Journal:  Nature       Date:  2004-09-02       Impact factor: 49.962

Review 5.  Generation and use of transgenic ascidian embryos.

Authors:  Robert W Zeller
Journal:  Methods Cell Biol       Date:  2004       Impact factor: 1.441

Review 6.  Decoding cis-regulatory systems in ascidians.

Authors:  Takehiro Kusakabe
Journal:  Zoolog Sci       Date:  2005-02       Impact factor: 0.931

Review 7.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

8.  Characterization of a notochord-specific enhancer from the Brachyury promoter region of the ascidian, Ciona intestinalis.

Authors:  J C Corbo; M Levine; R W Zeller
Journal:  Development       Date:  1997-02       Impact factor: 6.868

9.  VISTA: computational tools for comparative genomics.

Authors:  Kelly A Frazer; Lior Pachter; Alexander Poliakov; Edward M Rubin; Inna Dubchak
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

10.  Ensembl 2005.

Authors:  T Hubbard; D Andrews; M Caccamo; G Cameron; Y Chen; M Clamp; L Clarke; G Coates; T Cox; F Cunningham; V Curwen; T Cutts; T Down; R Durbin; X M Fernandez-Suarez; J Gilbert; M Hammond; J Herrero; H Hotz; K Howe; V Iyer; K Jekosch; A Kahari; A Kasprzyk; D Keefe; S Keenan; F Kokocinsci; D London; I Longden; G McVicker; C Melsopp; P Meidl; S Potter; G Proctor; M Rae; D Rios; M Schuster; S Searle; J Severin; G Slater; D Smedley; J Smith; W Spooner; A Stabenau; J Stalker; R Storey; S Trevanion; A Ureta-Vidal; J Vogel; S White; C Woodwark; E Birney
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  12 in total

1.  Accelerated evolutionary rate of housekeeping genes in tunicates.

Authors:  Georgia Tsagkogeorga; Xavier Turon; Nicolas Galtier; Emmanuel J P Douzery; Frédéric Delsuc
Journal:  J Mol Evol       Date:  2010-08-10       Impact factor: 2.395

2.  The ANISEED database: digital representation, formalization, and elucidation of a chordate developmental program.

Authors:  Olivier Tassy; Delphine Dauga; Fabrice Daian; Daniel Sobral; François Robin; Pierre Khoueiry; David Salgado; Vanessa Fox; Danièle Caillol; Renaud Schiappa; Baptiste Laporte; Anne Rios; Guillaume Luxardi; Takehiro Kusakabe; Jean-Stéphane Joly; Sébastien Darras; Lionel Christiaen; Magali Contensin; Hélène Auger; Clément Lamy; Clare Hudson; Ute Rothbächer; Michael J Gilchrist; Kazuhiro W Makabe; Kohji Hotta; Shigeki Fujiwara; Nori Satoh; Yutaka Satou; Patrick Lemaire
Journal:  Genome Res       Date:  2010-07-20       Impact factor: 9.043

3.  Combinatorial chromatin dynamics foster accurate cardiopharyngeal fate choices.

Authors:  Claudia Racioppi; Keira A Wiechecki; Lionel Christiaen
Journal:  Elife       Date:  2019-11-20       Impact factor: 8.140

4.  Brachyury controls Ciona notochord fate as part of a feed-forward network.

Authors:  Wendy M Reeves; Kotaro Shimai; Konner M Winkley; Michael T Veeman
Journal:  Development       Date:  2021-02-05       Impact factor: 6.868

5.  Study of Cis-regulatory Elements in the Ascidian Ciona intestinalis.

Authors:  Steven Q Irvine
Journal:  Curr Genomics       Date:  2013-03       Impact factor: 2.236

6.  Integrated bio-entity network: a system for biological knowledge discovery.

Authors:  Lindsey Bell; Rajesh Chowdhary; Jun S Liu; Xufeng Niu; Jinfeng Zhang
Journal:  PLoS One       Date:  2011-06-27       Impact factor: 3.240

7.  ANISEED 2015: a digital framework for the comparative developmental biology of ascidians.

Authors:  Matija Brozovic; Cyril Martin; Christelle Dantec; Delphine Dauga; Mickaël Mendez; Paul Simion; Madeline Percher; Baptiste Laporte; Céline Scornavacca; Anna Di Gregorio; Shigeki Fujiwara; Mathieu Gineste; Elijah K Lowe; Jacques Piette; Claudia Racioppi; Filomena Ristoratore; Yasunori Sasakura; Naohito Takatori; Titus C Brown; Frédéric Delsuc; Emmanuel Douzery; Carmela Gissi; Alex McDougall; Hiroki Nishida; Hitoshi Sawada; Billie J Swalla; Hitoyoshi Yasuo; Patrick Lemaire
Journal:  Nucleic Acids Res       Date:  2015-09-29       Impact factor: 16.971

8.  ANISEED 2017: extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets.

Authors:  Matija Brozovic; Christelle Dantec; Justine Dardaillon; Delphine Dauga; Emmanuel Faure; Mathieu Gineste; Alexandra Louis; Magali Naville; Kazuhiro R Nitta; Jacques Piette; Wendy Reeves; Céline Scornavacca; Paul Simion; Renaud Vincentelli; Maelle Bellec; Sameh Ben Aicha; Marie Fagotto; Marion Guéroult-Bellone; Maximilian Haeussler; Edwin Jacox; Elijah K Lowe; Mickael Mendez; Alexis Roberge; Alberto Stolfi; Rui Yokomori; C Titus Brown; Christian Cambillau; Lionel Christiaen; Frédéric Delsuc; Emmanuel Douzery; Rémi Dumollard; Takehiro Kusakabe; Kenta Nakai; Hiroki Nishida; Yutaka Satou; Billie Swalla; Michael Veeman; Jean-Nicolas Volff; Patrick Lemaire
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  ORegAnno: an open-access community-driven resource for regulatory annotation.

Authors:  Obi L Griffith; Stephen B Montgomery; Bridget Bernier; Bryan Chu; Katayoon Kasaian; Stein Aerts; Shaun Mahony; Monica C Sleumer; Mikhail Bilenky; Maximilian Haeussler; Malachi Griffith; Steven M Gallo; Belinda Giardine; Bart Hooghe; Peter Van Loo; Enrique Blanco; Amy Ticoll; Stuart Lithwick; Elodie Portales-Casamar; Ian J Donaldson; Gordon Robertson; Claes Wadelius; Pieter De Bleser; Dominique Vlieghe; Marc S Halfon; Wyeth Wasserman; Ross Hardison; Casey M Bergman; Steven J M Jones
Journal:  Nucleic Acids Res       Date:  2007-11-15       Impact factor: 16.971

10.  Text-mining assisted regulatory annotation.

Authors:  Stein Aerts; Maximilian Haeussler; Steven van Vooren; Obi L Griffith; Paco Hulpiau; Steven J M Jones; Stephen B Montgomery; Casey M Bergman
Journal:  Genome Biol       Date:  2008-02-13       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.