Literature DB >> 29036684

YEASTRACT: an upgraded database for the analysis of transcription regulatory networks in Saccharomyces cerevisiae.

Miguel C Teixeira1,2, Pedro T Monteiro3,4, Margarida Palma1,2, Catarina Costa1,2, Cláudia P Godinho1,2, Pedro Pais1,2, Mafalda Cavalheiro1,2, Miguel Antunes1,2, Alexandre Lemos3,4, Tiago Pedreira5, Isabel Sá-Correia1,2.   

Abstract

The YEAst Search for Transcriptional Regulators And Consensus Tracking (YEASTRACT-www.yeastract.com) information system has been, for 11 years, a key tool for the analysis and prediction of transcription regulatory associations at the gene and genomic levels in Saccharomyces cerevisiae. Since its last update in June 2017, YEASTRACT includes approximately 163000 regulatory associations between transcription factors (TF) and target genes in S. cerevisiae, based on more than 1600 bibliographic references; it also includes 247 specific DNA binding consensus recognized by 113 TFs. This release of the YEASTRACT database provides new visualization tools to visualize each regulatory network in an interactive fashion, enabling the user to select and observe subsets of the network such as: (i) considering only DNA binding evidence or both DNA binding and expression evidence; (ii) considering only either positive or negative regulatory associations; or (iii) considering only one set of related environmental conditions. A further tool to observe TF regulons is also offered, enabling a clear-cut understanding of the exact meaning of the available data. We believe that with this new version, YEASTRACT will improve its role as an open web resource instrumental for Yeast Biologists and Systems Biology researchers.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29036684      PMCID: PMC5753369          DOI: 10.1093/nar/gkx842

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The YEASTRACT (YEAst Search for Transcriptional Regulators And Consensus Tracking; www.yeastract.com) database was released in 2006 (1) to provide free access to all published information on transcriptional regulation in the model eukaryote Saccharomyces cerevisiae, curated by experts in the field. Since then, the database has been maintained up-to-date, and up-graded in terms of the data enclosed in it and of the tools that have been developed to increase the ability to fully exploit the increasingly available data (2–4). Databases such as MYBS (5), TRANSFAC (6), RSAT (7), YPA (8) and YeTFaSCo (9) focus most of their analysis and predictive power on the understanding of promoter regions, considering information that includes the occurrence of TF binding sites, but also the accessibility of those putative binding sites, determined among other aspects by nucleosome occupancy data. More recently, SGD (10) provided some information on transcription factor – target gene associations based on published information. YEASTRACT, besides providing tools for promoter analysis in yeast, is, to our knowledge, the single information system that offers a complete integration of all the experimentally curated transcriptional regulatory data ever published for S. cerevisiae. Furthermore, YEASTRACT provides a number of queries that enable the user to scrutinize the available data, to compare it with his own data and to run predictive analysis on the regulators of a gene or genome-wide response. In this new release, YEASTRACT was empowered with new tools, especially focusing on improved and interactive image analysis, as described below. The information in YEASTRACT was further updated and revised, leading to the integration of more reliable data. It currently includes nearly 10-fold more regulatory associations than in its first release, based on the scientific community published work in the field, especially supported by transcriptomics or ChIP-chip or ChIP-seq data, and includes more information associated to each transcriptional association, including the environmental condition and the experimental setup in which the associations were registered, and the direction of the interaction. All the information deposited in the database can now be used to filter query results interactively through a strong visual support, contributing to obtain predictions with an expected higher biological relevance.

Data Upgrade

In YEASTRACT’s first release, the database gathered ∼12,000 regulatory associations between transcription factors (TF) and target genes and 257 transcription factor binding sites (TFBS) (1). Since then, a stepwise increase in the number of regulatory associations was registered, following the rate of published data in the field, mostly based on transcriptomic analyses (2–4). With this new release, another 20,000 regulatory associations were included in YEASTRACT, reaching a total of ∼163,000 unique regulatory associations, obtained from >1,600 bibliographic references. With this release 35 new transcription factor binding sites, demonstrated in the literature, were also added to the database, that includes today 268 TFBS. All the data gathered in YEASTRACT since its first release has been manually curated by experts in the field, to assure its users high reliability. In this release, the data deposited in YEASTRACT was reviewed and a few proteins previously considered TFs, together with its associated regulatory associations, were withdrawn from the database based on the fact that, given a more accurate description, they are not really TFs but rather co-factors or chromatin remodeling proteins. Furthermore, emphasis was given to associate to each regulatory association all available information regarding: (i) the experimental evidences used to define this association; (ii) the environmental conditions in which this association takes place (iii) the directionality of the interaction, e.g. is the transcription factor acting as an activator or as a repressor of the target gene, and (iv) to begin to unveil what could be strain specific differences in terms of transcriptional regulation. Additionally, data on the exact S. cerevisiae strain in which the regulatory association was observed was gathered for the first time. This is so far only possible to a limited extent, given that this information was not yet gathered for all the regulatory associations obtained in previous releases and because it was found that specific strain information is surprisingly missing in some of the published datasets. The information gathered for each regulatory association has been coupled to existing tools, enabling the user to filter its queries to find regulatory associations that are exclusively based on DNA-binding evidence (including EMSA, DNA footprinting, ChIP, ChIP-chip, ChIP-seq or more recently ChIP-exo data), or exclusively based on expression evidence (including, mostly, that obtained by comparing gene expression in wild-type strains with that observed in TF mutant strains, registered through northern blotting, RT-PCR, DNA microarrays and RNA-seq experiments), or eventually exclusively based on both DNA binding and expression evidence, which increases the degree of confidence of the query predictive analysis. Since TF activation as well as transcriptional activation or repression is highly dependent on environmental cues, the knowledge of the environmental conditions in which every regulatory association was observed enables the possibility to search for the transcription factors that regulate the user's gene or gene-list of interest, filtering for only regulatory associations that are known to occur in specific environmental conditions. The up-to-date high-quality data deposited in YEASTRACT, as done regularly for the past 11 years, is expected to continue to benefit the community working on the Biology of S. cerevisiae, a key organism both as cell factory and eukaryotic model, and on the development of Systems Biology, a growing field of research which relies on the existence of massive amounts of reliable data.

Interactive visualization tools

A key feature of this new release of the YEASTRACT database is the significant expansion of the visualization tools offered therein, which we believe follows in line with the requests of many of our users. Visualization tools are indeed crucial for a more intuitive interpretation of results, as exemplified in the following examples, that illustrate how to use these new tools. The query ‘Rank by TF’ is particularly useful for the analysis of genome-wide expression data, particularly those coming from transcriptomic approaches, offers now three options for the visualization of the results. The new visualization tool can be selected by pressing the ‘Interactive image: force-directed layout’ option in the results page. As an example, Figure 1A depicts the outcome of using the query ‘Rank by TF’ to search for the regulatory network controlling the genes whose expression is up-regulated in response to acetic acid stress in Saccharomyces cerevisiae and obtained through a transcriptomic analysis (11). Only enriched TFs, targeting more than 25% of the dataset were considered, an option made to contemplate only the TFs which apparently play the most important role. The visual depiction of the network, which uses Data Driven Documents (D3.js: https://d3js.org/) to distribute the elements of the network, highlights the enormous complexity of this predicted network, including 110 target genes and 50 TFs. As it is, the network is very complex, which on one hand makes it more difficult to extract meaningful knowledge, and on the other may be misleading as we cannot be sure that all these TFs and corresponding regulatory associations are active under acetic acid stress. To deal with this issue, the user can filter the query to focus only on regulatory associations taking place under a more restricted set of environmental conditions. Selecting the ‘Environmental Condition Group’ ‘Stress’ a much more simplified and accurate prediction of the network underlying this transcriptional response is observed (Figure 1B). This simplified network predicts that 42 TFs work together to provide the observed transcriptional response, setting aside eight TFs whose activity has not been registered under stress. Within the 42 TFs remaining, some may be active under stress, but not specifically under acetic acid stress. Thus, an even more specific network can be obtained, by narrowing further the search, using as filter the ‘Environmental Condition Sub-Group’ ‘Weak Acid Stress’. In this case, immediately in the center the TFs which are responsible for the highest number of regulated genes appears Haa1, working as the hub of this system (Figure 1C). This is in agreement with the notion that Haa1 controls the expression of around 70% of the acetic acid response in yeast (11). Interestingly, even in this highly filtered network, it is still possible to detect the contribution of eight other TFs, including the general stress response regulators Msn2 and Msn4, and the major regulators of multidrug resistance Pdr1, Pdr3. Interestingly, both Msn2 and Msn4 have been reported to be Haa1 targets and participate in the control of the yeast response to weak acids, both in the presence of short chain (12) and long chain weak acids (13,14). For Pdr1 and Pdr3, on the other hand, there is only evidence to support its involvement in the control of the highly lipophilic weak acid herbicide 2–4-dichlorophenoxyacetic acid (2,4-D) (15) and the antimalarial drug artesunate (16). This case-study exemplifies the potential of using the new interactive visualization tools offered in YEASTRACT as an approach to conduct higher-confidence predictions of what are the regulators of a given gene or a given transcriptome-wide response.
Figure 1.

Regulatory network controlling, directly or indirectly, the Haa1 target genes identified in a single transcriptomic experiment (11). In this analysis only top enriched TFs, targeting more than 25% of the gene list were included. The whole TF network predicted to control the expression of these genes is depicted in (A), while the subnetwork of stress activated regulatory associations within the network is shown in (B) and the sub-subnetwork of weak acid stress activated regulatory associations within the network is shown in (C). The experimental evidences underlying each regulatory association (in full lines, in the case of DNA-binding evidence, or in dashed lines in the case of expression evidence), as well as the sign of the interaction—positive (green), negative (red), positive and negative (brown) or undefined (black) are highlighted.

Regulatory network controlling, directly or indirectly, the Haa1 target genes identified in a single transcriptomic experiment (11). In this analysis only top enriched TFs, targeting more than 25% of the gene list were included. The whole TF network predicted to control the expression of these genes is depicted in (A), while the subnetwork of stress activated regulatory associations within the network is shown in (B) and the sub-subnetwork of weak acid stress activated regulatory associations within the network is shown in (C). The experimental evidences underlying each regulatory association (in full lines, in the case of DNA-binding evidence, or in dashed lines in the case of expression evidence), as well as the sign of the interaction—positive (green), negative (red), positive and negative (brown) or undefined (black) are highlighted. In the specific case of analyzing a TF regulon (e.g. the complete set of targets of a transcription factor, resulting from all available published data on this TF), two new interactive visualization tools are further offered in this new release of the YEASTRACT database. First, the user may select the ‘Search for genes’ query to find the full network of genes controlled by a single transcription factor, and visualize it in the ‘Interactive image: force-directed layout’ option. This option is now available in every query offered in YEASTRACT, by selecting the ‘MENU’ icon that appears in the top of each generated table of results. As an example, Figure 2A displays the image obtained for the (directly or indirectly) Haa1-regulated genes, showing the TF in the centre of those 650 target genes, together with many other TFs that are regulated by Haa1, and that also regulate its targets. Since the Haa1 transcription factor is activated under acetic acid, but this is not the case, as far as current knowledge goes, for many of its targeted TF encoding genes, it is possible to get a more accurate depiction of the Haa1 regulon by filtering by ‘Environmental Condition Sub-Group’ ‘Weak Acid Stress’. The result shows a smaller network of Haa1 targets, including 76 genes which are regulated by Haa1 (Figure 2B), 14 of which are indirectly regulated via Msn4, and are more likely to represent the core of the Haa1 regulon relevant for the weak acid stress response.
Figure 2.

The Haa1 regulon. The whole network of Haa1 target genes is depicted in (A), while the subnetwork of Haa1-dependent regulatory associations known to take place under weak acid stress is shown in (B). The experimental evidences underlying each regulatory association (in full lines, in the case of DNA-binding evidence, or in dashed lines in the case of expression evidence), as well as the sign of the interaction—positive (green), negative (red), positive and negative (brown) or undefined (black) are highlighted.

The Haa1 regulon. The whole network of Haa1 target genes is depicted in (A), while the subnetwork of Haa1-dependent regulatory associations known to take place under weak acid stress is shown in (B). The experimental evidences underlying each regulatory association (in full lines, in the case of DNA-binding evidence, or in dashed lines in the case of expression evidence), as well as the sign of the interaction—positive (green), negative (red), positive and negative (brown) or undefined (black) are highlighted. To further provide a clear idea on what are the most reliable targets of a given transcription factor, based on the existing information, a second format of regulon depiction is also offered. This can be accessed by searching (through the ‘Quick Search’ option) for a specific gene and selecting the corresponding ‘Protein info’ tab. Then selecting the ‘See regulon’ option, a Venn diagram of the TF regulon is obtained. As a case-study, the Yrr1 regulon is shown in Figure 3. According to the information deposited in YEASTRACT, there are a total of 1443 documented targets of Yrr1, an important regulator of multidrug resistance in yeast (17). If the user considers the potential targets of Yrr1, that are those genes in whose promoter region it is possible to find the Yrr1 recognition sequence WCCGYKKWW (17), the number increases a lot. Among these, there is a surprising lack of overlap between the genes whose promoters are bound by Yrr1, the promoters which have a perfect Yrr1 binding site and the genes whose expression is indeed controlled by Yrr1. Only 79 genes meet both criteria. Surprisingly, this lack of overlap is observed for many of the TF regulons deposited in YEASTRACT. As such, this visualization tool enables a more intuitive analysis of the TF regulons, highlighting the regulatory subnetwork for which there is the highest degree of confidence. It is not surprising to see that within the 46 most verified targets of Yrr1 are a number of genes encoding multidrug transcription factors (e.g. PDR3 and YRR1), drug:H+ antiporters (FLR1, AZR1 and TPO1) and ABC drug efflux pumps (PDR10, PDR15, SNQ2 and YOR1). A few questions arise from the analysis of this figure as well: are the genes whose promoter is bound by Yrr1, but whose expression was not found to be controlled by Yrr1, controlled by it in yet unknown environmental conditions? And are the genes whose expression is affected by Yrr1, but for which there is no evidence of direct promoter targeting by the TF, indirect targets of Yrr1? Is the search for TF consensus sequences in a promoter region a reliable way of finding a putative target gene?
Figure 3.

The Yrr1 regulon. The distribution of Yrr1 target genes in the dependency of the underlying experimental evidence or computational prediction is highlighted as a Venn diagram.

The Yrr1 regulon. The distribution of Yrr1 target genes in the dependency of the underlying experimental evidence or computational prediction is highlighted as a Venn diagram.

Further available resources

In this release of the YEASTRACT database, a new data submission tool is offered. This tool comes as the answer to many users′ requests, turning it possible for individual researchers to upload their recently published data in the database. To do that, the user has to select the ‘Data Submission Form’, where an excel document can be downloaded, with filling instructions. Afterwards, the document can be easily uploaded in our portal, together with a few details on author information. For each submitted regulatory association, additional information is requested, so that the data can fit the standards of the YEASTRACT curation process, including the strain and the environmental conditions in which the association was found to take place, as well as the nature of the experimental setup used to uncover it. Once the users′ data is submitted, it will be made available in a short term, after verification by our curation staff. As always, only published data will be considered. This new tool is expected to increase the interaction between the large community generating regulatory data in S. cerevisiae and the YEASTRACT curation team, which will inevitably increase the rate of database updating and the quality of the collected information.

FUTURE DIRECTIONS

The YEASTRACT team is committed to continue to offer updated, reliable and complete information on the field of transcriptional regulation in yeast to the international community of yeast and systems biologists. Furthermore, continuous improvements of the provided tools will be made available, in response to the requests and needs of its users. Focus will be given in the future to the extension of the database to the S. cerevisiae pan-genome and to other yeasts of biotechnological interest, together with a close interconnection with the PathoYeastract database (18), in a comparative genomics approach.
  18 in total

1.  TRANSFAC: an integrated system for gene expression regulation.

Authors:  E Wingender; X Chen; R Hehl; H Karas; I Liebich; V Matys; T Meinhardt; M Prüss; I Reuter; F Schacherer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Saccharomyces cerevisiae resistance to chlorinated phenoxyacetic acid herbicides involves Pdr1p-mediated transcriptional activation of TPO1 and PDR5 genes.

Authors:  Miguel Cacho Teixeira; Isabel Sá-Correia
Journal:  Biochem Biophys Res Commun       Date:  2002-03-29       Impact factor: 3.575

3.  Genomic expression program involving the Haa1p-regulon in Saccharomyces cerevisiae response to acetic acid.

Authors:  Nuno P Mira; Jorg D Becker; Isabel Sá-Correia
Journal:  OMICS       Date:  2010-10

4.  Adaptation of Saccharomyces cerevisiae to the herbicide 2,4-dichlorophenoxyacetic acid, mediated by Msn2p- and Msn4p-regulated genes: important role of SPI1.

Authors:  T Simões; M C Teixeira; A R Fernandes; Isabel Sá-Correia
Journal:  Appl Environ Microbiol       Date:  2003-07       Impact factor: 4.792

5.  Global phenotypic analysis and transcriptional profiling defines the weak acid stress response regulon in Saccharomyces cerevisiae.

Authors:  Christoph Schüller; Yasmine M Mamnun; Mehdi Mollapour; Gerd Krapf; Michael Schuster; Bettina E Bauer; Peter W Piper; Karl Kuchler
Journal:  Mol Biol Cell       Date:  2003-11-14       Impact factor: 4.138

6.  YPA: an integrated repository of promoter features in Saccharomyces cerevisiae.

Authors:  Darby Tien-Hao Chang; Cheng-Yi Huang; Chi-Yeh Wu; Wei-Sheng Wu
Journal:  Nucleic Acids Res       Date:  2010-11-02       Impact factor: 16.971

7.  YeTFaSCo: a database of evaluated yeast transcription factor sequence specificities.

Authors:  Carl G de Boer; Timothy R Hughes
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

8.  Saccharomyces Genome Database: the genomics resource of budding yeast.

Authors:  J Michael Cherry; Eurie L Hong; Craig Amundsen; Rama Balakrishnan; Gail Binkley; Esther T Chan; Karen R Christie; Maria C Costanzo; Selina S Dwight; Stacia R Engel; Dianna G Fisk; Jodi E Hirschman; Benjamin C Hitz; Kalpana Karra; Cynthia J Krieger; Stuart R Miyasato; Rob S Nash; Julie Park; Marek S Skrzypek; Matt Simison; Shuai Weng; Edith D Wong
Journal:  Nucleic Acids Res       Date:  2011-11-21       Impact factor: 16.971

9.  The YEASTRACT database: an upgraded information system for the analysis of gene and genomic transcription regulation in Saccharomyces cerevisiae.

Authors:  Miguel Cacho Teixeira; Pedro Tiago Monteiro; Joana Fernandes Guerreiro; Joana Pinho Gonçalves; Nuno Pereira Mira; Sandra Costa dos Santos; Tânia Rodrigues Cabrito; Margarida Palma; Catarina Costa; Alexandre Paulo Francisco; Sara Cordeiro Madeira; Arlindo Limede Oliveira; Ana Teresa Freitas; Isabel Sá-Correia
Journal:  Nucleic Acids Res       Date:  2013-10-28       Impact factor: 16.971

10.  MYBS: a comprehensive web server for mining transcription factor binding sites in yeast.

Authors:  Huai-Kuang Tsai; Meng-Yuan Chou; Ching-Hua Shih; Grace Tzu-Wei Huang; Tien-Hsien Chang; Wen-Hsiung Li
Journal:  Nucleic Acids Res       Date:  2007-05-30       Impact factor: 16.971

View more
  57 in total

1.  Zinc uptake in the Basidiomycota: Characterization of zinc transporters in Ustilago maydis.

Authors:  Adriana M Martha-Paz; David Eide; David Mendoza-Cózatl; Norma A Castro-Guerrero; Elva T Aréchiga-Carvajal
Journal:  Mol Membr Biol       Date:  2019-12       Impact factor: 2.857

2.  Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments.

Authors:  Christopher A Jackson; Dayanne M Castro; Richard Bonneau; David Gresham; Giuseppe-Antonio Saldi
Journal:  Elife       Date:  2020-01-27       Impact factor: 8.140

3.  The anti-cancer drug 5-fluorouracil affects cell cycle regulators and potential regulatory long non-coding RNAs in yeast.

Authors:  Bingning Xie; Emmanuelle Becker; Igor Stuparevic; Maxime Wery; Ugo Szachnowski; Antonin Morillon; Michael Primig
Journal:  RNA Biol       Date:  2019-03-20       Impact factor: 4.652

Review 4.  Redefining fundamental concepts of transcription initiation in bacteria.

Authors:  Citlalli Mejía-Almonte; Stephen J W Busby; Joseph T Wade; Jacques van Helden; Adam P Arkin; Gary D Stormo; Karen Eilbeck; Bernhard O Palsson; James E Galagan; Julio Collado-Vides
Journal:  Nat Rev Genet       Date:  2020-07-14       Impact factor: 53.242

5.  Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination.

Authors:  Joshua L Wetzel; Mona Singh
Journal:  Nucleic Acids Res       Date:  2020-01-24       Impact factor: 16.971

6.  Live-cell assays reveal selectivity and sensitivity of the multidrug response in budding yeast.

Authors:  Elena Vanacloig-Pedros; Carlos Lozano-Pérez; Benito Alarcón; Amparo Pascual-Ahuir; Markus Proft
Journal:  J Biol Chem       Date:  2019-07-11       Impact factor: 5.157

7.  Absence of AfuXpot, the yeast Los1 homologue, limits Aspergillus fumigatus growth under amino acid deprived condition.

Authors:  Alireza Azizi; Atefeh SharifiRad; Somayeh Enayati; Mohammad Azizi; Mansour Bayat; Vahid Khalaj
Journal:  World J Microbiol Biotechnol       Date:  2020-01-30       Impact factor: 3.312

8.  Spt6 Is Required for the Fidelity of Promoter Selection.

Authors:  Stephen M Doris; James Chuang; Olga Viktorovskaya; Magdalena Murawska; Dan Spatt; L Stirling Churchman; Fred Winston
Journal:  Mol Cell       Date:  2018-10-11       Impact factor: 17.970

9.  Mip6 binds directly to the Mex67 UBA domain to maintain low levels of Msn2/4 stress-dependent mRNAs.

Authors:  Manuel Martín-Expósito; Maria-Eugenia Gas; Nada Mohamad; Carme Nuño-Cabanes; Ana Tejada-Colón; Pau Pascual-García; Lorena de la Fuente; Belén Chaves-Arquero; Jonathan Merran; Jeffry Corden; Ana Conesa; José Manuel Pérez-Cañadillas; Jerónimo Bravo; Susana Rodríguez-Navarro
Journal:  EMBO Rep       Date:  2019-11-03       Impact factor: 8.807

10.  Assessing regulatory features of the current transcriptional network of Saccharomyces cerevisiae.

Authors:  Pedro T Monteiro; Tiago Pedreira; Monica Galocha; Miguel C Teixeira; Claudine Chaouiya
Journal:  Sci Rep       Date:  2020-10-20       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.