| Literature DB >> 30115066 |
Alberto Santos-Zavaleta1, Mishael Sánchez-Pérez1, Heladia Salgado1, David A Velázquez-Ramírez1, Socorro Gama-Castro1, Víctor H Tierrafría1, Stephen J W Busby2, Patricia Aquino3, Xin Fang4, Bernhard O Palsson4,5, James E Galagan3, Julio Collado-Vides6,7.
Abstract
BACKGROUND: Our understanding of the regulation of gene expression has benefited from the availability of high-throughput technologies that interrogate the whole genome for the binding of specific transcription factors and gene expression profiles. In the case of widely used model organisms, such as Escherichia coli K-12, the new knowledge gained from these approaches needs to be integrated with the legacy of accumulated knowledge from genetic and molecular biology experiments conducted in the pre-genomic era in order to attain the deepest level of understanding possible based on the available data.Entities:
Keywords: ChIP-seq; Integrative analyses; Systems biology; Transcriptional regulation; Transcriptomics; gSELEX
Mesh:
Year: 2018 PMID: 30115066 PMCID: PMC6094552 DOI: 10.1186/s12915-018-0555-y
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Number of publications studying transcriptional regulation in E. coli K-12, using either classic molecular biology or HT technologies through the years
Search results by methods
| Method | PubMed search | Datasets |
|---|---|---|
| ChIP-chip | 30 | 34 |
| ChIP-seq | 15 | 34 |
| ChIP-exo | 8 | 12 |
| Selex | 35 | 23 |
| RNA-seq | 160 | 102 |
| Microarrays | 1188 | 749 |
| Hi-C | 8 | 1 |
| IPOD | 1 | 1 |
| NET-seq | 2 | 2 |
| TraDIS | 3 | 0 |
| ChAP-seq | 2 | 0 |
| CLIP-seq | 1 | 1 |
| Bisulfite-seq | 1 | 0 |
| Genotyping | n.d. | 10 |
| RIP-seq | n.d. | 2 |
| Others | 0 | 157 |
| Total | 1454 | 1128 |
Resources for gene regulation in E. coli K-12
| Source | Type of knowledge | URL | Updated | Reference |
|---|---|---|---|---|
| RegulonDB | Transcriptional regulation, operons, regulons, gensor units | http://regulondb.ccg.unam.mx | Yes | [ |
| EcoCyc | Regulation, transport, metabolism | https:// | Yes | [ |
| COLOMBOS | Expression compendia of bacterial organisms | http:// | Yes | [ |
| STRING | Protein-protein interaction network | http:// | Yes | [ |
| GEO | Genomics HT data repository |
| Yes | [ |
| ArrayExpress | Repository of HT functional genomics experimental results | https://www.ebi.ac.uk/arrayexpress/ | Yes | [ |
| PortEco | Next-generation data for | http:// | No | [ |
| GenExpDB | Expression compendia | https://genexpdb.okstate.edu | No | – |
| EcoGene | http:// | No | [ | |
| GenProtEC | Functions encoded by the | http://genprotec.mbl.edu | No | [ |
| EchoBASE | Information from postgenomic experiments |
| No | [ |
| Bacteriome | Integrates physical (protein-protein) and functional interactions | http://www. | – | [ |
| EcoProDB | Integrates protein information | http://eecoli.kaist.ac.kr/main.html | – | [ |
| M3D | Resource for microbial gene expression data |
| No | [ |
Fig. 2Diagram of the annotation process. We collect publications in PubMed involving the HT methodologies. Searches were made for the term “coli” in the title or in the abstract and the name of the method or different synonyms or keywords related to the method in all fields of publications. The results were filtered to get unique results. We read the abstracts and eliminate all those papers not reporting experiments performed in E. coli. Frequently, the papers include additional experimental characterization for a subset of the sites based on classic methods. Metadata are extracted from each publication. For more detail see main text. The growth condition and strain are described using the controlled vocabulary defined by Tierrafría et al. [7]. The products of curation are added to RegulonDB either together with the classic curation or as a separate dataset. Image from RegulonDB [8]
Summary of all curated knowledge available in RegulonDB that was obtained via HT methodologies
| Methodologies | Number of articles | Number of TFs | Name of the TFs |
| gSELEX | 2, previous work | 3 | CRP, H-NS, and LeuO |
| 19, this work | 18 | AscG, BasR, CitB, Cra, CsgD, Dan, DpiA, LeuO, Lrp, NemR, OmpR, PdhR, PgrR, RcdA, RstA, RutR, SdiA, and SutR | |
| ChIP-chip | 1, previous work | 1 | PurR |
| 17, this work | 15 | ArcA, ArgR, CRP, Fis, FNR, H-NS, IHF, LexA, Lrp, NsrR, RpoD (Sigma70), RpoH (Sigma32), RutR, Rho, and TrpR | |
| ChIP-exo | 2, previous work | 6 | GadE, GadW, GadX, OxyR, SoxS, and SoxR |
| 7, this work | 4 | ArgR, Fur, OmpR, and UvrY | |
| ChIP-seq | 8, this work | 8 | CsiR, FNR, Fur, H-NS, Nac, OmpR, RpoD (Sigma70), and RpoS (Sigma38) |
| Methodologies | Number of articles | Number of TSSs | Dataset in RegulonDB |
| TSS determination | 2, previous work | 5197 | |
| 1806 | |||
| 1, previous work | 14000 |
Summary of curated HT-generated regulatory interactions. The total of new RIs is 1048 and those RIs already existing are 107
| Complete data uploaded in RegulonDB | |||||||
|---|---|---|---|---|---|---|---|
| Datasets | Regulatory interactions | ||||||
| TF | Total number of peaks | Sites with missing information | New | Known (added in evidence) | PMID | HT methodology | Reference |
| ArgR | 122 | 37 | 67 | 18 | 25735747 | ChIP-exo, qPCR, and microarray | [ |
| ArgR | 48 | 34 | 10 | 4 | 22082910, this work | ChIP-chip and microarray | [ |
| ArcA | 278 | 143 | 115 | 20 | 24699140 | ChIP-chip, qPCR, and microarray | [ |
| CsiR | 126 | 0 | 126 | 0 | 28061857 | ChIP-seq and RNA-seq | [ |
| FNR | 224 | 186 | 29 | 9 | 24699140 | ChIP-chip, qPCR, and microarray | [ |
| FNR | 53 | 0 | 29 | 24 | 23818864 | ChIP-seq and microarray | [ |
| Fur | 144 | 87 | 39 | 18 | 25222563 | ChIP-exo and RNA-seq | [ |
| Fur | 134 | 119 | 12 | 3 | 26670385, this work | ChIP-seq and microarray | [ |
| Lrp | 143 | 67 | 68 | 8 | 19052235 | ChIP-chip and microarray | [ |
| Nac | 534 | 0 | 531 | 3 | 28061857 | ChIP-seq and RNA-seq | [ |
| OmpR | 41 | 31 | 10 | 0 | 26332955 | gSELEX | [ |
| OmpR | 41 | 30 | 11 | 0 | 28526842 | ChIP-exo and RNA-seq | [ |
| TrpR | 8 | 7 | 1 | 0 | 22082910, this work | ChIP-chip and microarray | [ |
Summary of interactions curated in datasets
| TF interactions within datasets | ||||
| TF | Number of interactions | PMID | HT methodology | Reference |
| ArcA | 143 | 24699140 | ChIP-chip | [ |
| ArgR | 426 | 22082910 | ChIP-chip | [ |
| ArgR | 38 | 25735747 | ChIP-exo | [ |
| AscG | 9 | 19633077 | gSELEX | [ |
| BasR | 99 | 22442305 | gSELEX | [ |
| CitB | 15 | 18997424 | gSELEX | [ |
| Cra | 14 | 16115199 | gSELEX | [ |
| Cra | 234 | 21115656 | gSELEX | [ |
| CRP | 39 | 16301522 | ChIP-chip | [ |
| CsgD | 31 | 21421764 | gSELEX | [ |
| CsiR | 126 | 28061857 | ChIP-seq | [ |
| Dan | 176 | 20156994 | gSELEX | [ |
| DpiA | 15 | 18997424 | gSELEX | [ |
| Fis | 228 | 16963779 | ChIP-chip | [ |
| FNR | 137 | 17164287 | ChIP-chip | [ |
| FNR | 796 | 23818864 | ChIP-seq and ChIp-chip | [ |
| FNR | 186 | 24699140 | ChIP-chip | [ |
| Fur | 473 | 26670385 | ChIP-seq | [ |
| Fur | 91 | 25222563 | ChIP-exo | [ |
| H-NS | 1501 | 23818864 | ChIP-chip | [ |
| H-NS | 101 | 16963779 | ChIP-chip | [ |
| H-NS | 53 | 21097887 | ChIP-seq | [ |
| IHF | 1020 | 23818864 | ChIP-chip | [ |
| IHF | 155 | 16963779 | ChIP-chip | [ |
| LeuO | 17 | 19429622 | gSELEX | [ |
| LexA | 69 | 16264194 | ChIP-chip | [ |
| Lrp | 67 | 19052235 | ChIP-chip | [ |
| Lrp | 296 | 28348809 | gSELEX | [ |
| Nac | 537 | 28061857 | ChIP-seq | [ |
| NemR | 6 | 18567656 | gSELEX | [ |
| NsrR | 83 | 19656291 | ChIP-chip | [ |
| OmpR | 68 | 28061857 | ChIP-seq | [ |
| OmpR | 30 | 28526842 | ChIP-exo | [ |
| OmpR | 31 | 26332955 | gSELEX | [ |
| PdhR | 14 | 17513468 | gSELEX | [ |
| PgrR | 82 | 23301696 | gSELEX | [ |
| RcdA | 39 | 23233451 | gSELEX | [ |
| RstA | 34 | 17468243 | gSELEX | [ |
| RutR | 20 | 18515344 | ChIP-chip | [ |
| RutR | 9 | 17919280 | gSELEX | [ |
| SdiA | 212 | 24645791 | gSELEX | [ |
| SutR | 15 | 25406449 | gSELEX | [ |
| TrpR | 17 | 22082910 | ChIP-chip | [ |
| UvrY | 288 | 26673755 | CHIP-exo | [ |
| Sigma factors and Rho interactions within datasets | ||||
| Sigma factors and Rho | Number of interactions | PMID | HT methodology | Reference |
| RpoD (Sigma70) | 1214 | 16109958 | ChIP-chip | [ |
| RpoD (Sigma70) | 528 | 16301522 | ChIP-chip | [ |
| Rho | 260 | 19706412 | ChIP-chip | [ |
| RpoD (Sigma70) | 6350 | 23818864 | ChiP-seq | [ |
| RpoH (Sigma32) | 82 | 16892065 | ChIP-chip | [ |
| RpoH (Sigma32) | 44 | 20602746 | ChIP-chip | [ |
| RpoS (Sigma38) | 91 | 26020590 | ChiP-seq | [ |
Fig. 3Elementary pieces of curation. As new methods emerge, we need to separately curate evidence and references for each elementary piece of knowledge that, when combined, supports our understanding. Here, we have separated evidence for binding of TFs and evidence for an effect on transcription either of a known promoter or on a target gene or TU for which the promoter is not known