| Literature DB >> 25101984 |
Peyman Zarrineh1, Aminael Sánchez-Rodríguez2, Nazanin Hosseinkhan3, Zahra Narimani3, Kathleen Marchal4, Ali Masoudi-Nejad3.
Abstract
Availability of genome-wide gene expression datasets provides the opportunity to study gene expression across different organisms under a plethora of experimental conditions. In our previous work, we developed an algorithm called COMODO (COnserved MODules across Organisms) that identifies conserved expression modules between two species. In the present study, we expanded COMODO to detect the co-expression conservation across three organisms by adapting the statistics behind it. We applied COMODO to study expression conservation/divergence between Escherichia coli, Salmonella enterica, and Bacillus subtilis. We observed that some parts of the regulatory interaction networks were conserved between E. coli and S. enterica especially in the regulon of local regulators. However, such conservation was not observed between the regulatory interaction networks of B. subtilis and the two other species. We found co-expression conservation on a number of genes involved in quorum sensing, but almost no conservation for genes involved in pathogenicity across E. coli and S. enterica which could partially explain their different lifestyles. We concluded that despite their different lifestyles, no significant rewiring have occurred at the level of local regulons involved for instance, and notable conservation can be detected in signaling pathways and stress sensing in the phylogenetically close species S. enterica and E. coli. Moreover, conservation of local regulons seems to depend on the evolutionary time of divergence across species disappearing at larger distances as shown by the comparison with B. subtilis. Global regulons follow a different trend and show major rewiring even at the limited evolutionary distance that separates E. coli and S. enterica.Entities:
Mesh:
Year: 2014 PMID: 25101984 PMCID: PMC4125155 DOI: 10.1371/journal.pone.0102871
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic representation of COMODO output for the first detected module across E. coli, B. subtilis, and S. enterica.
Modules in conserved co-expressed triplets are composed of homologous triplets between three organisms (core part). In addition, homologous pairs can be detected which are conserved only between two organisms, that share a mutual co-expression in each of the species. Furthermore, additional genes can also be detected for which the co-expression with the homologous linker genes was found to be species-specific.
Overview of evolutionary co-expressed conserved modules across three organisms.
| Biological process | Module number in |
| Nucleobase-containing compound metabolic process | 65-66-67-68-75-103-105-106 |
| Amino acid metabolic process | 40-41-42-43-44-45-46-47-48-49-71-72-76-102-104 |
| Metabolism of co-factors and vitamins | 73-74 |
| Carbohydrate metabolic process | 1-2-97-98-107 |
| Transport | 10-11-13-14-15-16-17-18-19-20-21-22-23-24-25-26-27-28-29-30-31-32-33-34-35-36-37-38-39-55-76-77-78-79-80-81-82-109 |
| Aerobic respiration | 62-63-64-99-100-101 |
| Anaerobic respiration | 50-51-52-53 |
| Chaperoning, repair (refolding) | 54 |
| Ribosomal metabolism and translation | 57-58- |
| Motility and flagella synthesis |
|
| Iron acquisition | 91-92-93-94-95 |
| Cellular response to DNA damage | 108 |
| Unknown function | 56 |
The most enriched GO term from the biological process subtree amongst the genes in each module is shown (left column). The numbers of co-expressed modules showing enrichment in the same term are grouped (right column). Conserved co-expressed modules across E. coli, B. subtilis, and S. enterica are their corresponding module numbers as in . The module numbers related to large evolutionary conserved co-expressed module, which contain at least 16 genes in their core part, are highlighted by bold characters.
Overview of evolutionary co-expressed conserved modules across E. coli and S. enterica.
| Biological process | Module number in |
| Nucleobase-containing compound metabolic process | 77-78-79-84-86-87-88-93-98-99-100-104-111-127-131-141-142-144-145-171-205-206 |
| Amino acid metabolic process | 1-13-14-31-60-61-113-132-133-134-135-136-137-143-152-176-178-179-180-181-200-210 |
| Metabolism of co-factors and vitamins | 66-67-140-155-156-157-190-203 |
| Carbohydrate metabolic process | 7-9-22-23-24-30-33-63-92-106-107-117-128-146-147-148-149-150-151-197-199-211 |
| Lipid metabolic process | 2-8-15-85-92-182 |
| Transport | 3-4-5-6-10-12-32-53-55-56-57-95-109-110-125-138-154-167-191 |
| Aerobic respiration | 30-39-40-42-155-158- |
| Anaerobic respiration | 34-35-36-37-41-43- |
| Chaperoning, repair (refolding) | 73-74-75-124 |
| Ribosomal metabolism and translation | 68-80- |
| Motility and flagella synthesis |
|
| Iron acquisition and Iron-sulfur metabolism | 168-169-170- |
| Cell shape and cell division | 72-130-192-207 |
| Response to stress | 20-21-70-71-76-112-118-120-193 |
| Response to external stimulus | 11-59 |
| Response to chemical stimulus | 121 |
| Response to abiotic stimulus | 204 |
| Cellular response to DNA damage | 202 |
| Signal transduction | 25-26-27-126-184-185 |
| Biofilm formation | 166 |
| Unknown function | 16-17-103-105-115-129-163-164-165 |
The most enriched GO term from the biological process subtree amongst the genes in each module is shown (left column). The numbers of co-expressed modules showing enrichment in the same term are grouped (right column). Conserved co-expressed modules across E. coli and S. enterica are their corresponding module numbers as in . The module numbers related to large evolutionary conserved co-expressed module, which contain at least 16 genes in their core part, are highlighted by bold characters.
Overview of evolutionary conserved regulators across three organisms.
| Regulator | Module number | Targets' co-expression conservation | Regulator co-expression conservation |
| NrdR | 65-66-67-68 | Yes | No |
| Fur | 91-92-93-94-95 | Yes | No |
| LexA | 108 | Yes | Yes |
| ArgR/AhrC | 69-70 | Yes | No |
| FliA/SigD | 83-84-85-86-87-88-89-90 | Yes | Yes |
| FlgM | 83-89 | Yes | Yes |
|
| 75-105-106 | Yes | No |
|
| 10 | Yes | No |
|
| 1-2 | Yes | No |
Conserved regulators between E. coli, B. subtilis, and S. enterica and the corresponding number of the modules which are enriched as the targets of these regulators. The same module numbers are used as in . Targets' co-expression conservation: refers to whether the known targets of the corresponding regulator showed co-expression conservation across the studies species (i.e. they were detected on the core part of the co-expressed module). Regulator co-expression conservation: refers to whether the corresponding regulator itself showed co-.expression conservation across the studied species. The non-orthologous regulators between E. coli and B. subtilis predicted as being functional counterparts i.e. they are responsible for co-expression conserved target genes are highlighted by bold characters.
Figure 2Selected co-expressed conserved modules across E. coli and S. enterica.
A. Core part of co-expressed conserved module regulated by transcription factor CysB in E. coli. Existence of orthologous transcription factors CysB in S. enterica makes it highly probable that CysB is responsible for observed co-expression in module 181 of S. enterica. In addition, co-expression conservation of ydjN in both organisms may imply that this gene is also a target of CysB, and ydjN is involved in the same biological process as the other genes (cysteine metabolism). B. Co-expression conservation of motility and flagerlla synthesis (module 162). Transcription factor FlhCD and sigma factor FliA is known to be responsible for the co-expression of genes involved in this biological process in both organisms. Co-expression conservation of sigma factors FliA and FliZ, anti-sgima factor FlgM, and transcription factor YcgR may also imply the similarity in regulatory interaction conservation. From 20 genes detected as variable part in S. enterica just four genes (srfB, srfC, STM1300, STM2314) has not previously been identified as motility and flagerlla synthesis in E. coli. The other 16 genes could be detected in E. coli if the lower threshold would be used, but using lower threshold could also introduce many new non-linking genes this time in the variable part of E. coli. C. Co-expression conservation of two anti-sigma factor RseA and RseB in module 193. We expect that sigma factor RpoE is also conserved in co-expression as all these genes are in one operon in both E. coli and S. enterica (see also ). D. Homologous transcription factors CsgD and STM0347 are co-expressed in linked co-expressed module 166. CsgD also exist in S. enterica and probably not detected as co-expressed gene in S. enterica because of available condition set in this organism (see also ).
Figure 3Expression behavior of genes in co-expressed modules 166 (Panel A) and 193 (Panel B) of Table S1 in S. enterica.
Genes in black are the genes which are found as the co-expressed modules by COMODO. While genes in red (csgD and rpoE) are the ones which are not found in the co-expressed modules, but their ortholgous pair are co-expressed with the E. coli counterpart modules. We expect that genes in red (csgD and rpoE) should also be part of their modules as they are in the same operon with some genes of their modules. Shaded areas correspond to conditions not shared for the genes which were not detected as co-expressed in S. enterica (red genes). The fact that these conditions are much smaller in number than the conditions genes in red (csgD and rpoE) show co-expression with the rest of the modules genes increases the probability that these genes are actually in those modules.
Overview of evolutionary conserved regulators across E. coli and S. enterica.
| Regulator | Module number | Targets' co-expression conservation | Regulator co-expression conservation |
| TrpR | 210 | Yes | No |
| CysB | 180-181 | Yes | No |
| NtrC(glnG) | 13-110-200 | Yes | Yes |
| ArgR | 13-132-133 | Yes | No |
| Fis | 84-96 | No | Yes |
| PhdR | 96 | No | Yes |
| PurR | 99-141-142-144-145-211 | Yes | No |
| PepA | 143 | Yes | No |
| LsrR | 3-5-6-7-9 | Yes | No |
| GalS | 7-10-197 | Yes | No |
| GalR | 7-10-197 | Yes | No |
| FadR | 7-8 | Yes | No |
| MelR | 22 | Yes | No |
| MalT | 23-24 | Yes | No |
| SrlR | 28 | Yes | No |
| MtlR | 63 | Yes | Yes |
| GcvA | 211 | Yes | No |
| CsiR | 15 | Yes | No |
| AccB | 85 | Yes | No |
| PrpR | 182 | Yes | No |
| LldR | 183 | Yes | Yes |
| IclR | 30-42 | Yes | No |
| GlpR | 187-188 | Yes | No |
| LexA | 202 | Yes | Yes |
| RseA | 193 | No | Yes |
| RseB | 193 | No | Yes |
| RpoE | 193 | No | Yes |
| Fur | 168-170-172-174-175 | Yes | No |
| IscR | 169-173-190 | Yes | Yes |
| SdiA | 130 | Yes | No |
| CueR | 203 | Yes | No |
| FliA | 160-162 | Yes | Yes |
| FlgM | 162 | Yes | Yes |
| FliZ | 162 | Yes | Yes |
| YcgR | 162 | No | Yes |
| FlhCD | 162 | Yes | No |
| CsgD | 166 | No | Yes |
Conserved regulators only between E. coli and S. enterica and their corresponding number of the modules which are enriched as the targets of these regulators. The same module numbers are used as in . Targets' co-expression conservation: refers to whether the known targets of the corresponding regulator showed co-expression conservation across the studies species (i.e. they were detected on the core part of the co-expressed module). Regulator co-expression conservation: refers to whether the corresponding regulator itself showed co-.expression conservation across the studied species.
Figure 4Phylogenetic tree of STM0347 and CsgD.
Both proteins were used as queries for BLAST searches to retrieve their closest relatives. Collected sequences were aligned using CLUSTALW [31] and the resulting alignment file used as input for the program ‘neighbor’ of the PHYLIP tree [30] to derive the tree. A total of 100 bootstrap replicates were generated (numbers on the branches). STM0347 and CsgD (Salmonella enterica) are far apart on the tree suggesting they have evolved from each other long time ago and might be involved in different functions.