| Literature DB >> 33171896 |
David E Whitworth1, Allison Zwarycz1.
Abstract
As prokaryotes diverge by evolution, essential 'core' genes required for conserved phenotypes are preferentially retained, while inessential 'accessory' genes are lost or diversify. We used the recently expanded number of myxobacterial genome sequences to investigate the conservation of their signalling proteins, focusing on two sister genera (Myxococcus and Corallococcus), and on a species within each genus (Myxococcus xanthus and Corallococcus exiguus). Four new C. exiguus genome sequences are also described here. Despite accessory genes accounting for substantial proportions of each myxobacterial genome, signalling proteins were found to be enriched in the core genome, with two-component system genes almost exclusively so. We also investigated the conservation of signalling proteins in three myxobacterial behaviours. The linear carotenogenesis pathway was entirely conserved, with no gene gain/loss observed. However, the modular fruiting body formation network was found to be evolutionarily plastic, with dispensable components in all modules (including components required for fruiting in the model myxobacterium M. xanthus DK1622). Quorum signalling (QS) is thought to be absent from most myxobacteria, however, they generally appear to be able to produce CAI-I (cholerae autoinducer-1), to sense other QS molecules, and to disrupt the QS of other organisms, potentially important abilities during predation of other prokaryotes.Entities:
Keywords: Myxococcales; carotenoids; comparative genomics; development; fruiting body formation; myxobacteria; one-component systems; quorum signalling; two-component systems
Year: 2020 PMID: 33171896 PMCID: PMC7694542 DOI: 10.3390/microorganisms8111739
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Genome properties of selected strains. A: M. xanthus strains, B: Myxococcus spp. type strains, C: Corallococcus spp. type strains, D: C. exiguus strains. In each section, the mean ± standard deviation (and as a percentage of the mean) is provided for the genome size, the % GC, and the number of protein-coding sequences. JGI = Joint Genome Institute.
| A: | ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| AB022 | 257 | 9.06 | 68.9 | 6995 | VHLD00000000 | [ |
|
|
| AB024B | 365 | 9.06 | 68.9 | 7013 | SRLY00000000 | [ |
|
|
| AB056 | 230 | 9.11 | 69.1 | 7065 | VHLB00000000 | [ |
|
|
| CA005 | 227 | 9.11 | 68.9 | 7175 | SRLV00000000 | [ |
|
|
| CA006 | 360 | 9.05 | 68.9 | 6991 | SRLU00000000 | [ |
|
|
| CA010 | 250 | 9.05 | 68.9 | 6981 | VHLA00000000 | [ |
|
|
| CA018 | 727 | 9.07 | 68.8 | 7102 | JAAEAG000000000 | [ |
|
|
| CA023 | 235 | 9.08 | 68.9 | 7047 | JAAEAH000000000 | [ |
|
|
| CA027 | 252 | 9.05 | 68.9 | 7001 | WBSK00000000 | [ |
|
|
| DK1622 | 1 | 9.14 | 68.9 | 7216 | GCF_000012685 | [ |
| 9.08 ± 0.03 | 68.9 ± 0.1 | 7059 ± 82 | ||||||
| B: | ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| DSM 2260 | 57 | 9.24 | 69.2 | 7340 | FNAJ00000000 | JGI |
|
|
| AB053B | 124 | 11.39 | 68.9 | 8751 | JAAIXY01000000000 | [ |
|
|
| DSM 16525 | 42 | 10.82 | 70.0 | 8318 | FOIB00000000 | JGI |
|
|
| Mixupus | 1 | 9.49 | 69.2 | 7069 | GCA_000280925 | [ |
|
|
| AM401 | 1077 | 12.41 | 68.7 | 9508 | VIFM00000000 | [ |
|
|
| DSM 14697 | 1 | 8.97 | 70.6 | 6966 | GCA_002305895 | [ |
|
|
| DSM 14675 | 1 | 10.35 | 69.2 | 7796 | GCA_000331735 | [ |
|
|
| AM301 | 1008 | 8.99 | 69.9 | 7055 | JAAIYB000000000 | [ |
|
|
| CA032A | 177 | 12.67 | 70.2 | 9986 | JAAIYA000000000 | [ |
|
|
| DSM 14698 | 825 | 13.53 | 70.5 | 10513 | JABBJJ000000000 | [ |
|
|
| CA060A | 136 | 12.67 | 70.3 | 9355 | JAAIXZ000000000 | [ |
| 10.96 ± 1.68 | 69.7 ± 0.6 | 8423 ± 1279 | ||||||
| C: | ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| AB050A | 625 | 9.98 | 70.0 | 7905 | RAWK00000000 | [ |
|
|
| CA043D | 530 | 10.79 | 69.9 | 8511 | RAWE00000000 | [ |
|
|
| DSM 2259 | 1 | 10.08 | 69.9 | 7893 | GCA_000255295 | [ |
|
|
| AB043A | 961 | 10.15 | 70.3 | 8018 | RAVW00000000 | [ |
|
|
| AB047A | 459 | 9.47 | 70.0 | 7566 | RAWM00000000 | [ |
|
|
| CA051B | 1244 | 10.53 | 70.3 | 8137 | RAWB00000000 | [ |
|
|
| CA031B | 1491 | 10.51 | 69.7 | 8167 | RAWI00000000 | [ |
|
|
| CA040B | 802 | 10.39 | 70.2 | 7877 | RAWG00000000 | [ |
|
|
| CA054A | 863 | 10.35 | 69.5 | 8008 | RAVZ00000000 | [ |
|
|
| DSM 14696 | 36 | 10.41 | 69.6 | 8112 | JAAAPK000000000 | [ |
| 10.27 ± 0.37 | 69.9 ± 0.3 | 8019 ± 245 | ||||||
| D: | ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| AB004 | 735 | 10.60 | 69.4 | 8223 | RAWS00000000 | [ |
|
|
| AB016 | 1212 | 10.75 | 69.6 | 8940 | JABEKY000000000 | This Study |
|
|
| AB018 | 647 | 10.45 | 69.4 | 8185 | RAWR00000000 | [ |
|
|
| AB030 | 552 | 10.63 | 69.6 | 8334 | RAWQ00000000 | [ |
|
|
| AB031 | 611 | 10.43 | 69.7 | 8356 | JABEKZ000000000 | This Study |
|
|
| AB032C | 298 | 10.45 | 69.5 | 8078 | RAWP00000000 | [ |
|
|
| AB038B | 471 | 10.77 | 69.3 | 8409 | RAWO00000000 | [ |
|
|
| AB051 | 1378 | 10.76 | 69.6 | 9025 | JABELA000000000 | This Study |
|
|
| CA041A | 794 | 10.26 | 69.5 | 8071 | RAWF00000000 | [ |
|
|
| CA048 | 723 | 10.35 | 69.6 | 8428 | JABELB000000000 | This Study |
| 10.55 ± 0.18 | 69.5 ± 0.1 | 8405 ± 330 |
Variability in the numbers of regulatory genes per genome. For each class of protein, the mean number (± standard deviation (sd)), and the variability coefficient (standard deviation as a function of the mean) are presented, for four taxonomic groupings (the number of strains in each taxonomic grouping is indicated in parentheses). Variability coefficients greater than 10% are in bold, and values for genome-wide numbers of CDS (protein coding sequences) are provided for comparison. Gray rows represent classes, while white rows are sub-classes of the class above. TCS (two-component system) and TF (transcription factor) proteins are subdivided into different sub-classes based on domain organisation. HK = histidine kinase, P = phosphotransfer protein, RR = response regulator, TR = transcriptional regulator, OCS = one-component system.
| (mean ± sd) | (% Variability) | (mean ± sd) | (% Variability) | (mean ± sd) | (% Variability) | (mean ± sd) | (% Variability) | |
|---|---|---|---|---|---|---|---|---|
| TCS | 280 ± 2 | 0.81 | 329 ± 51 |
| 306 ± 12 | 4.05 | 302 ± 3 | 1.15 |
| HK | 141 ± 2 | 1.62 | 174 ± 35 |
| 160 ± 6 | 3.98 | 154 ± 2 | 1.00 |
| P | 3 ± 1 |
| 3 ± 1 |
| 3 ± 1 |
| 4 ± 2 |
|
| RR | 136 ± 1 | 0.90 | 151 ± 17 |
| 143 ± 7 | 4.56 | 143 ± 2 | 1.05 |
| S/T Kinases | 92 ± 5 | 5.25 | 124 ± 31 |
| 107 ± 8 | 7.00 | 113 ± 5 | 4.43 |
| TF | 263 ± 5 | 1.73 | 363 ± 79 |
| 341 ± 24 | 7.04 | 358 ± 10 | 2.89 |
| TR | 123 ± 2 | 1.33 | 171 ± 38 |
| 165 ± 11 | 6.92 | 178 ± 10 | 5.52 |
| OCS | 36 ± 3 | 7.18 | 70 ± 27 |
| 73 ± 9 |
| 71 ± 7 | 9.71 |
| RR TF | 50 ± 0 | 0.85 | 58 ± 8 |
| 48 ± 4 | 8.43 | 53 ± 1 | 1.96 |
| σ factors | 55 ± 1 | 1.77 | 65 ± 12 |
| 55 ± 3 | 5.75 | 56 ± 2 | 4.04 |
| CDS | 7059 ± 82 | 1.16 | 8423 ± 1279 |
| 8019 ± 245 | 3.06 | 8405 ± 330 | 3.93 |
Figure 1Response regulator (RR) families encoded in myxobacterial genomes. The strains under consideration are in the same order (front to back) as those detailed in Table 1 (top to bottom). Different strains and species exhibit similar profiles of RR families, although conserved differences can be seen in some groups of genomes.
Numbers of illustrative protein family members per genome. Genomes are presented in four taxonomic groupings with the same background shade. Also shown are the inferred numbers of proteins in the core and accessory genomes of M. xanthus strains.
| Protein Class | P | P | RR | RR | RR | RR | RR | RR | RR | RR | TR | TR | TR | TR | TR | TR |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Protein Family | HisKA | Hpt | CheV | CyC-C | LytTR | OmpR | PrrA | TrxB | VieB | Xre | MerR | Fur | HrcA | NrdR | PucR | Rok |
| 3 | 1 | 2 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 1 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 2 | 1 | 2 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 4 | 1 | 2 | 2 | 11 | 2 | 1 | 1 | 4 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 1 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 1 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 2 | 1 | 1 | 2 | 11 | 2 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 4 | 1 | 2 | 2 | 11 | 2 | 1 | 1 | 4 | 2 | 1 | 1 | 1 | ||||
| Core (per | 2 | 0 | 1 | 1 | 2 | 11 | 2 | 0 | 1 | 1 | 4 | 2 | 1 | 1 | 0 | 1 |
| Accessory (per | 1 | 0 | 0 | 0.6 | 0 | 0 | 0 | 0 | 0 | 0 | 0.9 | 0 | 0 | 0 | 0 | 0 |
| 2 | 1 | 1 | 2 | 11 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 12 | 18 | 2 | 1 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||
| 3 | 1 | 2 | 11 | 13 | 2 | 1 | 1 | 1 | 7 | 3 | 1 | 1 | 1 | |||
| 2 | 1 | 2 | 6 | 14 | 2 | 1 | 4 | 2 | 1 | 1 | 1 | |||||
| 5 | 1 | 2 | 14 | 15 | 2 | 1 | 1 | 1 | 7 | 2 | 1 | 1 | 1 | |||
| 3 | 1 | 2 | 2 | 10 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 12 | 13 | 2 | 1 | 1 | 1 | 5 | 2 | 1 | 1 | ||||
| 5 | 1 | 1 | 2 | 9 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 10 | 14 | 2 | 1 | 1 | 7 | 2 | 1 | 1 | 1 | ||||
| 2 | 1 | 1 | 3 | 8 | 14 | 2 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | 1 | ||
| 3 | 1 | 2 | 10 | 13 | 2 | 1 | 7 | 2 | 1 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 5 | 9 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 8 | 12 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 2 | 1 | 2 | 5 | 9 | 1 | 1 | 7 | 2 | 1 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 5 | 11 | 1 | 1 | 8 | 2 | 1 | 1 | 1 | |||||
| 2 | 1 | 2 | 2 | 8 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | |||||
| 4 | 1 | 2 | 7 | 9 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | |||||
| 5 | 1 | 2 | 8 | 11 | 1 | 1 | 7 | 2 | 1 | 1 | 1 | |||||
| 3 | 1 | 2 | 4 | 10 | 1 | 1 | 8 | 2 | 1 | 1 | 1 | |||||
| 3 | 1 | 2 | 7 | 10 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | |||||
| 2 | 1 | 2 | 6 | 10 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | |||||
| 3 | 1 | 2 | 7 | 13 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 8 | 1 | 2 | 7 | 13 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 3 | 1 | 1 | 2 | 6 | 11 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | 1 | |||
| 3 | 1 | 1 | 2 | 6 | 11 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | 1 | |||
| 3 | 1 | 2 | 7 | 13 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 7 | 13 | 1 | 1 | 7 | 2 | 1 | 1 | 1 | 1 | ||||
| 3 | 1 | 2 | 8 | 12 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 5 | 1 | 2 | 7 | 13 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | ||||
| 2 | 1 | 1 | 2 | 5 | 11 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 | |||
| 3 | 1 | 2 | 6 | 11 | 1 | 1 | 6 | 2 | 1 | 1 | 1 | 1 |
Figure 2The percentage of RRs and TFs + OCSs found in the pan-genome core for the four groups of genomes.
Figure 3Conservation of signalling pathway proteins in Myxococcaceae. Regulatory proteins are shown as ovals. Positive regulation is shown with pointed arrows, and negative regulation with blunt-headed arrows. (A) Carotenoid production. In the dark (top), CarQ is held inactive by CarR, while CarA and CarH repress expression of the constitutively active crt promoter. In the light (bottom), CarH is directly inactivated while CarF inactivates CarR, releasing CarQ to direct transcription of carQRS, producing CarS which binds to CarA, relieving repression of the crt genes, which encode enzymes for the biosynthesis of carotenoids (lycopene shown as an example). (B) Fruiting body formation. Gene products work together in modules (dark grey boxes). Starvation triggers the production of the secondary messengers c-di-GMP and (p)ppGpp and activates the EBP (enhancer binding protein) cascade and the Mrp module. The A-signalling, C-signalling, FruA and Nla24/DmxB modules are stimulated by secondary messengers and regulatory modules. Various ‘development timer’ proteins regulate the timing of fruiting, and the DevTRS/CRISPR module modulates the timing of sporulation. ‘Other’ proteins regulate fruiting, but their relationship to other modules is not clear. (C) Quorum signalling. Four common Gram-negative bacterial quorum signals (AI-I, AI-II, CAI-I and HAI-I), and their corresponding synthase and receptor/regulator proteins, are shown. Also shown is the quorum-quenching AHL acylase, PvdQ. Whether regulatory proteins are core, conserved, or dispensable, is indicated based on their pattern of evolutionary conservation. Core proteins are found at a constant number per genome across the Myxococcaceae and are highlighted in bold text. Conserved proteins are found in all groups of Myxococcaceal genomes, but in variable numbers, and are indicated with a pale grey background and dashed outline. Dispensable proteins are absent from some groups of Myxococcaeal genomes and are shown with a transparent background, and grey text.