| Literature DB >> 35265061 |
Kateřina Burkartová1, Jiří Dresler2, Jakub Rídl3,4, Lukáš Falteisek5.
Abstract
Intrapopulation genetic variability in prokaryotes is receiving increasing attention thanks to improving sequencing methods; however, the ability to distinguish intrapopulation variability from species clusters or initial stages of gene flow barrier development remains insufficient. To overcome this limitation, we took advantage of the lifestyle of Ferrovum myxofaciens, a species that may represent 99% of prokaryotic microbiome of biostalactites growing at acid mine drainage springs. We gained four complete and one draft metagenome-assembled F. myxofaciens genomes using Oxford Nanopore and Illumina sequencing and mapped the reads from each sample on the reference genomes to assess the intrapopulation variability. We observed two phenomena associated with intrapopulation variability: hypervariable regions affected by mobilome expansion called "scrapyards," and variability in gene disruptions caused by transposons within each population. Both phenomena were previously described in prokaryotes. However, we present here for the first time scrapyard regression and the development of a new one. Nearly complete loss of intrapopulation short sequence variability in the old scrapyard and high variability in the new one suggest that localized gene flow suppression is necessary for scrapyard formation. Concerning the variable gene disruptions, up to 9 out of 41 occurrences per sample were located in highly conserved diguanylate cyclases/phosphodiesterases. We propose that microdiversification of life strategies may be an adaptive outcome of random diguanylate cyclase elimination. The mine biostalactites thus proved as a unique model system for describing genomic intrapopulation processes, as they offer easily sampleable units enriched in a single microbial species.Entities:
Keywords: chemolithotrophic bacteria; horizontal gene flow; metagenome-assembled genomes; mobile elements; population genomics
Year: 2022 PMID: 35265061 PMCID: PMC8899612 DOI: 10.3389/fmicb.2022.828531
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Basic characteristics of the five Ferrovum myxofaciens metagenome-derived genomes.
| Ferrovum SSU rDNA in amplicons (%) | Illumina coverage | Nanopore coverage | Contigs | GC content (%) | Length (bp) | Genes | tRNA genes | rRNA arrays | |
| MI1A | 87.4 | 1191 | 201.7 | 1 | 54.1 | 2749106 | 2751 | 42 | 2 |
| MI1I | 99.2 | 1320 | 169.5 | 1 | 54.1 | 2754432 | 2782 | 42 | 2 |
| MI1III | 88.4 | 0 | 401.0 | 1 | 54.1 | 2748450 | 2759 | 42 | 2 |
| S2.4 | 98.9 | 1147 | 384.0 | 1 | 54.6 | 2673932 | 2670 | 42 | 2 |
| OL2a6 | 98.4 | 1188 | 21.6 | 25 | 54.4 | 2695039 | 2722 | 42 | 2 |
FIGURE 1Reference-based whole genome comparisons of the new Ferrovum myxofaciens strains. One of the genomes was used as a reference in each ring: (A) MI1A, (B) MI1I, (C) MI1III, (D) S2.4, and (E) OL2a6. The innermost ring shows the GC skew. Four inner colored rings show the sequence identity between genomes from individual samples. Contig layout is displayed as black and gray sections in OL2a6 genome. Red and blue arcs mark the original and the S2.4 alternative scrapyard in each genome, respectively. The scrapyard is interrupted and GC skew is inconsistent due to random contig orientation in OL2a6. Three outer rings show (from the inside) (1) SSV variability revealed by Illumina read alternatives, (2) “scrapyard-like content” – marked are spots where the ORFs for hypothetical proteins (light blue), transposases or other MEs (red), and phage proteins (black) start, (3) presence/absence variability in insertion sequences (IS) revealed by Nanopore read alternatives: IS outside ORFs or inside hypothetical proteins (red). IS disrupting genes with annotated functions – IS is present in reference genome (green), IS is present in alternative reads (blues). The numbers of labels are described in Supplementary Tables 4a–e. The SSV variability is not available for genome MI1III, because Illumina reads are not available for this sample. The visualization was performed in BRIG (Alikhan et al., 2011) using Blastn method.
FIGURE 2A phylogenetic tree based on 12 conserved housekeeping genes. New strains of Ferrovum myxofaciens described in this study are in bold.
The main differences in content of genes with known function in Ferrovum myxofaciens from various sites.
| Operon/gene cluster | MI group | S2.4 | OL2a6 | Position in genomes (kbp) | Length (kbp) |
| Cellulose biosyntheses/degradation | Yes | No | No | 823–835 (in MI1A) | 12 |
| Assimilatory nitrate reduction | Yes | Yes | No | 1287–1302 (in MI1A) | 15 |
| Carboxysome gene cluster | No | Yes | Yes | 1586–1591 (in S2.4) | 5 |
| Mercury resistance and reduction | No | Yes | No | 467–470 (in S2.4) | 3 |
| CRISPR-Cas system type I | No | Yes | No | 1525–1534 (in S2.4) | 9 |
Basic characteristics of “scrapyard” genome region of five Ferrovum myxofaciens strains.
| Scrapyard | ||||||||||||||||||
| Hypothetical proteins | Phage protein | ME proteins | Rest of ORF | SSV | ||||||||||||||
|
|
|
|
|
|
| |||||||||||||
| Count | % | Per 10 kbp | Count | % | Per 10 kbp | Count | % | Per 10 kbp | Count | % | Per 10 kbp | Count | Per 10 kbp | ORF total | Length (bp) | From bp | To bp | |
| MI1A | 175 | 71.7 | 9.03 | 9 | 3.69 | 0.46 | 22 | 9.02 | 1.13 | 38 | 15.6 | 1.96 | 231 | 11.9 | 244 | 193839 | 32354 | 226193 |
| MI1I | 175 | 70.3 | 9.04 | 10 | 4.02 | 0.52 | 21 | 8.43 | 1.09 | 43 | 17.3 | 2.22 | 162 | 8.37 | 249 | 193534 | 32354 | 225888 |
| MI1III | 162 | 71.1 | 8.79 | 9 | 3.95 | 0.49 | 16 | 7.02 | 0.87 | 41 | 18.0 | 2.22 | N/A | N/A | 228 | 184321 | 32348 | 216669 |
| S2.4 | 95 | 61.7 | 7.05 | 1 | 0.65 | 0.07 | 7 | 4.55 | 0.52 | 51 | 33.1 | 3.78 | 3 | 0.22 | 154 | 134793 | 31234 | 166027 |
| OL2a6 | 142 | 66.1 | 9.07 | 0 | 0 | 0 | 32 | 14.9 | 2.04 | 41 | 19.1 | 2.62 | 61 | 3.89 | 215 | 156636 | 293058 | 449694 |
| S2.4 alt. | 150 | 62.2 | 7.97 | 18 | 7.47 | 0.96 | 12 | 4.98 | 0.64 | 61 | 25.3 | 3.24 | 278 | 14.8 | 241 | 188208 | 456809 | 645017 |
|
| ||||||||||||||||||
|
| ||||||||||||||||||
|
|
|
|
|
| ||||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
| ||||||||||||||||||
| MI1A | 1026 | 37.4 | 4.02 | 13 | 0.47 | 0.05 | 116 | 4.22 | 0.45 | 1592 | 58.0 | 6.23 | 452 | 1.77 | 2747 | 2555267 | ||
| MI1I | 1040 | 37.6 | 4.06 | 12 | 0.43 | 0.05 | 127 | 4.59 | 0.50 | 1588 | 57.4 | 6.20 | 413 | 1.61 | 2767 | 2560898 | ||
| MI1III | 1027 | 37.4 | 4.01 | 12 | 0.44 | 0.05 | 118 | 4.30 | 0.46 | 1590 | 57.9 | 6.20 | N/A | N/A | 2747 | 2564129 | ||
| S2.4 | 853 | 34.2 | 3.63 | 12 | 0.48 | 0.05 | 49 | 1.96 | 0.21 | 1522 | 61.0 | 6.47 | 317 | 1.35 | 2497 | 2350931 | ||
| OL2a6 | 1030 | 37.0 | 4.06 | 8 | 0.29 | 0.03 | 169 | 6.07 | 0.67 | 1576 | 56.6 | 6.21 | 355 | 1.40 | 2783 | 2538403 | ||
The “scrapyards” are delimited by conserved positions of tRNA-Ser-CGA and tRNA-Val-CAC genes in each genome.
List of genes encoding diguanylate cyclases (contain GGDEF and EAL domains).
| MI group | S | OL | ||||
| No. | l (bp) | 1I | 1A | 1III | ||
| 1 | 1812 | 1v | 1n | 1n | 0 | 0 |
| a | 2205 | 1v | 1n | 1n | ||
| 2 | 2046 | 0 | 0 | 0 | 0 | 0 |
| 3 | 2100 | 1v | 1v | 1v | 1v1m | 0 |
| b | 2619 | 0f | 0f | 0f | 0 | 0f |
| 4 | 3081 | 2v | 1v | 1m | 0 | 0 |
| c | 1842 | 0 | 0f | |||
| 5 | 2568 | 0 | 0 | 0 | 0 | 0 |
| 6 | 921 | 0 | 0 | 0 | 0 | 0 |
| d | 1896 | 0 | 0 | 0 | 0 | |
| 7 | 2283 | 1v | 1v | 0 | 0 | 0 |
| 8 | 1701 | 1v | 1v | 1v | 0 | 0 |
| 9 | 1194 | 0 | 0 | 0 | 0 | 0 |
| 10 | 1236 | 0 | 0 | 0 | 0 | 0 |
| 11 | 3003 | 0 | 0 | 0 | 0 | 0 |
| 12 | 2760 | 1v | 2v | 1v | 0 | 0 |
| 13 | 2871 | 0 | 0 | 0 | 0 | 0 |
| 14 | 2100 | 0 | 0 | 0 | 0 | 0 |
| e | 2797 | 0 | ||||
| 15 | 2649 | 0 | 1v | 1v | 0 | 0 |
| 16 | 2094 | 0 | 1v | 1v | 0 | 0 |
| 17 | 2145 | 1v | 1v | 1v | 0 | 0 |
| 18 | 1692 | 0 | 0 | 1v | 1m | 1? |
| f | 192 | 0 | 0 | 0 | 0 | |
| g | 2403 | 0 | 0 | 0 | ||
| h | 936 | 0 | ||||
| i | 1038 | 0 | ||||
Genes conserved in all samples are marked by numbers 1-18, sequences missing or fragmentary in a part of samples are marked by letters a-i. Numbering of the genes correspond to their order in MI group genomes (position of genes c and e was inferred from surrounding sequences).
n, possibly non-variable disruption by ME.
v, variable disruption by ME.
m, disrupted by nonsense mutation.
f, fragment of a gene (partially lost).
.
.
†, Duplication of DGC3.
‡, Contig end.
The Illumina reads quantity in eight genes afflicted by IS disruption.
| MI1A | MI1I | S2.4 | OL2a6 | |||||
| Gene annotation | nr. | Disruption ratio | nr. | Disruption ratio | nr. | Disruption ratio | nr. | Disruption ratio |
| Aerotaxis sensor receptor |
| y/n = 0.32 |
| y/n = 0.45 |
| no gene |
| n; contig end |
| Acetate kinase |
| y/n = 0.54 |
| y/n = 0.64 |
| n (0/114) |
| n (0/104) |
| Cellulose biosynthesis regulation protein BcsB |
| y/n = 0.28 |
| y/n = 0.56 |
| no gene |
| no gene |
| Diguanylate cyclase/phosphodiesterase DGC8 |
| y/n = 0.74 |
| y/n = 0.61 |
| n (0/133) |
| n (0/117) |
| Type II/IV secretion system secretin RcpA/CpaC; disruption 1 |
| y/n = 0.94 |
| y/n = 1.48 |
| n (0/45) |
| n (0/75) |
| Type II/IV secretion system secretin RcpA/CpaC; disruption 2 |
| y/n = 0.4 |
| y/n = 0.78 |
| n (0/83) |
| n (0/131) |
| Uptake [NiFe] hydrogenase, small subunit HyaA |
| n (0/88) |
| n (0/102) |
| n (0/78) |
| n (0/73) |
| Diguanylate cyclase/phosphodiesterase DGC3 |
| n (0/129) |
| n (0/150) |
| y/n = 0.78 |
| n (0/124) |
| Diguanylate cyclase/phosphodiesterase DGC4 |
| y/n = 0.52 |
| y/n = 0.44 |
| n (0/86) |
| n (0/126) |
The columns marked “nr.” are marked identically as in