| Literature DB >> 34946030 |
Laura Baxter1,2, Proyash Roy1,3, Emma Picot1, Jess Watts1, Alex Jones1, Helen Wilkinson1, Patrick Schäfer1, Miriam Gifford1, Beatriz Lagunas1.
Abstract
Here, we report an improved and complete genome sequence of Sinorhizobium (Ensifer) meliloti strain WSM1022, a microsymbiont of Medicago species, revealing its tripartite structure. This improved genome sequence was generated combining Illumina and Oxford nanopore sequencing technologies to better understand the symbiotic properties of the bacterium. The 6.75 Mb WSM1022 genome consists of three scaffolds, corresponding to a chromosome (3.70 Mb) and the pSymA (1.38 Mb) and pSymB (1.66 Mb) megaplasmids. The assembly has an average GC content of 62.2% and a mean coverage of 77X. Genome annotation of WSM1022 predicted 6058 protein coding sequences (CDSs), 202 pseudogenes, 9 rRNAs (3 each of 5S, 16S, and 23S), 55 tRNAs, and 4 ncRNAs. We compared the genome of WSM1022 to two other rhizobial strains, closely related Sinorhizobium (Ensifer) meliloti Sm1021 and Sinorhizobium (Ensifer) medicae WSM419. Both WSM1022 and WSM419 species are high-efficiency rhizobial strains when in symbiosis with Medicago truncatula, whereas Sm1021 is ineffective. Our findings report significant genomic differences across the three strains with some similarities between the meliloti strains and some others between the high efficiency strains WSM1022 and WSM419. The addition of this high-quality rhizobial genome sequence in conjunction with comparative analyses will help to unravel the features that make a rhizobial symbiont highly efficient for nitrogen fixation.Entities:
Keywords: Ensifer; Medicago truncatula; Sinorhizobium; Sm1021; WSM1022; WSM419; nitrogen fixation; nodulation; nodule; rhizobial genome; rhizobium
Year: 2021 PMID: 34946030 PMCID: PMC8706082 DOI: 10.3390/microorganisms9122428
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1Sinorhizobium meliloti WSM1022 genome composition. Graphical circular maps of chromosome (A), pA (B) and pB (C), generated using CGView. From outside to centre, rings show CDS (lilac), tRNA (pink) and rRNA (sage green) on the forward and reverse strand; positive and negative GC skew (green and purple, respectively); GC content (black); COG category (colours in key) on forward and reverse strands; Nitrogen fixation genes (orange) and Nod factor biosynthesis genes; (blue); genome position in Mbp.
Comparison of general genome features of the rhizobial strains in this study.
| WSM1022 | Sm1021 | WSM419 | |
|---|---|---|---|
| Species |
|
|
|
| Genome size (bp, total) | 6,751,834 | 6,691,694 | 6,817,576 |
| No. of contigs | 3 | 3 | 4 |
| No. of chromosomes | 1 | 1 | 1 |
| No. of plasmids | 2 | 2 | 3 |
| GC content (%) | 62.22 | 62.17 | 61.15 |
| RefSeq/GenBank assembly accession | GCF_0013315775.1 | GCF_00006965.1 | GCF_000017145.1 |
| Genes (total) | 6328 | 6293 | 6464 |
| CDS (total) | 6260 | 6225 | 6396 |
| CDS (with protein) | 6058 | 5981 | 6068 |
| Genes (RNA) | 68 | 68 | 68 |
| rRNAs (5S, 15S, 23S) | 3, 3, 3 | 3, 3, 3 | 3, 3, 3 |
| tRNAs | 55 | 55 | 55 |
| ncRNAs | 4 | 4 | 4 |
| Pseudo Genes (total, without protein) | 202 | 244 | 328 |
| Pseudo Genes (frameshifted) | 134 of 202 | 173 of 244 | 218 of 328 |
| Pseudo Genes (incomplete) | 97 of 202 | 97 of 244 | 170 of 328 |
| Pseudo Genes (internal stop) | 18 of 202 | 28 of 244 | 41 of 326 |
| Pseudo Genes (multiple problems) | 45 of 202 | 49 of 244 | 94 of 326 |
Figure 2Comparative genomics of Sm1021, WSM1022 and WSM419. (A–C) Circular plots representing sequences of WSM1022 chromosome (A), pA (B) and pB (C). Innermost rings (green) show sequence similarity to corresponding replicon of Sm1021. Middle rings (light blue) show sequence similarity to corresponding replicons of WSM419. Outer rings show locations of IS and prophage sequences.
Figure 3Conserved arrangement of Nod factor biosynthesis genes in plasmids of the three strains. Arrows indicate the genes encoding Nod factor biosynthesis genes present. Numbers in parentheses indicate distances between clusters of genes.
Figure 4Orthovenn results on orthologous protein clusters in WSM1022, Sm1021 and WSM419. (A) Table indicating total number of proteins, number of clusters (highly similar protein sequences) and singletons (cannot be clustered) in each rhizobial species used in this study. (B) Presence of cluster groups in each rhizobial species, (dark green—presence, light green—absence). Cluster count represents the number of protein clusters within each cluster group (darker blue for higher protein cluster counts). Protein counts reflect the number of proteins within each protein cluster, colours refer to the distribution of those protein counts in each species. (C) Venn diagram showing the number of protein clusters identified in each species). Interesting GO Terms that were found enriched in certain cluster groups are also added and affixed to the associated region.
Type III and IV secretion system proteins present in each genome. EffectiveS346 predictions of presence/absence of Type III and Type IV secretion system components in the three genomes. Missing components are indicated by “-“. Chromosome/plasmid locations of the corresponding genes are indicated with background colours for plasmid pA/pSymA/pSMED02 (blue), chromosome (green) and pSMED03 (peach).
| COG ID | COG Symbol | WSM1022 Protein | Sm1021 Protein | WSM419 Protein | |
|---|---|---|---|---|---|
| Type III | COG4669 | EscJ | QKN17985.1 | - | - |
| COG4790 | EscR/YscR | QKN17990.1 | - | - | |
| COG4794 | EscS/YscS | QKN17991.1 | - | - | |
| COG4791 | EscT/YscT | QKN17992.1 | - | - | |
| COG4792 | EscU/YscU | QKN17973.1 | - | - | |
| COG4789 | EscV | QKN17977.1 | - | - | |
| COG1157 | FliI | QKN17988.1 | WP_010968730.1 | WP_011974463.1 | |
| COG1317 | FliH | - | - | - | |
| Type IV | COG3838 | VirB2 | QKN18893.1 | WP_010967695.1 | WP_011971086.1 |
| COG3702 | VirB3 | QKN18440.1 | WP_010967694.1 | WP_011971088.1 | |
| COG3701 | TrbF | - | - | WP_011970261.1 | |
| COG3504 | VirB9 | QKN18446.1 | WP_010967688.1 | WP_011971094.1 | |
| COG3704 | VirB6 | QKN18443.1 | WP_013845459.1 | WP_011971092.1 | |
| COG3736 | VirB8 | QKN18445.1 | WP_010967689.1 | WP_011971093.1 | |
| COG2948 | VirB10 | QKN18447.1 | WP_010967687.1 | WP_011971095.1 | |
| COG3451 | VirB4 | - | - | - | |
| COG0630 | VirB11 | QKN18448.1 | WP_010967686.1 | - | |
| COG3505 | VirD4 | QKN18557.1 | WP_010967483.1 | WP_024325706.1 | |
| COG3157 | Hcp | - | - | - |