| Literature DB >> 29255570 |
Marianna Nagymihály1,2, Bálint M Vásarhelyi3, Quentin Barrière2, Teik-Min Chong4,5, Balázs Bálint3, Péter Bihari3, Kar-Wai Hong4,5, Balázs Horváth3, Jamal Ibijbijen6, Mohammed Amar7, Attila Farkas1, Éva Kondorosi1, Kok-Gan Chan4,5, Véronique Gruber8, Pascal Ratet8, Peter Mergaert2, Attila Kereszt1,3.
Abstract
Strain CCMM B554, also known as FSM-MA, is a soil dwelling and nodule forming, nitrogen-fixing bacterium isolated from the nodules of the legume Medicago arborea L. in the Maamora Forest, Morocco. The strain forms effective nitrogen fixing nodules on species of the Medicago, Melilotus and Trigonella genera and is exceptional because it is a highly effective symbiotic partner of the two most widely used accessions, A17 and R108, of the model legume Medicago truncatula Gaertn. Based on 16S rRNA gene sequence, multilocus sequence and average nucleotide identity analyses, FSM-MA is identified as a new Ensifer meliloti strain. The genome is 6,70 Mbp and is comprised of the chromosome (3,64 Mbp) harboring 3574 predicted genes and two megaplasmids, pSymA (1,42 Mbp) and pSymB (1,64 Mbp) with respectively 1481 and 1595 predicted genes. The average GC content of the genome is 61.93%. The FSM-MA genome structure is highly similar and co-linear to other E. meliloti strains in the chromosome and the pSymB megaplasmid while, in contrast, it shows high variability in the pSymA plasmid. The large number of strain-specific sequences in pSymA as well as strain-specific genes on pSymB involved in the biosynthesis of the lipopolysaccharide and capsular polysaccharide surface polysaccharides may encode novel symbiotic functions explaining the high symbiotic performance of FSM-MA.Entities:
Keywords: Ensifer meliloti; Nitrogen-fixation; Root nodule bacteria; Symbiosis
Year: 2017 PMID: 29255570 PMCID: PMC5729237 DOI: 10.1186/s40793-017-0298-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Colony morphology of E. meliloti strain FSM-MA on solid medium (a,b) at 5× (A) and 50× (b) magnifications as well as scanning electron microscopic images at 4000× (c,e) and 20,000× magnifications (d,f) of free-living cells (c,d) and bacteroids isolated from M. truncatula Jemalong A17 nodules (e,f)
Classification and general features of E. meliloti strain FSM-MA
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain FSM-MA (B554) | |||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | Mesophile | NAS | |
| Optimum temperature | 28–37 °C | IDA | |
| pH range | 5.5–9.5 | IDA | |
| Carbon source | Various | TAS [ | |
| GS-6 | Habitat | Soil, root nodule on hosts | IDA |
| MIGS-6.3 | Salinity | Unknown | NAS |
| MIGS-22 | Oxygen requirement | Aerobic | NAS |
| MIGS-15 | Biotic relationship | Free living, Symbiotic | IDA |
| MIGS-14 | Pathogenicity | Non-pathogen | TAS [ |
| Energy source | Chemoorganotroph | NAS | |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | Maamora Forest, Morocco | NAS |
| MIGS-5 | Sample collection | 2004 | NAS |
| MIGS-4.1 | Latitude | Not reported | NAS |
| MIGS-4.2 | Longitude | Not reported | NAS |
| MIGS-4.4 | Altitude | Not reported | NAS |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [46] (http://geneontology.org/page/guide-go-evidence-codes)
Fig. 2Phylogenetic tree showing the relationship of E. meliloti strain FSM-MA. The evolutionary history was inferred by using the Maximum Likelihood method based on the General Time Reversible model [30]. The tree with the highest log likelihood (−2208.71) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. Evolutionary analyses were conducted in MEGA7 [31]
Genome sequencing project information for E. meliloti strain FSM-MA
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Illumina mate-paired library |
| PacBio SMRTbell library | ||
| MIGS-29 | Sequencing platforms | Illumina MiSeq |
| PacBio RS II | ||
| MIGS-31.2 | Fold coverage | 249.2× |
| MIGS-30 | Assemblers | CLC Genomic Worknbench v. 9.5; HGAP v. 3 |
| MIGS-32 | Gene calling methods | Genemark S+, used as part of the NCBI Prokaryotic Genome Annotation Pipeline PGAP |
| Locus Tag | SMB554 | |
| Genbank ID | CP019584-CP019586 | |
| Genbank Date of Release | 2017.07.01 | |
| GOLD ID | Gp0258805 | |
| BIOPROJECT | PRJNA369312 | |
| MIGS-13 | Source Material Identifier | FSM-MA |
| Project relevance | Symbiotic Nitrogen-fixation, agriculture |
Summary of genome: one chromosome and 2 plasmids
| Label | Size (Mb) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 3.641 | Circular | CP019584 | NZ_CP019584.1 |
| Plasmid 1 | 1.640 | Circular | CP019586 | NZ_CP019586.1 |
| Plasmid 2 | 1.423 | Circular | CP019585 | NZ_CP019585.1 |
Genome statistics for E. meliloti strain FSM-MA
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 6,703,999 | 100.00 |
| chromosome size (bp) | 3,641,423 | 54.32 |
| pSymA size (bp) | 1,422,736 | 21.22 |
| pSymB size (bp) | 1,639,840 | 24.46 |
| DNA coding region (bp) | 5,641,977 | 84.16 |
| DNA G + C content (bp) | 4,152,010 | 61.93 |
| DNA scaffolds | 3 | 100.00 |
| Total genes | 6650 | 100.00 |
| chromosomal genes | 3574 | 53.74 |
| pSymA genes | 1481 | 22.27 |
| pSymB genes | 1595 | 23.98 |
| Protein-coding genes | 6183 | 92.97 |
| RNA genes | 67 | 1.01 |
| Pseudo genes | 400 | 6.01 |
| Genes in internal clusters | 2341 | 35.20 |
| Genes with function prediction | 5032 | 75.67 |
| Genes assigned to COGs | 5801 | 87.23 |
| Genes with Pfam domains | 5167 | 77.70 |
| Genes with signal peptides | 534 | 8.03 |
| Genes with transmembrane helices | 1403 | 21.10 |
| CRISPR repeats | 0 | 0 |
Number of genes of Ensifer meliloti FSM-MA associated with general COG functional categories
| Code | chromosome | pSymA | pSymB | Genome | |||||
|---|---|---|---|---|---|---|---|---|---|
| Value | % age of total (3574) | value | % age of total (1481) | value | % age of total (1595) | value | % age of total (6650) | Description | |
| J | 164 | 4.59 | 7 | 0.47 | 16 | 1.00 | 187 | 2.81 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | 0 | 0.00 | 0 | 0.00 | 0 | 0.00 | RNA processing and modification |
| K | 246 | 6.88 | 132 | 8.91 | 138 | 8.65 | 516 | 7.76 | Transcription |
| L | 140 | 3.92 | 40 | 2.70 | 26 | 1.63 | 206 | 3.10 | Replication, recombination and repair |
| B | 1 | 0.03 | 0 | 0.00 | 0 | 0.00 | 1 | 0.02 | Chromatin structure and dynamics |
| D | 30 | 0.84 | 5 | 0.34 | 10 | 0.63 | 45 | 0.68 | Cell cycle control, cell division, chromosome partitioning |
| V | 34 | 0.95 | 11 | 0.74 | 20 | 1.25 | 65 | 0.98 | Defense mechanisms |
| T | 135 | 3.78 | 77 | 5.20 | 71 | 4.45 | 283 | 4.26 | Signal transduction mechanisms |
| M | 148 | 4.14 | 32 | 2.16 | 104 | 6.52 | 284 | 4.27 | Cell wall/membrane/envelope biogenesis |
| N | 55 | 1.54 | 12 | 0.81 | 6 | 0.38 | 73 | 1.10 | Cell motility |
| U | 70 | 1.96 | 33 | 2.23 | 3 | 0.19 | 106 | 1.59 | Intracellular trafficking, secretion, and vesicular transport |
| O | 127 | 3.55 | 31 | 2.09 | 22 | 1.38 | 180 | 2.71 | Posttranslational modification, protein turnover, chaperones |
| C | 177 | 4.95 | 121 | 8.17 | 75 | 4.70 | 373 | 5.61 | Energy production and conversion |
| G | 236 | 6.60 | 94 | 6.35 | 245 | 15.36 | 575 | 8.65 | Carbohydrate transport and metabolism |
| E | 353 | 9.88 | 137 | 9.25 | 139 | 8.71 | 629 | 9.46 | Amino acid transport and metabolism |
| F | 82 | 2.29 | 7 | 0.47 | 21 | 1.32 | 110 | 1.65 | Nucleotide transport and metabolism |
| H | 133 | 3.72 | 31 | 2.09 | 35 | 2.19 | 199 | 2.99 | Coenzyme transport and metabolism |
| I | 117 | 3.27 | 37 | 2.50 | 53 | 3.32 | 207 | 3.11 | Lipid transport and metabolism |
| P | 140 | 3.92 | 81 | 5.47 | 78 | 4.89 | 299 | 4.50 | Inorganic ion transport and metabolism |
| Q | 76 | 2.13 | 35 | 2.36 | 42 | 2.63 | 153 | 2.30 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 399 | 11.16 | 172 | 11.61 | 169 | 10.60 | 740 | 11.13 | General function prediction only |
| S | 361 | 10.10 | 87 | 5.87 | 121 | 7.59 | 569 | 8.56 | Function unknown |
| W | 1 | 0.03 | 0 | 0.00 | 0 | 0.00 | 1 | 0.02 | Extracellular structures |
| – | 349 | 9.76 | 299 | 20.19 | 201 | 12.60 | 849 | 12.77 | Not in COGs |