Literature DB >> 28975015

High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain.

Wan Adnawani Meor Osman1, Peter van Berkum2, Milagros León-Barrios3, Encarna Velázquez4, Patrick Elia2, Rui Tian1, Julie Ardley1, Margaret Gollagher5, Rekha Seshadri6, T B K Reddy6, Natalia Ivanova6, Tanja Woyke6, Amrita Pati7, Victor Markowitz7, Mohamed N Baeshen8, Naseebh Nabeeh Baeshen8, Nikos Kyrpides6, Wayne Reeve1.   

Abstract

10.1601/nm.1335 Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here the features of 10.1601/nm.1335 Mlalz-1 are described, together with high-quality permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T, 10.1601/nm.1334 A 321T and 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T, based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as 10.1601/nm.1335. Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata-nodulating 10.1601/nm.1328 strains, but ≤93% with nodC of 10.1601/nm.1328 strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced 10.1601/nm.1335 strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In 10.1601/nm.1334 strain 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of 10.1601/nm.1334 strains, which suggests genetic recombination between strain Mlalz-1 and 10.1601/nm.1334 and the horizontal gene transfer of lpiA-acvB.

Entities:  

Keywords:  Ensifer; Geba-Rnb; Medicago; Root-nodule bacteria; lpiA-acvB operon

Year:  2017        PMID: 28975015      PMCID: PMC5613336          DOI: 10.1186/s40793-017-0270-2

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Symbiotic nitrogen fixation by pasture legumes and their associated root nodule bacteria provides a critical contribution to sustainable animal and plant production, and the maintenance of soil fertility in agricultural systems [1-3]. As such, it is of direct relevance to maintaining environmentally sustainable high agricultural yields, which significantly contributes to the Sustainable Development Goals adopted in September 2015 as part of the UN’s development agenda ‘Transforming our world: the 2030 Agenda for Sustainable Development’ [4]. Medics ( spp.) are some of the most important and extensively grown pasture legumes and their specific symbiosis with strains of rhizobia belonging to either 10.1601/nm.1328 (synonym 10.1601/nm.1339) meliloti or the closely related species 10.1601/nm.1334 [5, 6] has been the subject of extensive research efforts [7]. (L.) Miller (cut leaf medic), an annual native of southern and eastern Mediterranean and Saharo-Sindian countries, is of importance because of its ability to grow in comparatively arid habitats and marginal cropping areas [8-11]. It is highly specific in its rhizobial requirements, forming a symbiosis only with a restricted subset of 10.1601/nm.1335 and not with strains that nodulate L. (alfalfa) or Gaertn. [12, 13]. This symbiotic specificity has been linked to the rhizobial nod genes, in particular a specific nodC allele [14]. For example, van Berkum and colleagues found that most rhizobial strains isolated from Tunisian M. truncatula and M. laciniata shared chromosomal identity, but differed in their nodC alleles [15]. Based on these and other differing symbiotic traits, Villegas et al. [13] proposed two biovars within 10.1601/nm.1335: bv. medicaginis for 10.1601/nm.1328 strains that are symbiotically efficient on M. laciniata and bv. meliloti for the classical 10.1601/nm.1335 group that efficiently nodulates M. sativa. However, in subsequent studies the diversity observed within bv. medicaginis strains indicate that this group is certainly heterogeneous [16]. M. laciniata is native to the Canary Islands and is present on all of the islands of this archipelago, growing in environments that range from arid to subhumid. 10.1601/nm.1335 strain Mlalz-1 was isolated from a N2-fixing nodule of M. laciniata grown in alkaline soil (pH 9.0) collected in Guatiza, in the arid Northeast of Lanzarote Island, in 2007. This strain was one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 GEBA-RNB project proposal [17, 18]. Here an analysis of the complete genome sequence of 10.1601/nm.1335 Mlalz-1 is provided.

Organism information

Classification and features

10.1601/nm.1335 Mlalz-1 is a motile, non-sporulating, non-encapsulated, Gram-negative strain in the class 10.1601/nm.809. The rod shaped form has dimensions of approximately 0.5 μm in width and 1.0–2.0 μm in length (Fig. 1 Left and Center). It is fast growing, forming colonies after 3–5 days when grown on ½LA, TY, or a modified yeast-mannitol agar [19] at 28 °C. Colonies on ½LA are opaque, slightly domed and moderately mucoid with smooth margins (Fig. 1 Right). Minimum Information about the Genome Sequence (MIGS) for strain Mlalz-1 is provided in Table 1 and Additional file 1: Table S1.
Fig. 1

Images of 10.1601/nm.1335 Mlalz-1 using scanning (Left (a)) and transmission (Center (b)) electron microscopy as well as light microscopy to visualize colony morphology on solid media (Right (c))

Table 1

Classification and general features of 10.1601/nm.1335 Mlalz-1 in accordance with the MIGS recommendations [65] published by the Genomic Standards Consortium [66]

MIGS IDPropertyTermEvidence codea
Current classificationDomain BacteriaTAS [67]
Phylum 10.1601/nm.808 TAS [68]
Class 10.1601/nm.809 TAS [69, 70]
Order 10.1601/nm.1277 TAS [70, 71]
Family 10.1601/nm.1278 TAS [72, 73]
Genus 10.1601/nm.1328 TAS [74]
Species 10.1601/nm.1335 [21]
Strain: Mlalz-1 (= 10.1601/strainfinder?urlappend=%3Fid%3DUSDA+1984)IDA
Gram stainNegativeIDA
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingNAS
Temperature range10–40 °CIDA
Optimum temperature25–30 °CIDA
pH range; Optimum5–9.5; 6.5–8IDA
Carbon sourceVariedIDA
MIGS-6HabitatSoil; root nodule on host Medicago laciniata (L.) MillerIDA
MIGS-6.3SalinityTolerates 0 to 1% (w/v) % NaClTAS
MIGS-22Oxygen requirementAerobicIDA
MIGS-15Biotic relationshipFree living, symbioticIDA
MIGS-14PathogenicityBiosafety level 1TAS [75]
MIGS-4Geographic locationGuatiza, Lanzarote, Canary Islands, SpainIDA
MIGS-5Sample collection date2007IDA
MIGS-4.1Latitude29.074324IDA
MIGS-4.2Longitude−13.479696IDA
MIGS-4.3Depth5–10 cmIDA
MIGS-4.4Altitude102 mIDA

aEvidence codes – IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [76, 77]

Images of 10.1601/nm.1335 Mlalz-1 using scanning (Left (a)) and transmission (Center (b)) electron microscopy as well as light microscopy to visualize colony morphology on solid media (Right (c)) Classification and general features of 10.1601/nm.1335 Mlalz-1 in accordance with the MIGS recommendations [65] published by the Genomic Standards Consortium [66] aEvidence codes – IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [76, 77]

Symbiotaxonomy

M. laciniata is a highly specific host and its microsymbionts also appear to be highly specific since studies of isolates have shown that M. laciniata strains fail to nodulate a range of species [5, 12]. Bailly et al. [20] reported that isolates of M. laciniata nodulated and fixed nitrogen with M. truncatula, but also provided evidence that these were the progeny of horizontal transfer of the nodulation genes. Strain Mlalz-1 nodulates and is effective for nitrogen fixation with M. laciniata. We report here that strain Mlalz-1 is unable to nodulate L., the definitive host for 10.1601/nm.1334 strains [6].

Extended feature descriptions

Previous studies using multilocus sequence typing showed that M. laciniata rhizobia did not form a distinct chromosomal group [15]. Phylogenetic analysis of strain Mlalz-1 was performed by aligning the 16S rRNA sequence (1389 bp from scaffold 84.85) to the 16S rRNA gene sequences of 10.1601/nm.1328 type strains (Fig. 2). Based on four variable sites within this 16S rRNA gene sequence alignment, strain Mlalz-1 is closely related to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T (= 10.1601/strainfinder?urlappend=%3Fid%3DLMG+6133 T) [21], 10.1601/nm.1334 A 321T (= 10.1601/strainfinder?urlappend=%3Fid%3DLMG+19920 T) [6] and 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T [22]. The available IMG 16S rRNA sequence of strain Mlalz-1 gave alignment identities of 100% to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T, 99.7% to 10.1601/nm.1334 A 321T and 99.5% to 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T. In contrast, 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T and 10.1601/nm.1337 10.1601/strainfinder?urlappend=%3Fid%3DLMG+7834 T [23] were only 97.3% similar.
Fig. 2

Comparison of the 16S rRNA gene sequences of 10.1601/nm.1335 Mlalz-1 (shown in bold) and other 10.1601/nm.1328 spp. type strains, based on aligned 16S rRNA gene sequences of 1389 bp. Of the 1389 sites, 1279 were constant and 48 were informative. There were eight gaps overall when 10.1601/nm.1418 bv biserrulae 10.1601/strainfinder?urlappend=%3Fid%3DWSM+1271 was included in the analysis. Phylogenetic analysis was done using MEGA, version 6.0 [61] after manually assembling the alignment by using GeneDoc version 2.6.001 [62]. 10.1601/nm.1418 bv biserrulae 10.1601/strainfinder?urlappend=%3Fid%3DWSM+1271 was used as an outgroup and the tree was assembled using the UPGMA algorithm based on the number of nucleotide differences. This approach was used since the potential for genetic recombination among the different 16S rRNA genes as reported by van Berkum [63] cannot be ignored. Bootstrap analysis [64] with 2000 permutations of the data set was done to assess support for the branch points. Strains with a genome sequencing project registered in GOLD [24] are 10.1601/nm.1329 Casida AT, 10.1601/nm.1418 bv. biserrulae 10.1601/strainfinder?urlappend=%3Fid%3DWSM+1271 and Mlalz-1 and the GOLD ID is provided in place of the GenBank accession number

Comparison of the 16S rRNA gene sequences of 10.1601/nm.1335 Mlalz-1 (shown in bold) and other 10.1601/nm.1328 spp. type strains, based on aligned 16S rRNA gene sequences of 1389 bp. Of the 1389 sites, 1279 were constant and 48 were informative. There were eight gaps overall when 10.1601/nm.1418 bv biserrulae 10.1601/strainfinder?urlappend=%3Fid%3DWSM+1271 was included in the analysis. Phylogenetic analysis was done using MEGA, version 6.0 [61] after manually assembling the alignment by using GeneDoc version 2.6.001 [62]. 10.1601/nm.1418 bv biserrulae 10.1601/strainfinder?urlappend=%3Fid%3DWSM+1271 was used as an outgroup and the tree was assembled using the UPGMA algorithm based on the number of nucleotide differences. This approach was used since the potential for genetic recombination among the different 16S rRNA genes as reported by van Berkum [63] cannot be ignored. Bootstrap analysis [64] with 2000 permutations of the data set was done to assess support for the branch points. Strains with a genome sequencing project registered in GOLD [24] are 10.1601/nm.1329 Casida AT, 10.1601/nm.1418 bv. biserrulae 10.1601/strainfinder?urlappend=%3Fid%3DWSM+1271 and Mlalz-1 and the GOLD ID is provided in place of the GenBank accession number

Genome sequencing information

Genome project history

10.1601/nm.1335 Mlalz-1 was selected for sequencing at the U.S. Department of Energy funded Joint Genome Institute as part of the GEBA-RNB project [17, 18]. The root nodule bacteria in this project were selected based on environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance. In particular, strain Mlalz-1 was chosen since it has strict host specificity for M. laciniata, which is suited for cultivation in arid environments [11]. The 10.1601/nm.1335 Mlalz-1 genome project is deposited in the Genomes Online Database [24] and a high-quality permanent draft genome sequence (IMG Genome ID 2513237143) is deposited in IMG [25]. Sequencing, finishing and annotation were performed by the JGI. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information for 10.1601/nm.1335 Mlalz-1

MIGS IDPropertyTerm
MIGS-31Finishing qualityHigh-quality permanent draft
MIGS-28Libraries usedIllumina Standard PE
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage748×
MIGS-30AssemblersVelvet version 1.1.04; ALLPATHS v. r39750
MIGS-32Gene calling methodsProdigal 1.4
Locus TagA3CA [78]
GenBank ID ATZD00000000
Genbank Date of ReleaseJanuary 30 2012
GOLD IDGp0010229 [79]
BIOPROJECT165,343
MIGS-13Source Material IdentifierMlalz-1 (=10.1601/strainfinder?urlappend=%3Fid%3DUSDA+1984)
Project relevanceSymbiotic N2 fixation, agriculture
Genome sequencing project information for 10.1601/nm.1335 Mlalz-1

Growth conditions and genomic DNA preparation

10.1601/nm.1335 Mlalz-1 (= 10.1601/strainfinder?urlappend=%3Fid%3DUSDA+1984) was cultured on MAG solid media [26] for three days at 28 °C to obtain well grown, well separated colonies, then a single colony was selected from the plate and inoculated into 5 ml MAG broth media. The culture was grown for 48 h on a gyratory shaker (200 rpm) at 28 °C. Subsequently 1 ml was used to inoculate 50 ml of MAG and the cells were incubated on a gyratory shaker (200 rpm) at 28 °C until an OD600nm of 0.6 was reached. DNA was isolated from 50 ml of cells by Peter van Berkum according to the method described by van Berkum [26]. The final concentration of the DNA was set to 0.5 mg ml−1.

Genome sequencing and assembly

The draft genome of 10.1601/nm.1335 Mlalz-1 was generated at the DOE Joint genome Institute (JGI) using Illumina technology [27]. An Illumina standard PE library was constructed and sequenced using the Illumina HiSeq 2000 platform that generated 35,720,836 reads totalling 4983 Mbp. All general aspects of library construction and sequencing were done at the JGI and details can be found on the JGI website [28]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artefacts (Mingkun L, Copeland A, Han J; unpublished). The following steps for assembly were: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [29]; (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim (version 0.3.0) [30]; (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r39750) [31]. Parameters for the assembly steps were 1) Velvet: --v --s 51 --e 71 --i 2 --t 1 --f “-shortPaired -fastq $FASTQ” --o “-ins_length 250 -min_contig_lgth 500” for Velvet and 2) wgsim: -e 0–1 76–2 76 -r 0 -R 0 -X 0. The final draft assembly contained 100 contigs in 99 scaffolds. The total size of the genome is 6.7 Mbp and the final assembly is based on 4983 Mbp of Illumina data, which provides an average of 748× coverage of the genome.

Genome annotation

Genes were identified using Prodigal [32], as part of the DOE-JGI genome annotation pipeline [33, 34]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [35] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [36]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [37]. Additional gene prediction analysis and manual functional annotation was done within the Integrated Microbial Genomes-Expert Review platform [38] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 6,664,116 bp with 62.16% GC content (Table 3) and comprised of 99 scaffolds. From a total of 6388 genes, 6314 were protein encoding and 74 RNA only encoding genes. Most genes (79.52%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome statistics for 10.1601/nm.1335 Mlalz-1

AttributeValue% of Total
Genome size (bp)6,664,116100.00
DNA coding (bp)5,754,33286.35
DNA G + C (bp)4,142,40762.16
DNA scaffolds99100.00
Total genes6388100.00
Protein-coding genes631498.84
RNA genes741.16
Pseudo genes00.00
Genes in internal clusters105416.50
Genes with function prediction508079.52
Genes assigned to COGs465972.93
Genes with Pfam domains531783.23
Genes with signal peptides5558.69
Genes with transmembrane helices144022.54
CRISPR repeats00.00
Table 4

Number of genes of 10.1601/nm.1335 Mlalz-1 associated with the general COG functional categories

CodeValue%ageDescription
J2174.09Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K4668.77Transcription
L1222.3Replication, recombination and repair
B10.02Chromatin structure and dynamics
D390.73Cell cycle control, cell division, chromosome partitioning
Y00.00Nuclear structure
V1172.20Defense mechanisms
T2164.07Signal transduction mechanisms
M3015.67Cell wall/membrane/envelope biogenesis
N721.36Cell motility
Z00.00Cytoskeleton
W330.62Extracellular structures
U741.39Intracellular trafficking, secretion, and vesicular transport
O2063.88Posttranslational modification, protein turnover, chaperones
C3586.74Energy production and conversion
G55510.45Carbohydrate transport and metabolism
E58410.99Amino acid transport and metabolism
F1162.18Nucleotide transport and metabolism
H2424.56Coenzyme transport and metabolism
I2204.14Lipid transport and metabolism
P2795.25Inorganic ion transport and metabolism
Q1592.99Secondary metabolite biosynthesis, transport and catabolism
R55110.37General function prediction only
S3486.55Function unknown
X360.68Mobilome: prophages, transposons
172927.07Not in COGS
Genome statistics for 10.1601/nm.1335 Mlalz-1 Number of genes of 10.1601/nm.1335 Mlalz-1 associated with the general COG functional categories

Insights from the genome sequence

10.1601/nm.1335 Mlalz-1 is one of seven strains of 10.1601/nm.1335 that have been sequenced from the GEBA-RNB genome sequencing projects [17]. On the basis of 16S rRNA sequence identity, strain Mlalz-1 is closely related to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T (= 10.1601/strainfinder?urlappend=%3Fid%3DLMG+6133 T), 10.1601/nm.1334 A 321T (= 10.1601/strainfinder?urlappend=%3Fid%3DLMG+19920 T) and 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T. As the genomes of these type strains have not been sequenced or are not publically available, gANI values [39] had to be compared with other fully sequenced 10.1601/nm.1328 strains (Table 5). 10.1601/nm.1335 Mlalz-1 currently forms a gANI clique with other 10.1601/nm.1335 strains (gANI values ≥98.14%), compared with gANI values of ≤87.9% with the finished genomes of other 10.1601/nm.1328 strains. This supports the classification of strain Mlalz-1 as an 10.1601/nm.1335 strain, in accordance with the defined species affiliation cut-off value of 96.5% gANI [39]. The total genome size of strain Mlalz-1 is 6.6 Mbp, which falls within the expected size range of 6.6–8.9 Mbp for 10.1601/nm.1335. The genome architecture of 10.1601/nm.1335 consists of a chromosome and the two symbiotic megaplasmids pSymA and pSymB [20]. Replication of a plasmid is initiated by the replication protein encoded by repC, which is present as a single copy on 10.1601/nm.1335 pSymA and pSymB. The 10.1601/nm.1335 Mlalz-1 genome carried 2 repC loci (A3CADRAFT_00120 and A3CADRAFT_01676) with highest encoded protein identity to RepC proteins of 10.1601/nm.1335 strains. Mlalz-1 A3CADRAFT_00120 RepC1 had highest identity (98.10%) to the RepC1 protein encoded by SMb20044 on pSymB of 10.1601/nm.1335 1021. 10.1601/nm.1335 Mlalz-1 A3CADRAFT_01676 RepC2 had highest identity (99.00%) to the RepC2 protein encoded by SMa2391 on pSymA of 10.1601/nm.1335 1021. This indicated the presence of two megaplasmids in strain Mlalz-1, and revealed that strain Mlalz-1 has a similar genome architecture to that of 10.1601/nm.1335 1021.
Table 5

Pairwise gANI comparisons of selected finished genomes of sequenced 10.1601/nm.1328 strains

StrainGold ID: GpCasida AUSDA 257WSM 4191021AK83BL225CGR4Mlalz-1Rm41SM11
E.adhaerens Casida A009482410080.579.0680.1280.1180.0680.0180.0880.0380.06
10.1601/nm.1331 USDA 257000516980.510081.8983.2683.2483.2583.2083.1483.3383.22
10.1601/nm.1334 WSM419000011779.0681.9310088.1888.1388.2688.2487.9088.1488.26
10.1601/nm.1335 1021000072680.1283.2688.1910099.3699.6299.41 98.80 99.2499.43
10.1601/nm.1335 AK83000669580.0883.2588.1699.3610099.3399.14 98.60 99.3899.33
10.1601/nm.1335 BL225C000656080.0683.2588.2899.6299.3310099.44 98.81 99.2699.39
10.1601/nm.1335 GR4002050180.0183.2388.2699.4199.1499.43100 98.81 99.0599.25
10.1601/nm.1335 Mlalz-1 0010229 80.1183.1587.91 98.80 98.59 99.81 98.81 100 98.59 98.66
10.1601/nm.1335 Rm41002585380.0583.3688.1199.2699.3999.2599.06 98.59 10099.33
10.1601/nm.1335 SM11000601880.0583.2388.2999.4599.3399.3999.26 98.67 99.32100

For 10.1601/nm.1335 Mlalz-1, gANI values above the microbial species delineation cutoff value of 96.5% [39] are in bold font

Pairwise gANI comparisons of selected finished genomes of sequenced 10.1601/nm.1328 strains For 10.1601/nm.1335 Mlalz-1, gANI values above the microbial species delineation cutoff value of 96.5% [39] are in bold font

Extended insights

All 29 10.1601/nm.1335 strains within the gANI clique share a core set of 4948 orthologous genes, using cut off values of 1e-5 and 30% minimum protein identity. 10.1601/nm.1335 Mlalz-1 contains 176 unique genes, 96 (54.5%) of which encode hypothetical proteins. The unique genes include those encoding the components of a T2SS, located on scaffold A3CADRAFT_scaffold_5.6 (Fig. 3a), as well as genes that encode a DNA methyltransferase and a NitT/TauT family transport system. These T2SS components form part of a unique COG profile generated for Mlaz-1 (Table 6). The T2SS secretion system is used to translocate a wide range of proteins from the periplasm across the outer membrane [40]. Although T2SS genes are not found in other 10.1601/nm.1335 strains or in the 10.1601/nm.1331 strains GR64 and 10.1601/strainfinder?urlappend=%3Fid%3DUSDA+257, they are present in the genomes of the 10.1601/nm.1331 strains HH103 and 10.1601/strainfinder?urlappend=%3Fid%3DNGR+234, in a similar gene arrangement to that observed in 10.1601/nm.1335 Mlalz-1 [41, 42] (Fig. 3b). Generally, the T2SS gene cluster is comprised of 12–15 genes, and strain Mlalz-1 contains the 12 required genes gspDOGLMCKEFHIJ necessary for a functional T2SS, but lacks the gspS gene found only in certain genera [43] (Fig. 3c).
Fig. 3

a Map of 10.1601/nm.1335 Mlalz-1: A3CADRAFT_scaffold_ 5.6. b Genetic organization of the T2SS clusters in 10.1601/nm.1335 Mlalz-1 (Emel Mlalz-1), 10.1601/nm.1331 HH103 (Efre HH103) and 10.1601/nm.1331 10.1601/strainfinder?urlappend=%3Fid%3DNGR+234 (Efre 10.1601/strainfinder?urlappend=%3Fid%3DNGR+234). J, gspJ; I, gspI; H, gspH; F, gspF; E, gspE; K, gspK; C, gspC; M, gspM; L, gspL; G, gspG; O, gspO; D, gspD. c Schematics of the T2SS of Gram-negative bacteria [43]. The secretin, GspD (A3CADRAFT_01569); the polytopic protein, GspF (A3CADRAFT_01561); the cytoplasmic ATPase, GspE (A3CADRAFT_01562); the major pseudopilin component, GspG (A3CADRAFT_01567); the minor pseudopilins, GspH (A3CADRAFT_01560), GspI (A3CADRAFT_01559), GspJ (A3CADRAFT_01558) and GspK (A3CADRAFT_01563); the bitopic proteins, GspL (A3CADRAFT_01566), GspC (A3CADRAFT_01564) and GspM (A3CADRAFT_01565); the peptidase, GspO (A3CADRAFT_01568)(GspS is absent from Mlalz-1); OM, outer membrane; PG, peptidoglycan; IM, inner membrane

Table 6

List of the unique COGs in 10.1601/nm.1335 Mlalz-1

COGNameLocus TagGene symbolProtein function
0393Uncharacterized conserved protein YbjQ, 10.1601/strainfinder?urlappend=%3Fid%3DUPF+0145 familyA3CADRAFT_01446Unknown
4970Tfp pilus assembly protein FimTA3CADRAFT_01560 gspH T2SS
1459Type II secretory pathway, component PulFA3CADRAFT_01561 gspF T2SS
2804Type II secretory pathway ATPase GspE/PulE or T4P pilus assembly pathway ATPase PilBA3CADRAFT_01562 gspE T2SS
3156Type II secretory pathway, component PulKA3CADRAFT_01563 gspK T2SS
3166Tfp pilus assembly protein PilNA3CADRAFT_01566 gspL T2SS
2165Type II secretory pathway, pseudopilin PulGA3CADRAFT_01567 gspG T2SS
1450Type II secretory pathway component GspD/PulD (secretin)A3CADRAFT_01569 gspD T2SS
2189Adenine specific DNA methylase ModA3CADRAFT_02454 yhdJ DNA methyltransferase
4705Uncharacterized membrane-anchored proteinA3CADRAFT_05679Membrane protein
4089Uncharacterized membrane proteinA3CADRAFT_05685Membrane protein
2021Homoserine acetyltransferaseA3CADRAFT_06155Homoserine acetyltransferase
a Map of 10.1601/nm.1335 Mlalz-1: A3CADRAFT_scaffold_ 5.6. b Genetic organization of the T2SS clusters in 10.1601/nm.1335 Mlalz-1 (Emel Mlalz-1), 10.1601/nm.1331 HH103 (Efre HH103) and 10.1601/nm.1331 10.1601/strainfinder?urlappend=%3Fid%3DNGR+234 (Efre 10.1601/strainfinder?urlappend=%3Fid%3DNGR+234). J, gspJ; I, gspI; H, gspH; F, gspF; E, gspE; K, gspK; C, gspC; M, gspM; L, gspL; G, gspG; O, gspO; D, gspD. c Schematics of the T2SS of Gram-negative bacteria [43]. The secretin, GspD (A3CADRAFT_01569); the polytopic protein, GspF (A3CADRAFT_01561); the cytoplasmic ATPase, GspE (A3CADRAFT_01562); the major pseudopilin component, GspG (A3CADRAFT_01567); the minor pseudopilins, GspH (A3CADRAFT_01560), GspI (A3CADRAFT_01559), GspJ (A3CADRAFT_01558) and GspK (A3CADRAFT_01563); the bitopic proteins, GspL (A3CADRAFT_01566), GspC (A3CADRAFT_01564) and GspM (A3CADRAFT_01565); the peptidase, GspO (A3CADRAFT_01568)(GspS is absent from Mlalz-1); OM, outer membrane; PG, peptidoglycan; IM, inner membrane List of the unique COGs in 10.1601/nm.1335 Mlalz-1 In common with some other 10.1601/nm.1335 strains, strain Mlalz-1 contains several genes encoding phage components. The PHASTER algorithm [44] was used to identify two resident prophages, present on scaffold A3CADRAFT_scaffold_4.5: one that was incomplete (Prophage Region 1) and one that was intact (Prophage Region 2) (Fig. 4). The proteins encoded by Prophage Region 1 (11.4 kb) and Prophage Region 2 (55 kb) were most closely related to the phage proteins of PHAGE_Mycoba_Catalina_NC031238 and PHAGE_Sinorh_phiLM21_ NC_029046, respectively.
Fig. 4

Resident prophages present in 10.1601/nm.1335 Mlalz-1 A3CADRAFT_scaffold_4.5, imaged using PHASTER [44]. Reference locus tag for Prophage Region 1 is A3CADRAFT_01439 (phage capsid family); reference locus tag for Prophage Region 2 is A3CADRAFT_01472 (prophage tail length tape measure protein). Prophage maps not drawn to scale. Attachment site (att), coat protein (coa), fiber protein (fib), integrase (int), phage-like protein (plp), tail shaft protein (tsp), and terminase (ter). All other genes encode hypothetical proteins

Resident prophages present in 10.1601/nm.1335 Mlalz-1 A3CADRAFT_scaffold_4.5, imaged using PHASTER [44]. Reference locus tag for Prophage Region 1 is A3CADRAFT_01439 (phage capsid family); reference locus tag for Prophage Region 2 is A3CADRAFT_01472 (prophage tail length tape measure protein). Prophage maps not drawn to scale. Attachment site (att), coat protein (coa), fiber protein (fib), integrase (int), phage-like protein (plp), tail shaft protein (tsp), and terminase (ter). All other genes encode hypothetical proteins The Mlaz-1 genome also contains acid-tolerance or acid-responsive genes that are orthologous to the genes identified in the comparatively acid tolerant strain 10.1601/nm.1334 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419. Acid-tolerance or acid-responsive genes identified in Mlaz-1 include actA (lnt), actP, actR, actS, phrR, exoR, exoH, lpiA, acvB, degP1, mdh3, fbaB, groS, kdpB, kdpC, fixN2 and fixO2 [45-52] (Additional file 2: Table S2). It is notable that strain Mlalz-1 is unique among the sequenced 10.1601/nm.1328 strains since it contains two versions of the highly acid-induced lpiA-acvB operon. One operon (A3CADRAFT_01189-A3CADRAFT_01190) is found on scaffold A3CADRAFT_scaffold_3.4, in a gene region that is conserved in other 10.1601/nm.1335 (sequence similarity >98%) and is located on the chromosome of the fully sequenced 10.1601/nm.1335 1021. The second version of the lpiA-acvB operon (A3CADRAFT_05694-A3CADRAFT_05695) is located on A3CADRAFT_scaffold_47.48, in a gene region that is conserved in 10.1601/nm.1334 genomes (sequence similarity >96%) and is located on the pSMED02 symbiotic plasmid of the fully sequenced 10.1601/nm.1334 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419. The regulatory gene fsrR, required for the acid activated expression of lpiA in 10.1601/nm.1334 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419 [53], is located upstream of A3CADRAFT_05694 in strain Mlalz-1. This regulatory gene is absent from the A3CADRAFT_01190 gene region, and from the lpiA-acvB gene regions of all other 10.1601/nm.1335 sequenced genomes. These findings suggest that 10.1601/nm.1335 Mlalz-1 acquired the plasmid-borne lpiA-acvB operon and associated fsrR regulatory gene by lateral transfer from an 10.1601/nm.1334 strain. Essential symbiotic (nod, nif and fix) genes identified in the 10.1601/nm.1335 Mlalz-1 genome (Additional file 2: Table S3 and S4) are located in several clusters on the following scaffolds: A3CADRAFT_scaffold_54.55 (Fig. 5a), A3CADRAFT_scaffold_61.62 (Fig. 5b), A3CADRAFT_scaffold_63.64 (Fig. 5c), A3CADRAFT_scaffold_71.72 (Fig. 5d) and A3CADRAFT_scaffold_74.75 (Fig. 5e). Nodulation of M. laciniata has been shown to require a specific nodC allele [14]. The nodC gene of strain Mlalz-1 has highest sequence identity (≥ 98%) with nodC of other M. laciniata-nodulating 10.1601/nm.1328 strains in the NCBI database, whereas there is a lower sequence identity (≤ 93%) with nodC of 10.1601/nm.1328 strains that nodulate other species. Nodulation of hosts requires Nod factors that are sulfated at the reducing terminus and acylated at the non-reducing terminus, with a polyunsaturated fatty acyl tail [54, 55]. The NodH sulfotransferase, together with the NodP and NodQ sulfate-activating complex, are required for Nod factor sulfation [56, 57]. Activity of NodL results in O-acetylation of the Nod factor [58], while NodE and NodF produce the specific polyunsaturated fatty acyl tail [55, 59]. Strain Mlalz-1 would appear to be typical of 10.1601/nm.1328 strains that nodulate species since the nodEF, nodL and nodHPQ genes that are required for these specific decorations of the Nod factor are present in the genome. 10.1601/nm.1335 Mlalz-1 also possesses the three nodD genes that mediate host-specific activation of nodABC in the symbiotic interactions of 10.1601/nm.1335 with [60].
Fig. 5

Graphical map of the scaffolds; a A3CADRAFT_scaffold_54.55, b A3CADRAFT_scaffold_61.62, (c) A3CADRAFT_scaffold_63.64, d A3CADRAFT_scaffold_71.72 and e A3CADRAFT_scaffold_74.75 of 10.1601/nm.1335 Mlalz-1 showing the location of common nodulation (nod) and fixation (nif and fix) genes within the symbiotic regions of this strain. From bottom to the top of the scaffold map: Genes on reverse strand (color by COG categories as denoted by the IMG platform), genes on forward strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew

Graphical map of the scaffolds; a A3CADRAFT_scaffold_54.55, b A3CADRAFT_scaffold_61.62, (c) A3CADRAFT_scaffold_63.64, d A3CADRAFT_scaffold_71.72 and e A3CADRAFT_scaffold_74.75 of 10.1601/nm.1335 Mlalz-1 showing the location of common nodulation (nod) and fixation (nif and fix) genes within the symbiotic regions of this strain. From bottom to the top of the scaffold map: Genes on reverse strand (color by COG categories as denoted by the IMG platform), genes on forward strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew

Conclusions

10.1601/nm.1335 Mlalz-1 is a rhizobial strain that is able to nodulate and fix nitrogen with the highly specific host M. laciniata. Although the 16S rRNA gene sequence divergence was insufficient to differentiate strain Mlalz-1 from 10.1601/nm.1335 , 10.1601/nm.1334 or 10.1601/nm.17831 , a gANI value of 98.8% with the genome of 10.1601/nm.1335 1021, compared with 87.9% with the genome of 10.1601/nm.1334 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419 identifies strain Mlalz-1 as 10.1601/nm.1335. Nodulation of M. laciniata has been shown to be dependent on the presence of a specific nodC allele, which also is present in the genome of 10.1601/nm.1335 Mlalz-1, based on a 98% sequence identity with the nodC of other M. laciniata-nodulating 10.1601/nm.1328 strains [14]. However, strain Mlalz-1 is unique among sequenced 10.1601/nm.1335 strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. The second copy of the 10.1601/nm.1335 Mlalz-1 lpiA-acvB operon has highest sequence identity (>96%) with that of sequenced 10.1601/nm.1334 strains, which infers horizontal gene transfer of this region from 10.1601/nm.1334 . Associated MIGS record for Ensifer meliloti Mlalz-1. (DOCX 52 kb) Table S2. Acid responsive gene orthologs present in Ensifer strains. Table S3. The nodulation genes of Ensifer meliloti Mlalz-1. Table S4. The nitrogen fixation genes of Ensifer meliloti Mlalz-1. (DOCX 65 kb)
  53 in total

1.  Regulation of exopolysaccharide production in Rhizobium leguminosarum biovar viciae WSM710 involves exoR.

Authors:  Wayne G Reeve; Michael J Dilworth; Ravi P Tiwari; Andrew R Glenn
Journal:  Microbiology (Reading)       Date:  1997-06       Impact factor: 2.777

2.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

3.  An essential role for actA in acid tolerance of Rhizobium meliloti.

Authors:  Ravi P Tiwari; Wayne G Reeve; Michael J Dilworthan; Andrew R Glenn
Journal:  Microbiology (Reading)       Date:  1996-03       Impact factor: 2.777

4.  In vitro sulfotransferase activity of Rhizobium meliloti NodH protein: lipochitooligosaccharide nodulation signals are sulfated after synthesis of the core structure.

Authors:  M Schultze; C Staehelin; H Röhrig; M John; J Schmidt; E Kondorosi; J Schell; A Kondorosi
Journal:  Proc Natl Acad Sci U S A       Date:  1995-03-28       Impact factor: 11.205

5.  Rhizobium meliloti nodD genes mediate host-specific activation of nodABC.

Authors:  M A Honma; M Asomaning; F M Ausubel
Journal:  J Bacteriol       Date:  1990-02       Impact factor: 3.490

6.  Horizontal gene transfer and homologous recombination drive the evolution of the nitrogen-fixing symbionts of Medicago species.

Authors:  Xavier Bailly; Isabelle Olivieri; Brigitte Brunel; Jean-Claude Cleyet-Marel; Gilles Béna
Journal:  J Bacteriol       Date:  2007-05-11       Impact factor: 3.490

7.  The transcriptional regulator gene phrR in Sinorhizobium meliloti WSM419 is regulated by low pH and other stresses.

Authors:  Wayne G Reeve; Ravi P Tiwari; Cheryl M Wong; Michael J Dilworth; Andrew R Glenn
Journal:  Microbiology (Reading)       Date:  1998-12       Impact factor: 2.777

Review 8.  How rhizobial symbionts invade plants: the Sinorhizobium-Medicago model.

Authors:  Kathryn M Jones; Hajime Kobayashi; Bryan W Davies; Michiko E Taga; Graham C Walker
Journal:  Nat Rev Microbiol       Date:  2007-08       Impact factor: 60.633

9.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

10.  Discovery of Novel Plant Interaction Determinants from the Genomes of 163 Root Nodule Bacteria.

Authors:  Rekha Seshadri; Wayne G Reeve; Julie K Ardley; Kristin Tennessen; Tanja Woyke; Nikos C Kyrpides; Natalia N Ivanova
Journal:  Sci Rep       Date:  2015-11-20       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.