Literature DB >> 23425606

Comparative genomics of the core and accessory genomes of 48 Sinorhizobium strains comprising five genospecies.

Masayuki Sugawara, Brendan Epstein, Brian D Badgley, Tatsuya Unno, Lei Xu, Jennifer Reese, Prasad Gyaneshwar, Roxanne Denny, Joann Mudge, Arvind K Bharti, Andrew D Farmer, Gregory D May, Jimmy E Woodward, Claudine Médigue, David Vallenet, Aurélie Lajus, Zoé Rouy, Betsy Martinez-Vaz, Peter Tiffin, Nevin D Young, Michael J Sadowsky.   

Abstract

<span class="abstract_title">BACKGROUND: The sinorhizobia are amongst the most well studied members of <span class="Chemical">nitrogen-fixing root nodule bacteria and contribute substantial amounts of fixed nitrogen to the biosphere. While the alfalfa symbiont Sinorhizobium meliloti RM 1021 was one of the first rhizobial strains to be completely sequenced, little information is available about the genomes of this large and diverse species group.
RESULTS: Here we report the draft assembly and annotation of 48 strains of Sinorhizobium comprising five genospecies. While S. meliloti and S. medicae are taxonomically related, they displayed different nodulation patterns on diverse Medicago host plants, and have differences in gene content, including those involved in conjugation and organic sulfur utilization. Genes involved in Nod factor and polysaccharide biosynthesis, denitrification and type III, IV, and VI secretion systems also vary within and between species. Symbiotic phenotyping and mutational analyses indicated that some type IV secretion genes are symbiosis-related and involved in nitrogen fixation efficiency. Moreover, there is a correlation between the presence of type IV secretion systems, heme biosynthesis and microaerobic denitrification genes, and symbiotic efficiency.
CONCLUSIONS: Our results suggest that each Sinorhizobium strain uses a slightly different strategy to obtain maximum compatibility with a host plant. This large genome data set provides useful information to better understand the functional features of five Sinorhizobium species, especially compatibility in legume-Sinorhizobium interactions. The diversity of genes present in the accessory genomes of members of this genus indicates that each bacterium has adopted slightly different strategies to interact with diverse plant genera and soil environments.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23425606      PMCID: PMC4053727          DOI: 10.1186/gb-2013-14-2-r17

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Background

The rhizobia are symbiotic <span class="Chemical">nitrogen-fixing bacteria that form root and/or stem nodules on leguminous plants. Within <span class="Disease">nodules rhizobia convert atmospheric dinitrogen (N2) gas into ammonia, resulting in improved plant growth and productivity, even under N-limiting environmental conditions. These bacteria are among the largest fixers of atmospheric N2 gas in the biosphere and account for the deposition of nearly 100 to 195 teragrams per year. The effective use of biological nitrogen fixation via application of rhizobia leads to sustainable cropping systems with a net positive impact on the environment [1]. Most currently recognized legume-nodulating bacteria belong to the α-proteobacteria and are members of the genera Allorhizobium, Azorhizobium, Mesorhizobium, Rhizobium, Sinorhizobium (renamed Ensifer), or Bradyrhizobium [2,3]. Recently, some members of the β- and γ-proteobacteria have also been shown to nodulate legume plants [4]. Members of the genus <span class="Species">Sinorhizobium are among the most studied and first sequenced rhizobia. <span class="Species">Sinorhizobium meliloti (previously Rhizobium meliloti and now Ensifer meliloti) and its close relative Sinorhizobium medicae induce the formation of root nodules on Medicago species, including Medicago truncatula and Medicago sativa (alfalfa) [5]. In contrast, Sinorhizobium saheli and Sinorhizobium terangae form root and stem nodules with woody leguminous plants, such as Sesbania or Acacia [6], while Sinorhizobium fredii has a very wide host range, nodulating more than 79 plant genera representing all three subfamilies of the family Leguminosae. Although whole genome sequences of some strains of S. meliloti, S. medicae and S. fredii have been published [7-12], and many of their genetic features have been well characterized, only a limited number of strains of each species have been well characterized at the genome level. Recently, Tian et al. [12] reported the comparative genomics of nine strains of S. fredii and Baily et al. [13] reported the population genomics of 12 S. medicae strains analyzed using Roche 454 technology. Moreover, only limited comparative genomics studies among each species exist and there are no reports of genomic feature of other species of Sinorhizobium, including the important symbionts of Sesbania/Acacia. Most rhizobial nodulation genes (nod, noe, and nol) are involved in the synthesis of host-specific <span class="Chemical">lipochitinoligosaccharide (<span class="Chemical">LCO) Nod factors essential for initial infection [14]. Bacterial genes encoding various polysaccharides, cyclic β-glucans, and type III, IV and VI secretion systems are also involved in symbiosis and host specificity [15-17]. Most of the genes involved in symbiosis are located on large self-transmissible megaplasmids (pSym), or within large genomic symbiotic islands [18]. The megaplasmid pSymA, which has the most symbiosis-related genes in S. meliloti, is a more variable replicon than the chromosome or pSymB in this bacterium [10]. Symbiosis-related genes have previously been shown to be highly variable among rhizobial species and strains [10,19] and acquired by via horizontal gene- and plasmid-transfer events. This results in gene replacement and rearrangements leading to genome plasticity [18] and recombination [12] and, ultimately, specificity of symbiotic interactions with their legume hosts. This suggests that gene content in Sinorhizobium strains should vary among strains or species and these alterations could influence their symbiotic phenotype on a host plant. However, few comparative genomic studies have focused on gene content or symbiotic function of multiple strains within or between species of sinorhizobia. Here we describe the assembly and annotation of the whole genomes of 48 strains of <span class="Species">Sinorhizobium described previously [20], with primary focus on <span class="Species">S. meliloti and S. medicae. While we previously examined 44 of these genomes to characterize population diversity at the single nucleotide level and to determine the forces driving adaptive evolution, our overall goal here was to compare gene content among a large number of strains within a single sinorhizobial species. This was done to better understand functional features in each species and to identify symbiosis-associated genes contributing to symbiotic phenotypes as part of large genome-wide association, SNP, and Hapmap studies [20-22]. Here we show: 1) the genomic features of each Sinorhizobium species; 2) the differences in gene content between S. meliloti and the taxonomically and symbiotically related species S. medicae; and 3) the differences among strains and species in genes involved in Nod factor biosynthesis, polysaccharide biosynthesis, protein secretion systems, anaerobic denitrification, and organic sulfur utilization. We also report pair-wise analyses of symbiotic associations of these 46 S. meliloti and S. medicae strains with 27 diverse M. truncatula genotypes to better understand the relationship of symbiotic phenotype with bacterial genome content.

Results and discussion

General features of Sinorhizobium genomes

Annotated draft genome assemblies of 48 Sino<span class="Species">rhizobium strains comprising five genospecies - <span class="Species">S. meliloti, S. medicae, S. fredii, S. saheli and S. terangae - are presented here (Table S1 in Additional file 1). These assemblies were generated from raw reads used previously to call SNPs in a population genetics analysis [20]. A phylogenetic tree based on 645 protein-coding genes (Figure 1) showed that S. meliloti and S. medicae are more closely related to each other than to three other species included in this study. A phylogenetic tree based on the 16S rRNA gene sequence (Figure S1 in Additional file 2) was similar to that shown in Figure 1, but the bootstrap values did not support the nodes to the extent of the tree made from protein coding genes. Genome characteristics are summarized in Table S2 in Additional file 1. Total genome sizes varied between species and strains and ranged from 6.2 to 7.8 Mb. The number of predicted protein coding sequences (CDSs; 6,436 to 8,858), and mean mole percentage G+C content (61.0 to 63.5%) also varied among sequenced genomes (Figure 2; Table S2 in Additional file 1). The mean percentage G+C content of S. meliloti strains (61.8 to 62.2% for all 32 strains) was greater than those seen in S. medicae (60.9 to 61.1% for all 12 strains) (Figure 2). Genome sizes and CDS counts varied greatly among strains in the same species. While S. meliloti M270 had the largest genome size (7.8 Mb) and number of CDSs (8,858) among all the tested strains, the genome of S. saheli USDA 4893 had the smallest genome size (approximately 6.2 Mb) and highest G+C content (63.5%). The genomes of S. fredii and S. terangae were similar to those of S. meliloti or S. medicae, respectively (Figure 2; Table S2 in Additional file 1). Recently, Tian et al. [12] reported a comparative analysis of nine S. fredii genomes and found that the average genome size was approximately 6.6 Mb, and consisted of a large number of accessory genes likely acquired by horizontal gene transfer. This is similar to what we report here. All of the strains examined contained from two to five plasmids as determined by Eckhart gel electrophoresis.
Figure 1

Neighbor-joining tree based on concatenated sequences for 645 protein coding genes. Strains that were sequenced in other studies are in bold font and type strains are in italic font. Support for splits was assessed using 1,000 bootstraps, and splits with less than 60% support were collapsed to polytomies. For clarity, the bootstrap values are only shown for the deep branches. Bar indicates number of substitutions per site.

Figure 2

Three-dimensional plots of genome size, coding sequence number and GC content of the 48 .

Neighbor-joining tree based on concatenated sequences for 645 protein coding genes. Strains that were sequenced in other studies are in bold font and type strains are in italic font. Support for splits was assessed using 1,000 bootstraps, and splits with less than 60% support were collapsed to polytomies. For clarity, the bootstrap values are only shown for the deep branches. Bar indicates number of substitutions per site. Three-dimensional plots of genome size, coding sequence number and GC content of the 48 .

Gene contents in Sinorhizobium strains

To understand the pan-genome of <span class="Species">Sinorhizobium more deeply, 380,371 protein CDSs obtained from the 48 newly sequenced genomes plus two reference strains (<span class="Species">S. meliloti 1021 and S. medicae WSM419) were clustered using the CD-HIT algorithm with a 70% sequence identity cut-off. A total of 34,150 clusters were identified, and of these, 2,751 orthologs (8%) were identified in all 50 strains as the Sinorhizobium core genome (Figure 3a). The remaining variable 31,399 clusters were defined as the Sinorhizobium accessory genome. Species-specific genes were identified among the five tested species (Figure 3a).
Figure 3

The pan-genome of . The flower plots and Venn diagrams illustrate the number of shared and specific (accessory) genes based on clusters of orthologs. (a) Flower plot showing numbers of species-specific genes commonly found in each genome of each species (in the petals), and Sinorhizobium core orthologous gene number (in the center). (b) Flower plots showing numbers of unique orthologous genes in each S. meliloti strain (in the petals), and S. meliloti core orthologous gene number (in the center). (c) Flower plots showing numbers of unique orthologous gene in each S. medicae strain (in the petals), and S. medicae core orthologous gene number (in the center). (d) Venn diagram showing numbers of unique orthologous genes in each S. fredii strain, and S. fredii core orthologous gene number.

The pan-genome of . The flower plots and Venn diagrams illustrate the number of shared and specific (accessory) genes based on clusters of orthologs. (a) Flower plot showing numbers of species-specific genes commonly found in each genome of each species (in the petals), and <span class="Species">Sinorhizobium core orthologous gene number (in the center). (b) Flower plots showing numbers of unique orthologous genes in each <span class="Species">S. meliloti strain (in the petals), and S. meliloti core orthologous gene number (in the center). (c) Flower plots showing numbers of unique orthologous gene in each S. medicae strain (in the petals), and S. medicae core orthologous gene number (in the center). (d) Venn diagram showing numbers of unique orthologous genes in each S. fredii strain, and S. fredii core orthologous gene number. <span class="Chemical">Species core orthologous genes and strain-specific unique genes within a given <span class="Species">Sinorhizobium species were examined in 33, 13, and 2 strains of S. meliloti, S. medicae, and S. fredii, respectively (Figure 3b-d). In the S. meliloti strains, 21,118 orthologous genes were identified from 33 strains, and of these, 4,680 orthologs were present in all tested S. meliloti strains as the species core genome (Figure 3b). The number of unique genes in each S. meliloti strain varied from 25 to 840 (Figure 3b). S. meliloti strain M270 had the largest genome (7.8 Mb) and the largest number (840) of unique genes. The M270 genome uniquely contained well-correlated regions of the nopaline-type plasmid, pTiC58, found in the plant pathogen Agrobacterium tumefaciens C58. This included complete sets of trb genes (encoding type IV secretion system proteins involved in conjugal transfer) and nopaline utilization genes (noc).

Functional features of the core and accessory sinorhizobial genomes

To define possible differences in functions encoded by the core and/or accessory genome in each species group, the proportion of proteins in each <span class="Chemical">COG (Clusters of Orthologous Groups) category was plotted versus <span class="Chemical">COG function. Figure 4 shows that the core-genomes in each Sinorhizobium species group were commonly enriched in COG categories C, F, H, M, J, and V relative to those seen in the accessory genomes. In contrast, accessory genomes were commonly enriched in COG categories Q, D, K, and L relative to those of the core genome. There was no major difference in COG category proportion between S. meliloti and S. medicae, but the abundances of genes in category G (carbohydrate transport and metabolism) in the accessory genomes were greater in both of these species strains compared to those seen in other sinorhizobia.
Figure 4

Distribution of orthologous genes based on COG category in each . The percentages of orthologous genes assigned by COG category in the core genome (black bars) and the accessory genome (white bars) are shown. Only orthologous genes assigned by COGnitor were used for analysis.

Distribution of orthologous genes based on <span class="Chemical">COG category in each . The percentages of orthologous genes assigned by <span class="Chemical">COG category in the core genome (black bars) and the accessory genome (white bars) are shown. Only orthologous genes assigned by COGnitor were used for analysis.

Functional differences between S. meliloti and S. medicae

While <span class="Species">S. meliloti and <span class="Species">S. medicae are taxonomically related (Figure 1) with somewhat similar host ranges [5], 421 out of 4,680 S. meliloti core orthlogous genes were not found in the tested 13 strains of S. medicae. Similarly, 396 out of 5,036 S. medicae core orthologous genes were not found in the 33 tested strains of S. meliloti. Selected S. meliloti- or S. medicae-specific genes in each species are shown in Table 1 and all species-specific genes are presented in Tables S3 and S4 in Additional file 1. These results show that genes involved in conjugation, C1 metabolism, detoxification, and cellular process were specifically identified in the core genomes of each species. In addition, S. meliloti specifically possesses genes encoding a nitrate transporter (nrtABC), a nitrogen regulatory protein (ntrR), and a succinoglycan biosynthetic gene (exoI). In contrast, S. medicae species specifically contain many arylsulfatase genes (Figure S2 in Additional file 2) associated with transporter genes. Of particular interest is the prevalence of genes involved in organic sulfur utilization in S. medicae, which are also present and expressed in Bradyrhizobium japonicum when in symbiosis with soybean [23]. This is likely to be of functional importance as organic sulfur in the form of sulfur esters and sulfonates constitute approximately 95% of the total sulfur in aerobic soils [24].
Table 1

Selected S. meliloti- or S. medicae-specific genes among both speciesa

SpeciesGene IDbGene nameFunction
Conjugation
S. melilotiSMa0929traGConjugal transfer coupling protein TraG
S. melilotiSMa0934traA1Conjugal transfer protein TraA1
S. melilotiSMa1302virB11Type IV secretion protein VirB11
S. melilotiSMa1303virB10Type IV secretion protein VirB10
S. melilotiSMa1306virB9Type IV secretion protein VirB9
S. melilotiSMa1308virB8Type IV secretion protein VirB8
S. melilotiSMa1311virB6Type IV secretion protein VirB6
S. melilotiSMa1313virB5Type IV secretion protein VirB5
S. melilotiSMa1315virB4Type IV secretion protein VirB4
S. melilotiSMa1318virB3Type IV secretion protein VirB3
S. melilotiSMa1319virB2Type IV secretion protein VirB2
S. melilotiSMa1321virB1Type IV secretion protein VirB1
S. melilotiSMa1323rctANegative transcriptional regulator of vir genes
S. medicaeSmed_5050traDConjugal transfer TraD family protein
S. medicaeSmed_5051traCConjugal transfer protein TraC
S. medicaeSmed_5375traIAcyl-homoserine-lactone synthase
S. medicaeSmed_5377trbCConjugal transfer protein TrbC
S. medicaeSmed_5387traRTranscriptional activator protein TraR
S. medicaeSmed_5388traMTranscriptional repressor TraM
S. medicaeSmed_5391traBConjugal transfer protein TraB
Nitrogen metabolism
S. melilotiSMa0228gdhAGlutamate dehydrogenase
S. melilotiSMa0581nrtCNitrate transport ATP binding protein
S. melilotiSMa0583nrtBNitrate ABC transporter permease
S. melilotiSMa0585nrtANitrate ABC transporter substrate-binding protein
S. melilotiSMa0981ntrR2NtrR2 transcription regulator
S. melilotiSMc01521ntrR1Nitrogen regulatory protein
S. medicaeSmed_1742fnrNNitrogen fixation regulatory protein
Organic sulfur utilization
S. medicaeSmed_1128ssuB-likeAliphatic sulfonates import ATP-binding protein
S. medicaeSmed_1129ssuA-likeAliphatic sulfonates family ABC transporter, periplasmic ligand-binding protein
S. medicaeSmed_1130atsA-likeArylsulfatase
S. medicaeSmed_3146atsA-likeArylsulfatase
S. medicaeSmed_3147ssuAAliphatic sulfonates family ABC transporter, periplasmic ligand-binding protein
S. medicaeSmed_3148ssuBSulfonate ABC transporter, ATP-binding protein
S. medicaeSmed_3150ssuCAlkanesulfonate transport protein; membrane component
S. medicaeSmed_3151tauC-likePutative taurine transport system permease protein TauC
S. medicaeSmed_2065atsAArylsulfatase
Detoxification
S. melilotiSMb21552aacC4Aminoglycoside 6'-N-acetyltransferase
S. melilotiSMb20505tfxGTrifolitoxin immunity protein
S. melilotiSMc02649arsCArsenate reductase protein ArsC
S. melilotiSMc02650arsHArsenical resistance protein ArsH
S. medicaeSmed_0125aacAAminoglycoside N(6')-acetyltransferase type 1
S. medicaeSmed_2292aphEStreptomycin 3''-kinase
S. medicaeSmed_5053arsHArsenate resistance protein ArsH
S. medicaeSmed_5054arsBArsenite resistance protein ArsB
S. medicaeSmed_5055arsCArsenate reductase
C1 metabolism
S. melilotiSMa0002fdoGFdoG formate dehydrogenase-O, alpha subunit
S. melilotiSMa0005fdoHFdoH formate dehydrogenase-O, beta subunit
S. melilotiSMa0007fdoIFdoI formate dehydrogenase-O, gamma subunit
S. melilotiSMa0009fdhEFormate dehydrogenase accessory protein FdhE
S. melilotiSMa0011selAL-seryl-tRNA(Sec) selenium transferase
S. melilotiSMa0015selBSelenocysteine-specific elongation factor
S. melilotiSMa0028selDSelenide, water dikinase
S. medicaeSmed_2095folDBi-functional; 5,10-methylene-tetrahydrofolate dehydrogenase and cyclohydrolase
S. medicaeSmed_2096glyASerine hydroxymethyltransferase
Sugars and polysaccharides
S. melilotiSMb20951exoISuccinoglycan biosynthesis protein ExoI
S. melilotiSMb21416ddhAGlucose-1-phosphate cytidylyltransferase
S. melilotiSMb21417ddhBCDP-glucose 4,6-dehydratase
S. melilotiSMb21418NDP-hexose 3-C-methyltransferase
S. medicaeSmed_5910otsBTrehalose-phosphate phosphatase
Cellular processes
S. melilotiSMc03854ftsYPutative cell division protein
S. melilotiSMc03044motDChemotaxis protein (motility protein D)
S. medicaeSmed_1943ftsZCell division protein FtsZ homolog 2
S. medicaeSmed_0273motDChemotaxis protein motD
Others
S. melilotiSMc04203fecIPutative RNA polymerase sigma factor FecI protein
S. melilotiSMc04204fecRPutative IRON transport regulator transmembrane protein
S. melilotiSMc04205Putative IRON/HEME transport protein
S. medicaeSmed_2092dsdAD-serine dehydratase
S. medicaeSmed_3282fbpBFerric transport system permease protein FbpB
S. medicaeSmed_3284fbpCFerric transporter subunit

aAll genes are presented in Tables S3 and S4 in Additional file 1. bID of annotated gene in S. meliloti 1021 or S. medicae WSM419.

Selected <span class="Species">S. meliloti- or <span class="Species">S. medicae-specific genes among both speciesa aAll genes are presented in Tables S3 and S4 in Additional file 1. bID of annotated gene in <span class="Species">S. meliloti 1021 or <span class="Species">S. medicae WSM419.

Nod factor biosynthetic genes

Most nodulation genes (nod, noe, and nol) are involved in the synthesis of host-specific <span class="Chemical">lipo-chito-oligosaccharide (<span class="Chemical">LCOs) Nod factors that are essential for initiation of the symbiosis [14]. Nearly all rhizobia contain the common nod genes [25], which encode Nod factors secreted from rhizobial cells [14,26]. Figure 5 shows a physical map of Nod factor biosynthesis genes in all five Sinorhizobium species. The S. meliloti and S. medicae strains contain a nodABCIJ operon that is closely linked to nodD(encoding positive transcriptional regulator of nod genes), whereas nodDof S. fredii, S. saheli and S. terangae is not closely linked to the common nod genes. S. meliloti and S. medicae had three copies of nodD (nodD) while the other sinorhizobia examined had two copies of nodD. Interestingly, the annotated nodN (encoding a dehydratase enzyme) was found to be fragmented in many strains of S. medicae. The genome of the S. medicae WSM419 contained noeJ, whereas S. meliloti KH46b had two copies of the noeJK genes and a noeLnolK gene cluster involved in the fucosylation of the Nod factors at the C-6 position. Since both WSM419 and KH46b strains did not contain a nodZ homolog, our data suggest that these strains may not fucosylate their Nod factors. In contrast, S. saheli and S. fredii strain USDA 207 possessed a complete set of noeJK-nodZ-noeLK genes. The nodZ in S. fredii is also found in B. japonicum and is involved in host-specific nodulation of soybean [27].
Figure 5

Gene organization and correlation of Nod factor biosynthetic genes in each . Blue arrows indicate the genes encoding enzymes for Nod factor synthesis commonly detected in all tested Sinorhizobium strains. Yellow arrows indicate the genes involved in Nod factor secretion. Green arrows indicate specifically detected genes involved in Nod factor synthesis in an individual species. Red arrows indicate the genes encoding transcriptional regulators of nodulation genes. White arrows indicate genes involved in Nod factor biosynthesis that are not in common.

Gene organization and correlation of Nod factor biosynthetic genes in each . Blue arrows indicate the genes encoding enzymes for Nod factor synthesis commonly detected in all tested Sino<span class="Species">rhizobium strains. Yellow arrows indicate the genes involved in Nod factor secretion. Green arrows indicate specifically detected genes involved in Nod factor synthesis in an individual species. Red arrows indicate the genes encoding transcriptional regulators of nodulation genes. White arrows indicate genes involved in Nod factor biosynthesis that are not in common. The sequenced <span class="Species">S. saheli and <span class="Species">S. terangae strains contained the nodSU genes, which are involved in the N-methylation and 6-O-carbamoylation of Nod factors [28], inserted between nodABC and nodIJ genes. In addition, nolO and noeI, which are involved in 3-O-carbamoylation and 2-O-methylation of Nod factors, respectively, were localized downstream of the nodABCIJ cluster in only the genome of S. fredii strains. This organization was similar to that reported for the broad host range Rhizobium sp. strain NGR234 [29], but the nolO gene was fragmented in the closely related strains USDA 205 and 207. In contrast, the S. meliloti and S. medicae strains contained nodGP, nodM and noeAB, and S. saheli had a noeCHOP gene cluster, and only S. fredii had a noeI gene. Strains of <span class="Species">S. meliloti are known to synthesize <span class="Chemical">sulfated Nod factors via two copies of nodPQ (producing the sulfate donor molecule PAPS) and a nodH sulfotransferase. As PAPS is also a central metabolite for sulfate assimilation, S. meliloti has additional copies of genes for sulfur metabolism and uses nodPQ exclusively for sulfation of Nod factor. In contrast, S. saheli and S. fredii had only one copy of nodPQ and did not contain nodH, consistent with the Nod factor structure of S. saheli reported earlier [30]. While the Acacia symbiont S. terangae strain USDA 4894 had a nodH gene, it contained fewer Nod factor adornment genes than those seen in other species. The nolR gene, which encodes a negative transcriptional regulator of core Nod factor biosynthesis and is a global regulator in rhizobia [31,32], was detected in all species of <span class="Species">Sinorhizobium, although the gene in the reference strain <span class="Species">S. meliloti 1021 is not functional [32]. Taken together, these results indicated Nod factor biosynthetic gene content varied among strains of the same species and suggest that LCOs produced by sinorhizobia might be modified in a strain-specific manner. These results are also the first report of genetic organization of nodulation genes in the woody legume symbionts S. saheli and S. terangae.

Secretion system gene clusters among Sinorhizobium members

Clusters of genes encoding bacterial type III, IV, and VI protein secretion systems (T3SS, T4SS, and T6SS, respectively) play crucial roles in animal- and plant-bacterial interactions [33]. In rhizobia, these secretion systems are involved in host range determination with their cognate effector proteins modulating host defense reactions [17]. A T3SS gene cluster has been characterized in Rhizobium spp. (S. fredii) NGR234, S. fredii USDA 257 and S. fredii HH103 (USDA 207), and T3SS mutants have symbiotic phenotypes [34,35]. However, there are no reports on the roles of T4SS and T6SS systems in sinorhizobial-legume symbioses. Figure 6 shows the structure of the different T3SS, T4SS and T6SS genes found in all the sequenced strains with substantial differences in genomic organization and deduced protein sequences. Notably, the S. saheli genome contained T3SS, T4SS, and T6SS gene clusters, as did one of the two S. fredii strains, while S. medicae strains only contained a T4SS.
Figure 6

Gene clusters for type III, IV, and VI secretion systems identified in . (a) Gene organizations of identified type III, IV, and VI secretion system genes. Colored arrows indicate characterized or named genes involved in the protein secretion systems. (b) Map showing presence (black plot) or absence (grey plot) of each type of type III, IV, and VI secretion system gene cluster. (c) Phylogenetic tree of the virB operon from each type IV secretion system gene cluster. Protein sequences of virBand virBgenes or their orthologs in each type IV secretion system gene cluster were concatenated and used for drawing the tree. Bar indicates number of substitutions per site.

Gene clusters for type III, IV, and VI secretion systems identified in . (a) Gene organizations of identified type III, IV, and VI secretion system genes. Colored arrows indicate characterized or named genes involved in the protein secretion systems. (b) Map showing presence (black plot) or absence (grey plot) of each type of type III, IV, and VI secretion system gene cluster. (c) Phylogenetic tree of the virB operon from each type IV secretion system gene cluster. Protein sequences of virBand virBgenes or their orthologs in each type IV secretion system gene cluster were concatenated and used for drawing the tree. Bar indicates number of substitutions per site. Three types of T3SS clusters (types a, b, and c) were identified from several Sino<span class="Species">rhizobium strains and all clusters contained the canonical rhcJ-nolUV-rhcNQRST gene cassette (Figure 6a). The T3SSa cluster was detected in nine strains of <span class="Species">S. meliloti and S. saheli USDA 4893 and contained rhcC, rhcC, rhcU, and rhcV (Figure 6b). While most of the genes in the main cluster showed 58 to 94% protein identity with the corresponding genes in Rhizobium spp. (S. fredii) strain NGR234, gene organization of the flanking regions were different. The T3SSb cluster contained the effector genes (nop) in S. fredii HH103 strain (USDA 207) and was also identified in S. fredii USDA 205 and S. terangae USDA 4894. Strains having a T3SSc cluster had genes in the main cluster with 40 to 87% protein identity with those of Rhizobium etli CIAT 652 and were only observed in the genomes of S. meliloti M195 and S. terangae USDA 4894. The T3SS types a and c gene clusters found in S. meliloti, S. saheli and S. terangae had a different gene organization from any published Rhizobium T3SS clusters and did not contain the well-characterized nop genes, encoding T3SS-dependent surface appendage or effector proteins. The unique T3SS apparatus found in these strains may encode novel secretion proteins involved in host-specific interactions. <span class="Species">Agrobacterium tumefaciens C58 also uses T4SS for conjugation and DNA transfer [36] and strain C58 possesses three types of T4SS genes: vir, avh, and trb. The virB gene of <span class="Species">S. meliloti 1021 (grouped in T4SSa) is involved in conjugation, but is not required for symbiosis with alfalfa [37]. In contrast, vir genes of Mesorhizobium loti strain R7A are involved in protein translocation and have a host-dependent effect on symbiosis [38]. While seven types of T4SS gene clusters (designated T4SSa-g) were identified in the Sinorhizobium genomes (Figure 6a), they were not present in all strains (Figure 6b), suggesting these genes were likely acquired by horizontal gene or plasmid transfer events. To explore the potential function of each Sinorhizobium T4SS gene cluster, a phylogenetic tree was created using selected T4SS protein sequences from diverse bacteria known to infect plant and mammalian hosts (Figure 6c). A total of five clades were detected in the phylogenetic tree and T4SSb and T4SSc were present in clade I, including the Vir proteins of M. loti R7A and A. tumefaciens C58. In contrast, proteins in T4SSa, T4SSd, and T4SSg were in clades II or V and were similar to conjugation transfer proteins Trb or Avh of A. tumefaciens. Since the Sinorhizobium VirB proteins are similar to the symbiotically effective VirB in M. loti R7A, these results indicate that the T4SSb and T4SSc genes in Sinorhizobium strains may also influence symbiosis. The T4SSb gene cluster was found in 9 and 11 strains of S. meliloti and S. medicae, respectively, and the T4SSc cluster was only found in the Sesbania and Acacia symbionts (S. saheli and S. terangae), suggesting that the cluster plays a role in host-specific interactions. The T6SS locus (referred to as imp) is a determinant of host specificity in <span class="Species">Rhizobium leguminosarum [39]. The S. saheli strain USDA 4893 had two types of T6SS gene clusters, and T6SSb was also present in S. fredii USDA 207. The T6SSa cluster is very similar to that seen in R. leguminosarum at the amino acid level. No T6SS gene cluster was found in the S. meliloti, S. medicae, and S. terangae strains. Taken together, these results suggest that each sinorhizobial species utilizes different protein secretion strategies to modulate host-specific interactions, although further mutational and functional studies are needed to determine the role of these secretion systems in symbiosis.

General regulatory systems of T3SS and T4SS genes in rhizobia

In general, the expression of T3SS genes (rhc and nop) or T4SS genes (vir) is induced by the positive regulators <span class="Gene">TtsI (for T3SS) and VirA (for T4SS). <span class="Gene">TtsI and VirA bind to a tts- or vir-box in the promoter region of T3SS genes (rhc and nop) and T4SS genes (vir), respectively. In addition, the ttsI and virA genes have a nod box in front of them, indicating that these genes are likely induced by the NodD protein. The homologous genes of T3SS effector proteins (NopABCJLMPTX from <span class="Species">S. fredii NGR234) and the <span class="Gene">TtsI transcriptional regulator of T3SS genes were searched by BLAST analysis. Results of this analysis indicated that while the nop genes and ttsI were found in the genome of S. fredii USDA 205 and USDA 207 and in S. terangae strain USDA4894, which have the T3SSb gene cluster (Table S5 in Additional file 1), they were not found in the genomes of any S. meliloti strains. Moreover, a canonical nod box consensus sequence was not identified around any region of T3SS-related genes (rhc, nop and ttsI), although tts boxes were found upstream of some nop genes in the genomes of S. fredii USDA205 and USDA207 and the S. terangae strain USDA4894 (Table S6 in Additional file 1), which have the T3SSb cluster. Blast analyses were used to search the sequenced genomes for genes homologous to those encoding the T4SS effector proteins Msi059 and Msi061 from <span class="Species">M. loti <span class="Species">R7A and a VirA transcriptional regulator of T4SS genes. While the Msi061 homolog was found in the T4SSb and T4SSc gene clusters, Msi059 was not found in the genomes of any of the Sinorhizobium strains (Table S7 in Additional file 1). A VirA homolog was only found in the genomes of S. saheli strain USDA 4893 and S. terangae strain USDA 4894, in the T4SSc cluster (Table 3). In contrast, nod and vir box-like sequences were not identified in the T4SSb and T4SSc clusters of any of the sequenced strains. Taken together, these results suggest that the expression of identified T3SS and T4SS genes might not be regulated by the previously reported nod box inducers. However, further analysis is needed to examine the regulation of these genes.
Table 3

Presence of accessory genes involved in polysaccharide biosynthesis, microaerobic denitrification, lithotrophic growth, and organic sulfur utilization in the genomes of each Sinorhizobium species

Gene present in each Sinorhizobium speciesa

Gene or gene clusterFunctionmeliloti (n = 33)medicae (n = 13)fredii (n = 2)saheli (n = 1)terangae (n = 1)
Polysaccharide biosynthesis
exoF2Succinoglycan biosynthesis70200
exoHSuccinoglycan biosynthesis3313000
exoISuccinoglycan biosynthesis330010
exoI2Succinoglycan biosynthesis110200
exoP2Succinoglycan biosynthesis70200
exoTWVSuccinoglycan biosynthesis3313000
expA1-10-expGCD1D2-expE1-8Galactoglucan biosynthesis3313001
rkp-3; rkpLMNOPQCapsular polysaccharides biosynthesis40201
rkpZ1Capsular polysaccharides biosynthesis3313111
rkpZ2Capsular polysaccharides biosynthesis50211
rkpT2Surface polysaccharide export2913111
cgmBCyclic β-glucan biosynthesis10000
Microaerobic denitrification
napEFDABCNitrate reductase3213211
nirKVNitrite reductase199211
norECBQDNitric oxide reductase219211
nosRZDFYLXNitrous oxide reductase220000
Lithotroph
hupSLCDEFGHJKP-hypABFCDE-hoxXUptake hydrogenase00001
soxYZEF-likeSulfur oxidation70200
soxZSulfur oxidation3313200
Organic sulfur utilizationb
 I: ssuDABCEAlkanesulfonate degradation3313001
 II: tauRABCXDTaurine degradation3313000
 III: ssuCBA-atsA-likeArylsulfatase013000
 IV: tauC-ssuCBA-ats- likeArylsulfatase013000
 V: ssuADCBAlkanesulfonate degradation00200

aValues in a column indicate number of strains possessing a gene or gene cluster in a species. bThe genes in each gene cluster are orthologs of Smed_4212-4216 (I), Smed_4858-4863 (II), Smed_1127-1130 (III), Smed_3146-3151 in S. medicae WSM419, and U205v1_247004-247007 (V) in S. fredii USDA 205.

Symbiotic phenotypes of T4SSb mutants of S. meliloti and S. medicae

To further investigate the role of T4SSb in nodulation, deletion mutants of virBto virB, predicted to encode essential components of the T4SS apparatus in <span class="Species">S. meliloti KH46c and <span class="Species">S. medicae M2, were constructed and inoculated onto nine genotypes of M. truncatula and one genotype each of M. sativa, Medicago tricycla and Medicago littoralis. A few symbiotic differences between the wild-type strains and the KH46c and M2 virBmutants were detected in certain Medicago genotypes (Table 2). M. truncatula cv. A17 and M. tricycla inoculated with the virBmutant of S. meliloti KH46c formed significantly fewer nodules and had lower nodule and plant biomass than that seen in plants inoculated with the wild-type strain. Unexpectedly, however, the virBmutation in S. medicae M2 significantly increased nodule and plant biomass on M. truncatula cv. F83005-5. The KH46c ΔvirBmutant produced about four-fold greater nodule mass on M. sativa cv. Agate than did the wild-type strain (Table 2), but had about three- fold less acetylene reduction activity (432 ± 376 μmol C2H4 produced/h/g nodule dry weight) than the wild-type (1,132 ± 163 μmol C2H4 produced/h/g nodule dry weight), suggesting a less effective symbiotic interaction. While further experiments are needed to better understand the function of T4SSb in symbiosis, these results indicate that the T4SSb in Sinorhizobium may indeed play a role in host specificity. Observations from phenotype tests and gene content differences found in the genome data set suggested that the T4SSb secretion system is likely involved in symbiotic nitrogen fixation with specific M. truncatula genotypes. In particular, VirB proteins were postulated as symbiotic effector proteins in M. loti R7A [38]. However, we cannot rule out the possibility that other genes are important for host-determination and/or symbiotic efficiency.
Table 2

Symbiotic phenotypes of Medicago plants inoculated with virB mutants of S. meliloti KH46c and S. medicae M2

Host plantInoculated strainNodule numberaNodule dry mass (mg)Plant dry mass (mg)Plant height (cm)Chlorophyll content (SPAD unit)
M. truncatula KH46c wild-type796.620812.244
A17KH46c ΔvirB6-938*4.3*145*9.5*43
M2 wild-type1028.422911.041
M2 ΔvirB6-9516.2*20211.244
Uninoculated control00373.317
M. truncatula KH46c wild-type356.117410.342
F83005-5KH46c ΔvirB6-9245.51589.839
M2 wild-type294.91569.543
M2 ΔvirB6-9226.7*243*10.7*41
Uninoculated control00443.316
M. tricycla KH46c wild-type2412.231510.536
R108-C3KH46c ΔvirB6-912*9.923010.334
M2 wild-type112.8334.219
M2 ΔvirB6-9123.1334.221
Uninoculated control00263.516
M. satvia cvKH46c wild-type561.6958.554
AgateKH46c ΔvirB6-9426.8*557.245*
M2 wild-type312.56913.731
M2 ΔvirB6-9282.58514.628*
Uninoculated control007912.521

aValues are per plant. The asterisk indicates a significant difference compared with the wild-type strain by t-test (P < 0.05) of three biological replicates.

Symbiotic phenotypes of <span class="Species">Medicago plants inoculated with virB mutants of <span class="Species">S. meliloti KH46c and S. medicae M2 aValues are per plant. The asterisk indicates a significant difference compared with the wild-type strain by t-test (P < 0.05) of three biological replicates.

Anaerobic denitrification genes

The ability of rhizobia to denitrify depends on the nap, nir, nor, and nos gene clusters that encode <span class="Chemical">nitrate-, nitrite-, nitric oxide-, and <span class="Chemical">nitrous oxide-reductases, respectively [40,41]. Denitrification plays an important role in nitrogen-fixing soybean-Bradyrhizobium japonicum symbiosis and S. meliloti has been shown to denitrify under free-living and symbiotic conditions [41]. Genomic data presented here show that while the genomes of S. fredii, S. saheli, and S. terangae strains contained napEFDABC, nirKV, and norECBQD, they did not have the nosRZDFYLX genes that are involved in the terminal step of converting nitrous oxide to N2. In contrast, the nosRZDFYLX gene cluster was identified in 22 S. meliloti strains (Table 3), 19 of which had a complete gene set allowing for the production of N2 gas from nitrate. Presence of accessory genes involved in <span class="Chemical">polysaccharide biosynthesis, microaerobic denitrification, lithotrophic growth, and organic <span class="Chemical">sulfur utilization in the genomes of each Sinorhizobium species aValues in a column indicate number of strains possessing a gene or gene cluster in a species. bThe genes in each gene cluster are orthologs of Smed_4212-4216 (I), Smed_4858-4863 (II), Smed_1127-1130 (III), Smed_3146-3151 in <span class="Species">S. medicae WSM419, and U205v1_247004-247007 (V) in <span class="Species">S. fredii USDA 205.

Species differences in organic sulfur utilization genes

The majority of <span class="Chemical">sulfur in agricultural soils is in organic form, such as <span class="Chemical">sulfonates and sulfur-esters [24], and assimilation of these compounds by rhizobia is important for bacterial survival, competition in soils, and during symbiosis [23]. While Koch et al. [42] proposed that sulfonate monooxygenase is involved in host-specific adaptation by B. japonicum, little is known about organic sulfur utilization in sinorhizobia. Genome annotation indicated the presence of organic sulfur utilization genes (Table 3) and likely species-specific differences in the presence of genes for sulfonate monooxygenases (sulfonate sulfur utilization) or sulfatases (ester-sulfur utilization). S. meliloti and S. medicae specifically had cluster I (ssuDABCE encodes sulfonate transport and desulfonation proteins) and cluster II (tauRABCXD encodes taurine uptake and desulfonation proteins). In contrast, only S. medicae strains contained clusters III and IV, containing arylsulfatases (ester-sulfur utilization) [43] and ssuCBA-like organic sulfur transporter genes (Table 3; Figure S2 in Additional file 2). We tested for sulfatase activity in nodules induced in Medicago genotypes (HM011, HM014, HM019, HM028, HM101) by five S. meliloti (RM1021, M243, M210, M270, M30) and five S. medicae strains (WSM419, M102, M161, A321, M58). With few exceptions, sulfatase activity was greater in nodules induced by S. medicae than by S. meliloti, averaging 6.1 and 29.4 units/HM011 nodule, respectively. In addition, because S. medicae strains commonly have arylsulfatase genes associated with transporter genes (in clusters III and IV), strains of this species may uptake and utilize a wider variety of organosulfur compounds than S. meliloti.

Phenotypic interactions between sequenced Sinorhizobium spp. strains and diverse M. truncatula genotypes

We assessed the symbiotic interaction of 46 <span class="Species">S. meliloti or S. medicae strains with 27 M. truncatula genotypes. Symbiotic analyses indicated highly significant rhizobial-plant genotype interactions among the tested Sinorhizobium strains and M. truncatula genotypes (Figure 7; Tables S1 and S8 in Additional file 1). Most strains formed nodules on the roots of all M. truncatula genotypes, although S. meliloti strain M162 did not form nodules on 17 of 27 M. truncatula genotypes. The noeA gene, which was characterized as a host-specific nodulation gene [44], was found to be truncated in the nodulation-deficient strain S. meliloti M162, suggesting that the failure of this strain to nodulate some Medicago genotypes might be caused by a natural mutation in noeA. A cluster analysis using normalized and averaged values for each phenotype category obtained from all 27 M. truncatula genotypes is presented as a heat map (Figure 7). Strains were divided into phenotype clusters I (PC I) and II (PC II). The PC I included 30 strains that showed high compatibility with M. truncatula as measured by the increase in chlorophyll content and plant biomass, significantly more than the 16 strains in the PC II. Strains of both S. meliloti and S. medicae were present in both PC I and II, suggesting that differences in the symbiotic compatibility with M. truncatula were likely caused by strain-specific differences in symbiotic genes.
Figure 7

Symbiotic phenotypes of each . meliloti and S. medicae strain with M. truncatula. Dendrogram and heatmap showing the results of clustering analysis based on the phenotype values. Averaged raw values of each phenotype from three biological replicates were normalized to the range 0 to 1 in each M. truncatula genotype. The normalized values were then averaged for 27 genotypes of M. truncatula, and clustered. The color in the heatmap indicates the level of value; red indicates the highest and green indicates the lowest value. Black colored names indicate S. meliloti strain, and red colored names indicate S. medicae strain. PC, phenotype cluster.

Symbiotic phenotypes of each . <span class="Species">meliloti and S. medicae strain with M. truncatula. Dendrogram and heatmap showing the results of clustering analysis based on the phenotype values. Averaged raw values of each phenotype from three biological replicates were normalized to the range 0 to 1 in each M. truncatula genotype. The normalized values were then averaged for 27 genotypes of M. truncatula, and clustered. The color in the heatmap indicates the level of value; red indicates the highest and green indicates the lowest value. Black colored names indicate S. meliloti strain, and red colored names indicate S. medicae strain. PC, phenotype cluster. To investigate the sinorhizobial genes that may affect symbiosis and <span class="Chemical">nitrogen fixation with <span class="Species">M. truncatula, we searched previously identified symbiosis-related genes in Sinorhizobium or other rhizobia from the annotated genome data set of 46 S. meliloti or S. medicae strains. The proportion of strains having a full-length gene or gene clusters in each phenotypic cluster were obtained and compared to the proportions in other phenotypic clusters (Table 4). The T4SSb gene cluster (Figure 6) was conserved in 47% of S. meliloti and all S. medicae strains grouped in PC I; however, it was absent in all strains grouped in PC II (Table 4). In addition, hemN, involved in heme biosynthesis, and nirKV, norECBQD, and nosRZDFYLX, involved in microaerobic denitrification, were also conserved in relatively greater numbers of strains grouped in PC I (Table 4). In contrast, the proportion of strain containing previously reported symbiosis-related genes, such as T3SSa, genes involved in polysaccharide biosynthesis, and acdS (encoding 1-aminocyclopropane-1-carboxylate deaminase), were not differenct between among PC I and PC II strains. Taken together, these results suggest that protein secretion by the newly identified T4SSb and anaerobic respiration by denitrification might have an important role in symbiotic compatibility with M. truncatula.
Table 4

Presence of variable length symbiosis-related genes in each phenotype cluster of S. meliloti and S. medicae

Species and phenotype cluster (PC)a

S. meliloti S. medicae


Gene or gene clusterI (n = 19)II (n = 14)I (n = 11)II (n = 2)
Nodulation
nodN95 (18)64 (9)00
noeA100 (19)93 (13)100 (11)100 (2)
noeJ1K15 (1)000
noeJ2K2009 (1)0
noeLnolK5 (1)000
Nitrogen fixation
fixQ100 (19)86 (12)100 (11)100 (2)
fixR100 (19)93 (13)00
fixU95 (18)79 (11)100 (11)100 (2)
nifD100 (19)100 (14)100 (11)50 (1)
nifE100 (19)100 (14)90 (10)100 (2)
Succinoglycan (EPS I) biosynthesis
exoF226 (5)14 (2)
exoI95 (18)100 (14)00
exoI232 (6)36 (5)00
exoP226 (5)14 (2)00
exoW100 (19)93 (13)100 (11)100 (2)
Galactoglucan (EPS II) biosynthesis
expD295 (18)86 (12)100 (11)100 (2)
expE895 (18)100 (14)100 (11)100 (2)
Cyclic β-glucan biosynthesis
cgmB07 (1)00
Capusular polysaccharide biosynthesis
rkpLMNOPQ16 (3)7 (1)00
rkpRSTZ1100 (19)93 (13)100 (11)100 (2)
rkpT284 (16)86 (12)100 (11)100 (2)
rkpZ216 (3)14 (2)00
Type III secretion system
 T3SSa: rhc, nolUV26 (5)29 (4)00
Type IV secretion system
 T4SSa: rctA, vir100 (19)100 (14)00
 T4SSb: vir47 (9)0100 (11)0
 T4SSd: tra, trb07 (1)100 (11)100 (2)
 T4SSe: tra, trb, virD2, cogG 014 (2)00
 T4SSf: avh37 (7)71 (10)18 (2)0
 T4SSg: tra, trb07 (1)00
Denitrification
napEFDABC100 (19)93 (13)100 (11)100 (2)
nirKV84 (16)29 (4)82 (9)0
norECBQD84 (16)29 (4)82 (9)0
nosRZDFYLX89 (17)36 (5)0
Heme biosynthesis
hemA216 (3)29 (4)00
hemN74 (14)36 (5)73 (8)0
1-Aminocyclopropane-1-carboxylate deaminase
acdS (Smed_5532 ortholog)21 (4)036 (4)100 (2)
acdS (Smed_6456 ortholog)5 (1)36 (5)36 (4)0

aThe percentage and number (in parentheses) of strains possessing a gene or gene cluster are shown for each species group and phenotype cluster.

Presence of variable length symbiosis-related genes in each phenotype cluster of <span class="Species">S. meliloti and <span class="Species">S. medicae aThe percentage and number (in parentheses) of strains possessing a gene or gene cluster are shown for each species group and phenotype cluster.

Conclusions

The results of comparative genomics analysis of the <span class="Species">Sinorhizobium genus provide useful information for understanding the genetic functional features of a wide variety of <span class="Species">Sinorhizobium species strains, and a tool to better understand incompatibility in legume-rhizobia interactions. The correlation between the presence of T4SS and symbiotic efficiency suggest that each Sinorhizobium strain uses a slightly different strategy to obtain maximum compatibility with a host plant. Moreover, these large genomic data sets provide the opportunity to understand the evolution of rhizobia [20] together with mechanisms of host determination, nodulation, and nitrogen fixation. Our overall goal is to combine these data with our previous studies reporting SNPs in M. truncatula [21] and the sinorhizobia reported here [20] to provide a resource for genome-wide association mapping of genes and traits associated with symbiosis and nodulation. Moreover, the information provided here will be useful to study the population genomics of this bacterium and its evolution with Medicago.

Materials and methods

Bacteria used in this study

Illumina GAIIx sequencing was used to sequence the genomes of 32 strains of <span class="Species">S. meliloti, 12 strains of <span class="Species">S. medicae, 2 strains of S. fredii, and 1 strain each of S. saheli and S. terangae (Table S1 in Additional file 1). The S. meliloti and S. medicae strains were chosen from the USDA-ARS Rhizobium Germplasm Collection as representatives of different multi-locus sequence types [45] or obtained from nodules on M. truncatula trap hosts inoculated with slurries of soils obtained from several locations in France [46]. Sinorhizobia were also obtained from nodules of seven M. truncatula genotypes (HM004, HM006, HM007, HM0013, HM014, HM015 and A17) as trap hosts using Salses soil from France. The type-strains of S. fredii (USDA 205), S. saheli (USDA 4893) and S. terangae (USDA 4894) were chosen from the USDA-ARS Rhizobium Germplasm Collection, and S. fredii USDA 207 (syn. HH103) was also included. The Sinorhizobium strains were grown in TY medium at 30°C. DNA from each strain was used for Illumina library construction and extracted from culture grown cells using the Wizard Genomic DNA Purification kit (Promega Corp. Madison, WI, USA) with further purification by phenol extraction.

Illumina DNA sequencing

Paired end libraries were generated using Illumina's Phusion-based library kits following the manufacturer's protocols (Illumina, Hayward, CA, USA). Insert sizes averaged 332 nucleotides (range = 245 to 443). Four samples were multiplexed per lane and sequenced on Illumina GAIIx machines and base-called following the manufacturer's protocols. Sequence reads were paired 90-nucleotide reads. Individual samples averaged just over 1 Gb of sequence (range of 724 to 1,584 Mb per genome for <span class="Species">S. meliloti and <span class="Species">S. medicae strains) translating into an average and minimum coverage of 174× and 108×, respectively, of the approximately 6.7 Mb genome before aligning reads. Raw reads and derived SNP calls were analyzed previously [20]. Sequences were de novo assembled using ABySS [47]. For each strain, several kmers were run and the best resulting assembly was chosen based on assembly contiguity statistics, placement of a subset of high quality read pairs in the assembly with correct spacing, orientation, and comparisons to reference genome sequences.

Automatic gene annotation and clustering CDSs found in the Sinorhizobium genomes

CDSs were predicted using AMIGene (Annotation of Microbial Genomes) software [48] and predicted genes were functionally annotated as described by Vallenet et al. [49]. More than 20 bioinformatics methods were used for functional and relational analyses: homology search in a generalist databank (UniProt) and in more specialized databases (COG, InterPro, and PRIAM profiles for enzymatic classification), prediction of protein localization using TMHMM, SignalP and PsortB tools, computation of synteny groups with all available complete and incomplete (WGS section at NCBI) proteomes, and metabolic network reconstruction using Pathway Tools [49]. This fully automated first round of annotation ended with a functional assignment procedure to infer specific function(s) for each individual gene. This functional assignment was first based on annotations of the S. meliloti 1021 reference genome [50] for strong orthologs (>85% identity over at least 80% of the length of the smallest protein). All data (syntactic and functional annotations and results of comparative analysis) were stored in the relational database SinorhizoScope. Complete sequence data for the 48 Sinorhizobium genomes are publicly available via the MaGe interface [51]. The SRA sequences have also been deposited under accession SRA048718 and sequences and annotation data have been deposited in GenBank under project number PRJNA172127. All protein sequences, including automatic and manually annotated CDSs from the 48 sinorhizobial strains and those of reference strains (<span class="Species">S. meliloti 1021 and <span class="Species">S. medicae WSM419), were clustered by the CD-HIT algorithm [52] using a 70% cut-off for protein identity. Twenty-eight truncated CDSs in the reference strain genomes and 32 annotated CDSs having less than 11 amino acids identified from all strains were removed from analyses.

Phylogenetic analyses

<span class="Species">Sinorhizobium phylogenetic trees were first created based on 645 concatenated protein-coding sequences; genes were included if they were present in a single copy in all strains and the outgroup (<span class="Species">Rhizobium leguminosarum bv. trifolii WSM1325). Homologous sequences were identified in the outgroup by using the MaGe phyloprofile tool to search for bidirectional best hits with at least 70% protein identity across at least 80% of the length of both sequences between the outgroup and S. meliloti 1021. A phylogenetic tree was also created based on 16S rRNA gene sequences and alignment to reference genomes in GenBank. Distances between strains were calculated using the dnadist program in phylip [53] v3.69 with the F84 model of evolution, and a neighbor-joining tree was assembled using the neighbor program. Support for the splits in the neighbor-joining tree was assessed by constructing neighbor-joining trees on 1,000 bootstrapped datasets created with seqboot, then mapping the support values on to the tree created from the whole dataset using the sumtrees program [54]. The tree was rooted by treating the R. leguminosarum strain as an outgroup, and splits with less than 60% support were collapsed to polytomies.

Sinorhizobium symbiotic phenotype assays

The Sino<span class="Species">rhizobium strains and <span class="Species">Medicago genotypes used for phenotype analyses are listed in Table S1 in Additional file 1. Medicago seeds were prepared as described by Bucciarelli et al. [55]. Plant assays were run as a completely randomized block design with three replications in sterile Leonard jar assemblies containing a 1:1 mixture of Sunshine mix #5 (SunGro Horticulture Inc., Vancouver, Canada) and Turface MVP (Profile Product LLC, IL, USA) and inoculated approximately 107 TY-grown Sinorhizobium cells as described previously [56]. Nodulation studies were done at different times, with six plant genotypes tested each time, with one genotype in common. Plants were watered with nitrogen-free plant nutrient solution [55] and incubated in a plant growth chamber at 25°C with a 16-h light condition and at 21°C for 8-h in the dark. Nodule number, color (pink or white), and dry weight, plant dry weight and height, and chlorophyll content of each plant were determined 5 weeks after inoculation. Chlorophyll content in top trifoliate leaves was measured by using a SPAD-502 Chlorophyll Meter (MINOLTA Inc.) and values were averaged. The phenotype data were statistically analyzed by analysis of variance (ANOVA) and Duncan-Waller test using the SAS software package at α = 0.05. A heatmap was created by using default setting of the 'heatmap.2' program in R 2.14.1 software [57].

Construction of type IV secretion system gene mutants

<span class="Species">S. meliloti strain <span class="Species">KH46c and S. medicae strain M2 were selected as recipients for mutation of T4SSb since these strains formed effective nodules on all tested M. truncatula genotypes. Mobilizable virBinactivation plasmids were constructed as follows. The 2.9-kb virBcoding regions from both Sinorhizobium strains were amplified by PCR using the oligonucleotide primers virB XbaI_F (5'-GCTCTAGAAGTCTGGGCTCGTTTCAGA-3') and virB_XbaI_R (5'-CGTCTAGAGCGGACGTCTTGAGGTAGAA-3') containing the newly created XbaI sites (underlined). The PCR products were digested by XbaI and followed by ligation into suicide vector pK18mob to create pMS21 (for KH46c virB) or pMS22 (for M2 virB). These plasmids were digested by SspI and ScaI to delete a 1.6-kb fragment containing the virBto virBcoding region, and the Ω cassette from pHP45Ω was inserted to create pMS25 (KH46c virB::Ω), or pMS26 (M2 virB::Ω). The plasmids pMS25 or pMS26 were introduced into S. meliloti KH46c or S. medicae M2 by triparental mating. Mutated strains were selected on TY agar plates containing 20 μg of chloramphenicol (Cm) per ml and 100 μg of spectinomycin/streptomycin (Sp/Sm) per ml. Gene replacement, double crossover mutants were verified by their antibiotic resistance phenotype (Cm and Sp/Sm resistant, and neomycin sensitive), and by PCR amplification using primers that spanned the insertion sites.

Acetylene reduction assay

The nodulated plant roots were removed aseptically with scissors. Detached roots were placed in air-tight 150 ml serum bottles. Three ml of the air volume in each bottle was replaced by pure <span class="Chemical">acetylene gas (99.8%) using hypodermic syringes. The bottles were incubated at room temperature for 60 minutes. The <span class="Chemical">ethylene concentration in each bottle, before and after incubation, was analyzed by gas chromatography using a Nucon-5765 gas chromatograph (AIMIL Instruments, New Delhi, India) equipped with a flame ionization detector (FID) and a Rt-Alumina BOND/Na2SO4 column (30 m × 0.53 mm) (Restek Corp., Bellefonte, PA, USA). Nitrogen was used as the carrier gas. The operation temperatures for oven, injector, and detector were set at 50°C, 20°C and 104°C, respectively. All the experiments were conducted in triplicate.

Sulfatase activity test

Enzyme solutions were prepared by crushing 10 nodules aseptically in 150 μl sterilized 0.85% <span class="Chemical">NaCl and the mixture was homogenized by votexing for 15 s. <span class="Gene">Sulfatase assays were done as previously described [58]. The method was modified by using 50 mM phosphate buffer, pH 7.0, instead of 0.5 M Tris acetate buffer, pH 8.75.

Abbreviations

CDS: coding sequence; <span class="Disease">Cm: <span class="Chemical">chloramphenicol; COG: Clusters of Orthologous Groups; LCO: lipo-chito-oligosaccharide; N2: dinitrogen; PC: phenotype cluster; Sm: streptomycin; SNP: single nucleotide polymorphism; Sp: spectinomycin; T3SS, T4SS, and T6SS: bacterial type III, IV, and VI protein secretion systems, respectively.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

MS, MJS, NDY, PT and BE wrote the manuscript. MS, BE, LX, JR and RD carried out plant experiments. MS, BE, BB, JM, AKB, ADF, AF, GM and JEW participated in genome sequencing, assembly, and gene annotation. MS, MJS, BE, BB, TU, LX, GP, MJS, <span class="Disease">CM, DV, AL, ZR, JM, AKB, ADF and BMV carried out analysis of the genome sequences. MJS, NY, PT and BM were the principal investigators (PIs) of this study.

Additional file 1

Tables S1 to S8. Click here for file

Additional file 2

Figure S1 and S2. Click here for file
  46 in total

1.  AMIGene: Annotation of MIcrobial Genes.

Authors:  Stéphanie Bocs; Stéphane Cruveiller; David Vallenet; Grégory Nuel; Claudine Médigue
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 2.  Legume-nodulating betaproteobacteria: diversity, host range, and future prospects.

Authors:  Prasad Gyaneshwar; Ann M Hirsch; Lionel Moulin; Wen-Ming Chen; Geoffrey N Elliott; Cyril Bontemps; Paulina Estrada-de Los Santos; Eduardo Gross; Fabio Bueno Dos Reis; Janet I Sprent; J Peter W Young; Euan K James
Journal:  Mol Plant Microbe Interact       Date:  2011-11       Impact factor: 4.171

3.  Comparative genomics of rhizobia nodulating soybean suggests extensive recruitment of lineage-specific genes in adaptations.

Authors:  Chang Fu Tian; Yuan Jie Zhou; Yan Ming Zhang; Qin Qin Li; Yun Zeng Zhang; Dong Fang Li; Shuang Wang; Jun Wang; Luz B Gilbert; Ying Rui Li; Wen Xin Chen
Journal:  Proc Natl Acad Sci U S A       Date:  2012-05-14       Impact factor: 11.205

4.  Intergenomic epistasis and coevolutionary constraint in plants and rhizobia.

Authors:  Katy D Heath
Journal:  Evolution       Date:  2009-11-26       Impact factor: 3.694

5.  Genetic Diversity in Bradyrhizobium japonicum Serogroup 123 and Its Relation to Genotype-Specific Nodulation of Soybean.

Authors:  M J Sadowsky; R E Tully; P B Cregan; H H Keyser
Journal:  Appl Environ Microbiol       Date:  1987-11       Impact factor: 4.792

6.  Expression and functional roles of Bradyrhizobium japonicum genes involved in the utilization of inorganic and organic sulfur compounds in free-living and symbiotic conditions.

Authors:  Masayuki Sugawara; Gopit R Shah; Michael J Sadowsky; Oleg Paliy; Justin Speck; Andrew W Vail; Prasad Gyaneshwar
Journal:  Mol Plant Microbe Interact       Date:  2011-04       Impact factor: 4.171

7.  The type IV secretion system of Sinorhizobium meliloti strain 1021 is required for conjugation but not for intracellular symbiosis.

Authors:  Kathryn M Jones; Javier Lloret; Joseph R Daniele; Graham C Walker
Journal:  J Bacteriol       Date:  2006-12-08       Impact factor: 3.490

8.  Regulation and symbiotic significance of nodulation outer proteins secretion in Sinorhizobium fredii HH103.

Authors:  Francisco Javier López-Baena; José María Vinardell; Francisco Pérez-Montaño; Juan Carlos Crespo-Rivas; Ramón A Bellogín; Ma Del Rosario Espuny; Francisco Javier Ollero
Journal:  Microbiology (Reading)       Date:  2008-06       Impact factor: 2.777

9.  A novel highly unsaturated fatty acid moiety of lipo-oligosaccharide signals determines host specificity of Rhizobium.

Authors:  H P Spaink; D M Sheeley; A A van Brussel; J Glushka; W S York; T Tak; O Geiger; E P Kennedy; V N Reinhold; B J Lugtenberg
Journal:  Nature       Date:  1991-11-14       Impact factor: 49.962

10.  Genome sequence of the beta-rhizobium Cupriavidus taiwanensis and comparative genomics of rhizobia.

Authors:  Claire Amadou; Géraldine Pascal; Sophie Mangenot; Michelle Glew; Cyril Bontemps; Delphine Capela; Sébastien Carrère; Stéphane Cruveiller; Carole Dossat; Aurélie Lajus; Marta Marchetti; Véréna Poinsot; Zoé Rouy; Bertrand Servin; Maged Saad; Chantal Schenowitz; Valérie Barbe; Jacques Batut; Claudine Médigue; Catherine Masson-Boivin
Journal:  Genome Res       Date:  2008-05-19       Impact factor: 9.043

View more
  60 in total

1.  Epidemic Spread of Symbiotic and Non-Symbiotic Bradyrhizobium Genotypes Across California.

Authors:  A C Hollowell; J U Regus; K A Gano; R Bantay; D Centeno; J Pham; J Y Lyu; D Moore; A Bernardo; G Lopez; A Patil; S Patel; Y Lii; J L Sachs
Journal:  Microb Ecol       Date:  2015-10-14       Impact factor: 4.552

2.  Comparative Genomics, from the Annotated Genome to Valuable Biological Information: A Case Study.

Authors:  Sabina Zoledowska; Agata Motyka-Pomagruk; Agnieszka Misztak; Ewa Lojkowska
Journal:  Methods Mol Biol       Date:  2021

Review 3.  Biotic interactions in the rhizosphere: a diverse cooperative enterprise for plant productivity.

Authors:  Clelia De-la-Peña; Víctor M Loyola-Vargas
Journal:  Plant Physiol       Date:  2014-08-12       Impact factor: 8.340

4.  An Alkane Sulfonate Monooxygenase Is Required for Symbiotic Nitrogen Fixation by Bradyrhizobium diazoefficiens (syn. Bradyrhizobium japonicum) USDA110T.

Authors:  Justin J Speck; Euan K James; Masayuki Sugawara; Michael J Sadowsky; Prasad Gyaneshwar
Journal:  Appl Environ Microbiol       Date:  2019-11-27       Impact factor: 4.792

5.  Microevolution Rather than Large Genome Divergence Determines the Effectiveness of Legume-Rhizobia Symbiotic Interaction Under Field Conditions.

Authors:  Cintia Jozefkowicz; Silvina Brambilla; Romina Frare; Margarita Stritzler; Mariana Puente; Carlos Piccinetti; Gabriela Soto; Nicolás Ayub
Journal:  J Mol Evol       Date:  2017-08-21       Impact factor: 2.395

6.  Minimal gene set from Sinorhizobium (Ensifer) meliloti pSymA required for efficient symbiosis with Medicago.

Authors:  Barney A Geddes; Jason V S Kearsley; Jiarui Huang; Maryam Zamani; Zahed Muhammed; Leah Sather; Aakanx K Panchal; George C diCenzo; Turlough M Finan
Journal:  Proc Natl Acad Sci U S A       Date:  2021-01-12       Impact factor: 11.205

7.  Replicon-dependent differentiation of symbiosis-related genes in Sinorhizobium strains nodulating Glycine max.

Authors:  Hui Juan Guo; En Tao Wang; Xing Xing Zhang; Qin Qin Li; Yan Ming Zhang; Chang Fu Tian; Wen Xin Chen
Journal:  Appl Environ Microbiol       Date:  2013-12-06       Impact factor: 4.792

8.  Adaptive Evolution of Extreme Acidophile Sulfobacillus thermosulfidooxidans Potentially Driven by Horizontal Gene Transfer and Gene Loss.

Authors:  Xian Zhang; Xueduan Liu; Yili Liang; Xue Guo; Yunhua Xiao; Liyuan Ma; Bo Miao; Hongwei Liu; Deliang Peng; Wenkun Huang; Yuguang Zhang; Huaqun Yin
Journal:  Appl Environ Microbiol       Date:  2017-03-17       Impact factor: 4.792

9.  LDSS-P: an advanced algorithm to extract functional short motifs associated with coordinated gene expression.

Authors:  Hiroyuki Ichida; Sharon R Long
Journal:  Nucleic Acids Res       Date:  2016-05-17       Impact factor: 16.971

Review 10.  Gene Expression in Nitrogen-Fixing Symbiotic Nodule Cells in Medicago truncatula and Other Nodulating Plants.

Authors:  Peter Mergaert; Attila Kereszt; Eva Kondorosi
Journal:  Plant Cell       Date:  2019-11-11       Impact factor: 11.277

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.