| Literature DB >> 24704842 |
Damien Mornico1, Lucie Miché2, Gilles Béna3, Nico Nouwen4, André Verméglio5, David Vallenet6, Alexander A T Smith7, Eric Giraud8, Claudine Médigue9, Lionel Moulin10.
Abstract
Tropical aquatic species of the legume genus Aeschynomene are stem- and root-nodulated by bradyrhizobia strains that exhibit atypical features such as photosynthetic capacities or the use of a nod gene-dependent (ND) or a nod gene-independent (NI) pathway to enter into symbiosis with legumes. In this study we used a comparative genomics approach on nine Aeschynomene symbionts representative of their phylogenetic diversity. We produced draft genomes of bradyrhizobial strains representing different phenotypes: five NI photosynthetic strains (STM3809, ORS375, STM3847, STM4509 and STM4523) in addition to the previously sequenced ORS278 and BTAi1 genomes, one photosynthetic strain ORS285 hosting both ND and NI symbiotic systems, and one NI non-photosynthetic strain (STM3843). Comparative genomics allowed us to infer the core, pan and dispensable genomes of Aeschynomene bradyrhizobia, and to detect specific genes and their location in Genomic Islands (GI). Specific gene sets linked to photosynthetic and NI/ND abilities were identified, and are currently being studied in functional analyses.Entities:
Year: 2011 PMID: 24704842 PMCID: PMC3899966 DOI: 10.3390/genes3010035
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Maximum likelihood recA phylogeny of Aeschynomene bradyrhizobial symbionts indicating the position of genome-sequenced strains. Adapted from [16]. Numbers at nodes are bootstrap percentages from 100 replicates. Roman numbers indicate clades defined from AFLP analyses and recA phylogeny in [16]. Abbreviations: PB: photosynthetic bradyrhizobia; Sep: separate group with no affiliated number; ND/NI: nod gene dependent/independent symbiotic pathway.
Bacterial strains and genome sequencing statistics.
| Strain/Replicon | Group * | CI Group | Geographical Origin | ND/NI | Sequencing Status | Nb Contigs | Size (bp) | Accession Number |
|---|---|---|---|---|---|---|---|---|
| Photosynthetic | ||||||||
| ORS278 | VIII | 3 | Senegal | NI | Complete | 1 | 7456587 | NC_009445 |
| BTAi1 Chr | Sep | 3 | USA | NI | Complete | 1 | 8264689 | NC_009485 |
| BTAi1 Pl | Complete | 1 | 228826 | NC_009475 | ||||
| III | 3 | Senegal | NI | 454 + Solexa | 497 | 7909110 | CAFI01000001-497 & | |
| X | 3 | F. Guiana | NI | 454 + Solexa | 803 | 7391986 | CAFJ01000001-803 & | |
| VI | 2 | Senegal | Both % | 454 (30X) | 301 | 7632258 | CAFH01000001-301 & | |
| VII | 3 | F. Guiana | NI | Solexa 20X | 152475 | 10121035 § | ERP000868 $ | |
| II | 3 | Mexico | NI | Solexa 20X | 190139 | 9206235 § | ERP000868 $ | |
| Sep | 3 | Mexico | NI | Solexa 20X | 158371 | 8610503 § | ERP000868 $ | |
| Non-photosynthetic strain | ||||||||
| XI | 3 | F. Guiana | NI | 454 + Solexa | 350 | 8469730 | CAFK01000001-350 & | |
|
| ||||||||
| USDA110 | Sep | 1 | USA | ND | Complete | 1 | 9105828 | NC_004463 |
Strain genomes sequenced in this study are in bold. * Species or AFLP group was defined according to [16]; Sep: separate group with no affiliated number; ND/NI: nod-dependent or nod-independent strain according to [13,16]; CI: Aeschynomene cross inoculation group according to [14,15]; %: ORS285 hosts both systems; §: genome size estimates from Solexa read contig assemblies; & EBI accession numbers; $ ENA (trace archive) accession number for solexa reads [18].
Genomic features of bradyrhizobial genomes.
| Strain | GC% | CDS N | CDS L | IGR (bp) | PCD (%) | rRNA | tRNA | MscRNA | NRR (%) |
|---|---|---|---|---|---|---|---|---|---|
| ORS278 | 65.51 | 6748 | 952.1 | 180.23 | 85.50 | 6 | 50 | 10 | 8.76 |
| BTAi1 chr | 64.92 | 7466 | 959.42 | 176.72 | 85.59 | 6 | 52 | 12 | 9.33 |
| BTAi1 pl | 60.71 | 257 | 800.2 | 186.02 | 79.42 | - | - | - | 4.75 |
| ORS285 | 65.23 | 6848 | 955.92 | 184.32 | 85.41 | 4 | 49 | 11 | 10.45 |
| ORS375 | 65.49 | 7348 | 921.28 | 182.92 | 84.99 | 3 | 52 | 11 | 10.22 |
| STM3809 | 66.18 | 7142 | 879.79 | 182.97 | 84.19 | 3 | 47 | 13 | 9.42 |
| STM3843 | 63.3 | 8399 | 878.57 | 159.54 | 86.30 | 3 | 48 | 10 | 5.10 |
| BjUSDA110 | 64.06 | 9648 | 862.74 | 135.17 | 88.98 | 3 | 50 | 3 | 9.53 |
Abbreviations used: CDS: Coding Sequences; CDS-N: CDS number; CDS L: mean CDS Length; PCD: Protein coding density; Msc: Miscellanous; NRR: Nosferatu Repeated Region; bp: base pair.
Figure 2(A) Comparative gene orthology between seven genomes of Bradyrhizobium. Numbers indicates the shared gene sets between groups of strains, satisfying the following criteria: BBH (bidirectional best hits) with 40% amino-acid identity on 80% of the smallest protein; (B) Core and pangenome gene count in the Bradyrhizobiaceae family. CDS indicates the number of protein-coding genes in each genome listed in x-axis. The y-axis indicates the number of genes in core and pangenome when adding genomes in the x-axis to the comparative analysis. The presented tree was built by Maximum likelihood (GTR + I + G model) on a partition of 5 taxonomic marker genes (atpD, dnaK, recA, rpoB, rpoD), with bootstrapped nodes (100 replicates). Japonicum/Bj: B. japonicum USDA110; Rpal: Rhodopseudomonas palustris; Nwi: Nitrobacter winogradskii; Nham: Nitrobacter hamburgensis; Azo: Azorhizobium caulinodans.
Figure 3Circular view of whole genome alignments of bradyrhizobial genomes. Genomes were aligned to a reference genome [for (A) ORS278, (B) ORS285, (C) STM3843, (D) USDA110)]. The localization of aligned genomes is given as a list from inner to outter circles in the center of each figure. Legend (from inner to outter layer of circles: genome scale (kb), GC deviation (gray), GC skew (orange), aligned genomes (aligned parts are in blue, each layer is a different genome, strains are listed close to the reference genome name in the center of each figure, from inner to outer layers), GI (purple), core genome (red), specific genes to reference genomes (pink). Higher resolution images for each genome analysis are available in Supplementary Figure S4.
Figure 4Comparison of core-genome based phylogeny and a dendrogram based on GI distribution across bradyrhizobial genomes.
Figure 5Principal Component Analysis (PCA) of seven Bradyrhizobium genomes, performed on a two dimensional matrix compiling metabolic pathway completion values (the number of enzymatic reactions with coding genes for pathway x in a given organism, divided by the total number of reactions in pathway x as defined in the MetaCyc database; see Experimental Section) across all genomes. In the plot, each genome is represented by a point, whereas pathways are shown as colored vectors. Pathways with correlated completion values across organisms (vectors with similar orientation) have been clustered (corresponding to numbers 1 to 7) and drawn in the same color (vectors of pathways with identical completions overlap). Genomes can be associated with their representative and characteristic groups of metabolic pathways (i.e., vectors “pointing in their direction”). The corresponding pathway functions are listed in Supplementary Table S2. Vector length encodes the quality of representation of the pathway in the presented plot (i.e., the longer a vector is, the more representative its pathway is for genomes on the same side of the plot). Abbreviations: photo−/photo+: non-photosynthetic/photosynthetic strains; Nod+/Nod−: nod-dependent/nod-independent strains).
Figure 6Phyloprofile exploration of genomes according to photosynthetic (A) and nodulation (B) abilities.
List of specific annotated genes in nod-independent bradyrhizobia. Genes in bold are located in HAIs detected in ORS278.
| CDS labels | Gene | Product |
|---|---|---|
| Two component regulatory systems | ||
| BRADO4688 | putative two-component system sensor protein with Hpt domain | |
| BRADO4689 | putative sensor histidine kinase with a response regulator receiver domain | |
| BRADO5320 | putative sensor histidine kinase (PAS & response regulator receiver) | |
| BRADO6874 | putative two component system, regulator receiver (CheY-like protein) | |
| BRADO7009 | putative response regulator receiver (CheY-like protein) | |
| BRADO7010 | putative sensor histidine kinase (receiver & phosphotransferase domains) | |
| BRADO5682 | putative two component sensor histidine kinase | |
| BRADO1407 | putative Two-component system histidine kinase | |
| BRADO5100 | Putative Two-component sensor histidine kinase | |
| Transcriptional Regulators | ||
| BRADO3272 | putative transcriptional regulatory protein, TetR/AcrR family | |
| BRADO3678 | putative transcriptional regulatory protein, TetR family | |
| BRADO5987 | putative transcriptional regulator, TetR family | |
| Cyclases | ||
| BRADO0188 | putative diguanylate cyclase (GGDEF) domain | |
| BRADO1402 | putative diguanylate cyclase (GGDEF) | |
| BRADO2821 | putative Adenylate cyclase with a CHAD domain | |
| BRADO6510 | putative diguanylate cyclase with GGDEF and EAL domains | |
| Circadian clock operon | ||
| BRADO1478 | kaiC | circadian clock protein kinase kaiC |
| BRADO1479 | kaiB | circadian clock protein |
| BRADO1480 | putative signal transduction histidine kinase with PAS/PAC domains | |
| BRADO1481 | putative response regulator receiver (CheY-like protein) | |
| Plant wall degradation | ||
| BRADO1851 | vanB | vanillate
|
| BRADO2335-2343 | protochatechuate transport and degradation (ligJABC) | |
| BRADO2379 | protocatechuate 4,5-dioxygenase (4,5-PCD), alpha chain | |
| BRADO2380 | 2,3-dihydroxyphenylpropionate 1,2-dioxygenase | |
| BRADO2382 | putative transcriptional regulator, PadR-like family | |
| BRADO4665 | hpcB | homoprotocatechuate 2,3-dioxygenase |
| Various Catabolism/Detoxification | ||
| BRADO0718 | putative intradiol ring-cleavage dioxygenase | |
| BRADO1842 | putative amine oxidase | |
| BRADO1843 | putative ATPase, AAA family | |
| BRADO1844 | putative Glutathione S-transferase | |
| BRADO1848 | putative thiosulfate sulfurtransferase with Rhodanese-like domain | |
| BRADO1849 | conserved HP; putative oxidoreductase | |
| BRADO4030 | putative TauD/TfdA family dioxygenase (Taurine catabolism) | |
| BRADO4031 | putative dioxygenase; putative taurine dioxygenase (Taurine catabolism) | |
| NUDIX hydrolase | ||
| BRADO3028 | putative MutT/nudix family protein | |
| BRADO4664 | putative NUDIX hydrolase | |
| BRADO3973 | putative UDP-
| |
| BRADO4235 | putative
| |
| BRADO6639 | UDP-
| |
| BRADO7123 | putative
| |
| Polysaccharide biosynthesis & glycans | ||
| BRADO4794 | putative D-3-phosphoglycerate dehydrogenase | |
| BRADO4795 | putative carbohydrate kinase (xylulose/erythritol kinase, lyx/eryA-like) | |
| BRADO4796-4800 | putative sugar ABC transporter (permease + ATP-binding) | |
| BRADO4801 | putative transcription regulator (EryD-like) | |
| BRADO4802 | putative carbohydrate kinase | |
| BRADO4803 | putative hydrolase (HAD superfamily) | |
| BRADO4804 | putative aldolase/epimerase (AraD-like) | |
| BRADO6336 | putative NiFe-hydrogenase/urease accessory HupE/UreJ family protein | |
| Others | ||
| BRADO0151 | putative Type III polyketide synthase; putative chalcone synthase | |
| BRADO0860 | putative asparagine synthetase (glutamine-hydrolyzing) | |
| BRADO1346 | putative Phenazine biosynthesis protein | |
| BRADO1560 | putative aldolase | |
| BRADO4898 | putative cobalt-precorrin-6A synthase, cbiD (Vitamin B12 biosynthesis) | |
| BRADO4900 | putative CobE protein (Vitamin B12 biosynthesis) | |
| BRADO4906 | putative precorrin-3B synthase (cobG) (Vitamin B12 biosynthesis) | |
| BRADO4910 | putative HupE/UreJ protein | |
| BRADO4916 | putative cobalamin synthase (CobS family) (Vitamin B12 biosynthesis) | |
| BRADO4917 | alpha-ribazole phosphatase (anaerobic pathway cobalamin biosynthesis) | |
| BRADO7016 | indole-3-pyruvate decarboxylase (IAA biosynthesis) | |
| BRADO5389 | cysE | serine acetyltransferase |
| BRADO5390 | nifV | homocitrate synthase |
| BRADO5396 | putative transcriptional regulatory protein (protein cheY homolog) | |
| BRADO5403 | putative carboxymethylenebutenolidase(DLH) | |
| BRADO5410 | putative Ferredoxin, 2Fe-2S | |
| BRADO5411 | putative Aminotransferase, DegT/DnrJ/EryC1/StrS family | |
| BRADO5414 | conserved HP (NifZ domain) | |
| BRADO5416 | LRV protein FeS4 cluster in
| |
| BRADO5417 | conserved HP; TPR repeat | |
| BRADO5418 | putative iron-sulfur cluster assembly protein (SufA-like) | |
| BRADO5424 | putative nifU protein (C-terminal fragment) | |
| BRADO6025 | putative Sensor histidine kinase (HWE family) with a GAF domain | |
| BRADO6026 | coxF | carbon monoxide dehydrogenase, coxF accessory protein |
| BRADO6027 | coxE | carbon monoxide dehydrogenase, coxE accessory protein |
| BRADO6029 | coxL | carbon monoxide dehydrogenase large chain |
| BRADO6031 | coxM | carbon monoxide dehydrogenase medium chain (CO-DH M) |
| BRADO6032 | coxC | carbon monoxide dehydrogenase, coxC signalling protein |
| BRADO0659-0662 | carotenoid synthesis | |
| BRADO2007 | putative heme oxygenase | |
| BRADO2008 | putative bacteriophytochrome | |
| BRADO0941 | autoinducer (acylhomoserine lactone) synthase | |
| BRADO2926 | putative Methyl-accepting chemotaxis protein | |
| BRADO0678 | putative methyl-accepting chemotaxis protein | |
| BRADO3031 | putative methyl-accepting chemotaxis receptor/sensory transducer | |
| BRADO4514 | putative bacterial chemotaxis sensory transducer | |
| BRADO5051 | conserved HP-cheL | |
| Transporters | ||
| BRADO0677 | putative TRAP-dicarboxylate transporter (DctP subunit) | |
| Secretion | ||
| BRADO6338-6344 | general secretion pathway proteins (GspI, GspL) | |
| BRADO7122 | putative HlyD-family secretion protein | |