| Literature DB >> 16640791 |
J Peter W Young1, Lisa C Crossman, Andrew W B Johnston, Nicholas R Thomson, Zara F Ghazoui, Katherine H Hull, Margaret Wexler, Andrew R J Curson, Jonathan D Todd, Philip S Poole, Tim H Mauchline, Alison K East, Michael A Quail, Carol Churcher, Claire Arrowsmith, Inna Cherevach, Tracey Chillingworth, Kay Clarke, Ann Cronin, Paul Davis, Audrey Fraser, Zahra Hance, Heidi Hauser, Kay Jagels, Sharon Moule, Karen Mungall, Halina Norbertczak, Ester Rabbinowitsch, Mandy Sanders, Mark Simmonds, Sally Whitehead, Julian Parkhill.
Abstract
BACKGROUND: Rhizobium leguminosarum is an alpha-proteobacterial N2-fixing symbiont of legumes that has been the subject of more than a thousand publications. Genes for the symbiotic interaction with plants are well studied, but the adaptations that allow survival and growth in the soil environment are poorly understood. We have sequenced the genome of R. leguminosarum biovar viciae strain 3841.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16640791 PMCID: PMC1557990 DOI: 10.1186/gb-2006-7-4-r34
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Genome statistics for Rlv3841
| Replicon | Base pairs | Percentage G+C | Protein-encoding genes | Percentage Coding | Mean protein length (aa) | rRNA operons | tRNA genes |
| Chromosome | 5,057,142 | 61.1 | 4,736 | 86.3 | 309 | 3 | 52 |
| pRL12 | 870,021 | 61.0 | 790 | 90.3 | 335 | ||
| pRL11 | 684,202 | 61.0 | 635 | 87.5 | 318 | ||
| pRL10 | 488,135 | 59.6 | 461 | 81.7 | 304 | ||
| pRL9 | 352,782 | 61.0 | 313 | 88.8 | 337 | ||
| pRL8 | 147,463 | 58.7 | 140 | 83.4 | 306 | ||
| pRL7 | 151,546 | 57.6 | 188 | 74.6 | 224 | ||
| Total | 7,751,309 | 60.86 | 7,263 | 86.4 | 309 | 3 | 52 |
aa, amino acids; Rlv3841, Rhizobium leguminosarum biovar viciae 3841.
Figure 1The chromosome and six plasmids of Rlv3841. The plasmids are shown at the same relative scale, and the chromosome at one-fourth of that scale. Circles from outermost to innermost indicate genes in forward and reverse orientation: all genes, membrane proteins (bright green), conserved and unconserved hypotheticals (brown conserved, pale green unconserved), phage and transposons (pink, shown for pRL7 only), and (for the chromosome only) DNA transcription/restriction/helicases (red) and transcriptional regulators (blue). Inner circles indicate deviations in G+C content (black) and G-C skew (olive/maroon). The full list of Sanger Institute standard colors for functional categories is as follows: white = pathogenicity/adaptation/chaperones (shown here in black); dark grey = energy metabolism (glycolysis, electron transport, among others); red = information transfer (transcription/translation + DNA/RNA modification); bright green = surface (inner membrane, outer membrane, secreted, surface structures [lipopolysaccharide, among others]); and dark blue = stable RNA; turquoise = degradation of large molecules; pink/purple = degradation of small molecules; yellow = central/intermediary/miscellaneous metabolism; pale green = unknown; pale blue = regulators; orange/brown = conserved hypo; dark brown = pseudogenes and partial genes (remnants); light pink = phage/insertion sequence elements; light grey = some miscellaneous information (for example, Prosite) but no function. bp, base pairs; Rlv3841, R. leguminosarum biovar viciae strain 3841.
Figure 2Distribution of functional classes of genes within replicons. The classes are based on those presented by Riley [86].
Figure 3Protein-encoding genes on the chromosome and six plasmids of Rlv3841, showing their nucleotide composition. GC3s (G+C content of silent third positions of codons) is a sensitive measure of composition. Symbols indicate whether each gene encodes a quartop protein (with orthologs in A. tumefaciens, S. meliloti, and M. loti) and, if so, which phylogenetic topology it supports (RA-SM denotes the tree that pairs R. leguminosarum with A. tumefaciens, and S. meliloti with M. loti; RM-AS and RS-AM are similarly defined). In addition, the nodulation genes nodOTNMLEFDABCIJ are identified on pRL10. Rlv3841, R. leguminosarum biovar viciae strain 3841.
Figure 4Detail of part of Figure 3, showing a chromosomal island. The island extends from 855 to 908 kilobases, genes RL0790-RL0841, and is recognizable by low GC3s (G+C content of silent third positions of codons) and absence of quartop genes. RA-SM denotes the tree that pairs R. leguminosarum with A. tumefaciens, and S. meliloti with M. loti; RM-AS and RS-AM are similarly defined.
Figure 5Dinucleotide compositional analysis of 100-kilobase windows of the genomes of Rlv3841 and A. tumefaciens C58. On the first two axes of a principal components analysis of the symmetrized dinucleotide relative abundance (DRA) of both genomes analyzed jointly, sequences from each chromosome (chr) and plasmid are identified by distinct symbols. PC1 accounts for 48.9% and PC2 for 35.6% of the total variance. Rlv3841, R. leguminosarum biovar viciae strain 3841.
Figure 6Cumulative distribution of the eight-base motif GGGCAGGG in the genome of Rlv3841. The motif is shown in forward and reverse orientation on chromosome and plasmids. Rlv3841, R. leguminosarum biovar viciae strain 3841.
Figure 7Phylogeny of completely sequenced genomes of selected α-proteobacteria. The phylogeny is based on the concatenated sequences of 648 orthologous proteins. Neighbor-Joining method with % bootstrap support indicated. Scale indicates substitutions per site.
Phylogenies supported by quartets of orthologous proteins shared between Rlv3841, A. tumefaciens, S. meliloti, and M. loti
| Total proteins | Number in quartops | Percentage in quartops | Phylogeny supported | |||
| RA-SMa | RS-AM | RM-AS | ||||
| Chromosome | 4,736 | 1,798 | 38.0 | 488 | 165 | 92 |
| pRL12 | 790 | 70 | 8.9 | 7 | 23 | 10 |
| pRL11 | 635 | 124 | 19.5 | 36 | 22 | 10 |
| pRL10 | 461 | 28 | 6.1 | 10 | 6 | 2 |
| pRL9 | 313 | 30 | 9.6 | 9 | 5 | 7 |
| pRL8 | 140 | 0 | 0 | 0 | 0 | 0 |
| pRL7 | 188 | 6 | 3.2 | 1 | 1 | 4 |
| All | 7,263 | 2,056 | 28.3 | 551 | 222 | 125 |
aNumber of quartops supporting the phylogeny ([R. leguminosarum, A. tumefaciens], [S. meliloti, M. loti]) with at least 99% probability (and likewise for the other two possible topologies). Rlv3841, Rhizobium leguminosarum biovar viciae 3841.
Numbers of genes unique to Rlv3841 or shared with one or more related genomes
| Distributiona | Chromosome | pRL12 | pRL11 | pRL10 | pRL9 | pRL8 | pRL7 | All replicons |
| R | 1,145 | 293 | 276 | 214 | 128 | 91 | 123 | 2,270 |
| R+A | 236 | 88 | 34 | 46 | 28 | 15 | 21 | 468 |
| R+S | 267 | 61 | 42 | 36 | 25 | 5 | 9 | 445 |
| R+M | 280 | 78 | 50 | 41 | 27 | 3 | 11 | 490 |
| R+A+S | 347 | 67 | 37 | 20 | 19 | 3 | 6 | 499 |
| R+A+M | 170 | 47 | 19 | 17 | 19 | 17 | 3 | 292 |
| R+S+M | 365 | 68 | 33 | 43 | 25 | 6 | 6 | 546 |
| R+A+S+M | 1,926 | 88 | 144 | 44 | 42 | 0 | 9 | 2,253 |
| Total genes | 4,736 | 790 | 635 | 461 | 313 | 140 | 188 | 7,263 |
| (R+S+M)+Bb | 183 | 25 | 12 | 26 | 10 | 5 | 4 | 264 |
aGenes of Rhizobium leguminosarum biovar viciae 3841 (Rlv3841) are classified by whether they are unique to this genome (R) or have homologs in the genomes of A. tumefaciens (A), S. meliloti (S), or M. loti (M). bOf the 546 genes shared by the three rhizobia (R+S+M) but missing from A, 264 had homologs in B. japonicum, a distantly related rhizobium. Rlv3841, Rhizobium leguminosarum biovar viciae 3841.
The σ factors of Rlv3841
| Gene and name (if known)a | GC3s, quartopb | Closest homologsc | ||
| Species | Gene | BLAST e value | ||
| RL0422 (RpoN, used for N assimilation) | 0.791 Y | SMc01139 | 0.0 | |
| AGR_C_581 | 0.0 | |||
| mll3196 | 2E-161 | |||
| BR0158 | 3E-156 | |||
| RL0788 | 0.732 N | Pflu02001224 | 9E-56 | |
| ECA2017 | E-43 | |||
| BambDRAFT_3017 | 9E-30 | |||
| SMa01432 | 2E-27 | |||
| AGR_C_2739 | 2E-16 | |||
| mlr77732 | E-16 | |||
| RL1138 | 0.714 N | Reut_B3794 | E-47 | |
| bll6484 | 2E-44 | |||
| SPO31254 | E-29 | |||
| mlr7773 | 7E-29 | |||
| AGR_C_2739 | 2E-28 | |||
| SMc01419 | 9E-23 | |||
| RL3020 | 0.749 Y | AGR_C_4121 | 6E-56 | |
| BRA0021 | 2E-32 | |||
| Bcep02004893 | 7E-32 | |||
| CC3253 | 2E-30 | |||
| mll3493 | 4E-25 | |||
| SMb20531 | 2E-18 | |||
| RL3235 | 0.777 N | SMc01419 | E-48 | |
| AGR_C_3679p | 9E-44 | |||
| SalaDRAFT_1954 | 5E-23 | |||
| bll2628 | 6E-23 | |||
| mlr04075 | E-18 | |||
| RL3402 (RpoD, 'housekeeping' σ) | 0.823 Y | AGR_C_3929p | 0.0 | |
| SMc01563 | 0.0 | |||
| BR1479 | 0.0 | |||
| mll2466 | 0.0 | |||
| RL3509 | 0.758 N | SMc02713 | 2E-38 | |
| SalaDRAFT_1954 | 4E-18 | |||
| Bcepa03005434 | 5E-18 | |||
| DSM 12444 | 2E-17 | |||
| AGR_C_3679 | E-14 | |||
| mlr7773 | 5E-12 | |||
| RL3703 (RpoZ) | 0.790 Y | SMc01506 | 5E-84 | |
| BR1658 | 3E-80 | |||
| BH13830 | 5E-73 | |||
| AGR_L_1386p | 5E-68 | |||
| mll3697 | 7E-68 | |||
| RL3766 (RpoH, heat shock σ) | 0.794 Y | AGR_C_4439p | 2E-155 | |
| SMc00646 | 3E-148 | |||
| mlr3741 | 5E-138 | |||
| BR1650 | E-133 | |||
| RL4524 | 0.780 N | mlr8088 | 5E-38 | |
| Rrub02003750 | 3E-37 | |||
| 'rpoE' | 3E-34 | |||
| Magn03011107 | E-33 | |||
| None listed in top 50 matches | ||||
| None listed in top 50 matches | ||||
| RL4614 (RpoH2, heat shock σ) | 0.771 N | SMc03873 | 5E-118 | |
| mlr3862 | 2E-98 | |||
| BR1763 | 5E-97 | |||
| BH15210 | 3E-95 | |||
| AGR_C_4439pd | 7E-58 | |||
| pRL10385 | 0.720 N | mll1224 | 2E-52 | |
| Mdeg02003584 | E-28 | |||
| Rsph03003515 | E-27 | |||
| AcidDRAFT_4778 | 6E-22 | |||
| None listed in top 50 matches | ||||
| None listed in top 50 matches | ||||
| pRL11007 (EcfR) | 0.778 N | mlr0407 | 4E-57 | |
| Rsph03002607 | E-45 | |||
| Magn03006880 | 8E-39 | |||
| BPP1584 | 4E-37 | |||
| AGR_L_1386p | 3E-24 | |||
| SMa0143 | 2E-22 | |||
| pRL110418 | 0.794 N | blr7812 | 2E-83 | |
| SAV7424 | 7E-74 | |||
| nfa4970 | 3E-73 | |||
| Francci3DRAFT_3517 | 7E-66 | |||
| mll6869 | 8E-33 | |||
| None listed in top 50 matches | ||||
| None listed in top 50 matches | ||||
| pRL120319 (RpoI, iron uptake sigma) | 0.640 N | AvinDRAFT_4065 | 4E-20 | |
| 'pvdS'e | 3E-18 | |||
| SMc04203 | 8E-09 | |||
| AGR_pAT_442 | 2E-08 | |||
| pRL120580 | 0.587 N | RPA0550 | 4E-40 | |
| Rrub02003750 | 8E-38 | |||
| Magn03011107 | 5E-37 | |||
| PSPTO1043 | 3E-28 | |||
| mlr8088 | 4E-26 | |||
| AGR_C_3605 | 6E-14 | |||
| None listed in top 50 matches | ||||
aRhizobium leguminosarum biovar viciae 3841 (Rlv3841) locus tags are shown and, if the gene has been named in R. leguminosarum, this name is shown. bGC3s (G+C content of silent third positions of codons), member of quartop (yes [Y]/no [N]). cEach of the Rlv3841 sigma factors was used in BLASTP searches (maximum of 50 matches). The sequences and the locus tag numbers, together with the 'e' values, are shown for the top four matches. In those genera (Brucella, Pseudomonas, Streptomyces, Burkholderia) with more than one sequenced species, only one example is shown for each genus. In those cases in which one or more of the species used in the quartop analysis is not in the top four matches, these are also included. dA. tumefaciens has only one rpoH gene (AGR_C_4439p) whereas R. leguminosarum, M. loti, and S. meliloti have two. AGR_C_4439p is more closely related to Rlv3841 RpoH1 (RL3766) than to Rlv3841 RpoH2 (RL4614). eAlso known as PfrI in P. putida, and PbrA in P. fluorescens.
Classification of ABC transporters of Rlv3841 compared to those of S. meliloti 1021
| Family | ||||
| ABC genes | Complete systems | ABC genes | Complete systems | |
| Uptake systems | ||||
| CUT1 | 46 | 36 | 29 | 26 |
| CUT2 | 35 | 29 | 37 | 37 |
| FeCT | 5 | 3 | 5 | 5 |
| FeT | 2 | 2 | 1 | 1 |
| HAAT | 22 | 11 | 10 | 5 |
| MolT | 1 | 1 | 1 | 1 |
| MZT | 3 | 3 | 2 | 2 |
| NitT | 11 | 5 | 4 | 4 |
| PAAT | 16 | 13 | 16 | 13 |
| PepT | 49 | 30 | 33 | 25 |
| PhnT | 3 | 1 | 1 | 1 |
| PhoT | 2 | 2 | 1 | 1 |
| POPT | 8 | 7 | 12 | 12 |
| QAT | 6 | 5 | 5 | 4 |
| SulT | 1 | 1 | 2 | 1 |
| TauT | 0 | 0 | 1 | 1 |
| ThiT | 2 | 0 | 1 | 1 |
| Unclassified | 26 | 13 | 18 | 6 |
| Efflux systems | ||||
| Export | 31 | 21 | 21 | 18 |
| Total | 269 | 183 | 200 | 164 |
Families are defined according to Saier [93]. Complete uptake systems possess at least one solute binding protein, one ATP-binding cassette (ABC), and one integral membrane protein. Complete export systems need only possess one ABC and one integral membrane protein.
Distribution and phylogeny of ABC transporter genes
| Transporter genes | Percentage of all genes | Number in quartops | Percentage in quartops | Phylogeny supported | |||
| RA-SMa | RS-AM | RM-AS | |||||
| Chromosome | 383 | 8.1 | 128 | 33.4 | 31 | 20 | 16 |
| pRL12 | 172 | 21.7 | 12 | 7.0 | 0 | 9 | 0 |
| pRL11 | 77 | 12.1 | 29 | 37.7 | 11 | 3 | 1 |
| pRL10 | 83 | 18.0 | 8 | 9.6 | 5 | 1 | 0 |
| pRL9 | 86 | 27.5 | 14 | 16.3 | 3 | 3 | 4 |
| pRL8 | 15 | 10.6 | 0 | 0 | 0 | 0 | 0 |
| pRL7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| All | 816 | 11.2 | 191 | 28.3 | 50 | 36 | 21 |
aNumber of quartops supporting the 'core' phylogeny ([R. leguminosarum, A. tumefaciens], [S. meliloti, M. loti]) with at least 99% probability (and likewise for the other two possible topologies). ABC, ATP-binding cassette.
Figure 8Genes on the chromosome and six plasmids of Rlv3841 encoding components of ABC transporter systems. The nucleotide composition (GC3s, i.e. G+C content of silent third positions of codons) of the components is shown. Members of the most abundant families are indicated, defined according to Saier [93]. ABC, ATP-binding cassette; Rlv3841, R. leguminosarum biovar viciae strain 3841.