| Literature DB >> 12952534 |
Marina V Omelchenko1, Kira S Makarova, Yuri I Wolf, Igor B Rogozin, Eugene V Koonin.
Abstract
BACKGROUND: Shuffling and disruption of operons and horizontal gene transfer are major contributions to the new, dynamic view of prokaryotic evolution. Under the 'selfish operon' hypothesis, operons are viewed as mobile genetic entities that are constantly disseminated via horizontal gene transfer, although their retention could be favored by the advantage of coregulation of functionally linked genes. Here we apply comparative genomics and phylogenetic analysis to examine horizontal transfer of entire operons versus displacement of individual genes within operons by horizontally acquired orthologs and independent assembly of the same or similar operons from genes with different phylogenetic affinities.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12952534 PMCID: PMC193655 DOI: 10.1186/gb-2003-4-9-r55
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Examples of horizontally transferred operons
| Operon | Recipient organism and correspondent genes | Probable source | Other probable recipients | Comment |
| Pyruvate:ferredoxin oxidoreductase | Archaea | Aae, Hpy, Bha/Sau | Apparently, the related operon for 2-oxoisovalerate oxidoreductase (TM1758-TM1759) was also transferred from archaea | |
| Sulfate/molybdate transport | Gram-negative bacteria | - | No other such operons in | |
| Putative effector of murein hydrolase | Bacteria | Pab, Mac | ||
| Allophanate hydrolase subunits | Bacteria | Pab | ||
| Dipeptide transporter | Thermotoga/Archaea | Tma | It has several another bacterial operons including VC1091-VC1095 | |
| Ribonucleotide reductase alpha and beta subunit | Bacteria | - | Additional to "archaeal:" Ribonucleotide reductase alpha subunit VNG1644G, beta subunit is apparently lost | |
| Aromatic amino-acid biosynthesis | Bacteria | - | Paralogs of this pair are VNG1646G-VNG1647G | |
| Histidine biosynthesis suboperon | Epsilon-Proteobacteria | - | ||
| Panthothenate synthesis | Gram-positive bacteria | - | ||
| DNA repair SbcDC | Gram-positive bacteria | - | ||
| DNA gyrase A and B | Bacteria | Hbs, Tac, Tvo, Afu, | ||
| Dipeptide transporter | Gamma-Proteobacteria | - | ||
| Glutamate synthase complex | Archaea | - | There is another homolog for gene TM0397 of possible archaeal origin | |
| NADH:ubiquinone oxidoreductase | Bacteria | - | ||
| Phosphate transporter | Bacteria | - | ||
Examples of probable mosaic operons
| Species | Predicted operon | General operon function | Horizontally acquired genes | Probable source of horizontally acquired genes | Functions of horizontally acquired genes |
| RP633-661, RC0980-1008 | Ribosomal operon | RP651 RC0998 | Chlamydia | L29 | |
| Aq001-021 | Ribosomal operon | Aq018a | Archaea | L29 | |
| RP800-804, RC1234-1238 | F0F1-type ATPase | RP804 RC1238 Gram-positive bacteria | Delta subunit | ||
| UU128-138 | F0F1-type ATPase | UU128, UU132_1, UU133, UU134 | Gram-negative bacteria | Epsilon subunit, alpha subunit, delta subunit, delta subunit | |
| ML1139-1146 | F0F1-type ATPase | ML1139 | Gram-negative bacteria | A chain protein | |
| RP134-139, RC175-180 | Ribosomal proteins, transcription antiterminator, SecE | RP134 RC175 | Gram-positive bacteria | Preprotein translocase subunit SecE | |
| Aq1968_1_2 two domains | Histidine biosynthesis | Gram-negative bacteria | Phosphoribosyl-AMP cyclohydrolase | ||
| MJ1037-1038 | Tryptophan biosynthesis | MJ1037 | Bacteria | Tryptophan synthase beta chain | |
| MTH1655-1661 | Tryptophan biosynthesis | MTH1660 | Gram-negative bacteria | Tryptophan synthase alpha chain | |
| VNG0305-0309 | Tryptophan biosynthesis | VNG0307G | Bacteria | Tryptophan synthase beta chain | |
| PabB-folK BH0090-0095 | Tryptophan biosynthesis | PabB, BH0090 | Gram-negative bacteria | Anthranilate/para-aminobenzoate synthases component I | |
| VNG0635G-0647G | NADH:ubiquinone oxidoreductase | VNG0640G | Gram-negative bacteria | NADH dehydrogenase-like protein | |
| RP423-425, RC0588-0590 | Lipid metabolism | RP425, RC0590 | Spirochetes | Undecaprenyl pyrophosphate synthase | |
| VNG1306G-1310G | Succinate dehydrogenase/fumarate reductase | VNG1310G | Actinobacteria | Succinate dehydrogenase subunit C | |
| MG461-466 MPN677-682 | Housekeeping | MG466 MPN682 | Gram-negative bacteria | Ribosomal protein L34 | |
| TM0548-0556 | Leucine/isoleucine biosynthesis | TM0552 TM0555 TM0554 | 2-Isopropylmalate synthase 3-Isopropylmalate dehydratase, small subunit 3-Isopropylmalate dehydratase, large subunit | ||
| PAB888-895 | PAB0890 PAB0893 | Bacteria | 2-Isopropylmalate synthase ( | ||
| CAC3169-3174 | Leucine/isoleucine biosynthesis | CAC3172 CAC3173 CAC3174 Archaea | 3-Isopropylmalate dehydratase, small subunit 3-Isopropylmalate dehydratase, large subunit 2-Isopropylmalate synthase | ||
| TM1243-1251 | Nucleotide metabolism | TM1243 | Archaea | Phosphoribosylaminoimidazole-succinocarboxamide synthase | |
| L0104-0108 | Arginine biosynthesis | L0107 | Gram-negative bacteria | Acetylglutamate kinase | |
| TM1780-1785 | Arginine biosynthesis TM1784 | Archaea | Acetylglutamate kinase | ||
| BB0054-0061 | Carbohydrate metabolism (glycolysis, gluconeogenesis) | BB0057 | Gram-positive bacteria | Glyceraldehyde-3-phosphate dehydrogenase | |
| TM1780-1785 | Arginine biosynthesis | TM1780 | Gram-negative bacteria | Argininosuccinate synthase | |
| MPN573-574 MG391-392 | Molecular chaperones | MPN574 MG393 | Gram-negative bacteria | Heat shock protein (groES) | |
| MPN535-536 MG358-359 | DNA replication, recombination and repair | MPN536 MG359 | Gram-negative bacteria | Holliday junction resolvasome helicase subunit | |
| UU448-449 | DNA replication, recombination and repair | UU448 | Gram-negative bacteria | Holliday junction resolvasome helicase subunit | |
| VNG6305CC-6306C | Tetrahydrobiopterin biosynthesis | VNG6305C | Gram-negative bacteria | Organic radical activating enzyme | |
| VNG0582C-0586C | Energy production and conversion | VNG0582, VNG0583G | Bacteria | Cytochrome b subunit of the bc complex Cytochrome b subunit of the bc complex | |
| AF0321-0325 | Lipopolysaccharide biosynthesis | AF0323b | Bacteria | dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes | |
| DRA0037-DRA0044 | Lipopolysaccharide biosynthesis | DRA0044 | Archaea | dTDP-4-dehydrorhamnose epimerase | |
| MTH1789-1792 | Lipopolysaccharide biosynthesis | MTH1789, MTH1790, MTH1791 | Gram-positive bacteria Bacteria Bacteria | dTDP-D-glucose 4,6-dehydratase dTDP-4-dehydrorhamnose 3,5-epimerase dTDP-glucose pyrophosphorylase |
*The numbering of gene clusters is from the previously published analysis of gene neighborhoods in prokaryotic genomes [25].
Figure 1Genes with different phylogenetic affinities in a ribosomal operon from Aquifex aeolicus and Rickettsia prowazekii. (a) A fragment of ribosomal operon in Aquifex aeolicus (the operon from Thermotoga maritima is shown for comparison), Rickettsia prowazekii and Rickettsia conorrii (operons from other alpha-proteobacteria are shown for comparison). Genes are shown not to scale; the direction of transcription is indicated by arrows and gene numbers/names are given inside each arrow. Orthologous genes are shown by the same color. White arrows show genes in each genome that are unique in this operonic context. Phylogenetic affinity of a gene is shown as a thick colored border on the respective arrow; black denotes belonging to the reference taxon, red denotes not belonging to reference taxon. COG0197 - ribosomal protein L16/L10E; COG0255 - ribosomal protein L29; COG0186 - ribosomal protein S17. For species abbreviations, see Materials and methods. (b) Unrooted maximum-likelihood tree for ribosomal protein L16. Branches supported by bootstrap probability >70% are marked by black circles. Names of the genes from mosaic operons and the respective branches are shown in red. Branches for which the likelihoods of alternative placements were assessed using the RELL method are indicated by circles with numbers (see Table 3). (c) Unrooted maximum-likelihood tree for ribosomal protein L29;. the designations are as in Figure 1b. (d) Unrooted maximum-likelihood tree for ribosomal protein S17; the designations are as in Figure 1b.
Figure 2In situ displacement of the ruvB gene in Mycoplasma. (a) Organization of the Holliday junction resolvasome operon and surrounding genes in bacteria. COG0632 - Holliday junction resolvasome, DNA-binding subunit, COG2255 - Holliday junction resolvasome, DNA-binding subunit, COG0817 - Holliday junction resolvasome, endonuclease subunit, COG0392 - Predicted integral membrane protein, COG0282 - acetate kinase, COG0839 - NADH:ubiquinone oxidoreductase subunit 6 (chain J), COG0244 - ribosomal protein L10, COG0732 - restriction endonuclease S subunits, COG0809 - S-adenosylmethionine:tRNA-ribosyltransferase-isomerase, COG0772 - bacterial cell division membrane protein, COG0624 - acetylornithine deacetylase/succinyl-diaminopimelate desuccinylase and related deacylases, COG1487 - predicted nucleic acid-binding protein, COG1132 - ABC-type multidrug transport system, ATPase and permease components, COG0442 - prolyl-tRNA synthetase, COG0323 - DNA mismatch repair enzyme, COG1408 - predicted phosphohydrolases. The designations are as in Figure 1a. For species abbreviations, see Materials and methods. (b,c) Unrooted maximum-likelihood tree for RuvA (b) and RuvB (c); the designations are as in Figure 1b.
Kishino-Hasegawa test for the analyzed cases of apparent xenologous gene displacement in situ
| Tree* | Diff lnL† | S.E.‡ | RELL-BP§ |
| L19 original | 0.0 | ML | 0.8004 |
| 1R2 | -12.6 | 7.7 | 0.0480 |
| 3R4 | -6.6 | 6.6 | 0.1516 |
| 0.00 | ML | 0.9631 | |
| 1R2 | -27.1 | 15.4 | 0.0369 |
| 0.00 | ML | 0.9883 | |
| 1R2 | -29.3 | 12.8 | 0.0117 |
| 0.00 | ML | 0.8336 | |
| 1R2 | -7.4 | 7.9 | 0.1664 |
| 0.00 | ML | 1.0000 | |
| 1R2 | -151.1 | 25.0 | 0.0000 |
| 0.00 | ML | 0.9005 | |
| 1R2 | -17.0 | 13.3 | 0.0995 |
| 0.00 | ML | 1.0000 | |
| 1R2 | -150.2 | 25.8 | 0.0000 |
| 3R4 | -418.6 | 31.5 | 0.0000 |
| 5R6 | -245.0 | 27.8 | 0.0000 |
| 0.00 | ML | 0.9847 | |
| 1R2 | -52.9 | 18.1 | 0.0007 |
| 3R4 | -31.7 | 14.9 | 0.0146 |
| 0.00 | ML | 1.0000 | |
| 1R2 | -302.7 | 31.6 | 0.0000 |
| 3R4 | -439.1 | 32.1 | 0.0000 |
| 0.00 | ML | 1.0000 | |
| 1R2 | -66.6 | 17.2 | 0.0000 |
| 3R4 | -76.7 | 16.8 | 0.0000 |
*The numbers refer to local rearrangements of the tree as indicated on the corresponding figures. †Difference of the Log-likelihoods relative to the best tree. ‡Standard error of Diff lnL. §Bootstrap probability of the given tree calculated using the RELL method (Resampling of Estimated Log-likelihoods).
Figure 3Genes with different phylogenetic affinities in the lipid biosynthesis operon of Rickettsia. (a) Organization of the lipid biosynthesis operon and surrounding genes in Rickettsia prowazekii and Rickettsia conorrii (operons from three other alpha-proteobacteria are shown for comparison). COG0020 - undecaprenyl pyrophosphate synthase, UppS; COG0575 - CDP-diglyceride synthetase; COG0750 - predicted membrane-associated Zn-dependent proteases; COG0233 - ribosome recycling factor; COG0528 - uridylate kinase; COG0745 - OmpR-like response regulator; COG0642 - signal transduction histidine kinase; COG0729 - outer membrane protein; COG2919 - septum formation initiator; COG0743 - 1-deoxy-D-xylulose 5-phosphate reductoisomerase. The designations are as in Figure 1a. For species abbreviations, see Materials and methods. (b,c) Unrooted maximum-likelihood tree for UppS (b) and CdsA (c); the designations are as in Figure 1b.
Figure 4In situ gene displacement in the NADH-ubiquinone oxidoreductase operon in Halobacterium. (a) Organization of the NADH-ubiquinone oxidoreductase operon in selected archaeal and bacterial genomes. COG0838 - NADH:ubiquinone oxidoreductase subunit 3 (chain A), COG3077 - DNA-damage-inducible protein J, COG0852 - NADH:ubiquinone oxidoreductase 27 kD subunit, COG0649 - NADH:ubiquinone oxidoreductase 49 kD subunit 7, COG1905 - NADH:ubiquinone oxidoreductase 24 kD subunit, COG1894 - NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit, COG1034 - NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G), COG1005 - NADH:ubiquinone oxidoreductase subunit 1 (chain H), COG1143 - Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I), COG0839 - NADH:ubiquinone oxidoreductase subunit 6 (chain J), COG0713 - NADH:ubiquinone oxidoreductase subunit 11 or 4L (chain K), COG1009 - NADH:ubiquinone oxidoreductase subunit 5 (chain L), COG1008 - NADH:ubiquinone oxidoreductase subunit 4 (chain M), COG1007 - NADH:ubiquinone oxidoreductase subunit 2 (chain N). The designations are as in Figure 1a. For species abbreviations, see Materials and methods. (b-d) Unrooted maximum-likelihood tree for NuoH (b), NuoI (c) and NuoJ (d); the designations are as in Figure 1b.
Figure 5Genes with different phylogenetic affinities in the lipopolysaccharide biosynthesis operon of Methanothermobacter thermoautotrophicus and Deinococcus radiodurans. (a) Organization of the lipopolysaccharide biosynthesis operon in different prokaryotes. COG1091 - dTDP-4-dehydrorhamnose reductase; COG1209 dTDP-glucose pyrophosphorylase; COG1898 - dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes; COG1088 - dTDP-D-glucose 4,6-dehydratase. The designations are as in Figure 1a. For species abbreviations, see Materials and methods. (b-e) Unrooted maximum-likelihood tree for RfbB (b), RfbC (c), RfbA (d) and RfbD (e); the designations are as in Figure 1b.
Figure 6Genes with different phylogenetic affinities in the leucine/isoleucine biosynthesis operon. (a) Operon organization in different prokaryotic species. COG0028 - acetolactate synthase, large subunit; COG0440 - acetolactate synthase, small subunit; COG0059 - ketol-acid reductoisomerase; COG0129 - dihydroxyacid dehydratase; COG0119 - isopropylmalate synthases; COG0473 - isocitrate/isopropylmalate dehydrogenase; COG0066 - 3-isopropylmalate dehydratase, small subunit; COG0065 - 3-isopropylmalate dehydratase, large subunit. The designations are as in Figure 1a. For species abbreviations, see Materials and methods. (b-e) Unrooted maximum-likelihood tree for LeuA (b), LeuB (c), LeuC (d) and LeuD (e); the designations are as in Figure 1b.