| Literature DB >> 18226230 |
Christoph Kneip1, Christine Voss, Peter J Lockhart, Uwe G Maier.
Abstract
BACKGROUND: Bacteria occur in facultative association and intracellular symbiosis with a diversity of eukaryotic hosts. Recently, we have helped to characterise an intracellular nitrogen fixing bacterium, the so-called spheroid body, located within the diatom Rhopalodia gibba. Spheroid bodies are of cyanobacterial origin and exhibit features that suggest physiological adaptation to their intracellular life style. To investigate the genome modifications that have accompanied the process of endosymbiosis, here we compare gene structure, content and organisation in spheroid body and cyanobacterial genomes.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18226230 PMCID: PMC2246100 DOI: 10.1186/1471-2148-8-30
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Gene content in, and downstream of, the . Blue and red bars represent orfs coded on the leading or lagging strand of DNA, respectively. The locations of pseudogenes in the spheroid body fragment have been indicated with green bars. Genes have been named either according to homology matches in BLAST analyses or numbered consecutively for each organism (see also additional files 1 and 2). A GATA [29] plot is shown and indicates regions of high synteny between both organisms. GATaligner settings were: Window size: 100; Match: 5; MisMatch:-4; Gap Creation:-10; Gap Extension:-4; Raw Score Cut Off: 80. GATAPlotter score settings: Max: 141 bits, expect 1E-33; Min: 46.8 bits, expect 5E-5. GATAPlotter scores have been represented using a greyscale bar. Regions of the spheroid body genome showing modifications of special interest have been indicated. A) Gene inactivation by pseudogenisation (e.g. fdxN*); B) Gene deletion with DNA loss (e.g. cyl0012); C) Gene deletion without DNA loss resulting in large non-coding regions (e.g. cyl0016); D) Gene deletion with DNA loss resulting in gene fusion (e.g. cyl0019). See text for further description of individual modifications.
Figure 2Supernetworks displaying relationships inferred from phylogenetic analysis of different gene regions. A. Supernetwork reconstructed using strict consensus maximum likelihood trees for NifB, NifN, NifS and ABC, HesA, NdhK, ModC, PerM, UvrD, SAM, Sbr0014 of different cyanobacetria (see Materials and Methods). B. Supernetwork reconstructed using strict consensus maximum likelihood trees for NifD, NifH, NifK, NifE, and ABC, HesA, NdhK, ModC, PerM, UvrD, SAM, Sbr0014. The networks are outgroup rooted using Synechocystis PCC 6803. A reticulation occurs in the centre of both graphs because of the local instability in the placement of Synechocystis PCC 6803.
Genome features of obligate intracellular symbiotic and parasitic bacteria
| Species | Genome size (Mbp) | AT-content genome (%) | AT-content 16S rDNA (%) | ATcontent 1st codon | ATcontent 2nd codon | ATcontent 3rd codon | Ref. |
|---|---|---|---|---|---|---|---|
| 0.66 | 73.7 | 49.86 | 62.62 | 69.41 | 85.77 | [39] | |
| 0.71 | 72.6 | 52.89 | 61.33 | 68.08 | 83.95 | [7] | |
| 0.16 | 83.4 | 64.15 | 80.26 | 79.61 | 92.22 | [9] | |
| 1.04 | 58.7 | 48.58 | 48.39 | 61.17 | 65.52 | [40] | |
| 1.11 | 71.0 | 49.5 | 58.95 | 68.24 | 81.53 | [22] | |
| 4.17 | 45.3 | 45.29 | 38.83 | 57.86 | 34.73 | [12] | |
| 0.7 | 77.5 | 51.19 | 69.12 | 71.84 | 88.12 | [38] | |
| n.d. | 60.2 (fragment) | 45.42 | 49.16 | 61.40 | 66.81 | this paper |
Genome size, AT-content and nucleotide composition of each codon position are indicated
(References: [7,9,12,22,38–40]). n.d.: not determined
Figure 3Analysis of Ferredoxin N. A. Multiple alignment of cyanobacterial FdxN proteins. Accession numbers: CyaATCC51142: AAW56985.1 (Cyanothece sp. ATVV51142); CyaPCC8801: AAC33373.1 (Cyanothece sp. PCC8801); CyaCCY0110: ZP_01727762.1 (Cyanothece sp. CCY0110); GloeK068DGA: BAF47148.1 (Gloeothece sp. KO68DGA); NostPCC7120: AAA22005.1 (Nostoc punctiforme PCC 7120); AnabATCC29413: YP_324413.1 (Anabaena variabilis ATCC 29413). B. Alignment of Cyanothece sp. ATCC51142 FdxN protein with the spheroid body fdxN* pseudogene translated in 3 forward reading frames. Evidence of homology, at the level of amino acid similarity, is distributed across all 3 reading frames of the pseudogene, indicating multiple substitutions and single nucleotide deletion events.
Figure 4Truncation of . A. GATA plot for Cyanothece and spheroid body nifU indicating conserved regions. GATaligner settings: Window size: 100; Match: 5; MisMatch:-4; Gap Creation:-10; Gap Extension:-4; Raw Score Cut Off: 92.0. GATAPlotter score settings: Max: 141 bits, expect 9E-38; Min: 28 bits, expect 9E-4. GATAPlotter scores are indicated. B. Multiple alignment of predicted amino acid sequences for NifU indicating an N-terminal truncation in the homologue from the endosymbiont. NifU accession numbers: Cya51142: AAW56987.1 (Cyanothece sp. ATCC 51142); Cya0110: ZP_01727764.1 (Cyanothece sp. CCY0110); GloKO68DGA: BAF47150.1 (Gloeothece sp. KO68DGA); Cya8801: AAC33371.1 (Cyanothece sp. PCC 8801); Lyn8106: ZP_01620769.1 (Lyngbya sp. PCC 8106); TriIMS101: AAF82636.1 (Trichodesmium sp. IMS101); NdCCY9414: ZP_01628437.1 (Nodularia spumigena CCY9414); Nos73102: ZP_00112317.1 (Nostoc punctiforme PCC 73102); SynJA2: YP_476679.1 (Synechococcus sp. JA-2-3B'a(2–13)); sb8: AAW57048.1 (spheroid body); Syn6803: NP_442853.1 (Synechocystis sp. PCC6803); Glo7421: NP_925823.1 (Gloeobacter violaceus PCC 7421); Cro8501: ZP_00516385.1 (Crocosphaera watsonii WH 8501).
Figure 5Spheroid body . A. Alignment of predicted amino acid sequences for spheroid body protein Sbl0010 and Cyanothece proteins Cyl0018 and Cyl0020. Deletion in the endosymbiont genome of cyl0019 in the creation of sbl0010 can be inferred during reductive genome evolution. In Sb10010, homologues of Cyl0018 and Cyl0020 have been conserved in full length and are separated by a 17 amino acid residues. Cyl0018: green, Cyl0020: orange, Sbl0010: black. B. Cyl0018 and Cyl0020 are highly conserved in cyanobacteria closely related to the spheroid body. They are aseparated by 1–5 genes when they co-occur at the same locus, but in some cases they are encoded at different loci of the genome (indicated by x).
Figure 6A/T-G/C frequencies in . A/T-G/C-plot for the genome region shown in Fig. 1 (red: AT; blue:G/C), indicating a high AT composition in the large non-coding regions (black arrows) of the spheroid body fragment.
Figure 7PCR analysis of missing or pseudogenisised genes. PCR analysis of Cyanothece sp. ATCC 51142 (CY), R. gibba (RG) and spheroid body (SB) DNA. Amplification with primers specific for the cyanobacterial fdxN, petJ, psbC, cyl0012 and cyl0017 genes show that the analysed genes are not encoded in R. gibba and the endosymbiont's genome. CYrecF and SBrecF were used as positive controls. GeneRuler™ Express DNA Ladder (Fermentas) was used as molecular weight standard.
Figure 8Pairwise alignments for RecA and for RecF. Alignment of RecA (left) and RecF (right) homologues from the spheroid body (SB) and Cyanothece sp. ATCC51142 (CY) genomes. The spheroid body encodes complete full length recA and recF orfs.
Pseudogenes in all sequenced spheroid body genome fragments (totalling 192335 bp) and pseudogenes identified in the Cyanothece sp. ATCC 51142 genome region shown in Figure 1 (63344 bp)
| Name | Best BlastX hit | Organism | Accession | e-value | |
|---|---|---|---|---|---|
| sbp1* | FdxN | 0.070 | |||
| sbp2* | Flavodoxin, long chain | 2e-05 | |||
| sbp3* | hypothetical protein CwatDRAFT_1967 | 7e-05 | |||
| sbp4* | hemolysin | 3e-09 | |||
| sbp5* | Putative esterase | 6e-16 | |||
| sbp6* | COG0654: 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases | 2e-17 | |||
| sbp7* | hypothetical protein CY0110_23906 | 5e-09 | |||
| sbp8* | hypothetical protein CY0110_11227 | 1e-18 | |||
| sbp9* | Peptidase S49, SppA 67 kDa type:Peptidase S49, SppA | 3e-09 | |||
| sbp10* | GDP-D-mannose dehydratase | 1e-131 | |||
| sbp11* | NAD-dependent epimerase/dehydratase | 5e-103 | |||
| sbp12* | hypothetical protein CwatDRAFT_2668 | 3e-10 | |||
| sbp13* | Uma4 (transposase homolog) | 4e-05 | |||
| sbp14* | hypothetical protein CY0110_27355 | 9e-07 | |||
| sbp15* | thioredoxin reductase | 8e-88 | |||
| sbp16* | putative transposase | 6e-65 | |||
| sbp17* | hypothetical protein DSY2261 | 1e-14 | |||
| sbp18* | Mg chelatase-related protein | 2e-06 | |||
| sbp19* | hypothetical protein CY0110_07314 | 2e-103 | |||
| sbp20* | Putative Anti-Sigma regulatory factor (Ser/Thr protein kinase) | Cyanothece sp. CCY0110 | 0.011 | ||
| sbp21* | Anti-Sigma-factor antagonist (STAS) and sugar transfersase | 7e-53 | |||
| sbp22* | transposase | 4e-20 | |||
| sbp23* | hypothetical protein CY0110_12822 | 5e-28 | |||
| sbp24* | orfAB | 3e-05 | |||
| sbp25* | Protein of unknown function DUF820 | 2e-13 | |||
| sbp26* | probable sulfotransferase | 4e-50 | |||
| sbp27* | COG3464: Transposase and inactivated derivatives | Nostoc punctiforme PCC 73102 | 2e-11 | ||
| sbp28* | serine/threonine protein kinase | 4e-26 | |||
| sbp29* | hypothetical protein CwatDRAFT_4770 | 8e-07 | |||
| sbp30* | DedA | 2e-53 | |||
| sbp31* | hypothetical protein CY0110_03639 | 5e-06 | |||
| sbp32* | Sodium:galactoside symporter | 5e-22 | |||
| sbp33* | photosystem II CP43 protein | 1,0 | |||
| sbp34* | photosystem II D2 protein | 5e-05 | |||
| cyp1* | hypothetical protein CY0110_22382 | Cyanothece sp. CCY0110 | 2e-11 |
Pseudogenes within the nif-operon and downstream regions (genome regions shown in Figure 1) are indicated in grey.