| Literature DB >> 16472375 |
Jean-François Pombert1, Claude Lemieux, Monique Turmel.
Abstract
BACKGROUND: The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. The basal position of the Prasinophyceae has been well documented, but the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae is currently debated. The four complete chloroplast DNA (cpDNA) sequences presently available for representatives of these classes have revealed extensive variability in overall structure, gene content, intron composition and gene order. The chloroplast genome of Pseudendoclonium (Ulvophyceae), in particular, is characterized by an atypical quadripartite architecture that deviates from the ancestral type by a large inverted repeat (IR) featuring an inverted rRNA operon and a small single-copy (SSC) region containing 14 genes normally found in the large single-copy (LSC) region. To gain insights into the nature of the events that led to the reorganization of the chloroplast genome in the Ulvophyceae, we have determined the complete cpDNA sequence of Oltmannsiellopsis viridis, a representative of a distinct, early diverging lineage.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16472375 PMCID: PMC1402334 DOI: 10.1186/1741-7007-4-3
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
General features of Oltmannsiellopsis and other chlorophyte cpDNAs
| Size (bp) | |||||
| Total | 200,799 | 150,613 | 151,933 | 195,867 | 203,827 |
| IR | 46,137 | --a | 18,510 | 6,039 | 22,211 |
| LSC | 92,126 | --a | 33,610 | 140,914 | 81,307b |
| SSC | 16,399 | --a | 81,303 | 42,875 | 78,088b |
| A+T (%) | 57.9 | 68.4 | 59.5 | 68.5 | 65.5 |
| Coding sequences (%)c | 68.7 | 60.9 | 59.2 | 62.3 | 50.1 |
| Genes (no.)d | 128 | 112 | 105 | 105 | 94 |
| Introns (no.) | |||||
| Group I | 0 | 3 | 5 | 27 | 5 |
| Group II | 0 | 0 | 0 | 0 | 2 |
a Because Chlorella cpDNA lacks an IR, only the total size of this genome is given.
b The LSC and SSC regions of Chlamydomonas cpDNA cannot be distinguished unambiguously.
c Conserved genes, unique ORFs and introns were considered as coding sequences.
d Genes present in the IR were counted only once. Unique ORFs and intron ORFs were not taken into account.
Figure 1Gene maps of . Genes (filled boxes) outside each map are transcribed clockwise. The transcription direction of the rRNA genes is indicated by arrows. Genes shown in yellow, blue and red map to the IR, LSC and SSC regions in Mesostigma cpDNA, respectively. On the Oltmannsiellopsis map, genes characteristic of the LSC region that reside in the single copy region corresponding to SSC in both Oltmannsiellopsis and Pseudendoclonium cpDNAs are denoted by asterisks. Genes absent from Mesostigma cpDNA are shown in grey. tRNA genes are indicated by the one-letter amino acid code followed by the anticodon in parentheses (Me, elongator methionine; Mf, initiator methionine). A total of five introns (open boxes) were identified, some of which feature ORFs (narrow boxes).
Differences between the gene repertoires of Oltmannsiellopsis and other UTC algal cpDNAs
| ● | ● | ● | ○ | |
| ● | ● | ○ | ● | |
| ● | ● | ● | ○ | |
| ● | ● | ○ | ● | |
| ● | ● | ○ | ● | |
| ● | ○ | ○ | ○ | |
| ● | ○ | ○ | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ● | ● | ○ | |
| ● | ○ | ● | ○ | |
| ● | ○ | ● | ○ | |
| ● | ○ | ○ | ○ | |
| ● | ○ | ● | ○ | |
| ○ | ● | ● | ○ | |
| ● | ○ | ○ | ○ | |
| ● | ○ | ○ | ○ |
a A filled/open circle denotes the presence/absence of a gene. Only the genes that are missing in one or more genomes are indicated. A total of 91 genes are shared by all compared cpDNAs: atpA, B, E, F, H, I, ccsA, cemA, clpP, ftsH, petA, B, D, G, L, psaA, B, C, J, psbA, B, C, D, E, F, H, I, J, K, L, M, N, T, Z, rbcL, rpl2, 5, 14, 16, 20, 23, 36, rpoA, B, C1, C2, rps2, 3, 4, 7, 8, 9, 11, 12, 14, 18, 19, rrf, rrl, rrs, tufA, ycf1, 3, 4, 12, trnA(ugc), C(gca), D(guc), E(uuc), F(gaa), G(gcc), G(ucc), H(gug), I(cau), I(gau), K(uuu), L(uaa), L(uag), Me(cau), Mf(cau), N(guu), P(ugg), Q(uug), R(acg), R(ucu), S(gcu), S(uga), T(ugu), V(uac), W(cca), Y(gua).
Compared sizes of expanded genes in Oltmannsiellopsis and other UTC algal cpDNAs
| 801 | 1.6 | 1059 | 2.2 | 909 | 1.8 | 1503 | 3.0 | |
| 5163 | 1.9 | 6879 | 2.6 | 7791 | 2.9 | 8916 | 3.3 | |
| 837 | 0.9 | 1527 | 1.6 | 1734 | 1.8 | 2213b | 2.3 | |
| 3906 | 1.2 | 4251 | 1.3 | 6537 | 2.0 | 4967c | 1.5 | |
| 2511 | 1.3 | 3066 | 1.5 | 4737 | 2.4 | 5739c | 2.9 | |
| 4689 | 1.3 | 5580 | 1.5 | 10389 | 2.8 | 9363 | 2.5 | |
| 2460 | 2.0 | 2427 | 2.0 | 2505 | 2.0 | 5988 | 4.9 | |
a Size relative to the corresponding gene in Mesostigma. Each value was obtained by dividing the size of the UTC algal gene by the size of the Mesostigma gene.
b The indicated size is derived from our unpublished sequence data. We found that a portion of the rpoA coding sequences is missing in [GenBank:NC_005353] as a result of a sequencing error introducing a frameshift mutation.
c The indicated size includes the intergenic spacer separating the ORFs corresponding to the 5' and 3' portions of the gene.
Group I introns in Oltmannsiellopsis cpDNA
| Ov. | IB | 1322 | L8 | LAGLIDADG (2) | 264 |
| Ov. | IA2 | 1127 | L6 | GIY-YIG | 240 |
| Ov. | IB4 | 830 | L8 | LAGLIDADG (1) | 167 |
| Ov. | IB4 | 1129 | L9.1 | LAGLIDADG (2) | 250 |
| Ov. | IA3 | 767 | L6 | LAGLIDADG (1) | 165 |
a Introns were classified according to Michel and Westhof [44]. The subcategory of the Ov.petB.1 intron could not be identified unambiguously.
b L followed by a number refers to the loop extending the base-paired region identified by the number.
c The conserved motif in the predicted homing endonuclease is given, with the number of copies of the LAGLIDADG motif indicated in parentheses.
Group I introns at identical gene locations in Oltmannsiellopsis cpDNA, other green algal cpDNAs and land plant cpDNAs
| Ov. | [GenBank: | |
| [GenBank: | ||
| Ov. | [GenBank: | |
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| Ov. | [GenBank: | |
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| Ov. | [GenBank: | |
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
| [GenBank: | ||
a The letter in parentheses indicates the specific chlorophyte/streptophyte lineage comprising the green algal/plant indicated. P, Prasinophyceae; U, Ulvophyceae; T, Trebouxiophyceae; C, Chlorophyceae; E, Embryophyta; and ?, uncertain affiliation within the Chlorophyta.
b The intron number is given when more than one intron is present.
Figure 2Gene clusters shared between . Shared clusters are shown on the Oltmannsiellopsis gene map as alternating series of green and red boxes. Genes located outside conserved clusters are shown in grey. Genes missing from Chlamydomonas, Pseudendoclonium and Chlorella are represented in beige. The extent of the IR and transcription direction of the rRNA genes are denoted by arrows. Genes outside the map are transcribed clockwise.
Figure 3Fragmented ancestral gene clusters in the cpDNAs of UTC algae. The indicated clusters are found in both Mesostigma and Nephroselmis cpDNAs. Note that rpl22 has not been represented in the large ribosomal protein cluster because it is present only in Mesostigma cpDNA (between rps19 and rps3). Sites of fragmentation are denoted by arrowheads of different shades above the clusters: Chlamydomonas, filled arrowheads; Pseudendoclonium, dark grey arrowheads; Oltmannsiellopsis, light grey arrowheads; and Chlorella, open arrowheads. Genes missing from Chlamydomonas, Pseudendoclonium, Oltmannsiellopsis and Chlorella are indicated by squares, circles, asterisks and double dagger, respectively. Gene polarities are not shown.
Figure 4Derived gene clusters shared between the cpDNAs of UTC algae. Filled/open boxes represent the presence/absence of clusters. Gene polarities are not shown.
Minimal numbers of inversions accounting for gene rearrangements between green algal cpDNAs
| 75 | 74 | 75 | 75 | 77 | |
| 54 | 55 | 52 | 55 | ||
| 54 | 55 | 50 | |||
| 46 | 47 | ||||
| 43 | |||||
a Numbers of gene permutations by inversions were computed using GRIMM.
Figure 5Positions of SDR elements in . The Oltmannsiellopsis cpDNA sequence was aligned against itself using PipMaker. Regions containing SDRs can be identified as clusters of dots. Similarities between aligned regions are shown as average percent identity (between 50 and 100% identity). Genes and their polarities are denoted by horizontal arrows and coding sequences are represented by filled boxes.
SDR units in Oltmannsiellopsis cpDNA
| A | 11 | CAACACTYCCA | 252 | 426 |
| B | 11 | AWAGCGAAGCW | 89 | 126 |
| C | 7 | GCCCCCC | 126 | 126 |
| D | 8 | GGGGAGGG | 45 | 45 |
| E | 21 | AGGGGCTTTGCTTCGCGGTTT | 17 | 17 |
a Numbers of SDR units were determined in searches performed using 100% or 90% sequence identity. The genomic locations of these repeated elements are reported in [GenBank: DQ291132].
Figure 6Predicted secondary structures formed by SDR units A and B in Oltmannsiellopsis cpDNA.
Figure 7Densities of SDR elements in . Repeated elements with identical sequences are connected on the circular representations of the genomes. Repeats larger than 30 bp and 45 bp are shown on the top and bottom panels, respectively. For these analyses, one copy of the IR sequence was deleted from the Nephroselmis, Pseudendoclonium, Oltmannsiellopsis and Chlamydomonas genomes.