| Literature DB >> 18558012 |
Jean-Simon Brouard1, Christian Otis, Claude Lemieux, Monique Turmel.
Abstract
BACKGROUND: To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales).Entities:
Mesh:
Substances:
Year: 2008 PMID: 18558012 PMCID: PMC2442088 DOI: 10.1186/1471-2164-9-290
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Gene map of . Thick lines on the inner circle represent the two copies of the IR (IRA and IRB), with the arrows indicating the transcription direction of the IR-encoded rRNA operon. Genes (filled boxes) on the outside of the outer circle are transcribed in a clockwise direction. Introns are represented by open boxes and intron ORFs are denoted by narrow, filled boxes. The rpoB gene consists of two separate ORFs (rpoBa and rpoBb) that are not associated with sequences typical of group I or group II introns. tRNA genes are indicated by the one-letter amino acid code followed by the anticodon in parentheses (Me, elongator methionine; Mf, initiator methionine). Only one of the two isomeric forms of the genome is shown here; these isomers differ with respect to the relative orientation of the SC regions.
General features of Oedogonium and other chlorophycean cpDNAs
| Size (bp) | ||||
| Total | 196,547 | 223,902 | 161,452 | 203,827 |
| IR | 35,492 | -a | 12,022 | 22,211 |
| SC1b | 80,363 | -a | 72,440 | 81,307 |
| SC2c | 45,200 | -a | 64,968 | 78,088 |
| A+T (%) | 70.5 | 71.1 | 73.1 | 65.5 |
| Sidedness index | 0.74 | 0.95 | 0.88 | 0.87 |
| Conserved genes (no.) d | 99 | 97 | 96 | 94 |
| Introns | ||||
| Fraction of genome (%) | 17.9 | 7.9 | 8.6 | 6.8 |
| Group I (no.) | 17 | 16 | 7 | 5 |
| Group II (no.) | 4 | 5 | 2 | 2 |
| Intergenic sequences e | ||||
| Fraction of genome (%) | 22.6 | 46.7 | 34.3 | 49.2 |
| Average size (bp) | 370 | 1026 | 517 | 937 |
| Short repeated sequences | ||||
| Fraction of genome (%) f | 1.3 | 17.8 | 3.0 | 15.8 |
a Because Stigeoclonium cpDNA lacks an IR, only the total size of this genome is given.
b SC region with the larger size.
c SC region with the smaller size.
d Conserved genes refer to free-standing coding sequences usually present in chloroplast genomes. Genes present in the IR were counted only once.
e The Oedogonium orf485 and the ORFs encoding the maturase domain of reverse transcriptase in Oedogonium and Scenedesmus (orf1252 and orf932, respectively) were considered as gene sequences.
f Non-overlapping repeated elements were identified as described in the Methods.
Differences between the repertoires of conserved genes in Oedogonium and other chlorophycean cpDNAs
| + | - | + | - | |
| - | - | + | + | |
| + | + | - | - | |
| - | - | + | - | |
| + | + | - | - | |
| + | + | - | - | |
| + | - | - | - | |
| + | - | - | - | |
| - | + | - | - |
a Only the genes that are missing in one or more genomes are indicated. Plus and minus signs denote the presence and absence of genes, respectively. A total of 93 genes are shared by all compared cpDNAs: atpA, B, E, F, H, I, ccsA, cemA, chlB, L, N, clpP, ftsH, petB, D, G, L, psaA, B, C, J, psbA, B, C, D, E, F, H, I, J, K, L, M, N, T, Z, rbcL, rpl2, 5, 14, 16, 20, 23, 36, rpoA, B, C1, C2, rps2, 3, 4, 7, 8, 9, 11, 12, 14, 18, 19, rrf, rrl, rrs, tufA, ycf1, 3, 4, 12, trnA(ugc), C(gca), D(guc), E(uuc), F(gaa), G(gcc), G(ucc), H(gug), I(cau), I(gau), K(uuu), L(uaa), L(uag), Me(cau), Mf(cau), N(guu), P(ugg), Q(uug), R(acg), R(ucu), S(gcu), S(uga), T(ugu), V(uac), W(cca) and Y(gua).
Compared codon usage of the free-standing ORFs and conserved protein-coding genes in Oedogonium cpDNA
| 38.5 | 29.8 | 14.4 | 0.686 | |
| 48.0 | 34.3 | 32.9 | 0.465 | |
| 33.8 | 26.4 | 16.0 | 0.682 | |
| 19.0 | 17.6 | 13.3 | 0.733 | |
| Protein-coding genes a | 36.5 | 30.6 | 14.8 | 0.723 |
a Average values obtained from the 68 conserved protein-coding genes.
Figure 2Conservation of gene pairs in . For each gene pair, adjoining termini of the genes are indicated. Filled boxes indicate the presence of gene pairs with the same relative polarities in two or more genomes. Grey or open boxes indicate the absence of gene pairs. A grey box indicates that the two genes associated with a gene pair are found in the genome but are unlinked. An open box indicates that one or both genes associated with a gene pair are absent from the genome. Two horizontal lines at the bottom of the figure denote the gene pairs that are shared specifically by Chlamydomonas and Scenedesmus (CS clade) and by Oedogonium and Stigeoclonium (OCC clade). The gene pairs shared by the prasinophycean Nephroselmis and the streptophyte Mesostigma viride were presumably present in the last common ancestor of all green algae. Chlorella vulgaris belongs to the Trebouxiophyceae, whereas Oltmannsiellopsis and Pseudendoclonium are representatives of the Ulvophyceae.
Introns in Oedogonium cpDNA
| Oc. | 489 | IB | L8 | LAGLIDADG (2) | 274 |
| Oc. | 179 | IA1 | L9 | LAGLIDADG (1) | 184 |
| Oc. | 276 | IA1 | L5 | HNH | 165 |
| Oc. | 384 | IA3 | L3.2 | GIY-YIG | 311 |
| Oc. | 408 | IA1 | L5 | HNH | 157 |
| Oc. | 525 | IA2 | - | - | - |
| Oc. | 548 | IA1 | L5 | HNH | 202 |
| Oc. | 570 | IA1 | - | - | - |
| Oc. | 645 | IA1 | |||
| Oc. | 754 | IA1 | L5 | HNH | 270 |
| Oc. | 898 | IA1 | L5 | HNH | 258 |
| Oc. | 579 | IA2 | - | - | - |
| Oc. | 1009 | IA2 | - | - | - |
| Oc. | 426 | IA2 | - | - | - |
| Oc. | 1931 | IA3 | L8 | LAGLIDADG (1) | 215 |
| Oc. | 2596 | IA3 | L6 | LAGLIDADG (2) | 225 |
| Oc. | 35 | IC3 | - | - | - |
| Oc. | 25 | IIB | - | - | - |
| Oc. | 148 | IIA | - | - | - |
| Oc. | 22 | IIB | DIV | RT-X | 458 |
| Oc. | 4 | IIB | - | - | - |
a Insertion sites of introns in genes coding for tRNAs and proteins are given relative to the corresponding genes in Mesostigma cpDNA; insertion sites in rrs and rrl are given relative to E. coli 16S and 23S rRNAs, respectively. For each insertion site, the position corresponding to the nucleotide immediately preceding the intron is reported.
b Group I introns were classified according to Michel and Westhof [38], whereas classification of group II introns was according to Michel et al. [39].
c L followed by a number refers to the loop extending the base-paired region identified by the number; D refers to a domain of group II intron secondary structure.
d For the group I intron ORFs, the conserved motif in the predicted endonuclease is given, with the number of copies of the LAGLIDADG motif indicated in parentheses. For the psbI intron ORF, RT and X refer to the reverse transcriptase and maturase domains of group II intron-encoded proteins.
Figure 3Distribution of introns in . Circles denote the presence of group I introns and squares denote the presence of group II introns. Divided squares represent trans-spliced group II introns. Open symbols denote the absence of intron ORFs, whereas filled symbols denote their presence. Intron insertion sites are designated as indicated in Table 4. The column at the extreme right indicates the introns of Chlamydomonas species other than C. reinhardtii that are known to have homologues in completely sequenced UTC algal genomes. References for the latter introns are as follows: psaB [70]; psbA [71]; psbC [70]; rrs [72]; and rrl [73-76]. An asterisk denotes the absence of the ORF in some Chlamydomonas species.
Figure 4Consensus secondary structure model of the . The model is displayed according to Michel et al. [39]. Exon sequences are shown in lowercase letters. Positions exhibiting different nucleotides are denoted by dots. Positions showing deletions/additions are denoted by arrows labelled with two numbers; the left and right numbers refer to the nucleotides found in the Oedogonium and Stigeoclonium introns, respectively. Conserved base pairings are represented by dashes. Roman numerals specify the six major structural domains, whereas uppercase letters in bold denote the helices in domain I. Upper and lower numbers inside the variable loops indicate the sizes of these loops in the Oedogonium and Stigeoclonium introns, respectively. Tertiary interactions are represented by dashed lines, curved arrows, or Greek lettering. Nucleotides that potentially participate in the δ-δ' interaction are boxed. EBS and IBS are exon-binding and intron-binding sites, respectively. The putative site of lariat formation is denoted by an asterisk.