| Literature DB >> 21546564 |
Jean-Simon Brouard1, Christian Otis, Claude Lemieux, Monique Turmel.
Abstract
In the Chlorophyceae, the chloroplast genome is extraordinarily fluid in architecture and displays unique features relative to other groups of green algae. For the Chaetophorales, 1 of the 5 major lineages of the Chlorophyceae, it has been shown that the distinctive architecture of the 223,902-bp genome of Stigeoclonium helveticum is consistent with bidirectional DNA replication from a single origin. Here, we report the 182,759-bp chloroplast genome sequence of Schizomeris leibleinii, a member of the earliest diverging lineage of the Chaetophorales. Like its Stigeoclonium homolog, the Schizomeris genome lacks a large inverted repeat encoding the rRNA operon and displays a striking bias in coding regions that is associated with a bias in base composition along each strand. Our results support the notion that these two chaetophoralean genomes replicate bidirectionally from a putative origin located in the vicinity of the small subunit ribosomal RNA gene. Their shared structural characteristics were most probably inherited from the common ancestor of all chaetophoralean algae. Short dispersed repeats account for most of the 41-kb size variation between the Schizomeris and Stigeoclonium genomes, and there is no indication that homologous recombination between these repeated elements led to the observed gene rearrangements. A comparison of the extent of variation sustained by the Stigeoclonium and Schizomeris chloroplast DNAs (cpDNAs) with that observed for the cpDNAs of the chlamydomonadalean Chlamydomonas and Volvox suggests that gene rearrangements as well as changes in the abundance of intergenic and intron sequences occurred at a slower pace in the Chaetophorales than in the Chlamydomonadales.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21546564 PMCID: PMC3138424 DOI: 10.1093/gbe/evr037
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
General Features of Schizomeris and Other Sequenced Chlorophycean cpDNAs
| OCC Clade | CS Clade | |||||||
| Oedogoniales | Chaetopeltidales | Chaetophorales | Chlamydomonadales | Sphaeropleales | ||||
| Feature | ||||||||
| Size (bp) | ||||||||
| Total | 196,547 | 521,168 | 223,902 | 182,759 | 203,827 | 461,064 | 269,044 | 161,452 |
| IR | 35,492 | — | — | — | 22,211 | 15,948 | 14,409 | 12,022 |
| SC1 | 80,363 | — | — | — | 81,307 | 227,676 | 127,339 | 72,440 |
| SC2 | 45,200 | — | — | — | 78,088 | 200,100 | 112,887 | 64,968 |
| A + T (%) | 70.5 | 65.5 | 71.1 | 72.8 | 65.5 | 57.0 | 67.9 | 73.1 |
| Sidedness index | 0.74 | 0.91 | 0.95 | 0.97 | 0.87 | 0.83 | 0.86 | 0.88 |
| Conserved genes (no.) | 99 | 97 | 97 | 98 | 94 | 94 | 94 | 96 |
| Introns | ||||||||
| Fraction of genome (%) | 17.9 | 3.4 | 10.9 | 13.4 | 7.0 | 4.4 | 10.3 | 7.9 |
| Group I (no.) | 17 | 19 | 16 | 24 | 5 | 3 | 21 | 7 |
| Group II (no.) | 4 | 7 | 5 | 9 | 2 | 6 | 2 | 2 |
| Intergenic sequences | ||||||||
| Fraction of genome (%) | 22.6 | 77.8 | 44.4 | 31.5 | 49.2 | 75.3 | 54.1 | 34.3 |
| Average size (bp) | 370 | 3,824 | 975 | 538 | 937 | 3,405 | 1,347 | 517 |
| Short repeated sequences | ||||||||
| Fraction of genome (%) | 1.3 | 49.9 | 17.8 | 1.2 | 16.8 | 45.5 | 8.6 | 3.0 |
NOTE.—Abbreviations: Oc, Oedogonium cardiacum, NC_011031 (Brouard et al. 2008); Ft, Floydiella terrestris, NC_014346 (Brouard et al. 2010); Sh, Stigeoclonium helveticum, NC_008372 (Bélanger et al. 2006); Sl, Schizomeris leibleinii, HQ700713 (this study); Cr, Chlamydomonas reinhardtii, NC_005353 (Maul et al. 2002); Vc, Volvox carteri, GU084820 (Smith and Lee 2010); Ds, Dunaliella salina, GQ250046 (Smith et al. 2010); So, Scenedesmus obliquus, NC_008101 (de Cambiaire et al. 2006).
Values provided for the Volvox cpDNA should be considered as estimates because some intergenic regions could not be entirely sequenced.
Because the Floydiella, Stigeoclonium, and Schizomeris cpDNAs lack an IR, only the total sizes of these genomes are given.
Single-copy region with the larger size.
Single-copy region with the smaller size.
Conserved genes refer to free-standing coding sequences usually present in chloroplast genomes. Genes present in the IR as well as others duplicated were counted only once.
ORFs < 130 codons showing no sequence similarity with known genes were considered as intergenic sequences.
Nonoverlapping repeated elements ≥30 bp were identified as described in the Materials and Methods.
FGene maps of the Schizomeris and Stigeoclonium chloroplast genomes. Coding sequences on the outside of the map are transcribed in a clockwise direction. Genes included in the nine blocks of colinear sequences (numbered 1 to 9) shared between Schizomeris and Stigeoclonium are shown in blue, whereas rearranged or unique genes are shown in red. Group I and group II introns are shown in yellow and orange, respectively; intron ORFs are distinguished from exons by narrower boxes. Arrows mark the putative origin (ori) and terminus (ter) of replication. The duplicated Schizomeris trnS(gcu) is indicated by an asterisk. The rpoB gene consists of two separate ORFs (rpoBa and rpoBb) that are not associated with sequences typical of group I or group II introns. Two free-standing ORFs containing more than 130 codons (orf589 and orf1432) were identified. These ORFs, which are found upstream of the partially duplicated psaA gene and the duplicated trnS(gcu) copy, show no homology with any known DNA sequences. tRNA genes are indicated by the one-letter amino acid code followed by the anticodon in parentheses (Me, elongator methionine; Mf, initiator methionine).
FIdentification of the putative origin and terminus of replication in Schizomeris cpDNA. (A) Plot of cumulative GC skew over the entire genome sequence. The cumulative GC skew was calculated as indicated in the Materials and Methods; base at position 150,000 was arbitrarily selected to represent the starting point. Above the plot is a representation of the coding strand, either the strand whose sequence has been deposited in GenBank (the top strand) or the alternate strand (bottom strand). The numbers of conserved genes encoded in the filled boxes are indicated. (B) Positions of three highly conserved noncoding sequences (S1, S2, and S3) in the Schizomeris, Stigeoclonium, and Uronema cpDNA regions containing a putative origin of replication. (C) Alignments of the Schizomeris (Sl), Stigeoclonium (Sh), and Uronema (Ub) S1 and S2 sequences.
Minimal Numbers of Inversions Accounting for Gene Rearrangements between Chlorophycean cpDNAs
| Number of Gene Inversions | |||||||
| cpDNA | |||||||
| 73 | 80 | 78 | 81 | 79 | 80 | 82 | |
| — | 72 | 70 | 84 | 83 | 82 | 85 | |
| — | 20 | 89 | 90 | 88 | 89 | ||
| — | 90 | 90 | 90 | 87 | |||
| — | 24 | 51 | 58 | ||||
| — | 51 | 62 | |||||
| — | 58 | ||||||
NOTE.—Abbreviations: Oc, Oedogonium cardiacum; Ft, Floydiella terrestris; Sh, Stigeoclonium helveticum; Sl, Schizomeris leibleinii; Cr, Chlamydomonas reinhardtii; Vc, Volvox carteri; Ds, Dunaliella salina; So, Scenedesmus obliquus.
Inversions were computed using GRIMM (Tesler 2002).
FSequence divergence, gene rearrangements, and variation in the abundance of diverse categories of sequences in the Schizomeris/Stigeoclonium and Chlamydomonas/Volvox chloroplast genomes. The tree on the left shows the evolutionary distances as inferred from 61 concatenated protein-coding genes as well as the number of inversions for each pair of compared genomes. The numbers of nonsynonymous (upper numbers) and synonymous (lower numbers) nucleotide substitutions estimated by CODEML are shown along each branch. The histograms on the right illustrate the lengths of coding sequences, introns, repeats, and intergenic spacers excluding the repeats. Note that the sequences within the IR were counted only once. Abbreviations: Oc, Oedogonium cardiacum; Ft, Floydiella terrestris; Sh, Stigeoclonium helveticum; Sl, Schizomeris leibleinii; Cr, Chlamydomonas reinhardtii; Vc, Volvox carteri; Ds, Dunaliella salina; So, Scenedesmus obliquus.