| Literature DB >> 18462506 |
Rose Ann Cattolico1, Michael A Jacobs, Yang Zhou, Jean Chang, Melinda Duplessis, Terry Lybrand, John McKay, Han Chuan Ong, Elizabeth Sims, Gabrielle Rocap.
Abstract
BACKGROUND: Heterokont algae form a monophyletic group within the stramenopile branch of the tree of life. These organisms display wide morphological diversity, ranging from minute unicells to massive, bladed forms. Surprisingly, chloroplast genome sequences are available only for diatoms, representing two (Coscinodiscophyceae and Bacillariophyceae) of approximately 18 classes of algae that comprise this taxonomic cluster. A universal challenge to chloroplast genome sequencing studies is the retrieval of highly purified DNA in quantities sufficient for analytical processing. To circumvent this problem, we have developed a simplified method for sequencing chloroplast genomes, using fosmids selected from a total cellular DNA library. The technique has been used to sequence chloroplast DNA of two Heterosigma akashiwo strains. This raphidophyte has served as a model system for studies of stramenopile chloroplast biogenesis and evolution.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18462506 PMCID: PMC2410131 DOI: 10.1186/1471-2164-9-211
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Fosmid cloning technique. High molecular weight, total DNA is subject to pulse-field electrophoresis to recover sheared DNA of 45 to 50 kb. This DNA is used to generate a fosmid library which is selectively screened for cpDNA-containing clones, which are then sequenced, annotated and assembled.
Figure 2. Outer rim: genes on plus and minus strand, color coded according to function (see legend); Second ring: small inverted (red) and tandem (blue) repeats; Third ring: sequence comparison to the other H. akashiwo genome, including SNPs (blue), small insertions (green), deletions (red) and regions of extremely poor alignment (orange); Fourth ring: Location and size of fosmid clones color coded according to their orientation: supports depicted isoform (green), supports alternate isoform (pink), uninformative (black); Fifth ring: location of inverted repeats, large and small single copy domains. Red bar depicts location of 8 kb region inverted in CCMP452 relative to NIES293; inner circle: GC content.
Overview of H. akashiwo strains CCMP 452 and NIES 293 chloroplast genomes
| CCMP 452 | NIES 293 | |
| Length (bp) | 160,149 | 159,370 |
| Small Single Copy | 39,035 | 38,834 |
| Large Single Copy | 77,470 | 77,206 |
| Inverted Repeat | 21,822 | 21,665 |
| G+C content (%) | 30.5 | 30.4 |
| Protein coding (%) | 68.5 | 69.0 |
| Avg. protein length | 703 | 704 |
| Protein coding genes | 156 | 156 |
| With assigned function | 130 | 130 |
| Conserved hypothetical (ycf) | 19 | 19 |
| Hypothetical | 7 | 7 |
| Ribosomal RNA operons | 2 | 2 |
| Transfer RNA genes | 34 | 34 |
| Pseudo tRNA genes | 1 | 1 |
| tmRNA genes | 1 | 1 |
* This table reports total numbers of genes. Each of the two IRs contains 12 protein-coding genes, 7 genes for tRNAs and 1 rRNA operon.
Figure 3Isomeric cpDNA populations. Single copy regions are flipped resulting from a recombination event. Arrows show positions of sequence in large and small single copy regions.
Figure 4Comparison of . Gray shading indicates residues completely conserved among the four proteins. Stars indicated conserved residues important in catalytic function. Overline and underline are box I and box II respectively.
Comparison of putative dif sites in H. akashiwo chloroplast genomes with those of selected bacteria and viruses
| XerC | Binding | Xer D | |
| # | ACTGAGCTAAT | AGCCCAACA | TTATGTTAAAT |
| ATAGGCCTTCG | TCCCCT | TTATGTTAAAT | |
| ATTGAGGATCA | TTTTTG | TTATGTTAAAG | |
| % | AAAAACCAAAA | AATAAT | TTATGTTAAAG |
| GGTGCGCATAA | TGTATA | TTATGTTAAAT | |
| GGTGCGCATAA | TGTATA | TTATGTTAAAT | |
| GGAGCGCATAA | TGTATA | TTATGTTAAAT | |
| AGTGCGCATTA | TGTATG | TTATGTTAAAT | |
| AATGCGCATTA | CGTGCG | TTATGTTAAAT | |
| ATTTCGCATAA | TATAAA | TTATGTTAAAT | |
| ACTTCCTAGAA | TATATA | TTATGTAAACT | |
| *ColE1 cer | GGTGCGTACAA | TTAAGGGA | TTATGGTAAAT |
| *pSC101 psi | GGTGCGCGCAA | GATCC | TTATGTTAAAC |
# Present in CCMP452 and NIES293, on inverted repeat
& Present in CCMP452 and NIES293, on large single copy
% Present in NIES293, on inverted repeat
* From Lesterlin et al, 2004 [77]
Figure 5Synteny among stramenopile and red-lineage chloroplast genomes. H. akashiwo vs (A) Odontella sinensis and (B) Porphyra purpurea; Thalassiosira pseudonana vs (C) Odontella sinensis and (D) Porphyra purpurea.
Presence of Single Nucleotide Polymorphisms in protein coding genes between H. akashiwo CCMP452 and NIES293
| Gene | Synonymous | Non-synonymous |
| Ribosomal | ||
| - | 1 | |
| - | 1 | |
| 1 | - | |
| - | 1 | |
| - | 1 | |
| Translational | ||
| - | 2 | |
| 2 | 1 | |
| Photosystem/energy | ||
| 2 | - | |
| 1 | - | |
| - | 1 | |
| 1 | - | |
| 1 | - | |
| - | 1 | |
| 1 | - | |
| 1 | - | |
| Metabolic | ||
| 1 | - | |
| 1 | - | |
| 1 | - | |
| 1 | 1 | |
| 1 | - | |
| 2 | 2 | |
| 1 | - | |
| 1 | - | |
| ycf/orf | ||
| 1 | - | |
| - | 1 | |
| - | 1 | |
| 1 | - | |
| 4 | - | |
| 1 | 1 | |
| 2 | 5 | |
| - | 5 | |
| Highly impacted genes | ||
| 2 | 10 | |
| - | 1 | |
* located on the inverted repeat
A photosystem 1 assembly protein
B ABC transporter protein
C within the first 275 AAs
D within the first 65 AAs
Occurrence of tandem and inverted repeats in chloroplast genomes of rhodophytes and algae with chloroplasts of secondary endosymbiotic origin
| Organism | Genome Size | Inverted # (bp) | % | Tandem # (bp) | % | Repeat % Total |
| 160,149 | 40 (3060) | 1.91 | 25 (1150) | 0.71 | 2.62 | |
| 159,370 | 36 (2879) | 1.81 | 23 (916) | 0.57 | 2.38 | |
| (C) | 119,704 | 21(1394) | 1.16 | 2 (74) | 0.06 | 1.22 |
| (C) | 128,814 | 26 (1,600) | 1.24 | 5 (210) | 0.16 | 1.40 |
| (P) | 117,369 | 14(751) | 0.64 | 4 (132) | 0.11 | 0.75 |
| 105,309 | 15 (784) | 0.74 | 1 (34) | 0.03 | 0.77 | |
| 121,524 | 16 (1080) | 0.09 | 1 (32) | 0.03 | 0.12 | |
| (F) | 183,883 | 8 (421) | 0.23 | 5 (184) | 0.10 | 0.33 |
| (F) | 191,028 | 10 (489) | 0.26 | 1 (30) | 0.02 | 0.28 |
| (F) | 191,952 | 13 (672) | 0.35 | 3 (132) | 0.07 | 0.42 |
| (B) | 164,921 | 9 (526) | 0.32 | 2 (62) | 0.04 | 0.36 |
| (B) | 149,987 | 3 (152) | 0.10 | 71 (1984) | 1.32 | 1.42 |
| 135,599 | 47 (3268) | 2.41 | 26 (1,435) | 1.06 | 3.46 |
C-Centric diatom; P-Pennate diatom F-Floridiophyte B-Bangiophyte
Conservation of gene-associated repeats
| Cytochrome Associated Genes | |||||||||||
| Organism | B/D | F | G | J | L | M/N | |||||
| IR | T/IR | T | IR | IR | IR | IR | IR | - | IR# | IR | |
| IR | - | T | IR | IR | IR | IR | IR | - | IR# | IR | |
| (C) | IR | IR | IR | IR/T* | - | IR/T* | IR/IR | 0 | - | IR# | - |
| (C) | - | T/IR | IR/T | IR | - | - | - | 0 | - | IR | - |
| (P) | IR | IR | IR | IR* | IR | IR* | - | 0 | IR# | - | - |
| IR | IR | - | IR | - | 0 | IR | 0 | - | - | IR | |
| IR | T/IR* | - | IR | T | T* | IR# | 0 | - | - | IR# | |
| (F) | - | IR* | - | - | IR | IR* | - | IR | 0 | - | - |
| (F) | - | IR | IR | - | IR | - | IR | IR | - | - | - |
| (F) | IR | IR | IR | - | - | - | IR/T | IR | - | - | - |
| (B) | IR | - | - | - | - | T | - | - | 0 | - | - |
| (B) | IR | - | T/T | T | - | T# | - | - | - | - | T/T |
| IR | IR | IR | IR | IR | IR | IR | 0 | IR | IR | IR | |
IR: inverted repeat; T: tandem repeat; – no repeat; 0 gene absent from cpDNA ; * shared repeat; # located 5' to the gene. All other repeats are 3' to the gene.