| Literature DB >> 24646409 |
Shawn R Starkenburg1, Kyungyoon J Kwon, Ramesh K Jha, Cedar McKay, Michael Jacobs, Olga Chertkov, Scott Twary, Gabrielle Rocap, Rose Ann Cattolico.
Abstract
BACKGROUND: Microalgae in the genus Nannochloropsis are photosynthetic marine Eustigmatophytes of significant interest to the bioenergy and aquaculture sectors due to their ability to efficiently accumulate biomass and lipids for utilization in renewable transportation fuels, aquaculture feed, and other useful bioproducts. To better understand the genetic complement that drives the metabolic processes of these organisms, we present the assembly and comparative pangenomic analysis of the chloroplast and mitochondrial genomes from Nannochloropsis salina CCMP1776.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24646409 PMCID: PMC3999925 DOI: 10.1186/1471-2164-15-212
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Whole chloroplast genome alignments of , , and The red and green co‐‐linear blocks indicate regions of synteny and homology between the four algal species. The lines connecting the genomes indicate orthologous gene clusters. The solid blue lines indicate the locations of gaps in the N. oceanica genome assembly.
General characteristics of the organellar genomes
| Chloroplast | Size (bp) | 114821 | 114875 | 117463 | 115980* |
| | GC content | 32.92 | 32.96 | 33.4 | 33.5 |
| | Genes | 132 | 132 (124) | 136 (126) | 136 |
| | tRNA | 28 | 28 | 29(34) | 27 |
| | rRNA | 6 | 6 | 6 | 6 |
| | Nucleotide identity (%)† | 100.0 | 98.4 | 84.3 | 81.3 |
| Mitochondria | Size (bp) | 41992 | 42067 | 41721* | 38067 |
| | GC content | 31.4 | 31.4 | 32.2 | 31.9 |
| | Genes | 43 | 43 (36) | 40 (35) | 41 |
| | tRNA | 27 | 27 | 26 (28) | 25 |
| | rRNA | 2 | 2 | 2 | 2 |
| | Intronic ORF | 0 | 0 | 1 | 0 |
| Nucleotide identity (%)† | 100.0 | 97.0 | 76.2 | 73.5 |
†Percent global nucleotide identity relative to N. salina.
#The quantity of genes previously reported for N. gaditana CCMP527[13] and N. oculata CCMP525[12] are shown in parentheses.
*Indicates the amount of assembled bases; 1 or more gaps remain in the assembly.
Figure 2Circular diagram of the mitochondrial genome. The inset gene cluster, located on the N. oculata mitochondrial genome, shows genetic variation around cox1; insertion of the group IIA intron insertion (red dashes), and insertion of sequence in N. salina between cox1 and atp1 (green dashes). Genes are color-coded based on related metabolic function (see legend for categories).
Figure 3Circular diagram of the . chloroplast genome. The inset gene cluster indicates the genomic variation of the inverted repeat in N. oculata. The red dashes indicates the location of the three gene deletion in N. salina. Genes are color-coded based on the metabolic function (see legend).
Pangenomes of the organelles
| | ||
| | ||
| | ||
| | ||
| | | |
*Transcript was detected for all genes in bold.
+Representative genes are shown; one or more orthologs are present on the Nannochloropsis replicons.
Highly divergent genes on the organellar genomes
| Nsk00019 | rps6; 30S ribosomal protein S6 | 30S ribosomal protein S6 [Thalassiosira pseudonana];|YP_874616.1| | 106 | 103 | 96 | 36.5 | 2.00E-11 |
| Nsk00027 | atpD; Atp synthase delta subunit | Hypothetical protein MldDRAFT_4321 [delta proteobacterium MLMS-1]; |ZP_01290127.1| | 232 | 331 | 162 | 24.7 | 0.02 |
| Nsk00028 | atpF; ATP synthase b subunit | CF0 subunit I of ATP synthase [Oltmannsiellopsis viridis]; |YP_635887.1| | 155 | 183 | 106 | 33 | 5.00E-06 |
| Nsk00029 | atpG; ATP synthase b’ subunit | ATP synthase CF0 subunit II [Vaucheria litorea];|YP_002327468.1| | 160 | 154 | 145 | 29.7 | 7.00E-11 |
| Nsk00053 | Hypothetical; putative peroxidase | Hypothetical protein tlr1577 [Thermosynechococcus elongatus BP-1];|NP_682367.1| | 195 | 99 | 54 | 35.2 | 0.072 |
| Nsk00055 | psb28; photosystem II protein (ycf79) | Photosystem II protein W [Guillardia theta]; |NP_050669.1| | 113 | 116 | 94 | 30.9 | 2.00E-04 |
| Nsk00062 | ycf4; photosystem I assembly protein | Photosystem I assembly protein Ycf4 [Coccomyxa subellipsoidea C-169]; |YP_004222004.1| | 195 | 189 | 155 | 29 | 4.00E-12 |
| Nsk00063 | ycf49; DUF2499 | Unknown DUF2499 [Picea sitchensis]; |ABK25760.1| | 97 | 216 | 88 | 35.2 | 5.00E-11 |
| Nsk00087 | Unknown; ORFan | Hypothetical protein SPPN_02855 [Streptococcus pseudopneumoniae IS7493 | 117 | 282 | 85 | 25.9 | 0.81 |
| Nsk00113 | rpoA; RNA polymerase alpha chain | RNA polymerase alpha subunit [Cryptomonas paramecium]; |YP_003359271.1| | 447 | 310 | 195 | 34.9 | 4.00E-14 |
| Nsk00135 | ycf34 | Chloroplast protein Ycf34 [Gloeobacter violaceus PCC 7421]; |NP_927340.1| | 86 | 80 | 81 | 28.4 | 0.27 |
| Nsk00142 | clpN | ATP-dependent Clp protease ATP-binding subunit ClpA [Desulfobulbus propionicus DSM 2032; |YP_004196194.1| | 149 | 756 | 96 | 29.2 | 2.4 |
| Nsk00202 | Unknown; ORFan | Predicted protein with ABC transporter signatures [Fibroporia radiculosa]; |CCM01526.1| | 93 | 613 | 54 | 38.9 | 1.3 |
| Nsk00204 | Unknown; ORFan | Hyp. periplasmic binding protein MARHY3762 [Marinobacter hydrocarbonoclasticus ATCC 49840]; |YP_005431639.1| | 119 | 404 | 50 | 46 | 1.9 |
| Nsk00206 | Unknown; ORFan | Coiled-coil domain-containing protein 141 [Nomascus leucogenys]; |XP_003253834.1| | 99 | 1530 | 59 | 37.3 | 5.5 |
| Nsk00212 | rps10; 30S ribosomal protein S10 | 30S ribosomal protein S10 [Spirochaeta smaragdinae DSM 11293]; |YP_003802682.1| | 112 | 102 | 84 | 36.9 | 5.00E-05 |
| Nsk00213 | rps11; 30S ribosomal protein S11 | 30S ribosomal protein S11, partial [uncultured bacterium]; |EKD46317.1| | 156 | 140 | 110 | 39.1 | 2.00E-18 |
| Nsk00217 | rps2; 30S ribosomal protein S2 | Hypothetical protein [Batrachochytrium dendrobatidis JAM81];|EGF78568.1| | 212 | 195 | 169 | 29 | 5.00E-16 |
| Nsk00218 | rps4: 30S ribosomal protein S4 | Ribosomal protein S4 [Synedra acus]; |YP_003359457.1| | 241 | 246 | 176 | 33.5 | 4.00E-09 |
| Nsk00219 | Unknown; ORFan | Hypothetical protein [Trichomonas vaginalis G3]; |XP_001579587.1| | 323 | 744 | 118 | 28 | 1.8 |
| Nsk00222 | rpl5; 50S ribosomal protein L5 | Ribosomal protein L5 [Thalassiosira pseudonana]; |YP_316605.1| | 179 | 178 | 176 | 34.7 | 8.00E-19 |
| Nsk00231 | atp8; ATP synthase F0 subunit 8 | ATP synthase F0 subunit 8 [Fucus vesiculosus; |YP_448633.1| | 105 | 53 | 60 | 51.7 | 2.00E-07 |
| Nsk00232 | Unknown; ORFan | fmhA protein [Staphylococcus saprophyticus ATCC 15305]; |YP_300577.1| | 231 | 410 | 156 | 23.08 | 6.7 |
| Nsk00235 | rps13; 30S ribosomal protein S13 | NADH dehydrogenase s9- S13 fusion protein [endosymbiont of Durinskia baltica] |gb|AEP20701.1| | 118 | 310 | 117 | 41.9 | 8.00E-18 |
| Nsk00013, Nsk0014, Nsk00150, Nsk00085, Nsk00203, Nsk00223 | Unknown; ORFans | No homologs | - | - | - | - | - |
*Genes with locus tags that have a numerical value of <200 and > 200 are located on the chloroplast and mitochondrial genomes, respectively. Nsk00013, Nsk0014, Nsk00150, Nsk00085, Nsk00203, Nsk00223 were also identified as highly divergent with no BLASTP hit in the NR database.
#Identity of the aligned amino acids.
Figure 4CbbX phylogenetic tree. Bootstrap values higher than 50% are indicated at the nodes. The scale bar represents 0.4 mutations per site. Branch lengths are drawn to scale. Cyanobacterial CbbX sequences are boxed in green. CbbX sequences from red algae or secondary endosymbiotic events with red algae are boxed in red. CbbX sequences from all Stramenopiles (except Nannochloropsis) are boxed in brown. CbbX sequences encoded in the nucleus (nuc) or nucleomorph (nm) are boxed in orange. The Bacillus subtilis sporulation factor SpoVK is used as the outgroup.
Figure 5Primary and secondary structures of AtpD variants. The secondary structures above the first sequence indicate the predicted secondary structure of the AtpD found in Nannochloropsis. The secondary structures depicted below the last sequence indicate the approximate location of the consensus secondary structures of the E. coli ATP synthase delta subunit and the bovine OSCP derived from the predictions made by PSIPRED and Porter.
Figure 6Transcript profiles of the ATP synthase genes in . The top panel indicates the coverage of transcript reads mapped to the given region of N. salina chloroplast genome. The bottom panel indicates the locations of the coding regions of the ATP synthase genes (red) and neighboring genes (blue, green, white). The arrowed blocks in gray indicate the location of t‒RNAs (from 5′ to 3′; tRNA‒Lys, tRNA‒Gly, tRNA‒Glu).
Figure 7Structural models of ATP synthase subunits. Intermolecular interaction between N-terminal sequence of AtpA and homologs of AtpD: Minimum energy docked conformation of predicted N. salina AtpA-N terminus and N. salina AtpD (residues 31-154) (A), NMR structure of E. coli AtpA N-terminal and δ-subunit (PBD code: 2A7U) (B), NMR structure of Bovine AtpA N-terminal and OSCP subunit (PDB code:2JMX) (C).
Figure 8Divergence of the chloroplast Clp orthologs. ClpC contains several conserved domains: an N-domain (green), a D1-domain (yellow), a middle domain (M, purple), and a D2-domain (blue). The D1 and D2 domains each contain an AAA module (red). The D-2 domain in Bacillus contains a conserved ClpP-binding loop (P, orange). Homologous structural and functional features identified between bacterial ClpC and translated Nannochloroposis Clp orthologs are color matched. The question marks indicate that the M-domain and ClpC-binding regions were not clearly identified.