| Literature DB >> 26613001 |
Lina Yao1, Tin Wee Tan2, Yi-Kai Ng1, Kenneth Hon Kim Ban3, Hui Shen1, Huixin Lin1, Yuan Kun Lee1.
Abstract
BACKGROUND: For many years, increasing demands for fossil fuels have met with limited supply. As a potential substitute and renewable source of biofuel feedstock, microalgae have received significant attention. However, few of the current algal species produce high lipid yields to be commercially viable. To discover more high yielding strains, next-generation sequencing technology is used to elucidate lipid synthetic pathways and energy metabolism involved in lipid yield. When subjected to manipulation by genetic and metabolic engineering, enhancement of such pathways may further enhance lipid yield.Entities:
Keywords: Dunaliella tertiolecta; Lipid metabolism; Next-generation sequencing; RNA-Seq; Random insertional mutant
Year: 2015 PMID: 26613001 PMCID: PMC4660794 DOI: 10.1186/s13068-015-0382-0
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1Genotyping PCR results of D9 mutant and WT D. tertiolecta. The template for lanes 1, 2, 3, 4, and 5: D9 mutant, wild type, negative control (without template), 1 kb ladder, positive control (using the reconstructed plasmid as the template), respectively. The primer pair for lanes 1, 2, 3, and 5: ble_F + ble_R (the predicted size is 503 bp). The two arrows indicate a non-specific binding band and the target bleomycin band. ble_F: AAGCTGACCAGCGCCGTTC, ble_R: CCACGAAGTGCACGCAGTT
Fig. 2Physiological performance of D9 mutant and WT D. tertiolecta. a Growth curve of D9 mutant and WT D. tertiolecta; b photosynthetic rate and maximum photochemical capacity of D9 mutant and WT D. tertiolecta; c Photosynthetic efficiency of D9 mutant and WT D. tertiolecta; d, e, f Detection of carbon fixation parameters, namely neutral lipid content, glycerol content, and starch content. Asterisk indicate the statistically significant difference between D9 mutant and WT D. tertiolecta after two-tailed T test (*0.01 < p value ≤ 0.05; **0.001 < p value ≤ 0.01; ***p value ≤0.001)
Fig. 3RNA-Seq data analysis flowchart used in this study. The general pipeline includes sample preparation and harvesting, sequencing, data analyses, and biological interpretation. The red highlighted rectangle illustrates the construction of the D. tertiolecta reference library using Bag2D-workflow scripts. Bag2D: Blast1-annotation1-gene model1-Delete redundant genes-Blast2-annotation2-gene model2 (protein name, transcriptID, geneID)
Run summary on the Illumina MiSeq platform
| Sample name | Sequencing stats and pre-alignment QA/QC of raw data | Post-alignment QA/QC after alignment | |||||
|---|---|---|---|---|---|---|---|
| Total reads | Avg. read length | Avg. read quality | % N | % GC | Total reads | Aligned (%) | |
| D9_S7_L001_R1 | 954,340/ | 137.28/ | 38/ | 9.62E−04/ | 47.74/ | 954,300 | 56.72 |
| D9_S7_L001_R2 | 954,340/ | 137.07/ | 37.69/ | 1.04E−03/ | 47.67/ | ||
| WT_S8_L001_R1 | 1,084,599/ | 136.67/ | 38/ | 1.11E−03/ | 47.41/ | 1,084,546 | 57.24 |
| WT_S8_L001_R2 | 1,084,599/ | 136.36/ | 37.76/ | 1.25E−03/ | 47.34/ | ||
QA/QC Quality assurance and quality control
Avg. read quality >30 indicating 99.9 % accuracy or greater
The ones in italic font were data generated from post-trimming
Determining the reference transcripts through de novo assemblies using different sets of data
| No. | Samples used for assembly | Size of the assembled file (MB) | Gene model | Number of non-redundant ranscripts |
|---|---|---|---|---|
| (i) | WT | 9.4 | 3818 | 14,903 |
| (ii) | WT + D9 | 16.9 | 5360 | 23,564 |
| (iii) | Enlarged Dt-1 | 45.8 | 7023 | 47,276 |
The BlastX cut-off value is E-value ≤10−6 for this comparison. The results of D. tertiolecta transcripts with BlastX hits to C. reinhardtii as reference
Fig. 4GO enrichment result of up-regulated genes in D9 mutant compared to wild-type D. tertiolecta. If a functional group has an enrichment score over 1, functional category is over expressed. A value of 3 corresponds to significant over expression (p-value of less than 0.05)
Fig. 5Pathway analyses for mutant D9 and wild-type D. tertiolecta. Inositol phosphate metabolism, ascorbate acid synthesis, citrate cycle, pyruvate metabolism, and fatty acid biosynthesis pathways in D. tertiolecta constructed based on the information from C. reinhardtii and the transcript abundance of enzymes in D. tertiolecta. Key components are represented in black rectangle boxes. The green color indicates that the respective gene is down-regulated, and red color indicates up-regulation. The corresponding pathways are also represented in the yellow filled circles
Comparisons of homologues between D. tertiolecta and other five green algae with reference genome sequences
| Algal reference |
|
|
|
|
|
|---|---|---|---|---|---|
| Reference Transcripts or proteins # | 19,526 | 9629 | 7796 | 15,285 | 9791 |
| BlastN | |||||
| Transcript hit (E-value = 0) | 82 (0.42 %) | 18 (0.19 %) | 0 (0.00 %) | 24 (0.16 %) | 20 (0.20 %) |
| Transcript hit (E-value ≤10−10) | 574 (2.94 %) | 229 (2.38 %) | 15 (0.19 %) | 246 (1.61 %) | 281 (2.87 %) |
| Transcript hit (E-value ≤10−6) | 611 (3.13 %) | 249 (2.59 %) | 16 (0.21 %) | 269 (1.76 %) | 307 (3.14 %) |
| Transcript hit (E-value ≤10−3) | 654 (3.35 %) | 252 (2.62 %) | 16 (0.21 %) | 278 (1.82 %) | 315 (3.22 %) |
| BlastX | |||||
| Transcript hit (E-value = 0) | 583 (2.99 %) | 356 (3.70 %) | 205 (2.63 %) | 523 (3.42 %) | 239 (2.44 %) |
| Transcript hit (E-value ≤10−10) | 6610 (33.85 %) | 5025 (52.19 %) | 4002 (51.33 %) | 6249 (40.88 %) | 5106 (52.15 %) |
| Transcript hit (E-value ≤10−6) | 7023 (35.97 %) | 5643 (58.60 %) | 4592 (58.90 %) | 6986 (45.70 %) | 5773 (58.96 %) |
| Transcript hit (E-value ≤10−3) | 8561 (43.84 %) | 6393 (66.39 %) | 5267 (67.56 %) | 8162 (53.40 %) | 6604 (67.45 %) |
The numbers before and within brackets are the number of D. tertiolecta transcripts with Blast hits from reference alga and the hitting percentage in the corresponding reference transcripts/proteins
Cre, C. reinhardtii; Csu, C. subellipsoidea; Olu, O. lucimarinus; Vca, V. carteri; ChlN, C. variabillis N64A
Fig. 6Venn diagram of the numbers of D. tertiolecta transcripts with BlastX hits from five model algae. The BlastX cut-off value is 0 for this comparison. A The number of D. tertiolecta transcripts with BlastX hits to C. reinhardtii; B The number of D. tertiolecta transcripts with BlastX hits to C. subellipsoidea C-169; C The number of D. tertiolecta transcripts with BlastX hits to O. lucimarinus; D The number of D. tertiolecta transcripts with BlastX hits to V. carteri; and E The number of D. tertiolecta transcripts with BlastX hits to C. variabilis NC64A
Fig. 7Length distributions of the contigs from Dt_v10. X-axis: number of the contigs. Y-axis: E-value of the contigs
Comparison studies of transcripts from Bag2D program and NCBI
|
| E-value | Length (bps) |
| E-value | Length (bps) | Aligned score | Elongated |
|---|---|---|---|---|---|---|---|
| >Locus_1123_4Transcript_1/2_Confidence_0.800_Length_1500 | 0 | 1500 | >gi|46981381|gb|AY575952.1| Dunaliella tertiolecta assimilatory nitrate reductase (nar) gene, partial cds | 2e−18 | 1313 | – | |
| >gi|311818483|emb|HH768845.1| Sequence 5032 from Patent EP2221382 | 0 | 2531 | 90.6 | ||||
|
| 0 | 3447 | 90.6 | + | |||
|
| 0 | 1881 | >gi|3869303|gb|AF065142.1| Dunaliella tertiolecta glutamine synthetase mRNA, partial cds | 2e−109 | 601 | 92.8453 | |
|
| 1e−144 | 1463 | >gi|2645974|gb|AF034201.1| Dunaliella tertiolecta proliferating cell nuclear antigen (PCNA) mRNA, partial cds | 7e−122 | 616 | 97.8896 | |
|
| 2e−115 | 959 | >gi|12232559|gb|AF036312.2| Dunaliella tertiolecta mitotic cyclin mRNA, partial cds | 6e−47 | 490 | 96.1224 | + |
|
| 1e−132 | 1220 | >gi|167984|gb|M60049.1|DUNCAB D.tertiolecta 28.5-kDa LHCII apoprotein mRNA, complete cds | 2e−132 | 1041 | 91.0663 | |
|
| 3e−122 | 1062 | >gi|225322931|gb|FJ769282.1| Dunaliella tertiolecta ascorbate peroxidase mRNA, partial cds | 1e−72 | 546 | 87.5458 | |
|
| 0 | 1811 | >gi|585087513|gb|KF193066.1| Dunaliella tertiolecta sedoheptulose-1,7-bisphosphatase (SBP) mRNA, complete cds | 2e−180 | 1631 | 97.9767 | |
| >Locus_613_6Transcript_1/1_Confidence_1.000_Length_1405 | 0 | 1405 |
| 0 | 1687 | 90.3915 | + |
| >Locus_284_4Transcript_1/1_Confidence_1.000_Length_1355 | 2e−16 | 1355 |
| 2e−16 | 1560 | 98.2288 | |
| >Locus_17606_9Transcript_1/1_Confidence_1.000_Length_2042 | 0 | 2042 |
| 0 | 2049 | 99.3144 | |
|
| 4e−139 | 2662 | >gi|144601644|gb|EF471039.1| Dunaliella tertiolecta isolate 1 NRT2 (Nrt2) gene, partial cds | 5e−50 | 644 | 94.4099 | |
| >Locus_1062_4Transcript_1/1_Confidence_1.000_Length_630 | 2e−82 | 630 |
| 7e−57 | 840 | 42.5397 | * |
|
| 0 | 1955 | >gi|459938225|gb|KC572136.1| UNVERIFIED: Dunaliella tertiolecta isolate PL1 acetyl-CoA carboxylase beta subunit-like (accD) gene, partial sequence; chloroplast | 1e−42 | 925 | 44.973 | |
|
| 0 | 1448 | >gi|4165328|gb|AF038570.1| Dunaliella tertiolecta cyclin-dependent kinase 1 (DUNCDC2) mRNA, complete cds | 0 | 1061 | 99.7172 | |
| >Locus_652_4Transcript_1/1_Confidence_1.000_Length_2435 | 0 | 2435 |
| 0 | 1515 | 42.8383 | * |
|
| 0 | 2170 | >gi|371532798|gb|JQ039033.1| Dunaliella tertiolecta ATP synthase CF1 beta subunit (atpB) mRNA, partial cds; plastid | 5e−177 | 1167 | 46.8723 | * |
|
| 0 | 2042 | >gi|371532912|gb|JQ039091.1| Dunaliella tertiolecta elongation factor Tu (tufA) mRNA, complete cds; plastid | 8e−154 | 1257 | 47.494 | |
|
| 1e−64 | 943 | >gi|371532800|gb|JQ039034.1| Dunaliella tertiolecta ATP synthase CF1 epsilon subunit (atpE) mRNA, complete cds; plastid | 3e−07 | 408 | 26.4706 | * |
|
| 1e−117 | 1870 | >gi|761262915|gb|KJ930371.1| Dunaliella tertiolectaglucose-6-phosphate dehydrogenase (G6PDH) mRNA, complete cds;chloroplast | 3e−72 | 3246 | 34.6524 | * |
BlastX with C. reinhardtii using cut-off value at E-value ≤10−6 for this comparison
The one in italic font was chosen in the database optimization for each gene
+, the NCBI nucleotide was chosen for the corresponding gene, but it was elongated by part of the sequence from Bag2D
*, the two nucleotides did not quite match from the two methods, which may have been caused by the existence of isoforms or errors of the one from NCBI (determined by E-value and analyses of the sequences)
Fig. 8Length distribution of the contigs in Dt_v10 and Dt_v10-hit. a Statistical results of the length of D. tertiolecta reference contigs after assembly and filtering, total 181Mega-base nucleotides (481,381 contigs) with minimum length of 74 bps, maximum length of 17,995 bps, and average length of 376 bps. b Statistical results of the length of D. tertiolecta reference contigs after assembly and filtering, total 11 Mega-base nucleotides (10,185 contigs) with minimum length of 101 bps, maximum length of 15,975 bps, and average length of 1106 bps