| Literature DB >> 25254555 |
Qianqian Li1, Jianguo Liu2, Litao Zhang2, Qian Liu2.
Abstract
BACKGROUND: Algae in the order Trentepohliales have a broad geographic distribution and are generally characterized by the presence of abundant β-carotene. The many monographs published to date have mainly focused on their morphology, taxonomy, phylogeny, distribution and reproduction; molecular studies of this order are still rare. High-throughput RNA sequencing (RNA-Seq) technology provides a powerful and efficient method for transcript analysis and gene discovery in Trentepohlia jolithus. METHODS/PRINCIPALEntities:
Mesh:
Substances:
Year: 2014 PMID: 25254555 PMCID: PMC4177907 DOI: 10.1371/journal.pone.0108488
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Rocks covered with red T. jolithus.
(A) Red-Stone-Valley in winter. (B) Reddish stones along rivers. (C) Microscopic structure of dried T. jolithus. (D) Microscopic structure of rehydrated T. jolithus after a few drops of water was added to the dried material.
Summary of sequence analysis.
| Description | Number (n) | Bases (bp) |
|
| ||
| Total raw reads | 55,007,830 | |
| Total clean reads | 52,390,656 4,715,159,040 | |
| Q20 percentage (%) | 97.95 | |
| N percentage (%) | 0.02 | |
| GC content (%) | 39.88 | |
|
| ||
| Total contigs | 292,122 | 73,002,694 |
| Min length (bp) | 200 | |
| Max length (bp) | 9198 | |
| Average length (bp) | 250 | |
| N50 (bp) | 287 | |
|
| ||
| Total unigenes | 92,414 | 57,175,212 |
| Min length (bp) | 200 | |
| Max length (bp) | 8785 | |
| Average length (bp) | 619 | |
| N50 (bp) | 986 | |
Figure 2Length distribution of assembled contigs and unigenes.
BLAST analysis of non-redundant unigenes against public databases.
| Database | Number of annotated unigenes | Percentage of annotated unigenes |
|
| 37,869 | 91.63% |
|
| 9,674 | 23.41% |
|
| 23,274 | 56.32% |
|
| 26,217 | 63.44% |
|
| 22,018 | 53.28% |
|
| 22,921 | 55.46% |
Figure 3Characteristics of similarity search of unigenes against NR database.
(A) E-value distribution of BLAST hits for each unigene with a cutoff E-value of 1.0E-5. (B) Similarity distribution of the top BLAST hit for each unigene. (C) Species distribution of the top BLAST hit for each unigene in the NR database.
Figure 4Gene Ontology classification of assembled unigenes.
The results are summarized in three main categories: biological process, cellular component and molecular function. In total, 22,018 unigenes with BLAST matches to known proteins were assigned to this gene ontology.
Figure 5Clusters of orthologous groups (COG) functional classification.
In total, 22,921 of the 41,328 sequences with an NR hit were grouped into 25 COG classifications.
Figure 6Distribution of identified SSRs using MISA software.
Summary of putative SNPs found in T. jolithus unigenes.
| SNP Type | Count |
|
| 87,327 (66.42%) |
| A-G | 43,348 |
| C-T | 43,979 |
|
| 44,151 (33.58%) |
| A-C | 8,845 |
| A-T | 21,153 |
| C-G | 5,604 |
| G-T | 8,549 |
|
| 131,478 |
Determination of carotenoid and chlorophyll contents in T. jolithus.
| Type | Average (% of DW) | Range (% of DW) | Ratio (Car/Chl) |
|
| 2.20 | 1.99–2.41 | 3.30 |
|
| 0.67 | 0.63–0.70 |
Figure 7Putative pathway of carbon fixation in T. jolithus, generated by KEGG.
The numbers within the small boxes are enzyme codes. The boxes with a red border are enzymes identified in this study. The boxes with a black border are enzymes not identified in this study.
Figure 8Carotenoid biosynthesis model for T. jolithus based on annotations in the transcriptome library.
Abbreviations of chemical compounds are as follows: GAP, D-glyceraldehyde-3-phosphate; PYR, pyruvate; DXP, 1-Deoxy-D-xylulose 5-phosphate; MEP, 2-C-Methyl-D-erythritol 4-phosphate; CDP-ME, 4-(Cytidine 5′-diphospho)-2-C-methyl-D-erythritol; CDP-ME2P, 2-Phospho-4-(Cytidine 5′-diphospho)-2-C-methyl-D-erythritol; MECP, 2C-methyl-D-erythritol-2,4-cyclodiphosphate; HMBPP, 1-Hydroxy-2-methyl-2-butenyl 4-diphosphate; DMAPP, Dimethylallyl-PP; IPP, Isopentenyl-PP; GPP, Geranyl-PP; FPP, (E,E)-Farnesyl-PP; GGPP, Geranylgeranyl-PP; PPFPP, prephytoene diphosphate; PE, phytoene; PF, 15,9′-dicis-Phytofluene; TζCar, 9,15,9′-tricis-ζ-Carotene; DζCar, 9,9′-Di-cis-ζ-Carotene; LYC, Lycopene; β-CRY, β-Cryptoxanthin; ZEA, Zeaxanthin; ANT, Antheraxanthin; VIO, Violaxanthin; 9-cis-VIO, 9-cis-Violaxanthin; XAN, Xanthoxin; ABAL, Abscisic aldehyde; ABA, Abscisate; α-CRY, α-Cryptoxanthin; ZEI, Zeinoxanthin; LUT, Lutein; α-Car, α-carotene; β-Car, β-carotene; γ-Car, γ-carotene; δ-Car, δ-carotene; ε-Car, ε-carotene. Abbreviations of enzymes are as follows: DXS, 1-deoxy-D-xylulose-5-phosphate synthase; DXR, 1-deoxy-D-xylulose-5-phosphate reductoisomerase; ISPD, 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; CMK, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; MCS, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; HDS, (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase; HDR, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; IDI, isopentenyl-diphosphate delta-isomerase; FDPS, farnesyl diphosphate synthase; GGPS, geranylgeranyl diphosphate synthase, type III; PSY, phytoene synthase; PDS, 15-cis-phytoene desaturase; ZISO, zeta-carotene isomerase; CrtISO, prolycopene isomerase; CrtR-b, lycopene beta-cyclase; CruA, lycopene cyclase CruA; CrtL-b, lycopene beta-cyclase; CrtL-e, lycopene epsilon-cyclase; LUT5, cytochrome P450, family 97, subfamily A (beta-ring hydroxylase); CrtZ, beta-carotene 3-hydroxylase; LUT1, carotene epsilon-monooxygenase; ZEP, zeaxanthin epoxidase; VDE, violaxanthin de-epoxidase; NCED, 9-cis-epoxycarotenoid dioxygenase; ABA2, xanthoxin dehydrogenase; AAO3, abscisic-aldehyde oxidase.