| Literature DB >> 26927096 |
Ning Zhao1, Xi Lin2, Shan-Shan Qi3, Zhi-Mei Luo4, Shuang-Lin Chen5, Shu-Zhen Yan6.
Abstract
Shiraia bambusicola is a species of the monotypic genus Shiraia in the phylum Ascomycota. In China, it is known for its pharmacological properties that are used to treat rheumatic arthritis, sciatica, pertussis, tracheitis and so forth. Its major medicinal active metabolite is hypocrellin A, which exhibits excellent antiviral and antitumor properties. However, the genes involved in the hypocrellin A anabolic pathways were still unknown due to the lack of genomic information for this species. To investigate putative genes that are involved in the biosynthesis of hypocrellin A and determine the pathway, we performed transcriptome sequencing for Shiraia bambusicola S4201-W and the mutant S4201-D1 for the first time. S4201-W has excellent hypocrellin A production, while the mutant S4201-D1 does not. Then, we obtained 38,056,034 and 39,086,896 clean reads from S4201-W and S4201-D1, respectively. In all, 17,923 unigenes were de novo assembled, and the N50 length was 1970 bp. Based on the negative binomial distribution test, 716 unigenes were found to be upregulated, and 188 genes were downregulated in S4201-D1, compared with S4201-W. We have found seven unigenes involved in the biosynthesis of hypocrellin A and proposed a putative hypocrellin A biosynthetic pathway. These data will provide a valuable resource and theoretical basis for future molecular studies of hypocrellin A, help identify the genes involved in the biosynthesis of hypocrellin A and help facilitate functional studies for enhancing hypocrellin A production.Entities:
Keywords: RNA-Seq; Shiraia bambusicola; differentially expressed genes; ultraviolet mutagenesis
Mesh:
Substances:
Year: 2016 PMID: 26927096 PMCID: PMC4813174 DOI: 10.3390/ijms17030311
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Statistics of transcriptome sequencing.
| Sample | S4201-W | S4201-D1 |
|---|---|---|
| Raw reads | 38,230,680 | 39,301,794 |
| Raw bases | 4,778,835,000 | 4,912,724,250 |
| Clean reads | 38,056,034 | 39,086,896 |
| Clean bases | 4,751,476,731 | 4,880,073,139 |
| Valid ratio (base)/% | 99.42 | 99.33 |
| Q30/% | 90.20 | 90.43 |
| GC content/% | 51.00 | 49.00 |
Summary of annotations of unigenes in public databases.
| Database | No. of Matched Unigenes | Percentage/% |
|---|---|---|
| Nr | 14,243 | 79.47 |
| SWISS-PROT | 9011 | 50.28 |
| KOG | 7225 | 40.31 |
| KEGG | 3581 | 19.98 |
| GO | 9647 | 53.82 |
Figure 1KOG of unigenes of Shiraia bambusicola. KOG Functional Classification of transcriptome. In all, 7225 unigenes were clustered into 25 functional categories.
Figure 2Gene Ontology classification of Shiraia bambusicola transcriptome. A total of 9647 (53.82%) unigenes were assigned to their main GO categories (Biological processes (BP), Cellular components (CC), molecular functions (MF)), including 52 sub-categories.
Significant differentially expressed genes between S4201-W and S4201-D1.
| Gene ID | Description | |Log2Fold Change| | Up_Down * | |
|---|---|---|---|---|
| comp12495_c0_seq3 | o-methyltransferase/FAD-dependent monooxygenase | 19.98004 | 0.0000384 | down |
| comp12440_c0_seq1 | hydroxylase | 17.92012 | 0.0000204 | down |
| comp12566_c0_seq1 | fasciclin | 17.23691 | 0.000032 | down |
| comp12222_c0_seq4 | o-methyltransferase/FAD-dependent monooxygenase | 16.88793 | 0.0000458 | down |
| comp11902_c0_seq9 | polyketide synthase | 15.5618 | 0.000334 | down |
| comp9636_c0_seq2 | similar to NADPH-cytochrome P450 reductase | 15.35454 | 0.000492 | down |
| comp11380_c0_seq2 | putative multicopper oxidase | 14.86465 | 0.001313 | down |
| comp6304_c0_seq2 | carboxypeptidase | 14.7857 | 0.00154 | down |
| comp11297_c0_seq3 | putative MFS drug efflux transporter protein | 14.73989 | 0.00169 | down |
| comp11445_c0_seq2 | amino acid transporter | 13.91772 | 0.008498 | down |
| comp10001_c0_seq5 | FAD/FMN-dependent oxidoreductase | 13.86749 | 0.009314 | down |
| comp17278_c0_seq1 | plastin-3 | 13.68616 | 0.012799 | down |
| comp11463_c0_seq1 | glycoside hydrolase family 95 protein | 13.57229 | 0.015462 | down |
| comp17195_c0_seq1 | mannitol 1-phosphate dehydrogenase 1 | 13.24513 | 0.02557 | down |
| comp15202_c0_seq1 | alpha/beta-hydrolase | 12.89003 | 0.040099 | down |
| comp17704_c0_seq1 | chitin synthase D | 12.7117 | 0.04854 | down |
| comp11073_c0_seq3 | ATP synthase subunit 6 | 20.3094 | 0.0000172 | up |
| comp15750_c0_seq1 | cytochrome oxidase subunits 1 and 2 polyprotein | 18.0501 | 0.00000343 | up |
| comp6469_c0_seq2 | putative methyltransferase type 11 protein | 17.7353 | 0.00000373 | up |
| comp2455_c0_seq1 | similar to MFS peptide transporter Ptr2 | 17.5382 | 0.00000412 | up |
| comp10718_c0_seq3 | heat shock protein 90a | 17.2418 | 0.00000508 | up |
| comp17877_c0_seq1 | NADH dehydrogenase subunit 1 | 16.9127 | 0.00000697 | up |
| comp7051_c0_seq1 | putative family transcriptional regulator protein | 16.8352 | 0.00000761 | up |
| comp2192_c0_seq2 | decaprenyl-diphosphate synthase subunit 1 | 16.4804 | 0.000012 | up |
| comp7011_c0_seq2 | 60S ribosomal protein L2 | 16.4285 | 0.0000129 | up |
| comp7761_c0_seq1 | similar to kelch repeat protein | 16.2001 | 0.0000182 | up |
| comp7729_c0_seq1 | similar to protein phosphatase 2C | 15.9167 | 0.0000292 | up |
| comp22695_c0_seq1 | similar to sugar transporter | 15.5938 | 0.0000525 | up |
| comp16042_c0_seq1 | elongation factor 3 | 15.5433 | 0.0000578 | up |
| comp15117_c0_seq1 | chitin synthase | 15.5174 | 0.0000609 | up |
| comp5090_c0_seq1 | phosphate permease (PHO89/Pi cotransporter PHO89) | 15.3805 | 0.0000805 | up |
| comp1206_c0_seq1 | laccase | 15.3337 | 0.0000889 | up |
| comp7656_c0_seq1 | chitin synthase 4 | 15.2418 | 0.000108 | up |
| comp9653_c1_seq2 | glutathione | 15.1638 | 0.000129 | up |
| comp8271_c0_seq2 | efflux pump antibiotic resistance protein | 15.0455 | 0.000168 | up |
| comp10576_c0_seq1 | similar to 40s ribosomal protein S15 | 15.0162 | 0.00018 | up |
| comp18277_c0_seq1 | similar to 60s acidic ribosomal protein P0 | 15.0088 | 0.000183 | up |
| comp7104_c0_seq1 | high affinity glucose transporter | 14.9635 | 0.000203 | up |
| comp9911_c0_seq2 | cytochrome P450 monooxygenase, putative | 14.7665 | 0.000318 | up |
| comp15511_c0_seq1 | glycoside hydrolase family 105 protein | 14.7576 | 0.000324 | up |
| comp5983_c0_seq2 | aspartyl-tRNA synthetase | 14.7218 | 0.000352 | up |
| comp9603_c0_seq2 | MFS general substrate transporter | 14.6184 | 0.000445 | up |
| comp12046_c0_seq3 | chp3 | 14.6086 | 0.000456 | up |
| comp1492_c0_seq1 | putative duf341 domain protein | 14.5174 | 0.000561 | up |
| comp9605_c0_seq1 | similar to WSC domain-containing protein | 14.4313 | 0.000683 | up |
| comp8564_c0_seq1 | conserved hypothetical protein | 14.2793 | 0.000966 | up |
| comp17817_c0_seq1 | cysteine desulfurase, mitochondrial precursor | 14.2163 | 0.001115 | up |
| comp20949_c0_seq1 | similar to beta-1,3-glucan synthase | 14.1368 | 0.001334 | up |
| comp6798_c0_seq1 | similar to 40S ribosomal protein S9 | 14.1368 | 0.001334 | up |
| comp16828_c0_seq1 | transcription factor PacC | 14.1093 | 0.001419 | up |
| comp8237_c0_seq2 | similar to delta-12 fatty acid desaturase | 14.1093 | 0.001419 | up |
| comp9104_c0_seq1 | eukaryotic translation initiation factor 1A | 14.0813 | 0.001511 | up |
| comp9278_c0_seq1 | transport protein, putative | 14.0813 | 0.001511 | up |
| comp5653_c0_seq1 | 40S ribosomal protein S5 | 14.0671 | 0.00156 | up |
| comp8053_c0_seq1 | putative short-chain dehydrogenase reductase family protein | 14.0383 | 0.001663 | up |
| comp2722_c0_seq1 | integral membrane protein, putative | 14.0088 | 0.001774 | up |
| comp8290_c0_seq2 | ferric reductase transmembrane component | 13.9939 | 0.001834 | up |
| comp6711_c0_seq1 | conserved hypothetical protein | 13.9788 | 0.001895 | up |
| comp2304_c0_seq1 | cytochrome P450 | 13.8846 | 0.002321 | up |
| comp2698_c0_seq1 | short chain dehydrogenase family protein | 13.8846 | 0.002321 | up |
| comp18073_c0_seq1 | mitochondrial fusion GTPase protein | 13.8683 | 0.002403 | up |
| comp15182_c0_seq1 | vacuolar protein sorting-associated protein 45 | 13.8518 | 0.002488 | up |
| comp9144_c0_seq2 | cytochrome P450 monooxygenase | 13.8518 | 0.002488 | up |
| comp82_c0_seq1 | 60S ribosomal protein L14 | 13.8183 | 0.002668 | up |
| comp20237_c0_seq1 | similar to aromatic amino acid aminotransferase | 13.6757 | 0.003561 | up |
| comp7159_c0_seq1 | similar to 40S ribosomal protein S14 | 13.6377 | 0.003835 | up |
| comp7442_c0_seq1 | actin | 13.5174 | 0.004829 | up |
| comp542_c0_seq1 | pyrrolocin enoylreductase | 13.475 | 0.005228 | up |
| comp2644_c0_seq1 | heat shock protein Hsp88 | 13.4089 | 0.005901 | up |
| comp2635_c0_seq1 | C protein immunoglobulin-A-binding beta antigen | 13.3862 | 0.006148 | up |
| comp8431_c0_seq2 | mitochondrial citrate synthase | 13.3862 | 0.006148 | up |
| comp20213_c0_seq1 | similar to elongation of fatty acids protein | 13.3631 | 0.006406 | up |
| comp949_c0_seq1 | similar to adenosine deaminase family protein | 13.2163 | 0.00823 | up |
| comp2229_c0_seq2 | similar to zinc/cadmium resistance protein | 13.1368 | 0.009341 | up |
| comp6853_c0_seq2 | cyclin | 12.9325 | 0.012519 | up |
| comp15194_c0_seq1 | RNA polymerase II mediator complex component | 12.8352 | 0.014188 | up |
| comp617_c0_seq1 | chloroperoxidase | 12.8012 | 0.014797 | up |
| comp15885_c0_seq1 | similar to heat shock protein | 12.7665 | 0.015435 | up |
| comp8520_c0_seq1 | Beta-lactamase family protein | 12.7665 | 0.015435 | up |
| comp14744_c0_seq1 | SH3 domain containing protein | 12.7308 | 0.016106 | up |
| comp18002_c0_seq1 | NADH-ubiquinone oxidoreductase | 12.6943 | 0.016811 | up |
| comp2251_c0_seq1 | similar to cystathionine beta-synthase | 12.6943 | 0.016811 | up |
| comp14820_c0_seq1 | geranylgeranyl diphosphate synthase, putative | 12.6184 | 0.018339 | up |
| comp16310_c0_seq1 | conserved hypothetical protein | 12.6184 | 0.018339 | up |
| comp3019_c0_seq1 | similar to benzoate 4-monooxygenase cytochrome P450 | 12.4533 | 0.021976 | up |
| comp17256_c0_seq1 | similar to phosphoglucomutase | 12.3631 | 0.024168 | up |
| comp8045_c0_seq3 | MFS transporter | 12.2163 | 0.028088 | up |
| comp14626_c0_seq1 | similar to 26 proteasome complex subunit Sem1 | 12.1093 | 0.031252 | up |
| comp11395_c0_seq4 | NAD(P)-binding Rossmann-fold containing protein | 12.0528 | 0.033043 | up |
| comp5357_c0_seq1 | TPA: cytochrome P450, putative (Eurofung) | 12.0528 | 0.033043 | up |
| comp16027_c0_seq1 | putative cytochrome P450 | 11.9939 | 0.034993 | up |
| comp19911_c0_seq1 | AF360398_16-methylsalicylic acid synthase | 11.9325 | 0.037123 | up |
| comp5843_c0_seq1 | proteasome regulatory subunit 12 | 11.9325 | 0.037123 | up |
| comp16424_c0_seq1 | microbial terpene synthase-like protein | 11.8683 | 0.03946 | up |
| comp9499_c0_seq2 | annexin A7 | 11.8012 | 0.042037 | up |
| comp5096_c1_seq1 | pyrrolocin synthetase | 11.6568 | 0.048082 | up |
| comp22148_c0_seq1 | integral membrane protein | 11.6568 | 0.048082 | up |
* Significant upregulation or downregulation of genes in S4201-D1 as compared to those in S4201-W.
Figure 3Putative HA biosynthetic pathway-related genes and products in Shiraia bambusicola. Reactions catalyzed at every step are shown. The broken arrow represents putative steps, in which the enzymes involved are not yet clear.
Figure 4Quantitative RT-PCR validation. The relative expression levels of mRNA were normalized with internal reference gene (GAPDH) and relative expression to the corresponding values of S4201-W (control) were given an arbitrary value of 1. (A) 1: Multicopper oxidase, 2: Fasciclin, 3: Polyketide synthase, 4: O-methyltransferase/FAD-dependent monooxygenase, 5: Hydroxylase, 6: FAD/FMN-dependent oxidoreductase, 7: Pyridoxal reductase; (B) 8: Aldehyde dehydrogenase (NAD+), 9: Cytochrome P450, 10: Alcohol dehydrogenase.