| Literature DB >> 29895888 |
Ha-Yeon Song1, Dae-Hyuk Kim2, Jung-Mi Kim3.
Abstract
Lentinula edodes is a popular cultivated edible mushroom with high nutritional and medicinal value. To understand the regulation of gene expression in the dikaryotic mycelium and mature fruiting body in the commercially important Korean L. edodes strain, we first performed comparative transcriptomic analysis, using Illumina HiSeq platform. De novo assembly of these sequences revealed 11,675 representative transcripts in two different stages of L. edodes. A total of 9,092 unigenes were annotated and subjected to Gene Ontology, EuKaryotic Orthologous Groups, and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. Gene expression analysis revealed that 2,080 genes were differentially expressed, with 1,503 and 577 upregulated in the mycelium and a mature fruiting body, respectively. Analysis of 18 KEGG categories indicated that fruiting body-specific transcripts were significantly enriched in 'replication and repair' and 'transcription' pathways, which are important for premeiotic replication, karyogamy, and meiosis during maturation. We also searched for fruiting body-specific proteins such as aspartic protease, gamma-glutamyl transpeptidase, and cyclohexanone monooxygenase, which are involved in fruiting body maturation and isolation of functional substances. These transcriptomes will be useful in elucidating the molecular mechanisms of mature fruiting body development and beneficial properties, and contribute to the characterization of novel genes in L. edodes.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29895888 PMCID: PMC5997629 DOI: 10.1038/s41598-018-27318-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic of the comparative transcriptome analysis in Lentinula edodes. (A) Representative images of different developmental stages: vegetative mycelium (left) and mature fruiting body of L. edodes (right). Scale bar = 1 cm. (B) Overall workflow of the transcriptome assembly and analysis of the RNA sequencing (RNA-seq) data in L. edodes.
Throughput and quality of Illumina sequencing of the L. edodes transcriptome.
| Samples | Raw data | DynamicTrim dataa | LengthSort datab | |||
|---|---|---|---|---|---|---|
| Mycelium | Fruiting body | Mycelium | Fruiting body | Mycelium | Fruiting body | |
| Total reads | 33,810116 | 37,248796 | 33,810116 | 37,248,796 | 27,646,278 | 30,109,768 |
| Total length (bp) | 3,414,821,716 | 3,762,128,396 | 2,464,553,530 | 2,691,940,823 | 2,296,017,777 | 2,484,920,543 |
| Avg. length | 101 | 101 | 72.91 | 72.26 | 83.06 | 82.50 |
aQuality value: each read was trimmed individually based on a quality score of less than 20.
bStandard minimum read length: reads from which low-quality regions were trimmed, and reads with 25 base pairs (bp) were filtered out.
Functional annotation statistics of the L. edodes assembly data.
| Data | Num. of transcripts | Length of transcripts (bp) | ||||
|---|---|---|---|---|---|---|
| Total | Min | Max | Average | N50a | ||
| Total transcriptsb | 32,001 | 68,886,143 | 200 | 13,637 | 2,152 | 2,900 |
| Representative transcriptsc | 11,675 | 19,872,247 | 200 | 13,637 | 1,702 | 2,443 |
aHalf of all bases are in transcripts at least as long as N50.
bTotal number of transcripts assembled using k-mer 51.
cTranscripts of multi-copy genes are collapsed into a single sequence.
Figure 2Species distribution of the BLASTx results. Species distribution of BLASTx results against the NCBI non-redundant protein database (e-value < 1e−10) and the proportions for each species are shown. Different colors represent different species, and the top 10 species are shown. Species with proportions >1% are shown.
Figure 3Summary of the differentially expressed genes (DEGs) between the mycelium and maturing fruiting body of L. edodes. (A) MA plot showing the means (x-axis) and log2 ratios (y-axis) of differential expression (adjusted p-value less than the significance level) during different developmental stages, based on normalized RNA-seq read counts. Each data point represents a transcript, and those indicating differential expression are colored in red and green. Gray dashed lines indicate thresholds specified for the fold change. (B) Estimates of significantly up-regulated and down-regulated genes in the mycelium versus fruiting body are indicated as a number distribution in each comparison.
Figure 4Venn diagrams showing the intersection of significant DEGs between the mycelium and maturing fruiting body in L. edodes. The numbers of up-regulated and down-regulated genes are summarized relative to the number of DEGs between the two groups in the RNA-seq dataset (false discovery rate ≤ 0.01, log2 ratio ≥ 2).
Figure 5Gene ontology (GO) classification of the L. edodes transcriptome and DEGs between the mycelium and mature fruiting body. L. edodes unigenes were annotated to three functional categories: biological process (BP), cellular component (CC), and molecular function (MF). Bars indicate the number of genes in each GO category. The black and gray bars represent the up- and down-regulated DEGs of the mature fruiting body in L. edodes, respectively.
Figure 6Functional annotation of DEGs in accordance with euKaryotic Orthologous Groups (KOG) categories. All development stage-associated DEGs were assigned to the 26 KOG functional categories using BLASTx. Bars indicate the number of genes in each KOG category that revealed significant changes in expression (log2 ratio ≥ 2) between the two developmental stages in L. edodes. The black and gray bars represent the up- and down-regulated DEGs of the mature fruiting body in L. edodes, respectively.
Figure 7KEGG pathway analysis of the DEGs. Bars indicate the number of genes in each KEGG pathway that revealed significant changes in expression (log2 ratio ≥ 2) between the two developmental stages in L. edodes. The black and gray bars represent the up- and down-regulated DEGs of the mature fruiting body in L. edodes, respectively.
Functional annotation of specific DEGs in the mature fruiting body.
| Unigene ID | Mycelium read count | Fruiting body read count | Log2 fold changea | Description | Speciesb | Evalue | Identity |
|---|---|---|---|---|---|---|---|
| LE_007412 | 5 | 8,057 | 10.60 | Hydrophobin 1 | 2.00E-34 | 99.21 | |
| LE_009311 | 35 | 7,282 | 7.67 | Hypothetical protein GYMLUDRAFT_39929 | 2.00E-47 | 43.02 | |
| LE_007730 | 11 | 6,695 | 9.23 | Aspartic protease | 0 | 98.78 | |
| LE_006927 | 15 | 6,593 | 8.73 | MFS transporter | 2.00E-164 | 41.88 | |
| LE_007440 | 47 | 4,597 | 6.61 | Heat shock protein 9 | 1.00E-53 | 100.00 | |
| LE_003624 | 37 | 3,315 | 6.48 | Hypothetical protein | 0 | 89.00 | |
| LE_007464 | 24 | 3,310 | 7.10 | Gamma-glutamyl transepetidase | 0 | 91.85 | |
| LE_006999 | 28 | 3,192 | 6.80 | DNA mismatch repair protein | 6.00E-90 | 61.46 | |
| LE_007861 | 4 | 3,037 | 9.49 | GPI-anchor protein | 1.00E-29 | 54.90 | |
| LE_007450 | 9 | 2,890 | 8.34 | Cyclohexanone monooxygenase | 0 | 97.00 | |
| LE_007442 | 12 | 2,826 | 7.83 | O-acetylhomoserine (thiol)-lyase | 6.00E-180 | 64.80 | |
| LE_011335 | 4 | 2,723 | 9.32 | Hypothetical protein GYMLUDRAFT_486395 | 0 | 68.95 | |
| LE_006542 | 7 | 2,330 | 8.26 | Protein kinase | 9.00E-67 | 39.38 | |
| LE_006982 | 40 | 2,113 | 5.70 | Amidotransferase | 1.00E-160 | 44.00 | |
| LE_006156 | 7 | 1,962 | 8.13 | GPI-anchor protein | 1.00E-20 | 32.45 | |
| LE_007727 | 3 | 1,853 | 9.08 | Hypothetical protein GYMLUDRAFT_73314 | 4.00E-125 | 52.70 | |
| LE_006715 | 15 | 1,726 | 6.79 | Cysteine hydrolase | 1.00E-175 | 99.00 | |
| LE_007912 | 3 | 1,612 | 9.01 | Hypothetical protein | 1.00E-128 | 100.00 | |
| LE_007695 | 7 | 1,586 | 7.70 | Hypothetical protein GYMLUDRAFT_944329 | 3.00E-22 | 63.77 | |
| LE_006892 | 34 | 1,580 | 5.52 | Hypothetical protein | 3.00E-06 | 56.10 | |
| LE_005107 | 36 | 1,530 | 5.40 | Transcription factor | 0 | 88.12 | |
| LE_007489 | 19 | 1,396 | 6.19 | Hypothetical protein PLICRDRAFT_103741 | 2.00E-93 | 55.29 | |
| LE_007467 | 9 | 1,385 | 7.22 | Hypothetical protein GYMLUDRAFT_89343 | 4.00E-80 | 60.74 | |
| LE_007396 | 22 | 1,266 | 5.83 | Cytochrome P450 | 0 | 86.68 | |
| LE_007448 | 4 | 1,253 | 8.11 | Cytochrome P450 | 0 | 100.00 | |
| LE_005239 | 7 | 1,240 | 7.33 | Hypothetical protein GYMLUDRAFT_39615 | 3.00E-120 | 69.20 | |
| LE_007459 | 2 | 1,138 | 9.26 | Glycoside hydrolase family 61 | 2.00E-54 | 63.87 | |
| LE_011349 | 11 | 1,085 | 6.59 | Splicing factor | 0 | 100 | |
| LE_006616 | 31 | 927 | 4.89 | Hypothetical protein GYMLUDRAFT_265499 | 0 | 66.11 | |
| LE_007328 | 5 | 901 | 7.23 | Hypothetical protein GYMLUDRAFT_172776 | |||
| LE_007951 | 2 | 894 | 8.73 | Hypothetical protein | 5.00E-109 | 96.57 | |
| LE_010885 | 13 | 844 | 6.00 | Cyclin | 3.00E-111 | 50.20 | |
| LE_006898 | 23 | 844 | 5.17 | Reverse transcriptase-RNase H-integrase | 0 | 99.00 | |
| LE_001650 | 49 | 746 | 3.92 | ARM repeat-containing protein | 0 | 94.63 | |
| LE_004420 | 16 | 740 | 5.46 | Alcohol oxidase | 0 | 98.43 | |
| LE_007446 | 16 | 733 | 5.51 | Hypothetical protein | |||
| LE_006790 | 8 | 682 | 6.24 | Carbohydrate esterase family | 0 | 82.79 | |
| LE_007462 | 7 | 670 | 6.44 | Meiosis protein | 0 | 97.83 | |
| LE_005246 | 17 | 608 | 5.11 | DDR48-Heat shock protein | 2.00E-106 | 87.76 | |
| LE_007461 | 10 | 605 | 5.89 | MFS transporter | 0 | 67.12 | |
| LE_006895 | 8 | 604 | 6.22 | Hypothetical protein | 3.00E-122 | 52.14 | |
| LE_001942 | 47 | 602 | 3.65 | Alpha/beta-hydrolase | 8.00E-113 | 92.27 | |
| LE_010744 | 40 | 601 | 3.90 | MFS transporter | 0 | 97.54 | |
| LE_007512 | 48 | 599 | 3.63 | CIPA | 3.00E-88 | 87.62 | |
| LE_007498 | 1 | 587 | 8.55 | Aquaporin | 3.00E-161 | 77.99 | |
| LE_005252 | 11 | 572 | 5.95 | Hypothetical protein GYMLUDRAFT_775031 | 1.00E-156 | 57.88 | |
| LE_007439 | 11 | 544 | 5.52 | Hypothetical protein | 0 | 85.58 | |
| LE_007456 | 1 | 531 | 8.41 | Phospholipase C | 3.00E-16 | 54.65 | |
| LE_009426 | 42 | 531 | 3.64 | Isopentenyl diphosphate isomerase | 0 | 99.23 | |
| LE_006701 | 46 | 531 | 3.53 | Glycosyltransferase family 2 protein | 6.00E-93 | 65.40 | |
| LE_011394 | 5 | 525 | 6.72 | Protein kinase | 0 | 99.58 | |
| LE_005524 | 31 | 513 | 4.01 | Lysine decarboxylase |
| 2.00E-65 | 49.35 |
aLog2 fold change indicates a significant difference between fruiting body gene counts and mycelium gene counts.
bL. edodes, Letinula edodes; G. luxurians, Gymnopus luxurians; G. trabeum, Gloeophyllum trabeum; G. frondosa, Grifola frondosa; R. solani Rhizoctonia solani; F. hepatica, Fistulina hepatica; S. hirsutum, Stereum hirsutum; L. bicolor, Laccaria bicolor; C. hominis, Cryptosporidium hominis; L. sulphureus, Laetiporus sulphureus; P. crispa, Plicaturopsis crispa; M. roreri, Moniliophthora roreri; A. gallica, Armillaria gallica; M. chlorophos, Mycena chlorophos; P. noxius, Phellinus noxi.
Figure 8Validation of the RNA-seq results by quantitative real-time polymerase chain reaction. The log2 fold change is denoted as the ratio of expression in the fruiting body to that in the mycelium. The y-axis represents the expression levels of the genes relative to the housekeeping gene (18S rRNA) for three biological and technical replicates. Significant differences of p < 0.01 were analyzed using Student’s t-test.