Literature DB >> 20230644

Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum.

Hongmei Luo1, Chao Sun, Jingyuan Song, Jin Lan, Ying Li, Xiwen Li, Shilin Chen.   

Abstract

BACKGROUND: Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library.
METHODS: A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis.
RESULTS: A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified.
CONCLUSION: The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum.

Entities:  

Year:  2010        PMID: 20230644      PMCID: PMC2848221          DOI: 10.1186/1749-8546-5-9

Source DB:  PubMed          Journal:  Chin Med        ISSN: 1749-8546            Impact factor:   5.455


Background

Ganoderma lucidum (Curtis: Fr.) P. Karst, Lingzhi in Chinese, which belongs to the Polyporaceae family, has been used in China as medicine for centuries to promote health and longevity [1,2]. In other countries, its fruiting body is used to treat a variety of ailments, such as cancers, hypertension, diabetes, and hepatitis, apart from being a dietary supplement [2-4]. G. lucidum is an anti-tumour agent that acts via immune modulation or stimulating cytokine production [5-7]. The bioactive constituents of G. lucidum include more than 120 different triterpenes and polysaccharides, proteins and other compounds [2,8]. Genes involved in the triterpenoids biosynthesis pathways in G. lucidum including squalene synthase (SQS), farnesyl-Diphosphate Synthase (GlFPS) and HMG-CoA reductase (Gl -HMGR) were isolated and characterized [9-11]. Joo et al. identified a laccase gene (GLLac1) from G. lucidum [12]. However, little is known about the molecular biology of its fruiting body and its secondary metabolism. Identification of expressed genes, in particular the transcript profile, of the G. lucidum fruiting body would be a key to understanding its molecular biology. Expressed sequence tag (EST) analysis allows rapid and large-scale identification of uniquely expressed genes [13,14]. The EST analysis was used in transcriptome analysis of Lentinula edode [15], Aspergillus niger [16], Ustilago maydis [17] and Neurosphora crassa [18]. Sequencing information from ESTs may help discover genes in the biosynthesis of secondary metabolites [19]. Loo et al. identified a gene involved in the ricinoleic acid biosynthetic pathway [20]. Recently, genes encoding enzymes involved in the biosynthesis of ginsenoside, triterpene saponin and diterpenes were identified [21-23]. EST sequencing identified simple sequence repeats (SSRs) for genetic mapping [24]. Using the EST analysis, the present study annotated functional genes involved in the biosynthesis of secondary metabolites and the developmental regulation of the fruiting body of G. lucidum. Unique sequences very similar to squalene epoxydase (SE) and farnesyl-diphosphate synthase (FPS) in this EST collection were identified. We also discussed several candidate transcripts possibly associated with the cellular development of G. lucidum, such as hydrophobin, MOB2, profilin and PHO84. Moreover, identifying SSRs in the EST data is useful in marker-assisted breeding programs.

Methods

RNA extraction and cDNA library construction

The fruiting body of G. lucidum was obtained from the co-author Jin Lan, who has long been engaged in Ganoderma research in the Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China. She authenticated the G. lucidum using the morphological identification approach and referred to the Fungi Identification Manual [25]. Fifty (50) days after growing on the basswood medium at 25-30°C in a shade shelter, approximately 0.5 g was harvested and frozen in liquid nitrogen immediately. The mRNA of thefruiting body was isolated and purified directly with a Dynabeads (R) mRNA DIRECT™ kit (Invitrogen, USA) according to the manufacturer's recommendations. The cDNA library was constructed from purified mRNA with a Creator™ SMART™ cDNA Library Construction kit (Clontech, USA). The double-stranded cDNA was directionally ligated into the Sfi I restriction site of the pDNR-lib vector (Clontech, USA) and electroporated into a DH5α Escherichia coli strain (TakaRa, Japan).

EST sequencing, assembly and annotation

A total of 1,023 randomly selected clones were cultured in liquid LB medium containing 34 mg/l chloramphenicol and incubated overnight at 220 rpm in rotation and 37°C. Plasmid DNA was prepared with an Axyprep-96 plasmid kit (Axygen, USA). The plasmid DNA was submitted for direct sequencing from the 5' end with an M13 forward primer on an ABI 3730 DNA sequencer using BigDye 3.1 sequencing chemistry (Applied Biosystems, USA). The ABI-formatted chromatogram sequences were processed automatically with a local EST analysis pipeline. The Phred/Phrap program was applied for trace files conversion and for base calling with quality assessment [26,27]. The vector and low-quality regions were removed from the sequence with the Cross Match typically included in the Phred/Phrap program. The short sequences (less than 100 bp) and poly A/T tails were filtered from the EST database. The high quality ESTs were assembled into contigs (clusters of assembled ESTs) and singletons (sequences found only once) by Phrap [28]. The unique sequences were searched against public databases including the SwissProt [29], NCBI non-redundant protein (Nr) [30] and non-redundant nucleotide (Nt) [31] databases using BLAST [32] algorithm, with a E-value cut-off at 10-5. The functional categories of these unique sequences were classified by a broad category, including metabolism, energy production, cell signalling, cell defence and stress response, cell structure and growth, transcription, protein synthesis, protein degradation, transport and secretion as well as unclassified and unknown function.

SSR detection

The detection of simple sequence repeats (SSRs) from the total high-quality ESTs of the fruiting body of G. lucidum was performed with the Simple Sequence Repeat Identification Tool (SSRIT) [33]. The SSRIT accepts FASTA-formatted sequence files and reports the sequence ID, SSR motif, number of repeats (di- and tri-nucleotide repeat units), repeat length and position of the SSR and the total length of the sequence in which the SSRs were found [34]. The search parameters for the maximum motif-length group were set to hexamer and those for the minimum number of repeats were set to five.

Result and discussion

General characteristics of G. lucidum fruiting body cDNA library and ESTs

A cDNA library was constructed from the fruiting body of G. lucidum for the identification of the transcripts and the expression profiles involved in its cellular development and biosynthesis of secondary metabolites. The cDNA library had a titre of 1.25×106 colony forming units per millilitre (ml). A total of 1,023 cDNA clones were randomly selected from the library for sequencing, yielding 879 (85.9%) high-quality ESTs after vector screening and short sequence (<100 bp) filtering. These ESTs were assembled into 82 contigs and 518 singletons for a total of 600 unique sequences (Table 1). The average sequence length of these unique sequences was 288 bp, ranging from 0.15 kb to 1.5 kb. Over 63.4% contigs had two sequences, followed by 22.0% having three to four sequences and 14.6% having five to 40 sequences. Approximately 31.75% of the ESTs were redundant. A total of 427 unique sequences (71.1%) displaying no similarities to any sequences in the public databases were probably new transcripts. The redundancy (31.75%) of the ESTs suggests considerable potential for new transcripts in continued sequencing of random colonies from this cDNA library. The sequenced transcripts have been deposited in the GenBank database (GO447131-GO448009).
Table 1

Overview of the characteristics of the cDNA library of G. lucidum fruiting body

DescriptionNumber
Total number of clones sequenced1,000
Total high quality ESTs879
Total unique genes600
Average length per unique sequence (bp)288
Number of contigs82
Number of singletons518
Redundancy (%)31.75
Number of annotated unique sequences173
Number of non-annotated unique sequences427
Overview of the characteristics of the cDNA library of G. lucidum fruiting body

Expressed profile of the unique sequences

The expressed profile of the unique sequences identified in the G. lucidum fruiting body is shown in Table 2. Among 600 unique sequences, 518 (86.3%) unique sequences were sequenced only once; 72 (12%) unique sequences 2-5 times; five (0.8%) unique sequences 6-10 times and five (0.8%) unique sequences 12 times or more. The most abundantly expressed unique sequences in the G. lucidum fruiting body were coded for the hypothetical proteins (48 ESTs) and the cell wall-associated hydrolase (36 ESTs) (Table 3). Moreover, the unique sequence consisting of five ESTs and with sequence similarity to hydrophobin 2 of Lentinula edodes (Xianggu) was identified for the first time in G. lucidum (Table 3). Moreover, the unique sequences matched the elongation factors and the ribosomal proteins were also expressed at high levels (Table 3).
Table 2

Occurrence of ESTs in unique sequences

Number of ESTs in a unique sequenceNumber of unique sequences
1518
252
313
45
52
61
71
82
101
121
131
221
361
481
Table 3

Highly expressed transcripts in G. lucidum fruiting body cDNA library

No.of ESTsBLASTX annotationE-value
48Hypothetical protein [Rattus norvegicus]2.00-16
36Cell wall-associated hydrolase [Capnocytophaga sputigena Capno]2.00-14
8Predicted protein [Coprinopsis cinerea okayama7#130]1.00-11
8Protein TAR1 [Kluyveromyces lactis]8.00-21
5Hydrophobin 2 [Lentinula edodes]3.00-16
5Elongation factor 1-alpha [Schizophyllum commune]2.00-70
4Predicted protein [Laccaria bicolor S238N-H82]1.00-28
3Predicted protein [Laccaria bicolor S238N-H82]3.00-68
3Acyl-CoA-binding protein [Chaetophractus villosus]8.00-21
3Elongation factor 2 [Debaryomyces hansenii]8.00-73
340S ribosomal protein S11 [Schizosaccharomyces pombe]3.00-57
Occurrence of ESTs in unique sequences Highly expressed transcripts in G. lucidum fruiting body cDNA library

Annotation of expressed sequence tags

The list of the annotated ESTs found in the fruiting body of G. lucidum is shown in Additional file 1. Sixty-two (62) ESTs showed sequence similarities to uncharacterized genes encoding hypothetical proteins that were omitted from the list. The unique sequences from this cDNA library were analyzed for similarities by performing BLAST searches against public databases, including SwissProt [29], Nr [30] and Nt [31]. A total of 139 (23.2%) and 67 (11.2%) unique sequences were assigned a putative identity based on significant sequence similarities to at least one sequence in the Nr and Nt databases, respectively. These annotated unique sequences provide an available resource for application and basic microbiology. Furthermore, among the 879 ESTs only three (0.3%) ESTs were identified as homologues of previously reported nucleotides from G. lucidum in the GenBank database, indicating that the vast majority of the ESTs in our dataset were unique and new. The three unique sequences showed similarities to cytochrome c oxidase subunit 2 (GO447869), glyceraldehyde-3-phosphate dehydrogenase (GO447698) and FPS (GO447502) (Additional file 1).

Functional distribution of ESTs

The functions of the proteins that the identified sequences encoded were classified into categories of metabolism, energy production, cell signalling, cell defence and stress response, cell structure and growth, transcription, protein synthesis, protein degradation, transport and secretion as well as unclassified and unknown functions. The functional distribution of identified sequences from the G. lucidum fruiting body cDNA library is shown in Figure 1. The unique sequences associated with metabolism (22%), protein synthesis (22.7%), unclassified and unknown function (12.7%) and energy production (11.3%) were strongly represented, whereas those with cell signalling (2.0%) and cell defence and stress response (3.3%) were not. In summary, a total of 173 unique sequences showed similarities to known genes involved in the biosynthesis of secondary metabolites and developmental regulation. While the EST sequencing scale is limited, it provides some information about the expressed transcript profile of the fruiting body of G. lucidum.
Figure 1

Distribution of ESTs by broad functional categories. The functional categories include: metabolism, energy production, cell signaling, cell defence and stress response, cell structure and growth, transcription, protein synthesis, protein degradation, transport and secretion, and unclassified and unknown function.

Distribution of ESTs by broad functional categories. The functional categories include: metabolism, energy production, cell signaling, cell defence and stress response, cell structure and growth, transcription, protein synthesis, protein degradation, transport and secretion, and unclassified and unknown function. SSRs, also known as microsatellites, are useful genetic markers in molecular biology. A total of 13 SSR motifs were identified from the EST sequences of the fruiting body of G. lucidum (Table 4). The composition of di- and tri-nucleotide SSRs included AC (two), AT (two), CT (one), GA (one), GT (one), TA (one), TC (one), and TG (two). Only two tri-nucleotide repeats with the composition of CGA and GGT were found in the EST dataset (Table 4). Each SSR-containing unique sequence contains only one SSR motif. The lengths of eight repeats were nine bases and the other five repeats were between 11 and 15 bases. In addition, the tetra-, penta- and hexa-nucleotide repeats were not present in this EST dataset. There was a wide variation in the frequency of SSR motifs among species [35].
Table 4

The di- and tri-nucleotide repeats in G.lucidum fruiting body ESTs

SSR-containing ESTMotifRepeat No.SSR startSSR endSSR lengthSequence length
GO447147AC52462559340
GO447693AC813314815160
GO447650AT564739202
GO447380AT545549180
GO447641CT53153249402
GO447753GA7718413131
GO447502GT6869711172
GO447423TA51381479158
GO447358TC52993089520
GO447397TG512219180
GO447977TG5961059131
GO447241CGA58810214293
GO447840GGT5385214195
The di- and tri-nucleotide repeats in G.lucidum fruiting body ESTs

Candidate genes involved in the biosynthesis of triterpenoids

EST analysis is an important tool to identify secondary metabolite genes in the fruiting body of G. lucidum. Triterpenoids, the major bioactive compounds in G. lucidum, are synthesized from acetyl-CoA in the isoprenoid pathway. While genes involved in the triterpenoid biosynthetic pathway including SQS, GlFPS and Gl-HMGR were cloned from and identified in G. lucidum [9-11], other genes for the key enzymes in this pathway are to be identified. According to the studies of the triterpene biosynthesis [10,36], SE and FPS are rate-limiting enzymes in catalyzing triterpenoid biosynthesis in G. lucidum. The unique sequence (GO447913) with 71% identity (E-value = 1.00-12) to SE and the unique sequence (GO447502) with 98% identity to FPS (E-value = 4.00-27) involved in triterpenoid biosynthesis were presented in our EST data. SE acts as an important regulatory enzyme in the triterpenoid biosynthetic pathway. SE (EC 1.14.99.7), a monooxygenase, converts squalene into 2,3-oxidosqualene [36,37]. The enzyme requires molecular oxygen, flavin adenine dinucleotide (FAD), either NADH or NADPH depending on the organisms [38]. Since the gene encoding SE has not been identified in G. lucidum, the information of the unique sequence (GO447913) will help identify and characterize the SE in G. lucidum. The EST for FPS (GO447502) shows sequence similarity to the GlFPS, suggesting that this EST is the partial sequence of the full-length GlFPS. The genes encoding key enzymes involved in the triterpenoid biosynthesis, such as SQS, Gl-HMGR and others, are not present in this EST dataset, indicates a low abundance of these genes in the fruiting body of G. lucidum or incomplete sequencing of the library. The absence of ESTs associated with the polysaccharide biosynthesis, which should be abundant in G. lucidum, in this study may be due to the limited sequencing scale.

Candidate genes involved in regulation of G. lucidum development

Several transcripts present in the EST dataset encode the proteins that may be associated with the development processes of the fruiting body of G. lucidum. The unique sequence (GO447641) showed sequence similarity to the inorganic phosphate transporter PHO84 gene which controls the absorption of phosphate nutrition and regulates the development of Saccharomyces cerevisiae [39]. MOB2 is a nonessential yeast gene and plays a role in the maintenance of ploidy [40]. The unique sequences homologous to MOB2 (GO447972) and PHO84 (GO447641) may have the same functions as those in yeast. Hydrophobin is expressed specially in filamentous fungi and is important during the morphogenesis of fungi and the fruiting body development of mushrooms [41]. The unique sequence homologous to hydrophobin 2 of Lentinula edodes in this cDNA library consisted of five ESTs (GO447695, GO447166, GO447364, GO447414, GO447512), suggesting its abundance in the fruiting body of G. lucidum. Suizu et al. (2008) reported that the ESTs for hydrophobins were also most frequently identified in the cDNA library of Lentinula edodes [15]. Profilin is a universal small eukaryotic protein that binds to monomeric actin (G-actin) and is involved in diverse functions such as maintenance of cell structural integrity, cell mobility and growth factor signal transduction [42]. The sequences (GO447955, GO447282) encoding profilin were present in the G. lucidum cDNA library. The important unique sequence encoding an argonaute-like protein (GO447302) may be involved in the RNAi pathway, suggesting a potential gene knock-out by RNA interference in G. lucidum. Cloning and characterization of these candidate genes is under way.

Limitations of the study

The ESTs sequenced in this study from the fruiting body of G. lucidum were insufficient to cover all functional genes, although this EST dataset showed some characteristics of gene expression in the fruiting body of G. lucidum.

Conclusion

The present study used EST analysis and identified the transcripts in the biosynthesis of secondary metabolites and the developmental regulation of G. lucidum. For example, the candidate transcript encoding SE, the rate-limiting enzyme in the triterpenoid biosynthesis, was identified. Several genes associated with the development processes of G. lucidum, such as hydrophobin, MOB2, profilin and PHO84, were also identified.

Abbreviations

BLAST: Basic Local Alignment Search Tool; bp: base pair; cDNA: complementary DNA; EST: expressed sequence tag; FPS: farnesyl-diphosphate synthase; HMGR: HMG-CoA reductase; NCBI: National Center for Biotechnology Information; Nr: NCBI non-redundant protein; Nt: NCBI non-redundant nucleotide; SE: squalene epoxidase; SQS: squalene synthase; SSRs: simple sequence repeats

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

HML analyzed the data and drafted the manuscript. CS and YL participated in the data analysis. JYS participated in the study design. JL and XWL collected tissue samples. SLC evaluated the results and revised the manuscript. All authors read and approved the final version of the manuscript.

Additional file 1

Putative functions of partial . This table summarizes putative functions of partial ESTs of G. lucidum fruiting body. Click here for file
  32 in total

1.  Molecular cloning, characterization, and differential expression of a farnesyl-diphosphate synthase gene from the basidiomycetous fungus Ganoderma lucidum.

Authors:  Yi-Xin Ding; Xiang Ou-Yang; Chang-Hua Shang; Ang Ren; Liang Shi; Yu-Xiang Li; Ming-Wen Zhao
Journal:  Biosci Biotechnol Biochem       Date:  2008-06-07       Impact factor: 2.043

2.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

3.  An oleate 12-hydroxylase from Ricinus communis L. is a fatty acyl desaturase homolog.

Authors:  F J van de Loo; P Broun; S Turner; C Somerville
Journal:  Proc Natl Acad Sci U S A       Date:  1995-07-18       Impact factor: 11.205

Review 4.  Update from Asia. Asian studies on cancer chemoprevention.

Authors:  T K Yun
Journal:  Ann N Y Acad Sci       Date:  1999       Impact factor: 5.691

5.  Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential.

Authors:  S Temnykh; G DeClerck; A Lukashova; L Lipovich; S Cartinhour; S McCouch
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

Review 6.  Ganoderma - a therapeutic fungal biofactory.

Authors:  R Russell M Paterson
Journal:  Phytochemistry       Date:  2006-08-14       Impact factor: 4.072

7.  Regulation of profilin localization in Saccharomyces cerevisiae by phosphoinositide metabolism.

Authors:  D B Ostrander; J A Gorman; G M Carman
Journal:  J Biol Chem       Date:  1995-11-10       Impact factor: 5.157

8.  A genomics approach to the early stages of triterpene saponin biosynthesis in Medicago truncatula.

Authors:  Hideyuki Suzuki; Lahoucine Achnine; Ran Xu; Seiichi P T Matsuda; Richard A Dixon
Journal:  Plant J       Date:  2002-12       Impact factor: 6.417

9.  The Ras/protein kinase A pathway acts in parallel with the Mob2/Cbk1 pathway to effect cell cycle progression and proper bud site selection.

Authors:  Lisa Schneper; Alicia Krauss; Ryan Miyamoto; Shirley Fang; James R Broach
Journal:  Eukaryot Cell       Date:  2004-02

10.  Cloning and characterization of squalene synthase (SQS) gene from Ganoderma lucidum.

Authors:  Ming-Wen Zhao; Wan-Qi Liang; Da-Bing Zhang; Nan Wang; Chen-Guang Wang; Ying-Jie Pan
Journal:  J Microbiol Biotechnol       Date:  2007-07       Impact factor: 2.351

View more
  5 in total

1.  Abundant and selective RNA-editing events in the medicinal mushroom Ganoderma lucidum.

Authors:  Yingjie Zhu; Hongmei Luo; Xin Zhang; Jingyuan Song; Chao Sun; Aijia Ji; Jiang Xu; Shilin Chen
Journal:  Genetics       Date:  2014-02-04       Impact factor: 4.562

2.  Deep insight into the Ganoderma lucidum by comprehensive analysis of its transcriptome.

Authors:  Guo-Jun Yu; Man Wang; Jie Huang; Ya-Lin Yin; Yi-Jie Chen; Shuai Jiang; Yan-Xia Jin; Xian-Qing Lan; Barry Hon Cheung Wong; Yi Liang; Hui Sun
Journal:  PLoS One       Date:  2012-08-27       Impact factor: 3.240

3.  Generation and analysis of the expressed sequence tags from the mycelium of Ganoderma lucidum.

Authors:  Yen-Hua Huang; Hung-Yi Wu; Keh-Ming Wu; Tze-Tze Liu; Ruey-Fen Liou; Shih-Feng Tsai; Ming-Shi Shiao; Low-Tone Ho; Shean-Shong Tzean; Ueng-Cheng Yang
Journal:  PLoS One       Date:  2013-05-02       Impact factor: 3.240

4.  Proteome exploration to provide a resource for the investigation of Ganoderma lucidum.

Authors:  Guo-Jun Yu; Ya-Lin Yin; Wen-Hui Yu; Wei Liu; Yan-Xia Jin; Alok Shrestha; Qing Yang; Xiang-Dong Ye; Hui Sun
Journal:  PLoS One       Date:  2015-03-10       Impact factor: 3.240

5.  Potential molecular mechanisms for fruiting body formation of Cordyceps illustrated in the case of Cordyceps sinensis.

Authors:  Kun Feng; Lan-Ying Wang; Dong-Jiang Liao; Xin-Peng Lu; De-Jun Hu; Xiao Liang; Jing Zhao; Zi-Yao Mo; Shao-Ping Li
Journal:  Mycology       Date:  2017-08-30
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.