| Literature DB >> 34586388 |
Yuanchao Liu1,2,3, Longhua Huang2, Huiping Hu2, Manjun Cai2, Xiaowei Liang2, Xiangmin Li2, Zhi Zhang3, Yizhen Xie2,3, Chun Xiao2, Shaodan Chen2, Diling Chen2, Tianqiao Yong2, Honghui Pan2, Xiong Gao2, Qingping Wu1,2.
Abstract
Ganoderma leucocontextum, a newly discovered species of Ganodermataceae in China, has diverse pharmacological activities. Ganoderma leucocontextum was widely cultivated in southwest China, but the systematic genetic study has been impeded by the lack of a reference genome. Herein, we present the first whole-genome assembly of G. leucocontextum based on the Illumina and Nanopore platform from high-quality DNA extracted from a monokaryon strain (DH-8). The generated genome was 50.05 Mb in size with an N50 scaffold size of 3.06 Mb, 78,206 coding sequences, and 13,390 putative genes. Genome completeness was assessed using the Benchmarking Universal Single-Copy Orthologs (BUSCO) tool, which identified 96.55% of the 280 Fungi BUSCO genes. Furthermore, differences in functional genes of secondary metabolites (terpenoids) were analyzed between G. leucocontextum and Ganoderma lucidum. Ganoderma leucocontextum has more genes related to terpenoids synthesis compared to G. lucidum, which may be one of the reasons why they exhibit different biological activities. This is the first genome assembly and annotation for G. leucocontextum, which would enrich the toolbox for biological and genetic studies in G. leucocontextum.Entities:
Keywords: zzm321990 Ganoderma leucocontextumzzm321990 ; Illumina; Nanopore; genome; medicinal fungi; secondary metabolism; terpenoids
Mesh:
Substances:
Year: 2021 PMID: 34586388 PMCID: PMC8664445 DOI: 10.1093/g3journal/jkab337
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Fruiting bodies of G. leucocontextum. Scale bar: 1.0 cm.
The quality assessment for genome assembly of G. leucocontextum
| Quality assessment | Values |
|---|---|
| Library | 350 bp |
| Mapped (%) | 99.39 |
| Properly mapped (%) | 97.66 |
| Coverage (%) | 96.13 |
| Depth (X) | 74.36 |
| Complete BUSCOs (C) | 280 (96.55%) |
| Complete and single-copy BUSCOs (S) | 276 (95.17%) |
| Complete and duplicated BUSCOs (D) | 4 (1.38%) |
| Fragmented BUSCOs (F) | 1 (0.34%) |
| Missing BUSCOs (M) | 9 (3.10%) |
| Total lineage BUSCOs | 290 |
Represents Illumina sequencing library size.
Represents the percentage of clean reads mapped to the genome assembly of G. leucocontextum to all clean reads.
Represents the paired-end sequencing; sequences were all located on the genome assembly of G. leucocontextum and the distance was consistent with the length distribution of the sequenced fragments.
Represents genome coverage of data from Illumina sequence.
Represents genome coverage depth of data from Illumina sequence.
Represents the number and percentage of complete genes found in the database (contains 290 conserved core genes of fungi).
Represents the number and percentage of complete single-copy genes.
Represents the number and percentage of complete duplicated genes.
Represents the number of predictions for incomplete genes.
Represents the unpredicted number of genes.
Represents the number of conserved gene sets in fungi from the database of fungi_odb9.
Genomic comparison of important species of Ganoderma
|
| Strain | GenBank assembly accession | Genome size (Mb) | Number of scaffolds | GC% | Genome coverage | Assembly level | Contig N50/bp | Sequencing technology |
|---|---|---|---|---|---|---|---|---|---|
|
| HMGIM-I160015 | AHKGY000000000 | 50.05 | 58 | 55.85 | 288.16x | Scaffold | 3,064,430 | Illumina, Nanopore |
|
| BCRC 36111 | GCA_012655175.1 | 48.91 | 173 | 55.1 | 97.73x | Contig | 1,281,108 | PacBio |
|
| BCRC 37177 | GCA_000338035.1 | 44.08 | 3,275 | 55.5 | 824x | Contig | 63,041 | Illumina |
|
| G.260125-1 | GCA_000271565.1 | 43.29 | 82 | 56.1 | 440x | Scaffold | 649,708 | Illumina |
|
| Xiangnong No.1 | GCA_000262775.1 | 39.95 | 634 | 55.3 | 70x | Scaffold | 80,796 | Illumina |
|
| s90 | GCA_003057275.1 | 45.50 | 6,638 | — | 101.0x | Scaffold | 11,659 | Illumina HiSeq |
|
| ZZ0214-1 | GCA_002760635.1 | 48.96 | 69 | 55.6 | 500.0x | Scaffold | 753,893 | 454; Illumina HiSeq |
|
| BCRC 37180 | GCA_000338015.1 | 46.38 | 6,173 | 55.3 | 824x | Contig | 50,471 | Roche/454; Illumina/ABI |
|
| BRIUMSc | GCA_008694245.1 | 52.28 | 12,158 | 55.6 | 99.35x | Contig | 6,197 | Illumina |
|
| NJ3 | GCA_001855635.1 | 60.33 | 18,903 | 55.9 | 20.0x | Contig | 6,116 | Illumina HiSeq; 454 |
|
| G3 | GCA_002900995.2 | 79.19 | 495 | 55.9 | 50.0x | Contig | 272,644 | PacBio |
Data of G. leucocontextum were from this study; other data were from NCBI.
Gene information statistics of G. leucocontextum
| Gene statistics | Values |
|---|---|
| Gene number | 13,390 |
| CDs number | 78,206 |
| Exon number | 82,221 |
| Intron number | 68,831 |
| Gene length | 29,573,839 |
| CDs length | 19,331,970 |
| Exon length | 23,157,769 |
| Intron length | 6,416,070 |
| Average gene length | 2,208.65 |
| Average CDs length | 247.19 |
| Average exon length | 281.65 |
| Average intron length | 93.21 |
| Average CDs number | 5.84 |
| Average exon number | 6.14 |
| Average intron number | 5.14 |
Functional annotation of G. leucocontextum genes from public databases
| Public database | Number of genes | Percentage |
|---|---|---|
| GO | 3,110 | 24.4 |
| KEGG | 3,401 | 26.7 |
| KOG | 5,658 | 44.5 |
| Pfam | 7,894 | 62.0 |
| Swiss-Prot | 6,345 | 49.9 |
| TrEMBL | 12,566 | 98.8 |
| Nr | 12,703 | 99.8 |
| All annotated | 12,724 | 100.0 |
Figure 2Annotated result classification chart by KOG database.
Figure 3Annotated result classification chart by GO database.
Figure 4Annotated result classification chart by KEEG database.
Annotation results of carbohydrate-active enzymes
| Type | Number | Percentage |
|---|---|---|
| AAs | 84 | 13.68 |
| CBMs | 65 | 10.58 |
| CEs | 103 | 16.77 |
| GHs | 273 | 44.46 |
| GTs | 76 | 12.37 |
| PLs | 13 | 2.11 |
Figure 5The CAZymes annotation of G. leucocontextum.
The gene distribution of CAZymes in six different fungi
| Nutritional ecological type | Species | The proportion of match number and percentage | ||||||
|---|---|---|---|---|---|---|---|---|
| GHs | GTs | PLs | CEs | CBMs | AAs | Totals | ||
| Wood-rotting fungi |
| 273 (44.46%) | 76 (12.37%) | 13 (2.11%) | 103 (16.77%) | 65 (10.58%) | 84 (13.68%) | 614 |
| Wood-rotting fungi |
| 288 (58.90) | 70 (14.31%) | 10 (2.04%) | 30 (6.13) | 53 (10.84) | 38 (7.77%) | 489 |
| Wood-rotting fungi |
| 106 (31.55%) | 29 (8.63%) | 14 (4.17%) | 22 (6.55%) | 103 (30.65%) | 66 (18.45%) | 340 |
| Wood-rotting fungi |
| 161 (47.21%) | 59 (17.30%) | 7 (2.05%) | 26 (7.62%) | 4 (1.17%) | 84 (24.63%) | 341 |
| Straw-rotting fungi |
| 159 (47.60%) | 41 (12.28%) | 20 (5.99%) | 13 (3.89%) | 57 (17.07%) | 44 (13.17%) | 334 |
| Mycorrhizal fungi |
| 112 (51.85%) | 41 (18.98) | 5 (2.31) | 5 (2.31%) | 21 (9.72%) | 32 (14.81%) | 216 |
| Entomogenous fungi |
| 159 (50.15%) | 84 (26.49%) | 4 (1.26%) | 13 (4.10%) | 2 (0.63%) | 55 (17.35%) | 317 |
Genes of ganoderic acids and ergosterol biosynthesis in G. leucocontextum and G. lucidum
| Secondary metabolite | Gene name | Gene number | |
|---|---|---|---|
|
|
| ||
| Ganoderic acids | AACT-1 | GL23502 | EVM0010239.1 |
| AACT-2 | GL26574 | EVM0000725.1 | |
| FPS-1 | GL22068 | EVM0004420.2 | |
| FPS-2 | GL25499 | EVM0009008.1 | |
| HMGR | GL24088 | EVM0008852.1 | |
| HMGS | GL24922 | EVM0003163.1 | |
| IDI | GL29704 | EVM0010983.1 | |
| LSS | GL18675 | EVM0005702.1 | |
| MVD | GL25304 | EVM0004601.1 | |
| MVK | GL17879 | EVM0008302.1 | |
| PMVK | GL17808 | EVM0009209.1 | |
| SE | GL23376 | EVM0008824.1 | |
| SQS | GL21690 | EVM0010547.1 | |
| Ergosterol | ERG11-1 | GL26139 | EVM0001186.1 |
| ERG11-2 | GL22375 | / | |
| ERG2 | GL22516 | EVM0010304.1 | |
| ERG24 | GL23832 | EVM0011570.1 | |
| ERG25 | GL23074 | EVM0000338.1 | |
| ERG26 | GL16838 | EVM0012405.1 | |
| ERG27 | GL22371 | EVM0011171.1 | |
| ERG3 | GL26052 | EVM0000544.3 | |
| ERG4 | GL21870 | EVM0000917.1 | |
| ERG5 | GL30444 | EVM0010271.1 | |
| ERG6 | GL18323 | EVM0002811.1 | |
Figure 6Triterpenoid and ergosterol biosynthesis pathways of G. leucocontextum.
Pathway of metabolism of terpenoids and polyketides in G. leucocontextum and G. lucidum
| Pathway of metabolism of terpenoids and polyketides | Gene name | Definition | KO term | EC number |
|
|
|---|---|---|---|---|---|---|
| Terpenoid backbone biosynthesis/map00900 | ACAT, atoB | Acetyl-CoA C-acetyltransferase | K00626 | EC:2.3.1.9 | Present | Present |
| E2.3.3.10 | Hydroxymethylglutaryl-CoA synthase | K01641 | EC:2.3.3.10 | Present | Present | |
| HMGCR | Hydroxymethylglutaryl-CoA reductase (NADPH) | K00021 | EC:1.1.1.34 | Present | Present | |
| E2.7.4.2, mvaK2 | Phosphomevalonate kinase | K00938 | EC:2.7.4.2 | Present | Absent | |
| MVD, mvaD | Diphosphomevalonate decarboxylase | K01597 | EC:4.1.1.33 | Present | Present | |
| idi, IDI | Isopentenyl-diphosphate Delta-isomerase | K01823 | EC:5.3.3.2 | Present | Absent | |
| FDPS | Farnesyl-diphosphate synthase | K00787 | EC:2.5.1.1, EC:2.5.1.10 | Present | Present | |
| PCYOX1, FCLY | Prenylcysteine oxidase/farnesylcysteine lyase | K05906 | EC:1.8.3.5, EC:1.8.3.6 | Present | Present | |
| ICMT, STE14 | Protein- | K00587 | EC:2.1.1.100 | Present | Present | |
| STE24 | STE24 endopeptidase | K06013 | EC:3.4.24.84 | Present | Absent | |
| RCE1, FACE2 | Prenyl protein peptidase | K08658 | EC:3.4.22.- | Present | Absent | |
| FNTB | Protein farnesyltransferase subunit beta | K05954 | EC:2.5.1.58 | Present | Present | |
| DHDDS, RER2, SRT1 | Ditrans, polycis-polyprenyl diphosphate synthase | K11778 | EC:2.5.1.87 | Present | Absent | |
| GGPS1 | Geranylgeranyl diphosphate synthase, type III | K00804 | EC:2.5.1.29 | Present | Present | |
| hexPS, COQ1 | Hexaprenyl-diphosphate synthase | K05355 | EC:2.5.1.82, EC:2.5.1.83 | Present | Present | |
| Sesquiterpenoid and triterpenoid biosynthesis/map00909 | FDFT1 | Farnesyl-diphosphate farnesyltransferase | K00801 | EC:2.5.1.21 | Present | Absent |
| SQLE, ERG1 | Squalene monooxygenase | K00511 | EC:1.14.14.17 | Present | Absent |
Figure 7Enzyme in G. leucocontextum and G. lucidum involved in the pathway of Sesquiterpenoid and triterpenoid biosynthesis (map00909, A) and Terpenoid backbone biosynthesis (map00900, B). The colored box indicated existing homologous genes of the enzyme; green represent G. leucocontextum, red represent G. lucidum, white box means not, the same below.