| Literature DB >> 31597238 |
Young-Jin Park1, Chang-Soo Lee2, Won-Sik Kong3.
Abstract
Next-generation sequencing (NGS) of the Flammulina rossica (wood-rotting basidiomycete) genome was performed to identify its carbohydrate-active enzymes (CAZymes). De novo genome assembly (31 kmer) revealed a total length of 35,646,506 bp (49.79% GC content). In total, 12,588 gene models of F. rossica were predicted using an ab initio gene prediction tool (AUGUSTUS). Orthologous analysis with other fungal species revealed that 7433 groups contained at least one F. rossica gene. Additionally, 12,033 (95.6%) of 12,588 genes for F. rossica proteins had orthologs among the Dikarya, and F. rossica contained 12 species-specific genes. CAZyme annotation in the F. rossica genome revealed 511 genes predicted to encode CAZymes including 102 auxiliary activities, 236 glycoside hydrolases, 94 glycosyltransferases, 19 polysaccharide lyases, 56 carbohydrate esterases, and 21 carbohydrate binding-modules. Among the 511 genes, several genes were predicted to simultaneously encode two different CAZymes such as glycoside hydrolases (GH) as well as carbohydrate-binding module (CBM). The genome information of F. rossica offers opportunities to understand the wood-degrading machinery of this fungus and will be useful for biotechnological and industrial applications.Entities:
Keywords: Flammulina rossica; basidiomycete; carbohydrate active enzyme; fungi; genome
Year: 2019 PMID: 31597238 PMCID: PMC6843371 DOI: 10.3390/microorganisms7100421
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Flammulina rossica Genome Sequencing Statistics.
|
| Total reads (100 bp) | 38,390,380 |
| Reads after trimming (%), >Q30 | 35,908,618 (93.53) | |
|
| Optimized Velvet hash value (kmer) | 31 |
| Total number of contigs | 15,546 | |
| Number of contigs (>1 kb) | 1843 | |
| Contig N50 (bp) | 48,718 | |
| Length of longest contig (bp) | 418,162 | |
| Total bases in contigs (bp) | 35,645,506 | |
| Total bases in contigs (>1 kb) | 33,499,878 | |
| GC content (%) | 49.79 | |
|
| Predicted gene | 12,588 |
| Average gene length (bp) | 1911 | |
| Average protein length (aa) | 510.29 | |
| Average exon per gene | 6.52 | |
| Average exon size (bp) | 234.67 | |
| Average intron size (bp) | 68.03 |
Comparison of the Genome Characteristics of Flammulina rossica and other Basidiomycetes.
|
|
|
|
|
|
|
|
|
|
|
|
| ASI4194 | KACC46186 | KACC46185 | KACC46182 | KACC42780 | S238N-H82 | Okayama7#130 | RP78 | H4-8 |
|
| 35.6 | 34.5 | 32.4 | 35 | 35.6 | 64.9 | 37.5 | 35.1 | 38.5 |
|
| 12,588 | 12,269 | 11,591 | 12,536 | 12,218 | 20,614 | 13,544 | 10,048 | 13,181 |
|
| 49.79 | 49.76 | 39 | 49.7 | 48.99 | 46.6 | 51.6 | 53.2 | 56.6 |
|
| 1911 | 2009 | 1980 | 1973 | 2294 | 1533.0 | 1679.0 | 1667.0 | 1794.9 |
|
| 234.67 | 234.09 | 230.53 | 233.91 | 231.4 | 210.1 | 251.0 | 232.0 | 249.3 |
|
| 68.03 | 68.14 | 68.93 | 69.29 | 190.3 | 92.7 | 75.0 | 117.0 | 79.0 |
Ortholog analysis of Flammulina rossica and other fungal species.
| Fungal Species | Genes | Genes in Orthogroups (%) | Unassigned Genes (%) | Orthogroups Containing Species (%) | Number of Species-Specific Orthogroups | Genes in Species-Specific Orthogroups |
|---|---|---|---|---|---|---|
| 12,588 | 12,024 (95.5) | 564 (4.5) | 7485 (58.7) | 5 | 12 | |
| 11,591 | 11,235 (96.9) | 356 (3.1) | 7307 (57.3) | 2 | 4 | |
| 12,269 | 11,881 (96.8) | 388 (3.2) | 7572 (59.4) | 1 | 2 | |
| 12,536 | 11,987 (95.6) | 549 (4.4) | 7592 (59.5) | 1 | 2 | |
| 12,218 | 10,774 (88.2) | 1444 (11.8) | 6679 (52.4) | 4 | 10 | |
| 10,448 | 9102 (87.1) | 1346 (12.9) | 5712 (44.8) | 29 | 265 | |
| 13,356 | 10,837 (81.1) | 2519 (18.9) | 6393 (50.1) | 59 | 329 | |
| 7826 | 6477 (82.8) | 1349 (17.2) | 4719 (37.0) | 39 | 129 | |
| 18,215 | 12,662 (69.5) | 5553 (30.5) | 6217 (48.7) | 73 | 524 | |
| 14,079 | 11,678 (82.9) | 2401 (17.1) | 6674 (52.3) | 61 | 340 | |
| 13,602 | 10,443 (76.8) | 3159 (23.2) | 6223 (48.8) | 36 | 265 | |
| 13,194 | 10,976 (83.2) | 2218 (16.8) | 6214 (48.7) | 59 | 437 | |
| 9561 | 8433 (88.2) | 1128 (11.8) | 5709 (44.8) | 4 | 24 | |
| 16,389 | 9336 (57.0) | 7053 (43.0) | 6263 (49.1) | 17 | 64 | |
| 9651 | 8369 (86.7) | 1282 (13.3) | 6131 (48.1) | 5 | 13 | |
| 10,812 | 8659 (80.1) | 2153 (19.9) | 6217 (48.7) | 10 | 26 | |
| 6002 | 4624 (77.0) | 1378 (23.0) | 3485 (27.3) | 12 | 33 | |
| 9115 | 8332 (91.4) | 783 (8.6) | 6283 (49.3) | 4 | 9 |
* Basidiomycota, ** Ascomycota.
Figure 1Phylogenetic tree of fungal species based on ortholog clustering.
Figure 2Identification and annotation of carbohydrate-active enzyme genes in the F. rossica genome based on three different databases including the HMMER (dbCAN carbohydrate-active enzymes domain HMM database), DIAMOND (CAZy database), and Hotpep (short conserved motifs in the (PRR library database) [32].
CAZymes of Flammulina rossica and other fungal species.
| Fungal Species | CAZymes | No. of CAZyme (Annotation DB) | Total | Reference | |||||
|---|---|---|---|---|---|---|---|---|---|
| AA 1 | GH 2 | GT 3 | CE 4 | CBM 5 | PL 6 | ||||
| 93 | 182 | 64 | 56 | 7 | 17 | 419 (Hmmer dbCAN) | 528 | This study | |
| 61 | 145 | 53 | 21 | 5 | 15 | 300 (Hotpep) | |||
| 39 | 155 | 72 | 11 | 11 | 6 | 294 (CAZy database) | |||
| 86 | 220 | 85 | 57 | 45 | 20 | - | 513 | [ | |
| 87 | 228 | 87 | 61 | 40 | 21 | - | 524 | [ | |
| 82 | 218 | 89 | 59 | 42 | 18 | - | 508 | [ | |
| 85 | 239 | 84 | 63 | 44 | 25 | - | 540 | [ | |
| 81 | 174 | 54 | 33 | 44 | 9 | - | 395 | JGI database | |
| 111 | 195 | 83 | 60 | 105 | 16 | - | 570 | [ | |
| 55 | 170 | 96 | 18 | 31 | 7 | - | 377 | JGI database | |
| 82 | 254 | 85 | 44 | 61 | 11 | - | 537 | [ | |
| 85 | 175 | 65 | 16 | 62 | 4 | - | 407 | JGI database | |
| 78 | 241 | 85 | 57 | 37 | 18 | - | 516 | [ | |
| 14 | 97 | 70 | 5 | 12 | 4 | - | 202 | CAZy database | |
| 54 | 165 | 91 | 34 | 39 | 5 | - | 388 | [ | |
| 59 | 210 | 90 | 32 | 44 | 5 | - | 440 | [ | |
| 5 | 57 | 68 | 2 | 12 | 0 | - | 144 | CAZy database | |
| 33 | 267 | 91 | 30 | 46 | 23 | - | 490 | CAZy database | |
| 35 | 177 | 80 | 21 | 42 | 4 | - | 359 | CAZy database | |
| 77 | 287 | 119 | 37 | 89 | 10 | - | 619 | CAZy database | |
* Basidiomycota, ** Ascomycota, 1 auxiliary activitie (AA), 2 glycoside hydrolase (GH), 3 glycosyltransferase (GT), 4 carbohydrate esterase (CE), 5 carbohydrate-binding module (CBM), 6 polysaccharide lyase (PL).
Figure 3Number of CAZymes in F. rossica. Number of (A) GT families; (B) CE families; (C) GH families; (D) PL families; (E) AA families; (F) CBM families. AA, Auxiliary Activities; GH, glycoside hydrolase; GT, glycosyltransferase; CBM, carbohydrates-binding module; PL, polysaccharide lyase.
Figure 4Distribution of CAZymes in F. rossica and other fungal species. (A) GT families; (B) CE families; (C) GH families; (D) PL families; (E) AA families; (F) CBM families.