| Literature DB >> 31138126 |
Mari Mäkinen1,2, Jaana Kuuskeri1, Pia Laine3, Olli-Pekka Smolander3,4, Andriy Kovalchuk5, Zhen Zeng5, Fred O Asiegbu5, Lars Paulin3, Petri Auvinen3, Taina Lundell6.
Abstract
BACKGROUND: The white rot fungus Phlebia radiata, a type species of the genus Phlebia, is an efficient decomposer of plant cell wall polysaccharides, modifier of softwood and hardwood lignin, and is able to produce ethanol from various waste lignocellulose substrates. Thus, P. radiata is a promising organism for biotechnological applications aiming at sustainable utilization of plant biomass. Here we report the genome sequence of P. radiata isolate 79 originally isolated from decayed alder wood in South Finland. To better understand the evolution of wood decay mechanisms in this fungus and the Polyporales phlebioid clade, gene content and clustering of genes encoding specific carbohydrate-active enzymes (CAZymes) in seven closely related fungal species was investigated. In addition, other genes encoding proteins reflecting the fungal lifestyle including peptidases, transporters, small secreted proteins and genes involved in secondary metabolism were identified in the genome assembly of P. radiata.Entities:
Keywords: ABC transporters; Phlebia radiata; carbohydrate-active enzyme genes; co-regulation; comparative genomics; lignin biodegradation; peptidases; secondary metabolism; small secreted proteins; wood decay
Mesh:
Substances:
Year: 2019 PMID: 31138126 PMCID: PMC6540522 DOI: 10.1186/s12864-019-5817-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the genome assembly features of the phlebioid fungal genomes included in the comparative study
| Genome features |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Assembly size (Mbp) | 40.41 | 49.96 | 34.85 | 42.73 | 30.14 | 46.29 | 35.15 |
| # of contigs | 92 | 3178 | 3667 | 1263 | 1195 | 2272 | 1253 |
| # of scaffolds | 92 | 1645 | 3022 | 508 | 573 | 1137 | 232 |
| # of scaffolds ≥ 2 Kbp | 92 | 1645 | 1652 | 508 | 506 | 1136 | 204 |
| Scaffold L50 | 8 | 12 | 201 | 13 | 72 | 6 | 8 |
| Scaffold N50 (Mbp) | 1.98 | 1.31 | 0.05 | 1.03 | 0.12 | 3.53 | 1.91 |
| # of gaps | 0 | 1533 | 645 | 755 | 622 | 1135 | 1021 |
| Average gene lenght (bp) | 1847 | 1627 | 1378 | 1703 | 1714 | 1765 | 1684 |
| Average transcript length (bp) | 1463 | 1274 | 1134 | 1388 | 1380 | 1446 | 1401 |
| Average exon length (bp) | 227 | 225 | 274 | 248 | 230 | 269 | 259 |
| Average intron length (bp) | 72 | 78 | 80 | 71 | 69 | 75 | 66 |
| Average protein length (bp) | 488 | 400 | 378 | 406 | 411 | 384 | 408 |
| Exons per gene | 6.46 | 5.66 | 4.14 | 5.59 | 6 | 5.36 | 5.41 |
| # of gene models | 14629 | 16170 | 13785 | 15473 | 11891 | 13937 | 13602 |
Data were collected from the JGI MycoCosm (genome.jgi.doe.gov/programs/fungi/index.jsf)
Fig. 1Gene Ontology classification of P. radiata proteins. Functional classification by GO categories (a) and GO terms (b)
Number of the identified CAZyme-encoding genes involved in breakdown of plant cell wall lignocellulose components in the genomes of the studied phlebioid fungi
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| GH1 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| GH3 | 10 | 8 | 7 | 9 | 9 | 11 | 10 |
| GH5_5 | 4 | 4 | 2 | 4 | 4 | 6 | 2 |
| GH5_22 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| GH6 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| GH7 | 6 | 4 | 3 | 5 | 5 | 5 | 8 |
| GH9 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| GH12 | 2 | 2 | 2 | 2 | 3 | 3 | 2 |
| GH44 | 1 | 2 | 1 | 0 | 0 | 0 | 0 |
| GH45 | 3 | 3 | 1 | 1 | 1 | 1 | 2 |
| GH131 | 2 | 4 | 2 | 3 | 2 | 2 | 3 |
|
| |||||||
| CE1 | 2 | 1 | 1 | 1 | 2 | 3 | 4 |
| CE5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| CE15 | 2 | 2 | 6 | 2 | 1 | 3 | 2 |
| CE16 | 10 | 8 | 6 | 15 | 6 | 5 | 7 |
| GH2 | 3 | 2 | 3 | 3 | 3 | 2 | 2 |
| GH5_7 | 3 | 2 | 3 | 2 | 2 | 2 | 3 |
| GH10 | 7 | 8 | 6 | 4 | 4 | 5 | 6 |
| GH11 | 1 | 0 | 0 | 0 | 2 | 1 | 1 |
| GH27 | 2 | 2 | 2 | 3 | 3 | 3 | 3 |
| GH29 | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
| GH31 | 4 | 5 | 6 | 4 | 6 | 8 | 6 |
| GH35 | 3 | 4 | 3 | 4 | 2 | 4 | 3 |
| GH43 | 2 | 2 | 2 | 6 | 7 | 4 | 4 |
| GH51 | 1 | 1 | 4 | 2 | 2 | 2 | 2 |
| GH74 | 2 | 1 | 1 | 2 | 2 | 2 | 4 |
| GH95 | 1 | 3 | 1 | 1 | 0 | 1 | 1 |
| GH115 | 1 | 2 | 2 | 2 | 1 | 1 | 1 |
|
| |||||||
| CE8 | 2 | 3 | 1 | 2 | 4 | 2 | 2 |
| CE12 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| GH28 | 7 | 5 | 5 | 6 | 10 | 4 | 5 |
| GH53 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| GH78 | 2 | 1 | 2 | 2 | 1 | 1 | 1 |
| GH88 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| GH105 | 3 | 0 | 2 | 1 | 0 | 0 | 0 |
| PL1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| PL4 | 1 | 1 | 1 | 1 | 0 | 0 | 0 |
|
| |||||||
| AA1 | 7 | 11 | 6 | 2 | 5 | 10 | 5 |
| AA2 | 10 | 15 | 8 | 21 | 9 | 11 | 15 |
| AA3 | 32 | 39 | 23 | 39 | 23 | 36 | 38 |
| AA3_1 (CDH) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| AA5 | 10 | 8 | 6 | 7 | 6 | 6 | 7 |
| AA9 | 12 | 12 | 11 | 28 | 15 | 11 | 16 |
| AA14 | 3 | 2 | 2 | 3 | 3 | 2 | 2 |
| DyP | 1 | 3 | 2 | 10 | 4 | 1 | 0 |
| HTP | 5 | 2 | 4 | 4 | 4 | 3 | 3 |
CAZy GH, CE, PL and AA classes, dye-decolorizing peroxidases (DyP), and heme-thiolate peroxidases (HTP) are indicated
Fig. 2Number of cellulose decomposing-enzyme coding genes in the genomes of the phlebioid fungi
Fig. 3Number of genes encoding hemicellulose-decomposing enzymes in the genomes of the phlebioid fungi
Fig. 4Number of pectin-decomposing enzyme coding genes in the genomes of the phlebioid fungi
Fig. 5Number of lignin-modifying and auxiliary oxidoreductase-coding genes in the genomes of the phlebioid fungi
Fig. 6Clustering and co-regulation of GH10 endo-1,4-β-xylanase (a) and AA9 LPMO encoding genes (b, c) of P. radiata. Intensity of the color indicates the scale of up- or down-regulation (dark red for high up-regulation, green for down-regulation, yellow for no significant difference) of gene expression on spruce wood substrate [26]. Direction of the arrow indicates gene location and orientation (on plus or minus strand) in the unitigs of the genome assembly (this study). The arrowhead indicates the 2-week time point of spruce wood cultivation whereas the end of the arrow refers to the 4-week time point. Beginning and end of the cluster (as genomic coordinates) is marked above the unitig. Gene model minus.g10275 in unitig 83 encodes a non-CAZyme protein
Genomic clusters of GH10 endoxylanase encoding genes in the genome assemblies of the phlebioid fungi
| Fungus | Cluster location in the genome assembly | Number of genes in the cluster | Number of non-CAZyme genes in the cluster |
|---|---|---|---|
|
| Unitig 9/Scaffold 8:672573-681272 | 4 | 0 |
|
| Scaffold 12:834597-856661 | 9 | 3 |
|
| Scaffold 2925:6494-13943 | 3 | 0 |
|
| absent | - | - |
|
| absent | - | - |
|
| Scaffold 3:3347462-3351815 | 3 | 1 |
|
| absent | - | - |
Locations of P. radiata gene models are indicated as original assembly unitig positions, and as scaffold positions of the MycoCosm genome repository. Other data were retrieved from the MycoCosm
Genomic clusters of AA9 encoding genes in the genome assemblies of the seven phlebioid fungi
| Fungus | Cluster location in the genome assembly | Number of genes in the cluster | Number of non-CAZyme genes in the cluster |
|---|---|---|---|
|
| Unitig 6/Scaffold 23:237827-242972 | 2 | 0 |
| Unitig 9/Scaffold 8:1780628-1784094 | 2 | 0 | |
| Unitig 83/Scaffold 30:124277-133397 | 4 | 1 | |
|
| Scaffold 7:1939036-1942768 | 2 | 0 |
| Scaffold 8:1713073-1722627 | 4 | 1 | |
| Scaffold 9:703327-712085 | 3 | 1 | |
|
| Scaffold 809:16725-20452 | 2 | 0 |
|
| Scaffold 3:699996-705149 | 3 | 0 |
| Scaffold 12:565705-571395 | 3 | 0 | |
| Scaffold 19:267134-284710 | 9 | 2 | |
| Scaffold 35:28836-31774 | 2 | 0 | |
|
| Scaffold 13:72697-82083 | 4 | 2 |
| Scaffold 167:19857-23789 | 2 | 0 | |
| Scaffold 240:27459-32596 | 3 | 0 | |
|
| Scaffold 3:465671-473281 | 3 | 0 |
| Scaffold 10:958421-970775 | 4 | 2 | |
|
| Scaffold 7:1587181-1592893 | 4 | 1 |
| Scaffold 10:522449-536923 | 7 | 4 |
Locations of P. radiata gene models are indicated as original assembly unitig positions, and as scaffold positions of the MycoCosm genome repository. Other data were retrieved from the MycoCosm
Number of peptidase encoding genes identified in the genomes of the seven phlebioid fungi
| Peptidase family Fungal species | Aspartic | Cysteine | Glutamic | Metallo | Serine | Threonine | Sum of peptidases |
|---|---|---|---|---|---|---|---|
|
| 48 | 50 | 2 | 60 | 149 | 18 | 327 |
|
| 67 | 59 | 0 | 73 | 207 | 19 | 425 |
|
| 58 | 45 | 0 | 51 | 138 | 18 | 310 |
|
| 60 | 69 | 0 | 68 | 173 | 18 | 388 |
|
| 43 | 51 | 15 | 62 | 159 | 18 | 348 |
|
| 70 | 91 | 34 | 61 | 170 | 17 | 443 |
|
| 64 | 45 | 11 | 55 | 149 | 19 | 343 |
Data was searched and retrieved from the MycoCosm
Number of ABC transporter encoding genes identified in the genomes of the seven phlebioid fungi
| Fungal species / ABC Subfamily |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| ABC-A | 1 | 1 | 1 | 0 | 0 | 0 | 0 |
| ABC-B (FL) | 3 | 3 | 4 | 4 | 3 | 3 | 3 |
| ABC-B (HT) | 8 | 6 | 6 | 8 | 9 | 7 | 8 |
| ABC-C | 18 | 26 | 20 | 13 | 19 | 18 | 18 |
| ABC-D | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| ABC-E | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| ABC-F | 5 | 5 | 5 | 4 | 5 | 5 | 5 |
| ABC-G | 8 | 8 | 7 | 7 | 6 | 8 | 10 |
| ABC-I | 3 | 3 | 3 | 3 | 3 | 3 | 3 |
Data was searched and retrieved from the MycoCosm
Fig. 7Venn diagram illustrating the number of shared SSP candidates of P. radiata. The SSP genes were computationally analysed by using EffectorP and LOCALIZER and according to the criteria of being small and cysteine-rich
Summary of the SSP candidates of P. radiata that fulfilled the three criteria used in the analyses
| Protein coding gene ID | Length | Number of cysteines | EffectorPa | LOCALIZERb |
|---|---|---|---|---|
| minus.g2681 | 265 | 10 | 0.610 | NLS |
| minus.g6180 | 200 | 6 | 0.669 | 0.998, C |
| plus.g1485 | 119 | 11 | 0.916 | NLS |
| plus.g2909 | 150 | 8 | 0.676 | 0.998, C |
| plus.g2957 | 146 | 8 | 0.570 | 0.896, C |
| plus.g7370 | 229 | 19 | 0.767 | 0.716, C |
| plus.g8107 | 158 | 8 | 0.747 | 0.971, C; 0.824, M |
| plus.g881 | 179 | 4 | 0.781 | 0.993, C; 0.995, M |
aThe number is the probability of being an effector given by EffectorP
bThe number is the probability of having a Chloroplast (C) transit peptide-like or Mitochondria (M) transit peptide-like sequences given by LOCALIZER; for the cases of searching for nuclear localization signals (NLS), no probability was returned