| Literature DB >> 29580224 |
Zhen Zeng1, Hui Sun1,2, Eeva J Vainio3, Tommaso Raffaello1, Andriy Kovalchuk1, Emmanuelle Morin4, Sébastien Duplessis4,5, Fred O Asiegbu6.
Abstract
BACKGROUND: Heterobasidion parviporum is an economically most important fungal forest pathogen in northern Europe, causing root and butt rot disease of Norway spruce (Picea abies (L.) Karst.). The mechanisms underlying the pathogenesis and virulence of this species remain elusive. No reference genome to facilitate functional analysis is available for this species.Entities:
Keywords: Comparative genomics; CpG-biased mutation; Heterobasidion parviporum; Oxidation-reduction process; Saprotrophic wood decay; Secreted proteins; Transcription factors; Virulence factors
Mesh:
Substances:
Year: 2018 PMID: 29580224 PMCID: PMC5870257 DOI: 10.1186/s12864-018-4610-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of H. parviporum isolates, mapping and variants relative to the reference isolate S15
| Isolate | Namea | Origins | HAMBI/FBCC number | Mapped reads | Reads depth (x) | Reads coverage (%) | SNPs | InDels |
|---|---|---|---|---|---|---|---|---|
| 03020 | S1 | Pernaja, Horslök | 2362 | 20,338,386 | 47.76 | 93.52 | 146,387 | 34,661 |
| 04121 | S2 | Artjärvi | 2365 | 20,914,722 | 49.13 | 92.97 | 140,039 | 34,113 |
| 93242 | S3 | Siilinjärvi | 2366 | 21,352,488 | 50.13 | 93.80 | 138,032 | 34,749 |
| 04051 | S4 | Mäntsälä | 2368 | 19,057,716 | 44.78 | 93.48 | 136,927 | 33,259 |
| 99055 | S5 | Kolari, Ylläs | 2361 | 18,024,412 | 42.34 | 93.60 | 144,819 | 34,027 |
| 91271 | S6 | Karkkila | 2363 | 20,342,018 | 47.78 | 95.21 | 94,638 | 29,362 |
| 99058 | S7 | Kittilä, Kukasjärvi | 2354 | 20,662,349 | 48.54 | 92.44 | 141,327 | 32,800 |
| 01039 | S8 | Hattula, Korkee | 2369 | 18,110,965 | 42.55 | 93.48 | 150,875 | 34,805 |
| 96160 | S9 | Askola | 2355 | 19,847,739 | 46.58 | 93.07 | 135,994 | 32,959 |
| 05029 | S10 | Kirkkonummi, Yövilä | 2364 | 19,240,310 | 45.20 | 93.86 | 143,212 | 33,761 |
| 99067 | S11 | Loppi, Launonen | 2360 | 21,492,624 | 50.51 | 93.13 | 143,010 | 34,019 |
| 94174 | S12 | Loimaa, Köyliön kylä | 2358 | 17,832,237 | 41.90 | 92.72 | 140,559 | 32,891 |
| 98038 | S13 | Suomusjärvi, Kettula | 2370 | 21,039,687 | 49.36 | 93.06 | 153,255 | 35,875 |
| 03014 | S14 | Kuhmoinen | 2367 | 19,767,540 | 46.42 | 92.79 | 143,762 | 33,717 |
| 96026 | S15 | Åland Islands | 2359 | – | – | – | – | – |
aReferred name in this study
Fig. 1Sampling sites of the 15 H. parviporum isolates used in this study in Finland. X- and y-axes are longitude and latitude, respectively
Fig. 2Phenotypic characterization of 15 H. parviporum isolates and virulence validation of two selected isolates. a Mortality rate of Norway spruce seedlings infected by 15 Heterobasidion parviporum isolates and the control at 15 and 25 dpi. b Percentage of wood dry mass losses caused by 15 H. parviporum isolates and the control. c Norway spruce clone C30560 infected by S15 and S12 as well as wounded control in phloem and xylem in virulence validation. Black line is 10 mm scale bar
Summary of genome assembly and annotation in isolate S15
| Feature | Value |
|---|---|
| Scaffold numbera | 287 |
| Total scaffold length (Mb)a | 37.76 |
| Scaffold N50 (Kb) | 630.6 |
| Scaffold N90 (Kb) | 89.4 |
| Maximum scaffold length (Mb) | 3.49 |
| Minimum scaffold length (Kb) | 4.1 |
| GC content (%)a | 53.00 |
| Number of predicted genesa | 10,502 |
| Number of genes with RNA-seq supportb | 8859 |
| Mean protein length (aa) | 479 |
| Total gene length (Mb) | 22.35 |
| Number of genes with Blast hits | 9789 |
| Number of genes with InterProScan hits | 7941 |
| Number of genes with GO annotations | 6977 |
| Benchmarking Universal Single-Copy Orthologs (BUSCOs) assessmentc | C: 92.6% (1332), D: 11% (158), F: 6.2% (89), M: 1.2% (17) |
aThe corresponding genome information of H. irregulare TC32–1 were 15 scaffolds, 33.6 Mb, GC content of 52.23% and 13,405 genes predicted
bBased on alignment with constructed transcripts by Trinity v2.3.2 (E-value <1e-5)
cC: complete (including complete single copy and duplicated BUSCOs), D: duplicated, F: fragmented, M: missing
Fig. 3The Heterobasidion parviporum S15 genome. The 287 scaffolds were arranged clockwise. The largest 40 scaffolds were labeled. Each circle from the outside to the inside depicts: (a) GC content (red > 0.65, green < 0.45); (b) Transposable elements density; (c) Gene density; (d) to (h) Secreted CAZymes, peptidase and peptidase inhibitors, peroxidases, cytochrome P450 and effector homologs of PHI-base, respectively; (i) Genome-wide nucleotide diversity, color coded from blue to red representing values from low to high; (j) SNPs density, color coded from blue to red representing values from low to high; (k) InDels density
Summary of assembly of re-sequenced isolates and alignments to S15 genome
| Isolates | Contig number | N50 (Kb) | Assembly size (Mb) | GC content (%) | Aligned length (Mb) | Aligned sequence (%) | Sequence identity (%) | |
|---|---|---|---|---|---|---|---|---|
| S1 | 6116 | 50.39 | 33.12 | 52.87 | 0.26 | 30.61 | 92.42 | 96.46 |
| S2 | 5800 | 55.67 | 33.02 | 52.88 | 0.24 | 30.21 | 91.49 | 96.45 |
| S3 | 6156 | 51.20 | 33.09 | 52.91 | 0.21 | 30.58 | 92.42 | 96.60 |
| S4 | 6328 | 52.94 | 33.30 | 52.89 | 0.26 | 30.55 | 91.75 | 96.66 |
| S5 | 6499 | 49.51 | 33.19 | 52.86 | 0.29 | 30.49 | 91.85 | 96.55 |
| S6 | 19,952 | 44.43 | 39.63 | 52.96 | 1.01 | 30.25 | 76.33 | 97.27 |
| S7 | 6016 | 45.83 | 32.93 | 52.88 | 0.22 | 30.25 | 91.87 | 96.58 |
| S8 | 6096 | 47.81 | 33.02 | 52.86 | 0.23 | 30.41 | 92.09 | 96.52 |
| S9 | 5838 | 52.80 | 32.95 | 52.88 | 0.21 | 30.65 | 93.01 | 96.62 |
| S10 | 5892 | 52.53 | 33.16 | 52.89 | 0.22 | 30.68 | 92.54 | 96.69 |
| S11 | 5599 | 55.60 | 32.81 | 52.92 | 0.20 | 30.40 | 92.64 | 96.59 |
| S12 | 5883 | 53.90 | 32.76 | 52.94 | 0.21 | 30.42 | 92.85 | 96.70 |
| S13 | 6086 | 47.96 | 33.28 | 52.86 | 0.24 | 30.39 | 91.33 | 96.24 |
| S14 | 5631 | 49.64 | 32.90 | 52.85 | 0.19 | 30.21 | 91.81 | 96.43 |
Secreted protein located within outlier window with negative Tajima’s D value (D < −1.9365)
| Gene ID | Brief description | InterPro accession | InterPro name | PHI-hita |
|---|---|---|---|---|
| evm.scaffold1.1114 | CE8 | IPR000070; | Pectinesterase, catalytic; | – |
| evm.scaffold13.112 | CBM1 | IPR000254 | Cellulose-binding domain, fungal | Yesb |
| evm.scaffold35.37 | CBM50 | IPR018392 | 2 LysM domains | |
| evm.scaffold67.10 | Laccase | IPR001117; | Multicopper oxidase, type 1; | Yes |
| evm.scaffold7.263 | AA9 | IPR005103 | Glycoside hydrolase, family 61 | Yesb |
| evm.scaffold15.16 | GH10 | IPR001000 | Glycoside hydrolase family 10 domain; | Yesb |
| evm.scaffold8.166 | GH76 | IPR005198; | Glycoside hydrolase, family 76; | – |
| evm.scaffold7.23 | GH92 | IPR012939; | Glycosyl hydrolase family 92; | – |
| evm.scaffold38.42 | GH92 | IPR012939; | Glycosyl hydrolase family 92; | – |
| evm.scaffold2.822 | Thioredoxin protein disulfide isomerase (TrxM) | IPR013766 | Thioredoxin domain | Yes |
| evm.scaffold7.192 | Chloroperoxidase (HalPrx) | IPR000028 | Chloroperoxidase | Yes |
| evm.scaffold20.57 | Phosphatidylethanolamine binding protein (I51) | IPR008914 | Phosphatidylethanolamine-binding protein | – |
| evm.scaffold5.277 | Subtilisin-like protease (S08A) | IPR015500; | Peptidase S8, subtilisin-related; | – |
| evm.scaffold53.38 | S53 protease (S53) | IPR015366; | Peptidase S53, activation domain; | – |
| evm.scaffold53.39 | S53 protease (S53) | IPR015366; | Peptidase S53, activation domain; | – |
| evm.scaffold16.111 | Cytochrome P450 | IPR002401 | Cytochrome P450, E-class, group I | Yes |
| evm.scaffold7.193 | Phenol 2-monooxygenase | IPR023753; | FAD/NAD(P)-binding domain; | – |
| evm.scaffold10.132 | Putative hydrophobic surface-binding protein | IPR021054 | Cell wall mannoprotein 1 | – |
| evm.scaffold11.199 | Hypothetical protein | IPR006689; | Small GTPase superfamily, ARF/SAR type; | Yes |
| evm.scaffold14.153 | Hypothetical protein | IPR008030; | NmrA-like domain; | |
| evm.scaffold27.89 | Hypothetical protein | – | – | – |
| evm.scaffold2.412 | Hypothetical protein | IPR000994 | Peptidase M24 | – |
| evm.scaffold38.41 | Hypothetical protein | IPR001623; | DnaJ domain; | Yes |
| evm.scaffold4.300 | Hypothetical protein | – | – | – |
| evm.scaffold57.22 | Hypothetical protein | IPR017905 | ERV/ALR sulfhydryl oxidase domain | – |
| evm.scaffold58.12 | – | – | – | – |
| evm.scaffold58.13 | Fasciclin-like protein | IPR008972 | Cupredoxin | – |
a“Yes” signifies the proteins having hits in PHI-base database and “b” means the hits were annotated as effectors
Secreted proteins exclusively affected by nonsynonymous SNPs and frameshift mutations in S12
| Gene ID | Variant type | Brief description | InterPro accession | InterPro name | PHI-hita |
|---|---|---|---|---|---|
| evm.scaffold1.948 | nonsynonymous SNPs | GH18, CBM5 | IPR003610; | Carbohydrate-binding module family 5/12; | |
| evm.scaffold3.480 | nonsynonymous SNPs | PL1 | IPR002022; | Pectate lyase | Yes |
| evm.scaffold67.10 | nonsynonymous SNPs | Laccase | IPR001117; | Multicopper oxidase, type 1; | Yes |
| evm.scaffold31.72 | nonsynonymous SNPs | Hypothetical protein | – | – | |
| evm.scaffold35.14 | nonsynonymous SNPs | Hypothetical protein | – | – | |
| evm.scaffold13.87 | frameshift mutation | Hypothetical protein | – | – |
a“Yes” signifies the proteins cse
Fig. 4Distribution of gene numbers in each gene category. Gene categories include (a) core genes, (b) deletion genes, (c) reference-specific genes. Duplicated genes and secreted protein coding genes in relation to each type of gene set were displayed
Summary of different categories of genes in relation to S15
| Isolates | Duplicated regionsa | Duplicated genes | Completely deleted genes | Partially deleted genes | Novel genes | Novel genes with Blastp hit | Novel genes with GO terms | Novel genes with domain(s) |
|---|---|---|---|---|---|---|---|---|
| S1 | 62 | 34 | 91 | 212 | 142 | 114 | 39 | 34 |
| S2 | 83 | 52 | 98 | 219 | 153 | 126 | 36 | 34 |
| S3 | 64 |
| 94 | 194 | 134 | 109 | 33 | 34 |
| S4 | 78 | 50 | 59 | 218 | 157 | 124 | 27 | 32 |
| S5 | 67 | 50 | 82 | 195 | 156 | 137 | 35 | 39 |
| S6 |
|
|
|
|
|
|
|
|
| S7 | 81 | 53 |
|
| 155 | 127 | 34 | 34 |
| S8 | 61 | 45 | 107 | 190 | 137 | 111 | 28 | 28 |
| S9 |
| 28 | 95 | 219 | 130 | 104 | 23 | 26 |
| S10 | 71 | 43 | 71 | 197 | 155 | 125 | 30 | 32 |
| S11 | 58 | 32 | 103 | 213 | 121 | 99 | 24 | 24 |
| S12 | 59 | 39 | 106 | 216 |
|
|
|
|
| S13 | 66 | 38 | 84 | 207 | 150 | 116 | 34 | 32 |
| S14 | 80 | 43 | 95 | 219 | 142 | 113 | 27 | 28 |
a the largest and the smallest numbers in each category are in bold
Reference-specific genes on scaffold38 and scaffold51 with annotations
| Gene ID | Brief description | GO ID | InterPro accession | InterPro name |
|---|---|---|---|---|
| evm.scaffold38.25a | Short chain dehydrogenase reductase | GO:0016491;GO:0055114 | IPR002347; | Short-chain dehydrogenase/reductase SDR; NAD(P)-binding domain |
| evm.scaffold38.27a | MFS general substrate transporter | GO:0055085;GO:0016021 | IPR011701; | Major facilitator superfamily; Major facilitator superfamily domain |
| evm.scaffold38.28a | Alpha beta-hydrolase | – | IPR029058 | Alpha/Beta hydrolase fold |
| evm.scaffold38.29a | Cytochrome P450 | GO:0005506;GO:0016705;GO:0055114;GO:0020037; | IPR001128 | Cytochrome P450 |
| evm.scaffold38.31 | Transmembrane |
| IPR008547; IPR029058 | Protein of unknown function DUF829, TMEM53; |
| evm.scaffold38.32a | Short chain dehydrogenase reductase | GO:0016491;GO:0008152 | IPR002347; | Short-chain dehydrogenase/reductase SDR; NAD(P)-binding domain |
| evm.scaffold38.33 | Hypothetical protein | GO:0016020 | – | Alpha/Beta hydrolase fold |
| evm.scaffold51.36b | – | – | – | – |
| evm.scaffold51.42 | Glucose-6-phosphate isomerase | GO:0004347;GO:0006094;GO:0006096; | IPR001672; | Phosphoglucose isomerase (PGI); |
| evm.scaffold51.44b | Hypothetical protein | – | – | – |
a denotes the proteins having PHI-base database hits
b denotes the secreted proteins
Fig. 5Domain distribution of secreted proteins in the deletion gene set. Symbol “*” signifies the presence of partial deletions but not counted as present in the isolates. Isolates were ranked by their virulence in descending order as in Fig. 2a
Fig. 6Domain distribution of predicted proteins in the novel gene set. PFAM domains related to TE were highlighted in red. Isolates were ranked by their virulence in descending order as in Fig. 2a