Literature DB >> 23341945

A genome-wide association study identifies genomic regions for virulence in the non-model organism Heterobasidion annosum s.s.

Kerstin Dalman1, Kajsa Himmelstrand, Åke Olson, Mårten Lind, Mikael Brandström-Durling, Jan Stenlid.   

Abstract

The dense single nucleotide polymorphisms (SNP) panels needed for genome wide association (GWA) studies have hitherto been expensive to establish and use on non-model organisms. To overcome this, we used a next generation sequencing approach to both establish SNPs and to determine genotypes. We conducted a GWA study on a fungal species, analysing the virulence of Heterobasidion annosum s.s., a necrotrophic pathogen, on its hosts Picea abies and Pinus sylvestris. From a set of 33,018 single nucleotide polymorphisms (SNP) in 23 haploid isolates, twelve SNP markers distributed on seven contigs were associated with virulence (P<0.0001). Four of the contigs harbour known virulence genes from other fungal pathogens and the remaining three harbour novel candidate genes. Two contigs link closely to virulence regions recognized previously by QTL mapping in the congeneric hybrid H. irregulare × H. occidentale. Our study demonstrates the efficiency of GWA studies for dissecting important complex traits of small populations of non-model haploid organisms with small genomes.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23341945      PMCID: PMC3547014          DOI: 10.1371/journal.pone.0053525

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Genome-wide association (GWA) studies have been used to elucidate the genetic basis for disease traits [1]. The feasibility of GWA studies in plants was demonstrated by Atwell and colleagues [2] who tested 107 phenotypes in Arabidopsis thaliana inbred lines, and has also been used to investigate the genetic basis of resistance in different crops or various traits in wild plants and trees [3]. So far, very few GWA studies have been applied to fungi, even though the tools required are available and fungi have the advantages of small genomes, a homokaryotic (haploid) stage, a sexual life cycle and the ability to multiply genotypes for reproducibility. In Saccharomyces cerevisiae GWA mapping of mtDNA copy number identified one significant SNP [4] and population based GWA analysis of clinical vs. nonclinical yeast identified several genetic loci associated with the clinical background [5]. However, several different genetic traits have been studied in fungi using quantitative trait loci (QTL) analysis. In Pleurotus ostreatus, Santoyo et al. [6] located QTLs linked to lignin-degrading enzymatic activities on six different chromosomes. A QTL associated with pathogenicity on wheat has been identified in the Fusarium head blight fungus Gibberella zeae [7]. In Nectria haematococca MPI, several QTLs for virulence on pumpkin have been found [8]. The genetic components controlling virulence in the necrotrophic forest pathogen Heterobasidion annosum (Fr.) Bref. sensu lato (s.l.) have been studied by QTL mapping of fungal growth within sapwood and induced lesion length in phloem in Norway spruce (Picea abies) and Scots pine (Pinus sylvestris) [9]. Four QTLs, for both lesion length and growth in sapwood on spruce and pine were found located within the same linkage group, while two other pine-specific QTLs associated with growth or lesion length were located to two other groups [9]. In addition to nuclear genetic factors, the virulence has been shown to be influenced by the mitochondrial genome using hybrids of H. irregulare × H. occidentale [10]. The advantage of using GWA studies over QTL mapping is that no mapping population needs to be generated. Furthermore, because many more recombination events have occurred in natural populations compared to in a single crossing, the linkage blocks are expected to be smaller than in QTL mapping, which implies a higher map resolution [3]. Population genomics and genome-wide mapping have revealed recent hybridization and genetic introgression between the pathogenic fungi Coccidioides posadasii and C. immitis [11] while a survey of the domesticated and wild yeasts Saccharomyces cerevisiae and S. paradoxus showed that phenotypic variation correlated broadly with global genome-wide phylogenetic relationships [12]. By SNP identification and whole-transcriptome sequencing, Ellison et al. [13] discovered two recently diverged populations of Neurospora crassa in which they identified genomic islands of high divergence that may be the result of local adaptation to a temperature difference between the two populations. The necrotrophic pathogen H. annosum s.l. causes severe damage in coniferous forests throughout the Northern Hemisphere [14]. H. annosum s.l. is a complex consisting of five species with different but partially overlapping host preferences [15], [16]. H. annosum sensu stricto (s.s.) mainly infects Pinus spp. but is also able to attack Picea spp. and other soft and hard wood tree species throughout Europe [17]. The economic losses due to devaluation of saw timber because of stem rot, reduced tree growth and tree mortality reach 1 billion Euros yearly for European forest owners. In addition, H. annosum s.l. challenges the coniferous forests potential to act as a carbon sink by releasing CO2 from decayed wood. In this study, we identify novel and well-known virulence genes in the conifer pathogen Heterobasidion annosum s.s. through GWA studies based on large scale single nucleotide polymorphism (SNP) identification and typing through next generation sequencing. Our results demonstrate the efficiency of this technique in a haploid non-model organism.

Results

Generation of a Genomic Data Set

We sequenced the genomes of 23 haploid H. annosum s.s. isolates with an Illumina Genome Analyzer. We obtained paired end reads from a 400-base pair (bp) insert library from three of the isolates (90211/2, Sä_16-4 and W_15), while 36-bp single end reads were generated from the remaining 20. The number of reads ranged between 3.2 and 8.9 million for the single end read isolates, and between 12.3 and 15.3 million for the paired end read isolates (Table 1). A combined reference genome was de novo assembled from the paired end sequenced isolates using the Velvet assembler [18], [19]. The assembly resulted in a reference genome of 56,195 contigs with an N50 of 41,157 bp and a median coverage of 38.7×. The number of contigs of at least 1000 bp was 2330, which comprise in total 30,569,260 bp (internal gaps excluded). These contigs were used as reference sequences when sequence reads of each isolate were assembled including the three used in the de novo assembly. The assembly of the individual isolates was achieved through mapping towards the combined reference genome using MOSAIK (http://bioinformatics.bc.edu/marthlab/Mosaik). For each isolate, 48% to 75% of the sequence reads were aligned to the reference and the coverage ranged between 80% and 94% with gap regions included (Table 1). The mean read coverage was between 2.6× and 12.6× of the individual isolates. There were 4,398,305 positions where all 23 individuals had sequence coverage of at least 2×. In these positions, a total of 64,055 SNPs were found and 33,018 of these were not singletons. We tested our SNP panel for population structure with the program Structure [20], [21], [22], using one SNP per contig. No signs of population structure were detected.
Table 1

Sequence data for the fungal isolates.

IsolateNumber of readsAligned (%a)Bp covered in %b Mean coverage (×)
87087/880855005663815 (70)91.86.5
90231/277092025812383 (75.4)91.76.7
87068/248816803350754 (68.6)87.74.0
Rb_28-2057451163869565 (67.4)88.84.6
W_15c 1525925410853302 (71.1)9412.5
V5:91_C465384844670833 (71.4)915.4
91202/287853616411227 (73)91.77.4
92153/146659453205708 (68.7)873.8
92182/183874924033923 (48.1)86.34.7
93030/153248513675732 (69)88.34.3
Fr_38-957363283943813 (68.8)89.14.7
AR_1839841532754913 (69.1)86.43.3
Sä_34-174289055504802 (74.1)916.3
V12:56_C448205943162468 (65.6)86.73.7
MJT155_b-483069306038861 (72.7)91.56.9
V3:47_C432196902181339 (67.7)80.22.6
Rb_30-338896672780017 (71.5)86.83.3
90211/2c 1467572610950439 (74.6)93.912.6
93014/187246536316731 (72.4)91.97.3
FW2-244544123123319 (70.1)86.43.7
Sä_16-4c 123385779018462 (73.1)93.910.4
L12-156615203239493 (57.2)85.43.8
V13:53_C243854063034864 (69.2)87.83.6

Percent aligned compared with total number of reads.

Coverage on reference sequence, Sä16-4, 90211-2 and W15 combined.

Paired-end sequenced isolates.

Percent aligned compared with total number of reads. Coverage on reference sequence, Sä16-4, 90211-2 and W15 combined. Paired-end sequenced isolates.

Fungal Virulence on Pine and Spruce Differs Significantly between Isolates

Virulence of H. annosum s.s. on Scots pine and Norway spruce was measured both as lesion length formed under the bark and fungal growth in the sapwood of the plants. The average infection success was 90% for each plant species and ranged between 30% and 100% for individual isolates (Table 2). There was a significant difference in virulence between isolates on pine and on spruce, measured both as lesion length and fungal growth in sapwood (P<0.05, ANOVA). In spruce, fungal growth (SFG) was normally distributed. However, the measurements for growth in pine (PFG) and lesion length in both spruce (SLL) and pine (PLL) had to be transformed to generate a normal distribution. A correlation between lesion length and fungal growth for spruce (R 2 = 0.6684) was found, but not for pine. A weak but still significant correlation (R 2 = 0.2363, P = 0.0187) was found between PFG and SFG.
Table 2

Heterobasidion annosum s.s. virulence analysis on Picea abies and Pinus sylvestris.

Isolatea Pine, lesionb Pine, growthb Spruce, lesionb Spruce, growthb
Mean (mm)Successfulc Mean (mm)Successfulc Mean (mm)Successfulc Mean (mm)Successfulc
87087/833.0±37.092.8±5.795.9±4.491.1±2.29
90231/26.3±5.391.1±3.391.0±1.733.3±2.93
87068/635.4±61.497.0±8.6106.1±2.396.7±10.39
Rb_28-2016.4±31.199.5±9.0105.3±3.41021.5±9.110
W159.3±5.51020.5±14.8106.6±3.01023.0±13.010
V5:91_C410.3±6.1109.0±5.2109.6±6.61026.0±13.710
91202/211.8±14.3916.1±17.198.2±5.4927.0±12.310
92153/116.6±16.41011.5±9.8106.6±1.9927.5±12.310
92182/117.7±18.11020.5±16.8106.5±4.91028.0±11.810
93030/19.1±7.5811.0±9.7109.7±7.11029.0±8.410
Fr_38_919.9±26.61026.5±21.4106.3±4.01030.0±15.310
AR_1814.1±9.21020.0±14.3107.7±3.6934.5±14.010
Sä_34-118.4±24.41017.5±11.81011.3±8.21035.5±12.810
V12:56_C46.1±5.185.0±4.4914.6±12.0835.5±12.68
MJT_155b-412.5±15.11020.0±11.1109.7±5.51036.0±13.310
V3:47_C-49.1±6.51019.0±16.8107.3±4.81036.0±9.910
Rb_30-317.7±19.11011.0±10.81012.8±18.71037.0±11.610
90211/29.4±6.91014.0±16.01016.0±11.2938.0±16.210
93014/17.3±4.21021.5±14.2107.6±3.51038.0±11.410
FW2_218.2±23.0931.1±23.6910.4±6.51039.5±7.310
Sä_16-424.6±38.11029.0±24.01011.8±6.0939.5±12.810
L12-16.1±3.9912.5±12.81010.0±3.31040.5±9.010
V13:53_C29.2±3.964.2±3.8641.2±24.0562.0±19.65

Isolates are sorted from low to high mean growth in sapwood of spruce.

Virulence was measured as lesion length and growth in sapwood up- and downstem from the wound combined in mm, ± s.d.

Number of successful infections out of 10.

Isolates are sorted from low to high mean growth in sapwood of spruce. Virulence was measured as lesion length and growth in sapwood up- and downstem from the wound combined in mm, ± s.d. Number of successful infections out of 10.

Twelve SNPs are Significantly Associated with Fungal Virulence

The general linear model revealed an association between 12 SNP markers and virulence with an adjusted P-value lower than 0.0001 (10 000 permutations) (Table 3). Out of these twelve, six SNP markers were found on the same contig (contig 41480); the remaining six SNP markers were distributed on six different contigs (Fig. 1). Four SNP markers were associated with SFG and the remaining eight with PFG. No markers were associated with SLL or PLL.
Table 3

Significantly associated SNP markers (P_adj≤1.00×10−4).

SNP IdContigPosition in contig P P_adja Traitb Genotype Minor/majorHomolog in Hetan2 Scaffold:PositionProtein Idc Gene OntologyGeneSubstitutionAA change (+strand) Minor/Major
25414128415297.50×10−6 1.00×10−4 SFGT/C2:90374IntergenicNone
318884148041514.82×10−5 1.00×10−4 PFGG/A13:5806291488820016787, Hydrolase activityPutative calcineurinSynonymousNone
318904148043484.82×10−5 1.00×10−4 PFGT/G13:5808261488820016787, Hydrolase activityPutative calcineurinIntronNone
318914148044144.82×10−5 1.00×10−4 PFGG/A13:5808921488820016787, Hydrolase activityPutative calcineurinSynonymousNone
318984148086094.82×10−5 1.00×10−4 PFGT/C13:575283IntergenicNone
3191241480114614.82×10−5 1.00×10−4 PFGT/C13:5693184828430005515, Protein binding; 0008270, Zinc ion bindingUnknownNon-synonymousCys (C)/Arg (R)
3191541480121274.82×10−5 1.00×10−4 PFGT/C13:5699854828430005515, Protein binding; 0008270, Zinc ion bindingUnknownSynonymousNone
716416590178299.63×10−5 1.00×10−4 PFGA/G10:1046969454667ExopolyphosphataseNon-synonymousPhe (F)/Ser (S)
49819600303234.46×10−5 1.00×10−4 PFGA/G1:56228899077 SWI5 transcription factorSynonymousNone
376564532266356.93×10−5 1.00×10−4 SFGT/C3:163145102298UnknownIntronNone
6505156277927.50×10−6 1.00×10−4 SFGT/C13:53143IntergenicNone
5197150191174542.12×10−5 1.00×10−4 SFGC/T5:438018IntergenicNone

P-value adjusted after 10,000 permutations.

SFG, fungal growth in spruce sapwood, upstem and downstem combined; PFG, fungal growth in pine sapwood, upstem and downstem combined.

ProteinId number for the homologous gene in Hetan2, http://genome.jgi-psf.org/Hetan2/Hetan2.home.html.

Figure 1

Overview of p-values (−log10 scale) for SNP associations to Heterobasidion virulence in spruce and pine.

The p-values are plotted against the site number from the SNP extraction. The contigs with significant SNPs are indicated with orange lines. Abbreviations: PFG, fungal growth in pine sapwood; PLL, lesion length in pine; SFG, fungal growth in spruce sapwood; SLL, lesion length in spruce.

Overview of p-values (−log10 scale) for SNP associations to Heterobasidion virulence in spruce and pine.

The p-values are plotted against the site number from the SNP extraction. The contigs with significant SNPs are indicated with orange lines. Abbreviations: PFG, fungal growth in pine sapwood; PLL, lesion length in pine; SFG, fungal growth in spruce sapwood; SLL, lesion length in spruce. P-value adjusted after 10,000 permutations. SFG, fungal growth in spruce sapwood, upstem and downstem combined; PFG, fungal growth in pine sapwood, upstem and downstem combined. ProteinId number for the homologous gene in Hetan2, http://genome.jgi-psf.org/Hetan2/Hetan2.home.html. The length of the contigs varied between 2.5 kilobase pairs (kb) and 76.5 kb (Table 4) and the SNP density varied in the contigs from 0.4 to 1.7 SNPs per kb (minor allele frequency ≥2/23 individuals; coverage in all individuals). The linkage disequilibrium (LD) heat maps show clear blocks of LD in the two contigs 4128 and 41480 associated with the markers for virulence traits (Fig. 2). In the remaining five contigs the SNP marker associated with virulence was not associated with any LD block (Fig. 2 and Fig. S1). The six SNP markers located in contig 41480 were found in two distinct LD blocks with the same statistical support (P<0.0001). The less virulent genotypes were present at low frequencies in the population (Fig. 3).
Table 4

Data for the contigs with SNPs associated with virulence.

Contig numberSize of contig (bp)Number of SNPs Minor allele 2 Coverage 23/23 isolatesNumber of SNPs Minor allele 2 Coverage 15/23 isolates
412866260112764
414807653293387
165904614619124
96003400637518
453221254318230
156272528315
501914023760510
Figure 2

Overview of four genomic regions significantly associated with Heterobasidion virulence in spruce and pine.

The upper part of each figure plots the p-values (−log10 scale) for the four traits (up- and downstem combined) to the genomic position (in bp). Abbreviations: PFG, fungal growth in pine sapwood; PLL, lesion length in pine; SFG, fungal growth in spruce sapwood; SLL, lesion length in spruce. The lower part displays linkage disequilibrium (LD) heat maps. The heat map illustrates the LD value r 2 from white to red where red indicates high r 2 -values. Significant SNP markers are in red. (A) Contig 4128; (B) Contig 41480; (C) Contig 16590; (D) Contig 9600.

Figure 3

Genotypes for SNPs significantly associated with virulence in spruce and pine.

Genotypes for SNPs significantly associated with: (A) fungal growth in spruce sapwood; contig 4128, 15627, 45322, 50191 and (B) fungal growth in pine sapwood; contig 9600, 16590 and 41480. All associations were adjusted by a permutation test, the positions shown have a p adj <0.0001. Non-synonymous substitutions are labelled with an asterisk.

Overview of four genomic regions significantly associated with Heterobasidion virulence in spruce and pine.

The upper part of each figure plots the p-values (−log10 scale) for the four traits (up- and downstem combined) to the genomic position (in bp). Abbreviations: PFG, fungal growth in pine sapwood; PLL, lesion length in pine; SFG, fungal growth in spruce sapwood; SLL, lesion length in spruce. The lower part displays linkage disequilibrium (LD) heat maps. The heat map illustrates the LD value r 2 from white to red where red indicates high r 2 -values. Significant SNP markers are in red. (A) Contig 4128; (B) Contig 41480; (C) Contig 16590; (D) Contig 9600.

Genotypes for SNPs significantly associated with virulence in spruce and pine.

Genotypes for SNPs significantly associated with: (A) fungal growth in spruce sapwood; contig 4128, 15627, 45322, 50191 and (B) fungal growth in pine sapwood; contig 9600, 16590 and 41480. All associations were adjusted by a permutation test, the positions shown have a p adj <0.0001. Non-synonymous substitutions are labelled with an asterisk.

Surveying the Genomic Regions Associated with Fungal Virulence

The seven H. annosum s. s. contigs containing SNPs associated with virulence were highly homologous to the H. irregulare genome (http://genome.jgi-psf.org/Hetan2/Hetan2.home.html) [23] and the gene order within each contig was well conserved. The contigs were not linked among each other and they were distributed over different scaffolds. Contig 4128 had sequence homology to positions 49591–113806 on scaffold 2 in the genome of H. irregulare. The LD block harbouring the SNP associated with SFG was located between position 40886 and 66145 in the contig (corresponding to position 89756-113692, scaffold 2 in H. irregulare). This region contained nine genes that encode: a serine protease, a transcriptional co-repressor, a quinone oxidoreductase (ToxD), an inner centromere protein, a urea transporter, an enzyme similar to a DNA-dependent ATPase, a sorbitol dehydrogenase and two flavin-containing monooxygenases (Fig. 2A). Marker 2541, associated with SFG, was located in between the two genes; the serine protease and the transcriptional co-repressor (Fig. 2A). Linkage was found between this marker and SNPs in the genes that encode a cytochrome P450, a urea transporter, a DNA-dependent ATPase and a sorbitol dehydrogenase. Contig 41480 had homology to positions 499810–584466 on scaffold 13 in the genome of H. irregulare and they were syntenic, although there was a small break in homology between positions 532643 and 533812, where H. irregulare has a transposon inserted. The LD block between 2.2 and 7.0 kb in the contig spanned a putative calcineurin, similar to S. cerevisiae CNA-1, and the end of an acetylglutamate kinase/synthase gene (Fig. 2B). Two markers at positions 11461 and 12127 were found in a smaller LD block that was located in a gene of unknown function. An absolute LD (r 2 = 1) between the significant markers and the rest of the positions in the LD blocks was observed. This linkage is localized to the genes that encode calcineurin, acetylglutamate kinase and the longer unknown gene. Two of the markers associated with virulence in the calcineurin gene were synonymous whereas the third was located in an intron (Table 3). In addition, the small contig 15627 (2.5 kb) also had homology to scaffold 13 (51802–56242) in the genome of H. irregulare. This region contains the last 200 bp of a polyprenylsynthetase gene and three unknown genes, out of which two encode secreted proteins. Contig 16590 had homology to positions 1029321-1073048 on scaffold 10 of the H. irregulare genome (Fig. 2C). The significant SNP marker 7164 at position 17829 in the contig was located in a non-synonymous position in an exopolyphosphatase gene (Table 3). It had LD (r 2 = 1) to one unknown gene at position 14325 and to three intergenic positions. Contig 9600 had homology to positions 533823-565834 on scaffold 1 in the genome of H. irregulare. Only a few, quite small LD blocks were found, the largest being 1.7 kb (Fig. 2D). The significant marker at position 30323 was located to a synonymous position in the transcription factor SWI5 with an absolute LD (r 2 = 1) restricted to the same gene at positions 30227, 30247, 30352 and 30602 (Table 3). Contig 45322 and 50191 had homology to scaffold 3 and 5 of the genome of H. irregulare, respectively. The SNP in contig 45322 was localized to a gene of unknown function and the one in contig 50191 was found in an intergenic region (Fig. S1).

Discussion

Applying GWA analysis to an organism in a haploid stage that can be clonally reproduced in high numbers and phenotyped repeatedly increases the accuracy of the phenotypic measurements and the power of the association analysis with several orders of magnitude as compared to diploids. We demonstrated that as few as 23 haploid individuals could successfully be used to identify convincing associations in small fungal genomes, whereas several hundreds of individuals are needed in diploid organisms with large genomes such as humans and plants [24], [25]. Lind et al. [9] used a mapping population generated from a cross between H. irregulare and H. occidentale to show that several H. annosum s.l. QTLs were associated with fungal virulence on spruce and pine. An improved linkage map [26] located two QTLs on scaffolds 1 and 2 close to or overlapping the H. annosum s.s. contigs 9600 and 4128, suggesting that these regions harbour general virulence factors important in several Heterobasidion species, e.g. H. irregulare, H. occidentale and H. annosum s.s. Such general virulence regions have previously been suggested by Lind et al. [9] who found one linkage group with QTLs for four virulence traits. The overlapping results between these different studies and mapping methods strengthen the assumption that the regions target important virulence genes. These combined data suggest that virulence in H. annosum s. l. is controlled by regions of virulence factors that occur at different positions in the genome, of which some confer general virulence conserved between the different Heterobasidion species and other species-specific virulence that have evolved separately. Some of these species-specific genes found in H. annosum s.s. may encode virulence factors that are specialized for Scots pine infection. Interestingly, several Norway spruce specific virulence factors were also found with a significant association for SFG, confirming the ability of H. annosum s.s. to infect spruce as an alternative host. In addition, this fact may indicate that H. annosum s.s. uses different mechanisms to infect spruce than H. occidentale. The success and power of an association study is dependent on the number of SNP markers and on the LD decay. We analysed LD in six contigs using an extended SNP data set and the results indicate a variable LD block size present in our population (300 bp to 31 kb). The very limited LD for the associated SNPs in four of the contigs indicates a high-resolution mapping of these regions. The fact that only part of the genome could be included in the analysis makes it difficult to predict the number of SNPs that are needed for complete coverage in H. annosum s.s. Maize has been shown to have an LD decay between 1 and 10 kb that increased with the increase of minor allelic frequencies and with smaller sample sizes [27]. For the diploid maize, small sample size was defined as 25 individuals and a sample size of more than 50 individuals generated no significant differences in the mean r. The long LD regions found in H. annosum s.s. might therefore partly be an effect of the relatively low number of individuals used.

Gene models associated with fungal growth in spruce

Markers associated with fungal growth in spruce (SFG) were found to be located in gene models with varying functions (contig 4128, Fig. 2A) as well as to unknown/novel virulence genes (contig 45322, 50191, 15627, Fig. S1). Contig 4128 is located close to a previously known QTL for fungal growth in pine sapwood, which is why the genes found in this region are candidates as conserved, general, virulence factors. The gene encoding a quinone oxidoreductase (contig 4128) is similar to a gene that encodes zinc-binding oxidoreductase ToxD, a host-selective toxin produced by Pyrenophora tritici-repentis Pt-1C-BFP [28], with 27% sequence identity and 36% similarity. Ptr ToxD is a zinc enzyme, specific for NADPH that catalyses the one-electron reduction of certain quinones [29]. Pyrenophora tritici-repentis, causal agent of tan spot of wheat, is known to produce several additional host-selective toxins, including Ptr ToxA, Ptr ToxB and Ptr ToxC [30]. Serine carboxypeptidases, found in contig 4128, belong to the serine-type catalytic family and use the amino acid as a nucleophile to form an acyl intermediate. They can also act as transferases and are common in viruses, bacteria and eukaryotes [31]. In Schizosaccharomyces pombe it was shown that deletion of a serine carboxypeptidase gene, sxa2, causes hypersensitivity to the P-factor mating pheromone and a reduction in mating efficiency in M cells [32]. Moreover, in Cryptococcus neoformans a deletion mutant of KIN1, a serine/threonine protein kinase, was shown to have attenuated virulence [33]. The position of a significant marker between two genes that encode a serine protease and a transcriptional co-repressor indicate that the transcriptional regulation of either of the genes could be affected. Flavin-containing monooxygenases (FMOs), found in contig 4128, are xenobiotic-metabolizing enzymes that catalyse the oxygenation of nucleophilic nitrogen, sulphur, phosphorous and selenium atoms using NADPH as a cofactor and FAD as a prosthetic group [34]. They have been implicated in the metabolism of several pharmaceuticals, pesticides and other toxicants. Arabidopsis mutants overexpressing FMO1 had an increased basal resistance against Pseudomonas syringae pv. tomato and Hyaloperonospora parasitica, which suggests that the FMO was involved in detoxifying the virulence factors produced by the pathogens [35]. Another FMO was previously found within a QTL for lesion lengths in pine bark close to contig 9600 suggesting a general and conserved role in pathogenicity [9] (Lind M, Dalman K, Olson Å, Brandström-Durling M, Stenlid J in prep.).

Gene models associated with fungal growth in pine

Three contigs were found to harbour markers associated with fungal growth in pine (PFG) (contig 41480, 16590, 9600). The general picture of LD for the significant markers in contig 41480 is that it is limited to three genes, calcineurin, acetylglutamate kinase and a longer unknown gene. Therefore these genes, along with exopolyphosphatase (contig 16590) and SWI5 (contig 9600), are the strongest candidates for the PFG virulence trait. The putative gene for calcineurin found in this study represents a strong candidate for virulence signalling in H. annosum s.s. Calcineurin is a phosphatase regulated by Ca2+ and calmodulin, and is heavily involved in the calcium-dependent signal transduction pathways of many processes in eukaryotes, such as T cell activation, muscle hypertrophy, memory development, glucan synthesis, ion homeostasis and cell cycle control [36]. Furthermore, the protein confers a conserved function for virulence in several fungi; e.g. Candida albicans [37], Ustilago maydis [38] and Sclerotinia sclerotiorum [39]. The plant pathogen Botrytis cinerea calcineurin-responsive transcription factor CRZ1 was found to be required for penetration of plant surfaces [40]. In the rice blast fungi Magnaporthe oryzae, deletion mutants for the genes encoding a calcineurin-responsive transcription factor had a reduced virulence due to a defect in host penetration [41], [42]. A gene encoding N-acetylglutamate was found in contig 41480. N-acetylglutamate is involved in the biosynthesis of arginine in prokaryotes, lower eukaryotes and plants [43]. An insertional mutation or deletion of genes encoding N-acetylglutamate lead to reduced virulence in Gibberella zeae (anamorph, Fusarium graminearum) [43] and in Colletotrichum higginsianum [44]. Exopolyphosphatases (PPXs), found in contig 16590, hydrolyse and release the terminal phosphate from linear polyphosphate containing three or more phosphoanhydride bonds [45]. Together with polyphosphate kinases (PPKs), responsible for polyphosphate synthesis, PPXs maintain the dynamic balance of the polyphosphate level. Polyphosphate is essential for growth of cells, responses to stresses and virulence in several bacteria [45]. Mutants lacking PPX in the human pathogen Neisseria meningitidis had an increased resistance to complement-mediated killing [46]. In contig 9600 the significant marker was located to a SWI5 transcription factor. The homolog in C. albicans, CaACE2, was shown to affect virulence and deletion mutants of CaACE2 were avirulent in a mouse model [47]. Strong QTLs for lesions in pine and spruce bark, and growth in spruce sapwood, are overlapping this contig [9](Lind M, Dalman K, Olson Å, Brandström-Durling M, Stenlid J in prep.). The corresponding peaks of these QTLs were at approximately 400 kb (pine) and 600 kb (spruce) respectively on scaffold 1 in the genome of H. irregulare. Candidate genes from this region are likely to play an important part in general Heterobasidion pathogenicity.

Conclusions

In this study we show that GWA studies are useful for dissecting important complex traits of non-model organisms, in particular those with small genomes and a haploid life style. We characterized seven genomic regions associated with fungal growth in the sapwood of spruce and pine and present eight candidate virulence genes that encode quinone oxidoreductase (ToxD), serine carboxypeptidase, two flavin-containing monooxygenases, calcineurin, acetylglutamate kinase, exopolyphosphatase and the SWI5 transcription factor. Of these, all except calcineurin, acetylglutamate kinase and exopolyphosphatase were found very close to or directly overlapping with previously known virulence QTLs.

Methods

Plant and Fungal Material

Two-year-old Norway spruce (Picea abies) and Scots pine (Pinus sylvestris) plants, originating from Latvia and Gotthardsberg in Sweden, respectively, were washed and planted in 2 L pots with fertilized peat. The plants were grown for one month in the greenhouse at 20°C before inoculation. The 23 haploid H. annosum s.s. isolates used in this study originated from field studies from different geographic locations in Europe (Table 5). They were all single spore isolates isolated from basidiospores or conidiospores. All isolates were grown on Hagem medium [48] at 21°C in darkness for one week whereafter autoclaved pine wood blocks (5×5×5 mm) were placed on the mycelia. The cultures were incubated for a further four weeks to allow thorough colonization of the wood blocks.
Table 5

Haploid isolates of Heterobasidion annosum s.s. used in the study.

Isolatea Geographic originHostColl.b
87087/8Wettingen, Zürich, Switzerland Pinus sp.OH
90231/2Stolptsky, Belarus Juniperus sp.KK
87068/6Pistoia, Italy Pinus nigra PC
Rb_28-20Ramsåsa, Sweden Picea abies JS
W_15England, UKUnknownJS
V5:91_C4Vedby, Sweden Larix eurolepis JS
91202/2Tartu, Estonia Pinus sp.KK
92153/1Bayern, Happerg, Germany Picea abies OH
92182/1Stors, Hillerød, Denmark Picea abies IT
93030/1Bayern, Happerg, Germany Picea abies OH
Fr_38-9Frossarbo, Sweden Picea abies JS
AR_18England, UK Pinus sylvestris JS
Sä_34-1Sätuna, Sweden Picea abies JS
V12:56_C4Vedby, Sweden Larix eurolepis JS
MJT_155b-4Mjölby, Sweden Pinus sylvestris JS
V3:47_C-4Vedby, Sweden Larix eurolepis JS
Rb_30-3Ramsåsa, Sweden Picea abies JS
90211/2Negoreloje, Belarus Pinus sylvestris KK
93014/1Bayern, Happerg, Germany Picea abies OH
FW2-2Scotland, UK Picea sitchensis JS
Sä_16-4Sätuna, Sweden Picea abies JS
L12-1Scotland, UK Picea sitchensis JS
V13:53_C2Vedby, Sweden Larix eurolepis JS

Single spore isolates from basidiospores or conidiospores.

Collectors: JS – J. Stenlid; KK – K. Korhonen; OH – O. Holdenrieder; PC – P. Capretti; IT – I. Thomsen.

Single spore isolates from basidiospores or conidiospores. Collectors: JS – J. Stenlid; KK – K. Korhonen; OH – O. Holdenrieder; PC – P. Capretti; IT – I. Thomsen.

Virulence Assay

The spruce and pine plants were inoculated with an H. annosum s.s. isolate by cutting a small window (5×10 mm) in the cambium of the plant, halfway between two nodes, inserting a colonised wood block in the wound and then wrapping it with Parafilm®. The experiment was executed in blocks: two spruce and two pine plants were inoculated with each of the 23 fungal isolates each day for five consecutive days, so that in total ten spruce and ten pine plants were inoculated with each isolate. As a control, ten spruce and ten pine plants were inoculated with sterile wood blocks. After four weeks the plants were harvested and the lesion lengths upstem and downstem from the wound was measured. The stems were then cut into 5-mm pieces and placed on wet filter paper in Petri dishes for 7 days. Growth within sapwood upstem and downstem from the wound was scored by the presence of conidiophores of H. annosum s.s. Inoculation was considered to have been unsuccessful in plants that showed no visible lesion, no fungal growth in sapwood and no conidia in the wound: these stems were discarded.

Statistical Tests for the Virulence Assay

A one-way ANOVA, performed using Minitab® 15.1.0.0, was used to test the difference in virulence between isolates. The measured values for virulence were normally distributed for SFG, but not for PFG or for SLL and PLL. To obtain a normal distribution for the three latter traits, 2 were added to every measure whereupon they were log transformed. The resulting mean values for all four traits were used in the regression analysis and in the association mapping.

SNP Extraction

Heterobasidion annosum s.s. mycelia were grown and DNA extracted according to Lind et al. [49]. All library preparations and sequencing were performed at the SNP&SEQ Technology Platform of Uppsala University Hospital on the Illumina Genome Analyzer according to Illumina’s standard protocol. For three of the isolates (90211/2, Sä_16-4 and W_15), libraries with an insert length of 400 bp were sequenced from both ends (paired end). Reads from the three paired-end-sequenced isolates were de novo assembled using Velvet version 0.7.60 (hash length 21 bp) [18], [19]. Contigs larger than 1000 bp were used as reference sequences. Reads from each isolate (including those isolates that were used for the de novo assembly) were aligned to the reference contigs using MOSAIK version 1.0.1384 (http://bioinformatics.bc.edu/marthlab/Mosaik). Two mismatches per read were allowed using a hash size of 10; only the uniquely aligned reads were accepted. The SAMtools program (pileup -c) [50], [51] and an in-house Python script were used to find SNPs. Each SNP had to have a SNP quality of at least 10 according to Li et al. [51]. Only sites where all individuals had coverage of two reads or more were used. The data was submitted to the Sequence Read Archive, SRA (SRA050098).

Population Structure Analysis

Structure within the sample population was explored using the program STRUCTURE [20], [21], [22]. To minimise the impact of linkage between SNPs in the structure analysis, we selected a single SNP representing each contig. These were analysed with different assumed population subdivisions in the range from one to four subpopulations in a model assuming the SNPs were not linked and the organism haploid.

Association Mapping and Linkage Disequilibrium Analysis

The association between SNP markers and virulence estimates was analysed using TASSEL version 2.1 [52]. A general linear model (GLM) approach assuming a completely random mating population was used. The imported phenotypic data consisted of transformed and non-transformed mean values of the four traits. The association analysis was performed in two rounds; first using the SNP dataset from the whole genome and then using an extended dataset from each selected genomic region found to be significantly associated with virulence in the first round. The SNP dataset for the first round consisted of SNPs with a minor allelic frequency of 9% and a minimum coverage of 100% and in the second round a minor allelic frequency of 9% and a minimum coverage of 65%. A permutation test with 10 000 replicates, implemented in TASSEL, was used to correct the p-values for each site. Linkage disequilibrium (LD) heat maps, based on r calculations implemented in TASSEL, were constructed for the selected genomic areas using the second SNP dataset.

Alignment and Annotation

The SAMtools pileup -c output [50] and an in-house Python script were used to retrieve the consensus sequences from the selected genomic regions of all isolates. Corresponding genomic regions were identified using BLAST searches of the complete sequenced genome of H. irregulare, isolate TC32-1 (http://genome.jgi-psf.org/Hetan2/Hetan2.home.html). The gene annotations found in TC32-1 were transferred to the reference sequence using PROT_MAP, FGENESH-2 (SoftBerry, Mount Kisco, NY) and Artemis [53]. The alignments were further analysed for synonymous and non-synonymous substitutions using MEGA version 4 [54]. Overview of two genomic regions significantly associated with virulence in spruce and pine. The upper part of each figure plots the p-values (−log10 scale) for the four traits (up- and downstem combined) to the genomic position (in bp). Abbreviations: PFG, fungal growth in pine sapwood; PLL, lesion length in pine; SFG, fungal growth in spruce sapwood; SLL, lesion length in spruce. The lower part displays linkage disequilibrium (LD) heat maps. The heat map illustrates the LD value r 2 from white to red where red indicates high r 2 -values. Significant SNP markers are in red. (A) Contig 45322; (B) Contig 50191. (TIF) Click here for additional data file.
  47 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

Review 3.  Genome-wide association studies for common diseases and complex traits.

Authors:  Joel N Hirschhorn; Mark J Daly
Journal:  Nat Rev Genet       Date:  2005-02       Impact factor: 53.242

4.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

5.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

Review 6.  Using association mapping to dissect the genetic basis of complex traits in plants.

Authors:  David Hall; Carolina Tegström; Pär K Ingvarsson
Journal:  Brief Funct Genomics       Date:  2010-01-06       Impact factor: 4.241

7.  Population genomic sequencing of Coccidioides fungi reveals recent hybridization and transposon control.

Authors:  Daniel E Neafsey; Bridget M Barker; Thomas J Sharpton; Jason E Stajich; Daniel J Park; Emily Whiston; Chiung-Yu Hung; Cody McMahan; Jared White; Sean Sykes; David Heiman; Sarah Young; Qiandong Zeng; Amr Abouelleil; Lynne Aftuck; Daniel Bessette; Adam Brown; Michael FitzGerald; Annie Lui; J Pendexter Macdonald; Margaret Priest; Marc J Orbach; John N Galgiani; Theo N Kirkland; Garry T Cole; Bruce W Birren; Matthew R Henn; John W Taylor; Steven D Rounsley
Journal:  Genome Res       Date:  2010-06-01       Impact factor: 9.043

8.  Calcineurin-responsive zinc finger transcription factor CRZ1 of Botrytis cinerea is required for growth, development, and full virulence on bean plants.

Authors:  Julia Schumacher; Inigo F de Larrinoa; Bettina Tudzynski
Journal:  Eukaryot Cell       Date:  2008-02-08

9.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

10.  Identification of quantitative trait loci affecting virulence in the basidiomycete Heterobasidion annosum s.l.

Authors:  Mårten Lind; Kerstin Dalman; Jan Stenlid; Bo Karlsson; Ake Olson
Journal:  Curr Genet       Date:  2007-06-14       Impact factor: 2.695

View more
  21 in total

1.  Estimating the Fitness Effect of Deleterious Mutations During the Two Phases of the Life Cycle: A New Method Applied to the Root-Rot Fungus Heterobasidion parviporum.

Authors:  Pierre-Henri Clergeot; Nicolas O Rode; Sylvain Glémin; Mikael Brandström Durling; Katarina Ihrmark; Åke Olson
Journal:  Genetics       Date:  2018-12-31       Impact factor: 4.562

2.  Call for a California coccidioidomycosis consortium to face the top ten challenges posed by a recalcitrant regional disease.

Authors:  George R Thompson; David A Stevens; Karl V Clemons; Josh Fierer; Royce H Johnson; Jane Sykes; George Rutherford; Michael Peterson; John W Taylor; Vishnu Chaturvedi
Journal:  Mycopathologia       Date:  2014-10-16       Impact factor: 2.574

3.  Identification of Diverse Mycoviruses through Metatranscriptomics Characterization of the Viromes of Five Major Fungal Plant Pathogens.

Authors:  Shin-Yi Lee Marzano; Berlin D Nelson; Olutoyosi Ajayi-Oyetunde; Carl A Bradley; Teresa J Hughes; Glen L Hartman; Darin M Eastburn; Leslie L Domier
Journal:  J Virol       Date:  2016-07-11       Impact factor: 5.103

Review 4.  The frequency of sex in fungi.

Authors:  Bart P S Nieuwenhuis; Timothy Y James
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2016-10-19       Impact factor: 6.237

Review 5.  Evolutionary Perspectives on Human Fungal Pathogens.

Authors:  John W Taylor
Journal:  Cold Spring Harb Perspect Med       Date:  2014-11-10       Impact factor: 6.915

6.  A Quantitative Genomic Approach for Analysis of Fitness and Stress Related Traits in a Drosophila melanogaster Model Population.

Authors:  Palle Duun Rohde; Kristian Krag; Volker Loeschcke; Johannes Overgaard; Peter Sørensen; Torsten Nygaard Kristensen
Journal:  Int J Genomics       Date:  2016-04-19       Impact factor: 2.326

Review 7.  Using Population and Comparative Genomics to Understand the Genetic Basis of Effector-Driven Fungal Pathogen Evolution.

Authors:  Clémence Plissonneau; Juliana Benevenuto; Norfarhan Mohd-Assaad; Simone Fouché; Fanny E Hartmann; Daniel Croll
Journal:  Front Plant Sci       Date:  2017-02-03       Impact factor: 5.753

8.  Bacterial Genetic Architecture of Ecological Interactions in Co-culture by GWAS-Taking Escherichia coli and Staphylococcus aureus as an Example.

Authors:  Xiaoqing He; Yi Jin; Meixia Ye; Nan Chen; Jing Zhu; Jingqi Wang; Libo Jiang; Rongling Wu
Journal:  Front Microbiol       Date:  2017-11-27       Impact factor: 5.640

9.  Gene expression profiling of candidate virulence factors in the laminated root rot pathogen Phellinus sulphurascens.

Authors:  Holly L Williams; Rona N Sturrock; Muhammad A Islam; Craig Hammett; Abul K M Ekramoddoullah; Isabel Leal
Journal:  BMC Genomics       Date:  2014-07-17       Impact factor: 3.969

10.  Comparative Genomics of Sibling Fungal Pathogenic Taxa Identifies Adaptive Evolution without Divergence in Pathogenicity Genes or Genomic Structure.

Authors:  Fabiano Sillo; Matteo Garbelotto; Maria Friedman; Paolo Gonthier
Journal:  Genome Biol Evol       Date:  2015-11-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.