Literature DB >> 26602546

Genome-wide analysis of Fusarium graminearum field populations reveals hotspots of recombination.

Firas Talas1, Bruce A McDonald2.   

Abstract

BACKGROUND: Fusarium graminearum (Fg) is a ubiquitous pathogen of wheat, barley and maize causing Fusarium head blight. Large annual yield losses and contamination of foodstuffs with harmful mycotoxins make Fg one of the most-studied plant pathogens. Analyses of natural field populations can lead to a better understanding of the evolutionary processes affecting this pathogen. Restriction site associated DNA sequencing (RADseq) was used to conduct population genomics analyses including 213 pathogen isolates from 13 German field populations of Fg.
RESULTS: High genetic diversity was found within Fg field populations and low differentiation (FST = 0.003) was found among populations. Linkage disequilibrium (LD) decayed rapidly over a distance of 1000 bp. The low multilocus LD indicates that significant sexual recombination occurs in all populations. Several recombination hotspots were detected on each chromosome, but different chromosomes showed different levels of recombination. There was some evidence for selection hotspots.
CONCLUSIONS: The population genomic structure of Fg is consistent with a high degree of sexual recombination that is not equally distributed across the chromosomes. The high gene flow found among these field populations should enable this pathogen to adapt rapidly to changes in its environment, including deployment of resistant cultivars, applications of fungicides and a warming climate.

Entities:  

Mesh:

Year:  2015        PMID: 26602546      PMCID: PMC4659151          DOI: 10.1186/s12864-015-2166-0

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Population genomics analyses can provide insight into the evolutionary history of populations by providing rich, genome-wide measures of genetic variation and its related parameters including gene flow, population size and reproductive system [1, 2]. Population genomics studies can also identify genomic regions experiencing exceptional degrees of recombination or selection. Coupled with annotations that come out of genome sequencing projects, it becomes possible to identify candidate genes within interesting hotspots that may underlie differences among populations that arise through selection for local adaptation. Recombination has an important effect on both genotypic diversity and evolutionary potential. The evolutionary potential reflects a pathogen’s response to control measures such as fungicide applications and deployment of resistant cultivars. Both homothallic inbreeding and asexual reproduction can reduce the effective recombination rate relative to random outcrossing [3, 4], generating a non-random association among loci over the genome called linkage disequilibrium (LD). The pattern of LD can provide insight into an organism’s evolutionary history [5, 6]. Different genomic regions within the same population can exhibit different LD patterns as a result of differences in selection or differences in recombination rates [6, 7]. However the size of LD blocks will depend on the methods used for detection. Recent advances in sequencing technology allowed a significant expansion in the number of loci used for LD measurements, which now include thousands of single nucleotide polymorphisms (SNPs) that cover the entire genome [8-10]. A recent study on Drosophila melanogaster reported an extremely rapid decay of LD within the surprisingly small distance of 10 bp [11, 12]. Population genomic studies that search for associations between markers and quantitative traits require a large number of genetic markers [1, 2], often using hybridization approaches based on dense panels of SNP markers that cover the genome. An alternative approach invented by Baird et al. [3, 4], called restriction site associated DNA sequencing (RADseq), combines a high density of SNP markers, with a high genome coverage at a relatively low cost [5, 6]. Fusarium head blight (FHB) disease causes significant annual losses in cereal production around the world, in addition to frequent contamination of foodstuffs with harmful mycotoxins such as nivalenol, deoxynivalenol, and zearalenone. FHB is caused by different species related to the genus Fusarium, including F. graminearum (anamorph: Gibberella zeae (Schwein.) petch) and F. culmorum. F. graminearum sensu stricto (Fg ss) is one of 14 cryptic species within the F. graminearum species complex [6, 7]. Fg ss is a homothallic fungus with a mixed reproductive system including inbreeding, outcrossing and asexual reproduction. Sexual reproduction in this fungus is highly dependent on temperature, with an optimum of 25–28 °C [8-10]. The pattern of ascospore release varies between regions and years [11, 12] depending on the temperature and relative humidity. Hence, recombination rates may differ among pathogen populations as a result of differences in their local environments. Several studies of population genetic structure have been conducted in different countries [9, 13–17] to better understand the connection between genetic variation and phenotypic variation at the population scale. Most of the earlier population studies of Fg included a limited number of isolates or a limited number of markers that were not equally distributed over the four Fg chromosomes [4, 13, 14, 16, 18–20]. Our goals in this study were to use population genomics analyses to: (i) Determine the genetic structure of field populations of Fg ss to assess the reproductive system and the degree of population subdivision occurring over regional spatial scales; (ii) Determine the extent of linkage disequilibrium in the genome and conduct a genome-scale search for recombination hotspots; (iii) Search for evidence of selection hotspots that may contain candidate genes under strong selection.

Results

Genetic variance analysis

Our analyses included thirteen field populations of Fg ss coming from different geographical areas and environments across Germany (Table 1). The filtered RADseq dataset included 1129 SNPs with a maximum of 1.8 % missing data per SNP. Partitioning of the genetic variance within and among field populations using AMOVA revealed that 99.7 % of the total genetic variance was within field populations and only 0.3 % was among populations. The corresponding overall Fixation index (FST) was 0.003 while pairwise differentiation between populations ranged from 0 to 0.036 (Fig. 1). FST can range between 0 and 1 according to the degree of population differentiation, with 0 representing populations that are indistinguishable.
Table 1

The Fusarium graminearum field populations analyzed in this experiment

No.Country/LocationPopulationNo. isolatesPlace of collectionYear of collectionLatitudeLongitudeAvg. rainAvg. tempAvg night temp
1South GermanyHOH127Hohenheim2008N 48 42' 50''E 9 12' 58''68.2 mm23.3 °C14.2 °C
2HOH27Hohenheim2008N 48 42' 50''E 9 12' 58''68.2 mm23.3 °C14.2 °C
3PLN17Plieningen2008N 48 42' 2''E 9 12' 54''68.2 mm23.3 °C14.2 °C
4BIR22Birkach2008N 48 43' 19''E 9 12' 30''68.2 mm23.3 °C14.2 °C
5TUB23Tübingen2008N 48 31' 22''E 9 3' 7''63.4 mm22.8 °C13.1 °C
6NUF10Nufringen2008N 48 37' 0''E 8 52' 59''63.4 mm22.8 °C13.1 °C
7ENT11Entringen2008N 48 33' 14''E 8 58' 22''63.4 mm22.8 °C13.1 °C
8HER6Herrenberg2008N 48 35' 46''E 8 52' 12''63.4 mm22.8 °C13.1 °C
9BOL18Bohlingen2008N 48 16' 59''E 8 50' 59''153.5 mm23.7 °C14.2 °C
10KEL6Kehl2008N 48 34' 59''E 7 49' 0''-24.1 °C-
11North GermanySCHICK21Schickelisheim2007N 52 15' 16''E 10 51' 54''110.7 mm21.8 °C11.6 °C
12WET124Wetze2006N 51 44' 27''E 9 54' 34''64.5 mm21.0 °C11.2 °C
13WET221Wetze2009N 51 44' 27''E 9 54' 34''48.9 mm24.1 °C13.1 °C
Fig. 1

Differentiation of Fg ss populations. Phylogenetic tree and heat map based on the pairwise fixation index between 13 field populations. Red indicates pairwise population comparisons that are more similar and green indicates populations that are genetically more distant

The Fusarium graminearum field populations analyzed in this experiment Differentiation of Fg ss populations. Phylogenetic tree and heat map based on the pairwise fixation index between 13 field populations. Red indicates pairwise population comparisons that are more similar and green indicates populations that are genetically more distant

Genetic diversity and population structure

Genetic dissimilarity between isolates based on modified Rogers’ distance was used to conduct a principle coordinate analysis (PCoA). Though the first 8 coordinates were consistent with a continuous distribution, the second and third coordinates explained the highest overall amount of genetic variation (34 %, and 39 % respectively, Fig. 2). A small group of 11 isolates coming from different field populations was visible to the right side of the main cluster relative to coord. 2 (Fig. 2). An additional group of seven isolates was located at an intermediate distance between the other two groups relative to coord. 2. The Structure analyses indicated a maximum of three subdivisions occurring among all isolates (Additional file 1: Figure S1) with no geographical pattern evident.
Fig. 2

Principal coordinate analysis. The two dimensions shown explain 74 % of the genetic diversity found among the 213 isolates included in the analysis. Different colors and shapes identify the field population associated with each isolate

Principal coordinate analysis. The two dimensions shown explain 74 % of the genetic diversity found among the 213 isolates included in the analysis. Different colors and shapes identify the field population associated with each isolate

Pattern of linkage disequilibrium

Analyses of linkage disequilibrium (LD) using a sliding window of 50 kbp revealed different patterns of LD on different chromosomes (Fig. 3). The correlation between LD and physical distance was very low in an analysis that considered all isolates as a single combined population (r = −0.028), whereas it ranged from −0.41 to 0.11 when analyzing each field population separately (Table 2). The locally fitted regression showed a low average LD (i.e., r < 0.10) across the four chromosomes. Linked markers showing r values > 0.5 dropped rapidly to r < 0.2 within a physical distance ranging from 600 to 1000 bp on different chromosomes (Fig. 3). The slowest LD decay was detected on chromosome 4.
Fig. 3

Linkage disequilibrium (LD). Pairwise LD in a sliding window of 50 kbp along each chromosome in the combined population of 213 Fusarium graminearum isolates sampled from 13 fields. Each point represents the LD (r ) between two SNPs among the 1129 SNPs used in the analysis. The red line is the locally fitted regression of LD over physical distance

Table 2

Recombination and selection parameters measured in each Fg ss field population organized according to chromosome

PopulationChr.Sa Tajima’DCorr. (r 2, d)b 2Ner (10−5)c IA d P value (IA)rd e P value (rd)Bonferroni adj.
Hoh11264−1.350.000.831.610.000.030.001***f
Hoh21146−1.02−0.020.761.810.010.050.007
PLN1197−1.31−0.015.100.300.130.010.129
BIR1190−1.460.012.030.570.030.010.026
TUB156−1.130.001.051.300.000.020.003
NUF1199−1.060.011.171.470.000.040.003
ENT1161−1.11−0.012.341.140.020.030.017
HER1152−0.74−0.011.313.000.000.080.001***
BOL1230−1.300.000.461.960.000.040.001***
KEL1109−0.88−0.025.29−0.240.66−0.010.663
SCHICK1130−1.390.111.461.730.000.030.001***
WET11135−1.390.000.711.600.000.030.001***
WET21199−1.220.017.280.500.030.010.032
Hoh12148−1.290.010.681.150.000.030.001***
Hoh2288−1.120.020.571.160.030.060.028
PLN2101−1.340.011.45−0.460.97−0.020.966
BIR295−1.10−0.012.42−0.140.70−0.010.703
TUB237−1.16−0.011.150.810.020.020.017
NUF2105−1.08−0.040.722.330.000.110.001***
ENT282−1.19−0.022.831.730.000.080.002
HER273−0.960.012.410.670.150.040.15
BOL2129−1.320.001.011.070.000.040.003
KEL243−1.050.051.59−0.170.63−0.030.634
SCHICK254−1.38−0.111.391.020.010.030.007
WET1281−1.430.000.950.370.100.010.101
WET22105−1.22−0.021.170.140.270.010.266
Hoh13166−1.45−0.020.781.240.000.030.001***
Hoh2380−0.98−0.030.292.970.000.140.001***
PLN3113−1.13−0.021.600.290.140.010.139
BIR3113−1.47−0.021.72−0.170.78−0.010.781
TUB349−1.210.012.490.130.500.000.501
NUF3139−1.060.001.100.740.050.020.045
ENT3103−1.310.000.990.910.020.050.024
HER382−0.86−0.021.191.200.040.060.042
BOL3160−1.340.000.941.810.000.050.001***
KEL365−1.010.013.431.710.020.140.015
SCHICK385−1.310.001.870.810.020.020.022
WET1392−1.47−0.011.231.710.000.040.001***
WET23118−1.25−0.013.860.330.090.010.089
Hoh14227−1.09−0.091.601.800.000.050.001***
Hoh24126−1.05−0.112.413.190.000.250.001***
PLN4164−0.87−0.121.290.870.020.040.015
BIR4189−0.86−0.092.820.330.100.010.095
TUB495−0.18−0.082.370.810.020.020.015
NUF4184−0.74−0.072.500.710.060.020.059
ENT4143−0.50−0.112.680.070.380.000.379
HER4124−0.41−0.142.171.100.030.070.026
BOL4191−0.75−0.011.610.790.010.030.013
KEL495−0.05−0.112.78−0.340.79−0.060.794
SCHICK4106−0.490.101.131.240.000.030.001***
WET14153−0.69−0.072.810.380.110.010.112
WET24171−0.98−0.083.550.420.070.020.066

aNumber of polymorphic (segregating) sites

bCorrelation between LD (r ) and physical distance (d)

cChromosome-wide population recombination rate (crossovers/bp/generation)

dIndex of association

eAdjusted value of IA

fSignificant after Bonferroni correction at P (IA) < 0.001

Linkage disequilibrium (LD). Pairwise LD in a sliding window of 50 kbp along each chromosome in the combined population of 213 Fusarium graminearum isolates sampled from 13 fields. Each point represents the LD (r ) between two SNPs among the 1129 SNPs used in the analysis. The red line is the locally fitted regression of LD over physical distance Recombination and selection parameters measured in each Fg ss field population organized according to chromosome aNumber of polymorphic (segregating) sites bCorrelation between LD (r ) and physical distance (d) cChromosome-wide population recombination rate (crossovers/bp/generation) dIndex of association eAdjusted value of IA fSignificant after Bonferroni correction at P (IA) < 0.001

Recombination rates

By analyzing the index of association (IA) and its standardized form (rd) for each population and each chromosome we found that all populations showed very low values of disequilibrium, on average, across all chromosomes. However, some field populations showed varying values of IA, including some with significant P-values, on different chromosomes within the same population. These differences may reflect variation in the degree of sexual recombination experienced by different populations and/or variation in frequencies of recombination among chromosomes (Additional file 2: Figure S2). Some populations (e.g., HOH1) exhibit less recombination on some chromosomes even after Bonferroni adjustment (Table 2). The high density of genetic markers allowed us to detect the recombination events found on each chromosome in each field population (Table 2). The chromosome-wide recombination rate (2Ner) ranged from 0.3 × 10−5 to 7.3 × 10−5 crossovers per bp per generation (HOH2: Chr3, WET2: Chr1, respectively), but in the combined population it was 0.4 × 10−5 crossovers/bp/generation over the four chromosomes (genome-wide recombination rate). By analyzing SNP markers on a finer scale, hotspots of recombination were revealed across the genome for the combined population (Fig. 4). The recombination hotspots displayed a wide variation in recombination frequency, ranging from 2.9 × 10−11 to 22.9 × 10−2 crossovers/bp/generation across the genome. The number of base pairs found in a hotspot ranged from 2 to 640 bp for the combined population. The population recombination parameter (2Ner/chromosome/generation) was highest on Chr1 followed by Chr3, Chr4 and was lowest on Chr2 (Fig. 5). Different populations showed different distributions of recombination hotspots, though several hotspots were shared among some populations (data not shown). Significant enrichment for some domains was found within the recombination hotspots (Table 3).
Fig. 4

Pooler plot of genomic parameters. Inner circle shows distribution of most extreme recombination hotspots (in orange scaled from 0 to 100 crossovers) across the genome of 213 isolates sampled from 13 fields. Tajima’s D values (in red scaled from 2 to −2) were calculated and shown for all recombination hotspots. The outer circle shows the distribution of 1129 SNPs in blue. Chromosomes are shown in different colors with a scale of 1 Mbp

Fig. 5

Chromosome-based measure of recombination (2Ner) in each field population. Chromosomes showed different degrees of recombination independent of differences in sample size for each population

Table 3

Enrichment tests for gene ontology categories found within recombination hotspots

GO-IDMolecular functionCategory P-valueGenes within hotspotsa Genes outside hotspotsb Enrichment
GO:0005773VacuoleC6.92E-0410177OVER
GO:0009423Chorismate biosynthetic processP1.44E-0322OVER
GO:0016020Membrane transportC1.57E-03441804OVER
GO:0005215Transporter activityF1.57E-0320602OVER
GO:0005507Copper ion bindingF2.57E-03433OVER
GO:0043231Intracellular membrane-bounded organelleC3.11E-03693322OVER
GO:0016049Cell growthP3.12E-03435OVER
GO:0043227Membrane-bounded organelleC3.17E-03693324OVER
GO:0005774Vacuolar membraneC3.44E-037118OVER
GO:0000324Fungal-type vacuoleC3.48E-038151OVER
GO:0000323Lytic vacuoleC3.48E-038151OVER
GO:0000322Storage vacuoleC3.48E-038151OVER
GO:0072423Response to DNA damage checkpoint signalingP3.52E-0324OVER
GO:0072402Response to DNA integrity checkpoint signalingP3.52E-0324OVER
GO:0046417Chorismate metabolic processP3.52E-0324OVER
GO:0006378mRNA polyadenylationP3.55E-03317OVER
GO:0031224Intrinsic component of membraneC3.72E-03311189OVER
GO:0044765Single-organism transportP3.79E-03331294OVER
GO:0006810Transport activityP3.98E-03361456OVER
GO:0016776Phosphotransferase activity, phosphate group as acceptorF4.09E-03318OVER
GO:0009987Cellular processP4.11E-03924827OVER
GO:0044437Vacuolar partC4.27E-037123OVER
GO:0022804Active transmembrane transporter activityF4.35E-038157OVER
GO:1902578Single-organism localizationP4.45E-03341360OVER
GO:0031501Mannosyltransferase complexC4.88E-0325OVER
GO:0051234Establishment of localizationP5.26E-03361481OVER
GO:0043631RNA polyadenylationP5.32E-03320OVER
GO:0040007GrowthP5.99E-03569OVER
GO:1902626Assembly of large subunit precursor of preribosomeP6.44E-0326OVER

aNumber of genes with the corresponding GO function located within recombination hotspots. bNumber of domains found in genes with the corresponding GO function, located outside of recombination hotspots

Pooler plot of genomic parameters. Inner circle shows distribution of most extreme recombination hotspots (in orange scaled from 0 to 100 crossovers) across the genome of 213 isolates sampled from 13 fields. Tajima’s D values (in red scaled from 2 to −2) were calculated and shown for all recombination hotspots. The outer circle shows the distribution of 1129 SNPs in blue. Chromosomes are shown in different colors with a scale of 1 Mbp Chromosome-based measure of recombination (2Ner) in each field population. Chromosomes showed different degrees of recombination independent of differences in sample size for each population Enrichment tests for gene ontology categories found within recombination hotspots aNumber of genes with the corresponding GO function located within recombination hotspots. bNumber of domains found in genes with the corresponding GO function, located outside of recombination hotspots

Neutral or non-neutral mutations

We used Tajima’s D test to determine the type and strength of selection operating in genomic regions that were in recombination hotspots. Measures of selection were based on departures of SNP allele frequencies from neutral expectations [6, 21]. A small number of regions had small positive values (Fig. 4). The chromosome-wide Tajima’s D values ranged from −1.47 (BIR: Chr3) to −0.05 (KEL: Chr4). The selection hotspots in the combined population had D values ranging from −1.4 to 1.2. High negative values are consistent with selective sweeps while high positive values are consistent with balancing selection.

Discussion

Population differentiation

We used RADseq to conduct the first population genomics study in Fg ss. Though geographical distances among the 13 field populations ranged from ~10 to 540 km, the overall population differentiation in Germany was very low (FST = 0.003), consistent with high gene flow across Germany and indicating that the individual field populations are part of a single metapopulation. Zeller et al. [13] reported a similar genetic structure when comparing 253 isolates from two field populations in Kansas and one in Minnesota separated by ~70–900 km using 94 AFLP markers. These findings stand in contrast to another study that included over 712 isolates of Fg ss sampled over a spatial scale of ~330–1670 km, that found significant divergence among populations in the Upper Midwest of the USA for 10 polymorphic RFLP probes [22]. Significant population subdivision (FST = ~0.02–0.44) was also found for chemotypes in the North American populations based on nine variable number of tandem repeat (VNTR) markers using 130 isolates [23]. In an earlier study using the same German isolates, we reported much higher FST values (i.e., FST = 0.20) [2, 16]. We believe that the different outcomes of the two studies reflect differences in the types and numbers of markers used [4, 24] and their distribution across the genome. The earlier study used 19 SSR markers that were unevenly distributed across the genome. The SSR alleles were separated using 3 % agarose gels, a method that is prone to scoring error due to imprecise binning of alleles. High FST values (0.09–0.76) were also reported by Gale et al. [6, 19] in a population analysis that included 534 North American isolates using 10 PCR-RFLP markers.

Population structure

Principle coordinate analysis (PCoA) was shown to be a suitable tool to monitor structure in human populations [1, 6, 25]. Our PCoA revealed a large cluster containing isolates from all 13 populations with respect to principle coordinates 2 and 3 (Fig. 2). The PCoA also defined two smaller clusters; one consisting of 11 and the other containing 7 isolates. Statistical analysis of population structure identified a maximum of three possible subdivisions (Additional file 1: Figure S1). Using a Bayesian model of clustering, a specialized nivalenol-producing population was identified in southern Louisiana [3, 9, 10, 19]. The groups we identified by PCoA contained isolates originating from different field populations and contained different chemotypes. Other studies of Fg ss populations in Europe identified three different chemotypes without any related clustering [12, 16, 26, 27]. We speculate that the small clusters of isolates identified with the PCoA may represent the emergence of new sub-populations that are associated with higher aggressiveness or production of novel chemotypes. A novel chemotype called NX-2 was reported in 2.8 % of 463 F. graminearum isolates analyzed in the USA, suggesting the presence of a distinct subpopulation [1, 7, 28]. We performed PCR-RFLP (according to [3, 28]) on 40 isolates representing all three PCoA clusters to determine if they carried new chemotypes, but found only wild-type alleles.

Analyses of linkage disequilibrium reveal high recombination

The pattern of LD across a genome reflects a population’s history, including the effects of genetic drift, selection, recent population admixture and frequency of recombination [5, 8, 29]. The correlation between physical distance and linkage disequilibrium was low in sliding windows of 50 kbp (Fig. 4, Table 2). A rapid decay in LD to less than r = 0.2 was found within a distance of ~600–1000 bp for different chromosomes. A similar rapid decay of LD was reported across a 10 bp scale on autosomes and across a 30 bp scale on the X chromosome of Drosophila melanogaster [7, 11, 12] and within ~ 600 bp in wild nematode populations [8, 30]. In some human populations, the LD decays within a few kbp [6, 11]. The rapid decay of LD we observed in field populations of F. graminearum is consistent with a high degree of recombination in these populations. Similar findings of very low LD and frequent recombination were reported [9, 13–17, 31] for the close relative F. culmorum using multilocus sequence typing of 111 isolates from Australia, West Africa/North Asia, and Europe (14 countries). FST was 0.002 among these populations, similar to our results. In our dataset, some SNPs on chromosome 3 maintained moderate LD (r = 0.50–0.30) despite being separated by long physical distances (25 kbp, on average). This may result from epistatic selection acting on the sites in disequilibrium [4, 13, 14, 16, 18–20, 32] or could result from inversions [6, 21, 33].

Evidence of sexual recombination

The importance of sexual recombination is cryptic for many homothallic fungi [13, 34], yet understanding a pathogen’s reproductive system can affect disease management strategies. The index of association (a multilocus measure of LD) can provide insight into the relative contributions of sexual and asexual reproduction in a population [22, 35]. A mixed reproductive system was hypothesized in Fg populations from Southern Louisiana and the Gulf Coast based on analyzing 10 PCR-RFLP loci in 534 isolates [19, 23]. Our results indicate varying degrees of recombination among the German field populations (Table 2) and also among chromosomes within individual field populations (Fig. 5). The combined population had a very small IA value of 0.01, but this value was significant at P = 0.001. A similar value of multilocus linkage disequilibrium (IA = 0.02) was reported in Chinese populations (169 isolates from 15 provinces) of Fg ss using 12 VNTR markers [36]. The individual German field populations showed an overall pattern consistent with recurring cycles of sexual recombination, with some populations at linkage equilibrium across all chromosomes (e.g., TUB), but others had significant disequilibrium on one or two chromosomes (e.g., SCHICK). Though the overall value of rd was low (0.03), HOH1 exhibited significant disequilibrium across all chromosomes even after applying a Bonferroni correction. We postulate that this reflects recent population admixture. HOH1 was sampled from naturally infected wheat within an experimental research station operated by University of Hohenheim. The naturally infected field included in our analyses was located near an artificially infected field that was inoculated with strains from across Germany, hence we expect that some ascospore movement among fields would introduce inoculated strains into our sampled field, generating a significant degree of population admixture. Overall, we believe that most of the observed disequilibrium in the combined population as well as the individual field populations likely reflects admixture rather than a significant degree of asexual reproduction within the German field populations.

Hot spots of recombination and selection

Different degrees of recombination (2Ner) were measured among chromosomes in different field populations (Fig. 5). These findings are in agreement with previous work of Gale et al. [37] that reported differences in chromosome-wide recombination rates in different genomic regions based on an analysis of 111 Fg ss progeny using 235 genetic markers. An analysis of 10,495 SNPs between the two reference isolates of Fg ss (PH1: NRRL31084 and GZ23639: NRRL29169) also identified recombination hotspots [38]. The earlier analyses also found the highest recombination rate on chromosome 1, in agreement with our findings (Fig. 5), but our analyses provided much finer resolution of hotspots compared to the earlier studies by combining a large number of wild-type field strains with a large number of markers. We identified ~240 recombination hotspots characterized by 2.9 × 10−11 to 22.9 × 10−2 crossovers/bp/generation. Tsai et al. [39] reported the persistence over evolutionary time of hotspots in yeast by comparing the hotspot positions in Saccharomyces cerevisiae and S. paradoxus, finding shared recombination hotspots with 9–45 × 10−4 crossovers/bp. We detected several shared recombination hotspots among the individual field populations when they were analyzed individually (data not shown). Analyses of nucleotide diversity can be used to infer processes affecting population evolution, based on deviations from the null hypothesis of neutral variation in an isolated population of constant size. We tested for departures from neutrality using Tajima’s D test (Fig. 4). Generally, positive values of Tajima’s D are interpreted to indicate balancing selection and/or decreasing population size, values near zero indicate neutrality, and negative values indicate an excess of rare alleles resulting from a selective sweep combined with recent population expansion or purifying selection [8, 21]. Significance thresholds for Tajima’s D are usually chosen as values greater than +2 or lower than −2. None of our D estimates reached this significance threshold, but this does not mean that selection is not operating in the genomic regions covered by our RADseq dataset. In a recent RADseq study on three malaria vectors (Anopheles gambiae, A. arabiensis, and A. merus), Tajima’s D values ranged between −0.58 and 0.38 and a t-test was used to show that the region with the highest negative value had been affected by a selective sweep [33]. Genome regions with high negative D values may be interesting candidates for resequencing studies to determine if genes in these regions have experienced recent selective sweeps as a result of local adaptation, for example as a response to fungicide applications, or changes in cultivars or crop rotations [40]. Recombination hotspots can develop in important genomic regions carrying favorable mutations [41]. The recombination hotspots identified in our analysis were enriched (P < 0.01) for genes encoding DNA repair and membrane transport (Table 3).

Conclusions

We found that field populations of Fusarium graminearum in Germany belong to a single, highly recombined meta-population. We identified ~240 recombination hotspots in this German meta-population. The high recombination rate observed across the genome coupled with the high gene flow observed among populations may enable Fg ss to adapt to a broad array of environmental stresses, including climate warming, deployment of resistant cultivars and fungicide applications. This study provides a baseline measure of recombination in natural field populations and a first indication of genomic regions that may be under selection. The low LD found over short physical distance coupled with a lack of population structure among these field populations of Fg ss optimizes all the parameters needed to perform a genome wide association study using these isolates.

Methods

Fungal collections

Thirteen field populations composed of an average of 16 isolates were sampled from naturally infected wheat fields in major wheat growing regions of Germany. All 213 single spore isolates were identified as Fg ss based on morphology [16] and using species-specific genetic markers [42].

RAD library preparation

DNA was extracted using DNeasy Plant Mini Kits (QIAGEN) and quantified using a Qubit 2.0 fluorimeter (Invitrogen). DNA concentrations were adjusted to ~50 ng/μl and digested with PstI. We adapted the RADseq protocol developed by Baird et al. [3] with minor modifications. Fungal DNA was barcoded using 21 P1 adapters (5’) and 6 P2 adapters (3’). Twenty-four libraries resulted after barcoding, with each library containing 21 isolates with different P1 barcodes. Twelve libraries, each with different P2 barcodes, were pooled into one lane and fungal DNA was sequenced using 100 bp paired-end reads on an Illumina HiSeq2000.

Short read alignment on the reference genome and calling SNPs

The raw data were quality checked using the FASTX toolkit 0.13. Sequences were quality trimmed from the 3’ end using Trimmomatic 0.32 [43], omitting sequences where the average quality scores in sliding windows of 5 bp dropped below 20. All sequences shorter than 30 bp in length were also omitted. Alignments were performed using Bowtie2-2.2.0 [44] to generate bam files. Alignments were sorted and indexed using samtools-0.1.19 [45]. Alignments were recoded with the isolate names according to the P1 adapter. A reference genome dictionary was created using Picard-tools-1.111 based on the chromosomal sequence of F. graminearum PH-1 (FG3, Broad Institute). SNPs were called using the Genome Analysis Tool Kit (GATK)-3.1-1 [46] from bam files and stored as a vcf file after setting the genome dictionary of ploidy to haploid. VCFtools v.3.5 [47] was used to discard all SNPs with a quality less than 600 while keeping a minimum depth of 20. The minimum allele frequency was set to 0.07 and the maximum allele frequency to 0.92. An additional filter was applied to keep the minimum missing data at 4 individuals per locus. After all filtering, there were 378, 193, 245, and 313 SNPs on chromosomes 1, 2, 3, and 4 respectively.

Analyses of molecular variance and population structure

All isolates were grouped into field populations according to their source. GenAlEx 6.5 [48] was used to perform an AMOVA analysis across the 1129 SNP loci with 1000 permutations. The principle coordinate analysis (PCoA) included 213 isolates from 13 field populations. The matrix of pairwise population differentiation was visualized using the R package heatmap3. The All Admixture model was implemented in Structure [49] with 5000 burned-in iterations and 2000 Markov Chain Monte Carlo (MCMC) iterations. The proposed number of populations was set to 13 with 100 permutations each. The maximum number of subpopulations was predicted using the formula [50]: where K = the proposed number of populations, L = average value of LnP (D) for the 100 permutations of Kth, and s = average of corresponding permutations. After taking into account the physical position of each SNP, the pairwise linkage disequilibrium was calculated using VCFtools v.3 [47] using a sliding window of 50 kbp over each chromosome. Sliding windows of 50 kbp were chosen based on results of genome analyses of two wild populations of Saccharomyces that showed a pairwise LD decay over 25–50 kbp [51]. Multilocus LD was calculated using 1000 permutations with the R package Poppr 1.0.5 [52] based on a subset of 87, 50, 57, and 55 SNP loci on chromosomes 1, 2, 3, and 4 respectively. These 249 SNP loci were separated by at least 50 kbp to decrease the contribution of linkage to disequilibrium among the loci.

Population genomic features

The recombination rate in the German population was analyzed using LDhat 2.2 [53] in two ways: (i) by considering all Fg isolates as a single large population; (ii) by considering each field population separately. These analyses took into account the physical position of each SNP. The correlation between LD and physical distance was calculated using 1000 permutations. Recombination hotspots were detected using Fearnhead’s method [54], which divides each genomic region into 6–15 SNP-based sub-regions that take into account the physical distance between SNPs. A composite likelihood function was then applied to identify the hotspots. Departure from the null hypothesis of no rate variation was then tested using a standard coalescent model. A hotspot is called in a sub-region if: (i) there is at least a five-fold increase in the local recombination rate and (ii) the statistical test is significant at P < 0.001. Estimates of recombination rate were based on a Bayesian reversible jump MCMC model [53]. Tajima’s D was calculated based on pairwise nucleotide differences (π) and the number of segregating sites (S) [55]. Visualization of SNP distributions, population recombination rates, Tajima’s D value and the positions of important genes on each chromosome were performed using Circos-0.66 [56]. Enrichment analyses of GO categories using Fisher exact tests were performed using Blast2go v.3.1.2 based on the annotations of the reference isolate PH-1, and categorizing the molecular function of each domain of the genes in the hotspots relative to the whole genome. Since most of the identified hotspots are fine-scaled (i.e., located in a single gene), we chose the genes where the hotspots were located for this analysis. We applied a cutoff of P < 0.01 to generate the results shown in Table 3.

Ethics statement

No human or animal subjects or genetically modified organisms were included in this study.

Availability of supporting data

A VCF file containing the nucleotide variation among the used isolates and basic information about the isolates is available at the European Nucleotide Archive under the accession number PRJEB11357. Using the data require citation.
  46 in total

1.  Population structure of and mycotoxin production by Fusarium graminearum from maize in South Korea.

Authors:  Jungkwan Lee; Hun Kim; Jae-Jin Jeon; Hye-Seon Kim; Kurt A Zeller; Laurel L A Carter; John F Leslie; Yin-Won Lee
Journal:  Appl Environ Microbiol       Date:  2012-01-27       Impact factor: 4.792

2.  Genomic regions exhibiting positive selection identified from dense genotype data.

Authors:  Christopher S Carlson; Daryl J Thomas; Michael A Eberle; Johanna E Swanson; Robert J Livingston; Mark J Rieder; Deborah A Nickerson
Journal:  Genome Res       Date:  2005-11       Impact factor: 9.043

3.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

4.  Temporal patterns of ascospore release in Leptosphaeria maculans vary depending on geographic region and time of observation.

Authors:  David Savage; Martin J Barbetti; William J MacLeod; Moin U Salam; Michael Renton
Journal:  Microb Ecol       Date:  2012-12-28       Impact factor: 4.552

5.  Fusarium culmorum is a single phylogenetic species based on multilocus sequence analysis.

Authors:  Friday Obanor; G Erginbas-Orakci; B Tunali; J M Nicol; S Chakraborty
Journal:  Fungal Biol       Date:  2010-07-15

6.  Population genomics of the wild yeast Saccharomyces paradoxus: Quantifying the life cycle.

Authors:  Isheng J Tsai; Douda Bensasson; Austin Burt; Vassiliki Koufopanou
Journal:  Proc Natl Acad Sci U S A       Date:  2008-03-14       Impact factor: 11.205

7.  A novel Asian clade within the Fusarium graminearum species complex includes a newly discovered cereal head blight pathogen from the Russian Far East.

Authors:  Tapani Yli-Mattila; Tatiana Gagkaeva; Todd J Ward; Takayuki Aoki; H Corby Kistler; Kerry O'Donnell
Journal:  Mycologia       Date:  2009 Nov-Dec       Impact factor: 2.696

8.  An adaptive evolutionary shift in Fusarium head blight pathogen populations is driving the rapid spread of more toxigenic Fusarium graminearum in North America.

Authors:  Todd J Ward; Randall M Clear; Alejandro P Rooney; Kerry O'Donnell; Don Gaba; Susan Patrick; David E Starkey; Jeannie Gilbert; David M Geiser; Tom W Nowicki
Journal:  Fungal Genet Biol       Date:  2007-10-16       Impact factor: 3.495

Review 9.  Linkage disequilibrium--understanding the evolutionary past and mapping the medical future.

Authors:  Montgomery Slatkin
Journal:  Nat Rev Genet       Date:  2008-06       Impact factor: 53.242

10.  Genomic analyses of three malaria vectors reveals extensive shared polymorphism but contrasting population histories.

Authors:  Samantha M O'Loughlin; Stephen Magesa; Charles Mbogo; Franklin Mosha; Janet Midega; Susan Lomas; Austin Burt
Journal:  Mol Biol Evol       Date:  2014-01-09       Impact factor: 16.240

View more
  17 in total

Review 1.  Evolution and genome architecture in fungal plant pathogens.

Authors:  Mareike Möller; Eva H Stukenbrock
Journal:  Nat Rev Microbiol       Date:  2017-08-07       Impact factor: 60.633

Review 2.  Fungal Sex: The Basidiomycota.

Authors:  Marco A Coelho; Guus Bakkeren; Sheng Sun; Michael E Hood; Tatiana Giraud
Journal:  Microbiol Spectr       Date:  2017-06

3.  Temporal dynamics, population characterization and mycotoxins accumulation of Fusarium graminearum in Eastern China.

Authors:  Jian-Bo Qiu; Jing-Tao Sun; Ming-Zheng Yu; Jian-Hong Xu; Jian-Rong Shi
Journal:  Sci Rep       Date:  2016-11-17       Impact factor: 4.379

Review 4.  Using Population and Comparative Genomics to Understand the Genetic Basis of Effector-Driven Fungal Pathogen Evolution.

Authors:  Clémence Plissonneau; Juliana Benevenuto; Norfarhan Mohd-Assaad; Simone Fouché; Fanny E Hartmann; Daniel Croll
Journal:  Front Plant Sci       Date:  2017-02-03       Impact factor: 5.753

5.  Candidate gene based association mapping in Fusarium culmorum for field quantitative pathogenicity and mycotoxin production in wheat.

Authors:  Valheria Castiblanco; Jose J Marulanda; Tobias Würschum; Thomas Miedaner
Journal:  BMC Genet       Date:  2017-05-19       Impact factor: 2.797

6.  Characterization of the Two-Speed Subgenomes of Fusarium graminearum Reveals the Fast-Speed Subgenome Specialized for Adaption and Infection.

Authors:  Qinhu Wang; Cong Jiang; Chenfang Wang; Changjun Chen; Jin-Rong Xu; Huiquan Liu
Journal:  Front Plant Sci       Date:  2017-02-14       Impact factor: 5.753

7.  A European Database of Fusarium graminearum and F. culmorum Trichothecene Genotypes.

Authors:  Matias Pasquali; Marco Beyer; Antonio Logrieco; Kris Audenaert; Virgilio Balmas; Ryan Basler; Anne-Laure Boutigny; Jana Chrpová; Elżbieta Czembor; Tatiana Gagkaeva; María T González-Jaén; Ingerd S Hofgaard; Nagehan D Köycü; Lucien Hoffmann; Jelena Lević; Patricia Marin; Thomas Miedaner; Quirico Migheli; Antonio Moretti; Marina E H Müller; Françoise Munaut; Päivi Parikka; Marine Pallez-Barthel; Jonathan Piec; Jonathan Scauflaire; Barbara Scherm; Slavica Stanković; Ulf Thrane; Silvio Uhlig; Adriaan Vanheule; Tapani Yli-Mattila; Susanne Vogelgsang
Journal:  Front Microbiol       Date:  2016-04-06       Impact factor: 5.640

8.  Whole genome sequencing and comparative genomics of closely related Fusarium Head Blight fungi: Fusarium graminearum, F. meridionale and F. asiaticum.

Authors:  Sean Walkowiak; Owen Rowland; Nicolas Rodrigue; Rajagopal Subramaniam
Journal:  BMC Genomics       Date:  2016-12-09       Impact factor: 3.969

9.  A population genomics approach shows widespread geographical distribution of cryptic genomic forms of the symbiotic fungus Rhizophagus irregularis.

Authors:  Romain Savary; Frédéric G Masclaux; Tania Wyss; Germain Droh; Joaquim Cruz Corella; Ana Paula Machado; Joseph B Morton; Ian R Sanders
Journal:  ISME J       Date:  2017-10-13       Impact factor: 10.302

10.  Population genomics of Fusarium graminearum reveals signatures of divergent evolution within a major cereal pathogen.

Authors:  Amy C Kelly; Todd J Ward
Journal:  PLoS One       Date:  2018-03-27       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.