| Literature DB >> 29584736 |
Abstract
The cereal pathogen Fusarium graminearum is the primary cause of Fusarium head blight (FHB) and a significant threat to food safety and crop production. To elucidate population structure and identify genomic targets of selection within major FHB pathogen populations in North America we sequenced the genomes of 60 diverse F. graminearum isolates. We also assembled the first pan-genome for F. graminearum to clarify population-level differences in gene content potentially contributing to pathogen diversity. Bayesian and phylogenomic analyses revealed genetic structure associated with isolates that produce the novel NX-2 mycotoxin, suggesting a North American population that has remained genetically distinct from other endemic and introduced cereal-infecting populations. Genome scans uncovered distinct signatures of selection within populations, focused in high diversity, frequently recombining regions. These patterns suggested selection for genomic divergence at the trichothecene toxin gene cluster and thirteen additional regions containing genes potentially involved in pathogen specialization. Gene content differences further distinguished populations, in that 121 genes showed population-specific patterns of conservation. Genes that differentiated populations had predicted functions related to pathogenesis, secondary metabolism and antagonistic interactions, though a subset had unique roles in temperature and light sensitivity. Our results indicated that F. graminearum populations are distinguished by dozens of genes with signatures of selection and an array of dispensable accessory genes, suggesting that FHB pathogen populations may be equipped with different traits to exploit the agroecosystem. These findings provide insights into the evolutionary processes and genomic features contributing to population divergence in plant pathogens, and highlight candidate genes for future functional studies of pathogen specialization across evolutionarily and ecologically diverse fungi.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29584736 PMCID: PMC5870968 DOI: 10.1371/journal.pone.0194616
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Evolutionary history and population structure of North American F. graminearum.
A phylogenetic network (A) and maximum likelihood phylogeny (B) was inferred from SNPs identified by reference-based mapping of whole genome sequences to the PH-1 F. graminearum genome (Ensembl, version RR). Taxa comprised 20 NA1/15ADON isolates, 20 NA2/3ADON isolates, 20 NX-2 isolates, and for the maximum likelihood analyses, representatives from three FGSC species were included: F. boothii (N = 5), F. gerlachii (N = 3) and F. louisianense (N = 2). Different colors indicate clustering assignment of each isolate in the three populations inferred from Bayesian analyses: NA2 = red, NA1 = blue and NA3 = green. A. SplitsTree4 [98] was used to construct a network wherein F. graminearum isolates are represented by terminal nodes, and relationships are depicted as branches with parallel edges indicating reticulate events (i.e. recombination, gene transfer). B. In the maximum likelihood phylogeny, all bootstrap values were 100%, except those indicated at branch nodes. The tree was rooted with F. boothii and drawn to scale, with branch lengths measured in the number of substitutions per site. Colored bars indicate Bayesian estimates of ancestry (q) for each isolate in the three populations.
Average genomic diversity within F. graminearum populations.
| Population | θ | Tajima's | |
|---|---|---|---|
| NA1 | 0.0012 | 0.0013 | 0.14 |
| NA2 | 0.0006 | 0.0006 | 0.15 |
| NA3 | 0.001 | 0.001 | -0.34 |
1 nucleotide polymorphism per site based on the number of segregating sites.
a-c The genomic distributions of each sliding-window summary statistic were compared across populations using nonparametric tests. Different letters indicate significant (P < 0.001) differences in the population distributions.
Genomic regions showing genetic signatures of selection in F. graminearum populations.
| Outlier region | Significant window (chromosome: coordinates) | Genes (InterPro domains) | References | |
|---|---|---|---|---|
| Targeted population(s) | ||||
| o1 | 1:7,080,001–7,090,000 | FGRRES_15944 (Zinc finger transcription factor), FGRRES_02168 (aminotransferase), FGRRES_02169 (peptidase), FGRRES_02170 (glycoside hydrolase/ chitinase), FGRRES_02171 (glutathione-dependent formaldehyde-activating enzyme), FGRRES_02172 (α,β hydrolase fold) | NA2 | |
| o2 | 1:10,880,001–10,890,000 | FGRRES_17361 (Major facilitator superfamily), FGRRES_10437, FGRRES_17362 (pleckstrin, CDC24 calponin), FGRRES_13758 (Ankyrin repeat) | NA2, NA3 ( | |
| o3 | 1:11,280,001–11,290,000 | FGRRES_10552, FGRRES_10553, FGRRES_20102, FGRRES_10554 (secreted cysteine-rich effector), FGRRES_10555, FGRRES_10556 | NA2 | [ |
| 1:11,290,001–11,300,000 | FGRRES_17388_M, FGRRES_10557_M, FGRRES_10558 (Heterokaryon incompatibility), FGRRES_10559 | NA2 | ||
| o4 | 1:11,300,001–11,310,000 | FGRRES_10560, FGRRES_10561 (Hce2 effector protein, Glycoside hydrolase/chitinase, LysM), FGRRES_10562 | NA3Fg | [ |
| o5 | 1:11,500,001–11,510,000 | FGRRES_17407 (Urease, Amidohydrolase), FGRRES_17408 (Zinc finger transcription factor), FGRRES_17409 | NA3 Fb, Fg, Fl | |
| o6 | 2:180,001–190,000 | FGRRES_13457 (Zinc finger transcription factor), FGRRES_08017 (citrate lyase domain), FGRRES_13456 (dehydrogenase), FGRRES_13455 (Zinc finger transcription factor), FGRRES_17114 (General substrate transporter) | NA1, NA3 Fb, Fg | |
| 2:190,001–200,000 | FGRRES_17113 (Gluconolactonase), FGRRES_08021, FGRRES_08022 (Peptidase), FGRRES_08023 (Cytochrome P450) | NA1, NA3 Fg | ||
| o7 | 2:200,001–210,000 | FGRRES_08024 (Glucose dehydrogenase), FGRRES_08025 (secreted Cyanovirin-N), FGRRES_08026, FGRRES_13453 | [NA1,NA2], NA3 ( | [ |
| o8 | 2:810,001–820,000 | FGRRES_08248 (hydrolase), FGRRES_17074_M (decarboxylase), FGRRES_08250, FGRRES_08251 (Gluconolactonase) | [NA1,NA2], NA3( | |
| o9 | 2:2,890,001–2,900,000 | FGRRES_08850 (SH3 domain), FGRRES_08851 (methylcitrate dehydratase), FGRRES_16955 (DNA photolyase, Cryptochrome) | NA3 | [ |
| o10 | 2:5,390,001–5,400,000 | NA1Fb, [NA2,NA3] | [ | |
| 2:5,400,001–5,410,000 | NA1Fb, [NA2, NA3 ( | |||
| 2:5,410,001–5,420,000 | NA1Fb, [NA2, NA3] | |||
| o11 | 2:8,520,001–8,530,000 | FGRRES_04599 (Glucose dehydrogenase), FGRRES_04600, FGRRES_15251 (secreted cysteine-rich effector), FGRRES_12220, FGRRES_16086 (Protein kinase-like) | NA3Fb, Fl | [ |
| 2:8,530,001–8,540,000 | FGRRES_12218, FGRRES_12217 (ankyrin repeat), FGRRES_04603 (Alpha carbonic anhydrase), FGRRES_04604 (Cullin), FGRRES_04605 (Glucose dehydrogenase) | [NA1Fg, Fl, NA2 Fg, Fl], NA3 ( | ||
| o12 | 2:8,570,001–8,580,000 | FGRRES_12216, FGRRES_12215 (HeLo, heterokaryon, incompatibility), FGRRES_04620 (α,β hydrolase fold), FGRRES_12214 (secreted cysteine-rich effector), FGRRES_15258 | NA2, NA3 Fb, Fl | [ |
| o13 | 4:4,060,001–4,070,000 | FGRRES_07631 (General substrate transporter), FGRRES_07632 (Major facilitator superfamily), FGRRES_07633, FGRRES_07634 (Sodium symporter) | NA1( | |
| o14 | 4:7,360,001–7,370,000 | NA2 | [ |
1Outliers (o1-o14) showed significantly (P < 0.001, unless otherwise indicated) elevated divergence between populations based on pairwise (interpopulation) values of Tajima’s D, D and F, and significantly reduced diversity (π) within the targeted population(s), based on null distributions generated through random permutation.
2Genes and domains are based on annotations in the PH-1 Fusarium graminearum genome [27] (Ensembl, version RR).
3Populations with the same allele are grouped by brackets, and superscripts indicate the allele was also found in outgroup species: Fg = F. gerlachii, Fl = F. louisianense, Fb = F. boothii.
Fig 2Genomic distribution of outlier regions exhibiting signatures of selection.
Sliding-window values of intrapopulation diversity and recombination (Panel A, Tajima’s D, π and R) and interpopulation differentiation (Panel B, Tajima’s D, D and F) were calculated in 10 kb windows to identify genomic regions with signatures of selection in NA1 (omitting admixed NX-2 isolates), NA2 or NA3 populations of F. graminearum. The fourteen outliers (o1-o14, yellow highlight) showed significant (P < 0.001) divergence between populations based on pairwise (interpopulation) values of Tajima’s D, D and F, coupled with significantly reduced diversity (π) within the divergent population(s). Significance was assessed by comparing observed sliding-window values of each metric against a null genome-wide distribution derived through random permutation.
Fig 3SNP haplotype alignments show divergence of NA1, NA2 and NA3 F. graminearum at outlier regions.
For each alignment, rows correspond to the polymorphisms found in a single isolate and columns show the SNPs found at each variant site. Gene designations and conserved domains are based on the PH-1 F. graminearum genome [27] (Ensembl, version RR). Panels A-B show examples of the genetic patterns observed at outliers. A. chromosome 4: 7,360,000–7,367,500; a single population (NA2, grey arrow) exhibited reduced within-population diversity and high divergence from the other two populations. B. chromosome 2: 5,390,000–5,420,000, TRI cluster genes; diversity was reduced in all three populations and two highly divergent haplotypes (grey arrows: 15ADON haplotype in NA1 and 3ADON haplotype in NA2/NA3) were segregating among the three populations.
Fig 4F. graminearum populations are differentiated by divergent gene sets that are enriched for different functions.
Functional enrichment was assessed based on Gene Ontology (GO) terms assigned to domains annotated in candidate genes identified in NA1 (left, blue bars), NA2 (center, red bars) and NA3 (right, green bars). Candidate genes included genes in outlier regions showing signatures of selection and differentially conserved genes in each population. Hypergeometric tests were performed by calculating Z-scores that compared the relative frequency of GO terms assigned to each candidate gene set to GO terms assigned to all functionally annotated F. graminearum genes. Z-scores and the number of domains in each gene set (N) are only shown for GO terms that were significantly overrepresented after a Benjamini–Hochberg adjustment for multiple testing (adjusted P < 0.01) [134]. InterPro domains associated with each GO term are listed in parentheses below the term description in bold type.
Fig 5Dendrogram of the F. graminearum pan-genome.
Pairwise distances were computed between genomes based on gene presence/absence polymorphism, and the resulting matrix was used to cluster isolates with the neighbor joining method implemented in MEGA 7 [126]. Branch lengths represent the number of genes that differ in presence/absence status between strains, such that genomes with identical gene content would have a distance of 0.
Fig 6Descriptions of genes that were differentially conserved among populations of F. graminearum.
The matrix indicates the presence (solid grey) and absence (white) of genes that were differentially conserved (E-test P-value < 0.01) in NA1, NA2 or NA3. Each column in the matrix represents a single isolate and each row represents a gene ortholog. Population structure based on Bayesian clustering (indicated by red, blue and green highlight) is shown, along with the maximum likelihood phylogeny inferred from SNPs. Admixed isolates (adm, shaded brown) were not included in calculations but are shown to emphasize their mosaic genomes. Functional descriptions of the domains encoded by each gene are based on annotations of the PH-1 F. graminearum genome [27] (Ensembl, version RR) and HMMER, BLAST, BLAST2GO, CD-Search, InterPro and Signal P analyses. Bold symbols (§, ‡, †) indicate accessory proteins with similar domain content and structure that were found in different genomic contexts in each population. Differentially conserved genes with uncharacterized functions were not depicted in the figure (N = 34).