| Literature DB >> 29722810 |
Norfarhan Mohd-Assaad1,2, Bruce A McDonald1, Daniel Croll3.
Abstract
Coevolution between hosts and pathogens generates strong selection pressures to maintain resistance and infectivity, respectively. Genomes of plant pathogens often encode major effect loci for the ability to successfully infect specific host genotypes. Hence, spatial heterogeneity in host genotypes coupled with abiotic factors could lead to locally adapted pathogen populations. However, the genetic basis of local adaptation is poorly understood. Rhynchosporium commune, the pathogen causing barley scald disease, interacts at least partially in a gene-for-gene manner with its host. We analyzed global field populations of 125 R. commune isolates to identify candidate genes for local adaptation. Whole genome sequencing data showed that the pathogen is subdivided into three genetic clusters associated with distinct geographic and climatic regions. Using haplotype-based selection scans applied independently to each genetic cluster, we found strong evidence for selective sweeps throughout the genome. Comparisons of loci under selection among clusters revealed little overlap, suggesting that ecological differences associated with each cluster led to variable selection regimes. The strongest signals of selection were found predominantly in the two clusters composed of isolates from Central Europe and Ethiopia. The strongest selective sweep regions encoded protein functions related to biotic and abiotic stress responses. Selective sweep regions were enriched in genes encoding functions in cellular localization, protein transport activity, and DNA damage responses. In contrast to the prevailing view that a small number of gene-for-gene interactions govern plant pathogen evolution, our analyses suggest that the evolutionary trajectory is largely determined by spatially heterogeneous biotic and abiotic selection pressures.Entities:
Mesh:
Year: 2018 PMID: 29722810 PMCID: PMC5972619 DOI: 10.1093/gbe/evy087
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Population structure analyses of 125 Rhynchosporium commune isolates from a worldwide collection based on genome-wide single nucleotide polymorphisms. (A) Sampling locations of R. commune included in this study. (B) Principal component analysis of genetic differentiation among isolates. The percentage of variance explained by the first two principal components is shown in parentheses. (C) Delta K plot of the STRUCTURE Bayesian clustering analyses for Ks of 2–9. (D) Genetic clustering using STRUCTURE for K = 3 (above) and K = 6 (below). Each colour represents one cluster and the vertical bar represents the proportion of cluster assignment for each genotype. Dotted lines are the minimum threshold value of membership coefficient for cluster assignment (90%).
. 2.—Genome-wide selection scans using the standardized integrated haplotype |iHS| score in three genetic clusters of Rhynchosporium commune. (A) Scaffolds of the reference genome and positions in Mb. (B) Density of single nucleotide polymorphisms (SNPs) across the whole genome shown in 10 kb nonoverlapping windows (gradient shows coverage between 0% and 5%). The total number of single nucleotide polymorphisms (SNPs) with known ancestry was 481,424. (C) Coverage of repetitive elements across the whole genome shown in 10 kb nonoverlapping windows (gradient shows coverage between 0% and 100%). (D–F) Dots show |iHS| values for each SNP within the clusters 1, 2, and 3, respectively. The most significant SNPs (top 0.05% for clusters 1 and 3, and top 0.01% for cluster 2) were highlighted in red. Triangles near the outermost plot indicate the location of loci most significantly associated with resistance to the fungicide cyproconazole (Bonferroni threshold α = 0.05) in a genome-wide association study (Mohd-Assaad et al. 2016).
Genes in Selective Sweep Regions Identified Using Integrated Haplotype Score (|iHS|) Analyses in the Three Genetic Clusters of Rhynchosporium commune
| Cluster | Scaffold | SNP Position | |iHS| | Genes in Sweep Region |
|---|---|---|---|---|
| Cluster 1 | RCO7_scaffold002 | 262395 | 4.57691295 | RCO7_04421 (glucose–methanol–choline (GMC) oxidoreductase) |
| RCO7_04422 (related to DUF 202 domain protein) | ||||
| RCO7_04423 (multi-antimicrobial extrusion (MATE) multidrug transporter) | ||||
| Cluster 1 | RCO7_scaffold004 | 707629 | 4.655198545 | RCO7_06862 (KES1 oxysterol-binding protein involved in ergosterol biosynthesis) |
| Cluster 1 | RCO7_scaffold051 | 125465 | 3.966106351 | RCO7_03673 (uncharacterized protein) |
| RCO7_03674 (acyltransferase GNAT family) | ||||
| RCO7_03675 (glycosyl hydrolase family 47 alpha-mannosidase) | ||||
| Cluster 2 | RCO7_scaffold061 | 469958 | 9.75730438 | RCO7_01309 (class I glutamine amidotransferase (GAT1)-like protein) |
| Cluster 2 | RCO7_scaffold012 | 1864317 | 9.31713174 | RCO7_02708 (uncharacterized protein) |
| RCO7_02709 (UbiA prenyltransferase family) | ||||
| Cluster 2 | RCO7_scaffold020 | 646208 | 8.83413424 | RCO7_14617 (uncharacterized protein) |
| Cluster 3 | RCO7_scaffold042 | 286253 | 7.03521568 | RCO7_07300 (uncharacterized protein) |
| RCO7_07301 (concentrative nucleoside transporter (CNT)) | ||||
| Cluster 3 | RCO7_scaffold040 | 75641 | 6.70227808 | Intergenic region |
| Cluster 3 | RCO7_scaffold013 | 900028 | 5.4617104 | RCO7_00065 (uncharacterized protein with SKP1/BTB/POZ domain) |
| RCO7_00064 (uncharacterized protein) |
Note.—Genes were selected if they were within 5,000 bp from the most significant SNPs (top 0.05% in clusters 1 and 3, and 0.01% in cluster 2).
. 3.—Genome-wide scan of the cross-population extended haplotype homozygosity (XP-EHH) analyses. (A) Scaffolds of the reference genome and positions in Mb. XP-EHH analyses of (B) cluster 2 versus cluster 1, (C) cluster 2 versus cluster 3, (D) cluster 1 versus cluster 3. The positive and negative values in (B)–(D) indicate the direction of selection. Positive values indicate positive selection in the reference population (cluster 2 was used as the reference population in (B) and (C), cluster 1 in (D)). The most significant SNPs (top 0.05% of |XP-EHH|) are highlighted in red, blue, and green for signals of positive selection in cluster 1, 2, and 3, respectively. Triangles near the outermost plot show the most significant SNPs identified by the standardized integrated haplotype |iHS| analyses (red for cluster 1, blue for cluster 2 and green for cluster 3).
Genes Identified in Overlapping Selective Sweep Regions
| Regions | Scaffold | Selection Analyses/Cluster Identities | SNP Position | |iHS|/XPEHH | Candidate Genes under Selection |
|---|---|---|---|---|---|
| 1 | RCO7_scaffold001 | (a) XP-EHH— | 962864 | 4.785 | (a) RCO7_01115 (transcription factor Som1) |
| (b) iHS—cluster 3 | 963272 | 4.343 | |||
| 2 | RCO7_scaffold003 | (a) XP-EHH— | 624918 | 3.564 | |
| (b) iHS—cluster 3 | 624918 | 4.6574 | |||
| 3 | RCO7_scaffold012 | (a) XP-EHH—cluster 2 versus | 1822306 | −3.977 | (a) RCO7_02695 (uncharacterized protein) |
| (b) iHS—cluster 2 | 1823054 | 8.694 | |||
| 1823058 | 8.694 | ||||
| 4 | RCO7_scaffold020 | (a) XP-EHH—cluster 2 versus | 647275 | −4.251 | |
| (b) iHS—cluster 2 | 646208 | 8.834 | (a) RCO7_03084 (uncharacterized protein) | ||
| 5 | RCO7_scaffold001 | (a) XPEHH—cluster2 versus | 1026013 | 3.914 | |
| (b) XPEHH— | 1025700 | 4.130 | |||
| 6 | RCO7_scaffold009 | (a) XPEHH—cluster 2 versus | 205732 | −4.091 | (a) intergenic |
| (b) XPEHH—cluster 1 versus | 204366 | −4.684 | (b) RCO7_06601 (Cu/Zn superoxide dismutase) | ||
| 7 | RCO7_scaffold002 | (a) XPEHH—cluster 2 versus | 818548 | −3.749 | |
| (b) XPEHH—cluster 2 versus | 818548 | −3.574 |
Note.—Regions 1–4 show selective sweep regions identified by both integrated haplotype score (|iHS|) and cross-population extended haplotype homozygosity (XP-EHH) analyses. Regions 5–7 show selective sweep regions identified in two separate XP-EHH analyses. Genes were reported if they were within 5,000 bp of the most significant single nucleotide polymorphism (SNP) locus (top 0.05%). For each XP-EHH selection analyses, the genetic cluster in which selection was detected is shown in bold. Gene names shown in bold are genes found in the overlap of both selection analyses. Gene names labelled with either (a) or (b) refer to which selection analyses identified these genes (see third column).
. 4.—Gene ontology (GO) enrichment analyses for genes within selective sweep regions. Enriched GO terms related to biological process (BP), molecular functions (MF) and cellular components (CC) are shown for analyses of the integrated haplotype score (iHS) and cross-population extended haplotype homozygosity (XP-EHH). Loci were combined for all individual analyses of the three genetic clusters. GO terms were simplified using REVIGO prior to enrichment analyses and only significantly enriched terms P < 0.01 are shown.