Literature DB >> 32554716

An Avirulence Gene Cluster in the Wheat Stripe Rust Pathogen (Puccinia striiformis f. sp. tritici) Identified through Genetic Mapping and Whole-Genome Sequencing of a Sexual Population.

Chongjing Xia1,2,3, Yu Lei3,4, Meinan Wang3, Wanquan Chen5, Xianming Chen6,7.   

Abstract

Puccinia striiformis f. sp. tritici, the causal agent of wheat stripe (yellow) rust, is an obligate, biotrophic fungus. It was difficult to study the genetics of the pathogen due to the lack of sexual reproduction. The recent discovery of alternate hosts for P. striiformis f. sp. tritici makes it possible to study inheritance and map genes involved in its interaction with plant hosts. To identify avirulence (Avr) genes in P. striiformis f. sp. tritici, we developed a segregating population by selfing isolate 12-368 on barberry (Berberis vulgaris) plants under controlled conditions. The dikaryotic sexual population segregated for avirulent/virulent phenotypes on nine Yr single-gene lines. The parental and progeny isolates were whole-genome sequenced at >30× coverage using Illumina HiSeq PE150 technology. A total of 2,637 high-quality markers were discovered by mapping the whole-genome sequencing (WGS) reads to the reference genome of strain 93-210 and used to construct a genetic map, consisting of 41 linkage groups, spanning 7,715.0 centimorgans (cM) and covering 68 Mb of the reference genome. The recombination rate was estimated to be 1.81 ± 2.32 cM/10 kb. Quantitative trait locus analysis mapped six Avr gene loci to the genetic map, including an Avr cluster harboring four Avr genes, AvYr7, AvYr43, AvYr44, and AvYrExp2 Aligning the genetic map to the reference genome identified Avr candidates and narrowed them to a small genomic region (<200 kb). The discovery of the Avr gene cluster is useful for understanding pathogen evolution, and the identification of candidate genes is an important step toward cloning Avr genes for studying molecular mechanisms of pathogen-host interactions.IMPORTANCE Stripe rust is a destructive disease of wheat worldwide. Growing resistant cultivars is the most effective, easy-to-use, economical, and environmentally friendly strategy for the control of the disease. However, P. striiformis f. sp. tritici can produce new virulent races that may circumvent race-specific resistance. Therefore, understanding the genetic basis of the interactions between wheat genes for resistance and P. striiformis f. sp. tritici genes for avirulence is useful for improving cultivar resistance for more effective control of the disease. This study developed a high-quality map that facilitates genomic and genetic studies of important traits related to pathogen pathogenicity and adaptation to different environments and crop cultivars carrying different resistance genes. The information on avirulence/virulence genes identified in this study can be used for guiding breeding programs to select combinations of genes for developing new cultivars with effective resistance to mitigate this devastating disease.

Entities:  

Keywords:  Puccinia striiformiszzm321990; QTL mapping; avirulence; genetics; host-pathogen interaction; wheat stripe rust; whole-genome sequencing

Mesh:

Year:  2020        PMID: 32554716      PMCID: PMC7300351          DOI: 10.1128/mSphere.00128-20

Source DB:  PubMed          Journal:  mSphere        ISSN: 2379-5042            Impact factor:   4.389


INTRODUCTION

Rust diseases, caused by fungi in the order Pucciniales, are a large threat to food security and impact ecosystems (1). Economically important rusts include leaf rust in coffee (2), soybean rust (3), crown rust in oat (4), flax rust (5), poplar leaf rust (6), and wheat rusts (7). Numerous epidemics of these rusts have been recorded in diverse agriculture systems and in many countries. Extensive efforts have been made to incorporate resistance genes into cultivars to protect plants from attacks of rust pathogens. However, rapidly evolving rust fungi are highly capable of evading plant immunity systems, resulting in ineffective host resistance. To avoid resistance failure and elongate the effectiveness of resistance genes, understanding the molecular mechanisms underlying host-pathogen interactions is essential. Plant-pathogen interactions were initially studied by Harold Flor and explained by his gene-for-gene concept (8). In this concept, host defense responses are activated by the recognition of a pathogen avirulence (Avr) gene by a cognate resistance (R) gene in the host. This concept has been supported by the fact that many Avr genes from bacteria, oomycetes, and fungi and R genes from different plant hosts have been cloned, and direct or indirect interactions between the products of some of the cloned R and Avr genes have been demonstrated (9). Such pioneering works have considerably increased our understanding of host-pathogen interactions. While Avr gene recognition is often referred to as effector-triggered immunity (ETI), pathogen-associated molecular pattern (PAMP)-triggered immunity (PTI) is generally thought of as a different type of resistance (10). PTI is activated by the recognition of conserved pathogen PAMPs such as chitin in the fungal cell wall. Pathogen effectors encoded by pathogenicity genes then suppress host PTI, enabling the pathogen to infect and cause disease. During coevolution, host plants gain R genes to detect evading pathogens. The recognition of Avr proteins by R proteins leads to ETI. As a coevolutionary arms race, a pathogen avoids the perception of a host R protein by mutating its Avr genes or developing new Avr genes to overcome or suppress host ETI. Classical approaches for cloning Avr genes and effectors include reverse-genetics and mapping-based positional methods. The reverse-genetics approach has been used to successfully clone many Avr genes in different pathosystems, including Cladosporium fulvum-tomato, from which the first fungal Avr gene, avr9, was identified and cloned (11, 12). Even though many effectors from oomycetes and fungi have been cloned using this approach, the reverse-genetics approach for Avr identification is not suitable for wheat-Puccinia pathosystems because the techniques usually used in reverse genetics, such as transformation, effector delivery systems, and RNA interference, are still not mature for rust fungi (13). An alternate approach for cloning Avr genes in plant pathogens is based on genetic mapping. Briefly, in the genetic mapping-based approach, an Avr locus is genetically mapped along with molecular markers; next, the genomic interval between two flanking markers is completely sequenced; and finally, the Avr gene is identified, followed by functional validation. The efficiency of this approach has been well demonstrated in cloning avirulence genes in the ascomycete fungus Leptosphaeria maculans, the causal agent of stem canker/black leg of oilseed rape and canola (14–18). Recently, this approach has been complemented by comparative genomics approaches, which has significantly accelerated the identification of Avr genes in L. maculans (19, 20). The mapping-based cloning approach has also been successful in basidiomycetes. In fact, this approach was successfully used to clone UhAvr1 from the barley smut pathogen, Ustilago hordei, the first avirulence gene cloned from basidiomycete fungi (21, 22). Particularly in Melampsora lini, the flax rust fungus, which is also in the order Pucciniales containing cereal rust pathogens, several Avr genes that follow the gene-for-gene relationship with flax resistance genes have been genetically mapped since the work of Flor and cloned using genome sequencing technology (23, 24). More recently, two Avr genes (AvrSr35 and AvrSr50) in Puccinia graminis f. sp. tritici, the causal agent of stem rust of wheat and barley, were cloned. AvrSr35 was identified through whole-genome sequencing (WGS) and comparison of chemically mutagenized mutants with a natural isolate (25), while AvrSr50, located in a loss-of-heterozygosity region, was identified by analyzing the genome variation and gene expression of spontaneous mutants (26). Among the rust fungi, Puccinia striiformis Westend. f. sp. tritici Erikss. causes wheat stripe (yellow) rust and is recognized as one of the most serious plant pathogens threatening global food security (27–30). P. striiformis f. sp. tritici is a macrocyclic, heteroecious fungus having five spore stages in its complete life cycle. Its urediniospores (n + n) are produced on and can reinfect primary hosts (wheat and grasses), on which teliospores (2n) are usually produced in the late crop season. Teliospores germinate to produce basidiospores (n) after meiosis, and basidiospores infect alternate hosts (Berberis spp. and Mahonia spp.), on which pycniospores (n) are produced, and fertilize receptive hyphae (n) to produce aeciospores (n + n). Aeciospores infect the primary host and produce urediniospores (31–33). Although economically important, avirulence genes have not been molecularly identified and characterized in P. striiformis f. sp. tritici. Cloning of Avr genes has not been conducted in P. striiformis f. sp. tritici, and the potential of a genetic mapping-based approach has been impeded, mainly due to the unknown alternate hosts of P. striiformis f. sp. tritici until the recent discovery of its alternate hosts in the genera Berberis and Mahonia (31–33). Since then, sexual populations of P. striiformis f. sp. tritici have been developed, and genetic studies of the inheritance of virulence phenotypes have been conducted (34–36). However, no Avr genes could be precisely defined due to the limited number of codominant molecular markers and the fragmented reference genomes. To conquer these limitations, we developed a segregating population through self-fertilizing a P. striiformis f. sp. tritici isolate on barberry and developed a high-density genetic map consisting of a large number of genome-wide molecular markers by whole-genome sequencing of the progeny population using next-generation sequencing technology. We mapped six Avr genes, including four in a gene cluster. Comparison of the high-density map regions with a reference genome of the pathogen enabled us to identify Avr candidates in narrow genome regions. The results set the basis for cloning the Avr genes for understanding the molecular mechanisms underlying rapid virulence changes in the wheat stripe rust fungus.

RESULTS

Virulence phenotyping.

The P. striiformis f. sp. tritici isolate 12-368 was selected to generate a self-fertilized sexual population based on its capability of producing abundant teliospores and high heterozygosity revealed by molecular markers, representing a different race group from those of our previously established sexual populations. The parental isolate and progeny isolates were kept as urediniospores that have two nuclei and can be asexually reproduced on susceptible wheat plants for a large quantity. The dikaryotic uredinial stage was genetically treated as diploid in the present study and used for virulence phenotyping and genomic DNA sequencing. Based on avirulence/virulence characterization of the set of 18 wheat Yr (yellow rust) single-gene differentials, isolate 12-368 was identified as belonging to race PSTv-4, with avirulence to the resistance genes Yr5, Yr7, Yr8, Yr10, Yr15, Yr24, Yr32, Yr43, Yr44, YrTr1, and YrExp2 and virulence to the genes Yr1, Yr6, Yr9, Yr17, Yr27, YrSP, and Yr76 (37). The sexual reproduction of the parental isolate 12-368 through self-fertilization on barberry plants produced 117 progeny isolates (Fig. 1). Besides the above-mentioned 18 Yr single-gene lines for differentiating P. striiformis f. sp. tritici races, the parental and progeny isolates were also tested on 16 additional wheat lines, each with a single different resistance gene (see Materials and Methods). Virulence tests showed that the parental and progeny isolates were all avirulent to 12 Yr genes, including Yr5, Yr10, Yr15, Yr24, Yr32, YrTr1, Yr26, YrCV, YrTr1, Yr45, Yr53, and Yr64, indicating that the Avr genes corresponding to these Yr genes were homozygous in the parental isolate. Similarly, the parental and progeny isolates were all virulent to 13 Yr genes, including Yr1, Yr2, Yr6, Yr9, Yr21, Yr25, Yr28, Yr29, Yr31, Yr76, YrA (i.e., Yr73 plus Yr74), Yr74 (in the Avocet Susceptible [AvS] cultivar), and YrSP, suggesting that the virulence loci corresponding to these Yr genes were also homozygous in the parental isolate. Therefore, these avirulence or virulence loci could not be mapped in this study. Detailed infection types (ITs) of parental and progeny isolates are provided in Data Set S1 in the supplemental material.
FIG 1

Experimental procedure for sexual population construction in this study.

Experimental procedure for sexual population construction in this study. Infection types of parental and progeny isolates on 34 wheat lines carrying 34 Yr genes for resistance to stripe rust. Download Data Set S1, XLSX file, 0.03 MB. In contrast, the avirulence/virulence phenotypes of the parental isolate to nine Yr genes (Yr7, Yr8, Yr17, Yr27, Yr35, Yr41, Yr43, Yr44, and YrExp2) were segregating in the progeny population (Table 1). Thus, the Avr genes corresponding to these resistance genes could be mapped. Since the dikaryotic (two unfused nuclei in a cell) urediniospores are heterozygous at these loci, the parental isolate was considered F1, and the progeny isolates produced through self-fertilization of the parental isolate were considered F2. Therefore, the avirulence/virulence phenotypes of progeny isolates should follow the segregation patterns in an F2 population. On wheat lines carrying Yr7, Yr8, Yr17, Yr41, Yr43, Yr44, and YrExp2, the avirulence/virulence phenotypes of progeny isolates fit the models that would be expected if the avirulence phenotype were dominant, whereas on the wheat lines with Yr27 or Yr35, to which the parental isolate was virulent, phenotypes of the progeny isolates were segregating, suggesting that the virulence phenotypes were dominant (Table 1).
TABLE 1

Segregation of avirulence/virulence in the progeny isolates derived from selfing parental isolate 12-368 of Puccinia striiformis f. sp. tritici on wheat Yr single-gene lines

Wheat Yr gene lineITa of 12-368No. of progeny isolates
Expected ratio (A/V)Pb Avirulence gene(s)
AV
Yr73 (A)86313:10.71AvYr7
Yr434 (A)91363:10.20AvYr43
Yr443 (A)91363:10.20AvYr44
YrExp23 (A)82353:10.22AvYrExp2
Yr82 (A)1051215:10.07AvYr8-1, AvYr8-2
Yr277 (V)31861:30.71avYr27
Yr178 (V)21951:30.09avYr17
Yr418 (V)161013:130.16AvYr41, AvYr41-Inh
Yr357 (V)48697:90.55avYr35-1, avYr35-2

IT, infection type based on a scale from 0 to 9, with 0 to 6 being avirulent (A) and 7 to 9 being virulent (V).

P, probability of goodness of fit by a χ2 test.

Segregation of avirulence/virulence in the progeny isolates derived from selfing parental isolate 12-368 of Puccinia striiformis f. sp. tritici on wheat Yr single-gene lines IT, infection type based on a scale from 0 to 9, with 0 to 6 being avirulent (A) and 7 to 9 being virulent (V). P, probability of goodness of fit by a χ2 test. The segregations of the parental avirulent phenotypes on wheat lines with Yr7, Yr43, Yr44, and YrExp2 fit the 3:1 avirulent/virulent (A/V) ratio, suggesting that each of the avirulent phenotypes of the parental isolate was controlled by a dominant gene. Therefore, the avirulence genes were designated AvYr7, AvYr43, AvYr44, and AvYrExp2, respectively. The segregation of avirulence on the wheat line with Yr8 fit the 15:1 A/V ratio, indicating two dominant avirulence genes, designated AvYr8-1 and AvYr8-2. The virulence phenotypes on wheat lines possessing Yr17 and Yr27 segregated at the 1:3 A/V ratio, indicating a recessive avirulence or a dominant virulence gene corresponding to each of the resistance genes. These P. striiformis f. sp. tritici genes were designated avYr17 and avYr27, respectively. The segregation of virulence to Yr41 best fit the 3:13 A/V ratio, indicating an epistatic interaction of a dominant inhibitor (AvYr41-Inh) over a dominant avirulence gene (AvYr41). On the wheat line with Yr35, the observed 7:9 A/V ratio indicated two independent recessive genes for avirulence, designated avYr35-1 and avYr35-2.

Genotyping by whole-genome sequencing.

To identify molecular markers for genetic mapping, Illumina HiSeq 150-bp paired-end (PE) technology was used to sequence the whole genomes of all 117 progeny isolates as well as the parental isolate. Twenty-six million pairs of reads (7.89 Gb in total) and 1.5 billion pairs of reads (471 Gb in total) were generated from the parental and progeny isolates, respectively. The numbers of filtered reads, percentages of mapped reads, and mapping coverages and qualities for the parental and all progeny isolates are summarized in Table S1. On average, 4.02-Gb sequences were generated for each progeny isolate. The high-quality reads of progeny isolates were mapped to the reference genome of isolate P. striiformis f. sp. tritici 93-210 (38). On average, 95.72% of reads were mapped to the reference genome. The mapping coverage of the progeny isolates ranged from 24.16× to 50.99×, with a mean coverage of 35.45×. The deep sequencing and high-quality reads enabled us to identify genome-wide variations for genetic mapping. Summary of whole-genome sequencing of progeny isolates derived from selfing parental isolate 12-368 of Puccinia striiformis f. sp. tritici. Download Table S1, DOCX file, 0.03 MB. In total, 2,487 heterozygous single nucleotide polymorphisms (SNPs) and 150 indels in the parental isolate genome were obtained based on their segregation at the 1:2:1 (AA:AB:BB) ratio (P ≥ 0.05 by a chi-squared test) in the progeny population (Fig. 2A). Contamination analysis suggested that 23 of the progeny isolates had abnormal numbers of crossover events, and these were therefore excluded from subsequent analyses. A total of 2,637 codominant markers were selected from 251 (out of 492) contigs, covering 71.35 Mb (out of 84.62 Mb) of the reference genome (Fig. 2B). The mean distance between two markers in the reference genome was 42.93 kb. Detailed genotypes of parental and progeny isolates are listed in Data Set S2.
FIG 2

Genotyping by whole-genome sequencing and genetic map construction. (A) Frequencies of homozygous AA, heterozygous (AB), and homozygous BB marker loci. (B) Circos plot of the distribution of markers along the genome and the linkage groups. The outer layer shows the contigs of the reference genome. Each colored bar represents one contig. The middle layer shows the distribution of markers in the reference genome. Each solid dot represents one marker. The inner layer shows the linkage groups of the genetic map. Note that the reference genome is reorganized corresponding to genetic groups, and the empty white bar in the inner layer represents the contigs that cannot be mapped to the linkage groups. Pst, P. striiformis f. sp. tritici. (C) Distribution of different numbers of single-crossover events. (D) Distribution of different numbers of double-crossover events.

Genotyping by whole-genome sequencing and genetic map construction. (A) Frequencies of homozygous AA, heterozygous (AB), and homozygous BB marker loci. (B) Circos plot of the distribution of markers along the genome and the linkage groups. The outer layer shows the contigs of the reference genome. Each colored bar represents one contig. The middle layer shows the distribution of markers in the reference genome. Each solid dot represents one marker. The inner layer shows the linkage groups of the genetic map. Note that the reference genome is reorganized corresponding to genetic groups, and the empty white bar in the inner layer represents the contigs that cannot be mapped to the linkage groups. Pst, P. striiformis f. sp. tritici. (C) Distribution of different numbers of single-crossover events. (D) Distribution of different numbers of double-crossover events. Molecular marker genotypes of progeny isolates derived from self-fertilized isolate 12-368 of Puccinia striiformis f. sp. tritici. Download Data Set S2, XLSX file, 1.0 MB.

Genetic map of P. striiformis f. sp. tritici.

After correcting allele switches and filtering potentially contaminated isolates, a genetic map was generated using the minimum spanning tree algorithm at a P value of 1E−10, which contained 2,631 markers in 41 linkage groups (LGs) (LG-1 to LG-41) (Table 2; Data Set S3); the remaining 6 markers could not be linked in the genetic map. The genetic map spanned a total of 7,715.0 centimorgans (cM), with individual LGs of up to 1,011.0 cM (LG-1, with 312 markers). The average genetic distance between markers was 2.94 cM throughout the genome. Comparison between LGs and the reference genome showed that multiple genome contigs could be tagged to one LG. For example, LG-1 included 39 contigs, covering over 10.86 Mb. With this information, we were able to estimate the recombination rate for LGs with at least 10 markers, and the average recombination rate across these LGs was 1.81 ± 2.32 cM/10 kb (Table 3). This rate was higher than those of the flax rust fungus M. lini (1.18 cM/10 kb) and Zymoseptoria tritici (1.25 cM/10 kb) and much higher than that of Fusarium graminearum (0.3 to 05 cM/10 kb) (Table 3).
TABLE 2

General features of the constructed linkage map

ChrNo. of markersLength (cM)Avg genetic distance between markers (cM) ± SDNo. of contigsContig length (bp)Mean recombination rate (cM/10 kb) ± SD
LG-13121,011.03.25 ± 3.043910,869,4790.66 ± 3.07
LG-2288762.72.65 ± 1.99276,567,2331.94 ± 2.21
LG-3232696.53.01 ± 2.23214,370,4141.66 ± 2.43
LG-4216655.53.04 ± 2.54246,208,2001.72 ± 2.37
LG-5183436.72.39 ± 2.36185,300,8671.35 ± 1.61
LG-6190413.42.18 ± 1.66131,825,6961.48 ± 2.18
LG-7124387.03.14 ± 2.75132,688,3371.80 ± 2.20
LG-8132343.62.62 ± 1.6392,509,1861.84 ± 2.71
LG-9101293.72.93 ± 2.40112,209,5121.99 ± 2.07
LG-1093292.13.17 ± 2.295943,7981.95 ± 2.01
LG-1167268.94.07 ± 3.5871,838,7582.11 ± 2.01
LG-1276248.33.31 ± 2.55102,280,8121.61 ± 1.78
LG-1372232.53.27 ± 3.00112,224,7641.94 ± 2.02
LG-1451179.03.58 ± 3.6581,834,7192.11 ± 2.25
LG-1567167.72.54 ± 1.5551,431,5741.77 ± 2.15
LG-1648153.83.27 ± 1.7471,390,0432.17 ± 2.78
LG-1749152.13.16 ± 2.293549,1492.01 ± 2.11
LG-1858147.42.58 ± 1.4671,158,9361.42 ± 1.70
LG-1930135.04.65 ± 3.479833,0853.19 ± 3.37
LG-202881.93.03 ± 1.282425,9501.05 ± 1.52
LG-212781.43.13 ± 2.354923,0421.92 ± 1.86
LG-221362.05.14 ± 3.483768,9771.46 ± 1.06
LG-231554.13.86 ± 3.431602,1191.60 ± 1.14
LG-241950.92.82 ± 1.511512,1941.84 ± 2.48
LG-251948.42.68 ± 1.525866,4271.99 ± 1.49
LG-262643.91.75 ± 1.793602,6080.76 ± 1.45
LG-271243.33.94 ± 2.641287,3921.94 ± 0.93
LG-28938.34.78 ± 3.312838,633NC
LG-291432.21.16 ± 1.462476,4023.55 ± 2.49
LG-301031.03.44 ± 3.262947,8702.58 ± 2.44
LG-31726.94.48 ± 1.881295,440NC
LG-32525.46.36 ± 2.671336,304NC
LG-33320.910.46 ± 3.391536,190NC
LG-34719.43.23 ± 0.681295,889NC
LG-35618.73.73 ± 1.191261,047NC
LG-36617.33.45 ± 1.552397,004NC
LG-37514.43.59 ± 0.741815,364NC
LG-38313.56.74 ± 0.322630,032NC
LG-3949.53.16 ± 1.462554,726NC
LG-4024.11181,331NC
LG-4120.51230,054NC
 
Overall2,6317,715.02.94 ± 2.3925168,819,5571.81 ± 2.32

Chr, chromosome; NC, not calculated because the linkage has <10 markers.

TABLE 3

Genome-wide CpG islands in selected plant pathogens

ParameterValue for pathogen
C. quercuum f. sp. fusiformea F. graminearumb M. linic P. striiformis f. sp. triticid Z. triticie
Genome length (bp)76,567,84237,946,458189,516,65384,531,32539,686,251
Mean recombination rate (cM/10 kb) ± SD (reference)Unknown (51)g 0.3–0.5 (52)g 1.18 (24)g 1.81 ± 2.321.25 (53)g
Total no. of CpGs1,310,3601,846,8274,449,4492,003,5932,796,481
No. of CpG dinucleotides in CpG islands (%)483,899 (36.9)128,731 (6.97)765,979 (17.2)587,056 (29.30)105,599 (3.77)
No. of predicted CpG islands29,7546,26040,81529,6715,454
Island coverage (%)f 7.372.875.097.261.64
Island length (bp)
    Avg ± SD189.68 ± 130.65174.07 ± 108.73236.65 ± 164.82207.02 ± 135.58120.05 ± 77.74
    Min68668
    Max1,4909183,2231,733697
Avg island GC% ± SD54.62 ± 7.2560.58 ± 6.9254.85 ± 8.9857.51 ± 2.0866.64 ± 8.07
Avg CpG O/E ratio ± SD1.37 ± 0.351.53 ± 0.311.31 ± 0.361.37 ± 0.081.73 ± 0.30

The reference genome was from G11 (https://genome.jgi.doe.gov/portal/Croqu1/download/Croqu1_AssemblyScaffolds.fasta.gz).

The reference genome was from isolate RRES (GenBank accession no. HG970335).

The reference genome was from isolate CH5 (https://genome.jgi.doe.gov/portal/Melli1/download/Melli1_AssemblyScaffolds.fasta.gz).

The reference genome was from isolate PST93-210.

The reference genome was from isolate IPO323 (https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/219/625/GCF_000219625.1_MYCGR_v2.0/GCF_000219625.1_MYCGR_v2.0_genomic.fna.gz).

Percentage of the genome covered by CpG islands.

Reference from which the previously estimated recombination rates were retrieved.

General features of the constructed linkage map Chr, chromosome; NC, not calculated because the linkage has <10 markers. Genome-wide CpG islands in selected plant pathogens The reference genome was from G11 (https://genome.jgi.doe.gov/portal/Croqu1/download/Croqu1_AssemblyScaffolds.fasta.gz). The reference genome was from isolate RRES (GenBank accession no. HG970335). The reference genome was from isolate CH5 (https://genome.jgi.doe.gov/portal/Melli1/download/Melli1_AssemblyScaffolds.fasta.gz). The reference genome was from isolate PST93-210. The reference genome was from isolate IPO323 (https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/219/625/GCF_000219625.1_MYCGR_v2.0/GCF_000219625.1_MYCGR_v2.0_genomic.fna.gz). Percentage of the genome covered by CpG islands. Reference from which the previously estimated recombination rates were retrieved. Genetic map of Puccinia striiformis f. sp. tritici constructed from a self-fertilized sexual population from isolate 12-368. Download Data Set S3, XLSX file, 0.2 MB. To investigate potential genome features that might contribute to the relatively large P. striiformis f. sp. tritici genetic map and high recombination rate, we estimated the numbers of single- and double-crossover events in each isolate, with averages of 133.15 and 35.94, respectively (Fig. 2C and D). We also calculated the genome coverage by CpG islands in P. striiformis f. sp. tritici and compared this value with those of a few other plant-pathogenic fungi. The CpG islands in the P. striiformis f. sp. tritici reference isolate covered 7.26% of the genome (Table 3). The CpG island coverage was higher than those of M. lini (5.09%), Fusarium graminearum (2.87%), and Zymoseptoria tritici (1.64%) but slightly lower than that of the pine fusiform rust fungus Cronartium quercuum f. sp. fusiforme (7.37%) (Table 3). The average recombination rate of the P. striiformis f. sp. tritici CpG islands was estimated to be 1.81 cM/10 kb, the same as the average recombination rate of the overall genome mentioned above. Due to the lack of data for the other fungi, we were unable to compare the CpG island recombination rates of P. striiformis f. sp. tritici with those of these fungi.

QTL mapping for avirulence loci.

The genetic map and infection type (IT) data of the segregating avirulence/virulence phenotypes of the progeny population were used for quantitative trait locus (QTL) mapping. Six avirulence genes (AvYr7, AvYr43, AvYr44, AvYrExp2, AvYr8-1, and avYr27) were mapped to three LGs (Table 4). AvYr8-1 was mapped to LG-19, at the 6.82- to 31.09-cM region flanked by markers C085_283131 and C182_5504. avYr27 was mapped to LG-4, between the 446.65- and 528.51-cM positions flanked by markers C162_20837 and C086_188415, respectively (Data Set S3). The QTL confidence interval of AvYr8-1 covered three contigs, contig 1.085 (from kb 200 to kb 283), contig 1.137 (from kb 12 to kb 29), and contig 1.182 (from kb 5 to kb 14) in the reference genome of isolate 93-210. The avYr27 interval covered two contigs, contig 1.086 (from kb 10 to kb 188) and contig 1.162 (from kb 1 to kb 164). Four avirulence genes, AvYr44, AvYr7, AvYr43, and AvYrExp2, were mapped to the same LG region between the 29.06- and 57.49-cM positions flanked by markers C022_56722 and C022_180222 in LG-22, respectively (Fig. 3A; Data Set S3).
TABLE 4

Quantitative trait loci for Avr genes identified in the selfing population of isolate 12-368 of Puccinia striiformis f. sp. tritici

QTLGenetic map
Physical mapb
Linkage groupFlanking markersPeak position (cM)Interval (cM)LODP valuePVEa ContigInterval (kb)
AvYr8-1LG-19C085_283131, C182_550419.006.82–31.096.73<0.000130.851.085200–283
1.13712–29
1.1825–14
 
avYr27LG-4C162_20837, C086_188415484446.65–528.516.16<0.000128.651.08610–188
1.1621–164
 
AvYr44LG-22C022_56722, C022_18022247.0029.06–57.4912.07<0.000148.401.0220–200
 
AvYr7LG-22C022_56722, C022_18022246.0029.06–57.4910.3<0.000143.141.0220–200
 
AvYr43LG-22C022_56722, C022_18022243.0029.06–57.4910.91<0.000144.981.0220–200
 
AvYrExp2LG-22C022_56722, C022_18022247.0029.06–57.499.36<0.000140.131.0220–200

PVE, percentage of variance explained by the QTL, calculated as , where n is the total number of individuals.

Based on the reference genome of isolate 93-210 (38).

FIG 3

The AvYr44-AvYr7-AvYr43-AvYrExp2 cluster. (A) QTL mapping identified a region carrying the AvYr44, AvYr7, AvYr43, and AvYrExp2 genes. (B) Manhattan plot of the genome-wide association study (GWAS) performed on contig 1.022. (C) Syntenic analysis of contig 1.022 from the reference genome of isolate 93-210 with homologous contigs from the genomes of isolates 104E 137A− and 11-281. Note that the first 10 kb of contig 1.022, which harbors most of the significant GWAS signals, were absent in 104E 137A− and 11-281. Blue blocks are syntenic regions. Red stars are genes. The darker the linking lines between two genomes, the higher degree of similarity between the genomes. (D) SNP density and GC content of contig 1.022.

Quantitative trait loci for Avr genes identified in the selfing population of isolate 12-368 of Puccinia striiformis f. sp. tritici PVE, percentage of variance explained by the QTL, calculated as , where n is the total number of individuals. Based on the reference genome of isolate 93-210 (38). The AvYr44-AvYr7-AvYr43-AvYrExp2 cluster. (A) QTL mapping identified a region carrying the AvYr44, AvYr7, AvYr43, and AvYrExp2 genes. (B) Manhattan plot of the genome-wide association study (GWAS) performed on contig 1.022. (C) Syntenic analysis of contig 1.022 from the reference genome of isolate 93-210 with homologous contigs from the genomes of isolates 104E 137A− and 11-281. Note that the first 10 kb of contig 1.022, which harbors most of the significant GWAS signals, were absent in 104E 137A− and 11-281. Blue blocks are syntenic regions. Red stars are genes. The darker the linking lines between two genomes, the higher degree of similarity between the genomes. (D) SNP density and GC content of contig 1.022.

Genomic location of the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster.

All markers from similar QTL regions for AvYr44, AvYrYr7, AvYr43, and AvYrExp2 were located in a single contig, namely, contig 1.002, in the reference genome of isolate 93-210. The leftmost marker was at bp 6812, and the rightmost marker was at bp 180222, indicating that the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster was most likely within the region of the first 200 kb of contig 1.002 in the reference genome. Our previous annotation of this region of the reference genome (39) identified 47 protein-coding genes, 4 of which have signal peptides in the N terminus (Table 5).
TABLE 5

Candidate genes of the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster in the confidence interval defined by QTL analysis

93-210 gene nameFunctional annotationSecretedMature length (amino acids)% cysteine
PSTG_03388Hypothetical proteinNo2911.03
PSTG_03389Enoyl-(acyl carrier protein) reductaseNo3511.14
PSTG_03390ABC transporterNo1,0530.66
PSTG_03391Hypothetical proteinNo1021.96
PSTG_03392Hypothetical proteinNo8141.11
PSTG_03393Hypothetical proteinNo1990.50
PSTG_03394Hypothetical proteinYes2280.88
PSTG_03395Noncatalytic module family expansinNo3510.57
PSTG_03396Hypothetical proteinNo5450.37
PSTG_03397Hypothetical proteinNo4581.09
PSTG_03398Hypothetical proteinNo4890.82
PSTG_03399Hypothetical proteinYes1431.40
PSTG_03400Threonine dehydratase INo4861.03
PSTG_03401Hypothetical proteinNo3940.25
PSTG_03402Hypothetical proteinNo4000.75
PSTG_03403Hypothetical proteinNo4140.97
PSTG_03404Hypothetical proteinNo1540.65
PSTG_03405Ca2+/calmodulin-dependent protein kinaseNo4520.66
PSTG_03406zap1 metalloregulator involved in zinc-responsive transcriptional regulationNo6281.27
PSTG_03407Ubiquitin-specific protease 7No1,1160.45
PSTG_03408Hypothetical proteinYes1302.31
PSTG_03409Hypothetical proteinNo5560.72
PSTG_03410RING-H2 finger ATL54-likeNo2430.82
PSTG_03411P-loop-containing nucleoside triphosphate hydrolaseNo1891.59
PSTG_03412Per1-like; involved in manganese homeostasisNo4580.87
PSTG_03413Inositol monophosphataseNo3150.95
PSTG_03414Subunit of cytochrome bd ubiquinol oxidaseNo1280.00
PSTG_03415Hypothetical proteinYes3380.30
PSTG_03416Hypothetical proteinNo1481.35
PSTG_03417Hypothetical proteinNo1730.58
PSTG_03418Hypothetical proteinNo1350.00
PSTG_03419Hypothetical proteinNo1890.53
PSTG_034203-Deoxy-d-arabinoheptulosonate 7-phosphate (DAHP) synthetase I familyNo4351.38
PSTG_03421Hypothetical proteinNo6781.18
PSTG_03422tRNA
PSTG_03423Hypothetical proteinNo6961.01
PSTG_03424Hypothetical proteinNo4201.43
PSTG_03425Hypothetical proteinNo3801.05
PSTG_03426Hypothetical proteinNo1421.41
PSTG_03427Hypothetical proteinNo1930.52
PSTG_03428Hypothetical proteinNo3010.66
PSTG_03429Hypothetical proteinNo4091.22
PSTG_03430Hypothetical proteinNo230.00
PSTG_03431Hypothetical proteinNo2690.74
PSTG_03432Hypothetical proteinNo900.00
PSTG_03433Hypothetical proteinNo2410.41
PSTG_03434RNA polymerase I-specific transcription initiation factor rrn11No2701.48
Candidate genes of the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster in the confidence interval defined by QTL analysis To validate the QTL mapping results and to identify putative casual variations, we performed a genome-wide association study (GWAS) for each of the AvYr44, AvYr7, AvYr43, and AvYrExp2 genes. Instead of using only markers in nontransposable elements (non-TEs) and without significant distortion for the 1:2:1 ratio in the QTL analysis, all markers and all 117 progeny isolates were used for the GWAS. In this way, 609 markers from contig 1.022 were obtained and used in the GWAS. Seventeen out of the 18 significantly associated SNPs were within the first 200 kb on contig 1.022. Surprisingly, a major GWAS signal peak was within the first 5-kb region (Fig. 3B). This peak matched the gene PSTG_03388, and several of the SNPs within the gene were nonsynonymous (Table 6). Thus, both QTL analyses and GWASs located the AvYr7-AvYr43-AvYrExp2 locus to the first 200-kb region of contig 1.022.
TABLE 6

SNPs within the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster significantly associated with avirulent (Avr) and virulent (avr) phenotypes

Avr geneSNPP_FDR_adjGenotype(s)
Annotation
Avravr
AvYr44C022_25623.30E−05GAAAIntergenic
C022_1701660.00144TCTTIntergenic
 
AvYr7C022_16340.00018GAGGPSTG_03388, nonsynonymous, Pro/Leu
C022_16430.00018TCCCPSTG_03388, nonsynonymous, Glu/Gly
C022_23430.00018TCCCIntergenic
C022_25620.00018GAAAIntergenic
C022_25680.00018GTTTIntergenic
C022_25760.00018ACAAIntergenic
C022_25840.00018GAAAIntergenic
C022_82830.00064GC, CCGGPSTG_03390, nonsynonymous, Asp/His
C022_20990.00109GTTTPSTG_03388, nonsynonymous, Pro/Gln
C022_15550.00172TAAAPSTG_03388, synonymous
C022_15610.00172GAAAPSTG_03388, synonymous
 
AvYr43C022_15551.12E−10TAAAPSTG_03388, synonymous
C022_1883430.00045ATAAIntergenic
C079_1502090.00226GAAAPSTG_03421, intron
 
AvYrExp2C022_16345.47E−05GAGGPSTG_03388, nonsynonymous, Pro/Leu
C022_16435.47E−05TCCCPSTG_03388, nonsynonymous, Glu/Gly
C022_23435.47E−05TCCCIntergenic
C022_23705.54E−05CTTTIntergenic
C022_25628.70E−05GAAAIntergenic
C022_25688.70E−05GTTTIntergenic
C022_25768.70E−05ACCCIntergenic
C022_25848.70E−05GAAAIntergenic
C022_22820.00034AGGGIntergenic
C022_22940.00034TCCCIntergenic
C022_820800.00034AG, GGAAPSTG_03406, synonymous
C022_15310.00038GTTTPSTG_03388, synonymous
C022_18900.00038CGGGPSTG_03388, nonsynonymous, Glu/Gln
C022_20990.00051GTTTPSTG_03388, nonsynonymous, Pro/Gln
C022_26760.00051AC, CCAAIntergenic
C022_3201180.00059TT, T–Intergenic
C022_82830.00096GC, CCGGPSTG_03390, nonsynonymous, Asp/His
C022_469830.00096TA, AATTPSTG_03396, synonymous
SNPs within the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster significantly associated with avirulent (Avr) and virulent (avr) phenotypes We calculated the SNP density and GC content in contig 1.022. The region harboring the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster had 49 SNPs within the first 5 kb and only 4 between kb 5 and kb 20. The GC content was 47% within the first 20 kb, relatively in balance with the AT content along the contig (Fig. 3C). We further investigated whether AT-rich genomic regions exist in the P. striiformis f. sp. tritici genome and whether contig 1.022 is located in an AT-rich region. The unimodal distributions of GC content indicated that there were no distinct AT-rich segments in either the whole genome or contig 1.022 (Fig. S1). Therefore, we conclude that the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster is not located in an AT-rich region. Next, we attempted to investigate the genomic environment of the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster. Considering that this cluster resides in one of the contig terminal regions, we tried to extend contig 1.022 by aligning it to three other well-assembled genomes, those of isolates 104E 137A− (40), 11-281 (41), and 93TX-2, in addition to the reference genome (93-210) (Fig. S2). Surprisingly, the homologous contigs of these genomes terminated around the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster, except for the genomes of the parental and reference isolates (Fig. 3D). In fact, the first 10-kb region of contig 1.022 in the reference genome of isolate 93-210 was mostly or partially absent in the 104E 137A−, 93TX-2, and 11-281 genomes (Fig. S2). Besides the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster, the remaining regions were highly conserved among these three genomes. Most SNPs associated with the Avr cluster were found within the first 2,600 bp of the contig in the parental isolate and within the first 2,700 bp of the contig in the reference genome. The first seven SNPs were within the first exon identified in the contig. In summary, the results suggested that the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster resides in a genetically complex region attached to a highly conserved genomic region. Unimodal (one-peak) distributions of GC contents in the whole genome (A) and contig 1.022 (B) of Puccinia striiformis f. sp. tritici. Download FIG S1, RTF file, 1.0 MB. Alignment of the genomic sequences of the parental isolate 12-368 (12368) of Puccinia striiformis f. sp. tritici (Pst) containing SNPs significantly associated with the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster to the first 20-kb region of P. striiformis f. sp. tritici isolate 93-210 contig 1.022 and isolates 104E 137A− (104E_137A) and 93TX-2. The SNPs significantly associated with virulence phenotypes are highlighted in bold red, and gene exons are highlighted with a yellow background. Note that the first 10 kb from isolate 93-210 was partially absent in the parental genome and mostly absent in the other three genomes. Download FIG S2, RTF file, 0.4 MB.

DISCUSSION

In the present study, we used Illumina sequencing technology to construct a high-density genetic map for mapping Avr genes in the wheat stripe rust fungus. We generated a P. striiformis f. sp. tritici sexual population for genetic mapping by self-fertilizing P. striiformis f. sp. tritici isolate 12-368 (race PSTv-4). Whole-genome deep sequencing of progeny isolates generated 2,637 high-quality codominant molecular markers, which enabled us to construct a high-density genetic map for P. striiformis f. sp. tritici comprising 41 LGs. QTL analysis mapped six Avr genes in three LG regions. Moreover, an avirulence gene cluster carrying four Avr genes was identified and located at a single contig in the P. striiformis f. sp. tritici reference genome. Aligning the genetic map to the reference genome enabled us to further locate the Avr candidates at a small genomic region (<200 kb). This study provides the resources for functional cloning of Avr genes and a better understanding of the genomic basis of the rapid evolution of virulence in the wheat-P. striiformis f. sp. tritici pathosystem.

Isolate-dependent inheritance of avirulence/virulence in P. striiformis f. sp. tritici.

The segregation patterns of phenotypes in the progeny population suggest a complex inheritance of avirulence/virulence in P. striiformis f. sp. tritici, which is consistent with previous observations (34–36). First, different phenotypes of avirulence/virulence to individual Yr genes could be controlled by one or two genes in a single isolate. In isolate 12-368 used in the present study, the avirulence phenotypes to Yr7, Yr43, Yr44, and YrExp2 were each controlled by a single dominant gene, whereas the avirulence phenotype to Yr8 was controlled by two dominant genes. Different numbers of genes controlling avirulence/virulence phenotypes in a single isolate have also been observed in other P. striiformis f. sp. tritici selfing populations (34–36) as well as in other fungi (42). One possible explanation is that there may be an unidentified resistance gene(s) in the wheat Yr gene lines used in the genetic studies of P. striiformis f. sp. tritici, even though many of these wheat lines are nearly isogenic in the Avocet Susceptible (AvS) background. AvS was reported to have Yr74, which is complementary to Yr73, to provide resistance to some P. striiformis f. sp. tritici races in Australia (43). It is likely that some of the Yr near-isogenic lines also have Yr74. Yuan et al. (36) reported that the virulence phenotype in the parental isolate (08-220, race PSTv-11) was heterozygous and segregated in its selfed progeny population (36). In contrast, the parental isolate and the progeny isolates in the present study were all virulent on AvS (Yr74). Therefore, the presence of Yr74 in the AvS background of many Yr single-gene lines did not influence the segregation ratios of the wheat lines with an AvS background. Thus, it is possible for two avirulence genes to interact with a single Yr gene. Second, the particular avirulence/virulence phenotype to a single Yr gene could be controlled by one gene in one isolate but by two or more genes in another isolate. Third, the complex inheritance in P. striiformis f. sp. tritici is also suggested by the fact that the avirulence/virulence phenotype to a single Yr gene could be dominant in one isolate but recessive in another. For example, the avirulence phenotype on Yr7 was determined by a single dominant avirulence gene in isolate 12-368 in the present study and in isolate GS-2013 (34) but was determined by a single dominant virulence gene in isolates CY32 (35) and PSTv-11 (36). A possible explanation for this is the underestimation of the interaction between a dominant avirulence gene and a dominant inhibitor gene by misclassification of the segregation of the 3:13 ratio as 1:3, at least in some cases. Taken together, the inheritance of avirulence/virulence and the interactions between avirulence/virulence-controlling genes in P. striiformis f. sp. tritici are isolate dependent. The complex interactions between fungal Avr genes and their corresponding R genes have been described in many plant pathosystems (9). The isolate-dependent nature of avirulence/virulence inheritance observed in P. striiformis f. sp. tritici has also been reported in other rust fungi (44–46). Until now, no universal genetic models have been available to explain such complex inheritance. Therefore, further identification and comparison of genes and genomic regions between different isolates showing different avirulence/virulence inheritance are needed to test the association of complex inheritance and the plasticity of the genomic environment.

Avirulence gene cluster in P. striiformis f. sp. tritici.

The QTL analysis and the GWAS mapped four Avr genes in a small genomic region (Table 4 and Fig. 3A and B), indicating a cluster of Avr genes in P. striiformis f. sp. tritici isolate 12-368. The existence of Avr/Vir clusters has also been revealed from different P. striiformis f. sp. tritici mapping populations in previous studies (34–36). However, due to the limited number of molecular markers (34, 35) or the lack of codominant markers (36), either previous studies were not able to identify cosegregating markers or the flanking markers were too far away from the cluster to precisely define the genomic intervals of the cluster. In contrast, QTL mapping with the highly improved reference genome enabled us to map an Avr cluster, AvYr44-AvYr7-AvYr43-AvYrExp2, in LG-22 to a single contig of the reference genome. Avr and effector genes usually reside in plastic genomic regions, and gene clusters are often located in such regions (47). Such Avr gene clusters are not uncommon in cereal rust fungi. For example, the Avr genes in the flax rust fungus M. lini were genetically mapped to four small regions, and the genes within each region were tightly linked and inherited as a unit (5). Several Avr genes have been cloned from these clusters. Interestingly, single Avr genes (e.g., AvrL567 and AvrM14) in M. lini controlling avirulence to several R genes have been reported (23, 24). Such a single Avr gene (allele) recognized by multiple R genes has also been reported in the pathosystem of Leptosphaeria maculans-oilseed rape (48). Syntenic analysis revealed that the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster resides in one of the contig termini, which was difficult to assemble in different isolates (Fig. 3D; Fig. S1). It will be interesting to determine whether this Avr cluster is located in a subtelomere-like region, similar to AvrPita from Pyricularia oryzae (49) and AvrStb6 from Z. tritici (50), or a region adjacent to a repetitive region of the genome. Further work using linked- or long-read sequencing, e.g., bacterial artificial chromosome sequencing, PacBio technology, or nanopore sequencing, is needed to dissect the cluster of AvYr44-AvYr7-AvYr43-AvYrExp2 loci for cloning the Avr genes and for providing a genomic basis for the rapid avirulence changes in P. striiformis f. sp. tritici. The genes identified in the first 20 kb of the contig can be studied for expression to determine if they are functionally associated with avirulence.

Large genetic map with a high recombination rate of P. striiformis f. sp. tritici.

Generally, our genetic map was consistent with the physical map (Data Set S3). For example, the markers of each large contig (>200 kb in length) were always located nearby in the genetic map. However, we noticed two types of inconsistencies. First, some of the markers from different contigs were interwoven in the genetic map, especially the markers from short contigs. We speculate that these might be due to either the highly repetitive nature of the P. striiformis f. sp. tritici genome or the high heterozygosity between two marker loci. Second, even though markers from the same contig were located nearby in the genetic map, their locations were not linearly correlated. This inconsistency might be caused by structural variations (e.g., genome rearrangement) between the parental and reference isolates or by possible genotyping errors from intrinsic sequencing bias and errors. An improved and haplotype-solved reference genome, especially from the parental isolate, as well as long-read sequencing with high accuracy (e.g., PacBio HiFi), is needed to solve such inconsistencies. We took the advantage of our previously assembled high-continuity reference genome for comparison between the genetic map and the physical map (38, 39). This enabled us to discover new genetic features from a sexual P. striiformis f. sp. tritici population. Our study showed that P. striiformis f. sp. tritici has a large genetic map with a total genetic distance of 7,715.0 cM, which is comparable to other rust fungi. In the pine fusiform rust fungus (C. quercuum f. sp. fusiforme), a genetic map of 3,006 cM was constructed using 421 (including 208 randomly amplified polymorphic DNA [RAPD], 34 simple sequence repeat [SSR], and 184 amplified fragment length polymorphism [AFLP]) markers (51). Similarly, Anderson et al. (24) generated a genetic map of 5,860 cM using 13,412 restriction site-associated DNA sequence (RADseq) markers in the flax rust fungus (M. lini). One of the factors contributing to the slightly larger map in the present study might be the genotyping platform used since markers from whole-genome sequencing could potentially cover the whole genome. In the present study, 68.81 Mb, out of the 84.53-Mb (81.4%) genome, were covered by the markers from whole-genome sequencing. In the M. lini study, a slightly lower coverage (68.9%) of the genome was anchored to the genetic map using RADseq markers (24). We also noticed that P. striiformis f. sp. tritici has a slightly higher recombination rate (1.81 cM/10 kb) (Table 2) than other fungal pathogens, e.g., 0.3 to 0.5 cM/10 kb in F. graminearum (52), 1.25 cM/10 kb in Z. tritici (53), and 1.18 cM/10 kb in M. lini (24). We speculate that the higher recombination rate in P. striiformis f. sp. tritici is associated with the high number of detected single- and double-crossover events. Compared with the average of 114.6 crossovers per F2 individual in M. lini, P. striiformis f. sp. tritici has a relatively high number of crossovers per individual, at 133.15 (Fig. 2C). In addition to the high number of crossover events, we also noticed that rust fungi with high recombination rates also have high percentages of CpG islands in their genomes. It has been proposed that the depletion of nucleosome occupancy in particular functional features such as CpG islands increases the accessibility of the recombination machinery (54). We found that the percentages of genome coverage by CpG islands in the basidiomycete rust fungi are higher than those in the ascomycete fungi in our comparison. We noticed that Z. tritici was an exception in its slightly higher recombination rate than that of M. lini but much lower CpG island coverage (Table 3). Because only a few plant-pathogenic fungi have genome-wide recombination rates available, it is not possible to explicitly determine the role of CpG islands in the variation of recombination rates across different plant pathogens. Further studies are needed to test this hypothesis when more plant pathogens have their genome-wide recombination rates available. It is also useful to further investigate the genomic features of recombination hot spot regions besides the CpG islands to explain the recombination rate variation across the genome and among different plant pathogens. In summary, our high-density genetic map reveals a generally high recombination rate of P. striiformis f. sp. tritici. More studies are needed to investigate the contribution of the high number of sexual recombination events to P. striiformis f. sp. tritici genome architecture and the rapid evolution of virulence. Moreover, our high-quality genetic map with dense markers will provide a valuable resource for anchoring genomic contigs to chromosomes. In conclusion, using molecular markers generated through whole-genome sequencing of a self-fertilized population, we generated a high-density genetic map for P. striiformis f. sp. tritici comprising 41 lineage groups. Moreover, the SNP and indel markers are attached to the sequences of fragments that have been mapped to the reference genome, which allows direct comparison with data from future similar studies. The high-density genetic map will be valuable to further anchor fragmented contigs to chromosomes. Furthermore, the avirulence gene cluster of AvYr44-AvYr7-AvYr43-AvYrExp2 was identified from QTL mapping and located at a short genome region through genome comparison. Further studies on the detailed genomic environment of this Avr cluster and cloning of these genes will shed light on the genomic basis of the rapid virulence changes of this destructive pathogen.

MATERIALS AND METHODS

Isolate selection, urediniospore multiplication, and teliospore production.

P. striiformis f. sp. tritici isolate 12-368 (race PSTv-4), collected from Washington State in the United States in 2012, was selected to generate a segregating population based on its avirulence/virulence profile and its abilities to produce telia and infect barberry plants. The purification, urediniospore multiplication, and teliospore production of the isolate were conducted according to a previously described procedure (36). Briefly, the pure isolate was obtained from a single uredinium. Urediniospores were multiplied on seedlings of the winter wheat cultivar Nugaines (37). The spring wheat cultivar Avocet Sensitive (AvS) was inoculated at the flag-leaf stage and grown under controlled conditions to produce telia.

Developing a sexual population.

A sexual population was developed by self-fertilization of isolate 12-368 according to procedures described previously (36, 55) (Fig. 1). Briefly, wheat leaves bearing telia were prepared by removing the epidermal layers and then placed on moist filter paper in petri dishes. Teliospore germination and basidiospore production were checked under a microscope periodically. Suspensions of germinated teliospores and produced basidiospores were used to inoculate 10-day-old barberry leaves. After incubation in a dew chamber at 10°C in the dark with 100% relative humidity for 36 to 48 h, the inoculated barberry plants were transferred to a growth chamber for pycnial production. For self-fertilization, the nectar containing pycniospores from one pycnium was transferred to another pycnium. To avoid duplicated fertilization of a pycnium, the transfer of nectar was conducted in one direction such that the nectar from the first pycnium was transferred to the second pycnium but the nectar from the second pycnium was never retransferred back to the first pycnium or another pycnium. As expected, in this way, an aecium was produced only in the opposite leaf surface of the second pycnium but not in the first pycnium. Usually, a cluster of aecial cups was formed after one pycnium was fertilized. When an aecium was mature, only a single aecial cup was excised with a sterile razor blade, the aeciospores from the aecial cup were used to inoculate seedlings of wheat cultivar Nugaines, and the leftover cups within one aecium were stored in liquid nitrogen for backup. About 13 to 15 days after aeciospore inoculation, a single uredinium from an inoculated Nugaines leaf was isolated, and the urediniospores from the uredinium were used to inoculate Nugaines seedlings again to multiply enough urediniospores, which were considered one progeny isolate produced from infection by a single aeciospore. The increased urediniospores of both parental isolate 12-368 and progeny isolates were used for virulence testing and DNA extraction.

Virulence phenotyping and genetic analysis.

A total of 34 wheat genotypes, each carrying a single Yr resistance gene, were used to obtain avirulence/virulence phenotypes of isolate 12-368 and the progeny isolates. These 34 Yr genes were Yr1, Yr5, Yr6, Yr7, Yr8, Yr9, Yr10, Yr15, Yr17, Yr24, Yr27, Yr32, Yr43, Yr44, YrSP, YrTr1, YrExp2, Yr76, Yr2, Yr21, Yr25, Yr26, Yr28, Yr29, Yr31, Yr35, YrCV, YrTr1, YrCN19, YrA, YrAvS, Yr45, Yr53, and Yr64. Five to seven seeds of each wheat line were planted for each isolate test. About 18 days after inoculation, the IT data of an isolate on each plant were recorded based on a scale of 0 to 9 (56). As genetically pure seeds of each Yr single-gene line were specially produced and each P. striiformis f. sp. tritici isolate was obtained initially from a single uredinium and carefully multiplied, the up to seven plants of each line in a single isolate test mostly had identical ITs. In cases where different ITs were observed, the identical ITs of most plants or at least three plants were used. An isolate was considered avirulent to a specific Yr gene when the IT was 0 (inoculated leaves showing no visible symptoms), 1 (showing necrotic or chlorotic flecks), 2 (showing necrotic or chlorotic blotches without sporulation), or 3 to 6 (showing necrotic or chlorotic blotches with trace to moderate sporulation) or virulent when the IT was 7 to 9 (showing abundant sporulation with or without necrosis or chlorosis) (37). Since urediniospores of isolate 12-368 are dikaryotic, typical for P. striiformis f. sp. tritici and other rust fungi, we assumed that the parental isolate is heterokaryotic or “heterozygous” for many loci and therefore treated it as an F1 generation in the present study. The progeny isolates generated by selfing isolate 12-368 were considered the F2 generation. Therefore, the segregation of virulence phenotypes and molecular markers was expected to follow the segregation ratios of an F2 population. The chi-squared test was used to determine whether the observed segregation of A/V phenotypes on a particular wheat line fit a theoretical genetic ratio.

DNA extraction and whole-genome sequencing.

Genomic DNA was extracted from urediniospores using the cetyltrimethylammonium bromide (CTAB) method as previously described (57). The quality and quantity of the extracted DNA were checked using a NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific) and 0.8% agarose (Thermo Fisher Scientific) gel electrophoresis. DNA libraries were constructed and whole-genome sequencing was performed using Illumina HiSeq PE150 technology (Novogene Co. Ltd., Sacramento, CA).

Genomic variation calling.

For each isolate, Illumina sequencing reads were checked and trimmed using Trimmomatic (version 0.36) (58). Parameters were set as LEADING:10 TRAILING:10 SLIDINGWINDOW:4: 15 MINLEN:50 in the paired-end (PE) model. After trimming, reads with lengths of less than 50 bp or that were not paired were excluded from subsequent analyses. Potential sequencing errors in trimmed reads were corrected using Lighter software (59), with the parameter set as –K 21 90000000. The high-quality and error-corrected Illumina reads were used for genomic variation calling. The previously assembled genome of U.S. isolate P. striiformis f. sp. tritici 93-210 was used as a reference genome (38) since it has a well-assembled genome (38). Also collected from the U.S. Pacific Northwest, the reference isolate was identified as belonging to race PSTv-20 and was avirulent to all 18 wheat Yr single-gene lines, except Yr17, used to differentiate P. striiformis f. sp. tritici races in the United States (37). Genome-wide markers were identified according to a previously proposed framework (60). Briefly, the Burrows-Wheeler alignment (BWA) tool version 0.7.15 (61) was used to index the sequence reads to the reference genome, and the mem algorithm with default parameters was used to map filtered paired-end Illumina reads to the indexed reference genome. Next, Genome Analysis Toolkit (GATK) version 3.3 (62) was used to identify genomic variations. RealignerTargetCreator software was used to define interval targets for local realignment, and IndelRealigner was used to perform indel realignment. Two rounds of genomic variation calling were performed using GATK HaplotypeCaller with default parameters, as described previously (38, 63). VCFtools software (version 0.1.13) was used to manipulate SNPs and indels stored in vcf format. SNPs and indels of high confidence were filtered using VCFtools, with parameters set as –min-alleles 2 –max-alleles 2 –minQ 1000 –min-meanDP 30 –max-meanDP 60 –max-missing 1 (meaning that only biallelic SNPs and indels with a minimum quality of 1,000, a minimum mean depth of 30, a maximum mean depth of 60, and no missing data were used). By setting –min-meanDP 30 –max-meanDP 60, the variations in repetitive regions could be partially filtered since our sequencing depth ranged from 30× to 60×. To further remove potentially problematic variations in repetitive regions, we filtered SNPs and indels from transposable element (TE) regions as previously defined (38, 39). By doing this, genome-wide variations from only non-TE regions were kept for subsequent analyses. Moreover, duplicate markers (that is, markers with identical genotypes across all isolates) were identified and excluded from the construction of maps. To further reduce the computational burden, only one marker was selected within a 1,000-bp sequence region. The markers were named as contig no._position. For example, C001_1000 is the marker located in contig 1.001 at bp 1000 in the reference genome of P. striiformis f. sp. tritici 93-210 (38).

Genetic map construction and QTL mapping.

Before map construction, we first checked the segregation ratios of markers. In general, markers with P values from chi-squared tests of ≥0.05 for the 1 (homozygous AA):2 (heterozygous AB):1 (homozygous BB) ratio expected for a single locus were used for map construction. In total, 2,637 high-quality genome-wide markers and 94 progeny isolates were used for map construction. Genetic map construction was performed using qtl version 1.41-6 (64, 65) and ASMap version 1.0-2 (66) in the R package. The qtl program was used to analyze the genotypes. Even though the progeny population generated from selfing the isolate 12-368 could be considered an F2 population, the linkage phases were unknown due to the lack of paternal and maternal isolates. Therefore, some markers might have switched alleles. This issue was solved by checking logarithm of the odds (LOD) scores against the estimated recombination fractions for all marker pairs using the qtl program. The markers were considered to have alleles switched when they were tightly associated with other markers but had recombination fractions of >0.5, which were corrected according to a procedure described by Broman et al. (64). After correcting allele switches, the ASMap package using the minimum spanning tree algorithm was used to infer linkage groups and optimally order markers, with parameters set as dist.fun=“kosambi,” bychr=FALSE, p.value=1e-10, anchor=TRUE, noMap.dis=20, and detectBadData=TRUE. The genotyping errors and rates were calculated using the qtl package, and the number of observed crossovers per individual was estimated using the ASMap package. The genetic map was reconstructed until no genotype errors were detected. In addition, we excluded isolates that were potentially contaminated if their large proportion of alternative alleles was absent from the parental isolate. Analysis with such potentially contaminated isolates showed that these isolates had >3-times-larger numbers of crossover events than other isolates. The number of single- and double-crossover events per individual was calculated using the countXO function in the R/qtl package (64, 65). After the genetic map was constructed, we explored genome-wide features that might contribute to the high recombination rate of P. striiformis f. sp. tritici. The AT-rich regions were determined using OcculterCut v1.1 with default parameters (67). Since recombination often increases at the CpG islands in mammalian genomes (68), we also analyzed the distribution of CpG islands in P. striiformis f. sp. tritici and several other plant pathogens with genetic maps available. CpG islands were identified using CpGcluster v2.0 software (69), with a distance threshold of 75 and a P value of 1E−3. The genetic map generated with ASMap was used for QTL mapping of Avr genes, and QTL mapping was performed using the qtl package. First, the QTL genotype probabilities given the available marker genotypes were calculated using the calc.genoprob function, with parameters set as step=1, error.prob=0.02, map.function=“kosambi,” and stepwidth=“fixed.” The recorded ITs were used as phenotypes for QTL mapping. The QTL mapping approach was used, instead of single-gene locus-like markers, to map Avr genes because it was not always possible to classify isolates into homozygous avirulent, homozygous virulent, and heterozygous as for the genotypic data. Since the ITs did not follow a normal distribution, the nonparametric interval mapping method was selected by setting model=“np” in the scanone function. Instead of using an arbitrary LOD threshold for all phenotypes, we calculated 5% LOD thresholds via a permutation test for each phenotype by setting n.perm=1000. To determine the QTL confidence intervals, the Bayesian credible intervals were calculated using the bayesint function. QTL mapping was performed separately for the segregating phenotype data on each wheat Yr single-gene line.

Association analysis and genomic environment of the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster.

To validate our QTL mapping results and to further identify potential Avr-associated variations, we used all the isolates from the sexual population for association analysis. Here, we focused on AvYr44, AvYr7, AvYr43, and AvYrExp2 since (i) each of the avirulence phenotypes of these genes was controlled by a single gene and (ii) these four genes were mapped to a cluster in the genetic map and located in a single contig (contig 1.022) (see Results) in the reference genome. Genomic variations in contig 1.022 with <20% missing data were retrieved, and all the sequence variations were used for the GWAS, including those in repetitive regions and those with P values by chi-squared tests of less than 0.05. The FarmCPU method implemented in GAPIT v3 software was used to perform association analysis with three principal components as covariates (70). The associations between genomic variations and virulence phenotypes were considered significant if the P value was <1E−05 and the false discovery rate-adjusted P value (P_FDR_adj) was 0.01. Significantly associated variations were annotated based on our previous 93-210 reference genome (38). To investigate the genomic environment of the AvYr44-AvYr7-AvYr43-AvYrExp2 cluster, we compared the homologous regions of this cluster among three well-assembled P. striiformis f. sp. tritici genomes, namely, isolates 93-210 (38), 11-281 (41), and 104E 137A− (40). First, contig 1.022 of 93-210 was subjected to a BLAST search against the 11-281 and 104E 137A− genome sequences to detect potential homologous contigs. Next, the three homologous contigs were aligned using progressiveMauve software (71), and the generated alignment in a “.backbone” file was visualized using genoPlotR (72).

Data availability.

All data sets generated for this study are included in the figures, tables, and supplemental material of the article. The complete set of sequence data was deposited in the National Center for Biotechnology Information (NCBI) database under accession no. PRJNA599033 and in the SRA under accession no. SRP239501. Further inquiries can be directed to the corresponding authors.
  64 in total

Review 1.  Life, death and rebirth of avirulence effectors in a fungal pathogen of Brassica crops, Leptosphaeria maculans.

Authors:  Thierry Rouxel; Marie-Hélène Balesdent
Journal:  New Phytol       Date:  2017-01-13       Impact factor: 10.151

Review 2.  Phakopsora pachyrhizi, the causal agent of Asian soybean rust.

Authors:  Katharina Goellner; Marco Loehrer; Caspar Langenbach; Uwe Conrath; Eckhard Koch; Ulrich Schaffrath
Journal:  Mol Plant Pathol       Date:  2010-03       Impact factor: 5.663

3.  Secreted protein gene derived-single nucleotide polymorphisms (SP-SNPs) reveal population diversity and differentiation of Puccinia striiformis f. sp. tritici in the United States.

Authors:  Chongjing Xia; Anmin Wan; Meinan Wang; Derick A Jiwan; Deven R See; Xianming Chen
Journal:  Fungal Biol       Date:  2016-03-03

4.  Century-old mystery of Puccinia striiformis life history solved with the identification of Berberis as an alternate host.

Authors:  Yue Jin; Les J Szabo; Martin Carson
Journal:  Phytopathology       Date:  2010-05       Impact factor: 4.025

5.  First Report of Oregon Grape (Mahonia aquifolium) as an Alternate Host for the Wheat Stripe Rust Pathogen (Puccinia striiformis f. sp. tritici) Under Artificial Inoculation.

Authors:  M N Wang; X M Chen
Journal:  Plant Dis       Date:  2013-06       Impact factor: 4.438

Review 6.  The Top 10 fungal pathogens in molecular plant pathology.

Authors:  Ralph Dean; Jan A L Van Kan; Zacharias A Pretorius; Kim E Hammond-Kosack; Antonio Di Pietro; Pietro D Spanu; Jason J Rudd; Marty Dickman; Regine Kahmann; Jeff Ellis; Gary D Foster
Journal:  Mol Plant Pathol       Date:  2012-05       Impact factor: 5.663

7.  Quantitative trait locus mapping of melanization in the plant pathogenic fungus Zymoseptoria tritici.

Authors:  Mark H Lendenmann; Daniel Croll; Ethan L Stewart; Bruce A McDonald
Journal:  G3 (Bethesda)       Date:  2014-10-29       Impact factor: 3.154

8.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

9.  Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph.

Authors:  Yonghui Wu; Prasanna R Bhat; Timothy J Close; Stefano Lonardi
Journal:  PLoS Genet       Date:  2008-10-10       Impact factor: 5.917

10.  High-Resolution Mapping of Crossover and Non-crossover Recombination Events by Whole-Genome Re-sequencing of an Avian Pedigree.

Authors:  Linnéa Smeds; Carina F Mugal; Anna Qvarnström; Hans Ellegren
Journal:  PLoS Genet       Date:  2016-05-24       Impact factor: 5.917

View more
  3 in total

1.  Identification of Secreted Protein Gene-Based SNP Markers Associated with Virulence Phenotypes of Puccinia striiformis f. sp. tritici, the Wheat Stripe Rust Pathogen.

Authors:  Qing Bai; Meinan Wang; Chongjing Xia; Deven R See; Xianming Chen
Journal:  Int J Mol Sci       Date:  2022-04-08       Impact factor: 6.208

Review 2.  Current Status and Future Perspectives of Genomics Research in the Rust Fungi.

Authors:  Chongjing Xia; Age Qiu; Meinan Wang; Taiguo Liu; Wanquan Chen; Xianming Chen
Journal:  Int J Mol Sci       Date:  2022-08-25       Impact factor: 6.208

3.  Potential Infection Risks of the Wheat Stripe Rust and Stem Rust Pathogens on Barberry in Asia and Southeastern Europe.

Authors:  Parimal Sinha; Xianming Chen
Journal:  Plants (Basel)       Date:  2021-05-11
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.