| Literature DB >> 30288743 |
Gaetan J A Thilliez1,2, Miles R Armstrong1, Tze-Yin Lim3, Katie Baker3, Agathe Jouet4, Ben Ward5, Cock van Oosterhout6, Jonathan D G Jones4, Edgar Huitema2, Paul R J Birch1,2, Ingo Hein1,2.
Abstract
The oomycete pathogens Phytophthora infestans and P. capsici cause significant crop losses world-wide, threatening food security. In each case, pathogenicity factors, called RXLR effectors, contribute to virulence. Some RXLRs are perceived by resistance proteins to trigger host immunity, but our understanding of the demographic processes and adaptive evolution of pathogen virulence remains poor. Here, we describe PenSeq, a highly efficient enrichment sequencing approach for genes encoding pathogenicity determinants which, as shown for the infamous potato blight pathogen Phytophthora infestans, make up < 1% of the entire genome. PenSeq facilitates the characterization of allelic diversity in pathogen effectors, enabling evolutionary and population genomic analyses of Phytophthora species. Furthermore, PenSeq enables the massively parallel identification of presence/absence variations and sequence polymorphisms in key pathogen genes, which is a prerequisite for the efficient deployment of host resistance genes. PenSeq represents a cost-effective alternative to whole-genome sequencing and addresses crucial limitations of current plant pathogen population studies, which are often based on selectively neutral markers and consequently have limited utility in the analysis of adaptive evolution. The approach can be adapted to diverse microbes and pathogens.Entities:
Keywords: zzm321990Phytophthora capsicizzm321990; zzm321990Phytophthora infestanszzm321990; PenSeq; RXLR effectors; avirulence; population genomics; virulence
Mesh:
Substances:
Year: 2018 PMID: 30288743 PMCID: PMC6492278 DOI: 10.1111/nph.15441
Source DB: PubMed Journal: New Phytol ISSN: 0028-646X Impact factor: 10.151
Proportion of on‐target reads for the T30‐4 and LT1534 reference genomes
| MM | Read input | Reads mapped | Bait design sites (100% sequence identity) | Bait binding sites (min 80% sequence identity) | ||||
|---|---|---|---|---|---|---|---|---|
| Reads | Percentage of total reads (%) | Percentage of mapped reads (%) | Reads | Percentage of total reads (%) | Percentage of mapped reads (%) | |||
| T30‐4 | ||||||||
| 1% | 1457 534 | 1136 644 | 427 792 | 29.35 | 37.64 | 573 168 | 39.32 | 50.43 |
| 2% | 1457 534 | 1262 455 | 461 847 | 31.69 | 36.58 | 621 805 | 42.66 | 49.25 |
| 5% | 1457 534 | 1349 756 | 484 648 | 33.25 | 35.91 | 654 139 | 44.88 | 48.46 |
| LT1534 | ||||||||
| 1% | 2085 062 | 971 963 | 432 549 | 20.75 | 44.50 | 646 163 | 30.99 | 66.48 |
| 2% | 2085 062 | 1231 333 | 520 357 | 24.96 | 42.26 | 831 119 | 39.86 | 67.50 |
| 5% | 2085 062 | 1475 423 | 619 139 | 29.69 | 41.96 | 990 977 | 47.53 | 67.17 |
The on‐target rate for PenSeq reads (Read input) is shown for the Phytophthora infestans T30‐4 genome and the P. capsici LT1534 reference genome. High‐quality PenSeq reads are mapped to the respective reference isolates under different mismatch (MM) conditions. A 1% mismatch rate allows for a maximum of one sequence polymorphism in a 100‐bp target sequence, whereas 2% and 5% allow for up to two and five polymorphisms, respectively.
Figure 1Representation of target gene coverage in (a) Phytophthora infestans reference strain T30‐4 and (b) P. capsici reference strain LT1534 at a high‐stringent, 1% mismatch mapping rate. The x‐axis represents the percentage gene coverage of RXLRs (blue) and non‐RXLR target genes (red), which ranges from 0 (not covered) to 1 (100% sequence representation with PenSeq reads). The y‐axis reveals the average read depth per kb of the target genes.
Sequence coverage of effectors that are absent from the reference genomes across diverse isolates
| Effector | Sequence coverage in isolates (0 = 0%–1 = 100%) | |||||
|---|---|---|---|---|---|---|
|
| T30‐4 | 3928A | 88069 | 110059 | 110153 | EC1‐C7 |
| PITG_04085 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_04089 | 0 | 0.92 | 0.76 | 0.96 | 0.60 | 0.63 |
| PITG_04097 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_04099 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_04182 | 0 | 0.97 | 1 | 1 | 0 | 1 |
| PITG_04279 | 0 | 0 | 1 | 1 | 1 | 1 |
| PITG_12010 | 0 | 0 | 1 | 1 | 1 | 1 |
| PITG_16282 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_16283 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_16285 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_19800 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_21107 | 0 | 1 | 0 | 0 | 0 | 1 |
| PITG_21778 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_22724 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_22727 | 0 | 1 | 1 | 1 | 1 | 1 |
| PITG_23011 | 0 | 1 | 0 | 1 | 0 | 0 |
The sequence coverage of 16 Phytophthora infestans effectors that were not enriched in the T30‐4 genome is shown for the P. infestans isolates 3928A (13_A2), 88069, 110059 (US23), 110153 (US24) and EC1‐C7 (EC1). The sequence coverage of the P. capsici effectors that were not identified in the LT1534 reference gene is shown for the P. capsici isolates LT123, Pc204, LT6535, Y006 and Q108. The sequence coverage with PenSeq‐derived reads is shown as a percentage, ranging from 0 (no coverage) to 1 (full coverage).
Figure 2(a) Comparative nucleotide diversity study of 259 RXLRs and 174 non‐RXLRs across six Phytophthora infestans isolates. The mean (± SE) nucleotide diversity of the RXLRs (1.63 × 10−3 (± 1.10 × 10−4)) is significantly higher than that of non‐RXLRs (1.14 × 10−3 (± 0.90 × 10−4)) (Mann–Whitney test, W = 58962.5, P = 0.0282 (adjusted for ties)). (b) Nucleotide diversity calculated for 433 P. infestans PITGs, with 259 RXLRs (blue) and 174 non‐RXLRs (red) across six P. infestans isolates. An outlier analysis identified seven PITGs (PITG_06432, PITG_07947, PITG_10396, PITG_11484, PITG_13018, PITG_14986 and PITG_18325) which all encode for RXLRs and display significantly elevated nucleotide diversity (πi), indicative of non‐neutral evolution. The difference between the RXLR and non‐RXLR distributions of πi is driven by these seven outlier loci, without which the difference in πi is no longer significant (Mann–Whitney test, W = 55952.5, P = 0.0802 (adjusted for ties)).
Figure 3Presence/absence variations of Phytophthora infestans Avr genes. The x‐axis represents the nucleotide sequence of the full‐length reference genes Avr1 (PITG_16663), Avr2 (PITG_22870), Avr3a (PITG_14371), Avr3b (PITG_18215), Avr4 (PITG_07387), Avr8/AvrSmira2 (PITG_07558), Avr10 (PITG_11484), AvrSmira1 (PITG_07550) and Avr_vnt1 (PITG_16294), and the y‐axis represents the PenSeq read coverage. PenSeq reads from the isolates T30‐4, 88069, EC1‐C7 (EC1), 3928A (13_A2), 110059 (US23) and 110153 (US24) are shown in distinct colours. Mapping of the reads is carried out at 1% mismatch rate
Sequence diversity within selected, known Phytophthora infestans Avr genes
| Gene name | Amino acid (s) | Cons. | Ref. | Alt. | Isolates | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| T30‐4 | 3928A (13_A2) | 88069 | 110059 (US23) | 110153 (US24) | EC1‐C7 | |||||
|
| S19C | NS | A | T | A | A | A | A/T | A | A |
| S19S | S | C | T | C/T | C | C | C | C | C | |
| E80K | NS | G | A | G | G | G | G/A | G | G | |
| M103I | NS | G | T | G | G | G | G/T | G | G | |
| L121L | S | T | C | T | T | T | T/C | T | T | |
| R124G | NS | C | G | C | C | C | C/G | C | C | |
|
| R41L | NS | C | A | C | . | . | A | . | . |
| G85R | NS | C | G | C | . | . | G | . | . | |
| R124K | NS | C | T | C | . | . | T | . | . | |
|
| L139S | NS | T | C | T | . | C | T | C | C |
| L221V | NS | T | G | T | . | G | T | G | G | |
|
| P107P | S | A | G | A | A | G | A/G | G | G |
|
| R2R | S | T | G | T/G | T/G | T/G | G | T | T/G |
| L26L | S | C | A | C | C | C/A | C | C | C | |
| A113A | S | T | C | T | T/C | T | T | C | T | |
| A134G | NS | C | G | C | C/G | C | C | ‐ | C | |
| K143N | NS | A | C | A | A/C | A | A | ‐ | A | |
|
| R78K | NS | G | A | G | . | G/A | . | . | G |
|
| D84A | NS | T | G | T/G | T/G | T | T | T/G | T |
| D84N | NS | C | T | C | C/T | C | C | C/T | C/T | |
| D118G | NS | T | C | T | T/C | T | T | T/C | T/C | |
| D118N | NS | C | T | C/T | C/T | C | C | C/T | C | |
| K122R | NS | T | C | T | T | T | T | T | T/C | |
|
| A44A | S | T | C | T | T | T/C | T/C | T/C | T |
| P45P | S | T | A | T | T | T/A | T/A | T/A | T | |
| N123N | S | C | T | C | C | C | C | C | T | |
| K131R | NS | A | G | A | A/G | G | G | G | . | |
| M156L | NS | A | T | T | A/T | A | A | A | . | |
| L162L | S | G | A | G | G | . | A | G | G | |
| R170Q | NS | G | A | G | G | . | . | A | . | |
|
| N18D | NS | A | G | A | A | A | A/G | G | A/G |
The sequence diversity of selected known P. infestans Avr genes is shown in isolates T30‐4, 3928A (13_A2), 88069, 110059 (US23), 110153 (US24) and EC1‐C7 (EC1). The nucleotide sequence variations (Alt.) compared with the reference allele (Ref.), which refers to the T30‐4 genome sequence from Haas et al. (2009), are shown. The consequence (Cons.) of the variation is marked: S, synonymous polymorphism; NS, nonsynonymous variation. Sites that failed the single nucleotide polymorphism (SNP) filtering parameters are shown as (‐). (.), absence of sequences at the site and therefore the absence of polymorphisms.