| Literature DB >> 32647195 |
Celine Prakash1,2, Friederike Ch Trognitz3, Peter Venhuizen4, Arndt von Haeseler1,5, Bodo Trognitz6,7.
Abstract
SolariX is a compendium of DNA sequence tags from the nucleotide binding site (NBS) domain of disease resistance genes of the common potato, Solanum tuberosum Group Tuberosum. The sequences, which we call NBS tags, for nearly all NBS domains from 91 genomes-representing a wide range of historical and contemporary potato cultivars, 24 breeding programs and 200 years-were generated using just 16 amplification primers and high-throughput sequencing. The NBS tags were mapped to 587 NBS domains on the draft potato genome DM, where we detected an average, over all the samples, of 26 nucleotide polymorphisms on each locus. The total number of NBS domains observed, differed between potato cultivars. However, both modern and old cultivars possessed comparable levels of variability, and neither the individual breeder or country nor the generation or time appeared to correlate with the NBS domain frequencies. Our attempts to detect haplotypes (i.e., sets of linked nucleotide polymorphisms) frequently yielded more than the possible 4 alleles per domain indicating potential locus intermixing during the mapping of NBS tags to the DM reference genome. Mapping inaccuracies were likely a consequence of the differences of each cultivar to the reference genome used, coupled with high levels of NBS domain sequence similarity. We illustrate that the SolariX database is useful to search for polymorphism linked with NBS-LRR R gene alleles conferring specific disease resistance and to develop molecular markers for selection.Entities:
Mesh:
Year: 2020 PMID: 32647195 PMCID: PMC7347568 DOI: 10.1038/s41598-020-67848-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Capture of NBS domains of the R gene pool. (a) R-genes consist of a nucleotide binding (NB) and a leucine rich repeat (LRR) domain. They further have either the toll/interleukin-1 receptor (TIR) or the coiled-coil (CC) domain. Amplification primers were chosen to be complementary to one of the highly conserved P-loop, Kinase-2 and GLPL motifs within the NBS domain. PCR primers were designed using degeneracy at individual nucleotides to match the sequence diversity across 435 distinct NBS nucleotide sequences, obtained from tblastn searches in R genes mapped by Jupe et al.[13]. (b) The number of cultivars sequenced per country across Europe.[We adapted the map based on the File “Blank map of Europe cropped.svg” at https://commons.wikimedia.org/wiki/File:Blank_map_of_Europe_cropped.svg#file provided by ‘Maix’ (me@de) under a CC BY SA 3.0 Unported license (https://creativecommons.org/licenses/by-sa/3.0/legalcode).] The graphic in (b) is available under this same Creative Commons BY SA 3.0 Unported license. (c) The sequence of steps involved in targeted amplification of NBS domain containing genomic fragments.
Degenerate primers for selective PCR amplification of sequence tags from NB-LRR (Nucleotide Binding site and Leucine Rich Repeat) domains of R (Resistance) genes in the potato genome
| Primer name | Primer Sequence 5′–3′ | Annealing temperature (°C) | ||
|---|---|---|---|---|
| kinase2_1 | TNVTNITNITDGAYGAYNTNTGG | 60 | ||
| kinase2-3 | TNITTGTNCTTGATGAYRTNRA | 55 | ||
| kinase2-4 | TYSTYHTWGATGAYRTNTGG | 55 | ||
| NBS5 | YYTKRTHGTMITKGATGATGTITGG | 55 | ||
| ploop_R1 | GBATGGGNGGRCAAGGNAAAAC | 60 | ||
| ploop_R2-2 | GGHATDGSIGGHITNGGNAARAC | 55 | ||
| ploop1 | GGIGGINTRGGIAARACRAC | 50 | ||
| ploop_PGT | CCIGGIACIGGIAARACIAC | 60 | ||
| ploop_Sw5-2 | TGGGYGGWNTIGGNAATACHAC | 55 | ||
| ploop_Mi | GGNATGCCNGGNINIGGNAARAC | 60 | ||
| GLP-2 | GCDMMMGGNCWTCCDTTAGC | 55 | ||
| GLPL2-2 | TGYMARGGRYTKCCNYTDRYVVT | 55 | ||
| GLPL2-3 | TGYARRGGRYTDCCTYTDDYDVT | 55 | ||
| GLPL3-1 | TGYVRDGGNITRCCNYTDDC | 55 | ||
| GLPL3-2 | TGTGGMGGRTTGCCTCTCTTC | 55 | ||
| Additional degenerate primers for amplification using a MiSeq apparatus (Illumina), of | ||||
| R2ch4-F1 | I*GTGTTTGAGATCAACTCTATTGCTAATG | 55 | ||
| R2ch4-F2 | I*CAATTGTTGTATTGAGCGGACT | 55 | ||
| R2ch4-F3 | I*GGAAAGATGTTGACCCTGTTG | 55 | ||
| R2ch4-F4 | I*TGTGCAGTGATAACAGCTTCA | 55 | ||
| R2ch4-R2 | I*GCTGCTAATGTTGTTTAGGGAGT | 55 | ||
| R2ch4-R3 | I*TGGATCGAAGAACATAATTGACC | 55 | ||
| R2ch4-R4 | I*AATGACTCTGCTTCCATTCTTG | 55 | ||
| R2ploop1 | I*GAAWGGGBGGHTTRGGCAAGAC | 55 | ||
| R2ploop2 | I*GCATGGGHGGWTTRGGCAAGAC | 55 | ||
| R2ploop3 | I*GTATGGGMGGWTTRGGCAAGA | 55 | ||
| R2ploop4 | I*GGATGGGTGGATTGGGCAAGA | 55 | ||
| R2GLPL | I*AGRAGAAAGTTGGKAYCTCTTT | 55 | ||
| R2Kinase2 | I*GATGATBTRYGGMAKADWGAAG | 55 | ||
Primer nucleotide sequence and individual annealing temperature for optimal amplification results.
I*- the Illumina adapter for MiSeq sequencing: TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG.
The universal adapter primer for Illumina MiSeq was: GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGACTCGATTCTCAACCCGAAAG.
Summary information on potato cultivars and breeding clones used in SolariX, and their breeders.
| Country | No breeders | No cultivars |
|---|---|---|
| Austria | 1 | 3 |
| Canada | 1 | 1 |
| Denmark | 1 | 2 |
| France | 4 | 12 |
| Germany | 5 | 27 |
| Hungary | 2 | 6 |
| India | 1 | 1 |
| Ireland | 1 | 1 |
| Netherlands | 4 | 11 |
| Poland | 2 | 7 |
| Spain | 2 | 9 |
| UK | 2 | 7 |
| USA | 2 | 2 |
| Total | 28 | 89 |
Sources European Cultivated Potato Database, SASA Edinburgh, UK, www.europotato.org/. For a detailed presentation of the materials, see Supplementary Table S2.
Figure 2NBS domain sequencing statistics. (a) Percentage of total sequenced reads that were mapped to the doubled-monoploid (DM) potato clone version 4.03 genome and the distribution of the mapped reads within NBS domains. (b, c) Enriched sequencing coverage at NBS domains of R genes illustrates the specificity of the NBS-capture protocol—distributions of read coverage, normalized by length of the region (b) and fraction of bases in the region with non-zero read coverage (c) for all 96 sequencing libraries at annotated regions: Non-resistance genes, resistance genes, NB domains of R genes, Non NB domains of R genes (boxplots left to right). (d) NBS domain detection rate among the cultivars—the number of cultivars (y-axis) in which we observed at least 1 read mapped to the NBS domain (x-axis) in our NBS-capture sequencing data, shown for each of the 587 domains. (e) Extent of the NBS domains captured for each cultivar—The number of NBS domains (y-axis) that had at least one read mapped in the NBS-capture sequencing data of each cultivar (x-axis), shown for each cultivar.
Figure 3NBS domain sequence variation. (a) The number of polymorphisms (y-axis) detected among the cultivars (x-axis) at sites within NBS domains that had at least ×10 coverage in all of the 96 cultivars analyzed. (b) The distributions of number of polymorphisms per 100 bp (y-axis), at positions within NBS domains that had at least tenfold coverage in all libraries, over the different NBS-LRR R gene clusters (x-axis) for the 96 libraries. *Lengths of each cluster (shown in brackets in the x-axis labels) were computed by the sum, for all R genes assigned to the cluster, of the lengths of regions within NBS domains that had at least ×10 coverage in all of the 96 cultivars. Clusters with less than 100 bp of at least ×10 coverage in NBS domains of all cultivars have boxplots shaded grey, as the short stretches evaluated in this comparison are unlikely to capture the true extent of the number of polymorphisms in these regions.
Figure 4The SolariX website at www.cibiv.at/SolariX provides information obtained from sequencing the NBS domains of 96 cultivars or samples. (a) Screenshot of the Quick Guide to navigating the SolariX website. (b) Screenshot of the IUPAC sequence per cultivar at the NBS domain of the resistance gene PGSC0003DMG400011525. Clicking on a variant provides read coverage support for each nucleotide of this sequence tag.
Figure 5(a) Average read count (log scale for y-axis) and (b) number of haplotypes phased for 13 cultivars (MFII, TPS67, poolcrpR, poolcrpS, SarpoMira, Alegria, Granola, Russet Burbank, R1, R3, Danva and Balatony Rozsa) at 38 NBS regions on chromosome XII (x-axis). The black 'error' lines represent the minimum and maximum of the 13 cultivars rather than standard deviation.
Figure 6Inferring genetic similarity between cultivars based on variations in read coverage for nucleotide binding site domains. Read coverage frequencies (RCF), by resistance (R) locus on the reference genome DM, of all 96 cultivars represented as a compressed heat map. The sum of RCFs over R loci per cultivar is 1. The dendrogram represents hierarchical clustering based on the Euclidean distance between cultivar RCFs. The color key at the top right corner illustrates the correspondence between RCF value (x-axis) and color. A histogram (y-axis) of RCF values observed in the heatmap (turquoise line) is included in the color key. Cultivars were classified into the following groups (letters in parentheses at the labels of the dendrogram tips) – Five libraries representing parents and pooled progenies of population MT (X); Four libraries of our outgroup taxon, S. caripense, consisting of parents C1 and K4 and two pools of cross progenies (Y); Six individuals registered before 1930 (1810–1929) (a); Six late blight resistance differentials (ca 1950–1970) (b); Four cultivars released 1972–1975 in Germany and Poland (c); Nine contemporary cultivars registered by seven breeders between 1999 and 2011 in four countries (d); Two cultivars of (partially) American descent (e); All other sequenced cultivars ( )
SNPs showing patterns proportional to late blight resistance in S. caripense (CRP).
| SNP position† | C1‡ | K4 | PoolR | PoolS | Resistant progeny | Susceptible progeny | # No match§ | Chromosome IX; 15,686,521 bp¶ |
|---|---|---|---|---|---|---|---|---|
| Locus PGSC0003DMG400004413 Illumina: 37 SNPs | ||||||||
| 004 | C/T | T | T | C/T | T | C/T | 13 | 004; C, T |
| 100 | A/T | A | A/T | A/T | A/T | A | 14 | 095; A, G, T |
| 109 | A/T | T | T | A/T | T | A/T | 7 | – |
| 198 | A/G | A | A/G | A/G | A | A/G | 6 | 197; A, G |
| 237 | C/T | C | C | C/T | C | C/T | 6 | – |
| 259 | A/G | G | G | A/G | G | A/G | 5 | – |
| 275 | C/G | G | G | C/G | G | C/G | 6 | – |
| 296 | C/T | T | T | C/T | T | C/T | 5 | 294; T, C |
| 301 | C/T | C | C | C/T | C | C/T | 6 | 302; A, G |
| 355 | A/G | A | A | A/G | A | A/G | 7 | 349; C, T |
| 370 | C/T | T | C | C/T | C | C/T | 5 | 369; C, T |
| 397 | C | T | C/T | C | C/T | C | 4 | 395; C, T |
| 408 | A/G | A | A | A/G | A | A/G | 5 | 403; A, G |
| 421 | A/G | A | A | A/G | A | A/G | 6 | 421; C, G |
| 503 | A/G | G | G | A/G | G | A/G | 6 | – |
Test for association of the late blight resistance phenotype (as determined in multiple bioassays by Nakitandwe et al.[23]) with SNP haplotypes on PCR fragments of genomic DNA amplified using site-specific PCR primers on CRP parents (C1; heterozygous for dominant late blight resistance, K4; susceptible), pools representing 10 resistant or 10 susceptible progenies, and 71 individual progenies (30 late blight resistant, 41 susceptible). The PCR primers pinpoint a conserved stretch of R gene locus PGSC0003DMG402020585. The fragment was Sanger-sequenced. The first nucleotide of this locus (position on fragment, 001) maps to position 59,553,199 bp on potato chromosome IX. The locus is a member of R gene cluster 64 which also holds a late blight resistance gene isolated from S. venturii[53].
†SNP as observed on Sanger-sequenced fragments PCR-amplified from C1, K4, and CK progenies, ‘Pool_R’ and ‘Pool_S’; pooled DNA samples of 10 resistant or 10 susceptible CK progeny individuals that were randomly chosen, Resistant progeny, Susceptible progeny; total of 30 individual CK segregants possessing a late blight resistant phenotype and 41 susceptible segregants, respectively.
‡C1; late blight resistant parent (K4; susceptible parent) of CK population (cross of C1 and K4, S. caripense, 2n = 2x = 24),
§number of individual samples tested (in total 75) where resistance phenotype and SNP genotype were in disaccord with the rule (e.g., genotype C/T when the rule was T for resistant individuals among the progenies),
¶position of SNP as observed on the Illumina Hiseq-2000 data (closest to those detected on Sanger-sequenced fragments, left columns). For the R gene locus on chromosome IX shown here, the best coincidence of SNPs as found on Sanger sequenced fragments out of nine loci investigated (see Results) was obtained. However, this locus is distant from the region where the resistance was mapped (Nakitandwe et al.[23]).
Figure 7Genetic map of Alegria chromosome IX. Right; relative molecular marker positions, left; genetic distance (cM). Markers in bold print are linked in coupling phase with the PVY resistance.