| Literature DB >> 26825382 |
E M Schönhals1, F Ortega2, L Barandalla3, A Aragones3, J I Ruiz de Galarreta3, J-C Liao1, R Sanetomo4, B Walkemeier1, E Tacke5, E Ritter3, C Gebhardt6.
Abstract
KEY MESSAGE: SNPs in candidate genes Pain - 1, InvCD141 (invertases), SSIV (starch synthase), StCDF1 (transcription factor), LapN (leucine aminopeptidase), and cytoplasm type are associated with potato tuber yield, starch content and/or starch yield. Tuber yield (TY), starch content (TSC), and starch yield (TSY) are complex characters of high importance for the potato crop in general and for industrial starch production in particular. DNA markers associated with superior alleles of genes that control the natural variation of TY, TSC, and TSY could increase precision and speed of breeding new cultivars optimized for potato starch production. Diagnostic DNA markers are identified by association mapping in populations of tetraploid potato varieties and advanced breeding clones. A novel association mapping population of 282 genotypes including varieties, breeding clones and Andean landraces was assembled and field evaluated in Northern Spain for TY, TSC, TSY, tuber number (TN) and tuber weight (TW). The landraces had lower mean values of TY, TW, TN, and TSY. The population was genotyped for 183 microsatellite alleles, 221 single nucleotide polymorphisms (SNPs) in fourteen candidate genes and eight known diagnostic markers for TSC and TSY. Association test statistics including kinship and population structure reproduced five known marker-trait associations of candidate genes and discovered new ones, particularly for tuber yield and starch yield. The inclusion of landraces increased the number of detected marker-trait associations. Integration of the present association mapping results with previous QTL linkage mapping studies for TY, TSC, TSY, TW, TN, and tuberization revealed some hot spots of QTL for these traits in the potato genome. The genomic positions of markers linked or associated with QTL for complex tuber traits suggest high multiplicity and genome wide distribution of the underlying genes.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26825382 PMCID: PMC4799268 DOI: 10.1007/s00122-016-2665-7
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
PCR information for five new candidate loci and the specific allele InvCD141-Sa evaluated for association with tuber traits in the QUEST population
| Locus | Chromosome: position (bp) (pseudomolecule version v4.03) | Locus ID PGSC0003… | Primer sequence (5′–3′) |
| Amplicon size (bp) | Scored SNPs/indels |
|---|---|---|---|---|---|---|
|
| chr01:62723718..62724360 | DMG400009042 | F_GGCAACAATTGCTGGTGTTAa R_GCCTAATTCATAGCATTCAAGATTC | 59 | 453 | 11/1 |
|
| chr02:30142740..30152314 | DMG400008322 | F_CTCAATGAAGCTCGTGTCCA R_CAAAATTCCGAAGGCATCTCa | 50 | 869 | 15/0 |
|
| chr04:64775429..64782339 | DMG400012910 | F_AGCATCTACTCACCTTCTTCATCTTTC R_TGCAAACTGGCAAACAGCTTa | 56 | 492 | 14/1 |
|
| chr05:04538880..04541736 | DMG400018408 | F_CCGCGATGTAATAGCATGGAa R_GTTCCAAGGGTTTGCTACGG | 56 | 591 | 16/0 |
|
| chr06:54709882..54713896 | DMG400005930 | F_CGATTATGGAAGCCAATGGT R_GGAAATCGCTTATTCCCACTCa | 57 | 668 | 20/0 |
|
| chr10:55851945..55856805 | DMG402028252 | F_GGGCAACATTATTTGGGCT R_TTGGCTTTTGTTGATGGTTTTA | 65–60 | 295 | –b |
Primers, annealing temperature (Ta), amplicon size and number of SNPs/indels scored in the amplicons are shown
aPrimer used for amplicon sequencing
bAllele-specific assay
Fig. 1Boxplots of the adjusted entry means for tuber starch content (TSC), tuber yield (TY), tuber starch yield (TSY), tuber number (TN), and tuber weight (TW) evaluated in two-year field trials in 191 tetraploid cultivars (CUL), 73 tetraploid breeding clones (BRE) and 16 Andean landraces (LAN) of the QUEST population
Bivariate (Pearson) correlation coefficients between traits, and heritabilities H 2
| Trait | TSC | TY | TSY | TN | TW | PM1 |
|
| |
|---|---|---|---|---|---|---|---|---|---|
| TSC | – | 0.830 | 0.836 | ||||||
| TY | ns | – | 0.745 | 0.707 | |||||
| TSY | 0.361*** | 0.902*** | – | 0.762 | 0.730 | ||||
| TN | 0.163** | 0.452*** | 0.459*** | – | 0.728 | 0.743 | |||
| TW | −0.177** | 0.370*** | 0.370*** | −0.350*** | – | 0.723 | 0.688 | ||
| PM1 | −0.212** | −0.379*** | −0.439*** | −0.418*** | ns | – | – | _ | |
| PM2 | −0.248* | ns | −0.215* | −0.249* | ns | 0.510*** | – | _ |
ns not significant; * 0.05 > p > 0.01, ** 0.01 > p > 0.001, *** p < 0.001
Fig. 2Principal coordinate plots of the QUEST population including (a) and excluding (b) the landraces, based on 183 alleles at 29 microsatellite loci. Genotypes were separated by the first two principal coordinates (PC) which were calculated on the basis of Jaccard’s distances. Numbers in parentheses are the percentage of explained variance by the PC
Cytoplasm types in the QUEST population and their distribution in the K = 2 subpopulations inferred by Bayesian clustering
| Cytoplasm type | Total | Subpopulation | % per subpopulation | ||
|---|---|---|---|---|---|
| 1 | 2 | 1 | 2 | ||
| T | 192 | 174 | 18 | 90.6 | 9.4 |
| D | 36 | 19 | 17 | 52.8 | 47.2 |
| W | 38 | 33 | 5 | 86.6 | 13.2 |
| A | 9 | – | 9 | – | 100 |
| P | 6 | – | 6 | – | 100 |
| M | 1 | – | 1 | – | 100 |
| Total | 282 | 226 | 56 | ||
Number of marker–trait associations (MAF ≥1 %) detected at p-values <0.05, <0.01, and <0.001 with the models GLM, MM-PK with (+LAN) and without landraces (−LAN)
| Trait | No. of markers ( | No. of markers ( | No. of markers ( | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GLM (−LAN) | MM-PK (+LAN) | MM-PK (−LAN) | Persistent in MM | GLM (−LAN) | MM-PK (+LAN) | MM-PK (−LAN) | Persistent in MM | GLM (−LAN) | MM-PK (+LAN) | MM-PK (−LAN) | Persistent in MM | |
| TSC | 104 (109) | 66 (71) | 56 (61) | 44 (48) | 47 (49) | 29 (31) | 28 (31) | 22 (23) | 24 (26) | 11 (11) | 10 (10) | 10 (10) |
| TY | 41 (46) | 26 (28) | 21 (23) | 13 (15) | 13 (14) | 5 (7) | 2 (3) | 2 (3) | 2 (2) | 1 (1) | 1 (1) | 1 (1) |
| TSY | 29 (30) | 46 (49) | 29 (31) | 24 (26) | 3 (4) | 11 (12) | 8 (9) | 5 (6) | 1 (1) | 2 (2) | 2 (2) | 2 (2) |
| TN | 32 (34) | 30 (38) | 0 | 0 | 7 (8) | 16 (21) | 0 | 0 | 3 (4) | 2 (5) | 0 | 0 |
| TW | 60 (63) | 38 (40) | 0 | 0 | 24 (24) | 8 (9) | 0 | 0 | 4 (4) | 1 (1) | 0 | 0 |
The number of marker–trait associations when including markers with MAF <1 % are shown in parenthesis
Marker–trait associations in the QUEST population without landraces (−LAN) based on the mixed model MM-PK
| Marker | Marker allelesa | Locus position (Mbp) | Allele frequencyb | TSC ( | TY ( | TSY ( | PM1 ( |
|---|---|---|---|---|---|---|---|
|
| T/C | Chr01:86.09 | 0.095 (C) |
| ns | ns | ns |
|
| A/T | Chr02:30.14 | 0.063 (T) | ns |
| *(2.0) ↑ | ns |
|
| T/A | 0.041 (T) | ns | ns | ns | **(4.5) | |
|
| 0/1 | Chr03:39.25 | 0.155 (1) |
| ns |
| ns |
|
| C/T | 0.195 (T) | ns | ns | ns | **(4.4) | |
|
| A/G | Chr04:64.78 | 0.033 (G) |
| ns | ns | ns |
|
| A/G | 0.092 (G) | **(2.5) ↓ | ns | ns | ns | |
|
| G/A | 0.086 (A) |
| ns | ns | ns | |
|
| T/A | 0.087 (A) |
| ns | ns | ns | |
|
| C/T | 0.115 (T) | **(2.9) | ns | ns | ns | |
|
| 0/1 | 0.322 (1) |
| ns | ns | ns | |
|
| A/T | Chr05:04.54 | 0.157 (T) | ns |
|
| ns |
|
| C/A | 0.074 (A) | ns | ns | ns | **(4.4) | |
|
| T/C | Chr06:54.71 | 0.055 (C) | ns | ns | ns | **(5.5) ↑ |
|
| C/A | Chr09:60.56 | 0.300 (A) |
| ns | ns | ns |
|
| C/T | 0.302 (T) |
| ns | ns | ns | |
|
| 0/1 | Chr10:50.94 | 0.462 (1) |
| ns | ns | ns |
|
| 0/1 | Chr10:55.85 | 0.112 (1) |
| ns | ns | ns |
|
| 0/1 | Chr10:55.85 | 0.665 (1) |
| ns | ns | ns |
|
| G/A | Chr10:55.85 | 0.135 (A) |
| ns | *(2.1) ↓ | ns |
|
| C/T | 0.168 (T) |
| ns | ns | ns | |
|
| C/T | 0.148 (T) |
| ns | ns | ns | |
|
| C/T | 0.214 (T) |
| ns | ns | ns | |
|
| T/C | 0.197 (C) |
| ns | *(2.2) ↓ | ns | |
|
| C/T | 0.147 (T) |
| ns | *(2.1) ↓ | ns | |
|
| G/A | 0.138 (A) |
| ns | ns | ns | |
|
| 0/1 | Chr11:08.21 | 0.814 (1) |
| ns | **(3.4) ↓ | ns |
|
| G/A | Chr12:02.34 | 0.092 (A) |
| *(1.6) |
| ns |
|
| T/C | 0.181 (C) |
| ns | ns | ns | |
| Cytoplasm type | – | – | – | **(3.4) | ns | **(3.3) | ns |
Markers with minor allele frequency ≥1 % and at least one association at p < 0.01 are shown
ns not significant at α = 0.05; * significant at 0.01 ≤ p < 0.05, ** significant at 0.001 ≤ p < 0.01, *** significant at p < 0.001; Arrows indicate the direction of the effect of presence of the microsatellite or PCR marker allele or the effect of increasing dosage of the minor frequency SNP allele on the trait: ↑ increasing, ↓ decreasing mean values for TSC, TY, TSY, and PM1. Numbers in bold indicate markers that were persistent in the two association models MM-PK (+LAN) and MM-PK (−LAN) at p < 0.01)
aThe nucleotide present in the reference potato genome sequence (S. phureja) is on the left position, the alternative allele (S. tuberosum) on the right
bFor SNPs the minor allele frequency (MAF) is shown with the minor frequency SNP allele in parenthesis; MAF was calculated including the allele dosage. The ‘allele’ frequency of microsatellite and allele-specific PCR markers corresponds to the frequency of the presence (1) of the marker allele without counting allele dosage
cThe four PGI-4 SNPs are in nearly complete LD and form a haplotype
dThe two PWD SNPs are in nearly complete LD and form a haplotype
eThe six InvCD141 SNPs are in nearly complete LD and form a haplotype, the allele-specific marker InvCD141-Sa detects this haplotype
Fig. 3Phenotypic effects of genotype classes of SNPs StCDF1_snp1812 (a), SSIV_snp2679 (b), and cytoplasm type (c) in the QUEST(−LAN) population. Means (grey bars) and standard deviations of TSC, TY, and/or TSY are shown. Significant differences between genotype classes are indicated by a and b (ANOVA post hoc test LSD, p < 0.05). The number of individuals in each genotype class is indicated at the bottom of the bar. The three genotypes with the highest dosage (ATTT) of the allele StCDF1_T associated with increased TY and TSY were the variety Isla and the breeding clones 2001Q29-10 and 2003P54-4. The four genotypes with the highest dosage (AATT) of the allele SSIV_T associated with increased yield were the varieties Murato, Melody, Riviera, and Opal (Online Resource 1)
Fig. 4Physical maps of the 12 potato chromosomes, showing to the right of each chromosome the genomic positions of the markers genotyped in the QUEST population, and to the left markers linked to previously mapped QTL for tuber starch content (TSC, specific gravity (SG) in Bonierbale et al. 1993), yield (TY), starch yield (TSY), tuber weight (TW), tuber number (TN), and tuberization (TZ, in vitro tuberization (ivt) and greenhouse tuberization (gt) in Šimko et al. 1999). Numbers in parenthesis after the markers are numerical codes for the corresponding reference: 1 (Bonierbale et al. 1993), 2 (Schäfer-Pregl et al. 1998), 3 (van den Berg et al. 1996), 4 (Šimko et al. 1999), 5 (Zhou et al. 2014), and 6 (Navarro et al. 2011). RFLP marker sequences were retrieved from the databases Sol Genomics Network (TG, CT, and CD markers, http://www.sgn.cornell.edu/) and GABI Primary Database (GP and CP markers, GluA, SK2, SbeI, AGPaseB, pat, prp1, http://www.gabipd.org/). Microsatellite primer sequences were retrieved from the literature cited in Materials and Methods. Gene sequences were obtained from GenBank entries (http://www.ncbi.nlm.nih.gov/). Sequences were BLASTed against the potato genome sequence (pseudomolecules v4.03, http://potato.plantbiology.msu.edu/blast.shtml). QTL are shown red next to the linked or associated marker locus or next to the brackets indicating genome segments harbouring the QTL. Loci associated with TSC, TY, TSY, TW, and/or TN in the QUEST(+LAN) population but not in the QUEST(−LAN) population (MM-PK model, p < 0.01, MAF >1 %, see Online Resource 5) are shown in blue letters. Loci associated with TSC, TY, and/or TSY in both the QUEST(−LAN) and QUEST(+LAN) population are shown in green letters (see Table 5). Genes with known function are in italics