Literature DB >> 33240629

Genetic loci determining potato starch yield and granule morphology revealed by genome-wide association study (GWAS).

Vadim K Khlestkin1,2, Tatyana V Erst1, Irina V Rozanova1,3, Vadim M Efimov1,4, Elena K Khlestkina1,3.   

Abstract

BACKGROUND: It is well-documented that (bio)chemical reaction capacity of raw potato starch depends on crystallinity, morphology and other chemical and physical properties of starch granules, and these properties are closely related to gene functions. Preparative yield, amylose/amylopectin content, and phosphorylation of potato tuber starch are starch-related traits studied at the genetic level. In this paper, we perform a genome-wide association study using a 22K SNP potato array to identify for the first time genomic regions associated with starch granule morphology and to increase number of known genome loci associated with potato starch yield.
METHODS: A set of 90 potato (Solanum tuberosum L.) varieties from the ICG "GenAgro" collection (Novosibirsk, Russia) was harvested, 90 samples of raw tuber starch were obtained, and DNA samples were isolated from the skin of the tubers. Morphology of potato tuber starch granules was evaluated by optical microscopy and subsequent computer image analysis. A set of 15,214 scorable SNPs was used for the genome-wide analysis. In total, 53 SNPs were found to be significantly associated with potato starch morphology traits (aspect ratio, roundness, circularity, and the first bicomponent) and starch yield-related traits.
RESULTS: A total of 53 novel SNPs was identified on potato chromosomes 1, 2, 4, 5, 6, 7, 9, 11 and 12; these SNPs are associated with tuber starch preparative yield and granule morphology. Eight SNPs are situated close to each other on the chromosome 1 and 19 SNPs-on the chromosome 2, forming two DNA regions-potential QTLs, regulating aspect ratio and roundness of the starch granules. Thirty-seven of 53 SNPs are located in protein-coding regions. There are indications that granule shape may depend on starch phosphorylation processes. The GWD gene, which is known to regulate starch phosphorylation-dephosphorylation, participates in the regulation of a number of morphological traits, rather than one specific trait. Some significant SNPs are associated with membrane and plastid proteins, as well as DNA transcription and binding regulators. Other SNPs are related to low-molecular-weight metabolite synthesis, and may be associated with flavonoid biosynthesis and circadian rhythm-related metabolic processes. The preparative yield of tuber starch is a polygenic trait that is associated with a number of SNPs from various regions and chromosomes in the potato genome.
© 2020 Khlestkin et al.

Entities:  

Keywords:  GWAS; Inheritance; Principal components ; Solanum tuberosum; Starch; Starch ganula morphology; Starch yield; Two-block partial least squares

Year:  2020        PMID: 33240629      PMCID: PMC7664467          DOI: 10.7717/peerj.10286

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


Introduction

Starch is one of the most important renewable and economically notable organic resources of humankind (Khlestkin, Peltek & Kolchanov, 2018). Simplicity of potato growing and ease of industrial production of pure potato starch makes it a necessary component and starting compound in food and chemical industries and determines valuability of an exhaustive study of all aspects of manufacturing and commercial application of potato starch is necessary. There are many publications related to chemical and biochemical transformations of starch of diverse botanical origin and differences properties of various starches. Surprisingly, little attention is paid to difference in starch properties at the level of different cultivars, such as potato cultivars. Commercial starch is still subdivided as “potato starch”, “rice starch”, “corn starch” and so on, despite it is well-known that there are significant differences in the properties of starch of the same botanical origin. From general considerations, it is evident that chemical and biochemical reaction ability of the raw potato starch must depend on its granules crystallinity and morphology. If so, starches from different potato varieties with different granules’ properties should fit different applications better. (Bio)chemical reaction capacity of raw potato starch depends on the crystallinity, morphology and other chemical and physical properties of starch granules. For example, after chemical acylation of the raw potato starch and fractionation of resulting acetylated starch according to granule size, amylose and amylopectin may be isolated and characterized by degree of substitution (DS) and degradability with α-amylase, β-amylase and amyloglucosidase. In contrast to amylose, the DS of the amylopectin from the differently sized granules increased with decreasing granule size. The acetyl groups of the amylose molecules from small granules are more heterogeneously distributed and located more closely to the non-reducing ends compared to amylose from larger granules. The amylose populations from small granules of the acetylated starches were less susceptible to all the enzyme degradation reactions than the amylose from the large granules, even though the DS was similar. Additionally, the acetyl group distributions were different for amylopectin from different granule size fractions (Chen, Schols & Voragen, 2004; Chen et al., 2005). Some benefits of small potato starch granules are summarized in European patent (De Vetten & Heeres, 2004). The large size of the potato granules often hampers the utilization of potato starches, for example, in printing ink formulations, wherein potato starch granules may obstruct the printing apertures. Smaller granules facilitate chemical modification of the starch, since the surface-volume ratio is important for the accessibility of the modifying compound to the starch chains. In textile printing, for instance, carboxymethylated and roll dried potato starch of small granule size enables the production of finer prints. Furthermore, in adhesives, particularly in highly concentrated bag adhesives, crosslinked fine potato starches may be used as fillers and in drilling fluids, crosslinked and modified fine starches are expected to reduce fluid loss. In addition, smaller granule potato starches may be used as filler/structurant in soap, since this starch provides a pleasant sensation to the skin. Small granular potato starches are particularly suitable for applications in the food industry. Small granular potato starch provides such advantages as lower starch dosage level, better taste profile, and smooth but not excessively swollen granules (Singh & Kaur, 2004). Other practical and important properties that are sensitive to starch granule size are liquid composite viscosity (Zhou et al., 2002), chemical modification and flowability (Wang et al., 2016). It is evident that the shape of potato starch granules is closely related to genes function. Amylopectin polysaccharide is predominantly responsible for the smooth granule morphology. Even low-amylose starch granules still maintain a smooth shape, whereas starch with reduced amylopectin content (by antisense knockout of SBE and GWD genes) is associated with more oddly fissured granules (Blennow et al., 2003). The role of starch synthases and related genes (GBSS, SSI–SSIII) in potato starch granule morphology has been discussed elsewhere (Ball & Morell, 2003). In general, the role of certain genes in starch biosynthesis and starch granule morphology in particular is summarized in the review (Khlestkin, Peltek & Kolchanov, 2017). Traditionally, the preparative yield of potato tuber starch is the only starch-related trait generally accepted as important for potato breeding. In the past decade, starch phosphorylation and amylose/amylopectin content were also added to the short list of the traits studied at the genetic level by modern molecular biology methods. Thus, forward genetic tools, such as QTL analysis (Carreno-Quintero et al., 2012; Werij et al., 2012) and association mapping (Carpenter et al., 2015), were used to reveal loci associated with potato traits. The tuber starch characteristics, such as starch granular size and shape, and chemical-thermal properties of 21 potato varieties were determined and associated with genetic diversity through SSR markers. SSR-based cluster analysis revealed that varieties with interesting quality attributes were distributed among all clusters and subclusters, suggesting that the genetic basis of analyzed traits might differ among the varieties (Werij et al., 2012). Recently developed 22K SNP potato array is characterized by a high average density of markers, one locus per 40 kb (in the abovementioned studies (Khlestkin, Peltek & Kolchanov, 2017; Carreno-Quintero et al., 2012; Werij et al., 2012; Carpenter et al., 2015), this value does not exceed one marker per 4 Mbp). We used the chip to identify eight novel genomic regions on the chromosomes 1, 4, 5, 7, 8, 10 and 11 associated with starch phosphorylation. Some of the identified SNPs were located in noncoding genomic regions (Khlestkin et al., 2019). In this article, for the first time we perform a genome-wide association study using a 22K SNP potato array to locate genomic regions associated with starch granule morphology and to increase the number of known genomic loci associated with potato starch yield.

Materials and Methods

Plant material

The same set of 90 potato (Solanum tuberosum L.) varieties from the ICG “GenAgro” collection (Novosibirsk, Russia) described in Khlestkin et al. (2019) was used in this paper as well. Complete list of the accessions used in the study and their origin is presented in Supplemental Data S1 (Excel file). All plants are grown during the period May to October 2017 in the same field in Novosibirsk (54°52′ N and 83°00′ E). Seed tubers of all cultivars were planted in two rows with 0.75 m spacing and 0.3 m distance between the plants on the rows. In total, 10 plants were planted in the row; the length of each row was 10 m. Each cultivar was planted in three replicates; distances between the replicates’ plots were 2 m. Sowing: the first decade of May. Harvesting: the third decade of September. After harvesting tubers were stored for three weeks at +4 °C before starch isolation.

Starch Isolation

Potato starch was isolated from tubers according to the typical procedure described elsewhere (for example, see Khlestkin & Erst (2017)).

DNA Isolation and Genotyping

DNA was isolated from tuber skin using DNeasy Plant Mini Kit (Qiagen, Hilden, Germany) according to the standard procedure. А set of 15, 214 (71.7%) scorable SNPs (Khlestkin et al., 2019) was used for GWAS analysis. Genotyping information is available in Supplemental Data S2 (Excel file).

Microscopy and Image Processings

Sample preparation, microscopic image acquisition and processing were performed according to a previously developed procedure (Khlestkin & Erst, 2017). Five milligrams of raw starch were suspended in one mL of distilled water and dyed while shaking with 50 µL of iodine solution. A total of 20 mL of suspended dyed raw starch granules were placed on a microplate and covered with glass. At least four pictures of every sample were acquired (250–300 granules in every image) in transmitted light mode with bright-field technique. Micro images of starch granules obtained with the research optical microscope Axio Scope A1 (Carl ZEISS), objective—A-Plan 10x/0.25, CCD camera—AxioCam ICc 3, adaptor—TV 2/3″C 0.63x, software ZEN, total magnification 10 (objective) × 10 (ocular) × 0.63 (adaptor). Automatic image processing and analysis were performed in the freely distributed ImageJ program. Seven morphological parameters were analyzed (Table S1; Fig. S1).

Principal components and 2B PLS analysis

Two sets of principal components were calculated for both phenotypic morphological traits of starch granules, as well as for the genotyping data for potato varieties, through the distance matrix using the JACOBI 4 software package (Polunin, Shtayger & Efimov, 2014). To calculate the distance matrix between the potato varieties, we recoded the tetraploid potato genome from four-letter codes to numerical codes, taking into account the dose of a certain allele. After the recoding, 0 was assigned to the effector allele, and 1 was assigned as the non-effector allele, and their intermediate forms were coded as 0.75, 0.5 and 0.25. For example, the AAAA allele is reflected as 1, AAAG—as 0.75, AAGG—as 0.5, AGGG—as 0.25 and GGGG—as 0. Both sets of principal components were applied as blocks for two-block partial least squares (2B-PLS) analysis, where the first block related to phenotypic traits and the second block related to genotypic. Population structure matrix (Q-matrix) and the genotyping data were analyzed by Bayesian cluster analysis in STRUCTURE v.2.3.4 (Falush, Stephens & Pritchard, 2007). Cluster analysis of the same population has been previously discussed in the authors’ previous article (Khlestkin et al., 2019).

Association analysis

Association analysis was performed with the TASSEL 5 package (Bradbury et al., 2007). Four different statistical models were tested to identify significant marker associations with potato starch yield and granule morphology: (1) general linear model (GLM) without taking into account population structure, (2) GLM using a Q-matrix of population membership (GLM+Q) taking into account the population structure, (3) GLM taking into account population membership estimates derived from principal components analysis (GLM+PCA) and (4) a composite approach that combines both Q-matrix and the average relationship between individuals or lines (null matrix), represented in TASSEL as mixed linear model (MLM). Adaptation of MLM for GWAS has been discussed in Zhang et al. (2010). But MLM approach did not result into valuable results and no significant SNPs were identified with this tool. To identify significant SNPs, two corrections were used: (i) the Bonferroni correction, where the significant threshold (0.05) is divided by the total number of tests; in this work, the total number of markers (15,214) yields a threshold of 3.29 × 10−6 and (ii) the false discovered rate (FDR) (Benjamini & Hochberg, 1995), which was calculated for each isolate in each model. Some markers did not exceed the threshold but still possessed low p-value (>10−4). We referred to the markers as “suggestive”. The percentage of random was <10%. Belonging of identified SNPs to genes and their association with certain proteins were confirmed on the site https://plants.ensembl.org.

Results and Discussion

Phenotyping

Our initial study (Khlestkin & Erst, 2017) on potato tuber starch granule morphology provided a reliable method for granule shape evaluation. This work also indicated that granule shape is specific for different potato varieties and represents a set of quantitative traits that may be applied in modern potato breeding for production starch best suitable for industrial processing. Thus, 90 varieties of potato were harvested in 2017, and 90 samples of raw tuber starch and related DNA were isolated. Seven morphological parameters were automatically captured and evaluated for the 90 varieties: area (area of granule projection in microscope visible bright field), Feret’s diameter, minimal Feret’s diameter, aspect ratio (AR), roundness, circularity and solidity. Exact explanations and formulas for the parameters are provided in Table S1 and Fig. S1. The preparative yield of starch was also captured. It varied significantly from 7.4% (variety Agata) to 18.8% (variety Tango) (Supplemental Data S1, Excel file). The process of capturing starch micro images, treatment and evaluation were discussed in our previous study (Khlestkin & Erst, 2017). For GWAS, we used average values of all the morphological parameters for every cultivar.

Genome-wide association study

Principal components analysis and 2B PLS analysis

“Phenotype–genotype” covariation was calculated as a set of linear bicomponents (Table S2). It was shown that the first three linear bicomponents capture 92.7% of the total covariation. All the phenotypic traits studied, as well as preparative yield, correlate with the first bicomponent. The second component showed correlations with morphology traits and no correlations with the “preparative yield of starch”. “Preparative yield of starch” is the only trait that correlates significantly with the third bicomponent (Table S3). Plotting of the first and the third pairs of bicomponents showed good positive correlation between genotypes and phenotypes in both planes (Fig. S2). The haplotypes most interesting for further selection are situated at the opposite corners of the plots and are highlighted by the circles (Fig. S3). Despite the close values of the “Preparative yield of starch” in varieties Udacha and Svitanok Kievsky, they are genetically highly different. Analysis of genotypic data gives three clusters (Fig. S4). The first and the second clusters comprise single trait—preparative yield of starch and aspect ratio, respectively. All other morphological traits studied formed the third cluster. Preparative yield of starch and aspect ratio are opposite traits on the plate. Therefore, the breeding process for optimizing the traits should be performed in opposite directions in genetic coordinates.

Quantile-quantile plots

For all the traits studied, we evaluated whether the analysis of the population structure possesses additional accuracy in finding significant SNPs. First, the GLM analysis data were compared in the QQ-plots (quantile–quantile plots). They show that only for traits “circularity” and “Feret’s diameter” one may expect correct evaluation of noticeable SNPs with the GLM method. For the other traits, GLM demonstrates inflation (overestimated p-values) of noticeable SNP evaluation (Fig. S5). The GLM+PCA model takes into account the population structure and returns results of significantly higher quality (Fig. S6): calculated p-values are closer to the expected ones even in the region of high values. Thus, calculated data for “Circularity” and “Area” traits are close to theoretical ones, for “Solidity”, “Feret’s diameter” and “Minimal Feret’s diameter”, (−lg p)-values are somewhat lower. For other traits, obtained (−lg p)-values are slightly higher than expected. A similar QQ correlation for the first PLS bicomponent, which includes all the genotypic traits, shows significant deviations from theoretical data and thus a lower quality of p-value evaluation (Fig. S6). To account for the population structure, the GLM+Q statistical model was applied. The model worked well for “Feret’s diameter”, “Minimal Feret’s diameter”, “Circularity” and “Area” traits giving a good conformity, but did not give (–lg p)-value > 4, which indicates the absence of significant markers. The “Preparative tuber starch” and “Roundness” traits showed a number of false positive SNPs (Fig. S7).

Manhattan plots

In total, 53 significant SNPs were associated with morphological and starch yield traits (Table 1).
Table 1

Significant SNPs associated with preparative yield of starch and morphological traits.

NSNPStatistical modelSignificance levelsChrPositionp-Value*PolymorphismQTL effectHeritability valueTrait
1PotVar0026637GLMBonferroni4282,8282.5E−06T/G25.347660.22363Preparative yield
GLM+QBonferroni4282,8282.3E−06T/G25.730740.23036Preparative yield
GML+PCASuggestive4282,8284.8E−05T/G18.418150.14736Preparative yield
2PotVar0098904GLM+QFDR (0.05)461,699,9892.2E−05T/C20.209080.19196Preparative yield
3PotVar0098903GLM+QFDR (0.05)461,700,0411.8E−05A/G20.669120.19391Preparative yield
4solcap_snp_c2_32042GLMFDR (0.05)467,130,7193.9E−06A/G24.299340.21832Preparative yield
GLM+QFDR (0.05)467,130,7194.74E−06A/G23.938460.21998Preparative yield
5solcap_snp_c2_52081GLM+QSuggestive51,883,1653.8E−05A/G18.949880.174461st bi-component
GLM+QFDR (0.05)51,883,1651.2E−05A/G21.681740.20465Preparative yield
GML+PCASuggestive51,883,1654.6E−05A/G18.575340.14972Preparative yield
6PotVar0034580GLMFDR (0.05)551,697,9272.1E−05T/C19.339180.18535Preparative yield
GLM+QFDR (0.05)551,697,9272.09E−05T/C20.393750.19744Preparative yield
7solcap_snp_c1_1250GLMFDR (0.05)551,978,3844.94E−06A/C23.690150.21211Preparative yield
GLM+QFDR (0.05)551,978,3844.09E−06A/C24.263860.22014Preparative yield
8solcap_snp_c2_3174GLMFDR (0.05)64,419,1128.6E−06A/G22.419380.20678Preparative yield
GLM+QFDR (0.05)64,419,1121.22E−05A/G21.653850.20504Preparative yield
9solcap_snp_c1_5970GLM+QFDR (0.05)754,862,0764.9E−05T/G18.327870.17771Preparative yield
10PotVar0012073GLMFDR (0.05)92,679,6151.2E−05A/G19.874460.18596Preparative yield
GLM+QFDR (0.05)92,679,6151.17E−05A/G21.731430.20345Preparative yield
11solcap_snp_c2_5957GLM+QFDR (0.05)112,855,0445E−05A/G18.354780.18175Preparative yield
12solcap_snp_c2_6309GLMFDR (0.05)112,979,7841.2E−05T/C21.810450.21214Preparative yield
GLM+QFDR (0.05)112,979,7841.64E−05T/C21.117260.20921Preparative yield
13PotVar0067347GLMFDR (0.05)112,973,5092.1E−05T/C20.287430.19087Preparative yield
GLM+QFDR (0.05)112,973,5092.29E−05T/C20.143930.19353Preparative yield
14solcap_snp_c2_6285GLMFDR (0.05)113,080,2408.2E−06A/G22.487150.20539Preparative yield
GLM+QFDR (0.05)113,080,2405.57E−06A/G23.540870.21734Preparative yield
15solcap_snp_c2_6185GLMFDR (0.05)113,572,4452.5E−05A/C19.874850.18772Preparative yield
GLM+QFDR (0.05)113,572,4453.37E−05A/C19.234020.1859Preparative yield
16PotVar0120075GLM+QFDR (0.05)1433,6965.9E−05A/G17.916990.16272Aspect ratio
GLM+QFDR (0.05)1433,6968.1E−05A/G17.167880.15815Roundness
17PotVar0119973GLM+QFDR (0.05)1472,4226.7E−05T/C17.609490.16041Aspect ratio
GLM+QFDR (0.05)1472,4222.1E−05T/C20.33930.18167Roundness
18PotVar0119913GLM+QFDR (0.05)1473,4876.7E−05A/G17.609490.16041Aspect ratio
GLM+QFDR (0.05)1473,4872.1E−05A/G20.33930.18167Roundness
19solcap_snp_c2_36659GLM+QFDR (0.05)1532,0035.4E−05T/C18.111930.16418Aspect ratio
GLM+QFDR (0.05)1532,0037.3E−05T/C17.42120.16008Roundness
20solcap_snp_c2_36664GLM+QFDR (0.05)1535,4548.9E−05A/G16.956170.15717Aspect ratio
GLM+QFDR (0.05)1535,4549.6E−05A/G16.778460.1568Roundness
21solcap_snp_c2_36665GLM+QFDR (0.05)1536,0333.4E−05A/G19.203150.17223Aspect ratio
GLM+QFDR (0.05)1536,0333.7E−05A/G18.963980.17165Roundness
22PotVar0071846GLM+QFDR (0.05)11,155,6033.3E−05A/C19.200530.17141Aspect ratio
GLM+QFDR (0.05)11,155,6032.5E−05A/C19.874540.17799Roundness
23PotVar0071852GLM+QFDR (0.05)11,155,7703.3E−05T/C19.260690.17265Aspect ratio
GLM+PCASuggestive11,155,7708.8E−05T/C17.023990.15571Aspect ratio
GLM+QFDR (0.05)11,155,7703.5E−05T/C19.13430.17291Roundness
24solcap_snp_c2_56617GLMFDR (0.05)24,833,2895.5E−05A/G17.962390.16952Aspect ratio
GLMFDR (0.05)24,833,2896.3E−05A/G17.676250.16727Roundness
25solcap_snp_c1_16379GLMBonferroni24,839,5249.2E−07T/C28.023260.24794Aspect ratio
GLM+QFDR (0.05)24,839,5245.4E−05T/C18.170520.16451Aspect ratio
GLM+PCASuggestive24,839,5246.4E−05T/C17.814610.15835Aspect ratio
GLMBonferroni24,839,5249.2E−07T/C27.293350.24305Roundness
GLM+QFDR (0.05)24,839,5242.8E−05T/C19.738630.17492Roundness
GLM+PCASuggestive24,839,5243E−05T/C19.584470.17361Roundness
26PotVar0032432GLMFDR (0.05)25,126,9682.7E−05A/G19.762420.19231Aspect ratio
GLMFDR (0.05)25,126,9682.7E−05A/G19.283040.18853Roundness
27PotVar0032402GLMFDR (0.05)25,128,0629.6E−06A/G22.143070.20476Aspect ratio
GLMFDR (0.05)25,128,0629.9E−06A/G22.073460.20424Roundness
28solcap_snp_c2_4353GLMFDR (0.05)25,158,3057.9E−06T/C22.649380.2104Aspect ratio
GLMFDR (0.05)25,158,3058.7E−06T/C22.423380.20874Roundness
29solcap_snp_c2_4354GLMFDR (0.05)25,159,3012.2E−05T/C20.071710.18573Aspect ratio
GLMFDR (0.05)25,159,3012.5E−05T/C19.857270.18411Roundness
30PotVar0032114GLMFDR (0.05)25,159,3771.9E−05A/C20.512080.19258Aspect ratio
GLMFDR (0.05)25,159,3771.9E−05A/C20.512080.19258Roundness
31solcap_snp_c2_4360GLMBonferroni25,356,4986.7E−07A/G28.686960.24585Aspect ratio
GLM+QFDR (0.05)25,356,4982.1E−05A/G20.28770.17924Aspect ratio
GLM+PCASuggestive25,356,4981.5E−05A/G21.202110.18928Aspect ratio
GLMBonferroni25,356,4986.7E−07A/G27.444110.23773Roundness
GLM+QFDR (0.05)25,356,4989.6E−06A/G22.186730.19441Roundness
GLM+PCASuggestive25,356,4981.5E−05A/G21.202110.18928Roundness
32solcap_snp_c2_57190GLMFDR (0.05)25,394,4201.7E−05A/C20.700860.19221Aspect ratio
GLMFDR (0.05)25,394,4201.8E−05A/C20.559060.19114Roundness
33solcap_snp_c2_48725GLMFDR (0.05)25,510,7742.2E−05T/C20.071710.18573Aspect ratio
GLMFDR (0.05)25,510,7742.5E−05T/C19.857270.18411Roundness
34solcap_snp_c1_1503GLMFDR (0.05)25,362,5012.4E−05T/C19.917470.18629Aspect ratio
GLMFDR (0.05)25,362,5012.4E−05T/C19.919340.1863Roundness
35solcap_snp_c2_48735GLMFDR (0.05)25,552,6792.4E−05A/G19.917470.18629Aspect ratio
GLMFDR (0.05)25,55,26792.4E−05A/G19.919340.1863Roundness
36solcap_snp_c1_3750GLMFDR (0.05)26,090,0744.2E−06T/C24.1210.21707Aspect ratio
GLMFDR (0.05)26,090,0749E−06T/C22.270710.20381Roundness
37solcap_snp_c1_16405GLMFDR (0.05)26,707,4276.6E−06A/G23.032160.20932Aspect ratio
GLM+PCASuggestive26,707,4279.6E−05A/G16.832240.15409Aspect ratio
GLMFDR (0.05)26,707,4276.6E−06A/G23.896460.21548Roundness
GLM+PCASuggestive26,707,4275E−05A/G18.349470.17054Roundness
38solcap_snp_c1_3746GLMFDR (0.05)27,050,5951.5E−05T/C21.040940.19657Aspect ratio
GLMFDR (0.05)27,050,5951.4E−05T/C21.242320.19808Roundness
39solcap_snp_c1_3747GLMBonferroni27,050,7141.9E−06T/C26.126170.23301Aspect ratio
GLM+QFDR (0.05)27,050,7143.2E−05T/C19.379540.17319Aspect ratio
GLM+PCASuggestive27,050,7142.2E−05T/C20.288520.17686Aspect ratio
GLMBonferroni27,050,7141.9E−06T/C27.645940.24326Roundness
GLM+QFDR (0.05)27,050,7142.2E−05T/C20.284770.17974Roundness
GLM+PCASuggestive27,050,7148.6E−06T/C22.589850.19625Roundness
40solcap_snp_c2_14652GLMBonferroni28,703,4761.7E−06A/G26.357010.23458Aspect ratio
GLM+QFDR (0.05)28,703,4765.1E−05A/G18.284070.16778Aspect ratio
GLM+PCASuggestive28,703,4766.5E−05A/G17.748890.16164Aspect ratio
GLMBonferroni28,703,4761.7E−06A/G24.017680.21831Roundness
GLM+QFDR (0.05)28,703,4764.4E−05A/G18.625520.17228Roundness
GLM+PCASuggestive28,703,4764.4E−05A/G18.657390.17427Roundness
41solcap_snp_c2_14648GLMBonferroni28,799,5271.4E−06A/C27.010560.24553Aspect ratio
GLM+QFDR (0.05)28,799,5274.9E−05A/C18.456960.17279Aspect ratio
GLM+PCASuggestive28,799,5271E−04A/C16.831190.15898Aspect ratio
GLMBonferroni28,799,5271.4E−06A/C24.681370.22921Roundness
GLM+QFDR (0.05)28,799,5274.8E−05A/C18.489830.17371Roundness
GLM+PCASuggestive28,799,5276.7E−05A/C17.740910.17059Roundness
42solcap_snp_c2_32254GLMFDR (0.05)213,697,5234.9E−05A/G18.293190.1754Aspect ratio
GLMFDR (0.05)213,697,5234.9E−05A/G18.827510.1796Roundness
43PotVar0022442GLMFDR (0.05)71,120,0117.4E−05T/C17.330250.16772Aspect ratio
GLMFDR (0.05)71,120,0111.94E−04T/C15.168080.14993Roundness
44solcap_snp_c2_33657GLMFDR (0.05)112,274,0634.4E−05A/G18.551250.17744Aspect ratio
GLMFDR (0.05)112,274,0634.00E−05A/G18.757150.17905Roundness
45solcap_snp_c1_1504GLMFDR (0.05)24,479,3896.5E−06A/G23.100750.21174Aspect ratio
GLMFDR (0.05)24,479,3891.36E−05A/G21.295540.19848Roundness
46solcap_snp_c2_50824GLM+QFDR (0.05)1254,479,7732.7E−05T/C20.018520.18645Aspect ratio
GLM+QFDR (0.05)1254,479,7736.56E−05T/C17.880540.17686Circularity
GLM+QFDR (0.05)1254,479,7732.78E−04T/C14.549520.14453Roundness
47solcap_snp_c2_54815GLM+QFDR (0.05)11,125,6399.1E−05A/G16.983760.16372Aspect ratio
GLM+QFDR (0.05)11,125,6396.3E−05A/G17.822370.17128Roundness
48solcap_snp_c1_15462GLM+QBonferroni7205,9491.3E−06A/G27.206380.230951st bi-component
GLM+PCASuggestive7205,9491.3E−05A/G21.516240.175251st bi-component
GLM+QFDR (0.05)7205,9495.6E−06A/G23.54530.20251Сircularity
GLM+PCASuggestive7205,9495E−05A/G18.378380.1726Сircularity
49PotVar0047235GLM+PCAFDR (0.05)1139,417,9585.7E−06T/C23.511330.20444Сircularity
50solcap_snp_c2_52067GLMSuggestive Suggestive51,847,5563.7E−05A/G18.891640.176741st bi-component
GLM+QFDR (0.05)51,847,5568.8E−05A/G16.975740.155891st bi-component
51PotVar0097020GLM+QFDR (0.05)960,563,4364.3E−06T/G24.17610.210341st bi-component
GLM+PCASuggestive960,563,4361.8E−05T/G20.697390.16831st bi-component
52PotVar0097065GLM+QBonferroni960,565,0112.8E−06T/C25.310050.221431st bi-component
GLM+PCASuggestive960,565,0112.6E−05T/G19.926140.167311st bi-component
53PotVar0052560GLMSuggestive1259,793,4716.3E−05A/G17.68548A0.168941st bi-component

Note:

Only minimal p-value for each SNP presented.

Note: Only minimal p-value for each SNP presented. Recently, several genome association studies with starch granules size distribution were published for maize (Liu et al., 2018), wheat (Li et al., 2017; Chia et al., 2020) and Japonica rice (Biselli et al., 2019). In the present study, no noticeable SNPs with p-values higher than Bonferroni or FDR levels were found for average values for the “Area”, “Feret’s diameter”, “Minimal Feret’s diameter” and “Solidity” traits. This result is surprising, since three of the traits are related to the starch granule size, which is expected to be genotype-dependent. Indeed, if we apply ANOVA to the granules’ sizes of 90 varieties involved in the study, we see that factor “variety” clearly influences the size-related traits (“Area”, “Feret’s diameter” or “Minimal Feret’s diameter”). Building a “tree” of varieties’ starch granule size identifies five clusters of varieties (genotypes) with relatively close granule size, and ANOVA within the clusters enables us to conclude that the size of the granules in the clusters is dependent on the genotype (Fig. S8). It appears that granule size is not single- or oligogenic but is a polygenic trait. To determine the SNP pattern associated with this trait, analysis of p-values and their comparison with Bonferroni or FDR levels is not sufficient, and more advanced and complicated analysis is warranted.

Preparative tuber starch yield

Association of SNPs with phenotypic data for the “preparative tuber starch yield” trait revealed 10 significant SNPs when the GLM model was applied. The p-value of one SNP exceeded the Bonferroni level, and the p-values of the other nine exceeded the FDR (Fig. 1). The SNPs were assigned to chromosomes 4, 5, 6, 7, 9 and 11.
Figure 1

Manhattan plots for “Preparative tuber starch yield” trait; (A) GLM; (B) GLM+PCA; (C) GLM+Q.

For the “preparative tuber starch yield” trait, GLM+PCA appeared to be the best model according to the QQ plot (Fig. S6), but for the model (–lg p)-values are lower than for GLM (Fig. 1). No significant SNPs were revealed, but two SNPs on chromosomes 4 and 5 may be referred to as “suggestive”. Nevertheless, the GLM+Q model confirmed a significant SNP on chromosome 4. Some significant regions on chromosomes 4, 5, 9 and 11 were revealed with GLM and confirmed by GLM+Q (Table 1). Taking into account possible false-positive SNPs predicted by the QQ-plot, we lowered the FDR level following the suggestion of Chan, Rowe & Kliebenstein (2010). Setting the FDR level at the 10.05 percentile of p-values gave a reasonable number of detectable SNPs (Table 1).

Aspect ratio

The most promising results obtained for morphological traits related not to size but to shape of the starch granules. GLM without population structure analysis yielded five SNPs with (–lg p)-values exceeding the Bonferroni level (Table 1; Fig. 2). All of these SNPs are tightly grouped on chromosome 2, forming qualitative trait loci (QTL). Four other noticeable SNPs determined by GLM have (–lg p)-values exceeding FDR level and are located on chromosomes 1, 7, 11 and 12. The same three SNPs were confirmed by the GLM+PCA method and are probably parts of appropriate QTLs. Belonging to QTLs increases the probability that certain SNPs are associated with the trait studied.
Figure 2

Manhattan plots for “Aspect Ratio” trait. (A) GLM; (B) GLM+PCA; (C) GLM+Q.

Roundness

Analysis of SNPs for the “Roundness” trait in detail reproduced the results obtained for the “Aspect ratio”. Table 1 contains SNPs associated with the “Aspect ratio” and includes SNPs for “Roundness” as well. Manhattan plots are highly similar for the two traits because both describe the shape of potato starch granules.

Circularity

Three SNPs associated with the “Circularity” trait were revealed by GWAS using the statistical models GLM+PCA and GLM+Q (Table 1; Fig. S9).

SNPs correlated with the first phenotypic bicomponent

Since the “Preparative yield”, “Area”, “Circularity”, “Feret’s diameter”, “Minimal Feret’s diameter” and “Solidity” traits mutually contribute the first principal component significantly (Table S3), it was logical to analyze SNPs’ associations with the first bicomponent to determine if there were any SNPs associated with all of the traits simultaneously. One of the SNPs found for the first bicomponent (solcap_snp_c1_15462 on the seventh chromosome) fits the SNP, which is significant for the “Circularity” trait. The “Circularity” trait contributes the first bicomponent (−0.67992). The other four significant SNPs are unique and not associated with any single trait. All of these SNPs are a part of appropriate QTLs (“supported” with other SNPs with lower (−lg p)-values) (Fig. S10). Genes around the SNPs may be responsible for the general starch carbohydrate polymer arrangement in the starch granule. No significant SNPs associated with the second and third bicomponents were revealed.

SNPs and Related Genes/Proteins

In total, 53 SNPs were found to be significant in the association study of the potato genotypes and starch morphology/yield-related traits. (−lg p)-Values of one SNP were associated with “preparative yield of starch”, five SNPs—with “aspect ratio” and “roundness” traits and one SNP—with the first bicomponent exceeded strict Bonferroni criteria. Nevertheless, some associated SNPs overcame 0.1 and 0.05 FDR (false discovery rate) and suggestive levels. Most of the SNPs are located on the first and second chromosomes, the least on chromosomes 4, 5, 6, 7, 9, 11 and 12. A total of 37 of 53 SNPs are located in protein coding regions. Not one of the SNPs studied in this paper coincides with the ones associated with covalently bound phosphorus content in starch (Khlestkin et al., 2019) (Table S4).

Preparative Yield of Starch

Three out of four SNPs found on chromosome 4 are located in noncoding sequences and relate to two different DNA regions. There are some other related SNPs with considerably lower significance in these regions, indicating possible QTLs. The fourth SNP is situated in the different region of the gene, coding for a DNA transcription regulator consisting of 95 amino acid residues. All three SNPs found on chromosome 5 are related to the sequences coding different proteins. One SNP corresponds to coiled coil protein with unclear function; two others are related to low-molecular-weight organic metabolite conversion, specifically to the enzymes aldehyde dehydrogenase and phenyl alanine lyase, which are responsible for alcohol-aldehyde equilibria in the cell and for flavonoid synthesis. Taking into account SNPs that are less significant but still associated with starch yield, we may speak about two trait-associated DNA regions that are potential QTLs. The significant SNP on chromosome 6 seems also be a part of a QTL and relates to plastid transcriptionally active protein 16 (PTAC16), which is suspected to be involved in the regulation of plastid gene expression. The significant SNP on chromosome 7 encodes the protein translation factor SUI1. The significant SNP on chromosome 9 encodes a protein with unknown function. A group of five SNPs located within 2855044–3572445 bp on chromosome 11 belongs to the same QTL, and the SNPs are included in the coding regions of several proteins: DEGP10, methylenetetrahydrofolate reductase, acetolactate synthase, and two proteins of unknown function. In general, most of the proteins associated with potato starch preparative yield variations are involved in plastid activity and low-molecular-weight metabolite biosynthesis.

Aspect Ratio and Roundness

These traits are well-correlated with each other and describe the shape of starch granules in a similar way; thus, the traits are associated with the same SNPs. There is a locus on chromosome 1 containing eight SNPs in the same DNA region. Some of these proteins are related to circadian rhythm-regulating proteins. The circadian clock regulates numerous plant developmental and metabolic processes. In crop species, the circadian clock contributes significantly to plant performance and productivity and to the adaptation and geographical range over which crops can be grown. Other SNPs are related to the phosphorylation of important biochemical intermediates and plastid organization. On chromosome 2, a total of 19 significant SNPs were identified. The SNPs are narrowly situated, forming a single DNA region, a potential trait-related QTL. Most of the SNPs are located in noncoding regions or related to proteins with unknown functions. Two SNPs are related to WPP domain-associated protein-encoding genomic regions, one to plastid high chlorophyll fluorescence 136, another to pentatricopeptide repeat-containing protein, and one SNP is related to DNA binding protein. Two SNPs on chromosomes 7 and 11 were found in the genes encoding proteins with unknown functions.

Circularity

Only three SNPs associated with the circularity trait were identified. The first SNP is related to a noncoding region of chromosome 12, the second is located on chromosome 7 in the minichromosome maintenance 5 protein coding sequence, and the third is located on chromosome 11 in the flavonoid 3′,5′-hydroxylase-encoding region.

First “Phenotype” bicomponent

Six SNPs were found to be associated with the first bicomponent from 2B-PLS, which is a complex component comprising all of the traits studied. Thus, one SNP was the same for the first bicomponent and Preparative yield (chromosome 7), while another SNP was the same for the first bicomponent and Circularity (chromosome 5). Among the four other unique 1st bicomponent associated SNPs, two are related to the GWD gene, one of the key starch biosynthesis genes (Khlestkin, Peltek & Kolchanov, 2017), which is responsible for the phosphorylation-dephosphorylation of glucans (chromosome 9). The other two SNPs are associated with cytochrome B561 family protein (chromosome 5) and Pto-interacting protein 1 (chromosome 12). In summary, chromosomes 1 and 2 contain important regions responsible for the roundness and aspect ratio of tuber starch granules. There are indications that granule shape may depend on circadian rhythm-related metabolic processes and starch phosphorylation processes. The GWD gene, which is known to regulate phosphorylation and dephosphorylation participates in the regulation of a whole number of morphological traits, rather than a single certain one. Nevertheless, some other mechanisms and proteins located on chromosome 2 influence the granule formation process. The preparative yield of tuber starch is probably a polygenic trait, regulated by a number of proteins that are encoded by sequences in various parts and chromosomes of the potato genome.

Conclusions

A genome-wide association study using a 22K SNP potato array enabled 53 novel SNPs to be identified on chromosomes 1, 2, 4, 5, 6, 7, 9, 11 and 12; these SNPs are associated with tuber starch preparative yield and with starch granule morphology (aspect ratio, roundness, circularity, and the first bicomponent). Some of the SNPs observed in this study are located in noncoding regions. The coding regions are associated with membrane and plastid proteins, DNA transcription and binding regulators, low-molecular-weight metabolite synthesis as well as flavonoid biosynthesis. The information on significant regions can be used to convert SNPs to PCR-markers, convenient for screening breeding material in programs aimed on development of potato varieties with desired starch properties. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  14 in total

1.  Starch phosphorylation associated SNPs found by genome-wide association studies in the potato (Solanum tuberosum L.).

Authors:  Vadim K Khlestkin; Irina V Rozanova; Vadim M Efimov; Elena K Khlestkina
Journal:  BMC Genet       Date:  2019-03-18       Impact factor: 2.797

2.  A carbohydrate-binding protein, B-GRANULE CONTENT 1, influences starch granule size distribution in a dose-dependent manner in polyploid wheat.

Authors:  Tansy Chia; Marcella Chirico; Rob King; Ricardo Ramirez-Gonzalez; Benedetta Saccomanno; David Seung; James Simmonds; Martin Trick; Cristobal Uauy; Tamara Verhoeven; Kay Trafford
Journal:  J Exp Bot       Date:  2020-01-01       Impact factor: 6.992

Review 3.  Review of direct chemical and biochemical transformations of starch.

Authors:  Vadim K Khlestkin; Sergey E Peltek; Nikolay A Kolchanov
Journal:  Carbohydr Polym       Date:  2017-10-10       Impact factor: 9.381

4.  Mixed linear model approach adapted for genome-wide association studies.

Authors:  Zhiwu Zhang; Elhan Ersoz; Chao-Qiang Lai; Rory J Todhunter; Hemant K Tiwari; Michael A Gore; Peter J Bradbury; Jianming Yu; Donna K Arnett; Jose M Ordovas; Edward S Buckler
Journal:  Nat Genet       Date:  2010-03-07       Impact factor: 38.330

5.  Untargeted metabolic quantitative trait loci analyses reveal a relationship between primary metabolism and potato tuber quality.

Authors:  Natalia Carreno-Quintero; Animesh Acharjee; Chris Maliepaard; Christian W B Bachem; Roland Mumm; Harro Bouwmeester; Richard G F Visser; Joost J B Keurentjes
Journal:  Plant Physiol       Date:  2012-01-05       Impact factor: 8.340

6.  The molecular deposition of transgenically modified starch in the starch granule as imaged by functional microscopy.

Authors:  Andreas Blennow; Michael Hansen; Alexander Schulz; Kirsten Jørgensen; Athene M Donald; James Sanderson
Journal:  J Struct Biol       Date:  2003-09       Impact factor: 2.867

7.  Inference of population structure using multilocus genotype data: dominant markers and null alleles.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Mol Ecol Notes       Date:  2007-07-01

8.  Starch phosphorylation in potato tubers is influenced by allelic variation in the genes encoding glucan water dikinase, starch branching enzymes I and II, and starch synthase III.

Authors:  Margaret A Carpenter; Nigel I Joyce; Russell A Genet; Rebecca D Cooper; Sarah R Murray; Alasdair D Noble; Ruth C Butler; Gail M Timmerman-Vaughan
Journal:  Front Plant Sci       Date:  2015-03-10       Impact factor: 5.753

9.  GWAS for Starch-Related Parameters in Japonica Rice (Oryza sativa L.).

Authors:  Chiara Biselli; Andrea Volante; Francesca Desiderio; Alessandro Tondelli; Alberto Gianinetti; Franca Finocchiaro; Federica Taddei; Laura Gazza; Daniela Sgrulletta; Luigi Cattivelli; Giampiero Valè
Journal:  Plants (Basel)       Date:  2019-08-19

10.  Identification of Quantitative Trait Loci and Candidate Genes for Maize Starch Granule Size through Association Mapping.

Authors:  Na Liu; Zhanhui Zhang; Yadong Xue; Shujun Meng; Yubi Huang; Weihua Li; Jihong Huang; Jihua Tang
Journal:  Sci Rep       Date:  2018-09-24       Impact factor: 4.379

View more
  3 in total

1.  Association mapping reveals novel genomic regions controlling some root and stolon traits in tetraploid potato (Solanum tuberosum L.).

Authors:  Muhammad Farhan Yousaf; Ufuk Demirel; Muhammad Naeem; Mehmet Emin Çalışkan
Journal:  3 Biotech       Date:  2021-03-18       Impact factor: 2.406

2.  Different Reactivity of Raw Starch from Diverse Potato Genotypes.

Authors:  Vadim Khlestkin; Ilia Eltsov
Journal:  Molecules       Date:  2021-01-05       Impact factor: 4.411

Review 3.  The Glycemic Index and Human Health with an Emphasis on Potatoes.

Authors:  Venkata Sai Sagili; Priyadarshini Chakrabarti; Sastry Jayanty; Hemant Kardile; Vidyasagar Sathuvalli
Journal:  Foods       Date:  2022-08-02
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.