Literature DB >> 35865281

The Population Structure of a Globe Artichoke Worldwide Collection, as Revealed by Molecular and Phenotypic Analyzes.

Domenico Rau1, Giovanna Attene1, Monica Rodriguez1, Limbo Baghino2, Anna Barbara Pisanu2, Davide Sanna2, Alberto Acquadro3, Ezio Portis3, Cinzia Comino3.   

Abstract

The knowledge of the organization of the domesticated gene pool of crop species is an essential requirement to understand crop evolution, to rationalize conservation programs, and to support practical decisions in plant breeding. Here, we integrate simple sequence repeat (SSR) analysis and phenotypic characterization to investigate a globe artichoke collection that comprises most of the varieties cultivated worldwide. We show that the cultivated gene pool of globe artichoke includes five distinct genetic groups associated with the major phenotypic typologies: Catanesi (which based on our analysis corresponds to Violetti di Provenza), Spinosi, Violetti di Toscana, Romaneschi, and Macau. We observed that 17 and 11% of the molecular and phenotypic variance, respectively, is between these groups, while within groups, strong linkage disequilibrium and heterozygote excess are evident. The divergence between groups for quantitative traits correlates with the average broad-sense heritability within the groups. The phenotypic divergence between groups for both qualitative and quantitative traits is strongly and positively correlated with SSR divergence (FST) between groups. All this implies a low population size and strong bottleneck effects, and indicates a long history of clonal propagation and selection during the evolution of the domesticated gene pool of globe artichoke. Moreover, the comparison between molecular and phenotypic population structures suggests that harvest time, plant architecture (i.e., plant height, stem length), leaf spininess, head morphology (i.e., head shape, bract shape, spininess) together with the number of heads per plant were the main targets of selection during the evolution of the cultivated germplasm. We emphasize our findings in light of the potential exploitation of this collection for association mapping studies.
Copyright © 2022 Rau, Attene, Rodriguez, Baghino, Pisanu, Sanna, Acquadro, Portis and Comino.

Entities:  

Keywords:  Cynara cardunculus var. scolymus L.; QST; UPOV traits; divergent selection; germplasm collection; microsatellite; quantitative traits; simple sequence repeats

Year:  2022        PMID: 35865281      PMCID: PMC9294547          DOI: 10.3389/fpls.2022.898740

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   6.627


Introduction

The study of the organization of the gene pool of crops is an essential prerequisite for both theoretical and practical reasons. This provides insight into the evolutionary history of a crop, strengthens our knowledge for the implementation of correct conservation strategies, and rationalizes operational choice in plant breeding. For major crops, many studies on population structure are available, such as Triticum aestivum (Würschum et al., 2013), Oryza sativa (Jin et al., 2010), Zea mays (Vigouroux et al., 2008) Hordeum vulgare (Malysheva-Otto et al., 2006; Tanto Hadado et al., 2010; Rodriguez et al., 2012), Lycopersicon esculentum (Aflitos et al., 2014; Rodriguez et al., 2020), Solanum melongena (Cericola et al., 2013, 2014; Portis et al., 2015), Phaseolus spp. (Angioi et al., 2010; Bitocchi et al., 2013; Rodriguez et al., 2013), and Glycine max (Bandillo et al., 2015). However, for other minor crops, such as Cynara cardunculus var. scolymus L., the cultivated globe artichoke, information remains limited. Globe artichoke (2n = 2x = 34) is a perennial open-pollinated crop (De Vos, 1992; Pecaut, 1993) that is probably native to the Mediterranean basin (Sonnante et al., 2007; Gatto et al., 2013) where the cultivation of this crop is also particularly relevant. Based on data from the Food and Agriculture Organization Corporate Statistical Database[1] (FAOSTAT, 2020), the total world artichoke production in 2020 was 1,516,955 tons. In total, three circum-Mediterranean countries, namely Italy (367,080 tons), followed by Egypt (308,844 tons), and Spain (196,970 tons), were the most important world producers, accounting for 57.5% of the global production. The edible portion of the artichoke plant is the head, or capitulum, which is formed as a composite inflorescence that is harvested before the flowers bloom. Furthermore, globe artichoke is considered a functional food and a source of nutraceutical ingredients, such as bioactive phenolic compounds, inulin, fiber, and minerals; in addition, artichoke leaf extracts have long been used in folk medicine, particularly for liver complaints (Comino et al., 2007, 2009; Lattanzio et al., 2009; Moglia et al., 2009, 2016; Menin et al., 2010, 2012; Eljounaidi et al., 2014, 2015). Many different clonal varietal groups best adapted to different local environments have been described in this species. Cultivated germplasm has traditionally been classified into four main groups based on the capitulum traits (Dellacecca et al., 1976; Porceddu et al., 1976; Vanella et al., 1981): (1) The “Spinosi” group has long spines on the bracts and leaves; (2) “Violetti” has medium-sized, violet-colored, and less spiny heads; (3) “Romaneschi” has spherical or subspherical non-spiny heads; and (4) “Catanesi” has relatively small, elongated, and non-spiny heads. Elia and Miccolis (1996) conducted a phenotypic characterization in a common garden experiment. They conducted a multivariate analysis considering eight quantitative traits across 104 accessions, and they distinguished five main morphophenological clusters. These were early; medium-early; late with small heads; late-violet; and late with large heads. Furthermore, Lanteri et al. (2004) studied a wide collection of cultivated globe artichoke using amplified fragment length polymorphism (AFLP) markers and found concordances between molecular and the proposed phenotypic classifications. Indeed, the performed multivariate analysis showed four main clusters that corresponded to the phenotypical typologies of cultivated globe artichoke: “Catanesi” (together with “Violet de Provence”), “Romaneschi,” “Violetti di Toscana,” and “Spinosi.” Later, by analyzing leafy cardoon and globe artichoke accessions using SSR and inter-SSR markers, Pagnotta et al. (2017) observed that “Catanesi” resulted well separated from the other accessions, and the “Spinosi” types formed a well-defined group together with some “Romaneschi” and “Violetti” types. The “Romaneschi” types showed low uniformity, as they were distributed into several genetic subgroups. Thus, except for the “Catanesi” group, the association between molecular groups and phenotypic types was not clear. Moreover, as noted by Pagnotta et al. (2017), the accessions clustered not only based on their typology but also based on the gene bank of origin. They argued that the environmental conditions of the different field gene banks or the methods used to conserve plant materials might have affected the germplasm characteristics. More recently, Pavan et al. (2018) used genotyping by sequencing to analyze samples of globe artichoke, cultivated cardoon (C. cardunculus var. altilis), and wild cardoon (C. cardunculus var. sylvestris), and showed a clear-cut genetic separation between globe artichoke and the cultivated and wild cardoon. When the analysis was repeated assuming three genetic groups, the authors observed the separation of “Catanesi” from all of the other accessions and that the group of green-headed landraces typical of the Apulia region, Italy (Green Apulian), also formed a well-defined group. Furthermore, similarly to Pagnotta et al. (2017) and Pavan et al. (2018) observed that “Romaneschi” were attributed to a variable group that consisted of admixed individuals, which suggested that they should not be considered a genetically uniform varietal typology. To the best of our knowledge, no studies are available that compare the molecular and phenotypic population structures for the same collection of globe artichokes. Thus, in this study, we characterized a germplasm collection comprising 110 of the most important globe artichoke varieties cultivated worldwide by conducting a deep phenotypic characterization in a common garden and under field conditions. A total of thirty-five quantitative phenological, morphological, and yield traits and 23 qualitative descriptors from the International Union for the Protection of New Varieties of Plants (UPOV) were considered to describe the plants in the stages of the first, secondary, and tertiary heads. Thus, the phenotypic data were compared with the results of SSR analysis, to investigate the relationships between molecular and phenotypic variability. This study advances our understanding of the evolution of globe artichoke, strengthens the knowledge of the organization of its gene pool, and can help in rationalizing the operational choices when applying association mapping and defining breeding strategies in this species.

Materials and Methods

Plant Material

A total of 110 globe artichoke accessions were analyzed using both SSR and quantitative traits, as listed in Supplementary Table 1. All of these came from the living artichoke collection maintained by AGRIS at the experimental farm located at Oristano (Sardinia, Italy). The collection comprises most of the globe artichoke varieties cultivated worldwide, with the accessions subdivided into seven main typologies based on their different morphophenotypic traits (primarily related to head characteristics). These typologies are named as “Catanesi” (CAT), “Violetti di Provenza” (VPR), “Violetti di Toscana” (VTO), “Romaneschi” (ROM), “Macau” (MAC), “Spinosi” (SPI), and “Green et al.” (GEA), as reported by Comino et al. (2016). Nine clones of “Spinoso sardo” selected by AGRIS based on morphological and production traits were also analyzed (Supplementary Table 1).

Simple Sequence Repeat Assay

Total genomic DNA was extracted from young leaves (three plants per accession) and characterized using 26 genomic-SSR (gSSR) markers targeting loci distributed through all of the 17 chromosomes of globe artichoke (Supplementary Table 2A). The markers were selected according to their good resolution power and reliability (Scaglione et al., 2009, 2016; Portis et al., 2012, 2016). Chloroplast-SSR (SR) variation was also investigated using 10 primer pairs designed by Weising and Gardner (1999) and 23 designed by Chung and Staub (2003) (Supplementary Table 2B); the markers have been already used to study the population structure of wild cardoon, the progenitor of globe artichoke (Rau et al., 2016), and other plants (see, e.g., Angioi et al., 2009, 2010; Desiderio et al., 2013).

Individual-Based Clustering

To infer the population structure of the globe artichoke collection using multilocus genetic profiles, we first used the model-based (Bayesian) clustering algorithm implemented in the Structure software (Pritchard et al., 2000; Falush et al., 2003). The data were analyzed by setting the admixture model and the correlated allele frequencies options. All runs were carried out with 50,000 burn-ins and 100,000 iterations (Falush et al., 2003). Phenotypic typology was not used as prior information to assist clustering. A total of twenty independent runs were carried out for each number of assumed ancestral populations (K). A range of K from 1 to 10 was explored (Evanno et al., 2005). The K value that best explained the data was determined using the ad hoc statistics (ΔK; Evanno et al., 2005) with the CLUMPAK software (Kopelman et al., 2015). To complement the population structure analysis, discriminant analysis of principal components (DAPC) was used as a multivariate unsupervised method implemented in the R package adegenet (Jombart, 2008; Jombart et al., 2010). DAPC divides genetic variation into between- and within-group components, to maximize the former, and builds linear combinations of alleles (linear discriminants) that best separate the groups. The find.clusters function was used to detect the number of clusters in the collection adopting the K-means clustering method and determining the most likely number of subpopulations based on the Bayesian Information Criterion (BIC). The number of principal components to be retained was determined using the cross-validation function Xval.dapc, based on the root mean square error.

Variation Among Genetic Groups

The total SSR genetic variance (σ2) was partitioned into two components: among-population (σ2a) and within-population (σ2b). The divergence between populations was quantified as FST = σ2a/σ2, where σ2 = σ2a + σ2b. The partition of molecular variance was accomplished by the AMOVA implemented in Arlequin v. 3.1.5.2 (Excoffier and Lischer, 2010). The level of group divergence was also quantified by calculating the RST. While FST is based on the infinite allele model (IAM), RST is based on the stepwise mutation model (SMM), i.e., it also considers the variance of SSR allele size (Gaggiotti et al., 1999). The FST and RST statistics have the same expectations when group divergence is caused only by drift, whereas RST is larger than FST if stepwise-like mutations are also contributing to divergence; thus, comparing the FST and RST values can provide insights into the relative contribution of drift and mutation to group divergence (Hardy et al., 2003). This comparison can be done by testing whether the observed RST is significantly larger than its value after randomizing allele sizes (pRST; Hardy et al., 2003). If the RST is not significantly different from pRST, it is possible to conclude that RST = FST (Hardy et al., 2003). The p-values were obtained after 999 random permutations using the SPAGeDI software (Hardy and Vekemans, 2002).

Diversity and Linkage Disequilibrium Within Genetic Groups

The number of alleles per locus (na), the observed heterozygosity (HO), and the gene diversity (HE, Nei, 1978) were estimated for each genetic group identified by the Structure and DAPC analyzes. When referring to the qualitative traits variation, to avoid confusion with SSR, we indicate the number of classes per trait as n and the gene diversity of Nei (1978) as INei (instead of HE). To determine the coefficient of inbreeding (FIS), total variance (σ2) was partitioned into three components: among populations (σ2a), among individuals within-populations (σ2b), and within individuals (σ2). The total FIS was calculated as σ2b/(σ2b + σ2). Group-specific FIS was also calculated. This coefficient compares the expected and observed heterozygosities and the expected value under the Hardy-Weinberg equilibrium is 0. Thus, FIS > 0 indicates heterozygote deficit compared to the Hardy-Weinberg equilibrium (as a consequence of inbreeding), while FIS < 0 indicates heterozygote excess (as in clonally propagated crops or when selection in favor of heterozygotes is applied). The BOTTLENECK ver. 1.2 software (Piry et al., 1999) was used to determine whether the globe artichoke groups experienced recent founder events or population bottlenecks. In this case, both a reduction of the number of alleles (na) and of expected heterozygosity (HE) is expected, with the former that reduces faster than the latter (Nei, 1975). Given na and the sample size, the program BOTTLENECK computes, for each sample and for each locus, the distribution of the heterozygosity expected under the assumption of mutation-drift equilibrium (H). As na reduces faster than HE, it is expected that, on average, after a bottleneck HE > H (Cornuet and Luikart, 1996). The number and percentage of pairs of SSR loci in linkage disequilibrium (LD; p < 0.05) were calculated using the Arlequin software (Excoffier and Lischer, 2010).

Phenotypic Characterization and Statistical Analysis

The collection was characterized in the AGRIS experimental field (Azienda Palloni, Oristano, Sardinia, Italy). Individual plants were obtained by vegetative propagation using “ovoli” in July. The plants were managed following standard agronomic practices for the crop, both for mineral nutrition and water supply. Phenotypic data were recorded under field conditions, measuring three plants for each variety (single plant = replicate) in a completely randomized design. A total of twenty-one traits were recorded at the beginning of the first head harvest (Supplementary Table 3A). Among these, 11 traits were also recorded for the secondary and tertiary heads (Supplementary Table 3B). The traits “Total number of heads” and “Total weight of heads” were measured for secondary and tertiary heads. The collection was also characterized by recording 23 qualitative phenotypic descriptors defined by UPOV (Supplementary Table 3C). The total phenotypic variance (σ2) was decomposed into two components, within SSR groups (σ2W) and between SSR groups (σ2B), applying Nested Analysis of Variance (NANOVA). For each trait, the variance associated with each component was estimated considering a model with groups, varieties within groups, and replicates as random factors and applying the restricted maximum likelihood (REML) method implemented in the JMP software (ver. 10.0.0, SAS, Institute, NC, United States). Variance components were first used to calculate the heritability of traits. In globe artichoke, cross-fertilization is promoted by protandry (Mauromicale and Ierna, 2000), and when the progenies are tested, they show wide segregation of morphological and production traits (Basnitzki and Zohary, 1987). Thus, high heterozygosity was expected a priori. As the globe artichoke varieties considered in this study were obtained by vegetative propagation (clonally), it is assumed that the three replicates of each variety have the same genotype. This means that genetic differences among varieties, both within groups (σ2W) and between groups (σ2B), can be partially due to maternal and non-additive effects (dominance and epistasis), rather than to additive effects. Thus, the sum σ2W + σ2B is equal to the total genetic variance (σ2) and the ratio between genetic variance (σ2) and total variance (σ2) represents the broad-sense heritability (h2B); i.e., h2B = σ2/σ2, where σ2 = σ2W + σ2B + σ2E, with σ2E representing the environmental variance. To explore the pattern of phenotypic divergence between genetic groups of globe artichoke varieties, variance components were used to calculate the degree of differentiation for quantitative traits between groups by applying the formula QST = σ2B/(σ2B + 2σ2W) (Spitze, 1993; Whitlock, 1999; Leinonen et al., 2008). The QST statistic was introduced by Spitze (1993) as an analog to FST for molecular markers. This equation assumes that FIS = 0 and that σ2B is the additive genetic variance due to differences between populations, and σ2W is the mean additive genetic variance within populations. Successively, Bonnin et al. (1996) derived a generalized relation for any possible value of FIS, for which QST(f) = (1 + FIS)σ2[(1 + FIS)σ2B + 2σ2W]. To calculate the QSTs, we also applied this formula by entering the estimated total FIS (see previous paragraph) and assuming that the FIS of a quantitative trait locus is the same as the FIS of the neutral markers. We presented two QST values calculated as in Spitze (1993) and Bonnin et al. (1996). In our case, σ2B and σ2W are components of the total genetic variance, as they can include both non-additive and maternal effects. To mark this difference compared to Spitze’s QST and Bonnin’s QST(f), we used the notation QST_B and QST(f)_B, where the subscript “B” means “Broad.” The qualitative descriptors of the UPOV were statistically treated as molecular markers, considering each trait as a single SSR marker and the states of the trait as different alleles. Thus, for each group, statistics of genetic diversity (na, HE) and divergence (FST_) were calculated as for the SSR markers (Tanto Hadado et al., 2009). The correlations between the three divergence measures (i.e., FST, QST_B, FST_qlt) were determined. The values of FST_ were compared with the FST values obtained for SSRs to flag qualitative traits putatively under selection, using the Arlequin software v. 3.5.1.2 (see below). The levels of molecular and quantitative variations were compared following the rationale of the FST – QST approach (Spitze, 1993; Whitlock, 1999; Leinonen et al., 2008). This is based on the relative magnitude of the FST and QST and allows inferences to be made about the relevance of selection in the evolution of a specific quantitative trait (see “Discussion” section). To exclude possible selection effects on FST values calculated for the SSR loci, the (putatively) neutral “benchmark” FST was calculated by conducting the outlier tests as implemented in Arelquin v. 3.5.1.2 (100,000 reps). This test is similar to that implemented in FDIST computer program (Beaumont and Nichols, 1996). We ran simulations reiteratively, eliminating after each simulation the markers that showed the signature of divergent selection (p < 0.05). After obtaining a dataset without markers with signatures of divergent selection, the average FST across loci was assumed as the putatively neutral benchmark. The confidence intervals (C.I.) of the putatively neutral FST were determined by bootstrapping over loci (100,000 replicates).

Results

Population Structure Based on Genomic SSRs

All SSR markers tested were polymorphic (Minimum Allele Frequency, MAF, > 1%). A total of 152 alleles were observed: two to 13 alleles per locus (ne) were detected, giving a mean of 5.85. Gene diversity (HE) per locus varied from 0.231 to 0.822 (Supplementary Table 2A). When the 26 SSRs were combined, each accession showed a unique multilocus profile. Based on Evanno’s ΔK statistics, the highest hierarchical level of the population structure was K = 2 (Supplementary Figure 1). However, a secondary peak at K = 5 was also evident, which indicated substructures (Supplementary Figure 1). The genetic subdivisions resembled phenotypic typologies quite well (Figure 1A). Indeed, at K = 2, most of the Catanesi and Violetti di Provenza types clustered together and were clearly separated from the remaining accessions; most of the individuals (> 95%) were assigned to their group with q > 0.80 (Figure 1B). The two additional clustering steps first separated most Spinosi (K = 3) and then a group that included all the Violetti di Toscana and most green-headed types (K = 4; Figure 1A). Finally, at K = 5, most of the Romaneschi were split from Macau. At K = 5 Catanesi and Violetti di Provenza still clustered together, while Violetti di Toscana grouped with many Green-headed accessions (Figure 1A). This remained true for K up to 10 (not shown). The phenotypic typology “Green et al.” was not genetically well-defined. Indeed, it comprised varieties assigned to different genetic groups that were typical of Violetti di Toscana, Macau, and Catanesi-Violetti di Provenza (Figure 1A).
FIGURE 1

Structure analysis. (A) Individual to population assignment. Individuals are ordered and grouped according to a priori knowledge of phenotypic typology. (B) Distribution between groups of individuals assigned (q > 0.80) or admixed (q < 0.20) at K = 2 and K = 5. (C) Net-nucleotide distances between pairs of genetic groups (K = 5). (D) Neighbor-joining (N-J) tree depicting distances between groups (K = 5); the N-J tree was drawn by Structure software based on the pairwise net-nucleotide distances among the five detected groups.

Structure analysis. (A) Individual to population assignment. Individuals are ordered and grouped according to a priori knowledge of phenotypic typology. (B) Distribution between groups of individuals assigned (q > 0.80) or admixed (q < 0.20) at K = 2 and K = 5. (C) Net-nucleotide distances between pairs of genetic groups (K = 5). (D) Neighbor-joining (N-J) tree depicting distances between groups (K = 5); the N-J tree was drawn by Structure software based on the pairwise net-nucleotide distances among the five detected groups. Based on these data, from here on, we adopted the label CAT-VPR to designate the nuclear SSR genetic group related to the Catanesi-Violetti di Provenza phenotypic typology; similarly, the labels SPI, VTO-GEA, ROM, and MAC indicate the genetic groups associated with the phenotypic typologies Spinosi, Violetti di Toscana-Green et al., Romaneschi, and Macau, respectively. Overall, at K = 5, most of the accessions were assigned to their group with high probability (q > 0 80; Figure 1B). Overall, there was a strong association between phenotypic typologies and genetic classifications (χ2 test: p < 10–4). The highest frequency (∼30%) of admixed individuals (q < 0.80) was observed within the VTO-GEA group. CAT-VPR and SPI showed the highest net-nucleotide distance (0.171), while the lowest distances were observed between MAC and ROM (0.084), VTO-GEA and MAC (0.073), and VTO-GEA and ROM (0.071) (Figure 1C). According to AMOVA, the average genetic divergence (FST) between these five groups was moderate, being 0.170 (Table 1) while putatively neutral FST was slightly lower (0.134).
TABLE 1

Genetic divergence (FST, RST) among the five groups of varieties of cultivated globe artichoke as determined by AMOVA over loci considering two sources of variation: among populations and among individuals within populations.

Fixation indexValueP-valueMedianConfidence interval
95%99%99.9%
FST0.170<10–50.1690.137–0.2030.125–0.2160.111–0.230
FST (PN)0.134<10–50.1340.111–0.1560.101–0.1660.089–0.175
RST0.148<10–50.1480.098–0.2020.083–0.2210.066–0.245
pRST0.1790.3640.119–0.239

p-value: for observed F

Median, Confidence interval, obtained by bootstrapping over loci.

F

F

R

pR

Genetic divergence (FST, RST) among the five groups of varieties of cultivated globe artichoke as determined by AMOVA over loci considering two sources of variation: among populations and among individuals within populations. p-value: for observed F Median, Confidence interval, obtained by bootstrapping over loci. F F R pR According to RST, the average divergence among the genetic groups was 0.148 (Table 1). Furthermore, the RST and FST values were not statistically different (Table 1) because the observed RST (0.148) and the RST calculated by randomizing allele size were not different (pRST = 0.179; p = 0.364; Table 1). Therefore, in this study, stepwise-like mutations are not contributing to group divergence. The neighbor-joining tree emphasizes the proximity between ROM, MAC, and VTO-GEA, and their distance from CAT-VPR and SPI (Figure 1D). When applying the DAPC analysis and the function find.cluster with the K-mean clustering method, the BIC values plot showed a knee when the number of groups was five, although the minimum BIC value was observed for K = 9 (Figure 2A). Thus, based on the BIC, five was a parsimonious number of clusters for the modeling of our data. DAPC was carried out considering the five groups detected by the K-means clustering method. Four linear discriminants were maintained for the analysis (Figure 2A). The first two linear discriminants were sufficient to identify five groups that essentially corresponded to those previously identified by Structure (Figure 2A). At opposite extremes of the first linear discriminant, there were CAT-VPR and SPI; the second linear discriminant mainly separated VTO-GEA from both ROM and MAC. The third and the fourth linear discriminants refined the relationships among the groups (Supplementary Figure 2). For further details on the classification of the individual accessions provided by DAPC see Supplementary Figure 3.
FIGURE 2

Discriminant analysis of the principal components (DAPC). (A) Five groups of cultivated globe artichoke varieties as depicted by the first two discriminant functions. Insets, left: eigenvalues for the first four discriminant functions. The small square embedded in the lower left part of the figure represents the BIC values as a function of the numbers of assumed genetic groups. Roman numbers (I and II) indicate the chloroplast alleles observed within each nuclear SSR group. (B) Correspondence between the Structure and DAPC classifications.

Discriminant analysis of the principal components (DAPC). (A) Five groups of cultivated globe artichoke varieties as depicted by the first two discriminant functions. Insets, left: eigenvalues for the first four discriminant functions. The small square embedded in the lower left part of the figure represents the BIC values as a function of the numbers of assumed genetic groups. Roman numbers (I and II) indicate the chloroplast alleles observed within each nuclear SSR group. (B) Correspondence between the Structure and DAPC classifications. The correspondence between the Structure and DAPC classifications is reported in Figure 2B. Only four varieties were classified into different groups by the two methods (Figure 2B). These were Testa di Ferro, Gross Camus, Spinoso di Gonnos, and Spinoso di Gela that, based on Structure, were all admixed, with the coefficient of membership (q) to the prevalent genetic groups between 0.489 and 0.644. In two of the four cases, the “mismatches” between the two clustering methods involved varieties that were moved between closely related groups. Indeed, Spinoso di Gonnos was assigned to ROM by Structure, while it was assigned to MAC by DAPC; on the contrary, Testa di Ferro was assigned to MAC by Structure, while it was assigned to ROM by DAPC. Therefore, both the model-based (Structure) and the unsupervised (DAPC) methods agreed when identifying five groups.

Genomic SSR Polymorphism Within Groups

There was a significant difference between the groups for the number of alleles per locus (na) (Wilcoxon’s test: χ2 = 25.381; d.f. = 4; p = 0.00004; Table 2) with VTO-GEA that showed the highest value. The groups also showed significantly different levels of gene diversity (HE) (Wilcoxon’s non-parametric test: χ2 = 18.69, d.f. = 4; p = 0.0009). The VTO-GEA group showed the highest HE and was significantly more diverse than SPI and CAT-VPR, which showed the lowest HE values; ROM and MAC were in intermediate positions (Table 2).
TABLE 2

Descriptive genetic statistics for the five SSR groups identified within the gene pool of the cultivated globe artichoke according to the Structure and DAPC analyzes.

Molecular groupN° of alleles per locus (na)N° polymorphic SSR lociPolymorphism (%)Gene diversity (HE)Inbreeding (FIS)1N° loci deviating HW equilibriumP (rand. FIS ≥ Obs.) LD
N° of pairs (p < 0.05)(%)
CAT-VPR3.1 b2596.20.440 b−0.617 b17 (23)1.00016856.0
ROM3.3 b26100.00.570 ab−0.440 b12 (23)0.99911736.0
MAC3.3 b2492.30.516 ab−0.051 a0 (15)0.6948631.2
SPI2.8 b2492.30.438 b−0.445 b9 (22)1.00012946.7
VTO-GEA5.1 a26100.00.606 a−0.122 a6 (18)0.99911635.7
Mean 3.5 25.0 96.2 0.514 0.317 11.6 (17.6) 0.939 123 41.1

The number of loci deviating from the H-W equilibrium is given for p < 0.001 (outside parentheses) and for p < 0.05 (within parentheses).

Descriptive genetic statistics for the five SSR groups identified within the gene pool of the cultivated globe artichoke according to the Structure and DAPC analyzes. The number of loci deviating from the H-W equilibrium is given for p < 0.001 (outside parentheses) and for p < 0.05 (within parentheses). AMOVA analysis performed considering the hierarchical level “within-individual” showed a global negative and significant coefficient of inbreeding, FIS (−0.317, p < 10–5; Table 2; C.I.99% from −0.408 to −0.153) that indicated an excess of heterozygotes within the five genetic groups. In all cases except MAC, the group-specific FIS were negative and statistically significant (p < 10–5). The FIS value varied significantly between groups (Wilcoxon’s non-parametric test: χ2 = 34.61; d.f. = 4; p = 5.58 × 10–7). CAT-VPR, SPI, and ROM showed stronger excess of heterozygotes than MAC and VTO-GEA (Table 2). Bottleneck analysis confirmed frequent significant excess of heterozygosity compared to the expectation under mutation-drift equilibrium. This was more evident when a pure IAM model was considered when the frequency of loci evolving under SMM was < 70%, and for CAT-VPR, ROM, and SPI (Supplementary Table 4). In general, the level of LD within the groups was relatively high, ranging from 35.7% (VTO-GEA) to 56.0% (CAT-VPR), with an average of 41.1% of pairs of loci in LD (p < 0.05). The three groups with the highest LD values were also those with the lowest FIS (i.e., CAT-VPR, SPI, ROM) (Table 2).

Chloroplast SSR Analysis

Universal cpSSR primers revealed low levels of cytoplasmic variability. Indeed, of the 27 assayed loci, only one (cpSSR-7) was polymorphic, with two alleles hereafter named “I” and “II.” The “I” allele was the most frequent (0.67) (Supplementary Figure 4). However, the chloroplast and nuclear patterns of variation were correlated (Supplementary Figure 4; χ2 test: p < 0.01). The two most divergent nuclear SSR groups (CAT-VPR, SPI) also show alternative cpSSR alleles: CAT-VPR was monomorphic for allele “I,” while SPI was monomorphic for allele “II” (Figure 2).

Patterns of Phenotypic Variation Within and Between the SSR-Derived Groups

Table 3 reports the results of the NANOVA for the 21 quantitative traits measured at the harvest of the first head. Within the SSR groups, the varieties showed different means for all of the quantitative traits (p < 10–5). The broad-sense heritability (h2B) within groups varied widely depending on the traits considered, from 0.48 (stem diameter) to 0.98 (harvest time) (Table 3), with an average of 0.65. The five SSR-derived groups showed different mean h2B across traits. Indeed, these ranged from 0.56 (CAT-VPR) to 0.77 (VTO-GEA), with ROM (0.60), MAC (0.63), and SPI (0.70) in intermediate positions. These differences between groups were statistically significant based on ANOVA (p < 0.01), while the Tukey-Kramer HD test separated CAT-VPR from the other four groups (p < 0.05).
TABLE 3

Nested Analysis of Variance (NANOVA) to test the significance of the differences among the varieties within the SSR groups and of the differences between the SSR groups for the 21 traits measured at the stage of the harvest of the first head.

TraitVariance
Average heritability within groups (h2B)
Within SSR groups
Between SSR groups
Divergence between groups
SSF P SSF P (QST_B)1[QST(f)_B]2
Harvest time243,597.7214.0 **** * 151,634.015.53 **** * 0.977 0.200 0.147
Plant height94,616.021.1 **** * 74,019.919.45 **** * 0.748 0.279 0.210
Polar length of head410.67.3 **** * 66.94.03 ** 0.6480.1020.073
Equatorial length of head242.54.9 **** * 29.42.98 * 0.6160.1120.080
Ratio polar/equatorial length of head7.119.3 **** * 1.96.77 **** * 0.812 0.223 0.164
Weight of head1,042,590.35.0 **** * 45,672.91.08n.s.0.5820.0000.000
Length of stem18,954.310.5 **** * 3,701.64.84 ** 0.637 0.183 0.133
Diameter of stem28.05.1 **** * 1.21.09n.s.0.4760.0110.007
Weight of stem1,549,831.08.5 **** * 170,423.42.72 * 0.5980.0910.065
Weight of head + stem4,199,756.26.2 **** * 329,774.61.94n.s.0.5820.0590.041
N° of leaves on stem306.48.2 **** * 22.31.80n.s.0.5180.0100.007
Weight of the leaves on stem3,508,214.18.7 **** * 493,150.13.48 * 0.6370.0630.044
Weight of head + stem + leaves10,850,343.37.5 **** * 1,178,972.72.69 * 0.6850.0660.047
Bract height775.54.84 **** * 86.22.61 * 0.5400.0790.055
Bract width8,331.614.72 **** * 2,614.47.38 **** 0.765 0.163 0.118
Bract height/width0.911.84 **** * 0.38.10 **** 0.7870.1510.109
Bract thickness169.07.61 **** * 25.93.60 ** 0.6190.1140.081
Ten-bracts weight28,020.88.52 **** * 3,283.52.75 * 0.6320.0700.049
Receptacle height1,588.44.24 **** * 116.61.72n.s.0.5190.0140.009
Receptacle width19,935.16.96 **** * 3,704.24.37 ** 0.6740.1130.081
Receptacle height/width0.73.61 **** * 0.25.58 *** 0.4950.1200.086

*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001; *****p < 10

Gray shading, traits for which the SSR groups have different means (p < 0.05; Tukey-Kramer multiple comparison tests).

For each trait, the value of the Q

The results of the F

Nested Analysis of Variance (NANOVA) to test the significance of the differences among the varieties within the SSR groups and of the differences between the SSR groups for the 21 traits measured at the stage of the harvest of the first head. *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001; *****p < 10 Gray shading, traits for which the SSR groups have different means (p < 0.05; Tukey-Kramer multiple comparison tests). For each trait, the value of the Q The results of the F Based on ANOVA and Tukey-Kramer HD multiple comparison test, 10 out of 21 quantitative traits differed between the five genetic groups (with p-values from < 0.01 to < 10–5; Table 3). The strongest group divergence was seen for plant height (QST_B = 0.279), polar/equatorial head-length ratio (QST_B = 0.223), harvest time (Q = 0.200), and length of stem (QST_B = 0.183) (Table 3). Among the traits that defined the size and shape of the bracts and the receptacle, the width of the bract and bract height/length ratio was the most diverse among the groups (QST_B = 0.163, 0.151, respectively; p < 10–4 in both cases; Table 3). The genetic groups showed low divergence for weight of first head (QST_B = 0.00), diameter of stem (QST_B = 0.011), number of leaves on stem (QST_B = 0.010), and receptacle height (QST_B = 0.014). When considering the stages of the harvest of the secondary and tertiary heads, these globe artichoke varieties showed statistically different means (p < 10–5) for all the traits examined (Table 4). Again, the SSR-derived groups were strongly divergent for head shape (polar/equatorial head-length ratio, QST_B = 0.26 and 0.28, at the two considered stages, respectively) and length of the stem (QST_B = 0.28, 0.22, respectively), and to a lesser extent, for harvest time (QST_B = 0.17, 0.16, respectively). Furthermore, this analysis showed differences in yield performances between the genetic groups, as QST_B were 0.17 and 0.12 for the secondary and tertiary heads, respectively. Among the yield components, QST_B tended to be higher for the average number of heads (0.13, 0.14, respectively) than for the average weight of the heads (0.11, 0.02, respectively). As expected, when QST_B was calculated considering the observed average negative FIS, the QST values were reduced (Tables 3, 4).
TABLE 4

Nested Analysis of Variance (NANOVA) for the 11 traits measured at the stage of secondary and tertiary heads.

Secondary heads
Tertiary heads
Average
Traith2BQST_BQST(f)_B p h2BQST_BQST(f)_B p h2BQST_BQST(f)_B
Average harvest time0.89 0.17 0.12 **** * 0.89 0.16 0.12 **** * 0.89 0.16 0.12
Average polar length of head0.790.160.11 *** 0.880.120.08 *** 0.830.140.10
Average equatorial length of head0.610.150.11 *** 0.650.030.02n.s.0.630.090.07
Average ratio polar/equatorial length0.74 0.26 0.19 **** 0.73 0.28 0.21 **** * 0.74 0.27 0.20
Average number of heads0.340.130.09 **** 0.510.140.10 **** * 0.420.130.10
Average weight of heads0.650.110.08 *** 0.710.020.02n.s.0.680.070.05
Total weight of heads0.53 0.17 0.13 **** * 0.580.120.09 **** * 0.550.150.11
Average length of the stem0.65 0.28 0.20 *** 0.66 0.22 0.16 **** 0.65 0.25 0.18
Average diamter of the stem0.44−0.01−0.01n.s.0.630.050.03n.s.0.530.020.01
Average weight of the stem0.690.100.07 * 0.560.080.06 * 0.620.090.07
Average number of leaves of the stem0.330.050.04 * 0.580.130.09 **** * 0.460.090.07
Overall average0.600.150.110.670.120.090.640.140.10
Correlation (r) between QST_B and h2B0.4990.5040.1870.374
Significance level of the correlation (P)0.118P > 0.05P > 0.05P > 0.05

*P < 0.05; ***P < 0.001; ****P < 0.0001; *****P < 10

Nested Analysis of Variance (NANOVA) for the 11 traits measured at the stage of secondary and tertiary heads. *P < 0.05; ***P < 0.001; ****P < 0.0001; *****P < 10

Relationship Between Molecular and Quantitative Trait Variation

At the within-group level, the correlation between HE and the average h2B was positive but not significant (Pearson r = 0.72; n = 5; p > 0.05). There was a strong and positive correlation between QST_B and the average within-groups h2B (REML method: r = 0.736, n = 21, p = 0.0001; 95% C.I.: 0.446–0.887; Figure 3A). Furthermore, of the 21 quantitative traits measured at the harvest of the first head, four had QST_B values that exceeded the C.I. for the molecular, putatively neutral FST (Figure 3A and Table 3). These were plant height, head shape (as polar/equatorial head-length ratio), harvest time, and stem length, which exceeded the upper C.I.99.9%. Furthermore, the bract width was also significant as it was outside the C.I.95%. Among these traits, head shape, stem length and, to a lesser extent, harvest time, confirmed their outlying behaviors also at the stages of the secondary and tertiary heads (Table 4). For the “total weight of heads,” QST > FST for the secondary heads, but not for the tertiary heads. The correlation between the degree of quantitative-genetic subdivision among populations (QST) and the average heritability within populations (h2B) did not reach statistical significance (p = 0.118, 0.573, respectively). Based in QST(f)_B, the traits “plant height” and “head shape” remain statistically significant (Figure 3B) exceeding the C.I.95% and C.I.99.9% for the neutral FST.
FIGURE 3

Relationships between the level of quantitative-genetic subdivision among the SSR groups (QST_B) and the average heritability within the SSR groups (h2B) for the 21 phenotypic traits recorded at the harvest of the first head, as listed in Table 3. QST_B was calculated assuming FIS = 0.00 (A) or for the neutral FIS = –0.315 estimated from the SSR data (B). Orange continuous line, regression line; green continuous line: average level of genetic subdivision among groups (FST) based on putatively neutral SSRs; green, yellow, and red dashed lines: lower and upper limits (95%, 99%, 99.9%, respectively) of the neutral FST based on bootstrap.

Relationships between the level of quantitative-genetic subdivision among the SSR groups (QST_B) and the average heritability within the SSR groups (h2B) for the 21 phenotypic traits recorded at the harvest of the first head, as listed in Table 3. QST_B was calculated assuming FIS = 0.00 (A) or for the neutral FIS = –0.315 estimated from the SSR data (B). Orange continuous line, regression line; green continuous line: average level of genetic subdivision among groups (FST) based on putatively neutral SSRs; green, yellow, and red dashed lines: lower and upper limits (95%, 99%, 99.9%, respectively) of the neutral FST based on bootstrap. The highest level of population structure that separated CAT_VPR from the other groups (K = 2; Figure 1) was associated with an abrupt variation in the harvest time (Figure 4A). Indeed, the CAT-VPR group was 35–42 days earlier, on average, than the other groups. The SPI group, which split at K = 3 (Figure 1), tended to be the earliest among the late groups, for both the secondary and tertiary heads (Figure 4A). Furthermore, the differences between the SSR-derived groups for yield performance paralleled those for harvest time (Figure 4B). Indeed, the correlation between these two variables was positive and significant (REML: r = 0.940; C.I.95% = 0.340–0.997; p = 0.0174), i.e., the yield increased with the lateness. Thus, the CAT-VPR yield was 41.7% lower than that of VTO-GEA, the most productive group (Figure 4B). The ROM, MAC, and SPI groups tended to have intermediate positions. In all of the groups, the secondary heads accounted for ∼50% of the total yield (from 46% to 52%). The variation between groups for the yield components was also notable (Table 5). This was particularly evident for the number of secondary and tertiary heads (Table 5), where the top-ranking group was VTO-GEA (4.1 and 4.8, respectively), while the bottom-ranking group was CAT-VPR (2.9 and 2.6, respectively). The CAT-VPR group also occupied the last position for the weights of the heads (157 g and 113 g, respectively), while the MAC group tended to have larger heads, particularly for the primary and secondary heads (253 and 209 g; Figure 4B and Table 5, respectively).
FIGURE 4

Differences between the SSR groups for the average harvest time (A) and yields of primary, secondary, and tertiary heads (B). Group means are compared referring to the same head type (i.e., primary, secondary, and tertiary) and means with different letters are significantly different (p < 0.05; Tukey-Kramer HD test). In panel (A) green boxes = interquartile range; red bar = mean; white bar = median.

TABLE 5

Average numbers and weights of the secondary and tertiary heads for the five SSR-derived groups of cultivated globe artichokes.

GroupNumber (n)
Weight (g)
SecondaryTertiarySecondaryTertiary
CAT-VPR2.9 a2.6 a157 a113 a
ROM3.7 ab3.7 b181 ab125 b
MAC3.4 ab3.6 b209 b123 b
SPI3.2 a3.5 b196 b123 b
VTO-GEA4.1 b4.8 c203 b129 b
Range (%)41.384.629.314.1

Data are means, where those with different letters are significantly different (p < 0.05; Tukey’s honestly significant difference multiple comparisons test).

Differences between the SSR groups for the average harvest time (A) and yields of primary, secondary, and tertiary heads (B). Group means are compared referring to the same head type (i.e., primary, secondary, and tertiary) and means with different letters are significantly different (p < 0.05; Tukey-Kramer HD test). In panel (A) green boxes = interquartile range; red bar = mean; white bar = median. Average numbers and weights of the secondary and tertiary heads for the five SSR-derived groups of cultivated globe artichokes. Data are means, where those with different letters are significantly different (p < 0.05; Tukey’s honestly significant difference multiple comparisons test). Bract shape (as width and height/width ratio) varied according to clustering history (Table 6). Indeed, CAT-VPR showed the most oblong bracts, while VTO-GEA, MAC, and ROM showed the narrowest and most rounded bracts. The SPI group occupied a transitional position. For head shape (as polar/equatorial length ratio), the SPI showed the most oblong head (Table 6). Two close groups based on SSRs, MAC, and ROM, both showed the most spherical heads while VTO and CAT showed intermediate characteristics. Plant height and stem length did not follow the history of clustering based on Structure (Table 6). The average plant height was positively correlated with the average inbreeding coefficient, FIS (Figure 5). As the group-specific FISs were all negative, this meant that the lower the heterozygote excess, the taller the plants. Furthermore, when the plant stature increased, HE and h2B also increased, while LD tended to decrease (Figure 5).
TABLE 6

Differences between SSR-derived groups for plant height, stem length, and head and bract shape.

SSR-derived groupPlant height (cm)Stem length (cm)Head shape (p/E)Bract shape
Width (mm)Height/width
CAT-VPR67.5 b28.6 ab1.17 ab42.7 a0.43 b
SPI82.9 ab29.8 ab1.2 ab41.6 ab0.46 ab
VTO-GEA100.5 a29.7 ab1.19 ab38.3 b0.48 a
MAC103.5 a36.1 a1.06 bc38.0 ab0.49 a
ROM79.1 b22.6 b1.01 c35.3 b0.49 a

p/E, polar/equatorial length ratio.

Data are means, where those with different letters are significantly different (p < 0.05; Tukey’s honestly significant difference multiple comparisons test).

FIGURE 5

Relationships between coefficient of inbreeding (FIS), plant height (cm), gene diversity (HE), linkage disequilibrium (LD), and average heritability (h2B) for the five SSR groups of globe artichokes. Bubble size is proportional to the level of LD within the groups (as% of pair of loci with p < 0.05; Table 2); bubble shading (darkness) is proportional to the level of HE (Table 2). The number within each bubble indicates the average h2B across the quantitative traits.

Differences between SSR-derived groups for plant height, stem length, and head and bract shape. p/E, polar/equatorial length ratio. Data are means, where those with different letters are significantly different (p < 0.05; Tukey’s honestly significant difference multiple comparisons test). Relationships between coefficient of inbreeding (FIS), plant height (cm), gene diversity (HE), linkage disequilibrium (LD), and average heritability (h2B) for the five SSR groups of globe artichokes. Bubble size is proportional to the level of LD within the groups (as% of pair of loci with p < 0.05; Table 2); bubble shading (darkness) is proportional to the level of HE (Table 2). The number within each bubble indicates the average h2B across the quantitative traits.

Comparison Between Molecular and Qualitative Trait Variations

Among the 23 UPOV descriptors applied, 20 were polymorphic throughout the entire collection (Table 7). Within each group, the number of polymorphic traits ranged from 15 for ROM and SPI, to 20 for VTO-GEA (Table 7). VTO-GEA was the group with the highest diversity, based on both the number of phenotypic classes, nc, (2.73) and Nei’s diversity index, INei, (0.43), while the lowest INei (0.28) was recorded for CAT-VPR (Table 7). This confirmed the observation based on the average h2B within the groups and the average HE with SSRs. The average divergence for qualitative traits (FST_qlt) among the five SSR-derived groups was 0.20. This is close to the estimated value for the SSR markers (FST = 0.17) and quantitative traits (QST_B = 0.18). However, FST_qlt varied widely among the traits (Figure 6). In five cases, FST_qlt was large enough to fit outside the “neutral envelope” obtained by simulation using SSRs (Figure 6). The best evidence here was for leaf incision (FST_qlt = 0.54; p < 0.01) and leaf spininess (FST_qlt = 0.8, p = 0.01), followed by bract curvature (FST_ql t = 0.44, p < 0.05), spine length (FST_qlt = 0.40, p < 0.05) and apex shape (FST_qlt = 0.38, p < 0.05).
TABLE 7

Diversity statistics (Nei’ diversity index, INei, and number of phenotypic classes, nc) for the 23 UPOV descriptors applied within and between the five SSR groups of the cultivated globe artichoke.

TraitParticularCAT-VPR
ROM
MAC
SPI
VTO-GEA
nc
Nei diversity index (INei)
ncINeincINeincINeincINeincINeiAverages.d.TotalAverages.d.Total
LobeShape of tip (excluding terminal lobe)10.0010.0010.0010.0010.001.00.0010.000.000.00
Shape of tip of secondary lobes10.0010.0010.0010.0010.001.00.0010.000.000.00
LeafAttitude20.1120.3320.3910.0030.572.00.7130.280.230.43
Spininess10.0010.0020.2030.6020.061.80.8430.170.250.14
Incisions30.5310.0010.0010.0020.061.60.8930.120.230.36
Hairiness on upper side10.0010.0010.0010.0010.001.00.0010.000.000.00
Leaf bladeIntensity of green color (upper side)20.5130.6920.3920.3330.682.40.5530.520.160.66
Hue of green color30.4110.0020.3920.3320.502.00.7130.330.190.41
Intensity of gray hue20.5120.5130.6720.6030.572.40.5530.570.070.55
Blistering30.3830.6530.6030.7340.513.20.4540.580.130.53
PetioleAnthocyanin coloration at base40.6940.7520.5620.6040.753.21.1040.670.090.71
Central flower headShape in longitudinal section30.5440.7540.7830.7350.743.80.8450.710.090.73
Shape of tip20.3630.6430.7330.7330.392.80.4540.570.180.62
Time of appearance50.7230.6440.7330.7340.603.80.8450.680.060.76
Anthocyanin coloration of inner bracts30.4220.5120.3620.6030.552.40.5530.490.100.55
Density of inner bracts30.4620.5130.6930.8030.642.80.4530.620.140.60
Outer bractMain shape10.0020.4420.2010.0030.171.80.8430.160.180.14
Apex shape20.1610.0030.3820.3330.592.20.8430.290.220.48
Color (external side)30.2630.5630.6420.3340.723.00.7140.500.200.64
Reflexing (of tip)30.1120.4420.2010.0030.492.20.8430.250.210.45
Spine length20.1110.0020.2030.6030.542.20.8450.290.270.43
Mucron10.0020.5120.3610.0020.451.60.5520.260.250.29
ReceptacleShape in longitudinal section20.2130.5620.5620.3320.302.20.4530.390.160.38
Mean 2.30 0.28 2.09 0.37 2.26 0.39 1.96 0.37 2.78 0.43 2.28 0.314 3.13 0.37 0.05 0.49
s.d. 1.06 0.24 1.00 0.29 0.86 0.26 0.83 0.31 1.04 0.26 0.96 0.108 1.14 0.27 0.03 0.18
FIGURE 6

FST outlier analysis for qualitative traits. White dots, qualitative traits; green dots, SSR loci. The qualitative traits were treated as multiallelic loci. The divergence between populations was calculated for both qualitative trait (FST_qlt) and SSR (FST_SSR) conditioned to the heterozygosity between populations. The data obtained for the qualitative traits were then overlapped with the neutral envelope obtained by simulation using 26 SSRs. Blue and red dotted lines, 95% and 99% limits of the neutral envelope, respectively.

Diversity statistics (Nei’ diversity index, INei, and number of phenotypic classes, nc) for the 23 UPOV descriptors applied within and between the five SSR groups of the cultivated globe artichoke. FST outlier analysis for qualitative traits. White dots, qualitative traits; green dots, SSR loci. The qualitative traits were treated as multiallelic loci. The divergence between populations was calculated for both qualitative trait (FST_qlt) and SSR (FST_SSR) conditioned to the heterozygosity between populations. The data obtained for the qualitative traits were then overlapped with the neutral envelope obtained by simulation using 26 SSRs. Blue and red dotted lines, 95% and 99% limits of the neutral envelope, respectively. Interestingly, there was strong correlation between the qualitative traits and SSR distances between the pairs of groups (REML: r = 0.918; n = 10; L95% = 0.685; U95% = 0.981; p = 1.76 × 10–4) (Figure 7A). This was stronger than the positive and marginally significant correlation observed between quantitative-genetic divergence (QST_B) and SSR genetic divergence (FST) between the pairs of SSR-derived groups (REML r = 0.670, n = 10; L95% = 0.070, U95% = 0.914; p = 0.034; Spearman-rank ρ = 0.636; p = 0.048) (Figure 7).
FIGURE 7

Relationships between the degree of phenotypic divergence for qualitative (A) and quantitative (B) traits for the phenotypic traits listed in Supplementary Table 3 and the pairwise genetic distances between groups for the SSR markers.

Relationships between the degree of phenotypic divergence for qualitative (A) and quantitative (B) traits for the phenotypic traits listed in Supplementary Table 3 and the pairwise genetic distances between groups for the SSR markers.

Discussion

The Globe Artichoke Cultivated Gene Pool Clusters Into Five Subgroups

This study shows that the analyzed globe artichoke accessions clustered into two main genetic groups. The first comprises the Catanesi and Violetti di Provenza types and is quite uniform. The second is more complex and includes four additional subgroups related to Spinosi, Green et al., Violetti di Toscana, Romaneschi, and Macau. Based on the phenotypic traits and multivariate analysis, Elia and Miccolis (1996) distinguished five main morphophenological clusters: “early,” “medium-early,” “late with small head,” “late-violet,” and “late with large head.” This genetic classification based on SSRs shows good agreement with that of Elia and Miccolis (1996) based on the phenotypic data. Indeed, their “early” and “medium-early” clusters correspond to our CAT-VPR group, which is composed of Catanesi and Violetti di Provenza, and is characterized by long-shaped and small main and secondary heads, low-medium yield, early harvest, and plant with low vigour (short, with low numbers of heads). Their “late with small head” cluster corresponds to our SPI group, which includes Spinosi types characterized by long-shaped and medium-small heads and late harvest. The “late-violet” cluster corresponds to VTO-GEA that includes “Violetti di Toscana” and several other varieties assigned to the category “Green et al.”. On average, these plants were vigorous (tall and high-yielding) and showed less elongated heads than the previous groups. Finally, the “late with large head” cluster correlates with our ROM and MAC groups, which are characterized by lateness and large sizes of the principal and secondary heads, with a shape that tends to be spherical. Based on AFLP markers, Lanteri et al. (2004) reported that Catanesi and Violetti di Provenza were closer to Romaneschi and Macau than to Spinosi and Violetti di Toscana. Differently, in our study, Romaneschi and Macau are closer to Violetti di Toscana and to several varieties classified as “Green et al.”. As the collections analysed in Lanteri et al. (2004) and in our study are similar, the observed differences could be inherent to the different markers used (AFLP vs. SSR). Indeed, AFLPs, are biallelic and dominant markers, while SSRs are multiallelic and codominant markers. Moreover, AFLPs and SSR markers have different mutational models and rates. The mutation rates of SSRs are between 10–3 and 10–4, as inferred from genetic data (Estoup and Angers, 1998; Mariette et al., 2001; Garoia et al., 2007) and estimated from observed mutations (Vigouroux et al., 2002; Thuillet et al., 2005). The mutation rate for AFLPs is lower being between 10–6 and 10–5 (Mariette et al., 2001; Gaudeul et al., 2004; Kropf et al., 2009). On this basis, different marker types can lead to different precision when assessing relationships between single individuals, as well as to different estimations of within- and between-population components of genetic diversity (see, e.g., Gaudeul et al., 2004). Moreover, high evolutionary rates necessarily result in low phylogenetic signals, and conversely, low evolutionary rates result in high phylogenetic signals (Revell et al., 2008); thus, with their slower mutation rate, AFLP markers can highlight more ancient population splits (Flanders et al., 2009). Therefore, the population genetic structure described by Lanteri et al. (2004) might represent a more ancient one than that depicted in this study.

The Signature of Long-Term Clonal Propagation

The chloroplast and nuclear genetic polymorphisms as well as the molecular and phenotypic population structures, were associated. This is expected if there are multiple germplasm sources contemporarily differentiated for nuclear, organellar loci, and phenotypic traits. Furthermore, LD was detected within each of the five groups, and the diversity statistics based on SSRs and quantitative and qualitative traits are correlated. These results indicate that the five groups of globe artichoke varieties were genetically relatively isolated, and that the recombination accumulated through crop evolution was overall not sufficiently high (or the time elapsed was too low) to break up the cytonuclear and marker-trait LD. The presence of LD in the whole dataset is also indicated by the correlation between the divergence of the group for SSR markers (FST) and the divergence for the quantitative traits (QST_B) and the qualitative traits (FST_qlt). Within the groups, in addition to LD, we observed negative FIS, which indicates an excess of heterozygotes compared to the Hardy-Weinberg expectation. Detecting LD is not sufficient to call for clonal reproduction (de Meeûs and Balloux, 2004; Halkett et al., 2005). However, negative FIS is the ultimate signature of clonal diploid populations (Halkett et al., 2005). These results are consistent with the expected structure of genetic diversity of a collection of artichoke cultivars where clonal propagation is a common practice; in this species, propagation is carried out with “carducci” (basal shoots) or “ovoli” (semi-dormant shoots with a limited root system). Furthermore, our study also shows that some groups have a higher excess of heterozygotes than others and this suggests that different breeding histories and various degrees of historical “clonality” characterize the evolution of the five identified genetic groups. Albeit selfing is not precluded, cross-fertilization in globe artichoke is promoted by protandry (Mauromicale and Ierna, 2000). In a recent re-sequencing study Acquadro et al. (2017) documented the significant hetorozigosity of some cultivated clonal varieties. Consistent with this finding, Basnitzki and Zohary (1987) showed that the (self) seed progenies of clonal varieties showed wide segregation of morphological and production traits. Thus, it can be hypothesized that the excess of heterozygotes at SSR loci might reflect the selection of plants with superior hybrid vigor because of the heterozygote advantage (overdominance). However, genomic surveys have suggested that loci with heterozygote advantage must be considered only a small minority of all loci in a species (Hedrick, 2012). Furthermore, complete clonality is comparable to a long-term population bottleneck with a population size of one, which implies that heterozygote excess is also expected if the selection is not acting (Balloux et al., 2003). We also observed that the groups with a strong excess of heterozygotes (and strong LD, low HE, low h2B) were shorter and produced fewer and smaller heads (i.e., were less vigorous) than groups with low heterozygotes (and lower LD, higher HE, higher h2B). This suggests that selection against deleterious mutations might be less effective in groups with high clonality (and high heterozygote excess) than in those with low clonality (with low or null heterozygote excess). Indeed, when there is no recombination, selection against deleterious mutations is less effective, and deleterious mutations have the potential to accumulate (i.e., the mutation load) (McKey et al., 2010; Gaut et al., 2015). The accumulation of mutations in a purely clonal line ultimately leads to lower fitness, which is associated with lower agronomic performance (McKey et al., 2010). However, in globe artichoke, this remains to be demonstrated with a dedicated study.

Comparison Between Molecular and Phenotypic Population Structures Indicates a Role for Divergent Selection

Under neutral evolution (i.e., absence of selection), QST = FST is expected, such that the proportion of total variance allocated among populations (or genetic groups, in this case) should be equal or similar for both quantitative traits and molecular markers (Leinonen et al., 2013). Therefore, assuming that FST has been estimated considering putatively “neutral” SSR loci, the inequality QST > FST indicates divergent selection among the populations. Indeed, this suggests that for a given quantitative trait, populations (or genetic groups) show different mean values not only because of random demographic processes (e.g., genetic drift, migration, mutation), but also due to the selection that exerts differential pressure among the populations. On the contrary, the inequality QST < FST suggests uniform selection across populations: populations tend to have the same trait means because the selection has the same direction in all of the populations. Following this rationale, we compared the FST and QST_B obtained among the genetic groups of the globe artichokes identified by the analysis of the population structure. This comparison suggests that some quantitative traits have an “excess” of divergence between groups (QST_B) compared to the putatively neutral markers (FST) (QST_B > FST). For these traits, divergent selection, rather than demographic factors, might be responsible for the phenotypic differences observed between the genetic groups. These traits are harvest time, plant height, head shape, and stem length. Among the qualitative traits, divergent selection can be suggested for leaf spininess, spine length, apex shape, and reflexing of the tip of the outer bracts as they showed an FST_qlt that excedeed the “neutral” FST. In general, it is possible that these traits were under the farmer’s attention when selecting plants for clonal propagation, as they strongly condition both agronomic practice (e.g., harvest time) and consumer preference (e.g., stem length, head shape, spininess). However, caution should be exercised when interpreting these data. Indeed, deviations from the Hardy-Weinberg equilibrium might affect the QST–FST comparison. Inbreeding (FIS > 0) can lead to QST < FST under neutrality (instead of QST = FST), which results in low power when detecting divergent selections. On the contrary, when, as in the present case, there is an excess of heterozygotes (FIS < 0), it is possible that under neutrality QST > FST (instead of QST = FST), which leads to a higher risk of false positives (Cubry et al., 2017). As expected, when the QST is calculated considering our negative neutral FIS estimate, the QST values are reduced. However, the traits “plant height” and “head shape” remain statistically significant. For the trait “plant height,” QST > FST is often observed in both trees (Yang et al., 1996) and herbaceous plants (Waldmann and Andersson, 1998; Kawakami et al., 2011; Eriksen et al., 2012; Michalski et al., 2017). Interestingly, very high QSTs were observed for plant height, flowering date, and capitulum size (diameter) in the sunflower species Helianthus maximilian, which like C. cardunculus var. scolymus belongs to the Asteraceae family, is perennial and is diploid (Kawakami et al., 2011). Variations in these traits are relevant to local adaptation to spatially heterogeneous environments. Furthermore, as might also be suggested for our study, Kawakami et al. (2011) noted that plant height and flowering time might have both been found under divergent selection because of the typical life-history trade-off between growth and reproductive timing (Obeso, 2002). The existence of this trade-off in cultivated artichoke was suggested by Crinò et al. (2008). Alternatively, plant height and flowering time might at least partially share the same genetic control, as has been suggested in other plant species (Tondelli et al., 2014; Bustos-Korts et al., 2019). The QST–FST comparison assumes that the genetic control of the quantitative traits is additive (Spitze, 1993). Non-additive (dominance or epistasis) and maternal effects might have different weights for different traits and might confound any inference (Leinonen et al., 2013). It is possible that traits with simple genetic control, a strong dominance variance component, and under certain demographic models show QST inflated over the neutral expectation, mimicking divergent selection (López-Fanjul et al., 2003, 2007). However, such inflation is unlikely for traits with polygenic control (Goudet and Martin, 2007), and, in general, with non-additive inheritance, the QST–FST approach to look for diversifying selection would be conservative (Whitlock, 1999, 2008; Cubry et al., 2017). Furthermore, the QST–FST approach can be suggested as an exploratory tool (Leinonen et al., 2008; Whitlock, 2008), particularly when detailed information on phenotypic traits is not available. In this case, as Whitlock (2008) noted, “the difficult statistical properties of QST become less important, and if the QST of several traits is measured, then the traits with the highest QST values may be good candidates for further study of selection.” In other words, non-additive and maternal effects probably did not significantly alter the conclusions of this study, and plant height and head shape followed by harvest time, stem length, and the number of heads are good candidates for having been subjected to divergent selection during globe artichoke evolution. Another observation indicates a role for divergent selection among the genetic groups. Indeed, we found that although there is a substantial amount of variation among the traits for QST_B, much of this variation appears to be associated with variation in h2B. Under neutrality, QST does not correlate with h2B (Lande, 1992). In contrast, the response to directional selection on a quantitative trait (and hence the QST_B between populations), ceteris paribus, is proportional to h2B (Falconer and Mackay, 1996). Therefore, the positive correlation between QST and h2B observed in this study is qualitatively consistent with the hypothesis that selection contributed to the phenotypic differences observed between the groups. In previous studies on Pinus contorta (Yang et al., 1996), Medicago truncatula (Bonnin et al., 1996), and Clarkia dudleyana (Podolsky and Holtsford, 1995), the correlation between QST and h2B was also positive; a similar situation has also been observed in animals, such as in the microcrustacean Daphnia pulex (Lynch et al., 1999).

Conclusion

This study shows that a globe artichoke collection that comprises most of the varieties cultivated worldwide has a clonal population structure and is moderately structured. The data consistently show that five genetic groups with different degrees of “historical” clonality characterize the domesticated gene pool of the globe artichoke. Clonal propagation was probably accompanied by clonal selection for a set of different phenotypic characteristics. The traits that affect phenology (harvest time), plant architecture (plant height, stem length), head shape, and spininess (of both leaves and bracts) are suggested as the main targets of selection. The overall covariance between the molecular and phenotypic population structures indicates that genome-wide association mapping strategies must deal with the identified population structure to minimize the risk of false positives.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

DR, GA, MR, and EP conceived and designed the experiments. LB, AP, and DS performed the field trial. LB, MR, and GA carried out field data curation. MR carried out DNA extraction. AA and CC performed SSR analysis. DR performed statistical analyzes. DR, EP, and CC wrote the manuscript. All authors contributed to the discussion, revised, and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  78 in total

1.  Clonal reproduction and linkage disequilibrium in diploids: a simulation study.

Authors:  Thierry de Meeûs; François Balloux
Journal:  Infect Genet Evol       Date:  2004-12       Impact factor: 3.342

2.  Phylogenetic signal, evolutionary process, and rate.

Authors:  Liam J Revell; Luke J Harmon; David C Collar
Journal:  Syst Biol       Date:  2008-08       Impact factor: 15.683

Review 3.  What is the evidence for heterozygote advantage selection?

Authors:  Philip W Hedrick
Journal:  Trends Ecol Evol       Date:  2012-09-10       Impact factor: 17.712

Review 4.  Q(ST)-F(ST) comparisons: evolutionary and ecological insights from genomic heterogeneity.

Authors:  Tuomas Leinonen; R J Scott McCairns; Robert B O'Hara; Juha Merilä
Journal:  Nat Rev Genet       Date:  2013-02-05       Impact factor: 53.242

5.  Genetic structure and linkage disequilibrium in landrace populations of barley in Sardinia.

Authors:  Monica Rodriguez; Domenico Rau; Donal O'Sullivan; Anthony H D Brown; Roberto Papa; Giovanna Attene
Journal:  Theor Appl Genet       Date:  2012-03-13       Impact factor: 5.699

6.  Spatial genetic structure in wild cardoon, the ancestor of cultivated globe artichoke: Limited gene flow, fragmentation and population history.

Authors:  D Rau; M Rodriguez; E Rapposelli; M L Murgia; R Papa; A H D Brown; G Attene
Journal:  Plant Sci       Date:  2016-09-29       Impact factor: 4.729

7.  The genome sequence of the outbreeding globe artichoke constructed de novo incorporating a phase-aware low-pass sequencing strategy of F1 progeny.

Authors:  Davide Scaglione; Sebastian Reyes-Chin-Wo; Alberto Acquadro; Lutz Froenicke; Ezio Portis; Christopher Beitel; Matteo Tirone; Rosario Mauro; Antonino Lo Monaco; Giovanni Mauromicale; Primetta Faccioli; Luigi Cattivelli; Loren Rieseberg; Richard Michelmore; Sergio Lanteri
Journal:  Sci Rep       Date:  2016-01-20       Impact factor: 4.379

8.  Genome-Wide Identification of BAHD Acyltransferases and In vivo Characterization of HQT-like Enzymes Involved in Caffeoylquinic Acid Synthesis in Globe Artichoke.

Authors:  Andrea Moglia; Alberto Acquadro; Kaouthar Eljounaidi; Anna M Milani; Cecilia Cagliero; Patrizia Rubiolo; Andrea Genre; Katarina Cankar; Jules Beekwilder; Cinzia Comino
Journal:  Front Plant Sci       Date:  2016-09-23       Impact factor: 5.753

9.  Chloroplast Microsatellite Diversity in Phaseolus vulgaris.

Authors:  F Desiderio; E Bitocchi; E Bellucci; D Rau; M Rodriguez; G Attene; R Papa; L Nanni
Journal:  Front Plant Sci       Date:  2013-01-22       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.