| Literature DB >> 29651296 |
Heena Ambreen1, Shivendra Kumar1, Amar Kumar1, Manu Agarwal1, Arun Jagannath1, Shailendra Goel1.
Abstract
Carthamus tinctorius L. (safflower) is an important oilseed crop producing seed oil rich in unsaturated fatty acids. Scarcity of identified marker-trait associations is a major limitation toward development of successful marker-assisted breeding programs in safflower. In the present study, a safflower panel (CartAP) comprising 124 accessions derived from two core collections was assayed for its suitability for association mapping. Genotyping of CartAP using microsatellite markers revealed significant genetic diversity indicated by Shannon information index (H = 0.7537) and Nei's expected heterozygosity (I = 0.4432). In Principal Coordinate Analysis, the CartAP accessions were distributed homogeneously in all quadrants indicating their diverse nature. Distance-based Neighbor Joining analysis did not delineate the CartAP accessions in consonance with their geographical origin. Bayesian analysis of population structure of CartAP demonstrated the unstructured nature of the association panel. Kinship analysis at population (Gij ) and individual level (Fij ) revealed absence of or weak relatedness between the CartAP accessions. The above parameters established the suitability of CartAP for association mapping. We performed association mapping using phenotypic data for eight traits of agronomic value (viz., seed oil content, oleic acid, linoleic acid, plant height, number of primary branches, number of capitula per plant, 100-seed weight and days to 50% flowering) available for two growing seasons (2011-2012 and 2012-2013) through General Linear Model and Mixed Linear Model. Our study identified ninety-six significant marker-trait associations (MTAs; P < 0.05) of which, several MTAs with correlation coefficient (R2) > 10% were consistently represented in both models and in both seasons for traits viz., oil content, oleic acid content, linoleic acid content and number of primary branches. Several MTAs with high R2-values were detected either in a majority or in some environments (models and/or seasons). Many MTAs were also common between traits (viz., oleic/linoleic acid content; plant height/days to 50% flowering; number of primary branches/number of capitula per plant) that showed positive or negative correlation in their phenotypic values. The marker-trait associations identified in this study will facilitate marker-assisted breeding and identification of genetic determinants of trait variability.Entities:
Keywords: SSR markers; association mapping; core collections; kinship analysis; population structure; safflower
Year: 2018 PMID: 29651296 PMCID: PMC5885069 DOI: 10.3389/fpls.2018.00402
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary statistics for genetic variability at 93 SSR loci.
| Number of accessions sampled | 124 |
| Total number of alleles | 311 |
| Average number of alleles per locus | 3.34 |
| Range of alleles | 2–8 |
| SSR loci with highest number of alleles | NGSaf_265 and NGSaf_282 (8 alleles) |
| Average | 0.112 |
| SSR locus with highest | NGSaf_294 (0.958) |
| SSR locus with lowest | NGSaf_14, NGSaf_45, NGSaf_63, NGSaf_98, NGSaf_114, NGSaf_117, NGSaf_130, NGSaf_145, NGSaf_151, NGSaf_154, NGSaf_173, NGSaf_248 (0) |
| Average | 0.438 |
| SSR locus with highest | NGSaf_281 (0.76) |
| SSR locus with lowest | NGSaf_117 (0.016) |
| Average PIC-value | 0.38 |
| SSR locus with highest PIC-value | NGSaf_281 (0.73) |
| SSR locus with lowest PIC-value | NGSaf_117 (0.02) |
He, gene diversity or expected heterozygosity; Ho, observed heterozygosity; PIC, polymorphic information content,
Values for each parameter are provided in parenthesis.
Genetic diversity indices of CartAP accessions based on their regional gene pool distribution.
| Far East | 15 | 84 | 90.3 | 0.6029 | 0.3782 |
| Indian subcontinent | 32 | 91 | 97.8 | 0.7028 | 0.4265 |
| Iran-Afghanistan | 12 | 85 | 91.4 | 0.6016 | 0.3596 |
| Egypt | 5 | 72 | 77.4 | 0.5289 | 0.3449 |
| Europe | 9 | 84 | 90.3 | 0.5966 | 0.3818 |
| Near East | 6 | 60 | 64.5 | 0.4158 | 0.2595 |
| Turkey | 5 | 60 | 64.5 | 0.4650 | 0.2981 |
| America | 30 | 90 | 96.8 | 0.7211 | 0.4227 |
N, Number of accessions; P.
Figure 1(A) Neighbor Joining dendrogram elucidating the genetic relationships between 124 CartAP accessions using 93 SSR loci based on simple matching coefficient. NJ clusters (NJ I–NJ III) are demarcated through dashed lines. (B) Scatter plot for Principal Coordinate Analysis (PCoA) exhibiting the distribution of CartAP accessions on two main axes. Primary axes 1 and 2 captured 29 and 25% of the total variance, respectively. Color codes used for different regional pools are provided.
Figure 2Population structure of CartAP collection inferred using Bayesian-based program STRUCTURE. (A) Estimation of hypothetical sub-populations using ΔK-values. The number of identified sub-populations were two (K = 2). (B) Population structure analysis of CartAP accessions at K = 2 based on inferred ancestry (Q matrix). The sub-populations were named STR I and STR II.
Figure 3Hierarchical population structure analysis (A) Estimation of sub-population for STR I. Maximum number of sub-populations were inferred at K = 2. (B) Population structure for STR I at K = 2 based on Q matrix. The sub-populations were designated as STR I(a) and STR I(b) (C) Estimation of sub-populations for STR II. Maximum number of subpopulations were inferred at K = 5. (D) Population structure for STR II at K = 5 based on Q matrix. The sub-populations were named as STR II(a)–STR II(e). Color codes for each sub-population are provided.
Distribution of CartAP accessions in different sub-populations derived from STRUCTURE analysis.
| Far East (FE) | 15 | 3 | – | 2 | 5 | – | – | 1 | 4 |
| Indian subcontinent (IS) | 32 | 5 | 12 | 2 | 1 | 8 | 1 | 3 | 4 |
| Iran-Afghanistan (IA) | 12 | 9 | – | – | – | – | – | 3 | 1 |
| Near East (NE) | 6 | 2 | 1 | – | – | 1 | – | 2 | – |
| Turkey (TU) | 5 | 3 | – | – | – | – | – | 1 | 1 |
| Egypt (EG) | 5 | 1 | – | 2 | 2 | – | – | – | – |
| Sudan (SU) | 1 | 1 | – | – | – | – | – | – | – |
| Kenya (KE) | 1 | – | – | – | – | 1 | – | – | – |
| Ethiopia (ET) | 1 | - | – | – | – | 1 | – | – | – |
| Europe (EU) | 9 | 2 | – | 2 | 1 | – | 3 | 1 | – |
| America (US) | 30 | 6 | – | 4 | 9 | 1 | 1 | 3 | 6 |
| Australia (AUS) | 1 | – | – | – | – | 1 | – | – | – |
| Unknown origin (UN) | 1 | – | 1 | – | – | – | – | – | – |
| Total | 124 | 32 | 14 | 12 | 18 | 13 | 5 | 14 | 16 |
Expected heterozygosity of the sub-populations derived through STRUCTURE analysis.
| STR I | 0.469 |
| STR I(a) | 0.4352 |
| STR I(b) | 0.2838 |
| STR II | 0.4217 |
| STR II(a) | 0.2230 |
| STR II(b) | 0.3475 |
| STR II(c) | 0.2669 |
| STR II(d) | 0.1726 |
| STR II(e) | 0.2845 |
Figure 4Matrix showing pairwise FST-values between sub-populations inferred through hierarchical STRUCTURE analysis. Color codes are provided for each sub-population. *Denotes FST-values between same sub-population.
Analysis of molecular variance (AMOVA) between and within regional gene pools and between and within sub-populations derived through hierarchical STRUCTURE analysis.
| Between populations | 1.413 | 7 | 0.001 |
| Within populations | 19.024 | 93 | 0.001 |
| Total | 20.437 | 100 | |
| Between populations | 3.367 | 16 | 0.001 |
| Within populations | 17.524 | 84 | 0.001 |
| Total | 20.891 | 100 | |
Estimated variance.
With 999 data permutations.
Figure 5Kinship estimates at individual and population level. (A) Frequency distribution for global pairwise kinship estimates (F) for CartAP accessions. (B) Matrix showing pairwise G-values between sub-populations inferred through hierarchical STRUCTURE analysis. Color codes are provided for each sub-population. *Denotes G-values values between same sub-population.
Mean and range of different phenotypic traits for CartAP accessions.
| Oil content (%) | 32 | 16–50 |
| Oleic acid (%) | 27 | 10–79 |
| Linoleic acid (%) | 65 | 13–87 |
| 100-seed weight (gm) | 4 | 1–8 |
| Plant height (cm) | 156 | 94–226 |
| Number of capitula per plant | 82 | 16–203 |
| Number of primary branches | 16 | 6–33 |
| Days to 50% flowering | 141 | 119–160 |
Marker-trait associations identified through General linear model (GLM) and Mixed linear model (MLM) using phenotypic data of season 2011–2012 and 2012–2013.
| NGSaf_15 | 3 | 22.3 | 1.363E-5 | 10.4 | 0.0436 | 23.4 | 6.85E-06 | 16 | 0.004 | |
| NGSaf_148 | 7 | – | – | – | – | 11.5 | 0.02 | – | – | |
| NGSaf_201 | 11 | 12.9 | 0.011 | – | – | – | – | – | – | |
| NGSaf_255 | 1 | 9.7 | 0.033 | – | – | 16 | 0.001 | – | – | |
| NGSaf_300 | 12 | 16.7 | 6.026E-4 | 16.2 | 0.016 | 17.9 | 2.99E-04 | 13.8 | 0.042 | |
| NGSaf_67 | 7 | 11.4 | .005 | 11.4 | 0.017 | 11.6 | 0.004 | 11.6 | 0.016 | |
| NGSaf_210 | 4 | – | – | 14.3 | 0.031 | – | – | 14.6 | 0.028 | |
| NGSaf_309 | 5 | – | – | 34.1 | .002 | – | – | 34 | 0.002 | |
| NGSaf_67 | 7 | 10.9 | 0.006 | 10.9 | 0.021 | – | – | 16.7 | 0.039 | |
| NGSaf_83 | 3 | – | – | – | – | 12.4 | 0.014 | – | – | |
| NGSaf_155 | 10 | – | – | – | – | 19.1 | 0.041 | – | – | |
| NGSaf_210 | 4 | 14.3 | 0.013 | 14.4 | 0.031 | 14 | 0.007 | 17.9 | 0.004 | |
| NGSaf_309 | 5 | – | – | 34.1 | 0.002 | – | – | – | – | |
| NGSaf_101 | 4 | – | – | – | – | 8.9 | 0.006 | 15.3 | 0.034 | |
| NGSaf_306 | 6 | 14.7 | 0.031 | 24.52 | 0.011 | 13 | 0.034 | – | – | |
| NGSaf_309 | 5 | 24.1 | 4.2291E-4 | 22.9 | 0.044 | 15.5 | 0.022 | – | – | |
| NGSaf_23 | 6 | 10 | 0.009 | – | – | – | – | – | – | |
| NGSaf_83 | 3 | 12.5 | 0.013 | – | – | – | – | – | – | |
| NGSaf_101 | 4 | 14.1 | 0.019 | – | – | 12.2 | 0.049 | – | – | |
| NGSaf_156 | 6 | 11.8 | 0.023 | – | – | 13.7 | 0.01 | 13.9 | 0.033 | |
| NGSaf_173 | 5 | 10.6 | 3.84E-4 | 4.4 | 0.027 | – | – | |||
| NGSaf_296 | 6 | 14.7 | 0.007 | – | – | 12.7 | 0.021 | 13.1 | 0.018 | |
| NGSaf_279 | 6 | 7.4 | 0.037 | 9.3 | 0.037 | 8.1 | 0.024 | 8.5 | 0.051 | |
| NGSaf_309 | 5 | 15.4 | 0.023 | 25.7 | 0.023 | – | – | – | – | |
| NGSaf_15 | 3 | – | – | – | – | 11.8 | 0.008 | 12.4 | 0.021 | |
| NGSaf_83 | 3 | 13.3 | 0.009 | – | – | 10.5 | 0.037 | 10.5 | 0.047 | |
| NGSaf_279 | 6 | 8.9 | 0.015 | 10.8 | 0.018 | 8 | 0.024 | 8.9 | 0.043 | |
| NGSaf_92 | 9 | – | – | 11.4 | 0.014 | 7.9 | 0.026 | 11.6 | 0.014 | |
| NGSaf_101 | 4 | 12.6 | 0.037 | – | – | – | – | – | – | |
| NGSaf_201 | 11 | 11.9 | 0.017 | 11.4 | 0.014 | 11.8 | 0.019 | – | – | |
| NGSaf_255 | 1 | 10.7 | 0.021 | – | – | – | – | – | – | |
| NGSaf_296 | 6 | – | – | – | – | 11.1 | 0.043 | – | – | |
Linkage group; R.
Comparative account of SSR marker statistics of the present and previous studies in safflower.
| Number of accessions sampled | 124 | 27 | 10 | 48 | 100 | 42 |
| Number of polymorphic SSR markers | 93 | 104 | 64 | 42 | 30 | 33 |
| Allelic range | 2–8 | 2–15 | 2–8 | 2–8 | 2–7 | 2–8 |
| Average number of alleles per locus | 3.3 | 6 | 3.2 | 3.4 | 2.8 | 3.8 |
| Mean expected heterozygosity ( | 0.438 | 0.54 | 0.52 | 0.37 | 0.386 | |
| PIC | 0.38 | 0.32 | 0.32 | 0.325 | 0.3 |
Not available.
Figure 6Demarcation of two major STRUCTURE populations (STR I and STR II) and seven sub-populations obtained by further hierarchical structure analysis on (A) PCoA scatter plot. (B) Neighbor Joining dendrogram. Color codes are provided for each sub-population.