| Literature DB >> 26635822 |
Deepak Bajaj1, Shouvik Das1, Hari D Upadhyaya2, Rajeev Ranjan1, Saurabh Badoni1, Vinod Kumar3, Shailesh Tripathi4, C L Laxmipathi Gowda2, Shivali Sharma2, Sube Singh2, Akhilesh K Tyagi1, Swarup K Parida1.
Abstract
The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea.Entities:
Keywords: GBS; GWAS; QTL; SNP; chickpea; haplotype; seed coat color; wild accession
Year: 2015 PMID: 26635822 PMCID: PMC4647070 DOI: 10.3389/fpls.2015.00979
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Twenty-eight seed color-related known/candidate gene orthologs of chickpea selected for genetic association mapping.
| Soybean | Yang et al., | Ca05336 | 3793 | 12 | 3.16 | |||
| Soybean | Ca02395 | 2179 | 6 | 2.75 | ||||
| Soybean | Ca10786 | 5497 | 15 | 2.73 | ||||
| Soybean | Ca10787 | 4719 | 13 | 2.75 | ||||
| Soybean | Ca11427 | 1816 | 5 | 2.75 | ||||
| Soybean | Ca09972 | 3822 | 13 | 3.40 | ||||
| Soybean | Ca09973 | 3872 | 10 | 2.58 | ||||
| Wang et al., | Ca10786 | 5497 | 20 | 3.64 | ||||
| Wroblewski et al., | Ca08294 | 3010 | 11 | 3.65 | ||||
| Pourcel et al., | Ca03250 | 11604 | 37 | 3.19 | ||||
| Routaboul et al., | Ca13196 | 1443 | 4 | 2.77 | ||||
| Zhang et al., | Ca23340 | 3106 | 10 | 3.22 | ||||
| Debeaujon et al., | Ca10158 | 3676 | 9 | 2.45 | ||||
| Focks et al., | Ca04376 | 6919 | 21 | 3.04 | ||||
| Liu et al., | Ca02395 | 2179 | 4 | 1.84 | ||||
| Xu et al., | Ca03442 | 1903 | 3 | 1.58 | ||||
| Bowerman et al., | Ca05336 | 3793 | 9 | 2.37 | ||||
| Baxter et al., | Ca01370 | 10262 | 35 | 3.41 | ||||
| Sagasser et al., | Ca27078 | 2324 | 5 | 2.15 | ||||
| Chen et al., | Ca12762 | 2226 | 5 | 2.25 | ||||
| Chen et al., | Ca14089 | 2283 | 6 | 2.63 | ||||
| Erdmann et al., | Ca15760 | 3027 | 8 | 2.64 | ||||
| Nguyen et al., | Ca04058 | 2028 | 4 | 1.97 | ||||
| Pesch et al., | Ca17273 | 4380 | 16 | 3.65 | ||||
| Ichino et al., | Ca22116 | 16647 | 47 | 2.82 | ||||
| MATE secondary transporter (TRANSPARENT TESTA) | Zhao and Dixon, | Ca05556 | 3936 | 10 | 2.54 | |||
| MATE secondary transporter (TRANSPARENT TESTA) | Ca05557 | 5315 | 19 | 3.57 | ||||
| MATE secondary transporter (TRANSPARENT TESTA) | Ca05558 | 4540 | 15 | 3.30 | ||||
Tandem array of three seed color candidate co-orthologous tt12-MATE family transporter repeated genes located at the selective sweep target region of kabuli chromosome 4 reported by Varshney et al. (.
Figure 1(A) Frequency distribution of 9045 SNPs discovered employing reference kabuli genome (eight chromosomes and scaffolds)-based GBS and candidate gene-derived SNP genotyping assays. (B) Structural annotation of SNPs in the diverse coding (synonymous and non-synonymous) and non-coding (intron, URR, and DRR) sequence components of genes and intergenic regions of kabuli genome. The gene annotation information of kabuli genome (Varshney et al., 2013) was considered as reference to infer CDS (coding sequences)/exons, URR (upstream regulatory region) and DRR (downstream regulatory region) sequence components of genes. (C) A Circos circular ideogram depicting the relative distribution of 9045 SNPs, including non-synonymous SNPs (marked with red dots) physically mapped on eight kabuli chromosomes. The outermost circles illustrate the eight kabuli chromosomes coded with different colors, while the two inner circles “a” and “b” represent the distribution of SNPs mined from 93 cultivated (desi and kabuli) and 79 wild chickpea accessions, respectively.
Figure 2(A) Unrooted phylogenetic tree (Nei's genetic distance), (B) Population genetic structure (optimal population number K = 2 with two diverse color) and (C) Principal component analysis (PCA) using 8837 genome-wide GBS- and candidate gene-based SNPs assigned 93 BE (beige) and LB/YB (light/yellow brown) seed coat color representing kabuli and desi chickpea accessions mostly into two major populations-POP I and POP II, respectively. In population structure, the accessions represented by vertical bars along the horizontal axis were classified into K color segments based on their estimated membership fraction in each K cluster. In PCA, the PC1 and PC2 explained 7.1 and 17.4% of the total variance, respectively. (D) LD decay (mean r2) measured in a population of 93 cultivated desi and kabuli chickpea accessions using 7488 SNPs physically mapped on eight kabuli chromosomes. The plotted curved line denotes the mean r2-values among SNP loci spaced with uniform 50 kb physical intervals from 0 to 1000 kb across chromosomes. The plotted line in uppermost indicates the mean r2-values among SNPs spaced with uniform 20 kb physical intervals from 0 to 200 kb on chromosomes.
Figure 3(A) GWAS-derived Manhattan plot showing significant P-values (measured integrating GLM, MLM, EMMA, and CMLM) associated with seed coat color trait using 9045 genome-wide GBS- and candidate gene-based SNPs. The x-axis represents the relative density of reference genome-based SNPs physically mapped on eight chromosomes and scaffolds of kabuli genome. The y-axis indicates the −log10(P)-value for significant association of SNPs with seed color trait. The SNPs showing significant association with seed color at cut-off P ≤ 1 × 10−6 are marked with dotted lines. (B) Quantile-quantile plots illustrating the comparison between expected and observed −log10(P)-values with FDR cut-off < 0.05 to detect significant genomic loci (genes) associated with seed color trait in chickpea.
Fifteen seed coat color-regulating genomic loci (genes) identified by GWAS and candidate gene-based association mapping.
| CWSNP151 | 1 | 1858509 | (C/T) | Ca00233 (Inositol monophosphate) | CDS (Syn) | 2.2 × 10−7 | 25 |
| CWSNP156 | 1 | 1859914 | (G/A) | Ca00233 (Inositol monophosphate) | Intron | 2.0 × 10−7 | 27 |
| CWSNP208 | 1 | 2053788 | (A/G) | Ca00260 (Disease resistance protein) | CDS (NSyn) Threonine (ACG)-Alanine (GCG) | 1.8 × 10−7 | 22 |
| CWSNP1275 | 2 | 7701199 | (T/G) | Ca18123 ( | CDS (NSyn) Valine (GTG)-Glycine (GGG) | 1.7 × 10−9 | 43 |
| gSNP1 | 2 | 33782378 | (C/A) | Ca10158 ( | CDS (NSyn) Alanine (GCT)-Aspartic acid (GAT) | 1.2 × 10−7 | 20 |
| gSNP2 | 3 | 26031490 | (A/G) | Ca08294 ( | URR | 1.5 × 10−7 | 33 |
| gSNP5 | 4 | 48021943 | (G/T) | Ca10786 ( | CDS (NSyn) Glycine (GGG)-Valine (GTG) | 1.7 × 10−7 | 29 |
| gSNP3 | 4 | 7931791 | (T/C) | Ca03442 ( | URR | 1.4 × 10−7 | 30 |
| gSNP4 | 4 | 15386163 | (T/G) | Ca05557 ( | CDS (NSyn) Serine (TCA)-Alanine (GCA) | 1.3 × 10−6 | 24 |
| gSNP6 | 6 | 15514534 | (C/G) | Ca05336 (ban-ANR-Anthocyanin reductase) | URR | 2.1 × 10−8 | 35 |
| CWSNP5434 | 6 | 28325392 | (C/T) | Ca25968 (Ribonuclease T2) | CDS (NSyn) Serine (TCA)-Leucine (TTA) | 2.4 × 10−7 | 27 |
| CWSNP5762 | 6 | 56793021 | (G/T) | – | Intergenic | 1.5 × 10−7 | 21 |
| CWSNP6261 | 7 | 14281993 | (G/T) | Ca23042 (domain of unknown function DUF231) | Intron | 2.1 × 10−7 | 38 |
| gSNP7 | 8 | 2062481 | (G/A) | Ca02395 ( | URR | 2.1 × 10−6 | 26 |
| CWSNP8591 | Scaffold93 | 645442 | (A/C) | Ca23085 (Integrase) | URR | 1.9 × 10−7 | 23 |
CWSNP (cultivated wild SNP) and gSNP (gene-derived SNPs).
CDS, coding sequence; NSyn, non-synonymous; Syn, synonymous; URR, upstream regulatory region.
Details regarding SNPs are mentioned in the Table .
gSNP: gene-derived SNPs associated with seed coat color trait identified by candidate gene-based association mapping.
Figure 4The gene haplotype-based LD/association mapping, expression profiling and PA content estimation in a strong seed coat color-associated MATE secondary transporter . The genotyping of four SNP (one non-synonymous SNPs) and one SSR markers in different coding and non-coding sequence components of gene (A) among 172 cultivated and wild chickpea accessions constituted four haplotypes (B). The haplotypes I: [C-(GTTG)3-T-T-A] and II: [C-(GTTG)3-G-T-A] discriminated by one non-synonymous SNP (T/G at 1708 bp, yellow shaded) in the CDS of gene showed strong association potential for BE and LB/YB seed coat color differentiation, respectively. The four haplotype marker-based genotyping information imparted higher LD estimate covering the entire 6843 bp sequenced region of the gene (C). The columns below the diagonal denote the correlation frequency (r2) among a pair of four different haplotypes constituted in a gene, whereas columns above the diagonal indicate the P-value significance (P < 0.01) of LD estimates (r2) for these haplotype combinations at 1000 permutation. (D) The differential expression of BE and YB/LB seed color-associated MATE gene haplotypes (I and II) in the mature seed coats of accessions, mapping parents and homozygous RIL individuals representing BE and LB/YB seed colored haplotype groups. The elongation factor-1 alpha gene was utilized as an internal control in the RT-PCR assay. (E) The soluble and insoluble PA contents (nmol/mg seeds) estimated in the mature seed coats of desi and kabuli chickpea accessions, mapping parents and homozygous RIL individuals representing BE and LB/YB seed colored haplotypes. Each bars represent the mean (± standard error) of three independent biological replicates along with two technical replicates for each sample used in RT-PCR and PA estimation assays. *Significant differences in expression of gene haplotypes in mature seeds of LB/YB and BE seed colored accessions as compared to their leaf and seed (without seed coats) tissues at p < 0.001 (LSD-ANOVA significance). *Significant variation in PA contents in the matured seeds of LB/YB seed colored accessions as compared to BE seed coat colored accessions at p < 0.001 (ANOVA significance). YBH (yellow brown homozygous) and BEH (beige homozygous) RIL mapping individuals.