Literature DB >> 25942004

Ultra-high density intra-specific genetic linkage maps accelerate identification of functionally relevant molecular tags governing important agronomic traits in chickpea.

Alice Kujur1, Hari D Upadhyaya2, Tanima Shree1, Deepak Bajaj1, Shouvik Das1, Maneesha S Saxena1, Saurabh Badoni1, Vinod Kumar3, Shailesh Tripathi4, C L L Gowda2, Shivali Sharma2, Sube Singh2, Akhilesh K Tyagi1, Swarup K Parida1.   

Abstract

We discovered 26785 and 16573 high-quality SNPs differentiating two parental genotypes of a RIL mapping population using reference desi and kabuli genome-based GBS assay. Of these, 3625 and 2177 SNPs have been integrated into eight desi and kabuli chromosomes, respectively in order to construct ultra-high density (0.20-0.37 cM) intra-specific chickpea genetic linkage maps. One of these constructed high-resolution genetic map has potential to identify 33 major genomic regions harbouring 35 robust QTLs (PVE: 17.9-39.7%) associated with three agronomic traits, which were mapped within <1 cM mean marker intervals on desi chromosomes. The extended LD (linkage disequilibrium) decay (~15 cM) in chromosomes of genetic maps have encouraged us to use a rapid integrated approach (comparative QTL mapping, QTL-region specific haplotype/LD-based trait association analysis, expression profiling and gene haplotype-based association mapping) rather than a traditional QTL map-based cloning method to narrow-down one major seed weight (SW) robust QTL region. It delineated favourable natural allelic variants and superior haplotype-containing one seed-specific candidate embryo defective gene regulating SW in chickpea. The ultra-high-resolution genetic maps, QTLs/genes and alleles/haplotypes-related genomic information generated and integrated strategy for rapid QTL/gene identification developed have potential to expedite genomics-assisted breeding applications in crop plants, including chickpea for their genetic enhancement.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25942004      PMCID: PMC5386344          DOI: 10.1038/srep09468

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Chickpea (Cicer arietinum L.), the second most grown legume crop world-wide is an annual self-pollinated and diploid (2n = 2x = 16) pulse species with a genome size of ~740 Mb12. A member of Fabaceae family, this diploid food legume is a major source of human dietary protein packed with essential amino acids1. Chickpea is broadly categorized into two cultivars- Kabuli and Desi- types basing upon their plant characteristics and diverse gene pools-based geographical distribution. They are known to enrich the soil nutritional status and fertility by symbiotic nitrogen fixation. The seed and pod traits are considered the most prominent characteristics of chickpea defining its economic value as a diet for human race. Consequently, they draw major interest of researchers for yield and product quality improvement towards generation of high-yielding genetically tailored chickpea cultivars via genomics-assisted breeding3. A large genome with a narrow genetic base, chickpea requires numerous informative and genome-wide well-distributed SNP (single nucleotide polymorphism) markers for construction of ultra-high density genetic linkage map (≤1 cM average map-density45) in order to identify and fine-map the useful trait-influencing candidate genes/QTLs (quantitative trait loci) for marker-assisted genetic enhancement. A number of high-resolution inter-specific genetic linkage maps (with mean map-density: 0.59–1.7 cM) have been generated in chickpea by genotyping of large-scale SNP markers in its diverse mapping populations678910. Few recent efforts have also been made to construct SNP marker-based high-density genetic linkage maps (with map density: 1.74–3.68 cM) by utilizing diverse advanced generation desi and kabuli intra-specific mapping populations11121314. Indeed, all these studies have utilized high-throughput genotyping assays like Illumina GoldenGate/Infinium and Competitive Allele Specific PCR (KASPar) for large-scale validation and genotyping of SNP markers (~1000 SNPs till now) in intra- and inter-specific chickpea mapping populations to construct high-density genetic linkage maps. Aside chickpea89, similar efforts have also been undertaken to construct comprehensive high-density genetic linkage map (average inter-marker distance ≤1 cM) by deploying aforesaid strategies in diverse legume species (soybean, Medicago, pigeonpea, common bean and mung bean)1516171819202122232425. However, the need of prior information regarding SNP loci and their high-quality flanking sequence regions for genotyping of SNPs largely limits the use of these assays in both discovery and validation of SNPs at genome-wide scale for constructing efficient high-resolution genetic linkage maps in chickpea. It thus implicates the necessity of a suitable modern and fast advanced approach/assay for simultaneous large-scale genome-wide discovery and high-throughput genotyping of SNPs in diverse mapping populations to generate required ultra-high resolution genetic linkage maps in chickpea. Numerous past reports have established the essentiality of high-density linkage maps as references for more effective assembly and orientation of the high-quality sequenced scaffolds into the pseudomolecules corresponding the chromosomes of crop plant genomes1522232526272829303132333435363738. Therefore, it is indispensable to generate ultra-high density genetic linkage maps enriched with a greater number of SNP markers for efficient anchoring of assembled scaffolds into the chickpea chromosomes. This is expected to overall greatly facilitate the chickpea genome assembly and sequencing efforts in near future. Genotyping-By-Sequencing (GBS) is a simple, rapid and cost-effective next-generation sequencing (NGS)-based assay having potential for simultaneous large-scale discovery and genotyping of SNPs in diverse crop plants at a genome-wide scale3940. The high-barcoded multiplexing capacity (pooling up to 384-barcoded accessions in a single sequencing lane) at methylation-sensitive RE (restriction endonuclease)-sites (like APeKI) have enhanced the efficiency of GBS assay in fast discovery and genotyping of genome-wide SNPs in larger mapping populations. This in turn made the GBS assay much more advantageous over other available high-throughput traditional genotyping assays for constructing ultra-high density genetic linkage maps and genome-wide high-resolution (major and minor effects QTLs) gene/QTL mapping in small diploid as well as large genome crop species, including rice, wheat, barley, sorghum, Medicago and soybean2441424344454647484950. Considering its utility in many high-throughput marker-based genotyping applications in genomics-assisted breeding, the use of GBS assay for generating ultra-high resolution genome maps and molecular mapping of robust major as well as minor genes/QTLs with sub-optimal use of resources in a crop species like chickpea assumes significance. Nevertheless, very limited reports are available with regard to the use of GBS approach in discovery and genotyping of genome-wide SNPs in intra- and inter-specific mapping populations for constructing ultra-high resolution genetic linkage maps and fine mapping of QTLs regulating important agronomic traits in chickpea vis-à-vis other crop plants22455051. Over the past few years, a number of QTLs associated with important agronomic traits, including yield component and stress tolerance traits have been identified and mapped on low/high-resolution intra- as well as inter-specific genetic linkage maps of chickpea313142052535455565758596061626364656667686970717374757677. However, very limited number of these QTLs could either be successfully validated in diverse genetic backgrounds of accessions across geographical locations/years or fine-mapped through map-based cloning for their utilization in marker-assisted genetic improvement of chickpea131476. To expedite such process of identification and mapping of high-resolution and robust major and minor effect QTLs at genome-wide scale, the construction of SNP marker-based ultra-high density intra- and inter-specific genetic linkage maps using GBS assay at present is anticipated as an attractive approach for chickpea genetic enhancement. Keeping all above in view, we discovered 26785 and 16573 genome-wide SNPs showing differentiation between parental genotypes [ICC 12299 (desi) and ICC 8261 (kabuli)] by their genotyping in a 275 F7 RIL (recombinant inbred line) mapping population (ICC 12299 × ICC 8261) using reference desi and kabuli genome-based GBS assay, respectively. A selected 3625 and 2177 high-quality SNPs of these, were integrated into eight desi and kabuli chromosomes, respectively to construct ultra-high density intra-specific genetic linkage maps in chickpea. These high resolution genetic maps were further used as a reference to identify and map the novel major genomic regions underlying robust QTLs associated with three important agronomic traits in chickpea. To delineate functionally relevant candidate gene(s) regulating seed weight, one of the strong trait-associated major genomic region harbouring robust QTL has been narrowed down by integrating comparative desi and kabuli genome-based QTL mapping with QTL region-specific haplotype (linkage disequilibrium)-based high-resolution trait association analysis, differential gene expression profiling and gene haplotype-based association mapping in chickpea.

Results

Discovery and large-scale validation of GBS-based genome-wide SNPs

We generated an average of 207.3 million high-quality sequence reads (ranged from 1.9 to 8.3 with a mean of 3.4 million reads per individual) (Fig. S1) by sequencing of each 3 × 96-plex (288-plex) ApeKI libraries made from 275 RIL mapping individuals and two parental genotypes (ICC 12299 × ICC 8261) using GBS assay. However, 84.5 and 82.6% of sequence reads of these, were mapped to unique physical location of desi and kabuli reference genomes, respectively. It identified 43358, including 26785 and 16573 high-quality SNPs [with read-depth ≥10, SNP base quality ≥20, <10% missing data in each individual, 5% minor allele frequency (MAF) and 100% reproducibility] differentiating two parental genotypes using the reference desi and kabuli genome-based GBS assay, respectively (Fig. 1A). Of these, 22331, including 7968 and 14363 SNPs were physically mapped on eight chromosomes of desi and kabuli genomes with average map-densities of 15.6 and 24.2 kb, respectively (Table S1, Fig. 1A, B, C). A higher proportion of SNPs were mapped on desi and kabuli chromosomes 3 (19.4%, 1548 SNPs) and 4 (19.9%, 2854), respectively. The desi chromosomes 7 and 8 (13.9 kb each) and kabuli chromosome 4 (17.2 kb) had maximum mean map-density (Table S1). The remaining 21027 SNPs, including 18817 and 2210 SNPs were physically mapped on scaffolds of desi and kabuli genomes, respectively (Fig. 1A). The A/G and C/T transitions (62.7 and 55.1% of total SNPs identified in desi and kabuli, respectively) were more abundant compared with that of transversions (Fig. 1D). Detail characteristics of 43358 SNPs discovered in our study are provided in the Table S2. Of these, 16573 SNPs discovered by kabuli genome-based GBS assay have been submitted to NCBI dbSNP (http://www.ncbi.nlm.nih.gov/SNP) with SNP submission (SS) accession numbers 1067289484 to 1067306056 for unrestricted use [NCBI db SNP Build (B142) year 2014 (http://www.ncbi.nlm.nih.gov/SNP/snp_viewTable.cgi?handle=NIPGR)].
Figure 1

(A) Relative distribution and frequency of 43358 SNPs discovered in eight chromosomes and scaffolds of desi and kabuli chickpea using reference genome-based GBS assay. Distribution of 22331 SNPs physically mapped on eight desi (B) and kabuli (C) chromosomes are depicted in the Circos circular ideogram. The innermost circles display the distribution of SNPs identified through GBS assay, while outermost circles represent the eight chickpea chromosomes coded with different colours. (D) Proportionate distribution of transition and transversion SNPs identified using the reference desi and kabuli genomes-based GBS assays.

Four hundred-eighty SNPs mined by reference desi and kabuli genome-based GBS assay were targeted for their validation through resequencing of PCR amplicons (384 SNPs) and MALDI-TOF mass array (96 SNPs). It validated 437 (91%) SNPs successfully in parental genotypes and selected mapping individuals of a RIL population (ICC 12299 × ICC 8261) (Fig. S2A, B). Remarkably, all these validated SNPs revealed expected homozygous and heterozygous alleles discrimination in RILs as detected through our GBS assay.

Construction of ultra-high density intra-specific genetic linkage maps

The genotyping information of 4448 and 2689 high-quality SNPs (MAF ≥ 20%) discovered by reference desi and kabuli genome-based GBS assay, respectively (Table S3) in parental genotypes and 275 individuals of a RIL mapping population (ICC 12299 × ICC 8261) were used to construct ultra-high density intra-specific genetic linkage maps in chickpea. The linkage analysis of SNPs showing significant Mendelian segregation ratio (1:1) mapped 3625 and 2177 SNPs on eight desi and kabuli linkage groups (LGs) (LG1 to LG8) of intra-specific genetic maps, respectively according to their physical positions (bp) on corresponding chromosomes (Table 1, Fig. S3, S4, S5). Considering significant variations in the genome assembly and differences in sequence coverage and length (bp) of chromosomal pseudomolecules between desi and kabuli genomes78, we preferred to construct two genetic maps separately by using desi and kabuli genome-based high-quality GBS-SNPs and their corresponding physical positions on eight chromosomes as reference. The constructed desi and kabuli SNP-based ultra-high density intra-specific genetic maps comprising of eight LGs covered a total map length of 714.089 and 798.467 cM with mean inter-marker distances of 0.20 and 0.37 cM, respectively (Table 1). Maximum numbers of SNPs were mapped on desi LG1 (736 SNPs) and kabuli LG3 (463), while desi LG8 (281) and kabuli LG5 (104) contained minimum numbers of SNPs (Table 1). The map length spanned by each LG of desi ranged from 85.662 cM (386 SNPs) in LG2 to 97.782 cM (574) in LG4, while for kabuli, it varied from 81.444 cM (292 SNPs) in LG5 to 120.359 cM (386) in LG2 (Table 1). In desi, most saturated genetic map was LG1 (average inter-SNP marker distance: 0.12 cM), whereas LG5 and LG8 were least saturated (0.32 cM). In kabuli, LG3 (mean inter-SNP marker distance: 0.18 cM) and LG5 (0.78 cM) had most and least saturated genetic maps, respectively (Table 1).
Table 1

Detailed characteristics of constructed ultra-high density intra-specific genetic linkage maps of desi and kabuli chickpea

Linkage groups (LGs)SNPs mappedMap length covered (cM)Average inter-marker distance (cM)
DesiKabuliDesiKabuliDesiKabuli
CaLG0173634386.07691.7040.120.27
CaLG0238619985.662120.3590.220.60
CaLG0358746385.99382.1240.150.18
CaLG0457439997.782105.1270.170.26
CaLG0529210494.06981.4440.320.78
CaLG0643028587.601110.0090.200.38
CaLG0733919487.821105.1840.260.54
CaLG0828119089.085102.5160.320.54
Total36252177714.089798.4670.200.37

Estimation of genome-wide LD patterns in chickpea

The genome-wide LD estimates (r2) and extent of LD decay using all possible pair-combinations of 3625 and 2177 SNPs genetically mapped on eight desi and kabuli LGs of intra-specific genetic maps, respectively were determined. The average LD estimates in eight LGs of desi genome (r2: 0.62) was almost comparable with that of kabuli genome (0.59) (Table S4). The LG4 of both desi (r2: 0.78) and kabuli (0.72) genetic maps had highest LD estimates. Maximum proportion of SNP-pairs showed significant LD (P < 0.0001) on desi LG4 (42.8%) and kabuli LG2 (34.6%) (Table S4). We determined the LD decay of 3625 and 2177 SNP-pairs by pooling the r2 estimates across eight desi and kabuli LGs and plotting their average r2 against the 5 cM equal intervals of genetic distance (maximum up to 120 cM). A decreasing trend of LD decay (r2 < 0.3) was observed with increase in the genetic distance (cM) of SNP markers mapped on the desi and kabuli LGs (Fig. S6). Remarkably, a rapid LD decay was observed at the genetic distance of 5 cM in both desi and kabuli genomes. The desi and kabuli LGs overall sustained a significant level of LD up to a genetic distance of 10 cM. A significant LD decay (r2 < 0.1) was observed near about 15 cM genetic distance (Fig. S6) in LGs of these two genomes.

Identification and mapping of high-resolution trait-governing QTLs

A significant difference of three quantitative agronomic traits, namely pod number/plant (PN) (51.1–141.2 with 79–80% heritability), seed number/plant (SN) (60.6–231.6 with 77–81%) and 100-seed weight SW (7.1–43.7 g with 88–90%, Fig. S7) in 275 RIL mapping individuals and parental genotypes across three years based on ANOVA was observed (Table S5). We observed bi-directional transgressive segregation of these traits beyond that of parental genotypes in RILs (Fig. S8). The normal frequency distribution of all three quantitative traits in RILs and parental genotypes was evident. The coefficient of variation (CV) ranged from 0.14 for PN to 0.33 in SW (Table S5). The Pearson's correlation coefficient estimated among three pair-wise combinations of quantitative traits indicated significant (P < 10−2) negative correlation of PN and SN with SW (r = −0.42 to −0.51) as well as positive correlation between PN and SN (0.95) (Fig. S9). For genetic/QTL mapping, the genotyping data of 3625 parental polymorphic desi-genome based SNPs mapped on an ultra-high density intra-specific genetic linkage map was integrated with phenotyping data of 275 RIL mapping population. It identified and mapped 33 genomic regions harbouring 35 major and significant (LOD: 7.25–14.5) QTLs associated with PN, SN and SW on seven LGs (except LG3) of chickpea (Table 2, Fig. S10, Fig. S11). The proportion of phenotypic variance explained (PVE) by individual QTL varied from 17.9–39.7%. Collectively, all these QTLs revealed 37.3% PVE. All these identified QTLs (considered as robust QTL) showing consistent phenotypic expression and major effects on traits individually with PVE of > 10% each, were validated across two geographical locations as well as years/seasons in field. A maximum number of QTLs (nine QTLs) were mapped on LG1 followed by LGs 2, 6 and 7 (5 QTLs each) and minimum (3 QTLs) on LG5 (Table 2). The genomic regions (from 0.015 cM on LG4 to 1.978 cM on LG1) harbouring the QTLs covered with 106 SNP markers were mapped on seven LGs.
Table 2

Significant and robust QTLs regulating three agronomic traits identified and mapped on an ultra-high density intra-specific genetic linkage map of chickpea

Linkage groups (LGs)QTL identityLGsMarker intervals with genetic positions (cM)Peak LOD scorePhenotypic variance explained (PVE R2%) by QTLsTraits associatedAdditive effect
Ca_Desi_LG01qSW1.1LG_1Ca_Desi_SNP_1 (0) to Ca_Desi_SNP_2 (0.146)13.529.8SW4.5
qPN1.1LG_1Ca_Desi_SNP_15 (5.54) to Ca_Desi_SNP_17 (7.518)8.222.4PN3.2
qPN1.2LG_1Ca_Desi_SNP_40 (13.733) to Ca_Desi_SNP_42 (14.48)8.3122.8PN2.8
qPN1.3LG_1Ca_Desi_SNP_250 (34.293) to Ca_Desi_SNP_252 (34.453)8.1122PN1.9
qSN1.1LG_1Ca_Desi_SNP_486 (54.514) to Ca_Desi_SNP_488 (54.597)10.9529.6SN2.7
qSW1.2LG_1Ca_Desi_SNP_513 (56.043) to Ca_Desi_SNP_515 (56.158)1328.2SW3.2
qPN1.4LG_1Ca_Desi_SNP_591 (61.419) to Ca_Desi_SNP_595 (61.738)1435PN5.1
qSW1.3LG_1Ca_Desi_SNP_636 (64.655) to Ca_Desi_SNP_638 (64.817)7.3923.2SW4.5
qPN1.5LG_1Ca_Desi_SNP_655 (67.106) to Ca_Desi_SNP_657 (67.22)7.2522.6PN3.8
Ca_Desi_LG02qSN2.1LG_2Ca_Desi_SNP_752 (9.224) to Ca_Desi_SNP_754 (10.101)8.9125.4SN2.5
qSN2.2LG_2Ca_Desi_SNP_902 (39.18) to Ca_Desi_SNP_904 (39.252)8.4223.3SN2.8
qPN2.1LG_2Ca_Desi_SNP_946 (48.4) to Ca_Desi_SNP_948 (48.982)10.5628PN3.4
qPN2.2LG_2Ca_Desi_SNP_1012 (58.324) to Ca_Desi_SNP_1014 (58.377)8.1222PN4.7
qSN2.3LG_2Ca_Desi_SNP_1113 (80.057) to Ca_Desi_SNP_1115 (81.06)10.7232.5SN4.2
Ca_Desi_LG04qPN4.1LG_4Ca_Desi_SNP_1842 (42.717) to Ca_Desi_SNP_1844 (43.31)9.1622.2PN3.8
qPN4.2LG_4Ca_Desi_SNP_2011 (51.285) to Ca_Desi_SNP_2013 (51.308)8.3222PN3.4
qPN4.3LG_4Ca_Desi_SNP_2023 (51.702) to Ca_Desi_SNP_2025 (51.717)9.7424.7PN4.3
qSW4.1LG_4Ca_Desi_SNP_2088 (56.304) to Ca_Desi_SNP_2090 (56.442)9.0221.6SW2.6
Ca_Desi_LG05qSN5.1LG_5Ca_Desi_SNP_2336 (23.792) to Ca_Desi_SNP_2338 (24.875)12.638.1SN6.7
qSW5.1LG_5Ca_Desi_SNP_2337 (23.805) to Ca_Desi_SNP_2338 (24.875)14.539.7SW8.9
qSN5.2LG_5Ca_Desi_SNP_2557 (76.802) to Ca_Desi_SNP_2559 (77.188)10.524.5SN1.9
Ca_Desi_LG06qSW6.1LG_6Ca_Desi_SNP_2608 (10.981) to Ca_Desi_SNP_2610 (11.552)7.817.9SW2.6
qSW6.2LG_6Ca_Desi_SNP_2902 (56.733) to Ca_Desi_SNP_2904 (57.25)8.3721.8SW3.5
qPN6.1LG_6Ca_Desi_SNP_2903 (56.755) to Ca_Desi_SNP_2907 (57.422)7.8620.7PN3.8
qSW6.3LG_6Ca_Desi_SNP_2931 (62.038) to Ca_Desi_SNP_2933 (62.381)11.6225.7SW4.6
qPN6.2LG_6Ca_Desi_SNP_2993 (77.157) to Ca_Desi_SNP_2995 (78.929)9.2226.4PN5.2
Ca_Desi_LG07qSW7.1LG_7Ca_Desi_SNP_3006 (0) to Ca_Desi_SNP_3007 (1.014)8.2822.7SW3.6
qPN7.1LG_7Ca_Desi_SNP_3089 (24.832) to Ca_Desi_SNP_3091 (24.848)8.3522.5PN4.3
qPN7.2LG_7Ca_Desi_SNP_3157 (34.458) to Ca_Desi_SNP_3159 (34.858)8.2822.1PN3.9
qPN7.3LG_7Ca_Desi_SNP_3230 (44.792) to Ca_Desi_SNP_3232 (45.282)9.6524.3PN4.5
qPN7.4LG_7Ca_Desi_SNP_3269 (54.679) to Ca_Desi_SNP_3271 (54.792)10.4527.6PN5.1
Ca_Desi_LG08qSW8.1LG_8Ca_Desi_SNP_3387 (19.508) to Ca_Desi_SNP_3389 (19.651)9.0621.7SW3.9
qSN8.1LG_8Ca_Desi_SNP_3394 (20.582) to Ca_Desi_SNP_3396 (21.005)7.4421.7SN2.8
qSN8.2LG_8Ca_Desi_SNP_3536 (54.055) to Ca_Desi_SNP_3538 (54.218)8.4527.6SN4.6
qPN8.1LG_8Ca_Desi_SNP_3555 (57.041) to Ca_Desi_SNP_3561 (58.066)9.2122.4PN5.1
For PN, 16 major genomic regions underlying 17 robust QTLs (PVE: 20.7–35% and LD: 7.25–14.0) were identified and mapped on six LGs. Collectively, all these QTLs showed 36.4% PVE. The PN QTL regions covered with 57 SNP markers on LGs (0.015 cm on LG4 to 1.978 cM on LG1) showed mostly the positive additive gene effects indicating the effective contributions of ICC 12299 alleles at these loci for increasing pod number in chickpea. For SN, eight major genomic regions harbouring eight robust QTLs explaining 21.7–38.1% PVE (LOD: 7.44–12.6) were identified. These SN QTLs covered with 24 SNPs (0.072 cM on LG2 to 1.083 cM on LG5) were mapped on four LGs (Table 2 and Fig. S10). The combined PVE estimated for all these QTLs was 35.8%. The QTLs showed positive additive gene effects for increasing seed number with large allelic contribution from ICC 12299. For SW, nine major genomic regions harbouring 10 robust QTLs covered with 27 SNP markers (0.115 cM on LG1 to 1.07 cM on LG5) were mapped on six LGs explaining 23.2–39.7% PVE (LOD: 7.4–14.5) (Table 2 and Fig. S10). The combined PVE for all these QTLs was 39.4%. These QTLs showed positive additive gene effects for increasing seed weight with allelic contributions from ICC 8261.

Structural and functional annotation of SNPs mapped on intra-specific genetic linkage maps

The structural annotation of 4448 and 2689, including 3625 and 2177 high-quality desi and kabuli GBS-SNPs mapped on intra-specific genetic maps revealed the presence of 3017 and 1349 SNPs in the intergenic regions, while 1431 and 1340 SNPs in the different sequence components of 635 and 760 genes, respectively (Table S6A and B). The detailed annotation of SNPs within genes identified maximum frequency (35.4%, 507 SNPs in desi and 49.4%, 662 SNPs in kabuli) of SNPs in the exons and minimum in the 1000-bp downstream regulatory regions (DRR) (27.9%, 399 and 17.7%, 238) (Fig. 2A and S12A). Remarkably, 278 and 229 coding SNPs-containing desi genes, while 352 and 328 coding SNPs-carrying kabuli genes showed synonymous and non-synonymous (missense and nonsense) substitutions, respectively (Fig. 2B and S12B). The non-synonymous SNPs identified in desi and kabuli comprised of 223 and 323 missense SNPs in the 149 and 234 genes showing amino acid substitutions as well as six and five nonsense SNPs in the six and five genes, respectively culminating into premature termination codons introduced by nucleotide replacements (Fig. 2B and S12B). Maximum (5.3 SNPs/gene) and minimum (1.6 SNPs/gene) average frequency of SNPs was observed in the non-coding intronic and coding sequence components of desi genes, respectively. In contrast, we observed comparable average SNP frequency between intronic (1.7 SNPs/gene) and coding (1.9) sequence components of kabuli genes.
Figure 2

(A) Proportionate distribution of 4448 SNPs identified in different coding and non-coding sequence components and intergenic regions of 635 genes annotated from desi chickpea genome. (B) Number of SNPs, including synonymous and non-synonymous (missense and nonsense) SNPs annotated in the coding (CDS) as well as non-coding (intronic and regulatory) sequences of genes and intergenic regions of desi genome. The URR (upstream regulatory region) and DRR (downstream regulatory region) of genes were defined as 2000-bp upstream sequences from the translation start codon (ATG) and 1000-bp downstream sequences from the stop codon, respectively.

The functional annotation of 635 desi and 760 kabuli genes with SNPs detected a higher proportion of SNPs belonging to growth, development and metabolism-related proteins (58 and 63% in desi and kabuli, respectively) followed by transcription factors (16 and 18%) (Fig. S13A). The KOG-based determination of putative functions (excluding unknown and general functions) for SNPs-carrying genes revealed their maximum correspondence to transcription (K, 25% in desi and 31% in kabuli) (Fig. S13B). The GO enrichment analysis of SNPs-carrying desi and kabuli genes showed a significant (P ≤ 5.9 × 10−3) overrepresentation/enrichment of GO terms in the genes associated with transport (15.4%) belonging to biological process (Fig. S13C).

QTL region-specific haplotype (LD)-based trait association mapping

One major and robust (LOD: 14.5 and R2: 39.7%) QTL (qSW5.1) region [Ca_Desi_SNP2337 (23.805 cM)-Ca_Desi_SNP2338 (24.875 cM)] governing SW mapped on desi LG5 of intra-specific genetic map was selected for fine mapping through QTL target-specific haplotype (LD)-based high-resolution trait association mapping (Fig. S10 and Fig. 3A). The genomic region underlying such a major QTL was defined by integrating the genetic linkage map information with that of physical map of desi genome. The target QTL interval genetically mapped on desi LG5 was compared and correlated with that mapped on kabuli LGs of a genetic map (based on QTL mapping). In kabuli, the SW trait-influencing QTL region spanned with 1.6 cM genetic interval between the markers Ca_Kabuli_SNP1459 (34.6 cM) and Ca_Kabuli_SNP1462 (36.2 cM) was mapped on LG5 (Fig. 3B). The integration of genetic linkage map information with physical map of kabuli genome, the 1.6 cM QTL interval corresponded to 713 kb genomic region (40020.4–40733.4 kb) on chromosome 5 (Fig. 3C).
Figure 3

Integration of desi (A) and kabuli (B) genetic maps with kabuli physical (C) map of one major genomic region harbouring robust SW QTL (qSW5.1) mapped target QTL on 713 kb sequence interval of chromosome 5. This QTL was further narrowed down to about 22.5 kb sequenced region on chromosome 5 by integrating QTL mapping with QTL region-specific haplotype (LD)-based high-resolution trait association analysis (D). The 22.5 kb genomic region between the markers CaSNP1 and CaSNP2 contained five candidate protein-coding genes, of which one of the regulatory SNP-carrying embryo defective kabuli gene (Ca07594) showed strong association with SW in chickpea (D). The genetic (cM) and physical (bp) distance and identity of the markers mapped on the LGs/chromosomes are indicated on the left and right side of the LGs/chromosomes, respectively.

The targeted resequencing of this 713 kb qSW5.1 QTL region in parental genotypes (ICC 12299 and ICC 8261) and 10 of each homozygous low and high seed weight RIL mapping individuals detected 804 high-quality SNPs (mean SNP density: 1/886.8 bp). It includes 435 intergenic SNPs and 369 SNPs derived from the diverse coding and non-coding sequence components of 64 genes. The decay of LD over longer genetic distance (~15 cM, Fig. S6) particularly at the target SW QTL interval mapped on kabuli LG5 gave us clue to narrow-down this QTL region of interest using SNP haplotype (LD)-based high-resolution trait association mapping. The genotyping of 804 SNPs identified and mapped on the 713 kb qSW5.1 QTL region in 244 accessions belonging to an SW-specific association panel (Table S7) enabled to constitute 12 haplotypes (Fig. 3D). The SNP haplotype-based genotyping information were further correlated with their SW-specific robust field phenotyping data (SW: 5.9–57.6 g) for QTL region-specific trait association analysis. The integration of GLM and MLM analysis with EMMA and false discovery rate (FDR) correction based on multiple-comparisons (minimizing the confounding effect of population structure) identified one best haplotype (H1) between two SNPs (CaSNP1 and CaSNP2) in two genes (Ca07590 and Ca07594) showing strong association (P: 2–4 × 10−6 and R2: 37.6–40.3%) with SW in contrast to any other SNP combinations. This haplotype region covered a maximum of 22.5 kb (40710.6–40733.1 kb) physical distance between CaSNP1 and CaSNP2 at qSW5.1 QTL interval. The structural and functional annotation of this 22.5 kb sequenced short QTL region with kabuli genome annotation database identified five protein-coding candidate genes (Fig. 3D). The comprehensive analysis of strong SW-associated haplotypes-constituting two genic SNPs detected one regulatory SNP (CaSNP2: C/T) in the URR (upstream regulatory region) of an embryo defective protein-coding gene showing strong association (P: 2.0 × 10−5 and R2 = 39.8%) with SW.

Expression profiling of SW-associated genes

To infer the differential regulatory gene expression patterns, the expression profiling of five protein-coding candidate genes annotated in the 22.5 kb delineated SW-specific robust QTL interval (qSW5.1) was performed. The gene-based primers were amplified using the RNA isolated from three different vegetative and reproductive tissues and two seed developmental stages of low and high SW parental genotypes (ICC 12299 and ICC 8261) of a RIL mapping population through quantitative RT-PCR assay (Fig. S14). One regulatory SNP-carrying embryo defective gene of these, constituting strong SW-associated haplotypes at qSW5.1 QTL interval revealed seed-specific expression (as compared to vegetative and reproductive tissues) as well as pronounced down-regulation (>7-fold) in parental genotypes during seed development (Fig. S14). Henceforth, this gene localized in a major SW-influencing QTL interval (qSW5.1) was selected as target candidate for understanding its significance in seed weight regulation through high-resolution gene haplotype-specific association/LD mapping in chickpea.

Haplotype-based association mapping in a strong SW-associated gene

The sequencing of 9286 bp cloned amplicon covering the entire coding and non-coding regions (including 2 kb URR) of a strong SW-influencing embryo defective gene (Fig. 4A) (validated by QTL mapping, QTL region-specific association analysis and differential expression profiling) among 244 accessions belonging to a SW-specific association panel identified 12 SNP loci (Fig. 4B). It includes four regulatory SNPs in the URR of this gene. The gene-based haplotype analysis combining the genotyping data of 12 SNPs constituted a maximum of four haplotypes among accessions (Fig. 4B).
Figure 4

The molecular haplotyping, LD mapping and gene haplotype-specific trait association analysis in an embryo defective gene validating its strong association potential for SW in chickpea.

The genotyping of 12 SNPs, including one regulatory SNP (C/T) in the URR of this gene (A) among 244 association panel constituted four haplotypes (B). The four SNP haplotype marker-based genotyping information produced higher LD estimates covering the entire 9286 bp sequenced region of gene (C). A significant correlation of parental genotypes and RIL mapping individuals representing high (haplotype I: TAC) and low (haplotype II: CGT) SW haplotypes [constituted by three SNPs (shaded with yellow colour in B)] in URR of gene with their seed-specific diverse transcript expression (D) during seed development was evident. A superior favorable high SW-regulating haplotype (TAC) with decreased transcript expression was identified in the URR of gene (D).

The association analysis using the four SNP marker-based haplotypes in an embryo defective gene with SW (5.9 to 57.6 g)-specific multi-location replicated field phenotyping data of 244 accessions (association panel) revealed its strong association with SW (P: 2.5 × 10−7 and R2: 44.7%). The haplotype-pair-based LD estimation produced a significant high degree of LD (r2 > 0.70 and P < 0.0001) across entire 9286 bp sequenced gene (Fig. 4C), which increased its overall potential for trait association. The 94 high SW (40–57.6 g) accessions represented by single haplotype group 1 (TAC) in the gene differentiated distinctly from another haplotype group 2 (CGT) consisting 77 accessions of low SW (5.9–18.4 g) (Fig. 4B) with higher phenotypic variance (43.4–44.7% at P < 2.0 × 10−6) and thus strong association potential of this gene for SW in chickpea is expected. The differential expression profiling using these high (TAC) and low (CGT) SW-specific haplotypes constituted by three SNPs in URR of an embryo defective gene revealed significant down-regulated expression (~6 times) specifically in two seed developmental stages of low and high SW parental genotypes and homozygous RIL mapping individuals as compared to leaf. It inferred that favourable natural allelic variants and superior haplotype (TAC) constituted in the URR of an embryo defective gene involved in decreased transcript expression and thus have significance in regulation of seed development and consequently seed weight in chickpea.

Discussion

We utilized NGS-based 3 × 96-plex GBS assay for large-scale validation and high-throughput genotyping of genome-wide SNPs in a 275 RIL mapping population (ICC 12299 × ICC 8261) to construct ultra-high density intra-specific genetic linkage maps in chickpea. The reference desi and kabuli genome-based GBS assay identified 26785 and 16573 high-quality SNPs (5% MAF) differentiating the two parental genotypes (ICC 12299 and ICC 8261) of a RIL mapping population with 100% reproducibility, respectively. Notably, 22331, including 7968 and 14363 SNPs of these, showed extensive genome coverage across eight desi and kabuli chromosomes with mean densities of 1/15.6 and 1/24.2 kb, respectively. A high experimental validation success rate (91%) of 480 randomly selected SNPs from a total of 43358 GBS-based SNPs by amplicon resequencing and MALDI-TOF mass array genotyping assay was evident. A number of studies in the past have accessed the potential of diverse NGS-based assay for efficient mining and genotyping of non-erroneous SNPs through experimental validation of randomly chosen smaller set (~1–2%) of SNPs from a whole genome SNP dataset in diverse crop plants, including chickpea12224245505179808182. The correspondence of on an average of 39% (16910 SNPs) desi and kabuli genome-derived GBS-SNPs with that of SNP allelic information available in various chickpea genotypes8912223751 in silico based on their congruent physical positions was clearly evident. These findings overall suggest the robustness and utility of GBS assay for rapid large-scale mining and high-throughput genotyping of valid and high-quality SNPs at a genome-wide scale in chickpea. The advantages of GBS-assay in simultaneous genome-wide discovery and genotyping of SNPs and their ability to expedite various large-scale genotyping applications have been demonstrated in many crop plants244041424344454647484950. Collectively, a large number of 43358 high-quality reference desi and kabuli genome-based GBS-SNPs (61% novel SNPs) developed by us (submitted to NCBI SNPdb for unrestricted use) could serve as a valuable genomic resource for their immense use in genomics-assisted breeding applications of chickpea. An ultra-high density 3625 and 2177 SNP marker-based intra-specific genetic linkage maps (ICC 12299 × ICC 8261) with eight desi (mean inter-marker distance 0.20 cM) and kabuli (0.37 cM) LGs constructed in our study are highly saturated compared to the previously reported intra- and inter-specific genetic linkage maps (0.50–8.01 cM) in chickpea6789102251. Remarkably, the currently generated genetic linkage maps had much lower marker map-density and thus highly saturated in contrast to latest available high-density intra-specific genetic linkage maps (0.50–3.68 cM) in chickpea111213142251. Beside chickpea, the efficacy of different high-throughput genome-wide SNP discovery and genotyping approaches, including GBS assay for construction of ultra-high density genetic linkage maps have been well demonstrated in several other crop plants4151617181920212223242543. The ultra-high density genetic linkage maps have been successfully deployed as a reference for effective assembly, precise orientation and anchoring of high-quality scaffolds into pseudomolecules of the chromosomes arising out of whole genome sequencing efforts (finished and/or in progress) in many plant species1522232526272829303132333435363738. Therefore, ultra-high density intra-specific genetic linkage maps constructed in our study can serve as a reference for efficient generation of integrated genetic, physical and genome map, including identification and fine-mapping of major as well as minor QTLs regulating important agronomic traits in chickpea. A significant LD estimates across eight desi (0.62) and kabuli (0.59) LGs of constructed intra-specific genetic maps using mapped SNP markers reflected direct correlation of LD patterns with the density of SNPs required to cover the genomes. In spite of large differences in SNP marker map density between desi (0.20 cM) and kabuli (0.37) LGs of genetic maps, we observed higher LD estimates and extended LD decay (~15 cM) in LGs of both desi and kabuli genomes. This is much higher than the LD decay estimated in other self- and cross-pollinated crop plants8384858687888990. It could be due to sequential bottlenecks during chickpea domestication resulting in its narrow genetic base in contrast to other crop plants363774919293. Overall, the genome-wide LD patterns and constructed LD maps gave an approximate perception regarding the SNP density required to identify the functionally relevant trait-regulatory robust genes/QTLs in a large chickpea genome with narrow genetic base using genetic (fine-mapping and map-based cloning) and association mapping. Our study utilized ‘α-lattice’ designs in multi-location experimental field trials to efficiently grow and comprehensively phenotype the 275 RIL mapping individuals (ICC 12299 × ICC 8261) along with their parental genotypes for three important yield component traits (PN, SN and SW) in chickpea. The ‘α-lattice’ designs are more preferred than the commonly adopted ‘randomized complete block design (RCBD)’, since they are less prone to experimental errors, cost-effective and provide more precise (~35% higher) phenotyping information in field experiments aside high reproducibility. The ‘α-lattice’ designs are also associated with smaller coefficient of variation and error mean square, which collectively gives them an edge over the traditionally used RCBD designs specifically when phenotyping of yield component trait variables was conducted for very large numbers of diverse crop genotypes in field experiments with a smaller plot size949596. Thirty-three major genomic regions harbouring 35 robust QTLs (LOD: 7.25–14.5 and PVE: 17.9–39.7%) associated with three important agronomic traits (SW, PN and SN) were identified and mapped on seven desi LGs of an ultra-high density genetic map, which were well validated in multiple geographical locations/environments across years in field. The reliability and validity of the identified PN, SN and SW QTLs were determined by correlating/comparing their underlying genomic regions with that of earlier QTL mapping studies13525964697477979899100101 based on correspondence of physical positions of markers covering the target QTL intervals. This analysis identified one of each PN (qPN8.1) and SW (qSW4.1) QTLs in line with previous QTL mapping studies using diverse intra- and inter-specific mapping populations of chickpea135964699798101. This analysis indicated novelty and population-specific characteristics of our identified 33 robust QTLs for SW, PN and SN traits in chickpea. Several QTL mapping studies in crop plants, including legumes have effectively utilized ultra-high density genetic linkage maps derived from different intra- and inter-specific mapping populations as foundation for rapid molecular mapping as well as fine mapping/map-based cloning of major genes harbouring robust QTLs governing multiple yield contributing traits (like pod and seed number and seed weight) to accelerate genomics-assisted crop improvement202148102103104105. The use of ultra-high density genetic linkage map as a reference in the present study, significantly narrowed down the marker intervals in the QTL regions to 0.015–1.978 cM (mean < 1 cM), which is lower than that reported (1–3 cM) so far in diverse QTL mapping studies of chickpea31213142022517273747576. It further authenticates the use of SNP markers that are tightly linked to the QTLs for subsequent validation in diverse genetic backgrounds along with fine mapping/map-based cloning of QTLs controlling three agronomic traits in chickpea. Moreover, SNPs localized in the QTL region mapped on eight LGs of an ultra-high density intra-specific genetic linkage map derived from the diverse coding and non-coding sequence components of genes could certainly expedite the process of validation and/positional cloning of QTLs governing three agronomic traits in chickpea. The required inputs obtained from these analyses will eventually help us to identify potential candidate genes underlying the QTLs for marker-assisted genetic enhancement of chickpea. The structural and functional annotation of SNPs mapped on eight desi and kabuli LGs, including SW, PN and SN QTL intervals of ultra-high density intra-specific genetic maps will enable us to select functional gene-based SNPs (like non-synonymous coding and regulatory SNPs) for establishing marker-trait linkages and identifying genes/QTLs associated with important qualitative and quantitative traits through genetic and association mapping in chickpea. One strong SW-associated major genomic region (LOD: 14.5, PVE: 38.7%) harbouring a robust QTL (qSW5.1) mapped on desi LG5 of an intra-specific genetic map was compared/correlated with that of integrated genetic and physical maps of kabuli to fine map the major QTL region. The extended LD decay (~15 cM) in desi and kabuli LGs of genetic maps suggested that high-resolution haplotype (LD)-based trait association mapping at target QTL interval could be an attractive alternative approach for narrowing-down the QTL regions to candidate gene(s) regulating three said agronomic traits in chickpea. Henceforth, we combined haplotype (LD)-based high-resolution trait association mapping at target SW-specific QTL region with differential gene expression profiling (during seed development in low and high seed weight parental genotypes), which delineated one regulatory SNP-containing embryo defective protein-coding gene governing seed weight in chickpea. Such haplotype-specific association analysis in presence of high-resolution significant LD at major QTL region have utility to scale-down the QTLs to specific candidate gene(s) regulating important agronomic traits in crop plants106107108109. The strong trait association potential of an embryo defective gene with SW has been ascertained by higher contribution of a significant superior haplotype (TAC) identified in the URR of this gene with 44.7% SW phenotypic variation in a constituted 244 association panel. This was further supported by the seed-specific expression and significant down-regulated expression of gene haplotype transcript during seed development in contrasting SW parental genotypes and RIL mapping individuals. The favourable natural allelic variants and optimal superior haplotype identified in an embryo defective gene will be helpful in dissection of gene regulatory networks underlying this complex quantitative SW trait in chickpea. This identified gene could eventually be used as a potential candidate in marker-assisted genetic enhancement of chickpea for increasing seed weight and higher yield. The embryo defective chickpea gene ortholog in Arabidopsis is known to control the normal embryo development, which implies the role of this vital gene in natural growth and seed development of crop plants110. Considering the availability of very limited information regarding embryo defective gene in plants, a further comprehensive molecular characterization and functional validation of this gene in over-expression/knock-down transgenics is required to ascertain its definite involvement in regulating seed development and seed weight of chickpea.

Methods

Generation of an intra-specific chickpea mapping population

An intra-specific F7 RIL mapping population (ICC 12299 × ICC 8261) comprising of 275 individuals derived from the crosses between two parental chickpea genotypes ICC 12299 and ICC 8261 was generated using a single seed descent method. The chickpea genotype ICC 12299 originated from Nepal is a low 100-seed weight (9 g) and high pod (103) and seed (158) number-containing desi (C. arietinum) landrace. In contrast, the genotype ICC 8261 (originated from Turkey) is a high 100-seed weight (30 g) and low pod (64) and seed (73) number-containing kabuli (C. arietinum) landrace.

Phenotyping of mapping population for agronomic traits

The RIL mapping individuals along with their parental genotypes were grown in the field according to ‘alpha (α)-lattice’ design for three consecutive years (2011–2013) with at least two replications during crop growing season at two diverse geographical locations of India. These RILs were phenotyped individually for three important quantitative agronomic traits, namely PN, SN and SW. The PN and SN was estimated by counting the average number of fully formed pods and seeds per plant (from 10–15 representative plants from each RILs at maturity), respectively. The SW (g) was measured by taking the average weight (g) of 100-matured seeds at 10% moisture content by selecting 10–15 representative plants from each RILs. The mean, standard deviation, coefficient of variation, broad-sense heritability, frequency distribution, correlation coefficient and analysis of variance (ANOVA) of three agronomic traits in RIL mapping population were analyzed using SPSS v17.0 and following the methods of Kujur et al.7499 and Saxena et al.77.

Genome-wide discovery and genotyping of SNPs using GBS assay

The genomic DNA isolated from 275 RILs, including two parental accessions (used as biological replicates) were digested with ApeKI, ligated to adapters containing one of 288 unique barcodes, constructed 288-plex GBS libraries and pooled together following the detailed procedures of Elshire et al.40 and Spindel et al.45. All these libraries were sequenced by Illumina TrueSeq V3.0 single end sequencing chemistry with read lengths of 100-bp using HiSeq2000 (Illumina Inc., San Diego, CA, USA) NGS platform. The FASTQ raw sequence reads were processed for high-quality sequence filtering and mapping through TASSEL-GBS. The sequence reads with a minimum phred Q-score of 10 across the first 72-bp nucleotide sequences were considered as high-quality, which were further rechecked for their quality using FASTQC v0.10.1. The retained good quality sequences were de-multiplexed using unique barcodes attached to each mapping individuals. The sequence reads of each individuals were aligned and mapped to reference draft desi (ICC 495837) and kabuli (CDC Frontier36) chickpea genome sequences separately using Burrows-Wheeler alignment tool (BWA) with default parameters. The missing SNP allele data in individuals were imputed using the haplotype probability method of fastPHASE. A maximum likelihood statistical model and a catalog was used to identify and genotype valid and high-quality SNPs (minimum sequence read depth: 10 with SNP base quality ≥20) in 275 RIL mapping individuals along with two parental genotypes.

Large-scale validation of SNPs

To validate the genome-wide SNPs identified through GBS assay, the primers designed from the 200-bp sequences flanking either-side of 384 selected SNPs were PCR amplified using the genomic DNA of two parental chickpea genotypes and selected RIL mapping individuals. The amplified PCR products was resequenced, aligned the high-quality sequences and the SNPs were detected among individuals (following the methods74111). A selected 96 SNPs identified by reference genome-based GBS assay were validated and genotyped in parental genotypes and representative mapping individuals using MALDI-TOF (matrix-assisted laser desorption ionization-time of flight) mass array SNP genotyping assay (http://www.sequenom.com) following the methods of Pandit et al.112 and Saxena et al.77111.

Construction of intra-specific genetic linkage maps

The genotyping data of reference desi and kabuli genome-based high-quality GBS-SNPs (MAF ≥ 0.2) showing differentiation between parental genotypes (ICC 12299 and ICC 8261) were analysed in 275 RIL mapping individuals using the χ2-test (p < 0.05). The SNP genotyping data showing goodness-of-fit to the expected Mendelian 1:1 segregation ratio were used further for linkage analysis using MAPMAKER/EXP3.0 and JoinMap 4.1 at higher LOD (logarithm of odds) threshold (>10.0) with Kosambi mapping function. The SNPs were integrated into eight desi and kabuli LGs of intra-specific genetic maps based on their centimorgan (cM) genetic distance using the methods of Kujur et al.7499 and Saxena et al.77. The marker ordering was performed using the RECORD algorithm implemented in RECORD_WIN. The best marker ordered genetic linkage map with shortest map distance (cM) was preferred. The LGs with genetically mapped SNPs were designated and numbered (LG1 to LG8) based on their corresponding physical positions (bp) on the chromosomes of chickpea genomes as determined in our study.

Determination of LD patterns

To determine genome-wide LD patterns in chickpea, the LD estimates (significant P-value ≤ 10−3) as average squared-allele frequency correlations (r2) among a pairs of SNP loci that are genetically mapped on eight desi and kabuli LGs of intra-specific genetic maps were analysed using the sliding window approach of TASSEL v5.0. The decay of LD with the genetic distance was measured by combining the r2 values of SNP-pairs mapped in a uniform genetic intervals of 5 cM (maximum up to 120 cM) across eight desi and kabuli LGs of genetic maps. The graph was plotted between pooled r2 and genetic distance (cM) based on nonlinear regression model considering the r2 value = 1 at marker genetic distance of 0 cM7499101113 to determine the trend of LD decay in desi and kabuli genomes.

Structural and functional annotation of genome-wide SNPs

The reference desi and kabuli genome-based SNPs mapped on the eight LGs of intra-specific genetic maps were annotated in diverse intragenic and intergenic sequence components of genomes (chromosomes/pseudomolecules) using the genome annotation information (GFF) of desi37 and kabuli36. The structural and functional annotation of SNPs, including synonymous and non-synonymous coding and regulatory SNPs were performed using the customized perl scripts and single-nucleotide polymorphism effect predictor (SnpEff v3.1h). The SNPs were plotted individually based on their unique physical positions (bp) on eight chromosomes (pseudomolecules) of desi and kabuli genomes using Circos v0.67-pre3. The functional annotation of SNP-containing genes was performed according to desi and kabuli genome annotation and PFAM database v27.0. The genes with SNPs were BLAST searched against the KOGnitor NCBI database. Gene Ontology (GO) enrichment analysis of genes with SNPs was performed using the BiNGO plugin of Cytoscape v2.6 based on Benjamini and Hochberg false discovery rate correction at 5% significance level.

QTL mapping

For identification and mapping of QTLs, the reference desi genome-based high-quality GBS SNP genotyping data and genetic linkage map information of SNPs mapped on eight desi LGs of intra-specific genetic map were correlated with multi-location/years replicated field phenotyping data (PN, SN and SW) of RIL mapping individuals and parental genotypes using single marker analysis, interval mapping and composite interval mapping functions of QTL Cartographer v2.5 and MapQTL v6.0. To identify and map the novel genomic regions harbouring the major QTLs associated with three agronomic traits (PN, SN and SW) on LGs, the LOD threshold score of ≥5.0 at 1000 permutations was considered significant (p < 0.05). The significant major trait-influencing QTLs, which were validated across multiple environments (locations)/years were considered as “robust QTLs” controlling agronomic traits (as per Ref. 77). The positional genetic effects, phenotypic variation explained (PVE %) by QTLs and their additive effect (evaluated by parental origin of favourable alleles) on traits were evaluated at significant LOD (p ≤ 0.05).

QTL-region targeted resequencing and association analysis

A selected strong SW major genomic region underlying robust QTL was sequenced in parental genotypes (ICC 12299 and ICC 8261) along with 10 of each homozygous low and high SW RIL mapping individuals using the multiplexed amplicons sequencing protocol (following manufacturer's instructions) of TruSeq Custom Amplicon v1.5 in Illumina MiSeq next-generation sequencer. The high-quality amplicon sequence reads (> 90% bases covered at 0.5× mean coverage) were mapped to reference desi and kabuli genomes and SNPs were detected among parental genotypes and mapping individuals adopting the methods of Agarwal et al.80, Jain et al.82 and Saxena et al.77. The sequenced genomic region underlying SW-specific QTL was structurally and functionally annotated (following aforementioned methods). The SNPs showing differentiation between low and high SW parental genotypes and mapping individuals at QTL region of interest were genotyped in 244 chickpea accessions (211 mini-core) belonging to an SW-specific association panel (Table S7) using the MALDI-TOF SNP genotyping assay (following Refs. 77, 111). The SNP genotyping data among accessions were used to constitute haplotypes at target QTL interval. The replicated multi-location/years SW field phenotyping data, population structure (K = 2) statistics and kinship matrix of 244 accessions were obtained from Kujur et al.99. The association analysis was performed using general linear model (GLM) and mixed linear model (MLM at optimum level of compression with P3D method) of TASSEL v5.0 and mixed model approach of EMMA adopting the detailed procedures of Kujur et al.7499, Saxena et al.77 and Thudi et al.101. To eliminate the confounding effect of population structure and correct the false discovery rate (FDR) based on multiple comparisons, the Bonferroni correction of P-value was performed for each trait-associated SNPs at 5% significance level. By integrating the results of GLM and MLM with EMMA and FDR correction, the SNPs showing strong association (R2 = correlation potential of significant SNPs with traits) with SW at significant cut-off P ≤ 10−4 were selected.

Differential gene expression profiling

The SNPs-carrying genes annotated at the short delineated major genomic region underlying the robust SW-specific QTL were selected to infer their differential regulation during seed development through expression profiling. The RNA isolated from three different vegetative and reproductive tissues (leaf, root and flower bud) and two seed developmental stages7499100 of high (ICC 8261) and low (ICC 12299) SW parental genotypes was amplified using the gene-specific primers through quantitative RT-PCR assay (following Refs. 74, 100).

Gene haplotype-based LD mapping

To perform high-resolution haplotype-based association/LD mapping, the fragments covering the entire coding and non-coding sequences, including 2 kb upstream regulatory region (URR) of one strong SW-associated gene (validated by QTL mapping, association mapping and differential expression profiling) were amplified using the genomic DNA of 244 SW-specific association panel. The cloning and sequencing of amplicons, SNP mining, haplotype constitution and LD pattern determination of a gene including, its association potential with SW were estimated using the methods of Kujur et al.7499, Saxena et al.77 and Bajaj et al.100. To determine the potential of diverse haplotypes constituted in the gene for regulating SW, the differential expression profiling in two seed developmental stages of parental genotypes and homozygous RIL mapping individuals representing the low and high SW haplotype groups was performed using the gene haplotype-specific primers.
  77 in total

1.  Molecular mapping of Fusarium oxysporum f. sp. ciceris race 3 resistance gene in chickpea.

Authors:  Kamal Dev Sharma; P Winter; G Kahl; Fred J Muehlbauer
Journal:  Theor Appl Genet       Date:  2003-12-19       Impact factor: 5.699

2.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla.

Authors:  Olivier Jaillon; Jean-Marc Aury; Benjamin Noel; Alberto Policriti; Christian Clepet; Alberto Casagrande; Nathalie Choisne; Sébastien Aubourg; Nicola Vitulo; Claire Jubin; Alessandro Vezzi; Fabrice Legeai; Philippe Hugueney; Corinne Dasilva; David Horner; Erica Mica; Delphine Jublot; Julie Poulain; Clémence Bruyère; Alain Billault; Béatrice Segurens; Michel Gouyvenoux; Edgardo Ugarte; Federica Cattonaro; Véronique Anthouard; Virginie Vico; Cristian Del Fabbro; Michaël Alaux; Gabriele Di Gaspero; Vincent Dumas; Nicoletta Felice; Sophie Paillard; Irena Juman; Marco Moroldo; Simone Scalabrin; Aurélie Canaguier; Isabelle Le Clainche; Giorgio Malacrida; Eléonore Durand; Graziano Pesole; Valérie Laucou; Philippe Chatelet; Didier Merdinoglu; Massimo Delledonne; Mario Pezzotti; Alain Lecharny; Claude Scarpelli; François Artiguenave; M Enrico Pè; Giorgio Valle; Michele Morgante; Michel Caboche; Anne-Françoise Adam-Blondon; Jean Weissenbach; Francis Quétier; Patrick Wincker
Journal:  Nature       Date:  2007-08-26       Impact factor: 49.962

3.  Whole-genome nucleotide diversity, recombination, and linkage disequilibrium in the model legume Medicago truncatula.

Authors:  Antoine Branca; Timothy D Paape; Peng Zhou; Roman Briskine; Andrew D Farmer; Joann Mudge; Arvind K Bharti; Jimmy E Woodward; Gregory D May; Laurent Gentzbittel; Cécile Ben; Roxanne Denny; Michael J Sadowsky; Joëlle Ronfort; Thomas Bataillon; Nevin D Young; Peter Tiffin
Journal:  Proc Natl Acad Sci U S A       Date:  2011-09-26       Impact factor: 11.205

4.  A linkage map of chickpea (Cicer arietinum L.) based on populations from Kabuli x Desi crosses: location of genes for resistance to fusarium wilt race 0.

Authors:  M J Cobos; M J Fernández; J Rubio; M Kharrat; M T Moreno; J Gil; T Millán
Journal:  Theor Appl Genet       Date:  2005-04-02       Impact factor: 5.699

5.  Genetic mapping of ascochyta blight resistance in chickpea (Cicer arietinum L.) using a simple sequence repeat linkage map.

Authors:  B Tar'an; T D Warkentin; A Tullu; A Vandenberg
Journal:  Genome       Date:  2007-01       Impact factor: 2.166

6.  Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa.

Authors:  Pavana J Hiremath; Andrew Farmer; Steven B Cannon; Jimmy Woodward; Himabindu Kudapa; Reetu Tuteja; Ashish Kumar; Amindala Bhanuprakash; Benjamin Mulaosmanovic; Neha Gujaria; Laxmanan Krishnamurthy; Pooran M Gaur; Polavarapu B Kavikishor; Trushar Shah; Ramamurthy Srinivasan; Marc Lohse; Yongli Xiao; Christopher D Town; Douglas R Cook; Gregory D May; Rajeev K Varshney
Journal:  Plant Biotechnol J       Date:  2011-05-25       Impact factor: 9.803

7.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

8.  A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species.

Authors:  Robert J Elshire; Jeffrey C Glaubitz; Qi Sun; Jesse A Poland; Ken Kawamoto; Edward S Buckler; Sharon E Mitchell
Journal:  PLoS One       Date:  2011-05-04       Impact factor: 3.240

Review 9.  Genomics-assisted breeding in four major pulse crops of developing countries: present status and prospects.

Authors:  Abhishek Bohra; Manish K Pandey; Uday C Jha; Balwant Singh; Indra P Singh; Dibendu Datta; Sushil K Chaturvedi; N Nadarajan; Rajeev K Varshney
Journal:  Theor Appl Genet       Date:  2014-04-08       Impact factor: 5.699

10.  A reference genome for common bean and genome-wide analysis of dual domestications.

Authors:  Jeremy Schmutz; Phillip E McClean; Sujan Mamidi; G Albert Wu; Steven B Cannon; Jane Grimwood; Jerry Jenkins; Shengqiang Shu; Qijian Song; Carolina Chavarro; Mirayda Torres-Torres; Valerie Geffroy; Samira Mafi Moghaddam; Dongying Gao; Brian Abernathy; Kerrie Barry; Matthew Blair; Mark A Brick; Mansi Chovatia; Paul Gepts; David M Goodstein; Michael Gonzales; Uffe Hellsten; David L Hyten; Gaofeng Jia; James D Kelly; Dave Kudrna; Rian Lee; Manon M S Richard; Phillip N Miklas; Juan M Osorno; Josiane Rodrigues; Vincent Thareau; Carlos A Urrea; Mei Wang; Yeisoo Yu; Ming Zhang; Rod A Wing; Perry B Cregan; Daniel S Rokhsar; Scott A Jackson
Journal:  Nat Genet       Date:  2014-06-08       Impact factor: 38.330

View more
  38 in total

1.  Genome-wide cis-regulatory signatures for modulation of agronomic traits as exemplified by drought yield index (DYI) in chickpea.

Authors:  Akash Sharma; Udita Basu; Naveen Malik; Anurag Daware; Virevol Thakro; Laxmi Narnoliya; Deepak Bajaj; Shailesh Tripathi; V S Hegde; Hari D Upadhyaya; Akhilesh K Tyagi; Swarup K Parida
Journal:  Funct Integr Genomics       Date:  2019-06-08       Impact factor: 3.410

2.  Genetic dissection of plant growth habit in chickpea.

Authors:  Hari D Upadhyaya; Deepak Bajaj; Rishi Srivastava; Anurag Daware; Udita Basu; Shailesh Tripathi; Chellapilla Bharadwaj; Akhilesh K Tyagi; Swarup K Parida
Journal:  Funct Integr Genomics       Date:  2017-06-09       Impact factor: 3.410

3.  ABC Transporter-Mediated Transport of Glutathione Conjugates Enhances Seed Yield and Quality in Chickpea.

Authors:  Udita Basu; Hari D Upadhyaya; Rishi Srivastava; Anurag Daware; Naveen Malik; Akash Sharma; Deepak Bajaj; Laxmi Narnoliya; Virevol Thakro; Alice Kujur; Shailesh Tripathi; Chellapilla Bharadwaj; V S Hegde; Ajay K Pandey; Ashok K Singh; Akhilesh K Tyagi; Swarup K Parida
Journal:  Plant Physiol       Date:  2019-02-08       Impact factor: 8.340

4.  High density SNP and DArT-based genetic linkage maps of two closely related oil palm populations.

Authors:  Siou Ting Gan; Wei Chee Wong; Choo Kien Wong; Aik Chin Soh; Andrzej Kilian; Eng-Ti Leslie Low; Festo Massawe; Sean Mayes
Journal:  J Appl Genet       Date:  2017-12-06       Impact factor: 3.240

5.  A high-density SNP-based linkage map using genotyping-by-sequencing and its utilization for improved genome assembly of chickpea (Cicer arietinum L.).

Authors:  Rashmi Gaur; Subodh Verma; Seema Pradhan; Heena Ambreen; Sabhyata Bhatia
Journal:  Funct Integr Genomics       Date:  2020-08-27       Impact factor: 3.410

6.  Fine Mapping of the High-pH Tolerance and Growth Trait-Related Quantitative Trait Loci (QTLs) and Identification of the Candidate Genes in Pacific White Shrimp (Litopenaeus vannamei).

Authors:  Wen Huang; Chuhang Cheng; Jinshang Liu; Xin Zhang; Chunhua Ren; Xiao Jiang; Ting Chen; Kaimin Cheng; Huo Li; Chaoqun Hu
Journal:  Mar Biotechnol (NY)       Date:  2019-11-22       Impact factor: 3.619

Review 7.  Current advances in chickpea genomics: applications and future perspectives.

Authors:  Uday Chand Jha
Journal:  Plant Cell Rep       Date:  2018-06-02       Impact factor: 4.570

8.  Genome-wide association study for phosphate deficiency responsive root hair elongation in chickpea.

Authors:  Pawandeep Singh Kohli; Pankaj Kumar Verma; Rita Verma; Swarup K Parida; Jitendra K Thakur; Jitender Giri
Journal:  Funct Integr Genomics       Date:  2020-09-05       Impact factor: 3.410

9.  CLAVATA signaling pathway genes modulating flowering time and flower number in chickpea.

Authors:  Udita Basu; Laxmi Narnoliya; Rishi Srivastava; Akash Sharma; Deepak Bajaj; Anurag Daware; Virevol Thakro; Naveen Malik; Hari D Upadhyaya; Shailesh Tripathi; V S Hegde; Akhilesh K Tyagi; Swarup K Parida
Journal:  Theor Appl Genet       Date:  2019-03-30       Impact factor: 5.699

10.  Co-localization of genomic regions associated with seed morphology and composition in a desi chickpea (Cicer arietinum L.) population varying in seed protein concentration.

Authors:  Runfeng Wang; Manu P Gangola; Craig Irvine; Pooran M Gaur; Monica Båga; Ravindra N Chibbar
Journal:  Theor Appl Genet       Date:  2019-01-19       Impact factor: 5.699

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.