Literature DB >> 26397045

Next-generation sequencing for identification of candidate genes for Fusarium wilt and sterility mosaic disease in pigeonpea (Cajanus cajan).

Vikas K Singh¹, Aamir W Khan¹, Rachit K Saxena¹, Vinay Kumar¹, Sandip M Kale¹, Pallavi Sinha¹, Annapurna Chitikineni¹, Lekha T Pazhamala¹, Vanika Garg¹, Mamta Sharma¹, Chanda Venkata Sameer Kumar¹, Swathi Parupalli¹, Suryanarayana Vechalapu¹, Suyash Patil¹, Sonnappa Muniswamy², Anuradha Ghanta³, Kalinati Narasimhan Yamini³, Pallavi Subbanna Dharmaraj², Rajeev K Varshney^1,4.

Abstract

To map resistance genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) in pigeonpea, sequencing-based bulked segregant analysis (Seq-BSA) was used. Resistant (R) and susceptible (S) bulks from the extreme recombinant inbred lines of ICPL 20096 × ICPL 332 were sequenced. Subsequently, SNP index was calculated between R- and S-bulks with the help of draft genome sequence and reference-guided assembly of ICPL 20096 (resistant parent). Seq-BSA has provided seven candidate SNPs for FW and SMD resistance in pigeonpea. In parallel, four additional genotypes were re-sequenced and their combined analysis with R- and S-bulks has provided a total of 8362 nonsynonymous (ns) SNPs. Of 8362 nsSNPs, 60 were found within the 2-Mb flanking regions of seven candidate SNPs identified through Seq-BSA. Haplotype analysis narrowed down to eight nsSNPs in seven genes. These eight nsSNPs were further validated by re-sequencing 11 genotypes that are resistant and susceptible to FW and SMD. This analysis revealed association of four candidate nsSNPs in four genes with FW resistance and four candidate nsSNPs in three genes with SMD resistance. Further, In silico protein analysis and expression profiling identified two most promising candidate genes namely C.cajan_01839 for SMD resistance and C.cajan_03203 for FW resistance. Identified candidate genomic regions/SNPs will be useful for genomics-assisted breeding in pigeonpea.

Entities: CellLine Chemical Disease Species

Keywords: Fusarium wilt; SNP index; bulked segregant analysis; nonsynonymous SNPs; sterility mosaic disease; whole-genome re-sequencing

Mesh：

Year: 2015 PMID： 26397045 PMCID： PMC5054876 DOI： 10.1111/pbi.12470

Source DB: PubMed Journal: Plant Biotechnol J ISSN： 1467-7644 Impact factor: 9.803

Introduction

Pigeonpea (Cajanus cajan L. Millsp.) is an economically important grain legume crop in the developing countries of the tropical and subtropical regions of the world (Varshney et al., 2012a). The crop productivity of pigeonpea is severely affected by biotic stresses such as Fusarium wilt (FW) and sterility mosaic disease (SMD). Fusarium wilt is caused by Fusarium udum, and SMD is caused by a pigeonpea sterility mosaic virus (PSMV) and transmitted by the eriophyid mite, Aceria cajani. These two biotic stresses of pigeonpea could result in complete yield loss (Reddy et al., 2012). The annual losses due to FW and SMD have been reported to be US $ 113 million (Saxena et al., 2010a). To develop FW‐ and SMD‐resistant pigeonpea lines through molecular breeding, identification of genomic regions/QTLs or candidate genes responsible for resistance to diseases is an important step. Once a marker (or candidate gene) associated with resistance is identified and validated; marker‐assisted selection (MAS) can be used for introgression of resistance in susceptible genotypes (Varshney et al., 2012b). However, traditional QTL mapping approach that involves identification of parental polymorphisms and genotyping the entire population with polymorphic markers is time‐consuming and labour intensive (Abe et al., 2012). To map simply inherited traits like disease resistance, bulked segregant analysis (BSA) approach was proposed by Michelmore et al. (1991). Bulked segregant analysis approach involves screening of the extreme bulks along with the parents with a large number of markers, and subsequently, polymorphic markers showing the similar pattern in the bulks with respect to their corresponding parental genotypes are used to screen the entire population. This approach has been extensively used for trait mapping in a number of crops (see Semagn et al., 2010). Advent of the next‐generation sequencing (NGS) technologies and due to declining cost in per sample sequencing has drastically accelerated the pace with which candidate genes/genomic regions were identified (Schneeberger et al., 2009). Consequently, many recent approaches were proposed using BSA combined with whole‐genome re‐sequencing (WGRS) for rapid identification of candidate genes of interest termed as ‘fast forward genetics’ (Mokry et al., 2011). Next‐generation sequencing‐based BSA approaches were successfully applied to model crop, Arabidopsis (~135 Mb of genome size) for identification of candidate genes for growth habit and colour of leaves (Schneeberger et al., 2009; approach SHOREmap), cell wall composition (Austin et al., 2011; approach next‐generation mapping), suppressor mutant (Uchida et al., 2011; approach SNPing; Hartwig et al., 2012; approach isogenic mapping‐by‐sequencing) and gametophyte lethal mutation (Lindner et al., 2012; approach SNP‐ratio mapping). Such methods have also been successfully applied in crop plants like rice (~389 Mbp of genome size) for identification of candidate genes for pale green leaves, semi dwarfism (Abe et al., 2012; approach MutMap), blast resistance (Takagi et al., 2013a; approach MutMap‐Gap) and lethal phenotype associated with plant development (Fekih et al., 2013; approach MutMap+). Recently, QTL‐seq method was proposed in rice, which is a powerful approach for handling quantitative traits (Takagi et al., 2013b). This approach was successfully utilized for mapping genomic region for blast resistance and seedling vigour in rice (Takagi et al., 2013b), flowering associated QTL in cucumber (Lu et al., 2014) and seed size and root trait ratio in chickpea (unpublished). Similarly, WGRS‐based BSA approach based on G′ statistics was utilized for identification of candidate genomic region for cold tolerance in rice (Yang et al., 2013). In addition, WGRS of contrasting parents has been used for identification of nonsynonymous SNPs (nsSNPs) to map the candidate genes for sheath blight resistance in rice and drought tolerance in maize (Silva et al., 2012; Xu et al., 2014). Fusarium wilt resistance in pigeonpea has been found to be controlled by different gene actions in various genetic backgrounds, ranging from single to multiple genes with complementary to duplicate gene actions (Saxena, 2008). On the other hand, SMD resistance has been found to be governed by a single gene or two recessive genes (Gnanesh et al., 2011). Therefore, for identification of SNPs for multiple genes, associated with FW and SMD resistance in pigeonpea, we have analysed extreme bulks of resistant (R‐bulk) and susceptible (S‐bulk) RILs using Seq‐BSA approach. Furthermore, nonsynonymous SNPs (nsSNPs) approach has been used to complement Seq‐BSA approach. Re‐sequencing of additional genotypes, In silico protein analysis and transcription profiling have validated and shortlisted candidate genomic regions/SNPs conferring resistance to FW and SMD in pigeonpea.

Results

This study uses a combined approach of Seq‐BSA and WGRS‐based nsSNPs for identification of candidate resistance genes for FW and SMD. The detailed approach has been illustrated in Figure 1.

Figure 1

Schematic representation of the combined approach of sequencing‐based bulked segregant analysis (Seq‐BSA) and nsSNPs substitution for identification of candidate genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea. (a) Two contrasting parents, ICPL 20096 (R) and ICPL 332 (S) were crossed to develop F 7 RILs segregating for FW and SMD resistance through single‐seed descent method. (b, c) Phenotypic score of RILs for FW and SMD resistance resulted in the selection of highly resistant and highly susceptible RILs to form the resistant (R) and susceptible (S) bulks. (d) These two bulks along with resistant parent were subjected to whole‐genome re‐sequencing (WGRS) for identification of SNPs and SNP index through QTL‐seq pipeline. (e) Candidate genome regions were identified based on the SNP index (0 and 1). (f and g) WGRS was performed on the four contrasting parents (ICPL 20097, ICP 8863, ICPL 99050 and ICPB 2049) to identify nsSNPs. (h) WGRS data of the contrasting parents and the bulks defined the nsSNPs associated to the genomic regions to FW and SMD resistance. (i–l) Based on the WGRS data and nsSNPs analysis, candidate genes were subsequently selected for functional validation.

FW and SMD screening of genotypes

Six genotypes namely ICPL 20096, ICPL 332, ICPL 20097, ICP 8863, ICPB 2049 and ICPL 99050 along with one mapping population (ICPL 20096 × ICPL 332) comprising of 188 lines were screened for resistance to FW and SMD in this study. Our screening results showed resistance and/or susceptibility of these genotypes to either or both FW and SMD in multilocation phenotyping. Three genotypes (ICPL 20096, ICPL 20097 and ICPL 99050) were found to be resistant to both FW and SMD, and their disease reaction ranged from 0% to 4.4% for FW and 0.0% for SMD. However, one genotype namely ICPL 332 was found highly susceptible for both FW and SMD with disease reaction of 100% for each disease. The genotypes, ICP 8863 and ICPB 2049, were found to be resistant only to FW (0.6%) and SMD (0%), respectively. The detailed observations are presented in Table 1. Similarly, screening of the mapping population showed disease incidence score from 0% to 100% for FW and 0% to 100% for SMD.

Table 1

List of pigeonpea genotypes with phenotypic score and summary of Illumina sequencing of parental lines and bulks

Sample	Pedigree	Phenotyping for FW		Phenotyping for SMD		Illumina Sequencing
Sample	Pedigree	Patancheru (average of 2012–2013 and 2013–2014)	Gulbarga (average of 2012–2013 and 2013–2014)	Patancheru (average of 2012–2013 and 2013–2014)	Tandur (average of 2012–2013 and 2013–2014)	Data generated (Gb)	% Alignment	% coverage	Mean depth
Genotypes
ICPL 20096a ^, b ^, d ^, e	ICPL 87119 × ICP 12746	4.20 (R)	–	0.0 (R)	–	9.27	90.61	89.21	13.4
ICP 8863c ^, e	Selection from landrace	0.6 (R)	–	95.4 (HS)	–	6.72	95.41	88.96	9.97
ICPB 2049c ^, e	CMS 2039 × ICP 6697	99.9 (HS)	–	0.0 (R)	–	9.98	92.94	89.10	14.88
ICPL 20097c ^, e	ICPL 87119 × ICP 12746	0.0 (R)	–	0.0 (R)	–	7.81	91.84	88.66	11.51
ICPL 99050c ^, e	C11 × Banda Palera	4.4 (R)	–	0.0 (R)	–	10.36	91.64	89.48	15.12
ICPL 332b ^, d ^, e	Selection from ICP 1903	100.0 (HS)	–	100.0 (HS)	–	9.60	93.02	89.58	14.25
ICPL 87119e ^, f	C11 × ICP1‐6‐W3‐W	0.0 (R)	–	0.0 (R)	–	–	–	–	–
HPL 24e	Baigani × A. scarabaeoides	0.0 (R)	–	–	–	7.39	95.16	89.19	11.32
ICPL 85063e	Selection from ICPL 1903	100 (HS)	–	0.0 (R)	–	9.94	93.31	89.31	14.87
ICPL 87e	T 21 × ICP 6393	47 (HS)	–	–	–	6.01	94.96	88.55	9.20
ICPL 88039e	Selection from ICPL 161	94.0 (HS)	–	–	–	6.35	93.28	86.77	9.59
Mapping population
188 RILs	ICPL 20096 × ICPL 332	0%–100%	0%–100%	0%–100%	0%–100%	–	–	–	–
R‐bulkb ^, c ^, e ^, g	ICPL 20096 × ICPL 332	0.0%–5.0%		0.0%–2.6%	–	8.99	99.77	88.33	12.30
S‐bulkb ^, c ^, e ^, h	ICPL 20096 × ICPL 332	80.0%–100%		72.62%–100%	–	8.43	80.56	89.15	14.76

ICPL 20096 short reads were aligned to the publicly available pigeonpea genome of Asha (ICPL 87119) (http://www.icrisat.org/gt-bt/iipg/genomedata.zip).

Parents and bulks used for Seq‐BSA analysis.

Parents and bulks used for nonsynonymous SNP (nsSNPs) analysis.

Parents used for expression profiling studies.

Parents and bulks used for identified nsSNPs validation.

Reference genome assembly of publicly available pigeonpea genotype ICPL 87119 genome (Asha) was used in this study (Varshney et al., 2012a).

R‐bulk was constituted using 16 resistant RILs identified at both locations (Patancheru and Gulbarga).

S‐bulk was constituted using 16 susceptible RILs identified at both locations (Patancheru and Gulbarga).

Note: genotypes were categorized in different groups based on per cent disease incidence (PDI): resistant (0%–9.99% of PDI), moderately resistance (10%–19.99% of PDI), susceptible (20%–40% of PDI) and highly susceptible (>40% of PDI).

List of pigeonpea genotypes with phenotypic score and summary of Illumina sequencing of parental lines and bulks ICPL 20096 short reads were aligned to the publicly available pigeonpea genome of Asha (ICPL 87119) (http://www.icrisat.org/gt-bt/iipg/genomedata.zip). Parents and bulks used for Seq‐BSA analysis. Parents and bulks used for nonsynonymous SNP (nsSNPs) analysis. Parents used for expression profiling studies. Parents and bulks used for identified nsSNPs validation. Reference genome assembly of publicly available pigeonpea genotype ICPL 87119 genome (Asha) was used in this study (Varshney et al., 2012a). R‐bulk was constituted using 16 resistant RILs identified at both locations (Patancheru and Gulbarga). S‐bulk was constituted using 16 susceptible RILs identified at both locations (Patancheru and Gulbarga). Note: genotypes were categorized in different groups based on per cent disease incidence (PDI): resistant (0%–9.99% of PDI), moderately resistance (10%–19.99% of PDI), susceptible (20%–40% of PDI) and highly susceptible (>40% of PDI).

Sequencing‐based BSA (Seq‐BSA) approach

Construction and sequencing of R‐ and S‐bulks

Based on the phenotyping data on 188 RILs, 16 resistant and 16 susceptible RILs to FW as well as SMD were selected for the constitution of R‐ and S‐bulks (Figure 2). The phenotypic disease score of 16 RILs of the R‐bulk ranged from 0% to 5.0% (for FW) and 0% to 2.6% (for SMD). Similarly, the phenotypic disease score of selected RILs of the S‐bulk varied from 80.0% to 100% (for FW) and 72.62% to 100% (for SMD). Each DNA bulk (R‐ and S‐bulk) along with the resistant parent (ICPL 20096) was subjected for whole‐genome re‐sequencing (WGRS) using Illumina (MiSeq) sequencing platform. As a result, a total of 8.99 Gb (14.85 X coverage) sequence data for R‐bulk, 8.43 Gb (13.91 X coverage) for S‐bulk and 9.27 Gb data (15.30 X coverage) for ICPL 20096 were generated (Table 1).

Figure 2

Phenotyping of RILs and parents for Fusarium wilt (FW) and sterility mosaic disease (SMD) resistance. (a) FW is seed and soil borne fungal disease caused by Fusarium udum. Fusarium wilt causes the complete death of plant and reduction in yield due to loss of leaf turgidity, interveinal clearing and mild chlorosis to the bright yellow colour of leaves. (b) SMD is a viral disease caused by Pigeonpea sterility mosaic virus (PSMV). This disease can be identified as patches of bushy, pale green plants. Due to excess vegetative growth, without growing into reproductive phase, this condition is known as green plague of pigeonpea. (c) Based on the phenotyping of RILs for FW and SMD resistance at two different locations, a total of 16 resistant RILs were used to develop resistant bulk (R‐bulk). (d) A total of 16 susceptible RILs with percent disease score comparable to susceptible parent were used to produce susceptible bulk (S‐bulk).

Genomewide SNP index analysis

A total of 37.53 million filtered reads of the resistant parent (ICPL 20096) were used for developing reference‐guided assembly. This resulted in genome coverage of 94.46%. The cleaned sequence data for the R‐ (36.38 million reads) and S‐bulks (33.91 million reads) were then used for mapping onto the developed reference‐guided assembly of ICPL 20096. Subsequently, SNP index was calculated from R‐ and S‐bulks using QTL‐seq pipeline (http://genome-e.ibrc.or.jp/home/bioinformatics-team/mutmap). Based on the stringent selection criteria of read depth ≥7 in both bulks and SNP index ≥0.3 in either of the bulks, 35 877 SNPs were identified on all 11 linkage groups (Figures S1 and S2). However, only 4139 (11.54%) SNPs were found to have homozygous allele calls in both bulks (Table S1). Finally, seven candidate SNPs with delta SNP index = −1 were selected that were present on five different chromosomes (CcLG02, CcLG07, CcLG08, CcLG10 and CcLG11) (Table 2 and Figure 3). To identify the causative SNPs for FW and SMD resistance, chi‐square test was conducted for the seven SNPs along with neighbouring SNPs (one on each flanking side) in sequence data of both R‐ and S‐bulks. The analysis revealed that the probability of these SNPs to follow 1 : 1 binomial distribution with a P‐value <0.01 did not fit the expected pattern of co‐segregation, indicating their causativeness (Table S2).

Table 2

Identification of SNPs between resistant and susceptible bulks using Seq‐BSA approach

Linkage group	Position	Resistant parent base	R‐bulk base	Read depth of R‐bulk (X coverage)	Phred quality score of R‐bulk	S‐bulk base	Read depth of S‐bulk (X coverage)	Phred quality score of S‐bulk	SNP index of S‐bulk	∆ SNP index
CcLG02	26 551 810	T	T	7	48	A	7	48	1	−1
CcLG07	16 064 896	G	G	8	51	C	8	51	1	−1
CcLG07	18 411 642	G	G	11	60	A	9	54	1	−1
CcLG08	354 473	G	G	10	57	C	7	48	1	−1
CcLG10	7 815 091	G	G	9	54	A	7	48	1	−1
CcLG11	19 958 148	A	A	9	54	C	10	57	1	−1
CcLG11	34 310 320	C	C	7	48	A	7	48	1	−1

SNP index = 0 means bulked DNA representing resistant parent genome.

SNP index = 1 means bulked DNA representing susceptible parent.

∆ SNP index = −1 bulked DNA representing resistant parent genome.

Phred quality score ≥40: Probability of incorrect base call 1 in 10 000 (99.99%); Phred quality score ≥50: Probability of incorrect base call 1 in 100 000 (99.999%); Phred quality score ≥60: Probability of incorrect base call 1 in 1 000 000 (99.9999%).

Figure 3

Global distribution of ∆ SNP index and nonsynonymous SNPs. (a) Psuedomolecules of reference genome Asha adopted from Varshney et al. (2012a). (b) Genomewide distribution of nonsynonymous SNPs (nsSNPs) identified between resistant and susceptible genotypes and bulks. (c) Positions of identified candidate genes with nsSNPs in the vicinity of identified genomic regions through delta SNP index. (d) Upper probability values at 99% confidence (P < 0.01). (e) Upper probability values at 95% confidence (P < 0.05). (f) Region in green colour representing SNP index ranging from 0 to 1. (g) Genomewide delta SNP index, including those genomic regions with 0 and 1 SNP index, that is same in resistant parent (RP) and resistant bulk (R‐bulk) but entirely different in susceptible bulk (S‐bulk). These particular positions are marked with red dots along with their upper and lower confidence interval values at 99% and 95% probability values. (h) Lower probability values at 99% confidence (P < 0.01). (i) Lower probability values at 95% confidence (P < 0.05). (j) Region in red colour representing SNP index ranging from 0 to −1. (k) 2‐Mb selected genomic regions flanked both sides to each identified genomic positions with 0 to 1 SNP index for identification of candidate nsSNPs in the target regions.

Identification of SNPs between resistant and susceptible bulks using Seq‐BSA approach SNP index = 0 means bulked DNA representing resistant parent genome. SNP index = 1 means bulked DNA representing susceptible parent. ∆ SNP index = −1 bulked DNA representing resistant parent genome. Phred quality score ≥40: Probability of incorrect base call 1 in 10 000 (99.99%); Phred quality score ≥50: Probability of incorrect base call 1 in 100 000 (99.999%); Phred quality score ≥60: Probability of incorrect base call 1 in 1 000 000 (99.9999%). Global distribution of ∆ SNP index and nonsynonymous SNPs. (a) Psuedomolecules of reference genome Asha adopted from Varshney et al. (2012a). (b) Genomewide distribution of nonsynonymous SNPs (nsSNPs) identified between resistant and susceptible genotypes and bulks. (c) Positions of identified candidate genes with nsSNPs in the vicinity of identified genomic regions through delta SNP index. (d) Upper probability values at 99% confidence (P < 0.01). (e) Upper probability values at 95% confidence (P < 0.05). (f) Region in green colour representing SNP index ranging from 0 to 1. (g) Genomewide delta SNP index, including those genomic regions with 0 and 1 SNP index, that is same in resistant parent (RP) and resistant bulk (R‐bulk) but entirely different in susceptible bulk (S‐bulk). These particular positions are marked with red dots along with their upper and lower confidence interval values at 99% and 95% probability values. (h) Lower probability values at 99% confidence (P < 0.01). (i) Lower probability values at 95% confidence (P < 0.05). (j) Region in red colour representing SNP index ranging from 0 to −1. (k) 2‐Mb selected genomic regions flanked both sides to each identified genomic positions with 0 to 1 SNP index for identification of candidate nsSNPs in the target regions.

Nonsynonymous SNPs (nsSNPs) substitution approach

WGRS of additional set of resistant and susceptible genotypes

Four other genotypes viz. ICPL 20097 (resistant to FW and SMD), ICP 8863 (resistant to FW and susceptible to SMD), ICPB 2049 (susceptible to FW and resistant to SMD) and ICPL 99050 (resistant to FW and SMD) were re‐sequenced at >10 X coverage (Table 1). In brief, 6.72–10.36 Gb data with 11.09–17.09 X coverage were generated. Alignment of cleaned data from these genotypes and R‐ and S‐bulks from ICPL 20096 × ICPL 332 population mentioned in Seq‐BSA to the draft genome sequence indicated a higher level mapping in the range of 91.64% to 95.41%. In terms of depth coverage, alignment of sequence reads for these samples with respect to the draft genome was found in the range of 9.97 to 15.12 mean depth.

Genomewide nsSNPs

Detailed analysis of WGRS data sets for above four samples and two bulks identified 226 393 SNPs, which ranged from 3915 (CcLG05) to 43 367 (CcLG11) with an average 20 581 SNPs per linkage group (Table S3). The majority of these SNPs (42.18%) were found in the intergenic region while only ~3% SNPs were present in the exonic region (Table S4). Of 226 393 SNPs, only 8362 SNPs were found to be nsSNPs.

Combined approach of Seq‐BSA and nsSNP analysis

To obtain converging evidences from the Seq‐BSA and nsSNP analysis for causative SNPs, 2‐Mb regions of the seven candidate SNPs identified through Seq‐BSA were investigated for the presence of nsSNPs identified in the second approach. A total of 60 nsSNPs were detected in seven genomic regions identified using Seq‐BSA (Figure 3). Of these 60 nsSNPs, 16 SNPs were found on each CcLG02 and CcLG11 while seven, eight and 13 SNPs were present on CcLG08, CcLG10 and CcLG07, respectively. Subsequently, haplotype analysis was carried out for 60 nsSNPs in all resistant (ICPL 20097 and ICP 99050) and susceptible (ICP 8863 and ICP 2049B) genotypes along with the resistant and susceptible bulks. Of 60 nsSNPs, eight nsSNPs (four on CcLG02, one on CcLG08 and three on CcLG11) showed specific haplotype in all resistant genotypes and R‐bulk, while the other (alternate) allele in all susceptible genotypes and S‐bulk. These candidate nsSNPs (haplotypes) were found in seven candidate genes present on three linkage groups (CcLG02, CcLG08 and CcLG11).

Association of nsSNPs/genes with resistance to FW/SMD

With an objective to identify association of nsSNPs/genes with resistance to a particular disease, eight nsSNPs were compared in two different combinations of genotypes one each for FW and SMD. For FW, the re‐sequencing data of three resistant genotypes (ICPL 99050, ICPL 20097 and ICP 8863) and one susceptible genotype (ICPB 2049) were compared. Similarly, in the case of SMD, the re‐sequencing data of three resistant genotypes (ICPL 99050, ICPL 20097 and ICPB 2049) and one susceptible genotype (ICP 8863) were compared (Table 3). Based on WGRS data sets, the SNP identified in candidate gene C.cajan_07067 (at position 27 324 239 and 27 324 261 bp) had T (R‐bulk) to G (S‐bulk) substitution at both SNP position and based upon the allele calls in other genotypes this gene was found to be associated with SMD resistance. Similarly, after comparative analyses in other genotypes T (R‐bulk) to G (S‐bulk) substitution for the candidate gene C.cajan_07078 (PHD finger protein) and G (R‐bulk) to A (S‐bulk) substitution for the candidate gene C.cajan_07124 (rRNA‐processing protein) were found specific for FW resistance.

Table 3

Association of nsSNPs to the candidate genes responsive to FW and SMD diseases

Linkage group	Genes	nsSNPs position (bp)	Seq‐BSA approach			nsSNPs substitution approach
			ICPL 20096 (Ra to FW & SMD)	R‐bulkb (Ra to FW & SMD)	S‐bulkc (Sa to FW & SMD)	FW				SMD
			ICPL 20096 (Ra to FW & SMD)	R‐bulkb (Ra to FW & SMD)	S‐bulkc (Sa to FW & SMD)	ICPL 99050 (Ra)	ICPL 20097 (Ra)	ICP 8863 (Ra)	ICPB 2049 (HSd)	ICPL 99050 (Ra)	ICPL 20097 (Ra)	ICPB 2049 (Ra)	ICP 8863 (HSd)
FW associated nsSNPs
CcLG02	C.cajan_07078	27 426 866	T	T	G	T	T	T	G	T	T	G	T
CcLG02	C.cajan_07124	27 861 114	G	G	A	G	G	G	A	G	G	A	G
CcLG11	C.cajan_02962	32 606 065	T	T	C	T	T	T	C	T	T	C	T
CcLG11	C.cajan_03203	35 228 097	C	C	A	C	C	C	A	C	C	A	C
SMD associated nsSNPs
CcLG02	C.cajan_07067	27 324 239	T	T	G	T	T	G	T	T	T	T	G
CcLG02	C.cajan_07067	27 324 261	T	T	G	T	T	G	T	T	T	T	G
CcLG08	C.cajan_15535	2 014 125	C	C	G	C	C	G	C	C	C	C	G
CcLG11	C.cajan_01839	19 958 148	A	A	C	A	A	C	A	A	A	A	C

R: resistant genotype.

R‐bulk: resistant bulk for FW and SMD.

S‐bulk: susceptible bulk for FW and SMD.

HS: highly susceptible genotype.

Association of nsSNPs to the candidate genes responsive to FW and SMD diseases R: resistant genotype. R‐bulk: resistant bulk for FW and SMD. S‐bulk: susceptible bulk for FW and SMD. HS: highly susceptible genotype. Note: genotypes were categorized in different groups based on per cent disease incidence (PDI): resistant (0%–9.99% of PDI), moderately resistance (10%–19.99% of PDI), susceptible (20%–40% of PDI) and highly susceptible (>40% of PDI). Comparative analysis of candidate genes C.cajan_15535 (copia‐like retrotransposable) and C.cajan_01839 (serine–threonine protein phosphatase) showed C (R‐bulk) to G (S‐bulk) and A (R‐bulk) to C (S‐bulk) substitution, respectively, and based on the allele calls in other genotypes, these two genes were found specific for SMD resistance. Analysis of the remaining candidate genes C.cajan_02962 (NADH dehydrogenase) and C.cajan_03203 (retrovirus‐like polyprotein) had T (R‐bulk) to C (S‐bulk) and C (R‐bulk) to A (S‐bulk) substitutions, respectively, and based on the allele calls in resistant (ICPL 20096, ICPL 99050, ICPL 20097 and ICP 8863) and susceptible (ICPB 2049) genotypes, both of the genes were found to be specific to FW resistance.

Validation of identified SNPs

In addition to using the re‐sequencing data from above‐mentioned genotypes, a total of 39.29 Gb data with 9.20–14.88 X coverage were generated for six additional genotypes (Table 1). Re‐sequencing data from above‐mentioned six genotypes and raw sequence reads from draft genome assembly (ICPL 87119) were used to test the association of nsSNPs with the targeted traits. For the validation of FW associated SNPs, two resistant (ICPL 87119, HPL 24) and four susceptible (ICPL 85063, ICPL 332, ICPL 87, ICPL 88039) genotypes were added. In the case of SMD, re‐sequencing data from two resistant (ICPL 87119, ICPL 85063) and one susceptible (ICPL 332) genotypes were added for the validation of SMD‐associated SNPs. In brief, sequence data of 11 genotypes (six resistant: ICPL 20096, ICPL 99050, ICPL 20097, ICPL 8863, ICPL 87119 and HPL 24; five susceptible: ICPL 85063, ICPL 332, ICPB 2049, ICPL 87 and ICPL 88039) were used to test and validate the association of four nsSNPs for FW resistance. T to G substitution for the gene C.cajan_07078 and G to A substitution for the gene C.cajan_07124 were observed in all the resistant and susceptible genotypes, respectively (Table S5). Similarly, T (resistant genotypes) to C (susceptible genotypes) and C (resistant genotypes) to A (susceptible genotypes) substitution were observed for the other two candidate genes, C.cajan_02962 and C.cajan_03203, respectively (Table S5). This analysis unequivocally provided four SNPs in four different candidate genes associated with FW resistance. Likewise, association of four SNPs targeting three candidate genes (C.cajan_07067, C.cajan_15535 and C.cajan_01839) with SMD was also validated using re‐sequencing data of eight genotypes (six resistant: ICPL 20096, ICPL 99050, ICPL 20097, ICPB 2049, ICPL 87119 and ICPL 85063; two susceptible: ICPL 8863 and ICPL 332). SNPs at positions 27 324 239 bp and 27 324 261 bp on CcLG02 targeting the same candidate gene, C.cajan_07067, showed T to G substitution for both the SNPs in all the resistant and susceptible genotypes (Table S6). Analysis of SNP for the genes C.cajan_15535 and C.cajan_01839 revealed C to G and A to C substitution, respectively, for resistant and susceptible genotypes. These results clearly suggested association of four SNPs in the three candidate genes with SMD resistance.

Functional annotation of candidate genes

The candidate gene C.cajan_07067 on CcLG02 has two nonsynonymous substitutions at 27 324 239 bp and 27 324 261 bp positions. The nsSNP at 27 324 239 bp had T (in resistant genotypes) to G (in susceptible genotypes) substitution, which leads to change in amino acid from isoleucine (ATT) to methionine (ATG). Another nsSNP at 27 324 261 bp position had a similar T (in resistant genotypes) to G (in susceptible genotypes) substitution but codes for different amino acid serine (TCC) to alanine (GCC). Functional annotation of this candidate gene reveals its role in serine–threonine protein kinase. Similarly, another candidate gene C.cajan_07078 on CcLG02 had T (in resistant genotypes) to G substitution (in susceptible genotypes), which leads to change the codon from ATG (methionine) to CTG (leucine). The functional annotation of this candidate gene showed similarity to PHD finger protein. C.cajan_07124 is a third candidate gene on CcLG02 and had G (in resistant genotype) to A (in susceptible genotype) substitution. This substitution leads to change in amino acid from glycine (GGC) to serine (AGC). This gene showed functional similarity to rRNA‐processing protein. The candidate gene C.cajan_15535 on CcLG08 has C (in resistant genotypes) to G (in susceptible genotypes) substitution, which leads to a silent change from CAA (glutamine) to GAA (glutamine). This gene is functionally characterized as copia‐like retrotransposable. The chromosome CcLG11 had three SNP positions 19 958 148 bp, 32 606 065 bp and 35 228 097 bp in three candidate genes C.cajan_01839, C.cajan_02962 and C.cajan_03203, respectively. The candidate gene C.cajan_01839 had A (in resistant genotypes) to C (in susceptible genotypes) substitution resulting effect on exon 1 and leads to change into the codon from TAC (tyrosine) to TCC (serine). Functional characterization of C.cajan_01839 reveals their role in serine–threonine protein phosphatase an important candidate gene for defence mechanism. At SNP position 32 606 065 bp candidate gene, C.cajan_02962 had T (in resistant genotypes) to C (in susceptible genotypes) substitution, which leads to change in the amino acid from proline (CCA) to glutamine (CAA) and characterized as NADH dehydrogenase. In the same way, another candidate gene C.cajan_03203 had C (in resistant genotypes) to A (in susceptible genotypes) substitution. The change in nucleotide substitution led to conversion in amino acid from leucine (TTG) to phenylalanine (TTT) and characterized as retrovirus‐like polyprotein. Detailed functional annotation of these seven candidate genes is presented in Table S7.

In silico structural analysis of the candidate genes

To understand the structural variation in the candidate genes, nonsynonymous SNPs substitution effects were calculated. For each nsSNPs, the mutation effect was calculated based on proven score value (cut‐off = −2.50). As a result, of seven nsSNPs, five mutations did not show any deleterious effect in the protein structure analysis. Therefore, the five genes causing no changes in protein structure were not selected for further study. The remaining two mutations targeting candidate genes, namely C.cajan_01839 (serine–threonine protein phosphatase) (Figure S3) and C.cajan_03203 (retrovirus‐like polyprotein) (Figure S4), showed deleterious effect on the protein structure with proven score value of −8.56 and −3.69, respectively (Table S7). Proteins 3D structure of the two genes was modelled with >90% accuracy to analyse conformational changes in the translated proteins. In the case of C.cajan_01839, secondary protein structure revealed 1% variation in β‐sheet model of the resistant and susceptible genotypes. For another gene C.cajan_03203, a variation of 32% for resistant and 31% for susceptible genotypes was observed in the α‐helix secondary protein structure.

Expression profiling of candidate genes

Above‐mentioned approaches provide converging evidences about causativeness of the C.cajan_03203 gene with FW and the C.cajan_01839 gene with SMD. For confirming this at functional level, quantitative real‐time PCR (qRT‐PCR) for these genes was undertaken on root (for FW) and leaf (for SMD) tissues of the resistant (ICPL 20096) and susceptible (ICPL 332) genotypes for both FW and SMD. The C.cajan_03203 gene showed 3.45‐fold down‐regulation in root tissues in the FW‐susceptible genotype as compared to 0.50‐fold down‐regulation in the FW‐resistant genotype. Similarly, the C.cajan_01839 gene showed 2.32‐fold up‐regulation in leaf tissues in the SMD‐susceptible genotype as compared to 0.83‐fold up‐regulation in the SMD‐resistant genotype (Figure S5).

Discussion

Conventional methods of trait mapping required genotyping of all the individuals of the developed mapping population. This process, however, is laborious, time‐consuming and costly. Additionally, due to low level of polymorphism in some crop species like, pigeonpea and groundnut, identification of polymorphic markers is another challenging task (Pandey et al., 2011; Saxena et al., 2010b). To overcome these issues, in the recent past, NGS‐based technologies have been successfully utilized in trait mapping (see Varshney et al., 2014). Recently sequenced pigeonpea genome opened the new avenues to enable the NGS‐based breeding for rapid trait mapping similar to other crops (Varshney et al., 2012a). Next‐generation sequencing technologies can generate large number of short reads in less time, and with the help of powerful bioinformatics tools, it is possible to assemble the reads for variant (SNPs) calling between and among genotypes. This approach can be useful for identification of higher number of genomewide SNPs, which can be used in number of applications such as in trait mapping, MAS, etc. (Varshney et al., 2009). In the near future, NGS‐derived WGS‐based approach is expected to be the approach for trait mapping. In this study, we have applied two WGRS‐based methods, that is Seq‐BSA and nonsynonymous SNPs substitution to find out the candidate genes for two most dreaded diseases namely FW and SMD for enabling genomics‐assisted breeding. To find out the SNPs associated with the trait of interest, we used the concept of traditional BSA approach as proposed by Michelmore et al. (1991). This method has been used to map large number of simply inherited traits (Semagn et al., 2010). Due to continuous advances and reduction in cost of sequencing technologies, we have used BSA approach by sequencing the extreme bulks instead of screening with molecular markers. We constituted and sequenced two pools of 16 plants each for resistance and susceptible to these two diseases and referred the approach as Seq‐BSA. Accordingly, the probability (as calculated based on Michelmore et al., 1991; 2 (1 ‐ [1/4]16)(1/4)16) of an unlinked locus being polymorphic between bulks of 16 such individuals would be equivalent to 4.65 × 10−10. Consequently, on the basis of SNP index analysis, we have identified the SNPs that were monomorphic for resistant parent and resistant bulk (SNP index = 0) but entirely different in susceptible bulk (SNP index = 1) with read depth of ≥7. This approach directly reduced the number of SNPs from 4139 to seven (0.0016%) (with SNP index 1 and 0). Out of seven SNPs identified, only one SNP was present in the genic region (C.cajan_01839) while the other SNPs were present in the nongenic region. Our findings are similar to the previous studies in which NGS‐based pooled analysis does not allow the identification of direct candidate genes but provides information on putative candidate genes in form of associated SNPs (Hartwig et al., 2012). As the parental genotypes are segregating for two traits, it was difficult to associate the identified SNPs with one or both (FW, SMD) diseases. Therefore, nsSNP substitution‐based approach was utilized. This approach was successfully utilized for mapping drought tolerance in maize (Xu et al., 2014) and sheath blight resistance in rice (Silva et al., 2012). This approach identified 8362 nsSNPs between the R‐ and S‐bulks which are too far to be associated with the target trait. In the present study, we combined both Seq‐BSA and nsSNP approach that narrowed down putative eight SNPs in seven candidate genes on three different chromosomes (CcLG02, CcLG08 and CcLG11) of pigeonpea. However, we did not identify any nsSNP within flanking regions of the SNPs identified through Seq‐BSA on CcLG07 and CcLG10. This may be attributed to the low sequencing depth in these regions. These speculations, however, need to be confirmed by re‐sequencing the particular regions at higher depth. For classification of identified genes in terms of their association with resistance to FW and SMD, detailed haplotype analysis in diverse set of lines, In silico structural (protein) analysis and transcript profiling finally provided causativeness of C.cajan_03203 gene to FW and C.cajan_01839 gene to SMD. For instance, the single nucleotide mutation in these genes was predicted with deleterious effects for translation at the protein level that was also reflected in the form of conformational change in the secondary structure of these genes. qRT‐PCR‐based expression studies also showed a similar expression in the susceptible genotype (ICPL 332). The gene C.cajan_03203 codes for a retrovirus‐like polyprotein, which is known to be involved in plant defence against pathogens (Grandbastien, 2014). This gene showed >2‐fold up‐regulation in the susceptible genotype carrying the mutation as compared to the resistant genotype. Retrovirus‐like polyprotein are the mobile genetic elements which can replicate and transpose from one position in the genome to the other position either by RNA intermediate (Class‐I) driven reverse transcription or by direct transposition (Class‐II) (Grandbastien, 2014). Many of the plant retrotransposons reported to date are transcriptionally activated by various abiotic and biotic factors (Grandbastien, 1998). Fungal pathogens like Trichoderma viride and Cladosporium fulvum or inoculation with various viral and bacterial pathogens have been shown to activate the retrotransposons (Mhiri et al., 1997; Pouteau et al., 1994). While the gene C.cajan_01839, showing causativeness with SMD, has been annotated as serine–threonine protein phosphatase, which is known to be involved in the regulation of specific signal transduction cascades. The mutation in this gene also predicted to have a deleterious effect on the susceptible genotype. This SNP may also be responsible for showing significant down‐regulation of the gene in the susceptible genotype and could possibly lead to a defect in the response to SMD. Serine–threonine protein phosphatase is the known principal classes of plant defence genes (Farkas et al., 2007). This gene has been shown to play an important role in defence mechanism of Tobacco mosaic virus resistance (Dunigan and Madlender, 1995). It is evident from this study that NGS‐based approaches do increase not only the precision and power but also saves time in the identification of candidate genes for targeted traits as compared to the conventional mapping methods. We anticipate the accelerated use of NGS‐based mapping strategies in all those species, where draft genome sequence has become available. The present study identifies one gene (C.cajan_03203) associated with FW and one gene (C.cajan_01839) with SMD in pigeonpea. The diagnostic SNPs in these genes can be assayed in some cost‐effective marker platform such as CAPS (cleaved amplified polymorphic sequences) or KASP Assay for use in the MAS. Molecular breeding using such genes will help the development of superior lines with enhanced resistance to FW and SMD that will eventually enhance crop productivity of pigeonpea.

Experimental procedures

Plant materials

A total of 11 pigeonpea genotypes were selected based on their FW and SMD responses identified in our previous experiments (Saxena et al., 2010a) (Table 1). Among the selected genotypes, ICPL 20096, ICPL 99050, ICPL 20097, ICPL 8863, ICPL 87119 and HPL 24 were resistant to FW, and ICPL 20096, ICPL 99050, ICPL 20097, ICPB 2049, ICPL 87119, ICPL 85063 were resistant to SMD. Furthermore, ICPL 85063, ICPL 332, ICPB 2049 ICPL 87 and ICPL 88039 were susceptible to FW, and ICP 8863 and ICPL 332 were susceptible to SMD. Two genotypes ICPL 20096 (FW and SMD resistant) and ICPL 332 (FW and SMD susceptible) with contrasting phenotypes were crossed, and the confirmed F1s were selfed through single‐seed descent method. Finally, a total of 188 F 7 recombinant inbred lines (RILs) were produced.

Phenotyping for FW and SMD resistance

The 11 contrasting genotypes for FW and SMD resistance used in the present study were phenotyped in sick plot nursery at Patancheru (Telangana State, India) during crop season 2012–2013 and 2013–2014. Similarly, the RILs (ICPL 20096× ICPL 332) along with contrasting parents were sown in the sick plot nursery Patancheru and Gulbarga (Karnataka State, India) for FW and Patancheru and Tandur (Telangana State, India) for SMD during crop season 2012–2013 and 2013–2014 in three replications using randomized complete block design (RCBD). The experimental plots were four metres long with row to row spacing of 75 cm and a 20 spacing cm between plants. For satisfactory evaluation, the selection of FW‐sick plots was on the basis of disease incidence seen every year and for SMD ‘Leaf Stapling Technique’ (Nene and Reddy, 1976) at two leaf stages was followed. The observations for FW and SMD incidence were recorded at 30 and 90 days after sowing (DAS), and susceptible and resistant RILs were identified using the scale described by Singh et al. (2003). Based on the per cent disease incidence (PDI) score, the RILs and their parental lines were classified into four categories (i) resistant (0%–9.99% of PDI), (ii) moderately resistance (10%–19.99% of PDI), (iii) susceptible (20%–40% of PDI), and (iv) highly susceptible (>40% of PDI).

Construction of sequencing libraries and Illumina sequencing

From 188 RILs, a total of 32 RILs (16 susceptible and 16 resistant) and 10 genotypes (ICPL 20096, ICP 8863, ICPB 2049, ICPL 20097, ICPL 99050, ICPL 332, HPL 24, ICP 85063, ICPL 87 and ICPL 88039) were selected for sequencing. The sequencing data of one genotype namely, ICPL 87119, were used from the already published genome (Varshney et al., 2012a). Genomic DNA was isolated from two to three young leaves from selected RILs and 10 genotypes using NucleoSpin Plant II kit (Macherey‐Nagel, Düren, Germany). Two DNA pools, one resistant bulk (R‐bulk) and one susceptible bulk (S‐bulk), were prepared by mixing equimolar concentration of DNA samples from resistant and susceptible RILs. The Illumina libraries for these two bulks and 10 genotypes were prepared using TruSeq DNA sample Prep kit LT (set A) FC‐121‐2001. Two microgram DNA from each sample was sheared using Bioruptor® NGS (Diogenode, Liege, Belgium), end repaired and adapter ligated. Size selection of libraries was performed using 2% agarose gel to obtain a target insert size of 500–600 bp and purified for further analysis. Further, the libraries were enriched using adaptor compatible PCR primers. The size distribution of amplified DNA libraries was checked on an Agilent Technologies 2100 bioanalyzer using a High Sensitivity chip (Agilent Technologies, Palo Alto, CA, USA). The DNA libraries were sequenced on Illumina MiSeq platform using MiSeq Reagent Kit v2 (500 cycles) (Illumina Inc., San Diego, CA, USA) to generate 250 base paired‐end reads.

Alignment of short reads of bulks for Seq‐BSA

QTL‐seq pipeline (http://genome-e.ibrc.or.jp/home/bioinformatics-team/mutmap, developed by Iwate Biotechnology Research Center, Japan) was used for calculating SNP indices. Briefly, the cleaned reads of resistant parent (ICPL 20096) were first aligned to the reference genome (Asha) using inbuilt BWA aligner (Li et al., 2009). Coval was used for postprocessing and filtering of the alignment files (Kosugi et al., 2013). The variants called for the resistant parent were then used to develop reference‐guided assembly of the resistant parent (ICPL 20096; resistant parent) by substituting the bases with confidence variants calls in the genome. The reads from R‐ and S‐bulks were then aligned, and variants were called for both the bulks against the developed assembly. SNP index was calculated at each SNP position for both the bulks as suggested by Abe et al. (2012) using the formula: The SNPs with read depth <7 in both the bulks and SNP index <0.3 in either of the bulks were filtered out and SNPs with homozygous alleles in both the bulks were used for ∆SNP index calculation using formula: Only, SNP positions with ∆SNP index = −1 (i.e. the allele called in R‐bulk was same as that of resistant parent while contrastingly different in S‐bulk) were considered as the causal SNPs responsible for the trait of interest. Additionally, chi‐square analysis was performed for both the bulks based on the read depth to test the level of significance of associated SNPs at P < 0.01.

Identification of nonsynonymous SNPs

The WGRS data of four parents, namely ICPL 20097 (R‐FW and R‐SMD) and ICP 8863 (R‐FW and S‐SMD), ICPB 2049 (S‐FW and R‐SMD) and ICPL 99050 (R‐FW and R‐SMD), segregating for FW and SMD along with R‐ and S‐bulks were used for nsSNPs identification. The data were aligned against the reference genome of Asha (Varshney et al., 2012a) using Bowtie 2 (http://bowtie-bio.sourceforge.net/bowtie2/index.shtml). The BAM files thus obtained were used for SNP identification using Samtools 1.0 (Li et al., 2009). The SNPs obtained were annotated using SnpEff tool (http://snpeff.sourceforge.net/), and nsSNPs were subsequently identified using the stringent criteria as described in Silva et al. (2012). Further, nsSNPs present in 2‐Mb region flanking the SNP positions identified from Seq‐BSA study were selected, and the functions of associated genes were predicted by searching the respective protein sequences against nonredundant (nr) database using BLASTP program implemented in Blast2GO software (Conesa et al., 2005). The results from both the approaches were combined to identify disease‐specific nsSNPs.

qRT‐PCR for expression profiling

One resistant (ICPL 20096) and one susceptible (ICPL 332) genotypes for both FW and SMD, respectively, were used to validate the functionality of two putative candidate genes. Primers were designed for each candidate genes using Primer 3 software for qRT‐PCR experiment using standard criteria (Rosen and Skaletsky, 2000). The list of primers used in the qRT‐PCR analysis is provided in Table S8. FW and SMD stresses were imposed on 10 days old seedlings of ICPL 20096 and ICPL 332 grown in two sets for each stress. Root dip inoculation (FW) and leaf staple techniques (SMD) were followed for stress imposition under glasshouse conditions, and tissues were harvested after seven days of stress. Total RNA was isolated from roots (FW) and leaves (SMD) using XcelGen Plant RNA Mini Kit (Xcelris Genomics, Gujarat, India), while cDNA was synthesized using SuperScript® III First‐Strand Synthesis SuperMix (Invitrogen, Life Technologies, Thermo Fisher Scientific Corporation, Waltham, MA, USA) for qRT‐PCR. The qRT‐PCR reactions were performed using SYBR green master mix in 96‐well plates with two technical replicates and three biological replicates. The qRT‐PCR reaction was performed as mentioned previously (Mir et al., 2014). The housekeeping gene Actin was used as an endogenous control to normalize the variations in the cDNA samples. The data were compiled from the mean C t values of all the biological replicates after normalizing with the C t values of the endogenous control. The relative transcriptional level in terms of fold change was calculated using the 2−ΔΔCt method (Livaka and Schmittgen, 2001). Tukey's post hoc multiple comparison test using SPSS (version 16.0; SPSS Inc., Chicago, IL, USA) was used for data analysis at P < 0.05 to present significant values statistically. The different and similar letters were considered as statistically nonsignificant.

Protein structure analysis

Phyre 2 (Protein Homology/analogY Recognition Engine v2.0 server (www.sbg.bio.ic.ac.uk/phyre2) was used for 3D modelling structures of proteins. Phyre 2 uses the alignment of hidden Markov models via HH search to significantly improve the accuracy of alignment to known 3D structure models. It also incorporates an ab initio folding simulation called Poing to model regions of proteins with no detectable homology (Kelley and Sternberg, 2009). Best models were selected based on superfamilies, confidence key, coverage and amino acid identities.

Conflict of interest

The author(s) declare that they have no competing interests. Figure S1 Read depth of R‐ and S‐ bulks calculated across the 11 chromosome of pigeonpea. Click here for additional data file. Figure S2 SNP counts per window across the 11 chromosome of pigeonpea. Click here for additional data file. Figure S3 Protein sequence and nsSNP effect analysis for C.cajan_01839 gene. Click here for additional data file. Figure S4 Protein sequence and nsSNP effect analysis for C.cajan_03203 gene. Click here for additional data file. Figure S5 Expression profiling of two genes. Click here for additional data file. Table S1 Chromosome wise SNPs distribution between resistant (R) and susceptible (S) bulks. Click here for additional data file. Table S2 Chi‐square analysis for selected SNPs. Click here for additional data file. Table S3 Genome wide identification of nsSNPs between resistant and susceptible genotypes along with their respective bulks. Click here for additional data file. Table S4 Distribution of nsSNPs among resistant and susceptible genotypes and both the bulks. Click here for additional data file. Table S5 Validation of nsSNPs to the candidate genes responsive to fusarium wilt resistance. Click here for additional data file. Table S6 Validation of nsSNPs to the candidate genes responsive to sterility mosaic disease resistance. Click here for additional data file. Table S7 Non‐synonymous SNP substitution effects and expression fold difference for selected associated genes. Click here for additional data file. Table S8 List of primer pairs used for qRT‐PCR analysis of target genes. Click here for additional data file.

30 in total

1. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors: K J Livak; T D Schmittgen
Journal: Methods Date: 2001-12 Impact factor: 3.608

2. Next-generation mapping of Arabidopsis genes.

Authors: Ryan S Austin; Danielle Vidaurre; George Stamatiou; Robert Breit; Nicholas J Provart; Dario Bonetta; Jianfeng Zhang; Pauline Fung; Yunchen Gong; Pauline W Wang; Peter McCourt; David S Guttman
Journal: Plant J Date: 2011-07-18 Impact factor: 6.417

3. Identification of markers linked to disease-resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations.

Authors: R W Michelmore; I Paran; R V Kesseli
Journal: Proc Natl Acad Sci U S A Date: 1991-11-01 Impact factor: 11.205

4. Protein structure prediction on the Web: a case study using the Phyre server.

Authors: Lawrence A Kelley; Michael J E Sternberg
Journal: Nat Protoc Date: 2009 Impact factor: 13.491

5. SHOREmap: simultaneous mapping and mutation identification by deep sequencing.

Authors: Korbinian Schneeberger; Stephan Ossowski; Christa Lanz; Trine Juul; Annabeth Høgh Petersen; Kåre Lehmann Nielsen; Jan-Elo Jørgensen; Detlef Weigel; Stig Uggerhø Andersen
Journal: Nat Methods Date: 2009-08 Impact factor: 28.547

6. QTL-seq identifies an early flowering QTL located near Flowering Locus T in cucumber.

Authors: Hongfeng Lu; Tao Lin; Joël Klein; Shenhao Wang; Jianjian Qi; Qian Zhou; Jinjing Sun; Zhonghua Zhang; Yiqun Weng; Sanwen Huang
Journal: Theor Appl Genet Date: 2014-05-21 Impact factor: 5.699

7. The Sequence Alignment/Map format and SAMtools.

Authors: Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal: Bioinformatics Date: 2009-06-08 Impact factor: 6.937

8. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.

Authors: Ana Conesa; Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Manuel Talón; Montserrat Robles
Journal: Bioinformatics Date: 2005-08-04 Impact factor: 6.937

9. Coval: improving alignment quality and variant calling accuracy for next-generation sequencing data.

Authors: Shunichi Kosugi; Satoshi Natsume; Kentaro Yoshida; Daniel MacLean; Liliana Cano; Sophien Kamoun; Ryohei Terauchi
Journal: PLoS One Date: 2013-10-08 Impact factor: 3.240

10. MutMap+: genetic mapping and mutant identification without crossing in rice.

Authors: Rym Fekih; Hiroki Takagi; Muluneh Tamiru; Akira Abe; Satoshi Natsume; Hiroki Yaegashi; Shailendra Sharma; Shiveta Sharma; Hiroyuki Kanzaki; Hideo Matsumura; Hiromasa Saitoh; Chikako Mitsuoka; Hiroe Utsushi; Aiko Uemura; Eiko Kanzaki; Shunichi Kosugi; Kentaro Yoshida; Liliana Cano; Sophien Kamoun; Ryohei Terauchi
Journal: PLoS One Date: 2013-07-10 Impact factor: 3.240

39 in total

Review 1. Prospects of next generation sequencing in lentil breeding.

Authors: Jitendra Kumar; Debjyoti Sen Gupta
Journal: Mol Biol Rep Date: 2020-10-10 Impact factor: 2.316

2. Association mapping to discover significant marker-trait associations for resistance against fusarium wilt variant 2 in pigeonpea [Cajanus cajan (L.) Millspaugh] using SSR markers.

Authors: Prakash G Patil; Jyotirmay Dubey; Abhishek Bohra; R K Mishra; P R Saabale; Alok Das; Meenal Rathore; N P Singh
Journal: J Appl Genet Date: 2017-06-03 Impact factor: 3.240

3. Genetic dissection of heterotic loci associated with plant weight by Graded pool-seq in heading Chinese cabbage (Brassica rapa).

Authors: Lixin Yue; Rifei Sun; Guoliang Li; Feng Cheng; Limin Gao; Qinghua Wang; Shifan Zhang; Hui Zhang; Shujiang Zhang; Fei Li
Journal: Planta Date: 2022-05-16 Impact factor: 4.116

4. An autopolyploid-suitable polyBSA-seq strategy for screening candidate genetic markers linked to leaf blight resistance in sugarcane.

Authors: Zhoutao Wang; Hui Ren; Chao Pang; Guilong Lu; Fu Xu; Wei Cheng; Youxiong Que; Liping Xu
Journal: Theor Appl Genet Date: 2021-11-13 Impact factor: 5.699

5. The GmSNAP11 Contributes to Resistance to Soybean Cyst Nematode Race 4 in Glycine max.

Authors: Abdulwahab S Shaibu; Shengrui Zhang; Junkui Ma; Yue Feng; Yuanyuan Huai; Jie Qi; Jing Li; Ahmed M Abdelghany; Muhammad Azam; Honey Thet Paing Htway; Junming Sun; Bin Li
Journal: Front Plant Sci Date: 2022-07-04 Impact factor: 6.627

6. Complete resistance to powdery mildew and partial resistance to downy mildew in a Cucumis hystrix introgression line of cucumber were controlled by a co-localized locus.

Authors: Kaijing Zhang; Xing Wang; Wenwei Zhu; Xiaodong Qin; Jian Xu; Chunyan Cheng; Qunfeng Lou; Ji Li; Jinfeng Chen
Journal: Theor Appl Genet Date: 2018-08-04 Impact factor: 5.699

7. SNP-based pool genotyping and haplotype analysis accelerate fine-mapping of the wheat genomic region containing stripe rust resistance gene Yr26.

Authors: Jianhui Wu; Qingdong Zeng; Qilin Wang; Shengjie Liu; Shizhou Yu; Jingmei Mu; Shuo Huang; Hanan Sela; Assaf Distelfeld; Lili Huang; Dejun Han; Zhensheng Kang
Journal: Theor Appl Genet Date: 2018-04-17 Impact factor: 5.699

Review 8. From Mendel's discovery on pea to today's plant genetics and breeding : Commemorating the 150th anniversary of the reading of Mendel's discovery.

Authors: Petr Smýkal; Rajeev K Varshney; Vikas K Singh; Clarice J Coyne; Claire Domoney; Eduard Kejnovský; Thomas Warkentin
Journal: Theor Appl Genet Date: 2016-10-07 Impact factor: 5.699

9. Genetic mapping of green curd gene Gr in cauliflower.

Authors: Huaqiang Tan; Xin Wang; Zhangjun Fei; Huanxiu Li; Yaakov Tadmor; Michael Mazourek; Li Li
Journal: Theor Appl Genet Date: 2019-11-01 Impact factor: 5.699

10. Next-generation sequencing to identify candidate genes and develop diagnostic markers for a novel Phytophthora resistance gene, RpsHC18, in soybean.

Authors: Chao Zhong; Suli Sun; Yinping Li; Canxing Duan; Zhendong Zhu
Journal: Theor Appl Genet Date: 2017-11-14 Impact factor: 5.699