| Literature DB >> 28027425 |
Vikas K Singh1, Aamir W Khan1, Rachit K Saxena1, Pallavi Sinha1, Sandip M Kale1, Swathi Parupalli1, Vinay Kumar1, Annapurna Chitikineni1, Suryanarayana Vechalapu1, Chanda Venkata Sameer Kumar1, Mamta Sharma1, Anuradha Ghanta2, Kalinati Narasimhan Yamini2, Sonnappa Muniswamy3, Rajeev K Varshney1,4.
Abstract
Identification of candidate genomic regions associated with target traits using conventional mapping methods is challenging and time-consuming. In recent years, a number of single nucleotide polymorphism (SNP)-based mapping approaches have been developed and used for identification of candidate/putative genomic regions. However, in the majority of these studies, insertion-deletion (Indel) were largely ignored. For efficient use of Indels in mapping target traits, we propose Indel-seq approach, which is a combination of whole-genome resequencing (WGRS) and bulked segregant analysis (BSA) and relies on the Indel frequencies in extreme bulks. Deployment of Indel-seq approach for identification of candidate genomic regions associated with fusarium wilt (FW) and sterility mosaic disease (SMD) resistance in pigeonpea has identified 16 Indels affecting 26 putative candidate genes. Of these 26 affected putative candidate genes, 24 genes showed effect in the upstream/downstream of the genic region and two genes showed effect in the genes. Validation of these 16 candidate Indels in other FW- and SMD-resistant and FW- and SMD-susceptible genotypes revealed a significant association of five Indels (three for FW and two for SMD resistance). Comparative analysis of Indel-seq with other genetic mapping approaches highlighted the importance of the approach in identification of significant genomic regions associated with target traits. Therefore, the Indel-seq approach can be used for quick and precise identification of candidate genomic regions for any target traits in any crop species.Entities:
Keywords: Indels; bulked segregant analysis; fusarium wilt; sterility mosaic disease; whole-genome resequencing
Mesh:
Year: 2017 PMID: 28027425 PMCID: PMC5466435 DOI: 10.1111/pbi.12685
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Figure 1Pipeline of Indel‐seq approach. (a) Two contrasting parents (high trait parent (HTP) and low trait parent (LTP) are crossed to develop segregating population (F2/RILs) for target traits). (b) Based on the phenotyping of mapping population for the target traits, ~20 plants with extreme phenotype are selected for the constitution of extreme pools. (c) Low trait bulk (LTB) and high trait bulk (HTB) are constructed based on equimolar bulking of ~20 individuals of DNA for each bulk. (d) These two DNA bulks along with high trait parent (HTP) are used to whole‐genome resequencing. (e) Raw reads of HTP, HTB and LTB are aligned to the reference genome (RG, which is similar to high trait parent in term of target phenotype) for the identification of Indels. (f) Bulked segregant analysis (BSA) approach is applied to identify the associated Indels with the target traits using several Indel filtering criteria to identify putatively associated Indels between resistance and susceptibility. (g) List of putatively linked Indels identified.
Figure 2Phenotypic reaction of resistant and susceptible Fusarium wilt (FW) and sterility mosaic disease (SMD) plants. FW is a seed and soil borne fungal disease caused by Fusairum udum. Wilt symptoms usually appear when plants are in flowering and podding stage (a), but sometimes occur earlier when plants are 1‐2‐month‐old (b). SMD is a viral disease caused by Pigeonpea sterility mosaic virus (. This disease can be easily identified from a distance as patches of bushy, pale green plants (c) without flower or pods (d). Due to excess vegetative growth, without growing into reproductive phase, this disease is known as the green plague of pigeonpea.
Figure 3Flow diagram of Indel‐seq analysis for identification of candidate genes for FW and SMD resistance in pigeonpea. (a) Whole‐genome resequencing of the resistant parent (HTP), resistant bulk (HTB) and susceptible bulk (LTB) was performed with more than ≥10× genome coverage. (b) The generated raw reads of HTP, HTB and LTB were aligned with the reference genome (RG) for identification of genomewide Indels. The value presented in the funnel is the number of Indels identified/selected in each step, which is further classified as insertion (I) and deletion (D) (c) Total number of Indels identified after mapping of HTP, HTB and LTB on RG. (d) Further, only those Indels were selected, which possess ≥30 quality score. (e) Only homozygous Indels among HTP, HTB and LTB bulks were selected for further analysis (f) To remove false‐positive associations, only those Indels were selected which possesses reads ≥7 at both the bulk positions. (g) Homozygous polymorphic Indels were identified between both the bulks. (h) The classical concept of bulked segregant analysis (BSA) approach was implemented (RG = HTP = HTB ≠ LTB) for identification of putatively associated Indels (see text for the explanation). (i) Chi‐square test at 99% probability level was performed at each selected positions based on the presence of reads at selected Indel positions to select trait‐associated Indels.
Identification of Indels between resistant and susceptible bulks using Indel‐seq approach
| Gene | Type | Linkage group | Position (bp) | RG | HTP | Resistant bulk | Susceptible bulk | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HTB | Read depth | χ2 value |
| LTB | Read depth | χ2 value |
| ||||||
|
| I | CcLG02 | 12 535 647 | C | C | C | 15 | 8.07 | <0.001 | CA | 14 | 9.94 | <0.001 |
|
| |||||||||||||
|
| D | CcLG02 | 14 020 849 | CA | CA | CA | 7 | 7.00 | <0.001 | C | 15 | 8.00 | <0.001 |
|
| |||||||||||||
|
| I | CcLG02 | 14 397 213 | A | A | A | 19 | 8.89 | <0.001 | AT | 18 | 7.12 | <0.001 |
|
| |||||||||||||
|
| I | CcLG02 | 19 386 341 | T | T | T | 14 | 7.14 | <0.001 | TC | 11 | 8.00 | <0.001 |
|
| D | CcLG03 | 10 887 279 | GTA | GTA | GTA | 13 | 9.31 | <0.001 | G | 16 | 9.80 | <0.001 |
|
| I | CcLG06 | 890 690 | A | A | A | 12 | 12.00 | <0.001 | AT | 12 | 9.00 | <0.001 |
|
| |||||||||||||
|
| I | CcLG06 | 3 364 388 | C | C | C | 8 | 8.00 | <0.001 | CT | 7 | 8.33 | <0.001 |
|
| |||||||||||||
|
| D | CcLG07 | 405 527 | AT | AT | AT | 8 | 8.00 | <0.001 | A | 14 | 7.14 | <0.001 |
|
| I | CcLG08 | 7 106 619 | T | T | T | 8 | 8.00 | <0.001 | TG | 20 | 7.00 | <0.001 |
|
| |||||||||||||
| C.cajan_16060 (d) | I | CcLG08 | 7 820 397 | C | C | C | 19 | 11.84 | <0.001 | CCAACAA | 11 | 10.29 | <0.001 |
|
| I | CcLG09 | 2 209 342 | A | A | A | 14 | 7.14 | <0.001 | AT | 11 | 11.00 | <0.001 |
|
| |||||||||||||
|
| I | CcLG10 | 13 435 965 | C | C | C | 13 | 9.31 | <0.001 | CA | 17 | 8.07 | <0.001 |
|
| |||||||||||||
|
| D | CcLG10 | 13 516 086 | TTA | TTA | TTA | 15 | 8.07 | <0.001 | T | 8 | 14.22 | <0.001 |
|
| |||||||||||||
|
| D | CcLG10 | 18 889 276 | AT | AT | AT | 18 | 8.00 | <0.001 | A | 17 | 7.36 | <0.001 |
|
| D | CcLG11 | 17 030 340 | CA | CA | CA | 19 | 8.89 | <0.001 | C | 8 | 7.36 | <0.001 |
|
| |||||||||||||
|
| I | CcLG11 | 22 814 098 | G | G | G | 15 | 8.07 | <0.001 | GT | 11 | 7.36 | <0.001 |
Gene: u: upstream region; d: downstream region: i, intron; f, frame shift.
Type of Indels: ‘I’ stand for insertion and ‘D’ stand for deletion.
RG: Reference genome (Asha; ICPL 87119) (http://www.icrisat.org/gt-bt/iipg/genomedata.zip).
HTP: Resistant parent (ICPL 20096).
HTB: Resistant bulk.
LTB: Susceptible bulk.
Validation of candidate Indels in four known (resistant and susceptible) genotypes for FW and SMD resistance
| Linkage group | Indel positions (bp) | RG | HTP | HTB | LTB | ICPB 2049 | ICPL 99050 | ICPL 20097 | ICP 8863 |
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FW‐R | SMD‐R | FW‐R | SMD‐R | FW‐R | SMD‐R | FW‐S | SMD‐S | FW‐S | SMD‐R | FW‐R | SMD‐R | FW‐R | SMD‐R | FW‐R | SMD‐S | ||||
| CcLG02 | 12 535 647 | C | C | C | CA | CA | C | C | C |
| 0.42 | ||||||||
| CcLG02 | 14 020 849 | CA | CA | CA | C | CA | CA | CA | C | 0.42 |
| ||||||||
| CcLG02 | 14 397 213 | A | A | A | AT | A | A | A | A | 0.08 | 0.08 | ||||||||
| CcLG02 | 19 386 341 | T | T | T | TC | T | T | T | T | 0.08 | 0.08 | ||||||||
| CcLG03 | 10 887 279 | GTA | GTA | GTA | G | G | G | G | G | 0.27 | 0.27 | ||||||||
| CcLG06 | 890 690 | A | A | A | AT | A | A | A | A | 0.08 | 0.08 | ||||||||
| CcLG06 | 3 364 388 | C | C | C | CT | C | C | C | C | 0.08 | 0.08 | ||||||||
| CcLG07 | 405 527 | AT | AT | AT | A | A | AT | AT | AT |
| 0.42 | ||||||||
| CcLG08 | 7 106 619 | T | T | T | TG | TG | T | T | T |
| 0.42 | ||||||||
| CcLG08 | 7 820 397 | C | C | C | CCAACAA | C | C | C | C | 0.08 | 0.08 | ||||||||
| CcLG09 | 2 209 342 | A | A | A | AT | AT | AT | AT | AT | 0.27 | 0.27 | ||||||||
| CcLG10 | 13 435 965 | C | C | C | CA | C | C | C | C | 0.08 | 0.08 | ||||||||
| CcLG10 | 13 516 086 | TTA | TTA | TTA | T | TTA | TTA | TTA | TTA | 0.08 | 0.08 | ||||||||
| CcLG10 | 18 889 276 | AT | AT | AT | A | AT | AT | AT | A | 0.04 |
| ||||||||
| CcLG11 | 17 030 340 | CA | CA | CA | C | C | C | C | C | 0.27 | 0.27 | ||||||||
| CcLG11 | 22 814 098 | G | G | G | GT | G | G | G | G | 0.08 | 0.08 | ||||||||
RG: Reference genome (Asha; ICPL 87119) (http://www.icrisat.org/gt-bt/iipg/genomedata.zip).
HTP: Resistant parent (ICPL 20096).
HTB: Resistant bulk.
LTB: Susceptible bulk.
R: resistant reaction.
S: susceptible reaction.
Heterozygous calls.
P‐value <0.00 (boldface) found significant for specific disease resistance.