| Literature DB >> 35022582 |
Vikas Singh1,2, Pallavi Sinha1,2, Jimmy Obala1,3, Aamir W Khan1, Annapurna Chitikineni1, Rachit K Saxena1,4, Rajeev K Varshney5,6.
Abstract
To identify genomic segments associated with days to flowering (DF) and leaf shape in pigeonpea, QTL-seq approach has been used in the present study. Genome-wide SNP profiling of extreme phenotypic bulks was conducted for both the traits from the segregating population (F2) derived from the cross combination- ICP 5529 × ICP 11605. A total of 126.63 million paired-end (PE) whole-genome resequencing data were generated for five samples, including one parent ICP 5529 (obcordate leaf and late-flowering plant), early and late flowering pools (EF and LF) and obcordate and lanceolate leaf shape pools (OLF and LLS). The QTL-seq identified two significant genomic regions, one on CcLG03 (1.58 Mb region spanned from 19.22 to 20.80 Mb interval) for days to flowering (LF and EF pools) and another on CcLG08 (2.19 Mb region spanned from 6.69 to 8.88 Mb interval) for OLF and LLF pools, respectively. Analysis of genomic regions associated SNPs with days to flowering and leaf shape revealed 5 genic SNPs present in the unique regions. The identified genomic regions for days to flowering were also validated with the genotyping-by-sequencing based classical QTL mapping method. A comparative analysis of the identified seven genes associated with days to flowering on 12 Fabaceae genomes, showed synteny with 9 genomes. A total of 153 genes were identified through the synteny analysis ranging from 13 to 36. This study demonstrates the usefulness of QTL-seq approach in precise identification of candidate gene(s) for days to flowering and leaf shape which can be deployed for pigeonpea improvement.Entities:
Mesh:
Year: 2022 PMID: 35022582 PMCID: PMC9177671 DOI: 10.1038/s41437-021-00486-x
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.832
Sequencing details of parental line and bulks and mapping of sequence reads.
| Sample | Number of lines bulked | Total reads generated (Million reads) | High quality reads (Million reads) | Reads mapped (Million reads) | Genome coverage at 1X | Average depth (X) |
|---|---|---|---|---|---|---|
| ICP 5529a | 58.63 | 35.33 | 15.33 | 90.70% | 9.15 | |
| EFb | 15 | 66.34 | 62.64 | 30.28 | 93.13% | 15.10 |
| LFb | 15 | 76.46 | 63.99 | 30.25 | 93.26% | 15.08 |
| OLSb | 15 | 60.86 | 51.13 | 26.86 | 92.93% | 13.39 |
| LLSb | 15 | 61.88 | 52.23 | 26.90 | 92.95% | 13.41 |
EF Early flowering pool, LF Late flowering pool, OLS Obcordate leaf shape pool, LLS Lanceolate leaf shape pool.
aICP 5529 short reads were aligned to the publicly available pigeonpea genome of Asha (Varshney et al. 2012).
bThe short reads of bulks were aligned to the ICP 5529 assembly developed by replacement of SNPs between ICP 5529 and Asha.
Fig. 1A QTL-seq approach to identify genomic regions controlling days to flowering in pigeonpea.
a ICP 11605: early flowering parent; b ICP 5529: late flowering parent; c SNP index plot between early flowering pool (top), late flowering pool (middle) and ΔSNP index plot (bottom) of chromosome CcLG03 with statistical confidence interval under the null hypothesis of no QTLs (orange, P < 0.01; and green, P < 0.05). The significant genomic region identified for days to flowering is shaded (1.58 Mb region spanned through 19.22– 20.80 Mb).
Identification of SNPs in putative candidate genes for days to flowering.
| Linkage group | Gene | Position | ICP 5529 allele | LF allele | SNP index (LF pool)a | EF allele | SNP index (EF pool)b | Δ SNP-indexc | SNP effect | Function |
|---|---|---|---|---|---|---|---|---|---|---|
| CcLG03 | 19222701 | G (aCg)d | G (aCg)d | 0 | A (aTg)d | 1 | −1 | Exon (nsSNP) | Pentatricopeptide repeat-containing protein | |
| CcLG03 | 19549631 | A | A | 0 | G | 1 | −1 | Intron | Chromodomain-helicase-DNA-binding protein 4 | |
| CcLG03 | 19690312 | G | G | 0 | A | 1 | −1 | Intron | Maestro heat-like repeat-containing protein family | |
| CcLG03 | 19763754 | C | C | 0 | T | 1 | −1 | Intron | Phosphatidylinositol 4-phosphate 5-kinase 9 | |
| CcLG03 | 20439904 | G | G | 0 | C | 1 | −1 | Intron | 1,4-alpha-glucan-branching enzyme | |
| CcLG03 | 20635496 | T | T | 0 | A | 1 | −1 | Intron | Uridine nucleosidase 1 | |
| CcLG03 | 20745506 | G | G | 0 | T | 1 | −1 | Exon (sSNP) | Cell division protein | |
| CcLG03 | 20745771 | A | A | 0 | G | 1 | −1 | Intron | ||
| CcLG03 | 20747419 | C | C | 0 | T | 1 | −1 | Intron |
nsSNP non-synonymous SNPS, sSNP synonymous SNP.
aSNP-index of late flowering (LF) was calculated based on the allele calls and read depth in comparison to ICP 5529 reference assembly.
bSNP-index of early flowering (EF) bulk was calculated based on the allele calls and read depth in comparison to ICP 5529 reference assembly.
cΔ SNP-index of each SNP positions was calculated using following formula: Δ SNP-index = SNP-index of LF − SNP-index of EF.
dValue in parenthesis indicates the codon change due to SNP/Code for changed amino acids.
Fig. 2A QTL-seq approach to identify genomic regions controlling obcordate leaf shape in pigeonpea.
a ICP 11605: parent of lanceolate type leaf; b ICP 5529: parent of obcordate type leaf; c SNP index plot between lanceolate leaf shape DNA pool (top), obcordate leaf shape DNA pool (middle) and ΔSNP index plot (bottom) of chromosome CcLG08 with statistical confidence interval under the null hypothesis of no QTLs (orange, P < 0.01; and green, P < 0.05). The significant genomic region identified for obcordate leaf shape is shaded (2.18 Mb region spanned from 6.69 to 8.88 Mb).
Identification of SNPs in putative candidate genes for leaf shape.
| Linkage group | Gene | Position | ICP 5529 allele | OLS allele | SNP index (OLS pool)a | LLS allele | SNP index (LLS Pool)b | Δ SNP-indexc | SNP Effect | Function |
|---|---|---|---|---|---|---|---|---|---|---|
| CcLG08 | 6701814 | A | A | 0 | T | 1 | −1 | Intron | Beta-carotene hydroxylase 2 | |
| CcLG08 | 6764651 | T (Tgt)d | T (Tgt)d | 0 | C (Cgt)d | 1 | −1 | Exon (nsSNP) | Ac-like transposase | |
| CcLG08 | 6915910 | T (aTg)d | T (aTg)d | 0 | C (aCg)d | 1 | −1 | Exon (nsSNP) | Uncharacterized protein | |
| CcLG08 | 6921340 | A | A | 0 | G | 1 | −1 | Exon (sSNP) | Pro-Pol polyprotein | |
| CcLG08 | 6923929 | T | T | 0 | C | 1 | −1 | Intron | ||
| CcLG08 | 6924854 | A | A | 0 | G | 1 | −1 | Intron | ||
| CcLG08 | 6927480 | C | C | 0 | T | 1 | −1 | Intron | ||
| CcLG08 | 6927533 | A | A | 0 | G | 1 | −1 | Intron | ||
| CcLG08 | 6927560 | T | T | 0 | C | 1 | −1 | Intron | ||
| CcLG08 | 6927678 | T | T | 0 | C | 1 | −1 | Intron | ||
| CcLG08 | 7057478 | G (Gca)d | G (Gca)d | 0 | A (Aca)d | 1 | −1 | Exon (nsSNP) | F-box protein | |
| CcLG08 | 7059171 | T | T | 0 | C | 1 | −1 | Intron | ||
| CcLG08 | 7068488 | G | G | 0 | C | 1 | −1 | Intron | Uncharacterized protein | |
| CcLG08 | 7068679 | G | G | 0 | A | 1 | − | Intron | ||
| CcLG08 | 7070780 | C (cCt)d | C (cCt)d | 0 | T (cTt)d | 1 | −1 | Exon (nsSNP) | ||
| CcLG08 | 7083922 | T | T | 0 | A | 1 | −1 | Intron | Transcriptional corepressor | |
| CcLG08 | 7093751 | G | G | 0 | A | 1 | −1 | Intron | ||
| CcLG08 | 7456634 | T | T | 0 | C | 1 | −1 | Intron | Cytochrome P450 | |
| CcLG08 | 7456764 | A | A | 0 | G | 1 | −1 | Intron | ||
| CcLG08 | 7456831 | T | T | 0 | A | 1 | −1 | Intron | ||
| CcLG08 | 7456974 | A | A | 0 | G | 1 | −1 | Intron | ||
| CcLG08 | 7457844 | C | C | 0 | A | 1 | −1 | Intron | ||
| CcLG08 | 7486941 | A | A | 0 | G | 1 | −1 | Intron | Uncharacterized protein | |
| CcLG08 | 7606346 | T | T | 0 | A | 1 | −1 | Intron | Transposon Ty3-I | |
| CcLG08 | 7641790 | A | A | 0 | C | 1 | −1 | Intron | E3 ubiquitin-protein ligase | |
| CcLG08 | 7642733 | G | G | 0 | A | 1 | −1 | Intron | ||
| CcLG08 | 7643315 | A | A | 0 | C | 1 | −1 | Exon (sSNP) | ||
| CcLG08 | 7666784 | A | A | 0 | G | 1 | −1 | Intron | Protein ROOT PRIMORDIUM DEFECTIVE 1 | |
| CcLG08 | 7667174 | T | T | 0 | C | 1 | −1 | Intron | ||
| CcLG08 | 7780600 | A | A | 0 | T | 1 | −1 | Intron | Probable methyltransferase PMT16 | |
| CcLG08 | 7838639 | C | C | 0 | T | 1 | −1 | Exon (sSNP) | 1-aminocyclopropane-1-carboxylate oxidase homolog 1 | |
| CcLG08 | 7870949 | A | A | 0 | G | 1 | −1 | Intron | ||
| CcLG08 | 7888623 | G | G | 0 | A | 1 | −1 | Intron | 1-aminocyclopropane-1-carboxylate oxidase homolog 12 | |
| CcLG08 | 7940765 | C | C | 0 | G | 1 | −1 | Intron | Tripeptidyl-peptidase 2 | |
| CcLG08 | 7947009 | G | G | 0 | A | 1 | −1 | Intron | ||
| CcLG08 | 7950673 | A | A | 0 | C | 1 | −1 | Intron | ||
| CcLG08 | 8008253 | C | C | 0 | A | 1 | −1 | Intron | Cytochrome P450 | |
| CcLG08 | 8168767 | C | C | 0 | A | 1 | −1 | Intron | Type I inositol | |
| CcLG08 | 8666995 | T | T | 0 | C | 1 | −1 | Exon (sSNP) | - |
nsSNP non-synonymous SNPS, sSNP synonymous SN.
aSNP-index of obcordate leaf shape (OLS) bulk was calculated based on the allele calls and read depth in comparison to ICP 5529 reference assembly.
bSNP-index of lanceolate leaf shape (LLS) bulk was calculated based on the allele calls and read depth in comparison to ICP 5529 reference assembly.
cΔ SNP-index of each SNP positions was calculated using following formula: Δ SNP-index = SNP-index of OLS − SNP-index of LLS.
dValue in parenthesis indicates the codon change due to SNP/Code for changed amino acids.