| Literature DB >> 32265983 |
Ze Peng1, Zifan Zhao1, Josh Paul Clevenger2, Ye Chu3, Dev Paudel1, Peggy Ozias-Akins3, Jianping Wang1,4.
Abstract
Cultivated peanut (Arachis hypogaea L.) forms root nodules to enable a symbiotic relationship with rhizobia for biological nitrogen fixation. To understand the genetic factors of peanut nodulation, it is fundamental to genetically map and clone the genes involved in nodulation. For genetic mapping, high throughput genotyping with a large number of polymorphic markers is critical. In this study, two sets of sister recombinant inbred lines (RILs), each containing a nodulating (Nod+) and non-nodulating (Nod-) line, and their Nod+ parental lines were extensively genotyped. Several next generation sequencing (NGS) methods including target enrichment sequencing (TES), RNA-sequencing (RNA-seq), genotyping by sequencing (GBS), and the 48K Axiom Arachis2 SNP array, and various analysis pipelines were applied to identify single nucleotide polymorphisms (SNP) among the two sets of RILs and their parents. TES revealed the largest number of homozygous SNPs (15,947) between the original parental lines, followed by the Axiom Arachis2 SNP array (1,887), RNA-seq (1,633), and GBS (312). Among the five SNP analysis pipelines applied, the alignment to A/B genome followed by HAPLOSWEEP revealed the largest number of homozygous SNPs and highest concordance rate (79%) with the array. A total of 222 and 1,200 homozygous SNPs were polymorphic between the Nod+ and Nod- sister RILs and between their parents, respectively. A graphical genotype map of the sister RILs was constructed with these SNPs, which demonstrated the candidate genomic regions harboring genes controlling nodulation across the whole genome. Results of this study mainly provide the pros and cons of NGS and SNP genotyping platforms for genetic mapping in peanut, and also provide potential genetic resources to narrow down the genomic regions controlling peanut nodulation, which would lay the foundation for gene cloning and improvement of nitrogen fixation in peanut.Entities:
Keywords: RNA sequencing; SNP array; genotyping by sequencing; nodulation; peanut; single nucleotide polymorphism; target enrichment sequencing
Year: 2020 PMID: 32265983 PMCID: PMC7105825 DOI: 10.3389/fgene.2020.00222
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Five different alignment and SNP filtering pipelines.
| M1 | A/B | Overall | Yes | No | Yes |
| M2 | A + B | Unique | No | No | Yes |
| M3 | A + B | Unique | Yes | No | Yes |
| M4 | A/B | Overall | No | Yes | No |
| M5 | A + B | Unique | No | Yes | No |
FIGURE 1(A) Probe target regions, (B) on-target rate of mapped reads, and (C) reads coverage for target enrichment sequencing data.
Summary of SNPs between PI 262090 and UF 487A from five different methods using target enrichment sequencing RNA sequencing, and genotyping by sequencing data and concordance rate with array-overlapped SNPs.
| TES | M1 | 88/7,540 | 86/7,316 | 2/224 | 17(19.32%) | 17 (19.77%) | 0 |
| M2 | 92/22,584 | 21/18,146 | 71/4,438 | 57(61.96%) | 9 (42.86%) | 48 (67.61%) | |
| M3 | 13/1,283 | 12/2,938 | 1/132 | 5(38.46%) | 4 | 1 | |
| M4 | 44/10,157 | – | 44/10,157 | 36(81.82%) | – | 36 (81.82%) | |
| M5 | 30/2,694 | – | 30/2,694 | 23(76.67%) | – | 23 (76.67%) | |
| RNA-seq | M1 | 30/1,199 | 26/1,175 | 4/24 | 8(26.67%) | 4 (15.38%) | 4 |
| M2 | 108/14,684 | 82/13,897 | 26/787 | 33(30.56%) | 10 (12.20%) | 23 (88.46%) | |
| M3 | 13/297 | 11/285 | 2/18 | 3(23.08%) | 2 | 1 | |
| M4 | 17/901 | – | 17/901 | 14(82.35%) | – | 14 (82.35%) | |
| M5 | 9/288 | – | 9/288 | 6(66.67%) | – | 6 (66.67%) | |
| GBS | M1 | 1/161 | 0/159 | 1/2 | − | – | – |
| M2 | 1/171 | 0/134 | 1/37 | − | – | – | |
| M3 | 0/9 | 0/9 | 0/0 | − | – | – | |
| M4 | 1/278 | – | 1/278 | − | – | – | |
| M5 | 0/15 | – | 0/15 | − | – | – | |
FIGURE 2Comparison of identified and concordant SNPs among the five SNP analysis pipelines for target enrichment sequencing and RNA-sequencing data. For panels (A,B), the number outside shows the total number of SNPs identified from each method. For panels (C,D), the number before “/” shows the number of validated SNPs, the number after “/” shows the number of SNPs from each method that are overlapped with the Axiom Arachis2 SNP array.
Comparison of target enrichment sequencing, RNA sequencing, genotyping by sequencing, and the Axiom Arachis2 array.
| Pre-knowledge of DNA sequences | Yes | No | No | Yes |
| Efforts of bioinformatics analysis | High | High | High | Low |
| Price/sample | ∼$450 | ∼$260 | ∼$35 | ∼$28 |
| No. of homozygous SNPs identified | 15,947 | 1,633 | 312 | 1,887 |
| Per SNP per sample cost | ∼$0.0282 | ∼$0.1592 | ∼$0.1122 | ∼$0.0148 |
FIGURE 3Graphical map showing polymorphic genomic regions between E4 and E5. Each line represents a homozygous SNP. Each circle represents a candidate gene.
FIGURE 4Graphical map showing polymorphic genomic regions between E6 and E7. Each line represents a homozygous SNP. Each circle represents a candidate gene.