| Literature DB >> 31185892 |
Paul I Otyama1, Andrew Wilkey2, Roshan Kulkarni1,2, Teshale Assefa1,2, Ye Chu3, Josh Clevenger4, Dan J O'Connor5, Graeme C Wright5, Stanley W Dezern6, Gregory E MacDonald6, Noelle L Anglin7, Ethalinda K S Cannon8, Peggy Ozias-Akins3, Steven B Cannon9.
Abstract
BACKGROUND: Due to the recent domestication of peanut from a single tetraploidization event, relatively little genetic diversity underlies the extensive morphological and agronomic diversity in peanut cultivars today. To broaden the genetic variation in future breeding programs, it is necessary to characterize germplasm accessions for new sources of variation and to leverage the power of genome-wide association studies (GWAS) to discover markers associated with traits of interest. We report an analysis of linkage disequilibrium (LD), population structure, and genetic diversity, and examine the ability of GWA to infer marker-trait associations in the U.S. peanut mini core collection genotyped with a 58 K SNP array.Entities:
Keywords: Genetic diversity; Genome wide association; Linkage disequilibrium; Phylogenetic network tree; Population structure
Mesh:
Year: 2019 PMID: 31185892 PMCID: PMC6558826 DOI: 10.1186/s12864-019-5824-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Genome-wide SNP and haplotype block distribution. a Distribution of SNPs identified in the minicore collection. b Distribution of haplotype blocks along each chromosome. Horizontal dashed lines represent pericentromeric start and end positions inferred from genetic-by-physical plots
Fig. 2Structure within the mini core collection. Population structure plots with each vertical bar representing an accession colored according to the group to which it has been assigned. Accessions assigned to more than one group represents the extent of their admixed set of alleles. a structure for K = 4 subgroups ordered according to market type and subspecies (Top and Bottom bar plots respectively). b Population structure for K = 5 subgroups ordered according to market type and subspecies (Top and Bottom bar plots respectively)
Fig. 3Phylogenetic network showing the relationship among accessions and cultivars. Phylogenetic network constructed using Neighbor-net based on an analysis of 6300 SNPs for all 107 minicore accessions plus 6 select cultivars used as standards in the field for a total of 113 genotypes. Accession labels were color coded according to subspecies and the nodes according to market types
Fig. 4Effect of MAF on the nature of LD and its decay in the mini core collection. a Mean LD estimates computed at different map distances across the entire genome for three varying MAF threshold cutoffs. b LD decay with distance across the 20 chromosomes measured with varying MAF thresholds for r2 = half decay distance. LD decay distance is given in mega base pairs
Mean LD estimates among all SNPs with MAF ≥ 0.05 at different physical distances across the genome
| Distance (kb) |
| mean | SD | Distance (kb) |
| mean | SD |
|---|---|---|---|---|---|---|---|
| 0–0.10 | 301 | 0.870 | 0.268 | 78.8–127.0 | 1600 | 0.440 | 0.388 |
| 0.10–0.16 | 141 | 0.827 | 0.318 | 127.0–204.0 | 1903 | 0.367 | 0.366 |
| 0.16–0.26 | 225 | 0.826 | 0.314 | 204.0–329.0 | 3005 | 0.348 | 0.352 |
| 0.26–0.42 | 278 | 0.828 | 0.321 | 329.0–530.0 | 3630 | 0.245 | 0.283 |
| 0.42–0.67 | 348 | 0.793 | 0.340 | 530.0–853.0 | 4385 | 0.208 | 0.248 |
| 0.67–1.08 | 416 | 0.796 | 0.344 | 853.0–1370.0 | 7389 | 0.236 | 0.265 |
| 1.08–1.74 | 489 | 0.776 | 0.357 | 1370.0–2210.0 | 12,202 | 0.218 | 0.243 |
| 1.74–2.81 | 427 | 0.743 | 0.369 | 2210.0–3560.0 | 17,026 | 0.210 | 0.236 |
| 2.81–4.52 | 378 | 0.745 | 0.358 | 3560.0–5740.0 | 22,778 | 0.210 | 0.235 |
| 4.52–7.28 | 331 | 0.669 | 0.398 | 5740.0–9240.0 | 25,869 | 0.206 | 0.225 |
| 7.28–11.7 | 331 | 0.649 | 0.406 | 9240.0–14,900.0 | 31,364 | 0.207 | 0.229 |
| 11.7–18.9 | 612 | 0.644 | 0.392 | 14,900.0–24,000.0 | 38,171 | 0.219 | 0.236 |
| 18.9–30.4 | 603 | 0.568 | 0.408 | 24,000.0–38,600.0 | 45,340 | 0.216 | 0.245 |
| 30.4–48.9 | 808 | 0.524 | 0.404 | 38,600.0–621,000 | 43,255 | 0.212 | 0.246 |
| 48.9–78.8 | 1027 | 0.450 | 0.386 | 62,100–100,000 | 94,337 | 0.236 | 0.254 |
n number of r2 pairwise values, SD standard deviation, kb 1000 base physical distance
Average LD decay distance estimated for each chromosome and sub-genome
| Chromosome | Minor Allele Frequency | ||
|---|---|---|---|
| Sub-genome A | ≥ 0.05 | ≥ 0.1 | ≥ 0.2 |
| 1 | 5.07 | 25.82 | 71.09 |
| 2 | 2.38 | 4.99 | 17.14 |
| 3 | 1.80 | 5.08 | 18.94 |
| 4 | 9.67 | 27.99 | 41.06 |
| 5 | 3.55 | 8.41 | 34.32 |
| 6 | 1.79 | 4.10 | 17.97 |
| 7 | 2.34 | 8.49 | 16.82 |
| 8 | 1.66 | 4.24 | 19.97 |
| 9 | 3.93 | 17.31 | 77.74 |
| 10 | 5.50 | 13.96 | 67.16 |
| Average | 3.77 | 12.04 | 38.22 |
| Sub-genome B | |||
| 11 | 5.95 | 24.13 | 98.31 |
| 12 | 1.41 | 4.03 | 42.65 |
| 13 | 1.54 | 4.03 | 25.84 |
| 14 | 4.18 | 18.74 | 75.79 |
| 15 | 3.09 | 8.14 | 62.11 |
| 16 | 2.36 | 5.70 | 48.67 |
| 17 | 3.26 | 9.86 | 91.70 |
| 18 | 3.51 | 9.32 | 57.06 |
| 19 | 5.77 | 18.49 | 52.59 |
| 20 | 6.75 | 16.72 | 98.13 |
| Average | 3.78 | 11.92 | 65.29 |
| Genome-wide Average | 3.78 | 11.98 | 51.75 |
Values reported for LD decay at r2 half decay distance in mb
Fig. 5Association results for arachidic and behenic saturated fatty acids. a Histogram showing phenotypic distribution of arachidic, behenic fatty acids and blanchability. The x-axis shows BLUP values over three replicates for fatty acids and BLUPs over three locations for blanchability. The y-axis represents the number of individuals. b Manhattan and QQ plots for Arachidic and Behenic fatty acids. In the Manhattan plots, the blue horizontal line represents a suggestive line defined at 3.71 × 10− 4 and the red line represents an adjusted 5% multiple tests Bonferroni corrected threshold cut off at 1.86 × 10–5
Summary of significant associations and predicted candidate genes for arachidic and behenic fatty acids
| Trait | Marker | Chr | Pos | Marker | Candidate genes | Gene annotation | |
|---|---|---|---|---|---|---|---|
| Arach-idic | AX-147237808 | B01 | 6,039,768 | 3.52E-09 | 0.51845424 | Araip.ZQ4WS | Non-lysosomal ceramidase |
| AX-147237808 | B01 | 6,039,768 | 3.52E-09 | 0.51845424 | Araip.9YT86 | Non-lysosomal ceramidase | |
| AX-147255620 | B07 | 37,296,448 | 4.96E-06 | 0.316757 | Araip.L66QB | Flavin-binding Monooxygenase family protein | |
| AX-147255620 | B07 | 37,296,448 | 4.96E-06 | 0.316757 | Araip.L66QB | ATP binding Microtubule motor family protein | |
| AX-147220134 | A04 | 74,357,243 | 1.80E-04 | 0.214561 | – | – | |
| Behenic | AX-147247765 | B04 | 90,017,134 | 1.74E-11 | 0.38628 | Araip.10024352 | Palmitoyl protein thioesterase family protein |
| AX-147248572 | B04 | 127,788,433 | 1.64E-11 | 0.38871 | Araip.10024375 | Acetylglucos-aminyltransferase familyprotein | |
| AX-147248572 | B04 | 127,788,433 | 1.64E-11 | 0.38871 | Araip.10024487 | Thioesterase superfamily protein | |
| AX-147251762 | B06 | 3,275,121 | 8.25E-11 | 0.39006 | Araip.10017394 | Fatty acid desaturase 2 | |
| AX-147259171 | B08 | 122,074,815 | 5.99E-06 | 0.23662 | – | – | |
| AX-147209429 | A01 | 9,155,638 | 1.30E-10 | 0.3855 | Aradu.10033983 | Diacylglycerol Acyltransferase family | |
| AX-147209429 | A01 | 9,155,638 | 1.30E-10 | 0.3855 | Aradu.10033986 | Diacylglycerol Acyltransferase family |
Abbreviations in the table: chr chromosome, pos Marker position