| Literature DB >> 27857720 |
Padma Nimmakayala1, Venkata L Abburi1, Thangasamy Saminathan1, Aldo Almeida1, Brittany Davenport1, Joshua Davidson1, C V Chandra Mohan Reddy1, Gerald Hankins1, Andreas Ebert2, Doil Choi3, John Stommel4, Umesh K Reddy1.
Abstract
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST ) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9-2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers.Entities:
Keywords: genome-wide association mapping; genotyping by sequencing; haplotyping; linkage disequilibrium; peduncle length; population structure
Year: 2016 PMID: 27857720 PMCID: PMC5093146 DOI: 10.3389/fpls.2016.01646
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Chromosome-wise distribution 13,129 SNPs with MAF of ≥0.05 for .
| 1 | 1443 |
| 2 | 1220 |
| 3 | 1447 |
| 4 | 1259 |
| 5 | 1198 |
| 6 | 1302 |
| 7 | 844 |
| 8 | 752 |
| 9 | 752 |
| 10 | 970 |
| 11 | 922 |
| 12 | 1020 |
| Total | 13,129 |
Figure 1First and second principal component analysis (PCA) components for 36,621 single nucleotide polymorphisms (SNPs) in a set of 377 diverse pepper accessions (283 . See Table S2 for a list of accessions and eigen values for respective positions of individual accessions in the figure.
Figure 2First and second PCA components for 13,129 SNPs within 283 . See Table S3 for a list of accessions and eigen values for respective positions of individual accessions in the figure.
Figure 3Manhattan plot of chromosome-wise overall fixation index (. Individual F-values are in Table S4.
Figure 4Frequency spectrum for chromosomal means for nucleotide diversity (π) and Tajima's D for .
Figure 5Genome-wide distribution of marker associations (.
Chromosome-wise distribution of LD blocks for .
| 1 | 721 | 71.861 | 3948.923 |
| 2 | 636 | 91.05 | 10856.72 |
| 3 | 756 | 67.397 | 3122.154 |
| 4 | 603 | 88.686 | 7216.827 |
| 5 | 670 | 110.614 | 6046.404 |
| 6 | 670 | 88.726 | 4043.506 |
| 7 | 390 | 90.642 | 4527.862 |
| 8 | 400 | 96.781 | 6962.39 |
| 9 | 401 | 169.852 | 4569.492 |
| 10 | 471 | 138.615 | 8694.085 |
| 11 | 494 | 117.027 | 13021.65 |
| 12 | 533 | 104.171 | 7352.753 |
| Overall | 6745 | 99.11 | 13021.65 |
Figure 6Manhattan plot of the genome-wide association study for peduncle length in Range of observed peduncle length. (B) Chromosome coordinates are on the X-axis, with the negative log-10 of the association P-value for each SNP on the Y-axis. High negative log-10 indicates strong association with the trait. Histograms show effects of significantly associated SNPs for peduncle length. (C) Four SNPs located in the intergenic space between the oxidoreductase family protein/arogenate dehydrogenase on chromosome 7 that explained 10.6% of the total variation for peduncle length.
Annotation of significantly associated SNPs for peduncle length in .
| S7_19145046 | 1.13E−06 | 5.947 | 0.015 | CA07g03460/CA07g03470 | Intergenic | G → C | – | Oxidoreductase family protein/Arogenate dehydrogenase |
| S7_19145048 | 1.13E−06 | 5.947 | 0.007 | ” | ” | C → T | ” | Oxidoreductase family protein/Arogenate dehydrogenase |
| S7_19145066 | 1.13E−06 | 5.947 | 0.005 | ” | ” | T → A | ” | Oxidoreductase family protein/Arogenate dehydrogenase |
| S7_19145073 | 1.13E−06 | 5.947 | 0.004 | ” | ” | C → T | ” | Oxidoreductase family protein/Arogenate dehydrogenase |
| S2_134518344 | 3.89E−06 | 5.410 | 0.010 | CA02g11490 | Exon | G → C | R → P | Phospho-n-acetylmuramoyl-pentapeptide-transferase |
| S11_725918 | 7.48E−06 | 5.126 | 0.016 | CA11g00270/CA11g00280 | Intergenic | C → G | – | GABA-specific permease/Unknown protein |
| S2_121116327 | 2.01E−05 | 4.697 | 0.038 | CA02g09090 | Exon | G → T | T → K | LON peptidase N-terminal domain and RING finger protein |
| S3_12740983 | 2.22E−05 | 4.653 | 0.036 | CA03g04980 | Intron | C → T | – | Eukaryotic translation initiation factor 2 subunit alpha |
| S11_190326151 | 3.12E−05 | 4.505 | 0.046 | CA11g12020 | Exon | G → A | S → S | Tho2 protein |
| S11_3937182 | 5.13E−05 | 4.289 | 0.061 | CA11g01740 | Exon | T → A | Q → L | Hydroxyproline-rich glycoprotein |
| S3_200716267 | 6.71E−05 | 4.173 | 0.073 | CA03g17680 | Intron | A → C | – | Pre-mRNA cleavage factor IM |
| S11_246730373 | 0.0001 | 3.916 | 0.122 | CA11g15960 | Intron | G → A | – | ATP-dependent RNA helicase |
| S4_137196865 | 0.0001 | 3.864 | 0.128 | CA04g10860/CA04g10870 | Intergenic | C → T | – | Amino acid transporter/UDP-glucose 6-dehydrogenase |
| S4_137196912 | 0.0001 | 3.864 | 0.120 | ” | ” | C → A | ” | Amino acid transporter/UDP-glucose 6-dehydrogenase |
| S10_223493543 | 0.0002 | 3.807 | 0.128 | CA10g17500/CA10g17510 | Intergenic | C → G | Cytochrome b559 subunit alpha/Aluminum-activated malate transporter | |
| S8_126682716 | 0.0002 | 3.730 | 0.144 | CA08g09170 | Exon | C → T | G → R | Ribosomal protein S11 |
| S8_126682746 | 0.0002 | 3.730 | 0.136 | ” | ” | C → A | A → S | Ribosomal protein S11 |
| S9_252073885 | 0.0004 | 3.449 | 0.195 | CA09g18340 | Promoter | G → A | – | Reticulon-like protein B21 |
| S9_252073890 | 0.0004 | 3.449 | 0.187 | ” | ” | G → A | ” | Reticulon-like protein B21 |
| S11_190326131 | 0.0004 | 3.449 | 0.180 | CA11g12020 | Exon | G → A | S → L | Tho2 protein |
| S10_229515157 | 0.0004 | 3.412 | 0.188 | CA10g19840 | Exon | G → A | S → F | Uncharacterized protein |
| S6_2635088 | 0.0004 | 3.381 | 0.195 | CA06g01230/CA06g01240 | Intergenic | T → C | – | Late blight resistance protein Rpi-blb2/Detected protein of confused Function |
| S8_142510499 | 0.0005 | 3.315 | 0.219 | CA08g18030 | Exon | A → T | M → L | Serine/Threonine-protein kinase SMG1 |
| S1_96976222 | 0.0005 | 3.285 | 0.227 | CA01g16010 | Exon | T → C | T → T | Phytochrome |
| S2_139076418 | 0.0005 | 3.263 | 0.231 | CA02g13050 | Intron | T → G | – | Ureidoglycolate hydrolase |
| S3_70295226 | 0.0006 | 3.220 | 0.247 | CA03g11420/CA03g11430 | Intergenic | C → T | – | Detected protein of confused Function/NADH dehydrogenase subunit |
| S3_257225287 | 0.0006 | 3.188 | 0.251 | CA03g36710/CA03g36720 | Intergenic | C → T | – | LRR receptor protein kinase/LRR receptor protein kinase |
| S11_257395608 | 0.0007 | 3.183 | 0.246 | CA11g19730 | Exon | C → T | H → H | ABC transporter |
| S11_257395610 | 0.0007 | 3.183 | 0.239 | ” | ” | C → A | A → D | ABC transporter |
| S2_130946711 | 0.0007 | 3.180 | 0.234 | CA02g10590/CA02g10600 | Intergenic | C → T | – | Nucleic acid binding protein/cleavage and polyadenylation specificity factor CPSF30 |
| S3_252341359 | 0.0008 | 3.078 | 0.289 | CA03g33810 | Intron | T → G | – | DNase I-like superfamily protein |
| S1_131644198 | 0.0009 | 3.046 | 0.303 | CA01g17480 | Promoter | A → G | – | Diacylglycerol kinase variant B |
Nonsynonymous mutation on amino acid due to minor/major allele SNP variation.