| Literature DB >> 27901114 |
Padma Nimmakayala1, Venkata L Abburi1, Thangasamy Saminathan1, Suresh B Alaparthi1, Aldo Almeida1, Brittany Davenport1, Marjan Nadimi1, Joshua Davidson1, Krittika Tonapi1, Lav Yadav1, Sridhar Malkaram1, Gopinath Vajja1, Gerald Hankins1, Robert Harris1, Minkyu Park2, Doil Choi2, John Stommel3, Umesh K Reddy1.
Abstract
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27901114 PMCID: PMC5128918 DOI: 10.1038/srep38081
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Principal component analysis (PCA) showing distribution of global Capsicum annuum accession collections with 7, 331 single nucleotide polymorphisms (SNPs).
(A) PCA with accessions grouped by capsaicin content. (B) PCA with accessions grouped by fruit weight. See Table S2 for a list of accessions and for respective eigen values to locate individual accessions on the graph.
Figure 2Admixture in subpopulations resolved using 7, 331 single nucleotide polymorphisms (SNPs) along with capsaicin content (high, medium and low) (B) and SNPs and Fruit weight (large, medium and small) among C. annuum accession collections by population structure, a model-based approach.
(A) K4 had the highest peak (based on Delta K distribution), so four clusters sufficiently define C. annuum population structure based on capsaicin content and fruit weight.
Figure 3Individuals are ordered by their Euclidean distance (identity by state) and clustering with the tree is shown on top.
Heat map key is presented on the top left corner (values rescaled from 0.6 to 1).
Figure 4Genome-wide window-based pairwise F values of low and high FW (blue) and overall FST values (red) across chromosomes.
Note F distribution on part of chromosome 11 showed distinct sweep.
Figure 5Genome-wide window-based pairwise FST values of small and large fruit types (blue) and overall FST values (red) across chromosomes.
Note FST distribution on part of chromosome 11 showed distinct sweep.
Figure 6Marker associations (r2) across various chromosomes.
EM analysis is carried for (A) adjacent individual SNPs, (B) adjacent haplotypes, and (C) SNPs located in individual genes.
Figure 7High and low linkage disequilibrium (LD) block regions interspersed across the 12 C. annuum chromosomes.
Figure 8LD distribution across the 177 Mb sweep region specific to fruit weight identified on chromosome 11.
Annotation of significantly associated SNPs for capsaicin and dihydrocapsaicin content.
| Trait/markers | Locus | Gene annotation | Location of SNP | Ma→Mi | Amino acid # | Molecular function | Biological process |
|---|---|---|---|---|---|---|---|
| S1_31111874 | CA01g09570 | F-box/FBD/LRR-repeat protein | Promoter | C→G | — | Unknown function | Regulation of axonogenesis |
| S3_211558976 | CA03g19170 | Mitochondrial carrier protein MTM1-like | Exon 1 | A→G | V→I# | Metallochaperone activity | Manganese and phosphate ion transport |
| S5_215972421 | CA05g15510 | Laccase cupredoxin | Exon 5 | C→G | T→S# | Ferroxidase activity | Seed germination and root elongation |
| S5_227837981 | CA05g18080 | Ankyrin repeat-containing protein | Exon 1 | A→G | D→G# | Acyltransferase, Transferase | Cell growth regulation |
| S5_229634509 | CA05g18740 | IKI3 family protein | Exon 3 | A→T | K→R# | Histone acetyltransferase, tRNA binding | Transcription and protein transport |
| S6_203416571 | CA06g14430 | ABC transporter G family | Exon 20 | C→T | A→V# | ATPase activity, coupled to transmembrane movement | Transmembrane transport of lipids |
| S10_156251204 | CA10g10170/CA10g10180 | Zinc-finger CCHC type/GRAS transcription factor | Intergenic | G→A | — | Nucleic acid binding/DNA binding | mRNA processing/Regulation of transcription |
| S10_172735351 | CA10g10850 | Nuclear transport factor 2 (NTF2) family protein | Exon 1 | T→C | N→N | Protein transport | Nucleo-cytoplasm transport |
| S10_221317647 | CA10g16820 | 4-hydroxy-tetrahydrodipicolinate synthase | Intron 4 | A→C | — | 4-hydroxy-tetrahydrodipicolinate synthase | Lysine biosynthesis, diaminopimelate bisynthesis |
| S10_225598553 | CA10g18180 | Short-chain type alcohol dehydrogenase | 3-UTR | C→T | — | Oxidoreductase | — |
| S11_83592400 | CA11g09150 | ABC transporter C family member 3-like | Intron 1 | A→T | — | ATPase activity, coupled to transmembrane movement | Transmembrane transport of lipids |
| S11_85543247 | CA11g09200/CA11g09210 | Transparent Testa 12/F-box protein | Intergenic | C→T | — | DNA binding transcription factor/Protein binding | Proanthocyanidin biosynthesis, seed development/- |
| S11_85543251 | CA11g09200/CA11g09210 | Transparent Testa 12/F-box protein | Intergenic | T→C | — | DNA binding transcription factor/Protein binding | Proanthocyanidin biosynthesis, seed development/- |
| S11_85543257 | CA11g09200/CA11g09210 | Transparent Testa 12/F-box protein | Intergenic | A→G | — | DNA binding transcription factor/Protein binding | Proanthocyanidin biosynthesis, seed development/- |
| S3_212689068 | CA03g19340 | Putative hydrolase | Promoter | T→C | — | Hydrolase activity | Regulation of cell proliferation |
| S10_213596026 | CA10g15030 | Syntaxin-71-like (t-SNARE family) | Intron 3 | A→T | — | Protein transporter, SNAP receptor | Vesicle transport, protein target to membrane |
| S2_78953537 | CA02g05900/CA02g05910 | ATP-dependent DNA helicase pcrA/Peroxidase | Intergenic | A→C | — | ATP binding, DNA binding/metal ion binding, peroxidase activity | DNA replication/lignin biosynthesis, H2O2 catabolism |
| S2_136027877 | CA02g11920 | Detected protein of unknown function | Exon 8 | C→G | H→D# | — | — |
| S5_6535912 | CA05g02820 | Detected protein of unknown function | Promoter | C→T | — | — | — |
| S5_30787082 | CA05g06630/CA05g06640 | Pentatricopeptide repeat protein/CCCH-type zinc finger protein | Intergenic | T→C | — | RNA binding/DNA binding | Transit peptide processing/Negative regulation of transcription |
| S5_213819539 | CA05g15030/CA05g15040 | Intracellular transport USO-1 protein/Basic 7 S globulin 2 | Intergenic | C→G | — | Protein transporter activity/aspartic-type endopeptidase activity | SNARE complex and vesicles docking/- |
| S6_107738639 | CA06g07610/CA06g07620 | F-Box protein/Elongation factor 1-alpha | Intergenic | C→G | — | Glycoprotein binding/translation elongation, GTPase activity | DNA repair, protein ubiquination/Protein biosynthesis |
| S8_142512369 | CA08g18030/CA08g18040 | Serine/threonine-protein kinase SMG1/Serine/threonine-protein kinase SMG1-like | Intergenic | G→A | — | ATP, metal ion binding/ATP, metal ion binding | mRNA transport, response to stress/mRNA transport, response to stress |
| S9_220486511 | CA09g12880 | Os09g0132600 protein | Exon 4 | C→G | H→D# | Uncharacterized protein | — |
| S11_249688869 | CA11g16730/CA11g16740 | Nectarin 1/Disease resistant protein BS2 | Intergenic | A→G | — | Manganese ion binding, SOD activity/Nucleotide binding | Nutrient reservoir activity/disease resistance |
1Korea genome locus number; + or −, direction of transcription on + or – strand; Ma→Mi - Major and minor alleles from mapping population; #nonsynonymous mutation.
Annotation of significantly associated SNPs for fruit weight.
| Trait/markers | Locus | Gene annotation | Location of SNP | Ma→Mi | Amino acid # | Molecular function | Biological process |
|---|---|---|---|---|---|---|---|
| S1_178148471 | CA01g23190 | Isopenicillin N epimerase (AAT_I superfamily) | Exon 1 | C→T | S→N# | ADP binding, catalytic activity | Penicillin biosynthesis |
| S1_178214095 | CA01g23200 | protein transport protein SEC23-like (zf, MIDAS domain) | Promoter | A→G | — | Zinc ion binding | Intracellular protein transport |
| S2_169874314 | CA02g30530/CA02g30540 | Na+/H+ antiporter/Glucose-6-phosphate 1-dehydrogenase | Intergenic | C→T | — | Sodium:proton antiporter activity/NADP binding | Multiple cellular processes/glucose metabolic process |
| S3_230322338 | CA03g24610 | SNF1-related protein kinase/RAD50-interacting protein | Intron 4 | C→T | — | ATP binding, protein kinase/membrane traffic between the Golgi and ER | Nitrate assimilation, phosphorylation, carbohydrate metabolism/protein transport and Golgi transport |
| S3_230372266 | CA03g24640 | Ubiquitin-like modifier-activating enzyme 5-like (thiamine synthesis) | Intron 10 | A→T | — | Cofactor binding | Small protein activating enzyme activity |
| S5_131824978 | CA05g10770 | Unknown protein | Promoter | C→T | — | — | — |
| S6_202147247 | CA06g14190/CA06g14200 | STYLOSA protein/Flavin monooxygenase | Intergenic | C→T | — | Auxin transporter inhibitor/NADP binding | Regulation of floral organ identity/glucosinolate biosynthesis |
| S6_202147285 | CA06g14190/CA06g14200 | STYLOSA protein/Flavin monooxygenase | Intergenic | G→A | Auxin transporter inhibitor/NADP binding | Regulation of floral organ identity/glucosinolate biosynthesis | |
| S6_202147337 | CA06g14190/CA06g14200 | STYLOSA protein/Flavin monooxygenase | Intergenic | C→→T | Auxin transporter inhibitor/NADP binding | Regulation of floral organ identity/glucosinolate biosynthesis | |
| S6_202147420 | CA06g14190/CA06g14200 | STYLOSA protein/Flavin monooxygenase | Intergenic | C→T | Auxin transporter inhibitor/NADP binding | Regulation of floral organ identity/glucosinolate biosynthesis | |
| S6_227195619 | CA06g22610 | Chloroplastic- FAF-like protein | Exon 1 | C→G | N→K# | — | — |
| S8_132459145 | CA08g12350 | DnaQ-like exonuclease | Intron 2 | C→T | — | DNA binding | Exonuclease activity |
| S9_250224149 | CA09g16860 | Mitochondrial-processing peptidase subunit alpha | Intron 11 | C→T | — | Metal ion binding | Processing proteins targeting to mitochondrion |
| S10_229225552 | CA10g19740 | Cell division control protein 45 (CDC45) | Exon 1 | A→G | S→S | Chromatin binding, helicase activity | Cell division, DNA replication |
| S11_94177155 | CA11g09530 | Clathrin assembly protein | Promoter | A→C | low-density lipoprotein particle receptor binding | Intracellular protein transport, endocytosis | |
| S12_72971688 | CA12g10020/CA12g10030 | CLAVATA1 receptor kinase/Pentatricopeptide repeat | Intergenic | G→T | Protein kinase, ATP binding/RNA binding | Regulation of meristem structural organization, cell differentiation/Transit peptide processing | |
| S6_204245052 | CA06g14620 | TRS120 isoform | Intron | G→A | Cell plate assembly | Cell division | |
| S6_204246361 | CA06g14620 | TRS120 isoform | Exon 6 | G→A | R→G# | Cell plate assembly | Cell division |
| S10_231176974 | CA10g20890/CA10g20900 | Unknown function/Transcription factor | Intergenic | T→C | -/DNA binding | -/Transcription | |
| S4_215751345 | CA04g19880/CA04g19890 | LIM domain containing protein/DNA damage-binding protein | Intergenic | A→G | DNA and zinc ion binding/damaged DNA binding | Regulation of transcription/protein ubiquitination, nucleotide excision repair | |
1Korea genome locus number; + or −, direction of transcription on + or − strand; Ma→Mi - Major and minor alleles from mapping population; #nonsynonymous mutation.
Figure 9Manhattan plot of the genome-wide association study for fruit weight and capsaicinoids (capsaicin and dihydrocapsaicin).
Chromosome coordinates are displayed along the X-axis, with the negative log-10 of the association P-value for each SNP on the Y-axis. Higher negative log-10 indicates stronger association with the trait. Venn diagrams are of the unique and common significantly associated SNPs for capsaicin and dihydrocapsaicin content and fruit weight in 2011 and 2012.