| Literature DB >> 36235493 |
Nayoung Ro1, Mesfin Haile1, Bitsam Kim1, Gyu-Taek Cho1, Jungro Lee1, Yoon-Jung Lee1, Do Yoon Hyun2.
Abstract
Eggplant is one of the most economically and nutritionally important vegetables worldwide. The study of the association of phenotypic traits with genetic factors is vital for the rapid and efficient identification and selection of eggplant genetic resources for breeding purposes with desired traits. The eggplant resources (587) collected from different countries, including Korea, were used for establishing the core collection. A total of 288 accessions were selected from 587 Solanum accessions based on 52 single nucleotide polymorphisms (SNPs) markers together with 17 morphological traits. This core collection was further used to analyze the genetic associations of eggplant morphological variations. A large variation was found among the evaluated eggplant accessions for some agro-morphological traits. Stem prickles and leaf prickles showed a significant positive correlation (r = 0.83***), followed by days to flowering and days to maturity (r = 0.64***). A total of 114,981 SNPs were filtered and used for phylogenetic tree analysis, population structure analysis, and genome-wide association study (GWAS). Among the agro-morphological traits, significantly associated SNPs were found for six traits. A total of 377 significantly associated SNPs with six agro-morphological traits were identified. These six traits and the number of SNPs were: days to maturity (51), flower size (121), fruit width (20), harvest fruit color (42), leaf prickles (38), and stem prickles (105). The largest fraction of significant SNPs (11.94%) was obtained on chromosome Ch01, followed by Ch07 and Ch06 with 11.67% and 10.08%, respectively. This study will help to develop markers linked to the most important agro-morphological traits of eggplant genetic resources and support the selection of desirable traits for eggplant breeding programs.Entities:
Keywords: GBS; GWAS; SNP marker; agro-morphological trait; eggplant
Year: 2022 PMID: 36235493 PMCID: PMC9571982 DOI: 10.3390/plants11192627
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
The summary of qualitative agro-morphological traits of 288 eggplant collection.
| Traits | Frequency | Percentage | Traits | Frequency | Percentage |
|---|---|---|---|---|---|
|
|
| ||||
| Absent | 58 | 20.14 | Upright | 15 | 5.21 |
| Present | 230 | 79.86 | Intermediate | 260 | 90.28 |
|
| Prostrate | 13 | 4.51 | ||
| Absent | 202 | 70.14 |
| ||
| Present | 86 | 29.86 | Small (<2 cm) | 33 | 11.46 |
|
| Medium (2–3 cm) | 250 | 86.81 | ||
| Absent | 266 | 92.36 | Large (> 3 cm) | 5 | 1.74 |
| Present | 22 | 7.64 | |||
|
|
| ||||
| Absent | 257 | 89.24 | White | 47 | 16.32 |
| Present | 31 | 10.76 | Purple | 176 | 61.11 |
|
| Light purple | 61 | 21.18 | ||
| Absent | 168 | 58.33 | White and purple (mixed) | 4 | 1.39 |
| Present | 120 | 41.67 | |||
|
|
| ||||
| Round | 80 | 27.78 | Green | 65 | 22.57 |
| Oval | 29 | 10.07 | White | 24 | 8.33 |
| Ovate | 15 | 5.21 | Green-purple | 35 | 12.15 |
| Pear type | 10 | 3.47 | Light purple (more white) | 16 | 5.56 |
| Club | 94 | 32.64 | Purple | 116 | 40.28 |
| Elliptical | 49 | 17.01 | Orange | 15 | 5.21 |
| Cylindrical | 11 | 3.82 | Yellow | 17 | 5.90 |
|
|
| ||||
| Green | 110 | 38.19 | Tan (pale brown) | 136 | 47.22 |
| White | 10 | 3.47 | Yellow | 108 | 37.50 |
| White-purple (advanced purple) | 20 | 6.94 | Green | 3 | 1.04 |
| Green-purple | 41 | 14.24 | Green purple | 6 | 2.08 |
| Purple | 104 | 36.11 | Light purple | 2 | 0.69 |
| Yellow | 2 | 0.69 | Purple | 9 | 3.13 |
| White-purple (advanced white) | 1 | 0.35 | Red | 24 | 8.33 |
The summary of quantitative agro-morphological traits of 288 eggplant collection.
| Traits | No. of Germplasm | Min | Max | Average | SD |
|---|---|---|---|---|---|
| Plant height (cm) | 288 | 13.20 | 209.10 | 87.76 | 24.20 |
| Fruit width (cm) | 288 | 0.10 | 25.05 | 5.77 | 2.28 |
| Fruit length (cm) | 288 | 0.60 | 37.22 | 16.80 | 7.54 |
| Days to flowering | 288 | 27 | 110 | 60.03 | 12.47 |
| Days to maturity | 288 | 88 | 156 | 111.57 | 9.24 |
Figure 1Pearson’s correlation of phenotypic traits of eggplant accessions in a clustered heatmap. (A: hypocotyl anthocyanin, B: growth habit, C: stem anthocyanin, D: stem prickles, E: leaf prickles, F: calyx prickles, G: flower size, H: flower color, I: fruit shape, J: immature fruit color, K: mature fruit color, L: fruit color at harvest, M: plant height, N: fruit length, O: fruit width, P: days to flowering, and Q: days to maturity). Significant correlation indicated with asterisks (* p < 0.05; ** p < 0.01; *** p < 0.001).
Figure 2PCA plot based on the phenotypic data of 288 eggplant accessions (A: variables and B: individuals; each dot represents a single accession). Letters (A–Q) on the PCA plots indicate phenotype traits; A: hypocotyl anthocyanin, B: growth habit, C: stem anthocyanin, D: stem prickles, E: leaf prickles, F: calyx prickles, G: flower size, H: flower color, I: fruit shape, J: immature fruit color, K: mature fruit color, L: fruit color at harvest, M: plant height, N: fruit length, O: fruit width, P: days to flowering, and Q: days to maturity.
Figure 3The distribution of SNPs generated from 288 eggplant genetic resources across 12 chromosomes.
Summary of sequencing raw data.
| Set | Sequencing File | No. of Barcode | No. of Sample | No. of Reads | Avg. Length (bp) | Total Length (bp) | GC(%)*1 | Q30(%)*2 | No. of Demultiplexed Reads (%) |
|---|---|---|---|---|---|---|---|---|---|
| Set1 | R1 | 96 | 96 | 372,814,189 | 151 | 56,294,942,539 | 45.37 | 92.99 | 692,370,340 (92.86%) |
| R2 | 372,814,189 | 151 | 56,294,942,539 | ||||||
| Set2 | R1 | 96 | 96 | 371,354,724 | 151 | 56,074,563,324 | 45.91 | 92.43 | 722,810,406 (97.32%) |
| R2 | 371,354,724 | 151 | 56,074,563,324 | ||||||
| Set3 | R1 | 96 | 96 | 371,510,483 | 151 | 56,098,082,933 | 45.22 | 92.5 | 700,525,094 (94.28%) |
| R2 | 371,510,483 | 151 | 56,098,082,933 | ||||||
| Total | 288 | 288 | 2,231,358,792 | 336,935,177,592 |
Raw and trimmed sequence statistics.
| Average/Plant | Total | |
|---|---|---|
| Sum of raw reads | 7,346,200.83 | 2,115,705,840 |
| Total length of raw reads | 1,109,276,325.83 | 3.19472 × 1011 |
| Sum of trimmed reads | 6,710,718.03 | 1,932,686,794 |
| Total length of trimmed reads (bp) | 761,869,536.23 | 2.19418 × 1011 |
| Avg. length of trimmed reads (bp) | 113.32 | |
| Trimmed/Raw (%) | 91.34% | |
| Sum of trimmed reads | 6,710,718.03 | 1,932,686,794 |
| No. of mapped reads | 6,358,092.97 | 1,831,130,776 |
| Percent of mapped reads (%) | 94.62% | |
| No. of mapped region | 84,273.83 | 24,270,863 |
| Avg. depth of mapped region (#) | 25.41 | |
| Median depth of mapped region (#) | 9.57 | |
| Total length of mapped region (bp) | 19,439,752.74 | 5,598,648,789 |
| Avg. length of mapped region (bp) | 228.16 | |
| Reference Genome coverage (%) | 1.7011% |
Statistics of SNP filtering process.
| Filter Stage | Filter Item | No. of SNPs |
|---|---|---|
| 1 | Total SNP | 1,859,683 |
| 2 | MAF (minor allele frequency) >5% *1 | 618,245 |
| 3 | Missing data <30% *2 | 692,147 |
| 4 | Missing data <30% & MAF >5% | 114,981 |
(*1) MAF (minor allele frequency) >5%: SNPs with a minor allele frequency greater than 5% are selected from all samples of the locus. (*2) Missing data <30%: SNPs with missing data less than 30% were selected from all samples of the left.
Figure 4Population structure of 288 eggplant germplasms based on SNPs markers (5000, randomly selected). At K = 3, the population were estimated to be three (Pop1, Pop2, Pop3) based on STRUCTURE analysis.
Figure 5The 3D PCA (A) and DAPC (B) of 288 eggplant genetic resources based on 114,981 SNPs. Each color represents the population inferred by the structure program. The first three principal components; PC1, PC2, and PC3 accounted for 45%, 24% and 26.6%, respectively.
Figure 6The evolutionary history was inferred using the Neighbor-Joining method. The optimal tree with the sum of branch length = 5.02 is shown. The evolutionary distances were computed using the Maximum Composite Likelihood method and are in the units of the number of base substitutions per site. Bootstrap percentages test of 1000 replicates are shown next to the branches. The analysis involved 288 nucleotide sequences. All ambiguous positions were removed for each sequence pair. There was a total of 114,981 positions in the final dataset. Evolutionary analyses were conducted in MEGA6. Each number/code outside the branch represents the eggplant accessions.
Figure 7Manhattan plot depicting the association of 17 agro-morphological traits using 288 eggplant genetic resources. Each dot represents a single SNP, with the X-axis showing genomic location (chromosomes: colored and labeled) and Y-axis showing association level. The horizontal, green line represents the cut-off of the significant association.
Top 10 significantly associated SNPs with leaf prickles, stem prickles, flower size, fruit width, fruit color at harvest and days to maturity.
| Chromosomes | Position (bp) | Ref. | Alt. | -Log ( | Genic/ Intergenic | Gene ID | Feature | Description | |
|---|---|---|---|---|---|---|---|---|---|
| Leaf prickle | Ch02 | 79092061 | T | C | 9.9 | Intergenic | - | - | - |
| Ch02 | 10028712 | A | C | 8.75 | Intergenic | - | - | - | |
| Ch03 | 80612399 | T | A | 9.54 | Genic | SMEL_003g184780.1.01 | exon,CDS | Protein of unknown function | |
| Ch03 | 35561623 | C | T | 8.14 | Genic | SMEL_003g176660.1.01 | Intron | ARF ADP-ribosylation factor 2 | |
| Ch04 | 98734254 | T | C | 8.54 | Genic | SMEL_004g217730.1.01 | Intron | Protein of unknown function | |
| Ch05 | 7207267 | C | T | 8.52 | Genic | SMEL_005g229060.1.01 | exon,CDS | DPMS1 Dolichol-phosphate mannosyltransferase subunit 1 | |
| Ch05 | 7552127 | G | A | 8.43 | Intergenic | - | - | - | |
| Ch06 | 102955264 | C | T | 8.11 | Genic | SMEL_006g264280.1.01 | exon,CDS | Protein of unknown function | |
| Ch08 | 54103350 | T | A | 7.96 | Genic | SMEL_008g305950.1.01 | exon,CDS | MAG5 Protein transport protein SEC16A homolog | |
| Ch09 | 556607 | C | T | 9.27 | Genic | SMEL_009g320070.1.01 | Intron | At5g64680 Mediator-associated protein 2 | |
| Stem prickle | Ch01 | 1564720 | T | G | 9.19 | Genic | SMEL_001g116710.1.01 | Intron | PPRD2 Polyprenol reductase 2 |
| Ch04 | 433115 | A | T | 8.65 | Intergenic | - | - | - | |
| Ch05 | 31730496 | A | G | 11.32 | Genic | SMEL_005g234070.1.01 | exon,CDS | GSTT1 Glutathione S-transferase T1 | |
| Ch08 | 4114368 | C | T | 10.33 | Genic | SMEL_008g300170.1.01 | Intron | PCMP-H12 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic | |
| Ch08 | 13353668 | T | C | 8.87 | Genic | SMEL_008g302900.1.01 | Intron | APY2 Apyrase 2 (Arabidopsis thaliana OX = 3702) | |
| Ch10 | 8483799 | T | G | 10.92 | Genic | SMEL_010g341450.1.01 | Intron | ETFA Electron transfer flavoprotein subunit alpha, mitochondrial | |
| Ch10 | 6032590 | C | T | 10.05 | Genic | SMEL_010g340650.1.01 | exon,CDS | MAA3 Probable helicase MAGATAMA 3 | |
| Ch10 | 95813547 | C | T | 9.48 | Genic | SMEL_010g353430.1.01 | exon,CDS | ECI1 Enoyl-CoA delta isomerase 1, peroxisomal | |
| Ch10 | 914775 | G | T | 9.23 | Genic | SMEL_010g336590.1.01 | exon,CDS | BRO1 Vacuolar-sorting protein BRO1 | |
| Ch11 | 71457233 | A | T | 8.9 | Genic | SMEL_011g379480.1.01 | Intron | RPL35 60S ribosomal protein L35 | |
| Flower size | Ch01 | 12681936 | C | T | 8.15 | Genic | SMEL_001g126710.1.01 | exon,CDS | Pentatricopeptide repeat-containing protein At5g14770, mitochondrial |
| Ch02 | 59617555 | G | A | 8.31 | Genic | SMEL_002g159930.1.01 | exon,CDS | ASF1A Probable histone chaperone ASF1A | |
| Ch03 | 85061794 | T | C | 8.16 | Intergenic | - | - | ||
| Ch04 | 4151064 | T | C | 9.52 | Genic | SMEL_004g202900.1.01 | Intron | UVR8 Ultraviolet-B receptor UVR8 | |
| Ch05 | 4580830 | T | G | 8.93 | Genic | SMEL_005g227340.1.01 | exon,CDS | UVR8 Ultraviolet-B receptor UVR8 | |
| Ch09 | 34681620 | A | T | 8.67 | Genic | SMEL_009g335180.1.01 | exon,CDS | MACPF domain-containing protein At1g14780 | |
| Ch09 | 34681628 | A | C | 8.67 | Genic | SMEL_009g335180.1.01 | exon,CDS | MACPF domain-containing protein At1g14780 | |
| Ch10 | 103432323 | A | C | 8.69 | Genic | SMEL_010g358670.1.01 | Intron | G2/mitotic-specific cyclin-1 | |
| Ch11 | 64268771 | C | G | 8.56 | Genic | SMEL_011g374540.1.01 | exon,CDS | RR21 Two-component response regulator ORR21 | |
| Ch12 | 98996000 | G | T | 8.32 | Genic | SMEL_012g398260.1.01 | Intron | ARSA1 ATPase ARSA1 | |
| Fruit width | Ch01 | 759741 | C | T | 11.56 | Genic | SMEL_001g115700.1.01 | exon,CDS | UBC23 Probable ubiquitin-conjugating enzyme E2 23 |
| Ch01 | 771722 | G | T | 8.67 | Genic | SMEL_001g115720.1.01 | Intron | PIP2-7 Aquaporin PIP2-7 | |
| Ch01 | 769956 | G | C | 8.64 | Genic | SMEL_001g115710.1.01 | exon,CDS | Protein of unknown function | |
| Ch02 | 76635827 | T | A | 8.16 | Genic | SMEL_002g162940.1.01 | exon,CDS | MTERF6 Transcription termination factor MTERF6, chloroplastic/mitochondrial | |
| Ch07 | 89696723 | A | G | 9.69 | Genic | SMEL_007g282970.1.01 | exon,CDS | Protein of unknown function | |
| Ch07 | 101888117 | C | A | 8.51 | Genic | SMEL_007g283780.1.01 | exon,CDS | Protein of unknown function | |
| Ch07 | 25532159 | G | A | 7.62 | Genic | SMEL_007g277290.1.01 | Intron | Similar to Sucrose synthase | |
| Ch07 | 31284604 | A | T | 7.38 | Intergenic | - | - | - | |
| Ch09 | 15705642 | C | T | 8.13 | Genic | SMEL_009g325830.1.01 | exon,CDS | Protein of unknown function | |
| Ch09 | 15705607 | G | A | 8.1 | Genic | SMEL_009g325830.1.01 | exon,CDS | Protein of unknown function | |
| Fruit color at Harvest | Ch01 | 30200788 | T | A | 7.61 | Genic | SMEL_001g137250.1.01 | exon,CDS | Protein of unknown function |
| Ch05 | 4580830 | T | G | 7.35 | Genic | SMEL_005g227340.1.01 | exon,CDS | UVR8 Ultraviolet-B receptor UVR8 | |
| Ch05 | 10921833 | A | G | 7.26 | Intergenic | - | - | - | |
| Ch06 | 96640955 | C | T | 8.84 | Genic | SMEL_006g258500.1.01 | Intron | CLPX3 CLP protease regulatory subunit CLPX3, mitochondrial | |
| Ch06 | 103417425 | G | A | 7.17 | Genic | SMEL_006g264870.1.01 | exon,UTR | At1g51745 Uncharacterized protein At1g51745 | |
| Ch08 | 101047764 | C | T/C | 10.96 | Genic | SMEL_008g313940.1.01 | exon,CDS | Protein of unknown function | |
| Ch08 | 104699628 | G | G/A | 7.87 | Genic | SMEL_008g315650.1.01 | Intron | YPTM2 GTP-binding protein YPTM2 | |
| Ch08 | 2280526 | C | T | 7.27 | Genic | SMEL_008g298990.1.01 | exon,CDS | WRKY41 Probable WRKY transcription factor 41 | |
| Ch10 | 89790523 | A | A/G | 7.74 | Genic | SMEL_010g350710.1.01 | CDS,exon | Protein of unknown function | |
| Ch10 | 64792041 | G | G/T | 7.41 | Genic | SMEL_010g347400.1.01 | Intron | Protein of unknown function | |
| Days to Maturity | Ch01 | 48274449 | A | G | 7 | Intergenic | - | - | - |
| Ch03 | 25048685 | G | A | 10.94 | Genic | SMEL_003g175720.1.01 | Intron | LIG4 DNA ligase 4 | |
| Ch03 | 88779347 | A | G | 9.34 | Genic | SMEL_003g192220.1.01 | Intron | PPL1 PsbP-like protein 1, chloroplastic | |
| Ch05 | 38653250 | C | T | 12.47 | Genic | SMEL_005g238120.1.01 | CDS,exon | 4CLL5 4-coumarate--CoA ligase-like 5 | |
| Ch05 | 40876927 | C | G | 9.75 | Genic | SMEL_005g237880.1.01 | exon,CDS | ACT7 Actin-7 | |
| Ch06 | 97308270 | C | T | 8.62 | Genic | SMEL_006g259010.1.01 | exon,CDS | Protein of unknown function | |
| Ch07 | 126054497 | G | A | 9.31 | Genic | SMEL_007g286710.1.01 | exon,CDS | PHYC Phytochrome C | |
| Ch09 | 3651169 | G | T | 8.65 | Genic | SMEL_009g321780.1.01 | exon,CDS | PAL5 Phenylalanine ammonia-lyase | |
| Ch12 | 86643722 | T | C | 8.32 | Genic | SMEL_012g392350.1.01 | exon,CDS | Protein of unknown function | |
| Ch12 | 93119092 | C | T | 8 | Genic | SMEL_012g394590.1.01 | exon,CDS | Protein of unknown function |