| Literature DB >> 31645952 |
Aliki Xanthopoulou1, Javier Montero-Pau2, Ifigeneia Mellidou3, Christos Kissoudis4, José Blanca5, Belén Picó5, Aphrodite Tsaballa3, Eleni Tsaliki3, Athanasios Dalakouras3, Harry S Paris6, Maria Ganopoulou3, Theodoros Moysiadis7, Maslin Osathanunkul8,9, Athanasios Tsaftaris4, Panagiotis Madesis7, Apostolos Kalivas3, Ioannis Ganopoulos3.
Abstract
Cucurbita pepo contains two cultivated subspecies, each of which encompasses four fruit-shape morphotypes (cultivar groups). The Pumpkin, Vegetable Marrow, Cocozelle, and Zucchini Groups are of subsp. pepo and the Acorn, Crookneck, Scallop, and Straightneck Groups are of subsp. ovifera. Recently, a de novo assembly of the C. pepo subsp. pepo Zucchini genome was published, providing insights into its evolution. To expand our knowledge of evolutionary processes within C. pepo and to identify variants associated with particular morphotypes, we performed whole-genome resequencing of seven of these eight C. pepo morphotypes. We report for the first time whole-genome resequencing of the four subsp. pepo (Pumpkin, Vegetable Marrow, Cocozelle, green Zucchini, and yellow Zucchini) morphotypes and three of the subsp. ovifera (Acorn, Crookneck, and Scallop) morphotypes. A high-depth resequencing approach was followed, using the BGISEQ-500 platform that enables the identification of rare variants, with an average of 33.5X. Approximately 94.5% of the clean reads were mapped against the reference Zucchini genome. In total, 3,823,977 high confidence single-nucleotide polymorphisms (SNPs) were identified. Within each accession, SNPs varied from 636,918 in green Zucchini to 2,656,513 in Crookneck, and were distributed homogeneously along the chromosomes. Clear differences between subspecies pepo and ovifera in genetic variation and linkage disequilibrium are highlighted. In fact, comparison between subspecies pepo and ovifera indicated 5710 genes (22.5%) with Fst > 0.80 and 1059 genes (4.1%) with Fst = 1.00 as potential candidate genes that were fixed during the independent evolution and domestication of the two subspecies. Linkage disequilibrium was greater in subsp. ovifera than in subsp. pepo, perhaps reflective of the earlier differentiation of morphotypes within subsp. ovifera. Some morphotype-specific genes have been localized. Our results offer new clues that may provide an improved understanding of the underlying genomic regions involved in the independent evolution and domestication of the two subspecies. Comparisons among SNPs unique to particular subspecies or morphotypes may provide candidate genes responsible for traits of high economic importance.Entities:
Keywords: Natural variation in plants; Structural variation
Year: 2019 PMID: 31645952 PMCID: PMC6804688 DOI: 10.1038/s41438-019-0176-9
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Fig. 1A fruit from each of the eight accessions of Cucurbita pepo used in this study.
Top, left to right: “Tondo Chiaro di Nizza” (Pumpkin Group), “Bolognese” (Vegetable Marrow Group), “Tuffy” (Acorn Group), “Yellow Crookneck” (Crookneck Group). Bottom, left to right: “Chrysoulitsa” (Zucchini Group), “Black Beauty” (Zucchini Group), “Romanesco” (Cocozelle Group), and “Benning’s Green Tint” (Scallop Group). The accession information table is in the lower part, and subspecies pepo and ovifera are indicted by one (*) or two asterisks (**), respectively
Quantity and distribution of reads and SNPs for each morphotype
| Morphotype | No. of clean reads | Coverage | % Mapped | Variable sites | % Mis | Ho | ||
|---|---|---|---|---|---|---|---|---|
| No. | % Biallelic | % Monoallelic | ||||||
| Pumpkin | 107,814,560 | 37.8 | 94.1 | 1,077,816 | 28.38 | 71.62 | 1.79 | 0.08 |
| Marrow | 84,887,546 | 29.8 | 93.9 | 1,136,856 | 39.13 | 60.87 | 6.01 | 0.12 |
| Cocozelle | 96,385,370 | 33.8 | 94.0 | 1,194,461 | 43.76 | 56.24 | 5.44 | 0.14 |
| Zucchini (yellow) | 101,593,207 | 35.7 | 94.7 | 1,045,520 | 75.02 | 24.98 | 2.91 | 0.21 |
| Zucchini (green) | 86,968,548 | 30.5 | 95.1 | 636,918 | 53.30 | 46.70 | 3.83 | 0.09 |
| Subspecies | 95,529,846 | 33.5 | 94.4 | 1,018,314 | 47.92 | 52.08 | 4.00 | 0.13 |
| Acorn | 91,160,816 | 30.1 | 94.4 | 2,421,537 | 9.17 | 90.83 | 4.89 | 0.06 |
| Scallop | 93,992,800 | 32.9 | 94.9 | 2,620,106 | 27.61 | 72.39 | 5.34 | 0.20 |
| Crookneck | 106,734,784 | 37.4 | 95.0 | 2,656,513 | 22.60 | 77.40 | 2.84 | 0.16 |
| Subspecies | 97,296,133 | 33.5 | 94.8 | 2,566,052 | 19.79 | 80.21 | 4.36 | 0.14 |
Number of clean reads after cleaning, genome coverage obtained, percentage of mapped reads, number of variable sites compared to the reference genome (no. of variable sites), percentage of the variable sites that are biallelic (%, that is that have two alleles, one identical, and the other different from that of the reference genome) and monoallelic (%, that is that have only one allele, different to that of the reference genome), and percentage of missing data (Mis), and observed heterozygosity (Ho) for the 3,823,977 variable positions found along the genome for all the morphotypes. Average values for each subspecies are also shown. Notice that reference genome was developed from a C. pepo subps. pepo morphotype green Zucchini Spanish accession[18]
Fig. 2A circos diagram illustrating the genome-wide variations among 8 accessions.
Circos plot showing the genomic distribution of a number of single-nucleotide polymorphisms (SNPs), b frequency of heterozygous SNPs, c frequency of homozygous SNPs for the allele not present in the reference genome, d number of genes, e number of SNPs with a high gene impact like stop codon gaining or frameshifts as predicted by SNPEff, f number of SNPs with a moderate gene impact like non-synonymous aminoacid changes, or non-disruptive frameshift, g linkage disequilibrium, and h genetic diversity. In all cases, 100 kb windows have been used, but for linkage disequilibrium (LD) a 50 kb window was used
Genomic location of SNPs
| Location | No. of variants | % |
|---|---|---|
| Intergenic | 2,370,305 | 60.97 |
| Exon | 247,852 | 6.38 |
| Intron | 1,031,861 | 26.54 |
| Splice site region | 26,534 | 0.68 |
| 5′-UTR | 143,721 | 3.70 |
| 3′-UTR | 67,431 | 1.73 |
SNP single-nucleotide polymorphism, UTR untranslated region
Fig. 3Single-nucleotide polymorphisms (SNPs) and percentage of genes with a SNP for each morphotype.
Histograms of the number of SNPs obtained by comparison with reference genome (a) and percentage of genes with SNPs (b) for each morphotype classified according to their predicted effect. For each subspecies, the number of all and common SNPs/genes when considering all of the accessions. Venn diagrams (c) showing the number of genes with and SNPs with high or moderate effect when comparing both subspecies and when comparing morphotypes within subspecies
Fig. 4Genetic diversity and structure of eight accesions.
a Phylogenetic network showing the relationship among the eight Cucurbita pepo accessions. b Boxplot of genetic diversity (π) of 1 kb windows and c linkage disequilibrium (r2) for the species, using all accessions, and for each subspecies. Horizontal dashed lines show the background linkage disequilibrium
Fig. 5Genome wide differences between Cucurbita pepo subsp. pepo and subsp. ovifera.
Differences in a number of SNPs, b frequency of the alternative allele (allele not present in the reference genome), c genetic differentiation, d nucleotide diversity, and e linkage disequilibrium between Cucurbita pepo subsp. pepo and subsp. ovifera along the chromosomes. Vertical dashed lines indicate the end/beginning of a chromosome. Solid horizontal line marks the absence of differences
Fig. 6Genes and genetic diversity.
a Number of genes with different levels of diversity in Cucurbita pepo subsp. pepo (orange) and subsp. ovifera (purple). b Histogram of genetic differentiation (Fst) by gene among subsp. pepo and subsp. ovifera and c distribution along the genome of genes with a Fst = 1
Genes of horticultural interest
| Gene | Trait | GeneID | Genomic location | No. of variants in the gene with predited effect | No. of variants in the gene with HIGH effect | No. of variants in the gene with MODERATE effect | No. of variants upstream/downstream of the gene with predicted effect | |
|---|---|---|---|---|---|---|---|---|
| Cauliflower | Flesh color | Cp4.1LG13g00690 | Cp4.1LG13:628,094–633,068 | 60 | 1 | 2 | 97 | |
| Ethylene-insensitive 3 ( | Flowering | Cp4.1LG04g11790 | Cp4.1LG04:8,670,547–8,672,394 | 8 | 1 | 2 | 0 | |
| Flowering time control protein ( | Flowering | Cp4.1LG17g10910 | Cp4.1LG17:8,241,363–8,252,364 | 339 | 4 | 86 | 201 | |
| Squamosa promoter binding protein-like (2) | Flowering | Cp4.1LG17g10620 | Cp4.1LG17:8,083,301–8,086,452 | 81 | 1 | 7 | 167 | |
| Unusual floral organs | Flowering | Cp4.1LG20g02860 | Cp4.1LG20:1,553,934–1,555,718 | 56 | 2 | 7 | 117 | |
| Protein IQ-DOMAIN (2) | Fruit morphology | Cp4.1LG03g08410 | Cp4.1LG03:2,815,265–2,818,054 | 38 | 1 | 0 | 40 | |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG03g01040 | Cp4.1LG03:1,235,448–1,237,683 | 40 | 1 | 10 | 76 | |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG03g06270 | Cp4.1LG03:4,147,734–4,152,353 | 69 | 4 | 6 | 193 | |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG05g00950 | Cp4.1LG05:499,361–506,004 | 71 | 1 | 9 | 94 | * |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG08g10310 | Cp4.1LG08:7,948,336–7,951,922 | 65 | 2 | 11 | 65 | |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG10g04070 | Cp4.1LG10:1,593,038–1,598,092 | 54 | 1 | 4 | 119 | |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG12g10340 | Cp4.1LG12:9,512,443–9,518,623 | 46 | 2 | 5 | 46 | * |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG15g00200 | Cp4.1LG15:105,059–109,560 | 60 | 2 | 10 | 23 | * |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG15g01180 | Cp4.1LG15:901,114–905,856 | 40 | 2 | 12 | 57 | * |
| Tonneau recruiting motif | Fruit morphology | Cp4.1LG17g00460 | Cp4.1LG17:325,022–334,267 | 93 | 1 | 9 | 103 |
Selected genes, genomic locations, traits affected, number of changes predicted in total and with high and moderate effect, and number of changes 5 kb upstream or downstream of the genes that can have a potential effect are shown. For the TON1 recruiting motif proteins, asterisks identify those proteins that are phylogenetically close to TRM proteins of tomato known to interact with OVATE
Fig. 7Tonneau recruiting motif structure.
a Unrooted maximum-likelihood tree of Cucurbita pepo (purple), Arabidopsis thaliana (green), and Solanum lycorpersicum (red) orthologs containing a Tonneau recruiting motif. Node support expressed as Shimodaira–Hasegawa approximate likelihood -ratio (SH-aLRT) support (%)/a Bayes support/ultrafast bootstrap support (%). Nodes with a SH-aLRT lower than 70 have been collapsed with their parent node. Labels of nodes with a 100/1/100 support have been omitted for clarity. b Distribution of conserved motifs for each of the orthologs containing a Tonneau recruiting motif found by MEME. Sequences showing a M10 motif are likely to interact with OVATE