| Literature DB >> 33328432 |
Pietro Gramazio1, Leandro Pereira-Dias2, Santiago Vilanova2, Jaime Prohens2, Salvador Soler2, Javier Esteras3, Alfonso Garmendia4, María José Díez5.
Abstract
The wild Solanum pimpinellifolium (SP) and the weedy S. lycopersicum var. cerasiforme (SLC) are largely unexploited genetic reservoirs easily accessible to breeders, as they are fully cross-compatible with cultivated tomato (S. lycopersicum var. lycopersicum). We performed a comprehensive morphological and genomic characterization of four wild SP and four weedy SLC accessions, selected to maximize the range of variation of both taxa. These eight accessions are the founders of the first tomato interspecific multi-parent advanced generation inter-cross (MAGIC) population. The morphoagronomic characterization was carried out with 39 descriptors to assess plant, inflorescence, fruit and agronomic traits, revealing the broad range of diversity captured. Part of the morphological variation observed in SP was likely associated to the adaptation of the accessions to different environments, while in the case of SLC to both human activity and adaptation to the environment. Whole-genome resequencing of the eight accessions revealed over 12 million variants, ranging from 1.2 to 1.9 million variants in SLC and from 3.1 to 4.8 million in SP, being 46.3% of them (4,897,803) private variants. The genetic principal component analysis also confirmed the high diversity of SP and the complex evolutionary history of SLC. This was also reflected in the analysis of the potential footprint of common ancestors or old introgressions identified within and between the two taxa. The functional characterization of the variants revealed a significative enrichment of GO terms related to changes in cell walls that would have been negatively selected during domestication and breeding. The comprehensive morphoagronomic and genetic characterization of these accessions will be of great relevance for the genetic analysis of the first interspecific MAGIC population of tomato and provides valuable knowledge and tools to the tomato community for genetic and genomic studies and for breeding purposes.Entities:
Year: 2020 PMID: 33328432 PMCID: PMC7603519 DOI: 10.1038/s41438-020-00395-w
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Collection sites and GPS coordinates of the four S. pimpinellifolium (SP) and four S. lycopersicum var. cerasiforme (SLC) used in this study.
| Accession code | BGV006454 | BGV015382 | BGV013720 | BGV007145 | BGV006769 | BGV007931 | LA2251 | PI487625 |
| Study code | SP1 | SP2 | SP3 | SP4 | SLC1 | SLC2 | SLC3 | SLC4 |
| Collection site | Chulucanas, Piura, Peru | Jamalca, Department of Amazonas, Peru | Nazca, Ica, Peru | El Carmen, Manabí, Ecuador | Cayambe Coca National Park, Napo, Ecuador | Ahome, Sinaloa, Mexico | Soritor, San Martín, Peru | Los Diamantes, Costa Rica |
| Latitude | 5°08′51″S | 5°53′08″S | 14°34′49″S | 0°13′19″S | 0°01′52″S | 26°03′00″N | 6°08′00″S | 9°60′61″N |
| Longitude | 80°16′13″W | 78°10′34″W | 74°53′14″W | 79°29′20″W | 77°47′10″W | 109°22′00″W | 77° 05′ 00″ W | 84°14′03″W |
Fig. 1Pictures of leaves, stem, inflorescence, flower and fruit of the four S. pimpinellifolium (SP1–SP4) and four S. lycopersicum var. cerasiforme (SLC1–SLC4) accessions used in this study.
The yellow bars in the upper right of leaves and inflorescence pictures indicate 10 cm, while the yellow bars in the lower right of fruit pictures indicate 1 cm. The grid cells in fruit pictures have a size of 1 cm × 1 cm
List of the 39 traits with abbreviations and units used for the morphoagronomic characterization of the four S. pimpinellifolium (SP) and four S. lycopersicum var. cerasiforme (SLC) accessions of this study.
| Code | Descriptor name | Descriptor scale/unit |
|---|---|---|
| Plant descriptors | ||
| SPD | Stem pubescence density | 1–7 (1: absent; 7: dense) |
| SA | Stem anthocyanin coloration | 0–2 (0: absent; 2: high) |
| LT | Leaf type | 1–6 (1: dwarf; 2: potato; 3: standard; 4: |
| LN | Leaflets number | Number |
| SLN | Small leaflets number | Number |
| LB | Leaflet border | 1–4 (1: entire; 4: highly serrated) |
| LVA | Leaf vein anthocyanin presence | 0–1 (0: absent; 1: present) |
| 1TH | 1st truss height | cm |
| 3TL | 3rd truss length | cm |
| 5TL | 5th truss length | cm |
| Inflorescence descriptors | ||
| IT | Inflorescence type | 1–3 (1: uniparous; 2: both; 3: multiparous) |
| FPT | Flowers per truss (2nd and 3rd) | Number |
| LP | Presence of leaves in the inflorescence | 0–2 (0: absent; 2: presence in all) |
| SP | Presence of shoots in the inflorescence | 0–2 (0: absent; 2: presence in all) |
| SPOS | Style position | 0–7 (0: inserted; 7: highly exerted) |
| Fruit descriptors | ||
| FC | Exterior colour of immature fruit | 1–9 (1: greenish-white; 9: very dark green) |
| GI | Greenback intensity | 0–7 (0: absent; 7: strong) |
| FP | Fruit pubescence | 0–7 (0: absent; 7: dense) |
| FS | Fruit shape | 1–4 (1: flattened; 2: slightly flattened; 3: round; 4: elongated) |
| RCE | Ribbing at calyx end | 1–7 (1: very weak; 7: strong) |
| FCS | Fruit cross-sectional shape | 1–3 (1: round; 3: irregular) |
| SPS | Shape of pistil scar | 1–4 (1: dot; 2: stellate; 3: lineal; 4: irregular) |
| BES | Fruit blossom end shape | 1–3 (1: indented; 4: pointed) |
| FF | Fruit fasciation | 3–7 (3: slight; 7: severe) |
| FH | Fruit height | mm |
| FW | Fruit width | mm |
| W | Fruit weight | g |
| SS | Shoulder shape | 1–7 (1: flat; 7: strongly depressed) |
| PT | Pericarp thickness | mm |
| LOC | Locule number | Number |
| L | Exterior fruit colour lightness | L |
| A | Exterior fruit colour a | a |
| B | Exterior fruit colour b | b |
| Internal fruit quality descriptors | ||
| Acid | Acidity | % of citric acid (1 g/sample) |
| Brix | Sugar content | °Brix |
| AA | Ascorbic acid | mg/L (liquid extract) |
| pH | pH | |
| Agronomic descriptors | ||
| DM | Days to maturity of first fruita | Number of days |
| D50% | Days until 50% of plants with mature fruitsb | Number of days |
aFrom sowing until one plant has one ripe fruit
bFrom sowing until 50% of plants have at least one fruit ripened
Mean and range for the groups of four S. pimpinellifolium and four S. lycopersicum var. cerasiforme for the morphoagronomic descriptors assessed in this study.
| Code | ||||
|---|---|---|---|---|
| Mean | Range | Mean | Range | |
| Plant descriptors | ||||
| SPD | 2.30 | 0.00–5.00 | 5.67 | 3.00–7.00 |
| SA | 1.14 | 0.56–2.00 | 0.35 | 0.00–1.00 |
| LT | 3.00 | 3.00–3.00 | 5.00 | 5.00–5.00 |
| LN | 6.74 | 6.40–7.00 | 7.37 | 6.67–8.80 |
| SLN | 8.24 | 3.15–17.00 | 13.33 | 8.00–17.56 |
| LB | 1.54 | 1.00–2.00 | 2.79 | 2.17–3.00 |
| 1TH | 10.92 | 5.50–17.00 | 13.98 | 8.00–17.72 |
| 3TL | 20.57 | 11.35–28.44 | 16.39 | 10.75–19.75 |
| 5TL | 26.12 | 13.06–36.17 | 21.03 | 17.25–30.17 |
| Inflorescence descriptors | ||||
| IT | 1.15 | 1.00–1.60 | 1.63 | 1.00–2.00 |
| FPT | 15.84 | 10.00–20.67 | 9.41 | 8.40–11.33 |
| LP | 0.59 | 0.00–1.00 | 0.95 | 0.33–1.56 |
| SP | 0.23 | 0.00–0.60 | 0.89 | 0.00–1.78 |
| SPOS | 5.00 | 3.00–7.00 | 2.75 | 0.00–5.00 |
| Fruit descriptors | ||||
| FC | 2.00 | 1.00–3.00 | 1.50 | 1.00–3.00 |
| GI | 3.50 | 3.00–5.00 | 3.83 | 3.00–6.00 |
| FP | 2.25 | 0.00–3.00 | 3.88 | 3.00–5.00 |
| FS | 2.75 | 2.00–3.00 | 2.46 | 1.45–3.00 |
| RCE | 0.00 | 0.00–0.00 | 0.79 | 0.00–1.67 |
| FCS | 1.00 | 1.00–1.00 | 1.42 | 1.00–2.00 |
| SPS | 1.00 | 1.00–1.00 | 1.10 | 1.00–1.39 |
| BES | 2.00 | 2.00–2.00 | 2.00 | 2.00–2.00 |
| FF | 0.00 | 0.00–0.00 | 0.72 | 0.00–1.67 |
| FH | 1.51 | 1.46–1.60 | 2.24 | 1.73–2.80 |
| FW | 1.34 | 1.07–1.76 | 2.49 | 1.47–3.70 |
| W | 1.52 | 0.79–2.89 | 10.39 | 1.91–23.83 |
| SS | 1.50 | 1.00–3.00 | 2.08 | 1.00–5.00 |
| PT | 0.17 | 0.12–0.20 | 0.30 | 0.18–0.39 |
| LOC | 2.00 | 2.00–2.00 | 2.35 | 2.00–3.04 |
| L | 31.27 | 29.42–35.13 | 32.21 | 30.12–35.98 |
| A | 27.09 | 22.46–29.63 | 21.47 | 16.47–24.59 |
| B | 11.96 | 9.41–16.52 | 12.45 | 10.62–15.04 |
| Internal fruit quality descriptors | ||||
| Acid | 1.00 | 0.75–1.43 | 0.72 | 0.40–0.90 |
| Brix | 5.37 | 3.15–6.83 | 4.69 | 3.15–5.89 |
| AA | 30.77 | 23.00–43.10 | 32.03 | 26.53–40.67 |
| pH | 4.26 | 4.09–4.46 | 4.31 | 4.01–4.56 |
| Agronomic descriptors | ||||
| DM | 55.00 | 52.00–58.00 | 66.67 | 62.00–74.00 |
| DM50% | 56.25 | 55.00–58.00 | 69.63 | 65.00–75.00 |
Fig. 2Correlations among morphoagronomic traits and accessions.
Hierarchical clustering heatmap for the four S. pimpinellifolium (SP1–SP4) and S. lycopersicum var. cerasiforme (SLC1–SLC4) accessions for the 37 polymorphic morphoagronomic traits assessed in this study
Fig. 4Principal component analysis (PCA) comparison.
PCA similarities based on the morphological characterization of 39 traits (a) and on the whole set of SNPs identified in this study (9,106,964) (b) for the four S. pimpinellifolium (SP1–SP4) and four S. lycopersicum var. cerasiforme (SLC1–SLC4) accessions. The first and second principal coordinates (PC) are displayed. Study codes as in Table 1
Statistics of sequencing and mapping of the four S. pimpinellifolium and four S. lycopersicum var. cerasiforme accessions re-sequenced using Heinz 1706 SL4.0 as a reference genome[26] (RF).
| Mean | Total | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SP1 | SP2 | SP3 | SP4 | SLC1 | SLC2 | SLC3 | SLC4 | |||
| Raw reads (million) | 122.7 | 105.4 | 108.3 | 105.1 | 124.0 | 112.1 | 119.8 | 116.9 | 114.2 | 914.3 |
| Yield (Gb) | 18.5 | 15.9 | 16.3 | 15.8 | 18.7 | 16.9 | 18.0 | 17.6 | 17.2 | 137.7 |
| High-quality reads (million) | 122.3 | 105.3 | 108.3 | 105.0 | 123.9 | 112.1 | 119.7 | 116.8 | 114.1 | 913.4 |
| Total nt. (billion) | 18 | 16 | 16 | 16 | 19 | 17 | 17.9 | 17 | 17.0 | 136.4 |
| Reads mapped (million) | 116 | 100 | 102 | 101 | 120 | 110 | 116 | 114 | 109.8 | 879.1 |
| % reads mapped | 94.4 | 95.1 | 94.3 | 95.9 | 96.7 | 97.3 | 97.1 | 97.3 | 96.0 | – |
| Average depth of coverage | 20.2 | 17.1 | 18.0 | 17.0 | 21.0 | 18.5 | 19.0 | 19.4 | 18.7 | – |
| % average coverage RF | 96.8 | 96.8 | 95.5 | 97.0 | 98.1 | 98.7 | 98.7 | 98.2 | 97.4 | – |
Summary of the variants identified in the four S. pimpinellifolium and four S. lycopersicum var. cerasiforme accessions using using Heinz 1706 SL4.0 as a reference genome[26] (RF).
| Total | ||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SP1 | SP2 | SP3 | SP4 | Mean | SLC1 | SLC2 | SLC3 | SLC4 | Mean | |||||||||||||
| Count | % | Count | % | Count | % | Count | % | Mean | % | Count | % | Count | % | Count | % | Count | % | Count | % | Count | % | |
| Variants | ||||||||||||||||||||||
| Homozygous SNPs | 3,223,730 | 69.5 | 3,032,979 | 78.5 | 3,879,974 | 79.4 | 2,453,673 | 76.9 | 3,147,589 | 76.1 | 1,497,288 | 76.0 | 855,877 | 68.9 | 912,198 | 69.3 | 1,115,582 | 72.2 | 1,095,236 | 71.6 | 9,106,964 | 86.2 |
| Heterozygous SNPs | 647,201 | 14.0 | 184,801 | 4.8 | 209,810 | 4.3 | 189,876 | 6.0 | 307,922 | 7.3 | 129,834 | 6.6 | 125,920 | 10.1 | 143,599 | 10.9 | 149,794 | 9.7 | 137,287 | 9.3 | ||
| Homozygous InDels | 379,109 | 8.2 | 381,565 | 9.9 | 447,678 | 9.2 | 324,948 | 10.2 | 383,325 | 9.4 | 217,208 | 11.1 | 172,779 | 13.9 | 161,126 | 12.2 | 175,312 | 11.3 | 181,606 | 12.1 | 812,034 | 7.7 |
| Heterozygous InDels | 84,534 | 1.8 | 18,288 | 0.4 | 20,876 | 0.4 | 16,667 | 0.5 | 35,091 | 0.8 | 11,262 | 0.6 | 9,928 | 0.8 | 9,689 | 0.7 | 10,972 | 0.7 | 10,463 | 0.7 | ||
| Homozygous complex | 209,751 | 4.5 | 188,390 | 4.8 | 258,072 | 5.3 | 151,032 | 4.8 | 201,811 | 4.9 | 76,890 | 3.9 | 53,003 | 4.2 | 60,497 | 4.6 | 61,577 | 4.0 | 62,992 | 4.2 | 579,456 | 5.5 |
| Heterozygous complex | 57,559 | 1.2 | 26,997 | 0.7 | 32,071 | 0.6 | 24,840 | 0.8 | 35,367 | 0.8 | 18,838 | 0.9 | 15,018 | 1.2 | 18,388 | 1.4 | 18,824 | 1.2 | 17,767 | 1.2 | ||
| Homozygous MNP | 32,803 | 0.7 | 29,863 | 0.8 | 36,797 | 0.7 | 23,648 | 0.7 | 30,778 | 0.7 | 15,916 | 0.8 | 9,748 | 0.8 | 11,157 | 0.8 | 12,513 | 0.8 | 12,334 | 0.8 | 58,804 | 0.6 |
| Heterozygous MNP | 4906 | 0.1 | 2144 | 0.1 | 2469 | 0.1 | 1937 | 0.1 | 2864 | 0.1 | 1501 | 0.1 | 1,055 | 0.1 | 1,261 | 0.1 | 1,355 | 0.1 | 1,293 | 0.1 | ||
| Total homozygous | 3,845,393 | 82.9 | 3,632,797 | 94.0 | 4,622,521 | 94.6 | 2,953,301 | 92.7 | 3,763,503 | 91.1 | 1,807,302 | 91.8 | 1,091,407 | 87.8 | 1,144,978 | 86.9 | 1,364,984 | 88.3 | 1,352,168 | 88.7 | – | – |
| Total heterozygous | 794,200 | 17.1 | 232,230 | 6.0 | 265,226 | 5.4 | 233,320 | 7.3 | 381,244 | 9.0 | 161,435 | 8.2 | 151,921 | 12.2 | 172,937 | 13.1 | 180,945 | 11.7 | 166,810 | 11.3 | – | – |
| Total variants | 4,639,593 | 100.0 | 3,865,027 | 100 | 4,887,747 | 100.0 | 3,186,621 | 100.0 | 4,144,747 | 100.0 | 1,968,737 | 100.0 | 1,243,328 | 100.0 | 1,317,915 | 100.0 | 1,545,929 | 100.0 | 1,518,977 | 100.0 | 10,557,258 | 100.0 |
| % het. variants with RF | 0.103 | 0.030 | 0.034 | 0.030 | 0.049 | 0.021 | 0.020 | 0.022 | 0.023 | 0.022 | ||||||||||||
| % total variants with RF | 0.600 | 0.500 | 0.632 | 0.412 | 0.536 | 0.255 | 0.161 | 0.171 | 0.200 | 0.197 | ||||||||||||
| Private variants | 771,181 | 16.6 | 748,354 | 19.3 | 1,115,616 | 22.8 | 985,453 | 30.9 | 905,151 | 22.4 | 339,481 | 17.2 | 467,864 | 37.6 | 190,559 | 14.5 | 279,295 | 18.0 | 319,299.8 | 21.8 | 4,897,803 | 46.3 |
| Variants in coding regions | ||||||||||||||||||||||
| Homozygous SNPs | 307,824 | 58.6 | 321,285 | 71.8 | 375,090 | 72.3 | 253,723 | 69.1 | 314,481 | 68.0 | 124,716 | 65.0 | 92,841 | 60.0 | 78,951 | 56.0 | 80,019 | 55.2 | 94,132 | 59.1 | 996,772 | 82.4 |
| Heterozygous SNPs | 107,212 | 20.4 | 30,776 | 6.9 | 32,981 | 6.4 | 34,522 | 9.4 | 51,373 | 10.8 | 22,275 | 11.6 | 23,504 | 15.2 | 27,654 | 19.6 | 30,294 | 20.9 | 25,932 | 16.8 | ||
| Homozygous InDels | 58,463 | 11.1 | 61,945 | 13.9 | 70,754 | 13.7 | 51,374 | 14.0 | 60,634 | 13.2 | 29,821 | 15.6 | 26,057 | 16.9 | 22,305 | 15.8 | 22,568 | 15.5 | 25,188 | 15.9 | 131,056 | 10.8 |
| Heterozygous InDels | 16,675 | 3.2 | 3073 | 0.7 | 3411 | 0.6 | 2899 | 0.8 | 6515 | 1.3 | 1802 | 0.9 | 1,719 | 1.1 | 1,612 | 1.1 | 1,880 | 1.3 | 1,753 | 1.1 | ||
| Homozygous complex | 22,053 | 4.2 | 22,772 | 5.0 | 27,306 | 5.3 | 17,461 | 4.8 | 22,398 | 4.8 | 8323 | 4.3 | 6,548 | 4.2 | 5,826 | 4.1 | 5,476 | 3.8 | 6,543 | 4.1 | 76,511 | 6.3 |
| Heterozygous complex | 10,199 | 1.9 | 4550 | 1.0 | 5509 | 1.0 | 4551 | 1.2 | 6202 | 1.3 | 3326 | 1.7 | 2,864 | 1.9 | 3,763 | 2.7 | 3,835 | 2.6 | 3,447 | 2.2 | ||
| Homozygous MNP | 2831 | 0.5 | 2816 | 0.6 | 3228 | 0.6 | 2206 | 0.6 | 2770 | 0.6 | 1282 | 0.7 | 947 | 0.6 | 849 | 0.6 | 872 | 0.6 | 988 | 0.6 | 5,909 | 0.5 |
| Heterozygous MNP | 749 | 0.1 | 327 | 0.1 | 342 | 0.1 | 315 | 0.1 | 433 | 0.1 | 220 | 0.1 | 170 | 0.1 | 212 | 0.1 | 203 | 0.1 | 201 | 0.1 | ||
| Total homozygous | 391,171 | 74.3 | 408,818 | 91.3 | 476,378 | 91.8 | 324,764 | 88.5 | 400,283 | 86.5 | 164,142 | 85.6 | 126,393 | 81.7 | 107,931 | 76.4 | 108,935 | 75.0 | 126,850 | 79.7 | – | – |
| Total heterozygous | 134,835 | 25.7 | 38,726 | 8.7 | 42,243 | 8.2 | 42,287 | 11.5 | 64,523 | 13.5 | 27,623 | 14.4 | 28,257 | 18.3 | 33,241 | 23.6 | 36,212 | 25.0 | 31,333 | 20.3 | – | – |
| Total variants | 526,006 | 100.0 | 447,544 | 100.0 | 518,621 | 100.0 | 367,051 | 100.0 | 464,806 | 100.0 | 191,765 | 100.0 | 154,650 | 100.0 | 141,172 | 100.0 | 145,147 | 100.0 | 158,184 | 100.0 | 1,210,248 | 100.0 |
| % het. variants with RF | 0.017 | 0.005 | 0.005 | 0.005 | 0.008 | 0.003 | 0.004 | 0.004 | 0.005 | 0.004 | ||||||||||||
| % variants with RF | 0.068 | 0.058 | 0.067 | 0.047 | 0.060 | 0.024 | 0.020 | 0.018 | 0.019 | 0.020 | ||||||||||||
Fig. 3Distribution of homozygous variants along chromosome 4 for the four S. pimpinellifolium (SP1–SP4) and four S. lycopersicum var. cerasiforme (SLC1–SLC4) accessions.
The peaks represent high frequencies of variants in a window size of 10 Mbp. The dashed lines of the same colour indicate similar patterns of variant distribution