| Literature DB >> 21193580 |
Matthew D Robbins1, Sung-Chur Sim, Wencai Yang, Allen Van Deynze, Esther van der Knaap, Tarek Joobeur, David M Francis.
Abstract
The history of tomato (Solanum lycopersicum L.) improvement includes genetic bottlenecks, wild species introgressions, and divergence into distinct market classes. This history makes tomato an excellent model to investigate the effects of selection on genome variation. A combination of linkage mapping in two F(2) populations and physical mapping with emerging genome sequence data was used to position 434 PCR-based markers including SNPs. Three-hundred-and-forty markers were used to genotype 102 tomato lines representing wild species, landraces, vintage cultivars, and contemporary (fresh market and processing) varieties. Principal component analysis confirmed genetic divergence between market classes of cultivated tomato (P <0.0001). A genome-wide survey indicated that linkage disequilibrium (LD) decays over 6-8 cM when all cultivated tomatoes, including vintage and contemporary, were considered together. Within contemporary processing varieties, LD decayed over 6-14 cM, and decay was over 3-16 cM within fresh market varieties. Significant inter-chromosomal (gametic phase) LD was detected in both fresh market and processing varieties between chromosomes 2 and 3, and 2 and 4, but in distinct chromosomal locations for each market class. Additional LD was detected between chromosomes 3 and 4, 3 and 11, and 4 and 6 in fresh market varieties and chromosomes 3 and 12 in processing varieties. These results suggest that breeding practices for market specialization in tomato have led to a genetic divergence between fresh market and processing types.Entities:
Mesh:
Year: 2010 PMID: 21193580 PMCID: PMC3060673 DOI: 10.1093/jxb/erq367
Source DB: PubMed Journal: J Exp Bot ISSN: 0022-0957 Impact factor: 6.992
Number of markers polymorphic within different classes of tomato
| No. of entries | indel | SNP | SSR | Total | |
| Processing | 39 | 27 (22) | 104 (64) | 39 (27) | 170 (113) |
| Fresh market | 24 | 22 (16) | 101 (62) | 38 (26) | 161 (104) |
| Vintage cultivars | 21 | 22 (16) | 51 (34) | 33 (22) | 106 (72) |
| Latin American cultivars | 5 | 18 (13) | 57 (38) | 42 (28) | 117 (79) |
| All | 93 | 44 (34) | 154 (96) | 52 (37) | 250 (167) |
| Wild species | 9 | 63 (52) | 167 (117) | 65 (50) | 295 (219) |
| All entries | 102 | 70 (57) | 205 (135) | 65 (50) | 340 (242) |
The number of entries within each class.
The number in parentheses indicates the number of polymorphic markers with known genomic location either by either linkage or physical mapping.
All S. lycopersicum represents cultivated tomato and includes processing, fresh market, vintage, Latin American, and greenhouse cultivars as well as unimproved breeding lines.
Number of markers and size in cM of each chromosome in two linkage maps (Sun1642×LA1589 and Yellow Stuffer×LA1589) and the integrated map
| Chromosome | Framework markers | SNP and indel markers | Total markers | PCR-based markers | Markers with segregation distortion | Average cM between markers | Largest gap (cM) | Genome coverage (%) | Total cM |
| Sun1642×LA1589 | |||||||||
| 1 | 25 | 16 | 41 | 29 | 6 | 3.4 | 13.4 | 100.0 | 135.0 |
| 2 | 17 | 13 | 30 | 20 | 6 | 1.9 | 9.6 | 99.2 | 55.7 |
| 3 | 15 | 18 | 33 | 22 | 5 | 3.4 | 11.3 | 78.4 | 108.7 |
| 4 | 13 | 12 | 25 | 17 | 0 | 4.8 | 21.4 | 86.3 | 114.1 |
| 5 | 11 | 7 | 18 | 10 | 2 | 5.5 | 19.6 | 94.1 | 94.3 |
| 6 | 8 | 9 | 17 | 10 | 8 | 5.2 | 12.7 | 100.0 | 83.2 |
| 7 | 10 | 6 | 16 | 9 | 5 | 6.1 | 17.3 | 98.3 | 91.6 |
| 8 | 12 | 7 | 19 | 11 | 0 | 4.6 | 11.4 | 97.1 | 82.4 |
| 9 | 12 | 5 | 17 | 9 | 0 | 5.2 | 17.1 | 87.7 | 83.3 |
| 10 | 12 | 11 | 23 | 15 | 4 | 4.2 | 11.8 | 100.0 | 93.1 |
| 11 | 9 | 6 | 15 | 8 | 9 | 7.1 | 14.7 | 100.0 | 99.5 |
| 12 | 9 | 9 | 18 | 11 | 10 | 5.0 | 11.9 | 87.1 | 85.8 |
| Yellow Stuffer×LA1589 | |||||||||
| 1 | 10 | 6 | 16 | 6 | 3 | 7.7 | 15.5 | 95.2 | 115.8 |
| 2 | 10 | 7 | 17 | 7 | 8 | 5.8 | 10.9 | 99.3 | 93.4 |
| 3 | 9 | 10 | 19 | 10 | 2 | 5.4 | 12.8 | 79.9 | 97.5 |
| 4 | 6 | 4 | 10 | 4 | 1 | 12.8 | 26.1 | 74.4 | 114.8 |
| 5 | 8 | 7 | 15 | 7 | 0 | 6.3 | 30.8 | 86.3 | 88.3 |
| 6 | 7 | 3 | 10 | 3 | 0 | 9.0 | 21.7 | 61.5 | 80.9 |
| 7 | 4 | 3 | 7 | 3 | 7 | 8.8 | 23.5 | 41.3 | 52.6 |
| 8 | 8 | 5 | 13 | 5 | 1 | 5.5 | 19.7 | 51.7 | 66.3 |
| 9 | 7 | 4 | 11 | 4 | 4 | 9.5 | 19.5 | 96.6 | 95.3 |
| 10 | 6 | 4 | 10 | 4 | 0 | 9.8 | 33.2 | 47.9 | 88.4 |
| 11 | 7 | 6 | 13 | 6 | 9 | 7.1 | 17.6 | 71.7 | 84.9 |
| 12 | 8 | 1 | 9 | 1 | 0 | 11.7 | 20.7 | 29.3 | 93.5 |
| Integrated | |||||||||
| 1 | 26 | 22 | 48 | 35 | – | 2.9 | 14.7 | 100.0 | 137.2 |
| 2 | 19 | 20 | 39 | 27 | – | 2.8 | 11.4 | 98.7 | 105.2 |
| 3 | 15 | 29 | 44 | 33 | – | 2.4 | 9.1 | 96.4 | 99.2 |
| 4 | 13 | 16 | 29 | 21 | – | 4.0 | 16.9 | 100.0 | 107.0 |
| 5 | 12 | 14 | 26 | 17 | – | 3.5 | 15.9 | 95.0 | 86.6 |
| 6 | 8 | 12 | 20 | 13 | – | 5.0 | 12.4 | 91.2 | 89.5 |
| 7 | 11 | 9 | 20 | 12 | – | 4.5 | 16.4 | 100.0 | 84.9 |
| 8 | 12 | 12 | 24 | 16 | – | 3.6 | 12.7 | 100.0 | 82.7 |
| 9 | 12 | 9 | 21 | 13 | – | 5.0 | 18.4 | 89.7 | 100.1 |
| 10 | 12 | 15 | 27 | 19 | – | 3.4 | 12.6 | 100.0 | 81.8 |
| 11 | 9 | 12 | 21 | 14 | – | 4.4 | 12.2 | 100.0 | 88.5 |
| 12 | 9 | 10 | 19 | 12 | – | 4.9 | 17.0 | 78.6 | 88.0 |
Percentage of the genome within 10 cM of at least one PCR-based (SSR, SNP, or indel) marker.
Fig. 1.Integrated linkage map based on two F2 populations. Genetically mapped markers are on the right of the linkage groups while physically mapped markers are on the left. Marker names in grey text are framework markers used to merge the two F2 maps. Markers with the prefix ‘LEOH’ were previously developed [Yang (LEOH1-LEOH51), Francis (LEOH100-LEOH365)]. Markers with the prefix ‘SL’ were developed based on SNPs or indels (contains the suffix ‘i’) identified by Van Deynze . Markers in parentheses indicate that, although both a SNP and indel were created from the same locus, only one of the markers was mapped. Physically mapped markers are placed relative to the chromosome in their approximate position based on the framework markers. Markers on the far right of chromosome 9 were mapped using segmental substitution lines (Eshed and Zamir, 1995). Vertical lines indicate approximate boundaries of introgressions based on framework markers with the name of the segmental substitution line in vertical text to the left of the line. (Continued on page 6).
Fig. 3.Plots of linkage disequilibrium (LD) values (r) against genetic distance (cM) between pairs of markers in multiple classes of cultivated tomato. All possible pair-wise combinations of markers on the same chromosome were plotted to visualize LD decay within chromosomes over the entire genome. The r values were calculated separately for processing and fresh market cultivars (B and C, respectively) as well as processing, fresh market, and vintage cultivar classes combined (A). Curves were fit for each plot by second-degree LOESS. The horizontal dotted lines indicate the baseline r values based on the 95th percentile of the distribution of unlinked r values (black) and the fixed r value of 0.1 (grey).
Fig. 4.Heat maps of linkage disequilibrium (LD) values (r) throughout the tomato genome. Markers were ordered on the x and y axes based on genomic location so that each cell of the heat map represents a single marker pair. The r values for each marker pair are on the bottom half of the heat map and are represented by shades of grey from 0.0 (white) increasing in darkness in equal increments of 0.1 to 1.0 (black). The P-values of each r estimate are on the top half of the heat map and are represented by shades of grey from non-significant (P >0.05; white) to highly significant (P <0.0001; black). The combined analysis includes processing, fresh market, and vintage cultivars.
Fig. 2.Graph of the first three Principal Components based on marker frequencies. Fresh market (circles), processing (squares), and vintage (triangles) cultivar groups are indicated by unique symbols.
Summary of genome-wide linkage disequilibrium analysis
| Market class | No. marker pairs | Linkage disequilibrium decay (cM) | |||||||
| Median | St. Dev. | 95th percentile | LOESS | Moving means | |||||
| 95th percentile method | Fixed | 95th percentile method | Fixed | ||||||
| Combined | 5248 | 0.011 | 0.102 | 0.160 | 8.1% | 6.6 | 8.0 | 6 | 6 |
| Processing | 3294 | 0.037 | 0.131 | 0.248 | 5.5% | 6.9 | 14.2 | 6 | 9 |
| Fresh market | 2622 | 0.031 | 0.187 | 0.464 | 2.0% | 3.0 | 16.1 | 2 | 10 |
The number of marker pairs includes only markers polymorphic within each market class.
Linkage disequilibrium was estimated as r values for all possible marker pairs using TASSEL (Bradbury ) and GGT (van Berloo, 2008) software.
The 95th percentile of the distribution of r values for the unlinked markers. This value is the baseline r to estimate LD decay.
Percentage of r estimates with P value <0.01. P values of r estimates were calculated from 1000 permutations using TASSEL software (Bradbury ).
Linkage disequilibrium decay was estimated over genetic distance by the relationship of a baseline r estimate to linked marker pairs using two methods, LOESS and 1 cM moving means. The baseline r value was either fixed at 0.1 or estimated using the 95th percentile of the unlinked markers. Values for r that exceed the baseline are considered to be in linkage disequilibrium.
For the LOESS estimation of LD decay, genetic distance was estimated as the point where the LOESS curve first crosses the baseline r value.
For the means estimation of LD decay, the r values of linked markers were grouped into bins of 1 cM based on the distance between markers. LD decay was estimated as the first bin where the baseline r value was lower than the bin mean.
The combined analysis includes processing, fresh market, and vintage cultivars.
Comparison of inter-chromosomal linkage disequilibrium between processing and fresh market tomato germplasm
| Chromosome | Position | Chromosome | Position | No. | Processing | Fresh Market | |||
| Mean r2 | St. Dev. | Mean | St. Dev. | ||||||
| 2 | 36.3–47.3 | 3 | 71.2–87.9 | 33 | 0.0648 | 0.0682 | 0.5776 | 0.2813 | <0.0001 |
| 2 | 47.3–51.6 | 3 | 71.2–76.7 | 10 | 0.2094 | 0.0287 | 0.0203 | 0.0167 | <0.0001 |
| 2 | 36.3–45.2 | 4 | 100.0–105.7 | 10 | 0.2278 | 0.1610 | 0.0569 | 0.0525 | 0.0372 |
| 2 | 36.3–47.3 | 4 | 53.2–61.7 | 30 | 0.0294 | 0.0249 | 0.4362 | 0.2324 | <0.0001 |
| 3 | 76.7–87.9 | 4 | 53.2–61.7 | 17 | 0.0506 | 0.0407 | 0.4837 | 0.2346 | <0.0001 |
| 3 | 76.7–87.9 | 11 | 46.4–48.5 | 8 | 0.0581 | 0.0777 | 0.3346 | 0.1080 | 0.0009 |
| 3 | 52.5–94.9 | 12 | 49.7–65.8 | 13 | 0.1596 | 0.1496 | 0.0257 | 0.0196 | 0.0012 |
| 4 | 53.2–68.5 | 11 | 46.4–48.5 | 18 | 0.0249 | 0.0228 | 0.2358 | 0.1306 | <0.0001 |
Chromosomes being compared.
Genetic map position (cM) within the specified chromosomes. The position is derived from the integrated linkage map (Fig. 1).
Number of marker pairs in the comparison. Only marker pairs with r estimates in both classes were included.
Mean r values of all marker pairs between the two chromosomal regions.
P value of a paired t test of the mean r estimates of processing versus fresh market entries.