| Literature DB >> 22908034 |
Nicolas Ranc1, Stephane Muños, Jiaxin Xu, Marie-Christine Le Paslier, Aurélie Chauveau, Rémi Bounon, Sophie Rolland, Jean-Paul Bouchet, Dominique Brunel, Mathilde Causse.
Abstract
Genome-wide association mapping is an efficient way to identify quantitative trait loci controlling the variation of phenotypes, but the approach suffers severe limitations when one is studying inbred crops like cultivated tomato (Solanum lycopersicum). Such crops exhibit low rates of molecular polymorphism and high linkage disequilibrium, which reduces mapping resolution. The cherry type tomato (S. lycopersicum var. cerasiforme) genome has been described as an admixture between the cultivated tomato and its wild ancestor, S. pimpinellifolium. We have thus taken advantage of the properties of this admixture to improve the resolution of association mapping in tomato. As a proof of concept, we sequenced 81 DNA fragments distributed on chromosome 2 at different distances in a core collection of 90 tomato accessions, including mostly cherry type tomato accessions. The 81 Sequence Tag Sites revealed 352 SNPs and indels. Molecular diversity was greatest for S. pimpinellifolium accessions, intermediate for S. l. cerasiforme accessions, and lowest for the cultivated group. We assessed the structure of molecular polymorphism and the extent of linkage disequilibrium over genetic and physical distances. Linkage disequilibrium decreased under r(2) = 0.3 within 1 cM, and minimal estimated value (r(2) = 0.13) was reached within 20 kb over the physical regions studied. Associations between polymorphisms and fruit weight, locule number, and soluble solid content were detected. Several candidate genes and quantitative trait loci previously identified were validated and new associations detected. This study shows the advantages of using a collection of S. l. cerasiforme accessions to overcome the low resolution of association mapping in tomato.Entities:
Keywords: admixture; association mapping; linkage disequilibrium; tomato (Solanum lycopersicum)
Mesh:
Year: 2012 PMID: 22908034 PMCID: PMC3411241 DOI: 10.1534/g3.112.002667
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1 Genetic and physical location of the polymorphic fragments sequenced on chromosome 2. Genetic distances on the EXPEN2000 reference map are indicated on the left of the chromosome. Physical contigs are drawn on the right of the scheme. Cloned QTL are indicated on the left of the chromosome. Gray shaded area indicates homology of contigs on chromosome 2 pseudo-molecule. Numbers of polymorphisms (SNPs and indels) found in noncoding and coding regions are indicated within bracket in the first and second position, respectively. Markers in italics show high LD when compared together.
Distribution and frequencies of polymorphisms (SNP and indel) across species and ratio of polymorphism in coding and noncoding region
| Number of Access. | Number of Total Polymorphic Sites | Number of Shared Polymorph. | Polymorph. Frequency for 1000 bp | Noncoding/Coding Polymorphisms Ratio | ||||
|---|---|---|---|---|---|---|---|---|
| coding | noncoding | |||||||
| 17 | 157 | 0 | 1.66 | 4.27 | 2.57 | |||
| 63 | 349 | 11 | 5 | 5.42 | 8.61 | 1.59 | ||
| 10 | 336 | 0 | 187 | 3 | 5.27 | 8.25 | 1.57 | |
All fragments (81) are taken into account.
Numbers in diagonal indicate species specific polymorphisms.
Figure 2 Molecular diversity of the three groups of tomato based on 352 polymorphisms. Molecular diversity was estimated by Watterson's θ and compared with the total number of polymorphisms (S) for S. pimpinellifolium, S. l. cerasiforme, and S. l. esculentum.
Figure 3 Distribution of polymorphism MAFs among tomato species. S. l. cerasiforme (n = 63) is represented in black, S. l. esculentum (n = 17) in dark gray, and S. pimpinellifolium (n = 10) in light gray. Polymorphisms with overall species MAF lower than 0.05 were previously discarded (see Materials and Methods).
Figure 4 Estimates of LD (r) vs. genetic and physical distance on chromosome 2 for the 63 S. l. cerasiforme accessions. Only polymorphic sites having MAF greater than 5% are indicated (see Materials and Methods). (A) Decay of r over genetic distance on chromosome 2. Plot of r over distance was fitted by nonlinear regression (red curve). (B) Decay of r over physical distance on the five major contigs. Plot of r over distance is fitted by nonlinear regression (red curve). The inset shows a more detailed view of the LD decay curve for markers located less than 20 Kb apart. (C) Matrix of pairwise LD P value between and within physical contigs. P values were calculated with 1000 permutations.
Figure 5 Cumulative density functions (CDF) using several alternative models of association. Model comparisons are performed for FW (A), LCN (B), and SSC (C). Associations are tested for all polymorphic sites with MAF >5% on 90 individuals. Naive GLM (black diamond) and K+Q models, with structure based on SSR markers (white squares), on 4 PCA axis (white circles) and on all STS markers (black squares) were tested. The diagonal indicates uniform distribution of P values under the expectation that random SNPs are unlinked to the polymorphisms controlling these traits (H0: no SNP effect).
Significant associations for fruit weight (FW), locule number (LCN), and soluble solid content (SSC) estimated with K+Q models on 90 accessions
| Trait | Locus | Location | Model A | MAF | Model B | |||
|---|---|---|---|---|---|---|---|---|
| Corrected | R2 | a | Corrected | |||||
| log(FW) | TD091-415 | 54cM | 0.0012 | 0.004 | 0.10 | 10.0 | 0.18 | ns |
| log(FW) | TD091-607 | 54cM | 8.12×10−04 | 0.003 | 0.10 | 9.2 | 0.24 | ns |
| log(FW) | TD049-528 | 72cM | 6.04×10−04 | 0.002 | 0.11 | 9.5 | 0.48 | ns |
| log(FW) | TD363-213 | 76cM | 0.0019 | 0.005 | 0.07 | 9.6 | 0.39 | ns |
| log(FW) | TD383-419 | 84cM-c2.13 | 7.56×10−04 | 0.003 | 0.12 | 12.1 | 0.11 | ns |
| log(FW) | TD383-558 | 84cM-c2.13 | 6.36×10−04 | 0.002 | 0.13 | 11.3 | 0.13 | ns |
| log(FW) | TD383-60 | 84cM-c2.13 | 6.36×10−04 | 0.002 | 0.13 | 11.3 | 0.13 | ns |
| log(FW) | TD375-573 | 84cM-c2.14 | 0.0011 | 0.003 | 0.10 | 9.0 | 0.25 | ns |
| log(FW) | TD133-115 | 84cM-c2.8 | 3.34×10−04 | 0.002 | 0.09 | 7.2 | 0.33 | ns |
| log(FW) | TD133-395 | 84cM-c2.8 | 5.57×10−04 | 0.002 | 0.09 | 7.3 | 0.33 | ns |
| log(FW) | TD387-452 | 84cM-c2.9 | 9.40×10−07 | 4.14×10−05 | 0.19 | 11.6 | 0.27 | 0.025 |
| log(FW) | lcn2.1-686 | 86cM-c2.3 | 2.86×10−05 | 0.001 | 0.12 | −11.7 | 0.38 | ns |
| log(FW) | lcn2.1-692 | 86cM-c2.3 | 8.95×10−06 | 2.63×10−04 | 0.15 | −12.7 | 0.37 | ns |
| log(FW) | TD274-17 | 87.5cM-c3.13 | 9.32×10−04 | 0.003 | 0.08 | 8.9 | 0.26 | ns |
| log(FW) | TD274-325 | 87.5cM-c3.13 | 4.76×10−04 | 0.002 | 0.10 | 9.8 | 0.23 | ns |
| log(FW) | TD377-96 | 87.5cM-c3.14 | 0.0014 | 0.004 | 0.09 | 8.3 | 0.17 | ns |
| log(FW) | TD377-97 | 87.5cM-c3.14 | 0.0023 | 0.005 | 0.08 | 8.5 | 0.16 | ns |
| log(FW) | TD377-98 | 87.5cM-c3.14 | 0.0014 | 0.004 | 0.09 | 8.3 | 0.17 | ns |
| log(FW) | TD377-91 | 87.5cM-c3.14 | 0.0013 | 0.004 | 0.09 | 8.2 | 0.17 | ns |
| log(FW) | TD379-326 | 88cM-c3.11 | 4.42×10−04 | 0.002 | 0.12 | 14.4 | 0.15 | 0.001 |
| log(FW) | TD380-256 | 89cM-c3.8 | 3.04×10−04 | 0.002 | 0.11 | 9.5 | 0.21 | ns |
| log(FW) | TD380-526 | 89cM-c3.8 | 6.13×10−08 | 5.39×10−06 | 0.22 | 13.2 | 0.36 | 0.002 |
| log(FW) | TD280-328 | 89cM-c3.9 | 4.54×10−04 | 0.002 | 0.10 | 10.5 | 0.48 | ns |
| log(FW) | TD055-469 | 89.5cM-c3.7 | 9.46×10−05 | 0.001 | 0.13 | 8.3 | 0.26 | ns |
| log(FW) | TD278-267 | 90cM-c3.3 | 1.73×10−04 | 0.002 | 0.11 | 12.0 | 0.21 | 0.023 |
| log(FW) | TD278-21 | 90cM-c3.3 | 0.003 ns | 0.02 ns | — | — | — | 0.048 |
| log(FW) | TD278-39 | 90cM-c3.3 | 5.23×10−04 | 0.002 | 0.10 | 15.0 | 0.15 | 0.030 |
| log(FW) | TD278-444 | 90cM-c3.3 | 2.30×10−04 | 0.002 | 0.12 | 12.4 | 0.22 | 0.025 |
| log(FW) | TD278-524 | 90cM-c3.3 | 3.81×10−04 | 0.002 | 0.12 | 11.9 | 0.20 | 0.030 |
| log(FW) | TD300-257 | 90cM-c3.5 | 1.95×10−04 | 0.002 | 0.12 | 11.6 | 0.20 | ns |
| log(FW) | TD300-41 | 90cM-c3.5 | 0.0011 | 0.003 | 0.11 | 9.2 | 0.33 | ns |
| log(FW) | TD108-347 | 90.1cM | 8.29×10−04 | 0.003 | 0.10 | 7.4 | 0.27 | ns |
| log(FW) | TD056-134 | 116cM-c4.7 | 3.49×10−04 | 0.002 | 0.12 | 10.8 | 0.35 | ns |
| log(FW) | TD369-493 | 116cM-c4.8 | 0.0025 | 0.005 | 0.09 | 11.1 | 0.26 | ns |
| log(FW) | TD116-707 | 120cM-c4.3 | 4.90×10−05 | 0.001 | 0.16 | 8.1 | 0.45 | 0.023 |
| log(FW) | TD117-164 | 120cM-c4.4 | 1.16×10−04 | 0.001 | 0.15 | 10.1 | 0.33 | ns |
| log(FW) | TD117-176 | 120cM-c4.4 | 1.16×10−04 | 0.001 | 0.15 | 10.1 | 0.33 | 0.029 |
| log(FW) | TD083-246 | 133cM | 0.0013 | 0.004 | 0.09 | 10.3 | 0.48 | 0.033 |
| log(LCN) | TD373-391 | 86cM-c2.12 | 2.14×10−05 | 0.002 | 0.21 | −0.68 | 0.49 | 0.037 |
| log(LCN) | lcn2.1-692 | 86cM-c2.3 | 5.93×10−13 | 1.85×10−10 | 0.44 | −1.16 | 0.37 | 4.57×10−09 |
| log(LCN) | lcn2.1-686 | 86cM-c2.3 | 5.32×10−12 | 8.30×10−10 | 0.44 | −1.21 | 0.38 | 1.34×10−08 |
| SSC | TD133-115 | 84cM-c2.8 | 1.87×10−05 | 7.12×10−04 | 0.16 | −0.63 | 0.33 | ns |
| SSC | TD133-395 | 84cM-c2.8 | 4.90×10−05 | 0.002 | 0.15 | −0.58 | 0.33 | ns |
| SSC | TD387-452 | 84cM-c2.9 | 3.88×10−07 | 5.89×10−05 | 0.24 | −0.86 | 0.27 | 0.018 |
| SSC | TD047-274 | 86cM-c2.5 | 3.96×10−06 | 2.01×10−04 | 0.19 | −1.00 | 0.12 | ns |
| SSC | TD120-212 | 86cM-c2.6 | 3.10×10−04 | 0.004 | 0.13 | −0.58 | 0.33 | ns |
| SSC | TD120-88 | 86cM-c2.6 | 2.22×10−04 | 0.003 | 0.13 | −0.59 | 0.32 | ns |
| SSC | TD140-180 | 87.5cM-c3.15 | 1.90×10−04 | 0.003 | 0.14 | −0.73 | 0.21 | ns |
| SSC | TD379-326 | 88cM-c3.11 | 0.008 ns | 0.04 ns | — | — | — | 0.045 |
| SSC | TD380-256 | 89cM-c3.8 | 2.57×10−04 | 0.003 | 0.13 | −0.65 | 0.21 | ns |
| SSC | TD380-526 | 89cM-c3.8 | 1.27×10−06 | 9.68×10−05 | 0.21 | −0.70 | 0.36 | 0.022 |
| SSC | TD280-328 | 89cM-c3.9 | 1.64×10−04 | 0.003 | 0.14 | −0.55 | 0.48 | ns |
| SSC | TD055-469 | 89.5cM-c3.7 | 8.93×10−05 | 0.002 | 0.15 | −0.67 | 0.26 | ns |
| SSC | TD117-164 | 120cM-c4.4 | 1.52×10−04 | 0.003 | 0.14 | −0.70 | 0.33 | ns |
| SSC | TD117-176 | 120cM-c4.4 | 1.52×10−04 | 0.003 | 0.14 | −0.70 | 0.33 | ns |
Model A: MLM model, with structure based on 20 SSR (only P values less than 0.005 are shown with indication on allele effect); model B: MLM model with structure based on all STS loci on chromosome 2 (P values less than 0.05 are shown). MAF, minimal allele frequencies; ns, nonsignificant.
Nomenclature for the location is as follows: “genetic distance on expen2000 reference map”-“the number of contig”.”the fragment number on this contig”.
P values are corrected following the Benjamini & Hochberg (2000) procedure (see Materials and Methods).
R2 were calculated using Q model.
Allele effects are indicated in grams for FW, mean number of locule for LCN, and °brix for SSC.
MAFs are shown for each polymorphism.
Figure 6 Plot of association P values over the chromosome 2. Associations are estimated for 90 accessions. K+Q model was used to screen for association between polymorphisms and (A) FW, (B) LCN, and (C) SSC. Stars indicate the associations detected with the structure assessed with all STS, and black dots the associations detected with 20 SSR markers. The upper part of each graph represents associations along genetic distance over the entire chromosome 2. The lower part shows associations for each physical contig. Arrows indicate the marker name of the most significant associations. Adjusted P values for multiple testing (see Materials and Methods) are shown.