| Literature DB >> 23903436 |
Kenta Shirasawa1, Hiroyuki Fukuoka, Hiroshi Matsunaga, Yuhko Kobayashi, Issei Kobayashi, Hideki Hirakawa, Sachiko Isobe, Satoshi Tabata.
Abstract
With the aim of understanding relationship between genetic and phenotypic variations in cultivated tomato, single nucleotide polymorphism (SNP) markers covering the whole genome of cultivated tomato were developed and genome-wide association studies (GWAS) were performed. The whole genomes of six tomato lines were sequenced with the ABI-5500xl SOLiD sequencer. Sequence reads covering ~13.7× of the genome for each line were obtained, and mapped onto tomato reference genomes (SL2.40) to detect ~1.5 million SNP candidates. Of the identified SNPs, 1.5% were considered to confer gene functions. In the subsequent Illumina GoldenGate assay for 1536 SNPs, 1293 SNPs were successfully genotyped, and 1248 showed polymorphisms among 663 tomato accessions. The whole-genome linkage disequilibrium (LD) analysis detected highly biased LD decays between euchromatic (58 kb) and heterochromatic regions (13.8 Mb). Subsequent GWAS identified SNPs that were significantly associated with agronomical traits, with SNP loci located near genes that were previously reported as candidates for these traits. This study demonstrates that attractive loci can be identified by performing GWAS with a large number of SNPs obtained from re-sequencing analysis.Entities:
Keywords: genome-wide association studies; linkage disequilibrium; single nucleotide polymorphism; tomato; whole-genome re-sequencing
Mesh:
Substances:
Year: 2013 PMID: 23903436 PMCID: PMC3859326 DOI: 10.1093/dnares/dst033
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of the re-sequenced genomes in the six tomato lines
| Line name | Number of read pairs (reads) | Total sequence length (bp) | Re-sequencing deptha (times) | % of genome coverageb | Coverage depth (times) |
|---|---|---|---|---|---|
| AIC | 104 913 343 | 11 540 467 730 | 12.1 | 93.5 | 8.3 |
| FRK | 91 995 911 | 10 119 550 210 | 10.7 | 93.2 | 7.2 |
| M82 | 107 226 071 | 11 794 867 810 | 12.4 | 93.0 | 8.3 |
| PL11 | 121 752 304 | 13 392 753 440 | 14.1 | 93.7 | 9.7 |
| PON | 94 404 895 | 10 384 538 450 | 10.9 | 93.4 | 7.7 |
| REG | 188 026 505 | 20 682 915 550 | 21.8 | 93.3 | 14.1 |
| Mean | 118 053 172 | 12 985 848 865 | 13.7 | 93.4 | 9.2 |
aRe-sequencing depth = total sequence length/tomato genome size (950 Mb).
bMapped percentage on the reference genome sequences (SL2.40, 760 Mb) at ≥1 coverage.
Figure 1.Density maps for SNPs detected in six tomato lines with respect to the reference tomato genome, SL2.40. The colours in each block represent a continuum of SNP densities: low-to-high SNP densities are represented by green to red. Left-side elliptic bars indicate the tomato chromosomes. Horizontal lines in each chromosome bar show mapped positions of SNPs used for the GoldenGate assay (black for intergenic SNPs, red for SNPs at splice sites and intron SNPs, blue for SNPs at UTRs and synonymous SNPs, and green for missense SNPs and nonsense SNPs). Heterochromatic regions are indicated by vertical lines on the right of the chromosomes. Names of genes identified by map-based cloning in previous studies are shown on the right of the chromosomes.
The number of SNPs categorized into seven classes
| Line | Total | Intergenic | Intragenic | |||||
|---|---|---|---|---|---|---|---|---|
| Intron | Exon | |||||||
| Splice site | Intron | UTR | CDS | |||||
| Synonymous | Non-synonymous | |||||||
| Missense | Nonsense | |||||||
| AIC | 85 721 | 70 707 | 26 | 9477 | 628 | 1769 | 3032 | 82 |
| FRK | 120 379 | 104 542 | 27 | 10 082 | 645 | 1799 | 3175 | 109 |
| M82 | 710 986 | 672 467 | 60 | 25 951 | 1529 | 4035 | 6708 | 236 |
| PL11 | 245 805 | 213 427 | 34 | 22 083 | 1438 | 3646 | 5054 | 123 |
| PON | 85 595 | 70 205 | 22 | 9847 | 661 | 1751 | 3017 | 92 |
| REG | 1 104 787 | 984 271 | 123 | 83 074 | 7102 | 13 157 | 16 647 | 413 |
| 6 lines | 1 475 688 | 1 316 332 | 156 | 110 159 | 8982 | 17 410 | 22 091 | 558 |
Figure 2.LD measures; r2 values against physical distance (Mb) between all pairs of SNPs located on the same chromosome.
Figure 3.SNPs associated with inflorescence branching identified by GWAS. Distribution of SNPs associated with inflorescence branching. SNPs that associated significantly (-logP of 4.36 at a significant level of 5%) are indicated by arrows.
Effects of associated SNPs on the traits
| Trait | Associating SNP | Chromosome | Position | -Log | Additive effectb | Dominant effectb | Candidate gene |
|---|---|---|---|---|---|---|---|
| Inflorescence branching | SL2.40ch02_41751976Y | SL2.40ch02 | 41 751 976 | 4.4* | 0.2 | 0.2 | |
| solcap_snp_sl_39457 | SL2.40ch09 | 4 904 111 | 5.4** | −0.3 | −0.4 | ||
| No. of leaves between inflorescences | SL2.40ch06_42601581W | SL2.40ch06 | 42 601 581 | 5.4** | −0.4 | 0.4 | |
| − | |||||||
| Plant habit determinate | SL2.40ch06_42601581W | SL2.40ch06 | 42 601 581 | 26.2** | 0.2 | −0.3 | |
| − | |||||||
| Plant height | SL2.40ch06_42601581W | SL2.40ch06 | 42 601 581 | 11.2** | −7.7 | 21.1 | |
| − | |||||||
| solcap_snp_sl_16654 | SL2.40ch09 | 2 135 101 | 7.5** | −1.0 | 31.6 | ||
| Fruit size | SL1_00sc6004_2094360_solcap_snp_sl_44897 | SL2.40ch11 | 52 280 215 | 4.6* | −10.9 | 9.1 | |
| − | |||||||
| Locule number | SL2.40ch02_41172086R | SL2.40ch02 | 41 172 086 | 4.9* | −0.4 | −0.6 | |
| SL1_00sc6004_2094360_solcap_snp_sl_44897 | SL2.40ch11 | 52 280 215 | 7.6** | −0.5 | −0.2 | ||
| − | |||||||
| Green shoulder | SL2.40ch01_89266983Y | SL2.40ch01 | 89 266 983 | 4.3* | −0.3 | 0.9 | |
| SL2.40ch10_1539862R | SL2.40ch10 | 1 539 862 | 5.5** | −0.4 | 1.2 | ||
| − | |||||||
| Colour of fruit epidermis | SL2.40ch01_71279371Y | SL2.40ch01 | 71 279 371 | 9.0** | 0.2 | Not detected |
Genes associating with the traits in the replicated GWAS are shown by bold.
a** and * indicate the significance level of 1 and 5%, respectively.
bEffect of ‘Heinz 1706’ allele.