| Literature DB >> 23227037 |
Erika Asamizu1, Kenta Shirasawa, Hideki Hirakawa, Shusei Sato, Satoshi Tabata, Kentaro Yano, Tohru Ariizumi, Daisuke Shibata, Hiroshi Ezura.
Abstract
A total of 93,682 BAC-end sequences (BESs) were generated from a dwarf model tomato, cv. Micro-Tom. After removing repetitive sequences, the BESs were similarity searched against the reference tomato genome of a standard cultivar, "Heinz 1706." By referring to the "Heinz 1706" physical map and by eliminating redundant or nonsignificant hits, 28,804 "unique pair ends" and 8,263 "unique ends" were selected to construct hypothetical BAC contigs. The total physical length of the BAC contigs was 495, 833, 423 bp, covering 65.3% of the entire genome. The average coverage of euchromatin and heterochromatin was 58.9% and 67.3%, respectively. From this analysis, two possible genome rearrangements were identified: one in chromosome 2 (inversion) and the other in chromosome 3 (inversion and translocation). Polymorphisms (SNPs and Indels) between the two cultivars were identified from the BLAST alignments. As a result, 171,792 polymorphisms were mapped on 12 chromosomes. Among these, 30,930 polymorphisms were found in euchromatin (1 per 3,565 bp) and 140,862 were found in heterochromatin (1 per 2,737 bp). The average polymorphism density in the genome was 1 polymorphism per 2,886 bp. To facilitate the use of these data in Micro-Tom research, the BAC contig and polymorphism information are available in the TOMATOMICS database.Entities:
Year: 2012 PMID: 23227037 PMCID: PMC3514829 DOI: 10.1155/2012/437026
Source DB: PubMed Journal: Int J Plant Genomics ISSN: 1687-5389
Figure 1Distribution of BAC clone insert size. The insert size was deduced by mapping BESs onto the reference “Heinz 1706” genome (SL2.40).
Figure 2Flow of the BES analysis. To eliminate repetitive sequences, 93,682 BESs were initially searched against the repeat dataset of ITAG 2.3 with a BLASTN cutoff value of E <10−50. Next, the remaining sequences were mapped onto the “Heinz 1706” pseudomolecule sequences (SL2.40) under the following criteria: identity >90%, coverage >50%; E <10−100; the inclusion of single hits only. Mapped BESs were classified as either unique pair ends, for which both ends were mapped, or unique ends, for which only one end was mapped.
Coverage of chromosomes by hypothetical Micro-Tom BAC contigs.
| SL2.40ch01 | SL2.40ch02 | SL2.40ch03 | SL2.40ch04 | SL2.40ch05 | SL2.40ch06 | SL2.40ch07 | SL2.40ch08 | SL2.40ch09 | SL2.40ch10 | SL2.40ch11 | SL2.40ch12 | Total (ch01–ch12) | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Euchromatin | Chromosome length | 27,903,720 | 24,734,122 | 16,423,960 | 13,871,288 | 10,836,573 | 17,576,248 | 17,480,118 | 15,552,430 | 10,522,300 | 9,129,273 | 11,175,203 | 12,034,427 |
|
| Euchromatin | no. of Contigs | 100 | 78 | 45 | 34 | 32 | 52 | 53 | 55 | 37 | 25 | 45 | 44 |
|
| Euchromatin | no. of BACs | 533 | 504 | 401 | 231 | 170 | 339 | 224 | 279 | 176 | 197 | 184 | 317 |
|
| Euchromatin | Covered bases | 17,310,734 | 14,644,412 | 11,678,941 | 6,261,540 | 5,377,576 | 10,621,719 | 8,336,310 | 9,541,847 | 6,047,365 | 5,473,701 | 6,855,876 | 8,119,058 |
|
| Euchromatin | Uncovered bases | 10,592,986 | 10,089,710 | 4,745,019 | 7,609,748 | 5,458,997 | 6,954,529 | 9,143,808 | 6,010,583 | 4,474,935 | 3,655,572 | 4,319,327 | 3,915,369 |
|
| Euchromatin | % Coverage | 62.0% | 59.2% | 71.1% | 45.1% | 49.6% | 60.4% | 47.7% | 61.4% | 57.5% | 60.0% | 61.3% | 67.5% |
|
|
| ||||||||||||||
| Heterochromatin | Chromosome length | 62,400,524 | 25,184,172 | 48,416,754 | 50,193,024 | 54,184,865 | 28,465,388 | 47,788,503 | 47,480,227 | 57,139,791 | 55,705,032 | 42,210,822 | 53,451,826 |
|
| Heterochromatin | no. of Contigs | 175 | 74 | 147 | 131 | 169 | 76 | 135 | 150 | 149 | 159 | 128 | 155 |
|
| Heterochromatin | no. of BACs | 1,000 | 391 | 903 | 1,022 | 752 | 544 | 959 | 856 | 1,209 | 1,056 | 746 | 992 |
|
| Heterochromatin | Covered bases | 39,941,033 | 15,405,507 | 32,458,031 | 34,993,238 | 31,099,727 | 19,672,865 | 35,988,229 | 33,980,702 | 40,376,427 | 37,964,128 | 27,534,109 | 36,150,348 |
|
| Heterochromatin | Uncovered bases | 22,459,491 | 9,778,665 | 15,958,723 | 15,199,786 | 23,085,138 | 8,792,523 | 11,800,274 | 13,499,525 | 16,763,364 | 17,740,904 | 14,676,713 | 17,301,478 |
|
| Heterochromatin | % Coverage | 64.0% | 61.2% | 67.0% | 69.7% | 57.4% | 69.1% | 75.3% | 71.6% | 70.7% | 68.2% | 65.2% | 67.6% |
|
|
| ||||||||||||||
| Total | Chromosome length | 90,304,244 | 49,918,294 | 64,840,714 | 64,064,312 | 65,021,438 | 46,041,636 | 65,268,621 | 63,032,657 | 67,662,091 | 64,834,305 | 53,386,025 | 65,486,253 |
|
| Total | no. of Contigs | 275 | 152 | 192 | 165 | 201 | 128 | 188 | 205 | 186 | 184 | 173 | 199 |
|
| Total | no. of BACs | 1,533 | 895 | 1,304 | 1,253 | 922 | 883 | 1,183 | 1,135 | 1,385 | 1,253 | 930 | 1,309 |
|
| Total | Covered bases | 57,251,767 | 30,049,919 | 44,136,972 | 41,254,778 | 36,477,303 | 30,294,584 | 44,324,539 | 43,522,549 | 46,423,792 | 43,437,829 | 34,389,985 | 44,269,406 |
|
| Total | Uncovered bases | 33,052,477 | 19,868,375 | 20,703,742 | 22,809,534 | 28,544,135 | 15,747,052 | 20,944,082 | 19,510,108 | 21,238,299 | 21,396,476 | 18,996,040 | 21,216,847 |
|
| Total | % Coverage | 63.4% | 60.2% | 68.1% | 64.4% | 56.1% | 65.8% | 67.9% | 69.1% | 68.6% | 67.0% | 64.4% | 67.6% |
|
Possible genome rearrangement events observed in the Micro-Tom and “Heinz 1706” genome.
| No. | BAC | End1 | Acc | Chr | Direction | From | To | End2 | Acc | Chr | Direction | From | To | Possible event |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | MTBAC102D20 | T7 | FT290741 | SL2.40ch02 | — | 29,374,874 | 29,375,640 | SP6 | FT290742 | SL2.40ch02 | — | 29,494,209 | 29,494,781 |
|
| 1 | MTBAC084K15 | T7 | FT278701 | SL2.40ch02 | — | 29,375,421 | 29,376,188 | SP6 | FT278702 | SL2.40ch02 | — | 29,462,866 | 29,463,675 | |
|
| ||||||||||||||
| 2 | MTBAC041L05 | T7 | FT251747 | SL2.40ch03 | — | 6,601,537 | 6,602,368 | SP6 | FT251748 | SL2.40ch03 | — | 55,664,754 | 55,665,559 |
|
| 2 | MTBAC077O14 | SP6 | FT274148 | SL2.40ch03 | — | 6,602,568 | 6,603,163 | T7 | FT274147 | SL2.40ch03 | — | 55,665,296 | 55,666,020 | |
Number of polymorphisms found in each chromosome.
| SL2.40ch01 | SL2.40ch02 | SL2.40ch03 | SL2.40ch04 | SL2.40ch05 | SL2.40ch06 | SL2.40ch07 | SL2.40ch08 | SL2.40ch09 | SL2.40ch10 | SL2.40ch11 | SL2.40ch12 | Total | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Euchromatin | no. of polymorphisms | 4,152 | 4,123 | 3,700 | 2,863 | 969 | 2,417 | 3,504 | 2,113 | 1,932 | 1,302 | 1,694 | 2,161 |
|
| Euchromatin | Covered bases | 17,310,734 | 14,644,412 | 11,678,941 | 6,261,540 | 5,377,576 | 10,621,719 | 8,336,310 | 9,541,847 | 6,047,365 | 5,473,701 | 6,855,876 | 8,119,058 |
|
| Euchromatin | kb/polymorphism | 4,169 | 3,552 | 3,156 | 2,187 | 5,550 | 4,395 | 2,379 | 4,516 | 3,130 | 4,204 | 4,047 | 3,757 |
|
|
| ||||||||||||||
| Heterochromatin | no. of polymorphisms | 12,319 | 10,694 | 10,408 | 9,995 | 30,951 | 5,134 | 8,347 | 8,562 | 10,231 | 9,209 | 14,937 | 10,075 |
|
| Heterochromatin | Covered bases | 39,941,033 | 15,405,507 | 32,458,031 | 34,993,238 | 31,099,727 | 19,672,865 | 35,988,229 | 33,980,702 | 40,376,427 | 37,964,128 | 27,534,109 | 36,150,348 |
|
| Heterochromatin | kb/polymorphism | 3,242 | 1,441 | 3,119 | 3,501 | 1,005 | 3,832 | 4,312 | 3,969 | 3,946 | 4,123 | 1,843 | 3,588 |
|
|
| ||||||||||||||
| Total | no. of polymorphisms | 16,471 | 14,817 | 14,108 | 12,858 | 31,920 | 7,551 | 11,851 | 10,675 | 12,163 | 10,511 | 16,631 | 12,236 |
|
| Total | Covered bases | 57,251,767 | 30,049,919 | 44,136,972 | 41,254,778 | 36,477,303 | 30,294,584 | 44,324,539 | 43,522,549 | 46,423,792 | 43,437,829 | 34,389,985 | 44,269,406 |
|
| Total | kb/polymorphism | 3,476 | 2,028 | 3,129 | 3,208 | 1,143 | 4,012 | 3,740 | 4,077 | 3,817 | 4,133 | 2,068 | 3,618 |
|
Number of polymorphisms found in genic and intergenic regions in each chromosome.
| SL2.40ch01 | SL2.40ch02 | SL2.40ch03 | SL2.40ch04 | SL2.40ch05 | SL2.40ch06 | SL2.40ch07 | SL2.40ch08 | SL2.40ch09 | SL2.40ch10 | SL2.40ch11 | SL2.40ch12 | Total | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Genic | Exon | 3′ UTR | 115 | 100 | 127 | 108 | 62 | 58 | 59 | 87 | 47 | 15 | 0 | 0 |
|
| 5′ UTR | 157 | 58 | 48 | 23 | 26 | 34 | 28 | 14 | 38 | 2 | 0 | 0 |
| ||
| CDS | 1,152 | 967 | 1,031 | 758 | 820 | 603 | 757 | 605 | 669 | 570 | 662 | 615 |
| ||
| Intron | Intron | 2,035 | 1,938 | 2,023 | 1,971 | 1,547 | 1,122 | 1,778 | 1,154 | 1,425 | 994 | 1,947 | 1,172 |
| |
| Splice junction | 19 | 15 | 12 | 13 | 14 | 15 | 13 | 8 | 8 | 7 | 12 | 12 |
| ||
|
| |||||||||||||||
| Intergenic | 12,993 | 11,739 | 10,867 | 9,985 | 29,451 | 5,719 | 9,216 | 8,807 | 9,976 | 8,923 | 14,010 | 10,437 |
| ||
|
| |||||||||||||||
| Total | 16,471 | 14,817 | 14,108 | 12,858 | 31,920 | 7,551 | 11,851 | 10,675 | 12,163 | 10,511 | 16,631 | 12,236 |
| ||
Figure 3Micro-Tom BAC coverage with respect to the “Heinz 1706” chromosomes and detected polymorphisms. Black boxes indicate covered regions, and white boxes indicate gaps. Bars represent heterochromatic regions. The scale bars for polymorphisms indicate the number of SNPs or Indels per megabase (200 polymorphisms/scale).