| Literature DB >> 24548308 |
Maria Raffaella Ercolano1, Adriana Sacco, Francesca Ferriello, Raffaella D'Alessandro, Paola Tononi, Alessandra Traini, Amalia Barone, Elisa Zago, Maria Luisa Chiusano, Genny Buson, Massimo Delledonne, Luigi Frusciante.
Abstract
BACKGROUND: Investigation of tomato genetic resources is a crucial issue for better straight evolution and genetic studies as well as tomato breeding strategies. Traditional Vesuviano and San Marzano varieties grown in Campania region (Southern Italy) are famous for their remarkable fruit quality. Owing to their economic and social importance is crucial to understand the genetic basis of their unique traits.Entities:
Mesh:
Year: 2014 PMID: 24548308 PMCID: PMC3936818 DOI: 10.1186/1471-2164-15-138
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Variants statistics
| SNPs | 206,867 (199,502) | 177,179 (169,704) | 3,343 (3,160) |
| Deletions | 46,433 | 44,561 | 25,208 |
| Insertions | 77,882 | 75,984 | 55,668 |
| Unbalanced insertions | 6,537 | 6,626 | 1,343 |
Number of sequence variants in Vesuviano (RSV) and San Marzano (SM) varieties compared to the reference genome (SL240) In brackets is reported the number of unambiguous SNPs In the last column the number of variants in common between the two varieties is shown.
Counts of identified SNPs and of genes affected by them
| | ||||
|---|---|---|---|---|
| Intergenic | 188,769 (181,758) | 16,705 (16,247) | 158,964 (151,971) | 14,530 (13,939) |
| Intronic | 11,907 (11,766) | 2,880 (2,836) | 12,232 (11,979) | 2,576 (2,490) |
| UTR | 624 (610) | 381 (376) | 686 (671) | 361 (353) |
| CDS | 5,567 (5,368) | 3,254 (3,132) | 5,297 (5,083) | 2,886 (2,752) |
Results are given distinguishing in which region the SNPs are found, if they are in intergenic regions, in genes, in exonic regions or in the coding regions of genes using the ITAG23 annotation of reference genome For variants that insists on a gene, a count of affected genes is also given In brackets are given the numbers of the unambiguous SNPs.
Insertion statistics
| | | | | | |
|---|---|---|---|---|---|
| | |||||
| Total | Intergenic | 69,615 | 18,730 | 67,850 | 18,432 |
| | Intronic | 7,187 | 4,069 | 7,096 | 4,010 |
| | UTR | 557 | 485 | 567 | 489 |
| | CDS | 523 | 418 | 471 | 400 |
| Length < 6 | Intergenic | 63,307 | 18,006 | 62,241 | 17,736 |
| | Intronic | 6,846 | 3,975 | 6,791 | 3,929 |
| | UTR | 535 | 466 | 553 | 478 |
| | CDS | 381 | 321 | 362 | 315 |
| Length > 100 | Intergenic | 97 | 54 | 91 | 54 |
| | Intronic | 4 | 4 | 2 | 2 |
| | UTR | 0 | 0 | 0 | 0 |
| CDS | 4 | 4 | 4 | 4 | |
Statistics of the number of insertions and the number of corresponding annotated genes.
Deletion statistics
| | | | | | |
|---|---|---|---|---|---|
| | |||||
| Total | Intergenic | 43,385 | 9,909 | 41,604 | 9459 |
| | Intronic | 2326 | 1384 | 2317 | 1354 |
| | UTR | 141 | 110 | 147 | 115 |
| | CDS | 581 | 472 | 493 | 404 |
| Length < 6 | Intergenic | 34,567 | 8103 | 32,974 | 7696 |
| | Intronic | 1898 | 1190 | 1914 | 1162 |
| | UTR | 114 | 88 | 114 | 87 |
| | CDS | 369 | 296 | 331 | 257 |
| Length > 100 | Intergenic | 2365 | 950 | 2337 | 920 |
| | Intronic | 80 | 77 | 80 | 78 |
| | UTR | 6 | 6 | 7 | 7 |
| CDS | 48 | 48 | 37 | 37 | |
Statistics of the number of deletions and the number of corresponding annotated gene.
Figure 1Statistics of annotation translation. Statistics of the ITAG23 annotation translated from the reference assembly to the Vesuviano (RSV) and San Marzano (SM) genomes.
Number of tomato genes mapped the reference tomato (SL240), the Vesuviano (RSV) and San Marzano (SM) genomes
| 1 match | 29,844 | 29,940 | 29,934 |
| >1 match | 4,755 | 4,621 | 4,638 |
| total | 34,600 | 34,561 | 34,572 |
The number of genes resulting in one match or in multiple matches are indicated.
Figure 2Main results from ITAG23 gene mapping on SL240, Vesuviano (RSV) and San Marzano (SM) tomato genomes. Grey bars represent the twelve tomato chromosomes and chromosome 0 The distribution of the genes within the SL240 genome is shown by black lines The distribution of genes mapped more than once is shown in the following circle with purple lines when the redundancy is equal or lower than 30 copies and with orange lines when genes re-mapped more than 30 times SM and RSV specific behaviours are indicated in green and in blue, respectively, in the whole picture, while the common behaviours among the two genotypes are indicated in red In particular, on each chromosome (grey bars) colored lines indicate the distribution of the S lycopersicum genes which are not mapped in the two SM and RSV genotypes The two inner circles indicate the distribution of genes that are specific to RSV (circle with a light blue background) and to SM (circle with a light green background) genotypes, along the respective pseudomolecules Common genes identified exclusively in chromosome positions from the two newly sequenced genotypes are in red and are linked by red connectors between the two inner circles to better highlight the conserved positions of these genes in the two newly sequenced genomes
Figure 3Venn diagram of loci that share the same relative position in the reference SL240,Vesuviano (RSV) and San Marzano (SM) genomes.
Number and percentage of polymorphic genes and number of variants identified for each fruit quality-related class of genes in Vesuviano (RSV) and San Marzano (SM) varieties Data are referred to the tomato reference genome (SL240)
| | | ||||||
|---|---|---|---|---|---|---|---|
| Ascorbate biosynthesis | 23 | 18 | 783 | 265 | 18 | 783 | 349 |
| MEP/carotenoid pathway | 46 | 35 | 761 | 337 | 34 | 739 | 351 |
| Ethylene-related genes | 52 | 48 | 923 | 466 | 44 | 846 | 448 |
| Cell wall | 718 | 589 | 820 | 5,303 | 562 | 783 | 4,471 |
| Transcription factors | 2,025 | 1,710 | 844 | 17,215 | 1,662 | 821 | 15,412 |
| Transcription regulators | 434 | 353 | 813 | 2,660 | 344 | 793 | 2,496 |
Common and specific genes related to fruit quality with variants in Vesuviano (RSV) and San Marzano (SM) varieties
| Ascorbate biosynthesis | 16 | 2 | 2 |
| MEP/carotenoid pathway | 33 | 2 | 1 |
| Ethylene-related genes | 44 | 4 | 0 |
| Cell wall | 1,600 | 110 | 62 |
| Transcription factors | 328 | 25 | 16 |
| Transcription regulators | 545 | 44 | 17 |
Figure 4Classification of variants in fruit quality and ripening related genes. Classifications of variants were based on the their gene location A) Vesuviano (RSV) sequence variants B) San Marzano (SM) sequence variants.
Fruit quality related genes affected by deleterious mutation in SM, RSV or both varieties
| | | | ||
| Solyc01g1062302 | B3 domain-containing protein | V79F | −3233 | |
| Solyc03g1182902 | Auxin response factor 2 | S220G | −3818 | |
| Solyc04g0094402 | NAC domain protein | Y212C | −3401 | |
| Solyc04g0647701 | Zinc finger CCCH domain-containing protein 38 | K687Q | −29 | |
| Solyc10g0804101 | BZIP transcription factor | T403I | −4147 | |
| | | | ||
| Solyc07g0519801 | Chromodomain-helicase-DNA-binding protein 1-like | A618T | −2833 | |
| Solyc09g0760102 | PHD finger family protein | Y390C/L988F | −3167/−6472 | |
| | | | ||
| Solyc02g0782301 | Glucan synthase like 1 | V1659G | −5286 | |
| Solyc05g0518702 | Pollen allergen Phl p 11 | A134T | −3344 | |
| Solyc05g0554901 | Laccase-22 | R472H | −49 | |
| | | | ||
| Solyc01g1038301 | Zinc finger-homeodomain protein 2 | R56Q | −4000 | |
| Solyc03g0971202 | Heat stress transcription factor A3 | W469L | −4,548 | |
| Solyc07g0550001 | Myb-related transcription factor | T110I | −5433 | |
| Solyc08g0052902 | BZIP transcription factor 3 | E357G | −5548 | |
| Solyc08g0062401 | B3 domain-containing protein | R299W | −4223 | |
| Solyc09g0080402 | RNA polymerase sigma-70 factor | G436R | −5900 | |
| | | | ||
| Solyc01g0497401 | DNA repair and recombination protein RAD54-like | R155T | −4767 | |
| Solyc02g0646902 | Acetyltransferase-like protein | I12T | −3386 | |
| | | | ||
| Solyc01g0087201 | Mannan endo-1 4-beta-mannosidase | G215S | −5733 | |
| Solyc03g1153101 | Expansin | E146G | −3595 | |
| Solyc11g0565901 | Cellulose synthase | D100N | −4472 | |
| | | | ||
| Solyc00g1983601 | Zinc finger CCCH domain-containing protein 30 | K188T | −4865 | |
| Solyc02g0773901 | WUSCHEL-related homeobox 11 | T291K | −2933 | |
| Solyc02g0815202 | Nibrin | F236L | −5433 | |
| Solyc03g0439102 | BSD domain containing protein | G54C | −8114 | |
| Solyc05g0502202 | G-box binding factor 3 | N335S | −4567 | |
| Solyc07g0418501 | Homeodomain-containing transcription factor FWA | I545T | −4584 | |
| Solyc09g0075502 | Zinc finger family protein | V111L | −3000 | |
| Solyc09g0075701 | MYB transcription factor | K112I | −6351 | |
| Solyc10g0181101 | MADS box transcription factor | L104P | −3739 | |
| Solyc12g0107601 | Transcription factor (Fragment) | A36V/G44C/L38P | −2844/−7506/−6344 | |
| Solyc12g0198201 | MYB transcription factor | K14R | −2942 | |
| | | | ||
| Solyc04g0086102 | Histone acetyltransferase | P1416L | −7522 | |
| Solyc06g0842502 | Kelch repeat and BTB domain-containing | A324T | −2700 | |
| Solyc08g0687701 | N-acetyltransferase | W198L | −12149 | |
| Solyc10g0062202 | Cell differentiation protein rcd1 | Q215E | −2957 | |
| | | | ||
| Solyc01g0791802 | Pectinesterase | T334I | −2612 | |
| Solyc01g0811801 | Beta-glucosidase | H83Y | −4364 | |
| Solyc01g0915302 | Fasciclin-like arabinogalactan protein 13 | S201Y | −3582 | |
| Solyc02g0676501 | Polygalacturonase 1 | G320S | −5337 | |
| Solyc05g0500102 | 1-aminocyclopropane-1-carboxylate synthase | T82A | −3776 | |
| Solyc05g0525301 | Endoglucanase 1 | A183V/I97T | −3085/−4045 | |
| Solyc08g0653202 | Transmembrane protein 222 | V157L | −2583 | |
| Solyc09g0076501 | Fasciclin-like arabinogalactan protein 7 | S104F | −5217 | |
| Solyc09g0102102 | Endoglucanase 1 | A306V | −3709 | |
Data are referred to non-synonymous amino acid variants in CDS regions PROVEAN score threshold = −25 (Variants with score equal or below −25 are considered “deleterious”).
Figure 5GO terms enrichment heat map. Significant GO term classes enriched Vesuviano (RSV) and San Marzano (SM) non-synonymous coding variants are reported Variants found in both varieties respect reference genome are indicated as common; variants found in each variety (RSV or SM) are indicated as specific.