| Literature DB >> 28717173 |
Irma Terracciano1, Concita Cantarella1, Carlo Fasano1, Teodoro Cardi1, Giuseppe Mennella1, Nunzio D'Agostino2.
Abstract
Tomato (Solanum lycopersicum L.) plants are characterized by having a variety of fruit colours that reflect the composition and accumulation of diverse carotenoids in the berries. Carotenoids are extensively studied for their health-promoting effects and this explains the great attention these pigments received by breeders and researchers worldwide. In this work we applied Agilent's SureSelect liquid-phase sequence capture and Illumina targeted re-sequencing of 34 tomato genes belonging to the methylerythritol phosphate (MEP) carotenoid pathway on a panel of 48 genotypes which differ for carotenoid content calculated as the sum of β-carotene, cis- and trans-lycopene. We targeted 230 kb of genomic regions including all exons and regulatory regions and observed ~40% of on-target capture. We found ample genetic variation among all the genotypes under study and generated an extensive catalog of SNPs/InDels located in both genic and regulatory regions. SNPs/InDels were also classified based on genomic location and putative biological effect. With our work we contributed to the identification of allelic variations possibly underpinning a key agronomic trait in tomato. Results from this study can be exploited for the promotion of novel studies on tomato bio-fortification as well as of breeding programs related to carotenoid accumulation in fruits.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28717173 PMCID: PMC5514110 DOI: 10.1038/s41598-017-06120-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Bowtie2 alignment statistics. Pre-processed reads from the sequencing of 48 tomato genotypes were mapped along the tomato reference genome SL2.40.
Figure 2Cumulative distribution of coverage depth across target regions in 47 tomato genotypes. The box in the graph highlights the fraction of bases captured in the target regions covered at a depth between 20x–50x.
Figure 3Per-base depth of coverage across gene Solyc01g005940 for the 47 tomato genotypes. Mean value for coverage depth is represented in black bold dashed line. Gene structure and bait positions are shown above the multi-line graph.
Figure 4Stacked bar chart showing the number of SNPs and InDels called by GATK across all 47 tomato genotypes under investigation.
Figure 5Stacked bar chart showing sequence changes grouped by SnpEff categories within gene regions.
Number and type of sequence changes identified within gene regions.
| Gene region | Mutation class | Mutation type | Number |
|---|---|---|---|
| UTRs | 5′UTR | 73 | |
| 3′UTR | 45 | ||
| CDS | Synonymous | 79 | |
| Non-synonymous | Missense | 52 | |
| Frameshift | 2 | ||
| In frame deletion | 1 | ||
| Stop lost | 1 | ||
| Stop gain | 1 | ||
| Introns | Intron variant | 399 | |
| Splice region | 16 | ||
| Splice acceptor | 1 | ||
| Splice donor | 1 |
List of missense mutations coupled with prediction on the effect of amino acid substitutions by PredictSNP.
| Gene | Enzyme | AA Position | Wild residue | Target residue | PredictSNP prediction | PredictSNP confidence | Genotypes | ||
|---|---|---|---|---|---|---|---|---|---|
| Low-CC | Medium-CC | High-CC | |||||||
|
| PSY3 | 82 | C | R | N | 0.74 | — | — | STR |
|
| CCD7 | 337 | I | M |
| 0.72 | E98 | — | E1, E41, E115 |
|
| ZDS | 449 | Q | H | N | 0.65 | — | E55 | — |
| 350 | I | T | N | 0.83 | — | — | STR | ||
| 581 | L | P |
| 0.51 | — | — | STR | ||
|
| ZEP | 667 | E | G | N | 0.83 | — | E16, E113 | STR, ZBR |
|
| PSY1 | 361 | V | I | N | 0.83 | — | E51 | — |
| 105 | K | N | N | 0.83 | E93 | — | — | ||
| 105 | K | M |
| 0.65 | E93 | — | — | ||
| 108 | I | M | N | 0.83 | E93 | — | — | ||
|
| DXR | 125 | T | I | N | 0.68 | — | — | E96 |
| 56 | P | L | N | 0.83 | — | — | E117 | ||
|
| LCBY1 | 405 | D | N | N | 0.83 | E54 | E55, E103 | E57, E115, STR, ZBR |
|
| VDE | 267 | S | N | N | 0.74 | E54 | E55, E103 | E57, E115, STR, ZBR |
|
| GGPPS2 | 41 | Q | K | N | 0.83 | — | E117 | BX, STR, URI, ZBR |
| 52 | V | L | N | 0.83 | — | E117 | URI | ||
| 200 | S | P | N | 0.74 | — | E117 | BX, STR, URI, ZBR | ||
|
| CrtISO_like | 142 | E | V | N | 0.63 | — | — | BX |
| 334 | V | I | N | 0.83 | — | — | BX, STR | ||
| 367 | K | E | N | 0.83 | — | — | BX,STR | ||
|
| CYP97B2 | 9 | I | N |
| 0.61 | — | E16 | — |
| 17 | R | W |
| 0.55 | — | E16 | — | ||
|
| CHY1 | 16 | F | I | N | 0.74 | — | — | STR, ZBR |
| 21 | S | T | N | 0.83 | — | — | STR, ZBR | ||
| 27 | K | I |
| 0.55 | — | — | STR, ZBR | ||
| 122 | V | I | N | 0.83 | — | — | STR, ZBR | ||
| 217 | A | P | N | 0.83 | — | — | STR, ZBR | ||
| 300 | I | K | N | 0.83 | — | — | STR, ZBR | ||
|
| CYCB | 20 | R | K | N | 0.83 | E71 | — | — |
| 23 | V | F | N | 0.63 | E71 | — | — | ||
| 229 | R | K | N | 0.83 | E71 | — | — | ||
| 289 | R | S | N | 0.83 | E71 | — | — | ||
| 290 | D | N | N | 0.83 | E71 | — | — | ||
| 335 | V | L | N | 0.83 | E71 | — | — | ||
| 473 | M | L | N | 0.68 | E71 | — | — | ||
| 484 | L | V | N | 0.83 | E71 | — | — | ||
|
| NCED | 424 | A | P | N | 0.75 | — | E51 | STR |
|
| NCED2 | 571 | F | L |
| 0.87 | E88 | — | — |
|
| CCD8 | 170 | V | A | N | 0.83 | E39, E40, E54, E83, E88, E99 | E8, E16, E34, E45, E51, E55, E64, E70, E76, E113, E115 | E1, E4, E32, E96, E119 |
|
| CCD_like | 267 | I | L |
| 0.51 | E21, E71, E82, E93, E98 | E103 | E30, E96, STR |
|
| CCD4B | 82 | E | D | N | 0.83 | — | E103 | — |
| 272 | F | Y | N | 0.65 | E71 | — | — | ||
| 295 | H | P | N | 0.83 | — | E55 | STR | ||
| 486 | G | D | N | 0.75 | — | E55 | STR | ||
| 488 | M | L | N | 0.83 | — | E55 | STR | ||
| 501 | V | I | N | 0.83 | — | E55 | STR | ||
| 518 | K | E | N | 0.83 | — | E55 | STR | ||
|
| CtrISO | 41 | I | R | N | 0.71 | — | E49 | STR, ZBR |
| 362 | V | A |
| 0.76 | — | E49 | STR, ZBR | ||
|
| PTOX | 89 | D | G | N | 0.83 | — | E16, E72, E113 | E105, URI, ZBR |
| 106 | N | S | N | 0.83 | — | E16, E72, E113 | E105, ZBR | ||
|
| HDS | 475 | I | T | N | 0.74 | — | E45 | — |
N = Neutral; D = Deleterious.
Figure 6Bar chart showing the copy number variation of cis-acting regulatory elements within the promoter region of the 34 candidate genes across all 47 tomato genotypes.