| Literature DB >> 27142449 |
Francisco J Herraiz1, José Blanca1, Pello Ziarsolo1, Pietro Gramazio1, Mariola Plazas1, Gregory J Anderson2, Jaime Prohens3, Santiago Vilanova4.
Abstract
BACKGROUND: Solanum sect. Basarthrum is phylogenetically very close to potatoes (Solanum sect. Petota) and tomatoes (Solanum sect. Lycopersicon), two groups with great economic importance, and for which Solanum sect. Basarthrum represents a tertiary gene pool for breeding. This section includes the important regional cultigen, the pepino (Solanum muricatum), and several wild species. Among the wild species, S. caripense is prominent due to its major involvement in the origin of pepino and its wide geographical distribution. Despite the value of the pepino as an emerging crop, and the potential for gene transfer from both the pepino and S. caripense to potatoes and tomatoes, there has been virtually no genomic study of these species.Entities:
Keywords: Candidate genes; Functional annotation; Molecular markers; Pepino; Phylogeny; Potato; S. caripense; Solanaceae; Solanum muricatum; Tomato; Transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27142449 PMCID: PMC4855764 DOI: 10.1186/s12864-016-2656-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of raw and clean reads after processing for S. muricatum and S. caripense
|
|
| |
|---|---|---|
| Total raw reads | 58,327,154 × 2 | 52,646,045 × 2 |
| Total raw reads data size (Gb) | 11.78 Gb | 10.63 Gb |
| % G/C | 42.0 | 42.3 |
| Total clean reads | 33,963,075 × 2 | 36,228,181 × 2 |
| Total clean reads data size (Gb) | 6.86 Gb | 7.32 Gb |
| % G/C | 41.7 | 42.5 |
Summary of the Solanum muricatum transcriptome assembly. After assembly in the first group (Transcripts), and after filtering by level of expression (Most expressed transcripts)
| Transcripts | Number | 91,949 |
| Total length | 82,353,960 | |
| Average length | 895.65 | |
| Maximum length | 11,491 | |
| Most expressed transcripts | Number | 75,832 |
| Total length | 53,411,734 | |
| Average length | 704.34 | |
| Maximum length | 11,491 |
Fig. 1Length distribution of the pepino (S. muricatum) transcriptome most expressed transcripts. Horizontal axis represents the size range of each unigene. Vertical axis represents the number of unigenes for every interval
Functional annotation summary of the pepino sequences over protein databases. First the most expressed transcripts were annotated in Swiss-Prot database. Then, unpaired transcripts in this annotation were evaluated in the next database, ITAG2.4. And finally, the unpaired at this level, were evaluated in the Uniref90 database
| Number of transcripts | % of total | |
|---|---|---|
| Annotated in Swiss-Prot | 26,688 | 53.7 % |
| Annotated in ITAG2.4 | 14,942 | 30.1 % |
| Annotated in UniRef90 | 2263 | 4.6 % |
| Total annotated in protein databases | 43.893 | |
| TOTAL Annotations | 49,662 |
Fig. 2GO level distribution in each category for the annotated pepino unigenes. X axis represent the GO level and Y axis the number of annotated unigenes. BP = Biological Process, MF = Molecular Function, CC = Cellular Component
Fig. 3Gene ontology classification by plant slim term for level 2. The graphic indicates the number of transcripts for every process and functional category, cellular components (left), biological process (middle) and molecular function and (right)
Fig. 4Number of unigenes for each enzyme commission (EC) category
Candidate genes studied affecting traits of importance in different Solanaceae. Traits and genes affecting inflorescence, fruit stripes, fruit shape, anthocyanins route, chlorogenic acid pathway, saponines pathway, and sucrose accumulation are included. More information the Candidate genes section of Material and Methods and in Additional file 8: Table S1
| Trait | Genes | Features |
|---|---|---|
| Inflorescence | Anantha | Gene - F-box protein |
| Compound inflorescence | Transcription factor | |
| Fruit stripes | Fruit Stripes | Transcription factor |
| Fruit shape | FAS | Intron-regulatory |
| Fw2.2 | Promoter-regulatory | |
| Fw3 | Promoter-regulatory | |
| Tonneau | Gene - microtubule cytoskeleton organization | |
| Wuschel (LC) | SNP in downstream-regulatory | |
| OVATE | Premature stop | |
| POS1 | Intron-regulatory | |
| SlCCS52A | Receptor activity | |
| Sl-IAA17 | Gene | |
| SUN | Transposon insertion-regulatory | |
| Wee | Gene - Kinase | |
| Anthocyanins pathway | F3’5’H | Gene - Hydroxylase |
| Acyltransferase-like | Gene - Acyltransferase | |
| 5GT | Gene - Glucosyltransferase | |
| ANS | Gene - Anthocyanidin synthase | |
| DFR | Gene - Dihydroflavonol 4-reductase | |
| F3H | Gene - Flavanone 3-hydroxylase | |
| CHI | Gene - Chalcone isomerase | |
| CHS2 | Gene - Chalcone synthase | |
| CHS3 | Gene - Chalcone synthase | |
| CHS1 | Gene - Chalcone synthase | |
| Acyltransferase-like | Gene - Acyltransferase | |
| Chlorogenic acid pathway | 4CL | Gene - 4-Coumarate-CoA ligase |
| C3H | Cytochrome P450 | |
| HTC | Gene - Transferase | |
| HQT | Gene - Transferase | |
| Saponines pathway | Egp#1–1 | Gene - Glycosyltransferase |
| Egp#1–4 | Gene - Glycosyltransferase | |
| Ptt#1–53 | Gene - Glycosyltransferase | |
| Ptt21 | Gene - Glycosyltransferase | |
| Sgt1-1 | Gene - Glycosyltransferase | |
| Ptt#5–30 | Gene - Glycosyltransferase | |
| Sa#6–15 | Gene - Glycosyltransferase | |
| Sk#7–4 | Gene - Glycosyltransferase | |
| Sk#7–4 | Gene - Glycosyltransferase | |
| Sk#7–5 | Gene - Glycosyltransferase | |
| SaGT4A | Gene - Glycosyltransferase | |
| SaGT4R | Gene - Glycosyltransferase | |
| SaGT6 | Gene - Glycosyltransferase | |
| Sucrose accumulator | TIV1 | Gene - Acid invertase |
Fig. 5Graphical representation of the pepino unigenes positions on the chromosomes of potato (top) and tomato (bottom) and nucleotide-level variants distribution/density found on these chromosomes. a Distribution of the pepino unigenes on chromosomes, (b) ideograms of the 12 chromosomes, black bar indicates the approximated position of the centromere, (c) variants distribution/density, every column indicates the number of variants per Mb
Fig. 6Phylogenetic relationship among Solanaceae species. The number in the nodes indicates the estimated time of divergence (in millions of years). The length of the branches is proportional to the divergence time. Bootstrap values are not shown as it is 100 % in all nodes
Single sequence repeat (SSR) statistics according to the type of motif, the percentage of each motif and the amount of unigenes with SSRs. Complete information about these markers is shown in Additional file 10
| Di-nucleotide motif | Number of Di-SSR | % | Unigenes |
| AG | 123 | 48.62 | |
| AT | 101 | 39.92 | |
| AC | 29 | 11.46 | |
| Total | 253 | 100 | 253 |
| Tri-nucleotide motif | Number of Tri-SSR | % | Unigenes |
| AAG | 191 | 27.02 | |
| AAC | 103 | 14.57 | |
| AAT | 101 | 14.29 | |
| ATC | 93 | 13.15 | |
| ACC | 68 | 9.61 | |
| AGG | 56 | 7.92 | |
| AGC | 50 | 7.07 | |
| ACT, ACG, CCG | 45 | 6.36 | |
| Total | 707 | 100 | 684 |
| Tetra-nucleotide motif | Number of Tetra-SSR | % | Unigenes |
| AAAT | 28 | 25.00 | |
| AAAC | 17 | 15.18 | |
| AAAG | 16 | 14.29 | |
| AGAT | 10 | 8.93 | |
| AATC | 8 | 7.14 | |
| Others tetra-nucleotides | 33 | 29.46 | |
| Total | 112 | 100 | 112 |
Single nucleotide variants statistics for the S. muricatum and S. caripense transcriptomes
| Species | SNPs | INDELs |
|---|---|---|
|
| 11,735 | 766 |
|
| 30,668 | 1494 |
|
| 84,972 | 4058 |
Single nucleotide polymorphism (SNPs) statistics. Type and number of transitions and transversions are shown for high quality SNPs identified in each species and between them
| SNPs | Number (%) | SNPs | Number (%) | |
|---|---|---|---|---|
| Transitions | Transversions | Complex | ||
| A<->G | 26,684 (31.4) | A<->T | 8514 (10.0) | 212 |
| C<->T | 26,806 (31.5) | G<->T | 8311 (9.8) | |
| C<->G | 6250 (7.3) | |||
| A<->C | 8288 (9.7) | |||
| Total | 53,490 (62.9 %) | Total | 31,363 (36.9 %) | 212 (0.3 %) |
Distribution of pepino nucleotide-level variants on chromosomes of tomato and potato
| Tomato | Potato | |
|---|---|---|
| Chromosome 1 | 2931 | 3031 |
| Chromosome 2 | 2529 | 2619 |
| Chromosome 3 | 2441 | 2360 |
| Chromosome 4 | 1984 | 2151 |
| Chromosome 5 | 1548 | 1678 |
| Chromosome 6 | 2086 | 2204 |
| Chromosome 7 | 1867 | 1845 |
| Chromosome 8 | 1628 | 1637 |
| Chromosome 9 | 1610 | 1717 |
| Chromosome 10 | 1636 | 1656 |
| Chromosome 11 | 1530 | 1698 |
| Chromosome 12 | 1608 | 1846 |
| Total | 23,501 | 24,442 |
Fig. 7Fruits of Solanum muricatum var. Sweet Long (SL) (left) and of one of the wild relatives, S. caripense (EC-40) (right)
Fig. 8Schematic representxation of the overall sequencing and annotation workflow for the pepino transcriptome