| Literature DB >> 29769290 |
Alexander Q Wixom1, N Carol Casavant1, Joseph C Kuhl1, Fangming Xiao1, Louise-Marie Dandurand2, Allan B Caplan3.
Abstract
Solanum sisymbriifolium, also known as "Litchi Tomato" or "Sticky Nightshade," is an undomesticated and poorly researched plant related to potato and tomato. Unlike the latter species, S. sisymbriifolium induces eggs of the cyst nematode, Globodera pallida, to hatch and migrate into its roots, but then arrests further nematode maturation. In order to provide researchers with a partial blueprint of its genetic make-up so that the mechanism of this response might be identified, we used single molecule real time (SMRT) sequencing to compile a high quality de novo transcriptome of 41,189 unigenes drawn from individually sequenced bud, root, stem, and leaf RNA populations. Functional annotation and BUSCO analysis showed that this transcriptome was surprisingly complete, even though it represented genes expressed at a single time point. By sequencing the 4 organ libraries separately, we found we could get a reliable snapshot of transcript distributions in each organ. A divergent site analysis of the merged transcriptome indicated that this species might have undergone a recent genome duplication and re-diploidization. Further analysis indicated that the plant then retained a disproportionate number of genes associated with photosynthesis and amino acid metabolism in comparison to genes with characteristics of R-proteins or involved in secondary metabolism. The former processes may have given S. sisymbriifolium a bigger competitive advantage than the latter did.Entities:
Keywords: R-genes; S. sisymbriifolium; SMRT; solanum; transcriptome
Mesh:
Year: 2018 PMID: 29769290 PMCID: PMC6027862 DOI: 10.1534/g3.118.200327
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Summary of the SSI transcriptome derived using SMRT technology. Organ sub-transcriptomes were sequenced and combined from 33,170 root, 99,924 bud, 50,825 leaf, and 47,793 stem reads. See Supplemental Figure 2 and Supplemental Figure 3 for gene ontology bins of this transcriptome
| SMRT Assembly | SSI | |
|---|---|---|
| Total raw reads | 231,712 | |
| Read lengths | 300–7883 | |
| Total raw reads size (bp) | 362,086,346 | |
| GC content | 41.17 | |
| Transcripts | Number | 139,611 |
| Total length | 237,865,670 | |
| N50 | 2,050 | |
| Max length | 7,883 | |
| GC content | 40.97 | |
| Unigenes | Number | 41,189 |
| Total length | 74,642,518 | |
| N50 | 2,158 | |
| Max length | 7,883 | |
| GC content | 39.90 | |
BUSCO assessment for completeness of 3 transcriptomes and one genome. The SSI transcriptome appears to be nearly complete, but contains a disproportion number of duplicated sequences. See Supplementary Table 4 for sources of datasets. ATH, Arabidopsis thaliana
| SSI | Tomato | Potato | ATH (Genome) | |
|---|---|---|---|---|
| Complete BUSCOs | 93% | 96.2% | 86.7% | 97.7% |
| Complete Single-copy BUSCOs | 62.8% | 94.2% | 64.1% | N/A |
| Complete Duplicated BUSCOs | 30.2% | 2.0% | 22.6% | N/A |
| Fragmented BUSCOs | 2.0% | 0.8% | 4.7% | 0.6% |
| Missing BUSCOs | 4.1% | 3.0% | 8.6% | 1.7% |
Divergent gene assessment of allele and/or paralog number in the SSI transcriptome. 4-allele genes were mapped to tomato chromosomes, see Supplemental Table 2
| Allele or Paralog | # of genes |
|---|---|
| Homozygous | 17773 |
| 2 | 22098 |
| 3 | 1358 |
| 4 | 44 |
Figure 1Shared and restricted orthologous genes among 13 species. All species shown here shared 6067 core orthologs. Each petal shows the number of gene groups unique to each species. For visualization purposes, each group was differentially shaded according to the number of genes in the set, ranging from SLY (least) to ZMA (most). Not shown are groups shared by only 2–12 species. Solanum sisymbriifolium, SSI; Solanum tuberosum, STU; Solanum lycopersicum, SLY; Solanum melongena, SME; Arabidopsis thaliana, ATH; Carica papaya, CPA; Vitis vinifera, VVI; Prunus persica, PPE; Populus trichocarpa, PTR; Citrus sinensis, CSI; Medicago truncatula, MTR; Zea mays, ZMA; and Oryza sativa, OSA. See Supplemental Figure 4 for gene ontology bins for the SSI unique groups, and Supplemental Table 4 for sources of datasets.
Figure 2Final SMRT transcriptome sequences were backtracked through the de-redundification process to the organ sub-transcriptomes. A) Upset plot (Lex ) of genes expressed in one or more organs. Each vertical bar shows the number of genes expressed in the organ(s) indicated by the intersection matrix below it (where a single dot in the matrix is a single organ, 2 dots = 2 organs, etc.). The number of genes found to have a homolog in at least one of the organs is indicated by the horizontal bar graph extending to the left. B) Green-tissue specificity of sequences annotated as genes involved in the light harvesting complex-I pathway via Mercator. C) Sequences annotated as putative resistance genes because they contained a nucleotide binding domain (NB-ARC) and leucine rich repeat (LRR) domains show varied expression patterns. As shown on the scales on the right of (B) and (C), the darker the color, the more times the sequence was found in that organ.
R-gene profile of potato (STU), tomato (SLY), and the SSI transcriptome. The SSI database had fewer assigned R genes (based on the presence of nucleotide-binding domains and leucine-rich repeats within the same open reading frame) than either SLY or STU genomes. Refer to Supplemental Table 1 for full domain annotation statistics
| Transcriptome | R-genes |
|---|---|
| SSI (SMRT) | 67 |
| STU | 309 |
| SLY | 137 |
Figure 3Comparison of expression of 3 putative R-gene sequences in the SMRT database to semi-quantitative PCR from 2 cDNA preparations. A) The expression of three genes with LRR and NB-ARC domains characteristic of the R-genes and an actin isoform is shown in the heat map at the left and compared on the right to semi-quantitative PCR of those same genes in two independently prepared cDNA pools, one from the pool used to generate the transcriptome (Sequenced) and one prepared independently, and not used to make the transcriptome (Unsequenced). B) The expression of each PCR product from each pool was quantified and then normalized to the expression of an actin isoform (Ssi032526). Data (biological replicates, n = 2) are represented as mean ± STD. See Supplemental Table 3 for primers used.