| Literature DB >> 30650673 |
Aleksey A Penin1,2, Anna V Klepikova3,4, Artem S Kasianov5,6, Evgeny S Gerasimov7,8, Maria D Logacheva9,10,11.
Abstract
The knowledge of gene functions in model organisms is the starting point for the analysis of gene function in non-model species, including economically important ones. Usually, the assignment of gene functions is based on sequence similarity. In plants, due to a highly intricate gene landscape, this approach has some limitations. It is often impossible to directly match gene sets from one plant species to another species based only on their sequences. Thus, it is necessary to use additional information to identify functionally similar genes. Expression patterns have great potential to serve as a source of such information. An important prerequisite for the comparative analysis of transcriptomes is the existence of high-resolution expression maps consisting of comparable samples. Here, we present a transcriptome atlas of tomato (Solanum lycopersicum) consisting of 30 samples of different organs and developmental stages. The samples were selected in a way that allowed for side-by-side comparison with the Arabidopsis thaliana transcriptome map. Newly obtained data are integrated in the TraVA database and are available online, together with tools for their analysis. In this paper, we demonstrate the potential of comparing transcriptome maps for inferring shifts in the expression of paralogous genes.Entities:
Keywords: RNA-seq; Solanum lycopersicum; tomato; transcriptome atlas
Mesh:
Year: 2019 PMID: 30650673 PMCID: PMC6356586 DOI: 10.3390/genes10010050
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Hierarchical clustering of samples as represented by a clustering tree. Distance between samples is measured as 1 − Pearson r2 correlation coefficient.
Analysis of splicing sites based on newly obtained data.
| Without Filtering | Filter 1 (Identification in Two Samples) | Filter 2 (Identification in Two Replicates) | |
|---|---|---|---|
| Introns, total | 375,650 | 240,224 | 168,243 |
| Not annotated but found | 266,580 | 132,884 | 62,834 |
| Annotated but not found | 14,547 | 16,277 | 18,208 |
Figure 2Estimation of the completeness of the expression map. (a) The number of expressed genes at different thresholds in three publicly available datasets and in our dataset; (b) the number of detected splice sites under different filters in three publicly available datasets and in our dataset.
Figure 3Distribution of identity and expression distance for orthopairs, random pairs, interspecific pairs from ortho-triplets, and interspecific pairs from random triplets. (a) The distribution of expression distance; (b) the distribution of identity.
Medians of distributions of identity and expression distance.
| Expression Distance | Identity | ||
|---|---|---|---|
| Orthopairs | 3.68 | Orthopairs | 58.48 |
| Random pairs | 17.42 | Random pairs | 8.40 |
| Minimal distance in interspecific pairs from ortho-triplets | 4.12 | Maximal identity in interspecific pairs from ortho-triplets | 58.78 |
| Maximal distance in interspecific pairs from ortho-triplets | 8.52 | Minimal identity in interspecific pairs from ortho-triplets | 55.01 |
| Minimal distance in interspecific pairs from random triplets | 9.76 | Maximal identity in interspecific pairs from random triplets | 10.96 |
| Maximal distance in interspecific pairs from random triplets | 37.64 | Minimal identity in interspecific pairs from random triplets | 6.18 |
Figure 4Expression patterns in ortho-triplet AT5G36930, Solyc09g092410, and Solyc09g007710. Identities are almost equal, while expression distances differ ~20-fold. Note that while Solyc09g007710, similar to AT5G36930, is expressed in leaves, Solyc09g092410 has a drastically different expression profile, with a maximum expression level in roots.