| Literature DB >> 27338352 |
William Seffens1, Fisseha Abebe2, Chad Evans3, Xiao-Qian Wang4.
Abstract
RNAs have been shown to exhibit differential enrichment between nuclear, cytoplasmic, and exosome fractions. A current fundamental question asks why non-coding RNA partition into different spatial compartments. We report on the analysis of cellular compartment models with miRNA data sources for spatial-mechanistic modeling to address the broad area of multi-scalar cellular communication by miRNAs. We show that spatial partitioning of miRNAs is related to sequence similarity to the overall transcriptome. This has broad implications in biological informatics for gene regulation and provides a deeper understanding of nucleotide sequence structure and RNA language meaning for human pathologies resulting from changes in gene expression.Entities:
Keywords: exosome; miRNA; transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27338352 PMCID: PMC4926364 DOI: 10.3390/ijms17060830
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Gross properties of typical Human Transcriptome.
| Transcript Molecule | Size (nt) | Abundance (Copies) | Distinct Types | Notes |
|---|---|---|---|---|
| 28S rRNA | 5070 | 3.5 × 106 | 1 | Subunit in 80S ribosome |
| 18S rRNA | 1869 | 3.5 × 106 | 1 | Subunit in 80S ribosome |
| 5.8S rRNA | 156 | 3.5 × 106 | 1 | Subunit in 80S ribosome |
| 5S rRNA | 121 | 3.5 × 106 | 1 | Subunit in 80S ribosome |
| tRNA | ~85 | 3 × 107 | ~100 | 497 genes in 40 families, tissue specific |
| mRNA | 2 kb | 4 × 105 | 4 × 105 | Tissue specific, many isoforms |
| ncRNA | >200 | variable | >35,000 | Complex isoforms [ |
| miRNA | 22 | variable | 1000 |
Figure 1(a) Measures of number of unique and duplicate words in Simple Model transcriptome for various word size; (b) Correspondence to size of seed sequence for miRNAs. Red base pair is an allowed non-canonical matching.
Figure 2Resting miRNA cell tCount vs. Log fold change between exosome and cellular compartments. Word size was seven nt for calculation for tCount. Trendline added.
Figure 3Resting cell miRNA tWords vs. Log fold change between exosome and cellular compartments. Trendline added.
Figure 4Resting cell miRNA tWords minus randomized sequence tWord score. Trendline added.
Various measures of word similarity to simple model transcriptome from 4 data sources.
| Transcriptome Measures for Published Data Sets of miRNA | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Data Set | tCount | RANtCount | tWord | RANtWord | tW–tC | tC–RAN | tW–RAN | tC | tW | |
| has-miR- | 5.3 (2.9) | 5.7 | 9 (10.9) | 7.1 (1.9) | 3.7 (9.0) | −0.4 (3.2) | 1.9 (1.1) | −0.2 (2.4) | 1.1 (7.3) | 2588 |
| V-B All | 5.5 (2.9) | 6.1 (1.3) | 8.9 (8.8) | 7.8 (1.9) | 3.4 (6.8) | −0.6 (3.1) | 1.1 (9.0) | −0.3 (2.0) | 0.6 (4.6) | 151 |
| V-B EXO | 6.8 (2.9) | 6 | 12.5 (10.7) | 7.7 | 5.7 (8.8) | 0.8 (2.9) | 4.8 (10.7) | 0.4 (2.2) | 2.2 (5.8) | 75 |
| V-B CL | 4.3 (2.2) | 6.3 | 5.5 (4.0) | 8 | 1.2 (2.6) | −2.0 (2.7) | −2.4 (4.6) | −1.1 (1.5) | −1.0 (1.9) | 76 |
| Park All | 4.4 | 7.2 | 2.8 | −1.7 | −0.4 | −1.3 | −0.3 | 78 | ||
| Park NU | 4.0 (2.0) | 4.7 (2.6) | 0.7 (1.1) | −2.3 (2.3) | −3.2 (2.9) | −1.6 (1.8) | −1.4 (1.4) | 45 | ||
| Park CL | 5.0 (3.3) | 10.6 (16.4) | 5.6 (13.7) | −1.0 (3.7) | 3.2 (16.8) | −0.8 (2.4) | 1.1 (6.2) | 33 | ||
| G-F All | 5.4 (3.5) | 8.5 (8.6) | 3.1 (6.0) | 27 | ||||||
| G-F EXO | 7.9 (4.1) | 13.5 (11.5) | 5.6 (8.7) | 10 | ||||||
| G-F CL | 4.0 (2.2) | 6.1 (4.9) | 2.1 (3.2) | 10 | ||||||
Public database of miRNAs extracted 2588 human sequences. V-B from [12] with EXO exosome enriched or CL cytoplasmic miRNAs. Park from [13] with NUC nuclear enriched or CL cytoplasmic miRNAs. G-F from [16] with EXO exosome enriched or CL cytoplasmic miRNAs. Values in parenthesis are standard deviations. RANtCount and RANtWord are calculated from average of 4 randomized simple transcriptome words. tW-tC = tWord minus tCount and is a measure of the influence of frequent words in transcriptome. tC-RAN and tW-RAN are differences between tCount or tWord minus RANtCount or RANtWord, respectively. Z-scores from tCount and tWord calculated from RAN mean and SD of randomized simple transcriptome.
Figure 5Logical workflow of transcriptome modeling.
Construction of transcriptome cloud for word size of 7 and 8.
| Transcript | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | Total | Unique | Duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Transcript | nt length | 73 | 73 | 71 | 71 | 156 | 121 | 5034 | 1871 | 7470 | ||
| Word size | ||||||||||||
| Total words | 67 | 67 | 65 | 65 | 150 | 115 | 5028 | 1865 | 7422 | |||
| Unique words | 67 | 67 | 65 | 65 | 149 | 114 | 3355 | 1742 | 5625 | 4934 | 691 | |
| Duplicates | 0 | 0 | 0 | 0 | 1672 | 123 | 1797 | |||||
| Word size | ||||||||||||
| Total words | 66 | 66 | 64 | 64 | 149 | 114 | 5013 | 1864 | 7400 | |||
| Unique words | 66 | 66 | 64 | 64 | 149 | 114 | 4085 | 1831 | 6439 | 6439 | 288 | |
| Duplicates | 0 | 0 | 0 | 0 | 0 | 0 | 928 | 33 | 961 |
Simple transcriptome model 1 based on 4 tRNAs (transcripts 1-4) and 4 subunits of the ribosome (transcripts 5–8).