| Literature DB >> 30217147 |
Peter A Noble1, Alexander E Pozhitkov2.
Abstract
BACKGROUND: Our previous study found that more than 500 transcripts significantly increased in abundance in the zebrafish and mouse several hours to days postmortem relative to live controls. The current literature suggests that most mRNAs are post-transcriptionally regulated in stressful conditions. We rationalized that the postmortem transcripts must contain sequence features (3- to 9- mers) that are unique from those in the rest of the transcriptome and that these features putatively serve as binding sites for proteins and/or non-coding RNAs involved in post-transcriptional regulation.Entities:
Keywords: 3’UTR; 5’UTR; Chaos game representation; Molecular sponge; Motifs; Mouse; ORFs; Post-transcriptional regulation; Postmortem gene expression; Sequence features; Stress response; Zebrafish
Mesh:
Substances:
Year: 2018 PMID: 30217147 PMCID: PMC6137749 DOI: 10.1186/s12864-018-5042-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schematic representation of the study experimental design
Average ± standard deviation of 333 gene transcripts in the mouse and 230 transcripts in the zebrafish that contained unique mers by group (OP vs. CP). The absolute difference in unique mer counts by group and mer length is shown
| Animal | Mer length | Num transcripts (OP) | Num transcripts (CP) | Absolute Difference |
|---|---|---|---|---|
| Mouse | 2 | 333 ± 0.25 | 333 ± 0.4 | 0 ± 0.1 |
| 3 | 330 ± 6.9 | 329 ± 7.9 | 2 ± 3.0 | |
| 4 | 304 ± 30.3 | 304 ± 30.4 | 7 ± 7.2 | |
| 5 | 227 ± 56.0 | 226 ± 55.9 | 13 ± 9.5 | |
| 6 | 119 ± 52.5 | 117 ± 50.7 | 74 ± 45.2 | |
| 7 | 43 ± 27.8 | 42 ± 25.5 | 6 ± 7.3 | |
| 8 | 13 ± 10.8 | 12 ± 9.1 | 3 ± 4.2 | |
| 9 | 3 ± 4.0 | 2 ± 1.9 | 3 ± 3.3 | |
| Zebrafish | 2 | 230 ± 0.0 | 230 ± 0.1 | 0 ± 0.0 |
| 3 | 230 ± 0.6 | 229 ± 1.8 | 1 ± 1.5 | |
| 4 | 220 ± 11.8 | 211 ± 15.5 | 9 ± 6.1 | |
| 5 | 166 ± 35.7 | 148 ± 33.5 | 18 ± 8.1 | |
| 6 | 81 ± 34.9 | 70 ± 29.4 | 12 ± 8.9 | |
| 7 | 27 ± 17.5 | 23 ± 14.2 | 5 ± 5.2 | |
| 8 | 8 ± 6.6 | 7 ± 5.0 | 2 ± 2.5 | |
| 9 | 2 ± 2.4 | 2 ± 1.5 | 1 ± 1.2 |
Fig. 2Mer counts as a function of mer length. Hatched line, mouse; solid line, zebrafish. Panel a, Difference in average mer counts by group (OP vs. CP); Panel b, individual mer counts that were 5 time stdev of average of the CP; Panel c, is the same results as panel B except normalized to the number of possible mer combinations and shown as a percentage; Panel d, number of mer counts that were 5 times stdev of the average CP due to random chance; average ± stdev of 3 random selections (without replacement)
Number of unique mers in transcripts of the OP by mer length and organism
| Mer length | Zebrafish | Mouse |
|---|---|---|
| 4 | 1 | 0 |
| 5 | 31 | 1 |
| 6 | 67 | 62 |
| 7 | 118 | 279 |
| 8 | 356 | 819 |
| 9 | 1672 | 3956 |
| Sum | 2245 | 5117 |
Unique mers common to transcripts of the OP for the zebrafish and mouse
| Mer length | Mer |
|---|---|
| 6 | AAAUAC, AACGAA, ACAUAA |
| 7 | UGUGAAC, AUCUCCA, AAAUACA, UAGGUUA, CAUGAAA |
| 8 | CAGAAAGC, GUAAAGUC, GCACAAAG, ACGAAUAC, AGAAGAGU, CAUGUGAA, AAAUACAU, AUAGGUUA |
| 9 | CCAAUGUGG, CUAUGAAGG, AAGUCCCAG, CUGACAGUC, UUCUCUGUG, GUUUCUGUG, CUAUGUCUG, AUACAAGUG, GCAAGGUUC, CAUGUGAAC, UCUAUGAAG, AUAGGUUAC, UCUGGGGCA, CCUGCUGCU, UAUCAUCGA, AAAAGAUCA, AUUCAAUGU, AAGAAAUCA, ACAAAAUCA, CUUCUCCAU, CAGAACCAU, UUUAACCAA, CAUGCAGAA, CUGGAAGAA, AUACAUCAA, AAAGAUCAU, CAGUAUGAA, AGAAAUCAU, CCUACGAAU, GUCCUGAAA, AACAUGAAA |
Unique mers that were reverse complements by length and organism
| Organism | Mer length | Mers |
|---|---|---|
| Zebrafish | 5 | UGUAU, AUACA |
| 6 | GUAUUU, UCAAAA, UUUUGA, AAAUAC, AUUUUU, AUAUAU, AAAAAU | |
| 7 | UGUAUUU, CGUUGUU, CAUUUUG, CAAAAUG, ACAAAAU, AACAACG, AUUGUAU, AUACAAU, AUUUUGU, AAAUACA | |
| 8 | CAUUUUGA, UCAAAAUG | |
| 9 | GGCGGCAAG, CCAGGCUCA, CGUCUAGGU, GCUAGGGAC, GUCCCUAGC, CUUGCCGCC, ACCUAGACG, UGAGCCUGG, AGUAGGCUA, UAGCCUACU | |
| Mouse | 6 | CUAUAG, AUGCAU |
| 7 | ACCUAUA, AUGACUG, CAGUCAU, GUCUCUA, UAGAGAC, UAUAGGU, UCUAGAA, UUCUAGA | |
| 8 | CCAUGACU, GGUUACAU, CCUAUAGG, GUCUCUAC, GUAGAGAC, CUUCUAGA, CUAGAAGU, CUAUGACU, GUAUGAAU, CUAUAGGU, ACCUAUAG, UCUGCAGA, AGUCAUGG, AGUCAUAG, UCUAGAAG, ACUUCUAG, UUCUAGAA, AUGUAACC, UUUGCAAA, AAUGCAUU, AAAGCUUU, AUUCAUAC | |
| 9 | CGGAGAGAA, GCGAAGACA, CCCUUCUUC, GCUGCUGCU, CCUGGAACU, CCAGUGUGA, CCUGAGUUC, CCUCUUCUG, GGUCUUCAA, CCUUGAACU, CCAACAUCA, GGUUUCUCU, GGUUACAGU, GGUUACAUU, GAGGGCAUC, GUGGCUCAC, CAGGGAAGA, CUGCUGCUG, CAGCAGCAG, CUCCAGCAU, CUGCUCUCU, CUGCAGAAG, CAGGAGAAA, CUCCUUCCU, CAGGAAGCA, CAGGAAGGA, GUGAGCCAC, GUCUCCUGU, GAGUGGUAG, GUCUCCAAA, CACAGAGAA, CUGAGUUCA, CAGAGAAAA, GUCUUCGCU, CAGAAGAGG, CUGAAGACA, GUCUUCAGA, CUGAAGAUG, CAGAAGAUG, CAGAAAGCA, CAGUAUGAA, GAUGCCCUC, CUUCCCAUC, GAUGGGAAG, CUACCACUC, CUUCCUCUU, GAACCUUUU, CUUCUGCAG, CUUGAGGAA, GAACUCAGG, GAACACACA, GAAGACACA, CUUCACUUG, CAAGUGAAG, GAAGAAGGG, CAUCUUCUG, CAUCUUCAG, CUUCUAGAA, GAAGAUGAU, GAAGAAGAA, CAAAGCCUU, CAAAGACUU, CAAACUUCU, GUUACAUUU, GUAAAGACU, CAAAUGUAA, CUUUUAAAA, UCCCAGCAA, UGGGAAGGA, AGCGAAGAC, UGCUGGGAA, AGCAGCAGC, UGCUGCUGU, UCCUGCAAA, UGCAGAAGA, UCCUUCCCA, UCCUUCCUG, UGCUUCCUG, AGGAAGGAG, UGCUUUCUG, ACAGCAGCA, ACAGGAGAC, ACACCAACA, AGAGAGCAG, UCACACUGG, UCUCUGUGU, ACACAGAGA, UGUGUCUUC, UGUGUGUUC, ACAGAGAAA, UGUCUUCGC, UCUGAAGAC, UGUCUUCAG, ACUGUAACC, AGAGAAACC, ACUGUUUCU, ACACAUACA, AGUCAUAGU, AGAGUUUCU, AGUCUUUAC, UCUUCCCUG, AGUUCCAGG, UGUUGGUGU, UCUUCUGCA, UGAACUCAG, UCUUCACAA, UGAUGUGUU, UGAUGUUGG, AGUUCAAGG, UCUAGAAGU, UCAUCUUCU, UCAUCUUCA, UGUUCUUCA, UGAAGUUCU, UGAAGAUGA, UGAAGAACA, ACUUCUAGA, AGUUGUUCU, AGAACUUCA, AGAACAACU, AGAAGAUGA, AGAAGUUUG, UGUAUGUGU, ACUAUGACU, AGAAACUCU, AGAAACAGU, AGUAUGAAU, UCAAAAACA, UGUUUUUGA, UGUUAUAAA, UUCCCAGCA, AAGGCUUUG, UUGCUGGGA, AUGCUGGAG, UUCCUCAAG, UUGCAGAAA, AAGCAGUUA, UUCCUUCUU, AAGAGGAAG, UUCUCUCCG, UUCUCUGUG, UUGUGAAGA, AACACAUCA, AAGUCUUUG, AAGAAGGAA, UUGAAGACC, UUCUUCUUC, UUCUAGAAG, AUCAUCUUC, AUCAUGAAA, UUCAUACUG, UUGUUUUUU, AAGUUAUUA, AAAGCCUUU, AAAGGCUUU, UUUGCAGGA, UUUGGAGAC, AAAGCUUUU, UUUCUCCUG, UUUCUGCAA, UAACUGCUU, UUUCUCUGU, AAAGUCUUU, AAAGACUUU, UUUCAUGAU, AAUGUAACC, AUUCAUACU, UUACAUUUG, UUUCAAAAA, UAUGUAUUU, AAAAGGUUC, AAAAGCUUU, UUUUCUCUG, AAAUGUAAC, UUUUCAAAA, UUUUGUUUU, UUUUGAAAA, AAAACAAAA, UUUUUGAAA, AAAUACAUA, UAAUAACUU, AAAAAACAA, UUUAUAACA, UUUUAAAAG, UUUUUUUAA, UUAAAAAAA |
Fig. 3Distribution of unique mers per gene transcript in the zebrafish. a, unique mers in OP; b, multiple unique mers in OP; c, unique mers in CP (3 independent random selections; each as a different shade of grey); d, multiple unique mers in CP (3 independent random selections)
Fig. 4Distribution of unique mers per gene transcript in the mouse. a, unique mers in OP of the mouse; b, multiple unique mers in OP of the mouse; c, unique mers in CP of mouse (3 independent random selections; each displayed as a different shade of grey); d, multiple unique mers in CP of the mouse (3 independent random selections)
Fig. 5Heatmap of transcript groups and mer sets for the zebrafish. M, count of mers in group; N, count of transcripts in group. White, high count; yellow-orange, median count; red, low count
Fig. 6Heatmap of Transcript groups and mer sets for the mouse. M, count of mers in group; N, count of transcripts in group. White, high count; yellow-orange, median count; red, low count
Fig. 7Number of multiple unique mers in transcripts versus transcript length. a, zebrafish; b, mouse; Red, deviant transcripts. Red dots in the zebrafish correspond to Pimr transcripts; Red dots in the mouse represent 47 transcripts (see text)
Fig. 8Ordination plot of transcripts with high mer densities (a) and network of transcripts with shared mers (b). The ordination was based on the correlations among mouse brain transcript profiles. The network was based on the number of shared mers in subset of the transcript profiles with high R2 (> 0.95) to the transcripts with high mer densities. The network shows that the transcripts with high mer densities (i.e., molecular sponges) shared mers with many other transcripts
Number of unique mers by nucleotide (transcript length), region and organism. Two-way paired t-test across rows: a,b, p < 0.0001; c,d p < 0.01
| Organism | Regions | Number of transcripts | 5’UTR | ORF | 3’UTR | Non-coding |
|---|---|---|---|---|---|---|
| Zebrafish | 5’UTR, ORF, 3’UTR | 70 | 0.2 ± 0.26a | 0.2 ± 0.28a | 0.3 ± 0.35b | – |
| ORF, 3’UTR | 4 | – | 0.1 ± 0.01 | 0.1 ± 0.03 | – | |
| 5’UTR, ORF | 1 | 0.1 | 0.1 | – | ||
| ORF | 3 | – | 0.1 ± 0.03 | – | – | |
| Non-coding | 1 | – | – | – | 0.1 | |
| Mouse | 5’UTR, ORF, 3’UTR | 65 | 0.4 ± 0.60a | 0.6 ± 0.70b | 0.5 ± 0.80 | – |
| ORF, 3’UTR | 2 | – | 0.1 ± 0.00 | 0.1 ± 0.00 | – | |
| 5’UTR, ORF | 16 | 1.4 ± 0.80c | 2.0 ± 0.70d | – | – | |
| ORF | 11 | – | 2.3 ± 0.50 | – | – | |
| Non-coding | 10 | – | – | – | 0.4 ± 0.50 |
Number of transcripts by known protein binding site and organism. Hud binding site, YUNNYUY [21]; Rbfox binding site, UGCAUG [10]; and UAUUUAU, GAGAAAA, AGAGAAA, UUUGCAC, AUGUGAA, UUGCACA, GGGAAGA [22]
| Organism | Protein binding sites | ||||
|---|---|---|---|---|---|
|
|
| Jacobsen et al. | All three | ||
| Mouse | 333 | 287 | 126 | 258 | 314 |
| Zebrafish | 230 | 185 | 0 | 106 | 189 |
Fig. 9Number of known binding sites per transcripts for the mouse (a) and zebrafish (b). Total number of transcripts for the mouse, n = 333 and for the zebrafish, n = 230. The following binding sites were examined: Hud binding site, YUNNYUY [21]; Rbfox binding site, UGCAUG [10]; and UAUUUAU, GAGAAAA, AGAGAAA, UUUGCAC, AUGUGAA, UUGCACA, GGGAAGA [22]. Note: the zebrafish did not have Rbfox binding sites
Fig. 10Gene transcript abundances measured by a calibrated microarray [41, 42] (log transformed) by postmortem time. Abundances were normalized to flash frozen live controls (L). Black line, average. (a) Hud transcript in mouse; black dots, averaged abundance measured by probe A_55_P1990309 (n = 3 replicates for each dot except 48 h where n = 2 replicates); white dots, average abundance measured by probe A_55_P1990314; (b) Rbfox transcript in mouse; black dots, average abundance measured by probe A_55_P195339` (n = 3 replicates for each dot except last where n = 2 replicates); white dots, average abundance of probe A_55_P1953400; (c) Hud transcript in zebrafish; black dots, average abundance of probe A_15_P119510 (n = 2 replicates for each dot); white dots, average abundance of probe A_15_P120793. Data are from ref. [4]
Fig. 11Gene transcript abundances measured by a calibrated microarray [41, 42] (log transformed) by postmortem time. Abundances were normalized to flash frozen live controls (L). Black line, average. (a) Mouse: Open circle, represents Gm11007, Gm2007, Gm4631, Gm14434, Gm2026, Gm14305, Gm14399, Gm14325, Zfp969, Gm4724, Gm14326 transcripts; closed circle, Zfp967, Zfp969, Zfp968; open square, Gm14410; closed square, Gm14305; open triangle, Gm14322; closed triangle, Gm14308; closed diamond, Gm14412. All points are the average of 3 replicates per sample time except the 48 h, which is the average of 2 replicates. (b) Zebrafish: Pimr transcript. Each point in the zebrafish represents the average of two individuals per sample time. Data are from ref. [4]