| Literature DB >> 31120958 |
Ganesh Warthi1, Hervé Seligmann2,3.
Abstract
Raw transcriptomic data contain numerous RNA reads whose homology with template DNA doesn't match canonical transcription. Transcriptome analyses usually ignore such noncanonical RNA reads. Here, analyses search for noncanonical mitochondrial RNAs systematically deleting 1 to 12 nucleotides after each transcribed nucleotide triplet, producing deletion-RNAs (delRNAs). We detected delRNAs in the human whole cell and purified mitochondrial transcriptomes, and in Genbank's human EST database corresponding to systematic deletions of 1 to 12 nucleotides after each transcribed trinucleotide. DelRNAs detected in both transcriptomes mapped along with 55.63% of the EST delRNAs. A bias exists for delRNAs covering identical mitogenomic regions in both transcriptomic and EST datasets. Among 227 delRNAs detected in these 3 datasets, 81.1% and 8.4% of delRNAs were mapped on mitochondrial coding and hypervariable region 2 of dloop. Del-transcription analyses of GenBank's EST database confirm observations from whole cell and purified mitochondrial transcriptomes, eliminating the possibility that detected delRNAs are false positives matches, cytosolic DNA/RNA nuclear contamination or sequencing artefacts. These detected delRNAs are enriched in frameshift-inducing homopolymers and are poor in frameshift-preventing circular code codons (a set of 20 codons which regulate reading frame detection, over- and underrepresented in coding and other frames of genes, respectively) suggesting a motif-based regulation of non-canonical transcription. These findings show that rare non-canonical transcripts exist. Such non canonical del-transcription does increases mitochondrial coding potential and non-coding regulation of intracellular mechanisms, and could explain the dark DNA conundrum.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31120958 PMCID: PMC6532905 DOI: 10.1371/journal.pone.0217356
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Principle of systematic deletion transcription.
This principle is used for the construction of different deletion transformed versions of the mitogenome. To construct the delRNA3-1 transformation of the mitogenome, the 4th nucleotide following each transcribed trinucleotide is deleted. Similarly, every 4th and 5th nucleotides are deleted in delRNA3-2 transformations of the mitogenome. The nucleotide(s) highlighted are the nucleotide(s) missing in del-transcribed delRNAs. These principles work for any deletion size window of k (1 to 12) nucleotides. Where k is the number of nucleotides deleted after each transcribed nucleotide triplet.
Fig 2Mitogenomic versions of delRNA3-3 and delRNA3-4.
Nucleotides highlighted yellow are deleted during transcription. (A) Possible mitogenomic versions for del-transcription window size six, k = 3 (delRNA3-3.n). (B) Possible mitogenomic versions for del-transcription window size seven, k = 4 (delRNA3-4-n). ‘n’: nucleotides deleted before del-transcription initiation assumed to cover all possibilities (highlighted red).
Fig 3Number of delRNAs detected in whole cell and mitochondrial transcriptome for each systematic nucleotide deletions (k).
Fig 4Number of delRNAs detected in Genbank’s Human EST database and purified mitochondrial transcriptome for each systematic nucleotide deletions (k).
Fig 5Number of delRNAs detected in Genbank's human EST database and whole cell transcriptome transcriptome for each systematic nucleotide deletions (k).
Fig 6Numbers of overlapping delRNAs in three different datasets.
The Venn diagram shows the number of overlapping delRNAs across the three independent datasets and numbers of delRNAs not overlapping with any of the other two datasets. Total numbers of delRNAs detected in the whole cell transcriptome, the mitochondrial transcriptome and the human EST datasets are 968, 1767 and 1395, respectively.
Fig 7DelRNAs detected in three independent datasets mapped on coding and dloop hypervariable region.
Mitogenome composition (Protein coding genes: 68.44%, Dloop: 6.7%, rRNA region: 15.17%, tRNA region:9.35%).
Homopolymer frequencies of delRNAs detected in whole cell and purified mitochondrial transcriptome.
| delRNA3-k | No. of trinucleotides | No. of homopoly-mers in | Percent Homopolymers (%) | Ratio (A/B) | P of Chi-square | |||
|---|---|---|---|---|---|---|---|---|
| In detected delRNAs | In remaining del-transformed mitogenome | In detected delRNAs (A) | In remaining del-transformed mitogenome (B) | |||||
| Whole cell transcriptomes | delRNA3-1 | 607 | 15962 | 76 | 12.5 | 8.74 | 1.43 | 0.00 |
| delRNA3-2 | 723 | 15846 | 116 | 16.04 | 8.55 | 1.87 | 0.00 | |
| delRNA3-3 | 766 | 15803 | 90 | 11.77 | 8.74 | 1.34 | 0.00 | |
| delRNA3-4 | 879 | 15690 | 125 | 14.22 | 8.58 | 1.66 | 0.00 | |
| delRNA3-5 | 1078 | 15491 | 136 | 12.62 | 8.62 | 1.46 | 0.00 | |
| delRNA3-6 | 911 | 15658 | 92 | 10.1 | 8.81 | 1.15 | 0.18 | |
| delRNA3-7 | 963 | 15607 | 113 | 11.73 | 8.7 | 1.35 | 0.00 | |
| delRNA3-8 | 1019 | 15555 | 131 | 12.86 | 8.61 | 1.49 | 0.00 | |
| delRNA3-9 | 980 | 15589 | 143 | 14.59 | 8.52 | 1.71 | 0.00 | |
| delRNA3-10 | 1386 | 15189 | 181 | 13.06 | 8.49 | 1.54 | 0.00 | |
| delRNA3-11 | 1079 | 15497 | 137 | 12.7 | 8.61 | 1.47 | 0.00 | |
| delRNA3-12 | 1846 | 14729 | 233 | 12.62 | 8.41 | 1.5 | 0.00 | |
| Purified mitochondrial cell line transcriptomes | delRNA3-1 | 358 | 16211 | 43 | 12.01 | 8.81 | 1.36 | 0.02 |
| delRNA3-2 | 1581 | 14988 | 201 | 12.71 | 8.47 | 1.50 | 0.00 | |
| delRNA3-3 | 1309 | 15260 | 168 | 12.83 | 8.54 | 1.50 | 0.00 | |
| delRNA3-4 | 1694 | 14875 | 201 | 11.87 | 8.54 | 1.39 | 0.00 | |
| delRNA3-5 | 1885 | 14684 | 191 | 10.13 | 8.72 | 1.16 | 0.02 | |
| delRNA3-6 | 1612 | 14957 | 193 | 11.97 | 8.55 | 1.40 | 0.00 | |
| delRNA3-7 | 1395 | 15174 | 156 | 11.18 | 8.66 | 1.29 | 0.00 | |
| delRNA3-8 | 1812 | 14757 | 217 | 11.98 | 7.6 | 1.58 | 0.00 | |
| delRNA3-9 | 1624 | 14945 | 217 | 13.36 | 8.39 | 1.59 | 0.00 | |
| delRNA3-10 | 1400 | 15169 | 180 | 12.86 | 8.51 | 1.51 | 0.00 | |
| delRNA3-11 | 1240 | 15329 | 213 | 17.18 | 8.21 | 2.09 | 0.00 | |
| delRNA3-12 | 1317 | 15252 | 213 | 16.17 | 8.25 | 1.96 | 0.00 | |
| Mitogenomic regions covered by delRNAs in whole cell and purified mt RNA data | delRNA3-1 | 108 | 16461 | 14 | 12.96 | 8.85 | 1.46 | 0.09 |
| delRNA3-2 | 240 | 16329 | 47 | 19.58 | 8.72 | 2.24 | 0.00 | |
| delRNA3-3 | 266 | 16303 | 42 | 15.79 | 8.76 | 1.80 | 0.00 | |
| delRNA3-4 | 327 | 16242 | 47 | 14.37 | 8.77 | 1.64 | 0.00 | |
| delRNA3-5 | 370 | 16199 | 55 | 14.86 | 8.74 | 1.70 | 0.00 | |
| delRNA3-6 | 230 | 16339 | 28 | 12.17 | 8.88 | 1.38 | 0.06 | |
| delRNA3-7 | 269 | 16300 | 37 | 13.75 | 8.80 | 1.56 | 0.00 | |
| delRNA3-8 | 403 | 16166 | 57 | 14.14 | 7.87 | 1.80 | 0.00 | |
| delRNA3-9 | 320 | 16249 | 59 | 18.44 | 8.69 | 2.12 | 0.00 | |
| delRNA3-10 | 369 | 16200 | 65 | 17.62 | 8.68 | 2.03 | 0.00 | |
| delRNA3-11 | 272 | 16297 | 68 | 25.00 | 8.61 | 2.90 | 0.00 | |
| delRNA3-12 | 436 | 16133 | 78 | 17.89 | 8.63 | 2.07 | 0.00 | |
The table shows the percentage of homopolymers in delRNAs detected for each del-transformation in whole cell and purified mitochondrial cell line transcriptomes. It also shows the homopolymer percentage in mitogenomic regions covered by overlapping delRNAs detected in both whole cell and purified mitochondrial RNA datasets. Homopolymer frequencies in detected delRNAs are higher for all del transformations in the whole cell transcriptome, mitochondrial transcriptome and mitogenomic regions covered by delRNAs from both RNA datasets than in the remaining del-transformed mitogenome. Considering the frequencies of A, C, G and T in the human mitogenome, the expected frequency of homopolymer triplets is 0.077496. All results find more homopolymer triplets in del-transformed human mitogenome versions than expected using the mononucleotide frequencies: A = 0.303931257 C = 0.31269238, G = 0.13090712 and T = 0.24708794. This is also the case for the untransformed human mitogenome, for which the expected homopolymer triplet frequencies are AAA = 0.02959325; CCC = 0.03057397; GGG = 0.00224331 and TTT = 0.01508532 (total 0.07749586) and the observed frequency is 0.0887199.
Frequencies of codons belonging to the universal circular code X {AAC, AAT, ACC, ATC, ATT, CAG, CTC, CTG, GAA, GAC, GAG, GAT, GCC, GGC, GGT, GTA, GTC, GTT, TAC, TTC} in delRNAs detected in transcriptomic data.
| delRNA3-k | Whole cell | Mito-transcriptome | Overlapping delRNAs | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Percentage of X (%) | Ratio | P of Chi-square | Percentage of X (%) | Ratio | P of Chi-square | No. of X) | Percentage of X (%) | |||
| In del-RNAs | In remaining del-transformed mitogenome (B) | In del-RNAs | In remaining del-transformed mitogenome (D) | |||||||
| delRNA3-1 | 30.48 | 30.470 | 1.00 | 1.00 | 30.17 | 30.48 | 0.99 | 0.89 | 34 | 31.48 |
| delRNA3-2 | 28.77 | 30.55 | 0.94 | 0.31 | 28.97 | 30.63 | 0.95 | 0.17 | 62 | 25.83 |
| delRNA3-3 | 27.73 | 30.62 | 0.91. | 0.006 | 30.10 | 30.50 | 0.99 | 0.76 | 75 | 28.20 |
| delRNA3-4 | 26.73 | 30.68 | 0.87 | 0.01 | 28.39 | 30.71 | 0.92 | 0.05 | 88 | 26.91 |
| delRNA3-5 | 28.85 | 30.59 | 0.94 | 0.23 | 30.13 | 30.52 | 0.99 | 0.73 | 95 | 25.67 |
| delRNA3-6 | 32.49 | 30.36 | 1.07 | 0.17 | 25.25 | 30.86 | 0.82 | 0.00 | 58 | 25.22 |
| delRNA3-7 | 30.22 | 30.49 | 0.99 | 0.86 | 29.89 | 30.53 | 0.98 | 0.62 | 80 | 29.74 |
| delRNA3-8 | 30.62 | 30.45 | 1.01 | 0.92 | 29.80 | 30.55 | 0.96 | 0.51 | 128 | 31.76 |
| delRNA3-9 | 25.92 | 30.76 | 0.84 | 0.00 | 29.67 | 30.56 | 0.97 | 0.47 | 93 | 29.06 |
| delRNA3-10 | 28.93 | 30.60 | 0.95 | 0.19 | 34.21 | 30.13 | 1.14 | 0.00 | 118 | 31.97 |
| delRNA3-11 | 29.38 | 30.54 | 0.96 | 0.42 | 36.69 | 29.97 | 1.22 | 0.00 | 113 | 41.54 |
| delRNA3-12 | 27.20 | 30.87 | 0.88 | 0.00 | 37.05 | 29.90 | 1.24 | 0.00 | 152 | 34.86 |
Columns show their percentages in detected delRNAs and in the remaining transformed mitogenome for each del-transformed version, and the chi-square P value testing for difference in circular codon frequencies (Table 1). The last columns show their percentages in delRNAs detected in purified mitochondrial transcriptomes overlapping the same mitogenomic regions and their percentage ratio. Note: Total number of trinucleotides in the delRNAs and in remaining del-transformed mitogenome for whole cell transcriptome, mito-transcriptome and in overlapping delRNAs is given in Table 1.
Frequencies of codons belonging to the mitochondrial circular code X0(MIT) in delRNAs and in the remaining deletion-transformed mitogenome.
| delRNA3-k | Whole cell transcriptome | Mito-transcriptome | Overlapping delRNAs | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Percentage of X0 (MIT) (%) | Ratio | P of Chi-square | Percentage of X0 (MIT) (%) | Ratio | P of Chi-square | No. of X0(MIT) | Percentage of X0(MIT) (%) in del-RNAs | |||
| In del-RNAs | In remaining del-transformed mitogenome (B) | In del-RNAs | In remaining del-transformed mitogenome (D) | |||||||
| delRNA3-1 | 29.49 | 31.07 | 0.95 | 0.38 | 30.73 | 31.02 | 0.99 | 0.92 | 32 | 29.63 |
| delRNA3-2 | 28.77 | 31.11 | 0.93 | 0.16 | 27.70 | 31.36 | 0.88 | 0.00 | 57 | 23.75 |
| delRNA3-3 | 25.75 | 30.73 | 0.83 | 0.002 | 26.67 | 31.38 | 0.85 | 0.00 | 70 | 26.31 |
| delRNA3-4 | 29.47 | 31.10 | 0.95 | 0.31 | 29.52 | 31.18 | 0.95 | 0.16 | 94 | 28.75 |
| delRNA3-5 | 27.37 | 31.26 | 0.88 | 0.01 | 28.80 | 31.29 | 0.92 | 0.03 | 100 | 27.03 |
| delRNA3-6 | 30.73 | 31.03 | 0.99 | 0.86 | 25.56 | 31.44 | 0.81 | 0.00 | 50 | 21.46 |
| delRNA3-7 | 29.80 | 31.08 | 0.96 | 0.40 | 29.61 | 31.14 | 0.95 | 0.25 | 82 | 30.48 |
| delRNA3-8 | 27.77 | 31.12 | 0.89 | 0.02 | 28.59 | 31.31 | 0.91 | 0.02 | 106 | 26.30 |
| delRNA3-9 | 27.35 | 31.24 | 0.88 | 0.01 | 29.80 | 31.14 | 0.96 | 0.27 | 92 | 28.75 |
| delRNA3-10 | 29.87 | 31.11 | 0.96 | 0.34 | 35.43 | 30.60 | 1.16 | 0.00 | 112 | 30.35 |
| delRNA3-11 | 31.33 | 30.99 | 1.01 | 0.82 | 36.05 | 30.60 | 1.18 | 0.00 | 114 | 41.91 |
| delRNA3-12 | 28.60 | 31.31 | 0.91 | 0.02 | 36.14 | 30.57 | 1.18 | 0.00 | 175 | 40.13 |
The 20 codons belonging to the suspected mitochondrial circular code are {ACA, ACC, ATA, ATC, CTA, CTC, GAA, GAC, GAT, GCA, GCC, GCT, GGA, GGC, GGT, GTA, GTC, GTT, TTA, TTC}. Columns and description as in Table 2.