| Literature DB >> 31534643 |
Hervé Seligmann1, Ganesh Warthi2.
Abstract
Frameshifting protein translation occasionally results from insertion of amino acids at isolated mono- or dinucleotide-expanded codons by tRNAs with expanded anticodons. Previous analyses of two different types of human mitochondrial MS proteomic data (Fisher and Waters technologies) detect peptides entirely corresponding to expanded codon translation. Here, these proteomic data are reanalyzed searching for peptides consisting of at least eight consecutive amino acids translated according to regular tricodons, and at least eight adjacent consecutive amino acids translated according to expanded codons. Both datasets include chimerically translated peptides (mono- and dinucleotide expansions, 42 and 37, respectively). The regular tricodon-encoded part of some chimeric peptides corresponds to standard human mitochondrial proteins (mono- and dinucleotide expansions, six (AT6, CytB, ND1, 2xND2, ND5) and one (ND1), respectively). Chimeric translation probably increases the diversity of mitogenome-encoded proteins, putatively producing functional proteins. These might result from translation by tRNAs with expanded anticodons, or from regular tricodon translation of RNAs where transcription/posttranscriptional edition systematically deleted mono- or dinucleotides after each trinucleotide. The pairwise matched combination of adjacent peptide parts translated from regular and expanded codons strengthens the hypothesis that translation of stretches of consecutive expanded codons occurs. Results indicate statistical translation producing distributions of alternative proteins. Genetic engineering should account for potential unexpected, unwanted secondary products.Entities:
Keywords: Non-canonical transcription; Non-canonical translation; RNA editing; delRNAs; tRNA hopping
Year: 2019 PMID: 31534643 PMCID: PMC6742854 DOI: 10.1016/j.csbj.2019.08.006
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1RNA sequence and its chimeric translation according to regular tricodons and tetra- and pentacodons. Sequences corresponding to 90 codons (two groups of 30 regular tricodons, each at the 5′ and 3′ extremity of a group of 30 noncanonical codons expanded by mono- or dinucleotides (tetra- and pentacodons)) form running windows of 90 + 120 + 90 = 300 nucleotides (tetracodons) and 90 + 150 + 90 = 330 nucleotides (pentacodons). Hence for each of the 16,569 positions along the human mitogenome, chimeric peptides are translated from 30 regular, 30 noncanonical and 30 regular codons. These hypothetical peptides (lengths truncated in Fig. 1 for presentation purposes) are compared with actual MS mitoproteomic data.
Chimeric peptides transcribed in part according to regular, and in part according to expanded codons (tetra- and pentacodons) from the human mitogenome, detected in MS data from Guegneau et al. (2014). Columns are: 1. Regular tricodon translation frame (positive strand, 1–3; negative strand, 4–6), tetra- and pentacoded parts indicated by T and P; 2. Position of regular tricoded part on translated human mitogenome; 3. S, amino acid inserted at stop codons; 4. Detected peptide sequence, minor letters indicate translated stops, “|” separates regular tricoded from other part, underlined parts are tetra- and pentacoded. Ambiguous limits between tricoded and other part are also indicated when occurring, ambiguous part is considered parsimoniously as tricoded. 5. Xcorr between expected and observed MS; 6. PSM, counts observed MS matching expected MS; 7. q, false discovery rate; 8. PEP, posterior error probability, peptide specific; 9. Position-specific amino acid modifications; 10. Positions in regular mitogenome-encoded proteins matching regular tricoded part of detected chimeric peptide.
| T | Pos | S | Peptide | Xcorr | PSM | q | PEP | Modifications | Gene |
|---|---|---|---|---|---|---|---|---|---|
| 4 | 359–373 | A | 3.65 | 193 | 0 | 0.473 | |||
| 5 | 2140–2155 | D | 3.26 | 15 | 0 | 0.533 | K8(Lys- > PyrLys); C28(Carbamidomethyl); M34(Oxidation) | ||
| 6 | 1648–1661 | D | KSMQWAILGLFVVG| | 2.54 | 7 | 0 | 0.181 | M3(Oxidation) | |
| 4 | 527–537 | D | 3.59 | 299 | 0 | 1 | |||
| 3 | 1774–1783 | F | 2.21 | 31 | 0 | 0.449 | nd2 285–294 | ||
| 3 | 4102–4113 | G | KNFGATPNKSNN| | 2.28 | 1 | 0 | 0.907 | ||
| 5 | 4348–4356 | G | RWCgGWWg|M| | 3.87 | 221 | 0 | 0.942 | C3(Carbamidomethyl) | |
| 5 | 5417–5427 | G | KRGgGGLVE|IFL| | 3.13 | 1 | 0 | 0.513 | C15(Carbamidomethyl); C25(Carbamidomethyl) | |
| 2 | 378–397 | X | RQNTTSHSLKLKGPGGASY|P| | 2.48 | 1 | 0 | 0.219 | K10(Lys- > PyrLys); C25(Carbamidomethyl); C34(Carbamidomethyl) | |
| 4 | 936–945 | X | 2.36 | 10 | 0 | 0.744 | C12(Carbamidomethyl); C16(Carbamidomethyl) | ||
| 1 | 1762–1780 | X | KTMASSSPPSiPPSPSLT|S| | 3.99 | 1 | 0 | 0.696 | ||
| 6 | 2565–2578 | KPMITVPAHKGMA|M| | 2.13 | 27 | 0 | 1 | K1(Lys- > PyrLys); C22(Carbamidomethyl) | ||
| 4 | 1702–1712 | M | KSTAASTIDPA| | 4.18 | 286 | 0 | 0.532 | ||
| 4 | 3436–3445 | N | RPPLnQMRAG| | 3.84 | 81 | 0 | 0.735 | ||
| 1 | 2883–2890 | N | RLITTQQW| | 2.72 | 3 | 0 | 0.226 | K14(Lys- > PyrLys) | at6 41–48 |
| 6 | 1138–1148 | P | 4.41 | 542 | 0 | 1 | C2(Carbamidomethyl); M7(Oxidation) | ||
| 1 | 3854–3862 | R | rHNYNKLH|L | 3.45 | 17 | 0 | 0.551 | M17(Oxidation) | |
| 1 | 4324–4353 | R | PLLGLLLAAAGKSAQLGLHPWLPSAMEG|PT| | 2.08 | 2 | 0 | 1 | nd5 223–241 | |
| 4 | 564–575 | T | 4.14 | 2 | 0 | 1 | M9(Oxidation) | ||
| 6 | 1058–1076 | V | 2.05 | 1 | 0 | 0.909 | C17(Carbamidomethyl); M33(Oxidation) | ||
| 5 | 2583–2595 | W | 2.62 | 4 | 0 | 0.527 | |||
| 3 | 5404–5433 | Y | 2.65 | 11 | 0 | 0.493 | |||
| 4 | 2233–2247 | Y | 2.18 | 2 | 0 | 0.684 | K6(Lys- > PyrLys); M11(Oxidation); K23(Lys- > PyrLys) | ||
| 6 | 4397–4404 | S | RSVSIsNA| | 2.6 | 1 | 0 | 0.288 | M9(Oxidation) | |
| 2 | 378–397 | P | RQNTTSHSLKLKGPGGASY|P| | 2.36 | 1 | 0 | 0.319 | K10(Lys- > PyrLys); C25(Carbamidomethyl); M27(Oxidation); C34(Carbamidomethyl) | |
| 3 | 1909–1916 | 2 | 1 | 0 | 1 | K1(Lys- > PyrLys); M8(Oxidation); K19(Lys- > PyrLys) | |||
| 5 | 4085–4096 | K | kAPIIYSIKV|TL| | 2.93 | 65 | 0 | 0.34 | K1(Lys- > PyrLys) | |
| 5 | 3393–3403 | K | 3.51 | 228 | 0 | 0.64 | K1(Lys- > PyrLys); C4(Carbamidomethyl); M7(Oxidation); M8(Oxidation) | ||
| P | |||||||||
| 2 | 1441–1466 | A | 2.29 | 1 | 0 | 0.338 | K1(Lys- > PyrLys); M18(Oxidation) | ||
| 4 | 3294–3302 | D | RSSKLQYGd| | 2.92 | 69 | 0.046 | 0.257 | K4(Lys- > PyrLys); M14(Oxidation); K20(Lys- > PyrLys) | |
| 2 | 282–293 | D | KFNdAMLTPGL|V| | 3.56 | 4 | 0 | 1 | M6(Oxidation) | |
| 4 | 4257–4264 | E | KGGEVKGA| | 2.72 | 61 | 0 | 0.73 | M25(Oxidation) | |
| 4 | 460–473 | G | KFVITVAPQNDIW|P| | 2.44 | 2 | 0 | 0.412 | C37(Carbamidomethyl) | |
| 2 | 260–280 | G | 3.53 | 1 | 0 | 0.838 | |||
| 4 | 3176–3187 | G | 2.58 | 48 | 0 | 0.865 | K5(Lys- > PyrLys); K15(Lys- > PyrLys); C36(Carbamidomethyl) | ||
| 5 | 1645–1652 | I | 2.57 | 65 | 0 | 1 | C5(Carbamidomethyl); C7(Carbamidomethyl); M11(Oxidation); C17(Carbamidomethyl) | ||
| 2 | 948–959 | M | RAVHAKTSPVKA| | 2.02 | 1 | 0 | 0.296 | ||
| 6 | 3343–3360 | M | 2.18 | 1 | 0 | 0.774 | C19(Carbamidomethyl) | ||
| 2 | 282–292 | M | KFNmAMLTPGL| | 4 | 289 | 0 | 0.986 | ||
| 2 | 282–292 | N | KFNnAMLTPGL| | 3.46 | 80 | 0 | 0.531 | M6(Oxidation) | |
| 4 | 2272–2287 | N | 2.84 | 1 | 0 | 0.716 | C10(Carbamidomethyl) | ||
| 5 | 1645–1652 | Q | 2.11 | 9 | 0 | 1 | C5(Carbamidomethyl); C7(Carbamidomethyl); C17(Carbamidomethyl) | ||
| 3 | 3879–3889 | Y | 2.33 | 4 | 0 | 0.388 | |||
| 4 | 4257–4264 | I | KGGEVKG|A| | 2.64 | 65 | 0 | 0.618 | M18(Oxidation); M25(Oxidation) | |
| 5 | 1645–1652 | K | 2.11 | 10 | 0 | 1 | C5(Carbamidomethyl); C7(Carbamidomethyl); C17(Carbamidomethyl) | ||
| 3 | 5166–5176 | K | 2.98 | 128 | 0 | 0.538 | K1(Lys- > PyrLys); K15(Lys- > PyrLys) | ||
| 2 | 3153–3162 | K | 2.44 | 1 | 0 | 0.307 | C8(Carbamidomethyl); K9(Lys- > PyrLys) | ||
Chimeric peptides transcribed in part according to regular, and in part according to expanded codons (tetra- and pentacodons) from the human mitogenome, detected in MS data from Alberio et al. (2014). PLGS is the score estimating goodness of fit between observed and expected MS in the PLGS peptide detection software. Δ ppm is the difference between expected and observed MS total mass. Cl indicates cleavage expected by the MS/MS search that detected the specified peptide, C indicates cleavage at the carboxyl-, and N the amino-end of the amino acid. Chym and elas indicate cleavage by chymotrypsin and elastase.
| T | Pos | S | Peptide | PLGS | PSM | Δ ppm | Gene | Cl |
|---|---|---|---|---|---|---|---|---|
| 3 | 4428–4435 | T | NRHQPTTP| | 6.79 | 48 | −2.7085 | Chym | |
| 2 | 179–187 | E | 6.51 | 41 | −4.7043 | Hn | ||
| 2 | 1612–1620 | E | 6.51 | 37 | −6.2392 | Hn | ||
| 1 | 1340–1352 | D | 6.54 | 41 | 1.9161 | ND1, 234–250 | Sn | |
| 5 | 3210–3220 | H | KMNPhAQSTA|A | 6.49 | 54 | −3.1235 | Kn | |
| 1 | 2296–2304 | E | TEAMWNDL | 6.64 | 41 | −0.7789 | Tn | |
| 5 | 4157–4165 | I | 6.79 | 38 | 2.4082 | Cn | ||
| 2 | 3525–3531 | R | 6.74 | 22 | −2.4967 | Yn | ||
| 3 | 1639–1658 | Q | NVSLLLTLSILSIMAGSWGG| | 6.56 | 39 | −1.0766 | ND2, 150–169 | Nn |
| 1 | 1769–1779 | N | 6.59 | 58 | 1.8674 | Tc | ||
| 1 | 5433–5443 | Q | 6.72 | 31 | 0.5116 | Pc | ||
| 2 | 249–264 | P | 6.83 | 39 | −1.917 | Pn | ||
| 2 | 5240–5248 | R | YNPSLT | 6.75 | 74 | 1.0238 | CytB, 325–333 | Yn |
| 1 | 2182–2197 | K | YWLLAADL|L| | 6.66 | 35 | −0.3615 | Yn | |
| 1 | 853–861 | S | 6.6 | 33 | −1.8371 | Nc | ||
| 3 | 5074–5082 | F | TfFNESEEA| | 6.94 | 40 | −2.8644 | Qc | |
| 3 | 4612–4622 | D | 6.49 | 58 | 0.6481 | Tc | ||
| 2 | 399–415 | H | hGACSVIDKPRSTS|P| | 6.51 | 58 | 2.9743 | Hn | |
| 1 | 3776–3783 | Y | 6.47 | 35 | 0.8352 | Tc | ||
| 1 | 1287–1294 | H | 6.9 | 33 | 1.6579 | ND1, 185–192 | Ec | |
| 1 | 651–667 | S | LCSKMVGsFMGsGDKP|T| | 7.02 | 63 | −0.6754 | Ln | |
| 5 | 1151–1158 | A | WVaaFLL|Q| | 6.72 | 40 | −1.7249 | Wn | |
| 6 | 457–467 | C | RMVSLcLLWPLcM | 6.84 | 23 | −7.7922 | Chym | |
| 2 | 3143–3154 | F | 6.57 | 36 | −0.9708 | Elas | ||
| 6 | 5476–5483 | S | 6.8 | 71 | −6.8916 | Ac | ||
| 4 | 4033–4041 | E | 6.53 | 29 | 0.04081 | Qc | ||
| 1 | 2017–2024 | N | VTTTSTT|L| | 6.73 | 34 | −2.3462 | Nc | |
| 6 | 5475–5483 | M | 6.83 | 26 | 1.7577 | Ac | ||
| 2 | 925–933 | Y | 6.69 | 28 | −3.8878 | Kn | ||
| 1 | 1122–1136 | H | NAYRTKNShLYTTT|Q| | 6.62 | 32 | 0.2141 | Pc | |
| 4 | 2632–2643 | G | DgSLLGGDgSVV| | 6.63 | 31 | −1.6834 | Fc | |
| 2 | 4073–4081 | Y | 6.83 | 38 | −2.4583 | Sn |