| Literature DB >> 28185569 |
Maria Angela Diroma1, Paolo Lubisco1, Marcella Attimonelli2.
Abstract
BACKGROUND: The abundance of biological data characterizing the genomics era is contributing to a comprehensive understanding of human mitochondrial genetics. Nevertheless, many aspects are still unclear, specifically about the variability of the 22 human mitochondrial transfer RNA (tRNA) genes and their involvement in diseases. The complex enrichment and isolation of tRNAs in vitro leads to an incomplete knowledge of their post-transcriptional modifications and three-dimensional folding, essential for correct tRNA functioning. An accurate annotation of mitochondrial tRNA variants would be definitely useful and appreciated by mitochondrial researchers and clinicians since the most of bioinformatics tools for variant annotation and prioritization available so far cannot shed light on the functional role of tRNA variations.Entities:
Keywords: Annotation and prioritization tools; Bioinformatics analysis; Mitochondrial genomics; NGS; tRNA sequence variation
Mesh:
Substances:
Year: 2016 PMID: 28185569 PMCID: PMC5123245 DOI: 10.1186/s12859-016-1193-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Annotations by MToolBox pipeline
| Variant annotation | Status |
|---|---|
| Locus | Previously provided |
| HF | Previously provided |
| CI_lower;CI_upper | Previously provided |
| RSRS | Previously provided |
| MHCS | Previously provided |
| rCRS | Previously provided |
| Haplogroup | Previously provided |
| Other Haplogroups | Previously provided |
| Nt Variability | Updated |
| Codon Position | Previously provided |
| Aa Change | Previously provided |
| Aa variability | Updated |
| tRNA Annotation | New |
| Disease Score | Previously provided |
| RNA predictions | New |
| MutPred Pred | Previously provided |
| MutPred Prob | Previously provided |
| PolyPhen-2 HumDiv Pred | Previously provided |
| PolyPhen-2 HumDiv Prob | Previously provided |
| PolyPhen-2 HumVar Pred | Previously provided |
| PolyPhen-2 HumVar Prob | Previously provided |
| PANTHER Pred | Previously provided |
| PANTHER Prob | Previously provided |
| PhD-SNP Pred | Previously provided |
| PhD-SNP Prob | Previously provided |
| SNPs&GO Pred | Previously provided |
| SNPs&GO Prob | Previously provided |
| MITOMAP Associated Disease(s) | Updated |
| MITOMAP Homoplasmy | Updated |
| MITOMAP Heteroplasmy | Updated |
| Somatic Mutations | Updated |
| SM Homoplasmy | Updated |
| SM Heteroplasmy | Updated |
| ClinVar | New |
| OMIM | Updated |
| dbSNP | Updated |
| Mamit-tRNA | Previously provided |
| PhastCons20Way | New |
| PhyloP20Way | New |
| AC/AN 1000 Genomes | Previously provided |
| 1000 Genomes Homoplasmy | Previously provided |
| 1000 Genomes Heteroplasmy | Previously provided |
All the annotations provided by MToolBox pipeline are shown. In the latest update, new fields, mainly regarding tRNA gene variants, were added for a more accurate variant annotation in analyzed samples: structural information for tRNA variants (“tRNA annotation”), pathogenicity predictions for tRNA and rRNA genes (“RNA predictions”), disease reports in ClinVar database (“ClinVar”), conservation scores (“PhastCons20Way”, “PhyloP20Way”). tRNA annotation, in turn, includes five semi-colon separated annotations: position numbering in tRNA, tRNA type, cloverleaf secondary region, mature nucleotide and involvement of the specific position in tRNA folding (Y for yes or N for no). Moreover, data from HmtDB (“Nt variability”, “Aa variability”), MITOMAP (“MITOMAP Associated Disease(s)”, “MITOMAP Homoplasmy”, “MITOMAP Heteroplasmy”, “Somatic Mutations”, “SM Homoplasmy”, “SM Heteroplasmy”), OMIM links (“OMIM”) and dbSNP identifiers (“dbSNP”) were updated. All the remaining annotations were Previously provided by MToolBox
RNA pathogenicity predictions in MToolBox with corresponding scores
| rRNA prediction | rRNA Score | RNA pathogenicity score in MToolBox | tRNA Score | tRNA prediction |
|---|---|---|---|---|
| Proven pathogenic | 5 | 1.000 | 20 | Definitely pathogenic |
| 0.950 | 19 | Definitely pathogenic | ||
| 0.900 | 18 | Definitely pathogenic | ||
| 0.850 | 17 | Definitely pathogenic | ||
| Expectedly pathogenic | 4 | 0.800 | 16 | Definitely pathogenic |
| 0.750 | 15 | Definitely pathogenic | ||
| 0.700 | 14 | Definitely pathogenic | ||
| 0.650 | 13 | Possibly/definitely pathogenic | ||
| Likely pathogenic | 3 |
| 12 | Possibly/definitely pathogenic |
| 0.550 | 11 | Possibly/definitely pathogenic | ||
| 0.500 | 10 | Possibly pathogenic | ||
| 0.450 | 9 | Possibly pathogenic | ||
| Not enough evidence | 2 | 0.400 | 8 | Possibly pathogenic |
|
| 7 | Possibly pathogenic | ||
| 0.300 | 6 | Neutral | ||
| 0.250 | 5 | Neutral | ||
| Undetermined | 1 | 0.200 | 4 | Neutral |
| 0.150 | 3 | Neutral | ||
| 0.100 | 2 | Neutral | ||
| 0.050 | 1 | Neutral | ||
| Unlikely pathogenic | 0 | 0.000 | 0 | Neutral |
RNA pathogenicity scores provided by MToolBox pipeline, shown in the central column of the table, derived from two different scoring systems for rRNA and tRNA genes, respectively. Original predictions and scores, reported on the right and the left of MToolBox scores, were retrieved from the literature and normalized to a 0–1 range. Thresholds of 0.600 for rRNA and 0.350 for tRNA sequence variations (in bold) were set according to original scores. Damaging effects could be observed for variants with a score above or equal to the chosen thresholds, while neutral variants should be associated with lower values
Fig. 1Schematic representation of the four types of human mitochondrial tRNAs. The four types of human mt-tRNAs are shown. Green circles represent all the nucleotide positions involved in post-transcriptional modifications in each tRNA. Blue circles indicate nucleotide positions involved in tertiary folding with interactions represented by lines. Red circles represent nucleotide positions involved in tertiary folding and subject to post-transcriptional modifications. All the stems (A-stem, T-stem, C-stem, D-stem) and loops (T-loop, V-loop, C-loop, D-loop) of cloverleaf secondary regions are also shown
Post-transcriptional modifications in mt-tRNAs
| Position | Modified base name | Modomics symbol | Human mt-tRNAs | Bovine mt-tRNAs | Model organisms tRNAs |
|---|---|---|---|---|---|
| 9 | 1-methyladenosine | ” | Asp, Leu(CUN), Lys, Pro | Ala, Glu, Phe, Gly, His, Asn, Arg, Thr, Val, Trp | |
| 9 | 1-methylguanosine | K | Ile, Leu(UUR) | Cys, Gln, Tyr | |
| 10 | N2-methylguanosine | L | Leu(UUR), Leu(CUN), Lys, Asp | Ala, Phe, Gly, His, Asn, Val, Trp, Tyr | |
| 20 | dihydrouridine | D | Leu(UUR) | ||
| 26 | N2-methylguanosine | L | Ala, Glu | ||
| 26 | N2,N2-dimethylguanosine | R | Ile | ||
| 27 | pseudouridine | P | Asp, Ile, Leu(UUR), Leu(CUN), Lys, Met, Pro | Cys, His | |
| 28 | pseudouridine | P | Ile, Lys, Leu(CUN), Pro | Cys, Glu, Asn, Ser(UCN), Tyr | |
| 29 | pseudouridine | P | Ser(UCN) | ||
| 31 | pseudouridine | P | Leu(CUN) | ||
| 32 | 3-methylcytidine | ’ | Ser(UCN), Thr | ||
| 32 | pseudouridine | P | Pro | Cys | |
| 34 | 5-taurinomethyluridine | Ê | Leu(UUR) | Glu, Gln, Trp | |
| 34 | queuosine | Q | Asp | His, Asn, Tyr | |
| 34 | 5-taurinomethyl-2-thiouridine | Ǝ | Lys | Glu, Gln | |
| 34 | 5-formylcytidine | > | Met | ||
| 37 | N6-threonylcarbamoyladenosine | 6 | Ile, Lys, Ser(AGY) | Asn, Thr | |
| 37 | N6-isopentenyladenosine | + | Cys, Phe, Ser(UCN), Trp, Tyr | ||
| 37 | 2-methylthio-N6-isopentenyladenosine | * | Phe, Ser(UCN), Trp, Tyr | ||
| 37 | 1-methylguanosine | K | Leu(CUN), Pro | Gln | |
| 38 | pseudouridine | P | Pro | ||
| 39 | pseudouridine | P | Ala, Cys, Phe, Gly, His, Gln, Arg, Tyr | ||
| 40 | pseudouridine | P | Glu, Gln | ||
| 48 | 5-methylcytidine | ? | Leu(UUR) | ||
| 49 | 5-methylcytidine | ? | Glu | ||
| 50 | pseudouridine | P | Met | ||
| 54 | 5-methyluridine | T | Leu(UUR), Pro | ||
| 55 | pseudouridine | P | Leu(UUR), Pro | Glu, Gln, Ser(UCN), Tyr | |
| 58 | 1-methyladenosine | ” | Ser(AGY), Leu(UUR) | Cys, Ser(UCN) | Glu |
| 72 | 5-methylcytidine | ? | Thr |
All the post-transcriptional modifications confirmed or predicted in human mt-tRNAs are listed. The full name of modifications, Modomics symbols and positions affected are shown for each tRNA species. Modifications reported include those confirmed by crystallographic data in eight human mt-tRNAs, those predicted using bovine model, which has similar structure and sequence in mt-tRNAs, and those predicted based on model organisms, such as bacteria, yeast and nematode
Variant annotators comparison for a tRNA gene mutation
| MToolBox Annotation | Example |
| Locus | MT-TM |
| RSRS | Yes |
| MHCS | Yes |
| rCRS | Yes |
| Haplogroup | |
| Other Haplogroups | |
| Nt Variability | 0.00E+00 |
| tRNA Annotation | 53;II;TS;G;N |
| Patho-prediction RNA coding genes | 0.65 |
| MITOMAP Associated Disease(s) | Myopathy |
| MITOMAP Homoplasmy | N |
| MITOMAP Heteroplasmy | Y |
| Mamit-tRNA |
|
| PhastCons20Way | 0.889764 |
| PhyloP20Way | 0.797921 |
| SnpEff Annotation | Example |
| Annotation | non_coding_exon_variant |
| Annotation_Impact | MODIFIER |
| Gene_Name | MT-TM |
| Gene_ID | ENSG00000210112 |
| Feature_Type | Transcript |
| Feature_ID | ENST00000387377 |
| Transcript_BioType | Mt_tRNA |
| Rank | 1/1 |
| HGVS.c | n.49G>A |
| MitoBamAnnotator | Example |
| pos | 4450 |
| ref | G |
| from | G |
| to | A |
| gene | MT-TM |
| coding for | tRNA |
| coding region | 0 |
| is mutated | 1 |
| triplet position | NA |
| mutation type | NA |
| has_overlap | 0 |
| CI | NA |
| mit-o-matic | Example |
| Variant | 4450 |
| Ref allele | G |
| Alt allele | A |
| Gene | MT-TM |
| Disease | Myopathy |
Among tools providing annotations for a specific variant in a tRNA gene (m.4450G>A) chosen for its potential damaging effect, MToolBox showed the widest range of useful features provided in the final annotation step allowing users to prioritize the variant. Empty fields were omitted. Tested tools which do not provide annotations for tRNA variants were not reported
Fig. 2Overview of the usage of mitochondrial tracks at MSeqDR GBrowse. MSeqDR website provides access to a GBrowse useful to visualize genomics data. Users can upload the four tracks generated in this work in the “Custom Tracks” section of the browser (a). For the sake of simplicity, the only “MT-patho.RNA” track is here shown, including data about pathogenic variants in mt-tRNA and mt-rRNA genes. The custom track can be selected, totally or partially (only transitions, transversions, insertions or deletions, b) and then visualized in the browser (c) where users can search for a specific genomic region of interest. Eventually, detailed information can be shown by clicking on a specific variant site (d)