Literature DB >> 28185569

A comprehensive collection of annotations to interpret sequence variation in human mitochondrial transfer RNAs.

Maria Angela Diroma1, Paolo Lubisco1, Marcella Attimonelli2.   

Abstract

BACKGROUND: The abundance of biological data characterizing the genomics era is contributing to a comprehensive understanding of human mitochondrial genetics. Nevertheless, many aspects are still unclear, specifically about the variability of the 22 human mitochondrial transfer RNA (tRNA) genes and their involvement in diseases. The complex enrichment and isolation of tRNAs in vitro leads to an incomplete knowledge of their post-transcriptional modifications and three-dimensional folding, essential for correct tRNA functioning. An accurate annotation of mitochondrial tRNA variants would be definitely useful and appreciated by mitochondrial researchers and clinicians since the most of bioinformatics tools for variant annotation and prioritization available so far cannot shed light on the functional role of tRNA variations.
RESULTS: To this aim, we updated our MToolBox pipeline for mitochondrial DNA analysis of high throughput and Sanger sequencing data by integrating tRNA variant annotations in order to identify and characterize relevant variants not only in protein coding regions, but also in tRNA genes. The annotation step in the pipeline now provides detailed information for variants mapping onto the 22 mitochondrial tRNAs. For each mt-tRNA position along the entire genome, the relative tRNA numbering, tRNA type, cloverleaf secondary domains (loops and stems), mature nucleotide and interactions in the three-dimensional folding were reported. Moreover, pathogenicity predictions for tRNA and rRNA variants were retrieved from the literature and integrated within the annotations provided by MToolBox, both in the stand-alone version and web-based tool at the Mitochondrial Disease Sequence Data Resource (MSeqDR) website. All the information available in the annotation step of MToolBox were exploited to generate custom tracks which can be displayed in the GBrowse instance at MSeqDR website.
CONCLUSIONS: To the best of our knowledge, specific data regarding mitochondrial variants in tRNA genes were introduced for the first time in a tool for mitochondrial genome analysis, supporting the interpretation of genetic variants in specific genomic contexts.

Entities:  

Keywords:  Annotation and prioritization tools; Bioinformatics analysis; Mitochondrial genomics; NGS; tRNA sequence variation

Mesh:

Substances:

Year:  2016        PMID: 28185569      PMCID: PMC5123245          DOI: 10.1186/s12859-016-1193-4

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


Background

The abundance of biological data characterizing the genomics era is contributing to a comprehensive understanding of human mitochondrial genetics. To date more than 30,000 complete human mitochondrial genomes have been sequenced [1] and lots of tools and databases are publicly available allowing to gather large amounts of information about mitochondrial DNA (mtDNA). Nevertheless many aspects are still unclear, specifically about the 22 human mitochondrial transfer RNAs (mt-tRNA). Thanks to the “four-way wobble rule” and post transcriptional modifications at the first letters of tRNA anticodons [2], only 22 mt-tRNAs are sufficient in humans, as well as in other mammals, to translate all sense codons into 13 subunits of respiratory chain complexes encoded in each single copy of mtDNA [2]. mt-tRNAs could be considered hot spots of mutations [3]: among more than 600 disease associated mutations compiled to date, about 240 were mapped on mt-tRNA genes [4]. However, it is well known that clinical phenotypes appear only when the mutation load exceeds a certain threshold [5], considering the possible co-existence of different mtDNA genotypes within the same cell, tissue or individual, a condition known as heteroplasmy. Thus, if a mutation in an mt-tRNA gene has no consequences on mtDNA replication or transcription, it may instead affect biogenesis and functioning of tRNAs after their transcription [6]. For instance, post-transcriptional modifications by nuclear-encoded enzymes [7, 8] often occur in key positions for a correct tRNA functioning, including folding and codon-anticodon interaction [6, 9, 10]. As a consequence, the lack of a correct post-transcriptional process could cause pathological effects [11, 12]. Some features are shared among human and other mammalian mt-tRNAs, such as the low number of G–C pairs within stems of the 14 tRNAs encoded by the light DNA strand, due to a strong bias in nucleotide content (A, U and C-rich tRNAs), variable D-loop and T-loop sizes, and lack of conserved and semi-conserved signature motifs [13], thus the difficulties linked to the complex process of human tRNA purification and identification of modified nucleotides are often overpassed through predictions based on bovine models [2]. The availability of information about mt-tRNA genes and variants would support the interpretation of mtDNA variants and improve the understanding of molecular mechanisms of disease. However, most bioinformatics tools for variant annotation and prioritization available so far cannot shed light on the functional role of mt-tRNA variations, often focusing only on characterization of missense variants [14, 15]. To this aim, we updated our MToolBox pipeline [16] for mtDNA analysis of high throughput and Sanger sequencing data by integrating tRNA variants annotations in order to identify relevant variants not only in protein coding regions but also in tRNA genes. Pathogenicity predictions retrieved from the literature were added both for tRNA and rRNA gene variants, when available. These information were also provided as custom tracks which can be visualized in the GBrowse at the Mitochondrial Disease Sequence Data Resource (MSeqDR) website [17], conveniently allowing a deep insight into mitochondrial genomics.

Methods

Data collection from known databases, web-based resources and literature

All the information collected in this work and those previously collected and already implemented in the MToolBox pipeline [16], come from several resources and the literature about human mtDNA genomics and variation (Table 1). Nucleotide variability scores calculated by applying SiteVar algorithm [18] on 22,691 complete genomes from healthy individuals in the Human Mitochondrial Database, HmtDB (May 2014 update) [19], were reported for each position of the entire human mitochondrial genome; amino acid scores, calculated by MitVarProt algorithm [20] on the same dataset, were obtained for coding regions. Conservation scores calculated by PhyloP [21] and PhastCons [22] algorithms were retrieved from UCSC Genome Browser [23].
Table 1

Annotations by MToolBox pipeline

Variant annotationStatus
LocusPreviously provided
HFPreviously provided
CI_lower;CI_upperPreviously provided
RSRSPreviously provided
MHCSPreviously provided
rCRSPreviously provided
HaplogroupPreviously provided
Other HaplogroupsPreviously provided
Nt VariabilityUpdated
Codon PositionPreviously provided
Aa ChangePreviously provided
Aa variabilityUpdated
tRNA AnnotationNew
Disease ScorePreviously provided
RNA predictionsNew
MutPred PredPreviously provided
MutPred ProbPreviously provided
PolyPhen-2 HumDiv PredPreviously provided
PolyPhen-2 HumDiv ProbPreviously provided
PolyPhen-2 HumVar PredPreviously provided
PolyPhen-2 HumVar ProbPreviously provided
PANTHER PredPreviously provided
PANTHER ProbPreviously provided
PhD-SNP PredPreviously provided
PhD-SNP ProbPreviously provided
SNPs&GO PredPreviously provided
SNPs&GO ProbPreviously provided
MITOMAP Associated Disease(s)Updated
MITOMAP HomoplasmyUpdated
MITOMAP HeteroplasmyUpdated
Somatic MutationsUpdated
SM HomoplasmyUpdated
SM HeteroplasmyUpdated
ClinVarNew
OMIMUpdated
dbSNPUpdated
Mamit-tRNAPreviously provided
PhastCons20WayNew
PhyloP20WayNew
AC/AN 1000 GenomesPreviously provided
1000 Genomes HomoplasmyPreviously provided
1000 Genomes HeteroplasmyPreviously provided

All the annotations provided by MToolBox pipeline are shown. In the latest update, new fields, mainly regarding tRNA gene variants, were added for a more accurate variant annotation in analyzed samples: structural information for tRNA variants (“tRNA annotation”), pathogenicity predictions for tRNA and rRNA genes (“RNA predictions”), disease reports in ClinVar database (“ClinVar”), conservation scores (“PhastCons20Way”, “PhyloP20Way”). tRNA annotation, in turn, includes five semi-colon separated annotations: position numbering in tRNA, tRNA type, cloverleaf secondary region, mature nucleotide and involvement of the specific position in tRNA folding (Y for yes or N for no). Moreover, data from HmtDB (“Nt variability”, “Aa variability”), MITOMAP (“MITOMAP Associated Disease(s)”, “MITOMAP Homoplasmy”, “MITOMAP Heteroplasmy”, “Somatic Mutations”, “SM Homoplasmy”, “SM Heteroplasmy”), OMIM links (“OMIM”) and dbSNP identifiers (“dbSNP”) were updated. All the remaining annotations were Previously provided by MToolBox

Annotations by MToolBox pipeline All the annotations provided by MToolBox pipeline are shown. In the latest update, new fields, mainly regarding tRNA gene variants, were added for a more accurate variant annotation in analyzed samples: structural information for tRNA variants (“tRNA annotation”), pathogenicity predictions for tRNA and rRNA genes (“RNA predictions”), disease reports in ClinVar database (“ClinVar”), conservation scores (“PhastCons20Way”, “PhyloP20Way”). tRNA annotation, in turn, includes five semi-colon separated annotations: position numbering in tRNA, tRNA type, cloverleaf secondary region, mature nucleotide and involvement of the specific position in tRNA folding (Y for yes or N for no). Moreover, data from HmtDB (“Nt variability”, “Aa variability”), MITOMAP (“MITOMAP Associated Disease(s)”, “MITOMAP Homoplasmy”, “MITOMAP Heteroplasmy”, “Somatic Mutations”, “SM Homoplasmy”, “SM Heteroplasmy”), OMIM links (“OMIM”) and dbSNP identifiers (“dbSNP”) were updated. All the remaining annotations were Previously provided by MToolBox Somatic mutations and germline variants with reports of disease-associations were available in MITOMAP [4], with corresponding annotation of heteroplasmic/homoplasmic status (July 20, 2015 update of coding and control regions variants; July 29, 2015 update of somatic mutations and RNA genes variants). Other resources were exploited in order to facilitate clinical interpretation of variants, although they are not specialized for mitochondrial genome variant analysis, including OMIM [24], the Online Mendelian Inheritance in Man (August 4, 2015 update), dbSNP [25], a database for short genetic variations (release 144, May 26, 2015), and ClinVar [26], a public archive of reports of human variations and phenotypes reporting annotations of variants found in patient samples (January 21, 2015 update). Moreover, specific annotations for tRNA variants were gathered from databases, such as Mamit-tRNA [13], mitotRNAdb [27] and MODOMICS [28], as well as from the literature. Specifically, a scoring system developed for 207 variants in tRNA genes considering functional evidence, conservation, frequency and heteroplasmy status in mutations reported in MITOMAP as “pathogenic”, was retrieved [29, 30] and normalized to a 0–1 range (Table 2). Recently published predictions of pathogenicity for DNA variants involving 12S mitochondrial rRNA (mt-rRNA) [31] were considered and adapted, too.
Table 2

RNA pathogenicity predictions in MToolBox with corresponding scores

rRNA predictionrRNA ScoreRNA pathogenicity score in MToolBoxtRNA ScoretRNA prediction
Proven pathogenic51.00020Definitely pathogenic
0.95019Definitely pathogenic
0.90018Definitely pathogenic
0.85017Definitely pathogenic
Expectedly pathogenic40.80016Definitely pathogenic
0.75015Definitely pathogenic
0.70014Definitely pathogenic
0.65013Possibly/definitely pathogenic
Likely pathogenic3 0.600 12Possibly/definitely pathogenic
0.55011Possibly/definitely pathogenic
0.50010Possibly pathogenic
0.4509Possibly pathogenic
Not enough evidence20.4008Possibly pathogenic
0.350 7Possibly pathogenic
0.3006Neutral
0.2505Neutral
Undetermined10.2004Neutral
0.1503Neutral
0.1002Neutral
0.0501Neutral
Unlikely pathogenic00.0000Neutral

RNA pathogenicity scores provided by MToolBox pipeline, shown in the central column of the table, derived from two different scoring systems for rRNA and tRNA genes, respectively. Original predictions and scores, reported on the right and the left of MToolBox scores, were retrieved from the literature and normalized to a 0–1 range. Thresholds of 0.600 for rRNA and 0.350 for tRNA sequence variations (in bold) were set according to original scores. Damaging effects could be observed for variants with a score above or equal to the chosen thresholds, while neutral variants should be associated with lower values

RNA pathogenicity predictions in MToolBox with corresponding scores RNA pathogenicity scores provided by MToolBox pipeline, shown in the central column of the table, derived from two different scoring systems for rRNA and tRNA genes, respectively. Original predictions and scores, reported on the right and the left of MToolBox scores, were retrieved from the literature and normalized to a 0–1 range. Thresholds of 0.600 for rRNA and 0.350 for tRNA sequence variations (in bold) were set according to original scores. Damaging effects could be observed for variants with a score above or equal to the chosen thresholds, while neutral variants should be associated with lower values

MToolBox

MToolBox [16] is a bioinformatics pipeline recently developed for accurate and complete analysis of mitochondrial genome from high throughput sequencing. The tool includes several steps in the data analysis process, such as variant annotation and prioritization by exploiting several annotation resources, such as biological databases [4, 19] and pathogenicity prediction software [32-34], proving to be very useful especially in the characterization of missense variants (Table 1). The pipeline was also developed as a web-based tool, hosted at MSeqDR website [17], a portal recently developed for supporting mitochondrial disease studies by providing both data and user-friendly tools specifically for mtDNA analysis.

Variant annotators

Both generic and mitochondrial-oriented tools were used for a comparison of variant annotation processes. The command line tools ANNOVAR (version date 2015-03-22) [35], dbNSFP (version 3.0b1a) [14], and SnpEff (version 4.1b) [36], although not specific for mtDNA analysis, were used to provide annotations for three mitochondrial mutations involving genes coding for an rRNA, a tRNA and a protein, respectively. Web-based versions of mit-o-matic [37], MitoBamAnnotator [38] and MitImpact 2.0 [15] tools were also applied to the same mutations to compare their performance in variant annotation.

GBrowse tracks at MSeqDR website

GBrowse instance at MSeqDR website [17] allows visualization and analysis of variations and other genomics data in a classic genome browser interface by hosting mtDNA specific annotation tracks containing data from some of the major mtDNA genomics resources, such as HmtDB_rCRSvariants and HmtDB_RSRSvariants, provided by our group [17]. Data collection for new tracks generation was manually curated in order to produce tab-delimited text files, then converted in the required format (General Feature Format version 3, GFF3). Variants were reported using the Human Genome Variation Society (HGVS) nomenclature [39].

Results and discussion

Annotations for mitochondrial DNA variants in RNA genes by MToolBox pipeline and data update

The MToolBox pipeline [16] was updated and enhanced with specific annotations regarding tRNA genes, introduced for the first time in a tool specific for mtDNA analysis. New fields were added in the latest version of the MToolBox pipeline (Table 1): specific annotations for tRNA and rRNA genes, annotations from ClinVar database for disease-associated variants [26] and conservation scores for each site produced by PhyloP [21] and PhastCons [22] algorithms. Specifically, tRNA genes were characterized in each position with reports about tRNA structure including i) position in tRNA, following the Sprinzl standard nomenclature [27]; ii) tRNA type [40]; iii) cloverleaf-shaped secondary structure regions [27]; iv) mature nucleotide [2, 7, 28]; v) involvement of the specific position in tRNA folding [2, 7, 41] (Fig. 1). Each tRNA nucleotide was numbered from 1 to 73, CCA-ending excluded; the anticodon triplet was marked with nucleotides 34 to 36. The tRNA type indicates one of the four possible groups ranking human mt-tRNAs for their structural diversity and different tertiary interactions: type 0, the quasi-canonical cloverleaf structure, with standard D-loop/T-loop interaction; type II, the most common among mt-tRNAs, characterized by loss of D/T-loop interaction; type I and type III, each accounting one single tRNA with an atypical anticodon stem and lack of D-stem, respectively. The annotation of the typical cloverleaf pattern includes abbreviations of four loops (TL-TΨC Loop, VL-Variable Loop, CL-Anticodon Loop, DL-Dihydrouridine Loop), four stems (AS-Acceptor Stem, TS-TΨC Stem, CS-Anticodon Stem, DS-Dihydrouridine Stem), 3′ end (E) and junctions (-).
Fig. 1

Schematic representation of the four types of human mitochondrial tRNAs. The four types of human mt-tRNAs are shown. Green circles represent all the nucleotide positions involved in post-transcriptional modifications in each tRNA. Blue circles indicate nucleotide positions involved in tertiary folding with interactions represented by lines. Red circles represent nucleotide positions involved in tertiary folding and subject to post-transcriptional modifications. All the stems (A-stem, T-stem, C-stem, D-stem) and loops (T-loop, V-loop, C-loop, D-loop) of cloverleaf secondary regions are also shown

Schematic representation of the four types of human mitochondrial tRNAs. The four types of human mt-tRNAs are shown. Green circles represent all the nucleotide positions involved in post-transcriptional modifications in each tRNA. Blue circles indicate nucleotide positions involved in tertiary folding with interactions represented by lines. Red circles represent nucleotide positions involved in tertiary folding and subject to post-transcriptional modifications. All the stems (A-stem, T-stem, C-stem, D-stem) and loops (T-loop, V-loop, C-loop, D-loop) of cloverleaf secondary regions are also shown The mature nucleotide is meant as the nucleotide found in the tRNA molecule after post-transcriptional processes, predicted based on information of bovine and model organisms (bacteria, yeast, nematode) mt-tRNAs, and confirmed in 8 human mt-tRNAs [2, 8]. As a result of our data collection, we annotated 110 residues in the human mt-tRNA set involved in post-transcriptional modifications, with 16 different types of modified nucleotides. All the post-transcriptional modifications in mt-tRNAs and resulting mature nucleotides are listed in Table 3.
Table 3

Post-transcriptional modifications in mt-tRNAs

PositionModified base nameModomics symbolHuman mt-tRNAsBovine mt-tRNAsModel organisms tRNAs
91-methyladenosineAsp, Leu(CUN), Lys, ProAla, Glu, Phe, Gly, His, Asn, Arg, Thr, Val, Trp
91-methylguanosineKIle, Leu(UUR)Cys, Gln, Tyr
10N2-methylguanosineLLeu(UUR), Leu(CUN), Lys, AspAla, Phe, Gly, His, Asn, Val, Trp, Tyr
20dihydrouridineDLeu(UUR)
26N2-methylguanosineLAla, Glu
26N2,N2-dimethylguanosineRIle
27pseudouridinePAsp, Ile, Leu(UUR), Leu(CUN), Lys, Met, ProCys, His
28pseudouridinePIle, Lys, Leu(CUN), ProCys, Glu, Asn, Ser(UCN), Tyr
29pseudouridinePSer(UCN)
31pseudouridinePLeu(CUN)
323-methylcytidineSer(UCN), Thr
32pseudouridinePProCys
345-taurinomethyluridineÊLeu(UUR)Glu, Gln, Trp
34queuosineQAspHis, Asn, Tyr
345-taurinomethyl-2-thiouridineƎLysGlu, Gln
345-formylcytidine>Met
37N6-threonylcarbamoyladenosine6Ile, Lys, Ser(AGY)Asn, Thr
37N6-isopentenyladenosine+Cys, Phe, Ser(UCN), Trp, Tyr
372-methylthio-N6-isopentenyladenosine*Phe, Ser(UCN), Trp, Tyr
371-methylguanosineKLeu(CUN), ProGln
38pseudouridinePPro
39pseudouridinePAla, Cys, Phe, Gly, His, Gln, Arg, Tyr
40pseudouridinePGlu, Gln
485-methylcytidine?Leu(UUR)
495-methylcytidine?Glu
50pseudouridinePMet
545-methyluridineTLeu(UUR), Pro
55pseudouridinePLeu(UUR), ProGlu, Gln, Ser(UCN), Tyr
581-methyladenosineSer(AGY), Leu(UUR)Cys, Ser(UCN)Glu
725-methylcytidine?Thr

All the post-transcriptional modifications confirmed or predicted in human mt-tRNAs are listed. The full name of modifications, Modomics symbols and positions affected are shown for each tRNA species. Modifications reported include those confirmed by crystallographic data in eight human mt-tRNAs, those predicted using bovine model, which has similar structure and sequence in mt-tRNAs, and those predicted based on model organisms, such as bacteria, yeast and nematode

Post-transcriptional modifications in mt-tRNAs All the post-transcriptional modifications confirmed or predicted in human mt-tRNAs are listed. The full name of modifications, Modomics symbols and positions affected are shown for each tRNA species. Modifications reported include those confirmed by crystallographic data in eight human mt-tRNAs, those predicted using bovine model, which has similar structure and sequence in mt-tRNAs, and those predicted based on model organisms, such as bacteria, yeast and nematode Indication of the involvement of a specific residue in tRNA folding could be now recovered through variant annotation by our updated version of MToolBox. The three-dimensional structure of mt-tRNA has a typical L-shape, due to the molecule folding back in itself forming two double helix segments through base pairing between T and D loop. Triplet interactions also occur in position 10-25-45, 9-23-12 and 13-22-46 in order to increase stability [7]. The strength of folding is also affected by base stacking interactions, interesting almost all the nucleotides [42]. As expected, we observed a relatively low frequency of disease associated mutations within the anticodon triplet (11/394 mutations) since its high conservation is required for a correct recognition of the messanger RNA. Specifically, position 36, corresponding to the third base within anticodon, is more subject to pathogenic mutations (7/11). Moreover we observed a quite homogeneous distribution of mutations with a deleterious effect in other tRNA regions, in line with an almost consistent involvement of all the regions in the three-dimensional folding. Fortynine variants in rRNA genes [31] and 207 variants in tRNA genes [29, 30] were retrieved from the literature as validated mutations, hence inserted within the annotation mechanism used by MToolBox and integrated with pathogenicity predictions and scores. Original scores were normalized to a 0–1 range, with derived thresholds of 0.600 and 0.350 for rRNA and tRNA sequence variations, respectively (Table 2). Damaging effects could be observed for variants with a score above or equal to the chosen thresholds, while neutral variants should be associated with lower values. Finally, several annotations previously collected [16] were accurately revised to provide users the most possible up-to-date pipeline for mitochondrial genome analysis, including updated variability data from HmtDB database [19], dbSNP identifiers [25], OMIM links to known variants [24], novel disease associated variants and somatic mutations reported in MITOMAP [4] (Table 1). All the updates in MToolBox are available both in the command line version [43] and in the web-based resource at MSeqDR website [44]. New options to better manage input files are described in the readme file in the package. Moreover a summary is now produced reporting all the parameters chosen for the analysis and some basic statistics.

Annotation/prioritization tools comparison

In recent years lots of tools for variant prioritization were produced in order to help clinicians and researchers to recognize a few relevant mutations among the huge amount of variations detectable by NGS technologies. However, the annotation and prioritization processes carried out by these tools are often focused on missense variant characterization by providing pathogenicity predictions, dbSNP identifiers, frequency in known datasets such as the 1000 Genomes, conservation scores and region annotations (see Additional file 1). Among the most popular tools for variant prioritization, ANNOVAR [35], SnpEff [36] and dbNSFP [14] are commonly used both for nuclear DNA and mtDNA variations. Moreover mitochondrial-oriented tools have been recently developed, such as mit-o-matic [37], MitImpact [15] and MitoBamAnnotator [38] to ensure appropriate annotations mindful of mitochondrial genetics peculiarities, such as heteroplasmy. A comparison was performed among the aforementioned tools, showing pros and cons of each of them (Additional file 1). A few generic annotations regarding mt-tRNA variants were provided by some of the tested tools, while the MToolBox pipeline showed a wide range of annotations proving to be useful for any variant evaluation and not only missense variants (Table 4). Moreover, several input file formats can be used by MToolBox, proving a great efficiency for both high throughput sequencing and traditional FASTA data. Last but not least, the web-based version of the tool [44] ensures large usability also by non-expert users interested in mitochondrial genome analysis.
Table 4

Variant annotators comparison for a tRNA gene mutation

MToolBox AnnotationExample
 LocusMT-TM
 RSRSYes
 MHCSYes
 rCRSYes
 Haplogroup
 Other Haplogroups
 Nt Variability0.00E+00
 tRNA Annotation53;II;TS;G;N
 Patho-prediction RNA coding genes0.65
 MITOMAP Associated Disease(s)Myopathy
 MITOMAP HomoplasmyN
 MITOMAP HeteroplasmyY
 Mamit-tRNA http://mamit-trna.u-strasbg.fr/mutations.asp?idAA=19
 PhastCons20Way0.889764
 PhyloP20Way0.797921
SnpEff AnnotationExample
 Annotationnon_coding_exon_variant
 Annotation_ImpactMODIFIER
 Gene_NameMT-TM
 Gene_IDENSG00000210112
 Feature_TypeTranscript
 Feature_IDENST00000387377
 Transcript_BioTypeMt_tRNA
 Rank1/1
 HGVS.cn.49G>A
MitoBamAnnotatorExample
 pos4450
 refG
 fromG
 toA
 geneMT-TM
 coding fortRNA
 coding region0
 is mutated1
 triplet positionNA
 mutation typeNA
 has_overlap0
 CINA
mit-o-maticExample
 Variant4450
 Ref alleleG
 Alt alleleA
 GeneMT-TM
 DiseaseMyopathy

Among tools providing annotations for a specific variant in a tRNA gene (m.4450G>A) chosen for its potential damaging effect, MToolBox showed the widest range of useful features provided in the final annotation step allowing users to prioritize the variant. Empty fields were omitted. Tested tools which do not provide annotations for tRNA variants were not reported

Variant annotators comparison for a tRNA gene mutation Among tools providing annotations for a specific variant in a tRNA gene (m.4450G>A) chosen for its potential damaging effect, MToolBox showed the widest range of useful features provided in the final annotation step allowing users to prioritize the variant. Empty fields were omitted. Tested tools which do not provide annotations for tRNA variants were not reported

Mitochondrial variations tracks at MSeqDR

In order to facilitate the interpretation of genetic variants in a specific genomic context, four different custom tracks were produced in GFF3 file format displayable at MSeqDR GBrowse [45] (Fig. 2). The tracks included all the data used for the annotation step carried out by the MToolBox pipeline, providing users the possibility to analyze only variants or genomic positions with no need to provide input files. A track previously provided, called “Mitochondrial Pathogenicity Predictions” [17], was updated and split into two different tracks, “MT-patho.CDS” and “MT-patho.STOP” tracks. The first collects all the 24,202 possible non-synonymous variants within the 13 human mitochondrial protein encoding genes, identified using mtDNA-GeneSyn software [46]. Predictions and probabilities of pathogenicity were produced using five different software [16] and an overall disease score was also provided [47].
Fig. 2

Overview of the usage of mitochondrial tracks at MSeqDR GBrowse. MSeqDR website provides access to a GBrowse useful to visualize genomics data. Users can upload the four tracks generated in this work in the “Custom Tracks” section of the browser (a). For the sake of simplicity, the only “MT-patho.RNA” track is here shown, including data about pathogenic variants in mt-tRNA and mt-rRNA genes. The custom track can be selected, totally or partially (only transitions, transversions, insertions or deletions, b) and then visualized in the browser (c) where users can search for a specific genomic region of interest. Eventually, detailed information can be shown by clicking on a specific variant site (d)

Overview of the usage of mitochondrial tracks at MSeqDR GBrowse. MSeqDR website provides access to a GBrowse useful to visualize genomics data. Users can upload the four tracks generated in this work in the “Custom Tracks” section of the browser (a). For the sake of simplicity, the only “MT-patho.RNA” track is here shown, including data about pathogenic variants in mt-tRNA and mt-rRNA genes. The custom track can be selected, totally or partially (only transitions, transversions, insertions or deletions, b) and then visualized in the browser (c) where users can search for a specific genomic region of interest. Eventually, detailed information can be shown by clicking on a specific variant site (d) The second track collects all the 1740 possible stop-gain and 77 possible stop-loss mutations, which could be damaging in the generation of the 13 human mitochondrial proteins. The third track (“MT-patho.RNA”) is useful to show all the information currently available about pathogenicity of 392 variants in tRNA and 337 in rRNA genes, while the fourth track (“MT-RNA”) includes generic annotations reported for all the 1505 positions in genes encoding tRNAs and 2513 positions in genes encoding rRNAs, respectively. All the tracks were produced using the revised Cambridge Reference Sequence, rCRS (GenBank: J01415.2), as reference sequence. Additional information from MITOMAP [4], ClinVar [26], Mamit-tRNA [13] dbSNP [25] and OMIM [24] databases were shown, when available, for all the four tracks, as well as variability data from HmtDB database [19] and conservation scores from UCSC Genome Browser [21, 22]. The tracks, can be uploaded in the “Custom Tracks” section of the MSeqDR website, selected, totally or partially (only transitions, transversions, insertions or deletions) and visualized in the GBrowse (Fig. 2).

Conclusions

To the best of our knowledge, specific data regarding mitochondrial variants in tRNA genes were introduced for the first time in a tool for mitochondrial genome analysis and then reported in custom tracks, which could be displayed at MSeqDR GBrowse. The availability of such data could be useful to support the interpretation of genetic variants in specific genomic contexts.
  40 in total

1.  A Watson-Crick base-pair-disrupting methyl group (m1A9) is sufficient for cloverleaf folding of human mitochondrial tRNALys.

Authors:  M Helm; R Giegé; C Florentz
Journal:  Biochemistry       Date:  1999-10-05       Impact factor: 3.162

Review 2.  Human mitochondrial diseases caused by lack of taurine modification in mitochondrial tRNAs.

Authors:  Tsutomu Suzuki; Asuteka Nagao; Takeo Suzuki
Journal:  Wiley Interdiscip Rev RNA       Date:  2011-02-25       Impact factor: 9.957

Review 3.  Human mitochondrial tRNAs: biogenesis, function, structural aspects, and diseases.

Authors:  Tsutomu Suzuki; Asuteka Nagao; Takeo Suzuki
Journal:  Annu Rev Genet       Date:  2011-09-06       Impact factor: 16.830

4.  Mamit-tRNA, a database of mammalian mitochondrial tRNA primary and secondary structures.

Authors:  Joern Pütz; Bruno Dupuis; Marie Sissler; Catherine Florentz
Journal:  RNA       Date:  2007-06-21       Impact factor: 4.942

5.  dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.

Authors:  Xiaoming Liu; Xueqiu Jian; Eric Boerwinkle
Journal:  Hum Mutat       Date:  2013-07-10       Impact factor: 4.878

Review 6.  Mitochondrial Disease Sequence Data Resource (MSeqDR): a global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities.

Authors:  Marni J Falk; Lishuang Shen; Michael Gonzalez; Jeremy Leipzig; Marie T Lott; Alphons P M Stassen; Maria Angela Diroma; Daniel Navarro-Gomez; Philip Yeske; Renkui Bai; Richard G Boles; Virginia Brilhante; David Ralph; Jeana T DaRe; Robert Shelton; Sharon F Terry; Zhe Zhang; William C Copeland; Mannis van Oven; Holger Prokisch; Douglas C Wallace; Marcella Attimonelli; Danuta Krotoski; Stephan Zuchner; Xiaowu Gai
Journal:  Mol Genet Metab       Date:  2014-12-04       Impact factor: 4.797

7.  MODOMICS: a database of RNA modification pathways.

Authors:  Stanislaw Dunin-Horkawicz; Anna Czerwoniec; Michal J Gajda; Marcin Feder; Henri Grosjean; Janusz M Bujnicki
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  MITOMAP: a human mitochondrial genome database--2004 update.

Authors:  Marty C Brandon; Marie T Lott; Kevin Cuong Nguyen; Syawal Spolim; Shamkant B Navathe; Pierre Baldi; Douglas C Wallace
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing.

Authors:  Claudia Calabrese; Domenico Simone; Maria Angela Diroma; Mariangela Santorsola; Cristiano Guttà; Giuseppe Gasparre; Ernesto Picardi; Graziano Pesole; Marcella Attimonelli
Journal:  Bioinformatics       Date:  2014-07-14       Impact factor: 6.937

10.  ClinVar: public archive of relationships among sequence variation and human phenotype.

Authors:  Melissa J Landrum; Jennifer M Lee; George R Riley; Wonhee Jang; Wendy S Rubinstein; Deanna M Church; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2013-11-14       Impact factor: 16.971

View more
  5 in total

1.  Specifications of the ACMG/AMP standards and guidelines for mitochondrial DNA variant interpretation.

Authors:  Elizabeth M McCormick; Marie T Lott; Matthew C Dulik; Lishuang Shen; Marcella Attimonelli; Ornella Vitale; Amel Karaa; Renkui Bai; Daniel E Pineda-Alvarez; Larry N Singh; Christine M Stanley; Stacey Wong; Anshu Bhardwaj; Daria Merkurjev; Rong Mao; Neal Sondheimer; Shiping Zhang; Vincent Procaccio; Douglas C Wallace; Xiaowu Gai; Marni J Falk
Journal:  Hum Mutat       Date:  2020-11-10       Impact factor: 4.878

2.  Renal oncocytoma characterized by the defective complex I of the respiratory chain boosts the synthesis of the ROS scavenger glutathione.

Authors:  Gerrit Kürschner; Qingzhou Zhang; Rosanna Clima; Yi Xiao; Jonas Felix Busch; Ergin Kilic; Klaus Jung; Nikolaus Berndt; Sascha Bulik; Hermann-Georg Holzhütter; Giuseppe Gasparre; Marcella Attimonelli; Mohan Babu; David Meierhofer
Journal:  Oncotarget       Date:  2017-11-11

3.  Papillary thyroid carcinoma tall cell variant shares accumulation of mitochondria, mitochondrial DNA mutations, and loss of oxidative phosphorylation complex I integrity with oncocytic tumors.

Authors:  Oleksiy Tsybrovskyy; Monica De Luise; Dario de Biase; Leonardo Caporali; Claudio Fiorini; Giuseppe Gasparre; Valerio Carelli; Dominik Hackl; Larisa Imamovic; Silke Haim; Manuel Sobrinho-Simões; Giovanni Tallini
Journal:  J Pathol Clin Res       Date:  2021-11-17

4.  A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information.

Authors:  Ornella Vitale; Roberto Preste; Donato Palmisano; Marcella Attimonelli
Journal:  Mol Genet Genomic Med       Date:  2019-12-10       Impact factor: 2.183

5.  HmtVar: a new resource for human mitochondrial variations and pathogenicity data.

Authors:  Roberto Preste; Ornella Vitale; Rosanna Clima; Giuseppe Gasparre; Marcella Attimonelli
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.