| Literature DB >> 32926138 |
Xiaolei Zhang1,2, Matthew Wakeling3, James Ware1,2, Nicola Whiffin1,2.
Abstract
SUMMARY: Current tools to annotate the predicted effect of genetic variants are heavily biased towards protein-coding sequence. Variants outside of these regions may have a large impact on protein expression and/or structure and can lead to disease, but this effect can be challenging to predict. Consequently, these variants are poorly annotated using standard tools. We have developed a plugin to the Ensembl Variant Effect Predictor, the UTRannotator, that annotates variants in 5'untranslated regions (5'UTR) that create or disrupt upstream open reading frames. We investigate the utility of this tool using the ClinVar database, providing an annotation for 31.9% of all 5'UTR (likely) pathogenic variants, and highlighting 31 variants of uncertain significance as candidates for further follow-up. We will continue to update the UTRannotator as we gain new knowledge on the impact of variants in UTRs.Entities:
Mesh:
Substances:
Year: 2021 PMID: 32926138 PMCID: PMC8150139 DOI: 10.1093/bioinformatics/btaa783
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Details of the annotations provided for different categories of uORF-perturbing variants.hmark results of the cascade oscillators model
| Consequence | uAUG-gained | uAUG-lost | uSTOP-lost | uSTOP-gained | uFrameshift |
|---|---|---|---|---|---|
| Number of existing uORFs | √ | √ | √ | √ | √ |
| KozakContext: sequence and strength | √ | √ | √ | √ | √ |
| Start distance to CDS | √ | √ | √ | √ | |
| Start distance to STOP | √ | √ | |||
| With translated evidence | √ | √ | √ | √ | |
| uORF subtype | √ | √ | √ | √ (ref and alt) | |
| Other annotations | Start distance from cap | Whether there is an alternative STOP, alternative stop distance to CDS, frame of disrupted uORF with CDS | New stop distance to CDS |
Fig. 1.5′UTR variants in ClinVar annotated by the UTRannotator. (a) A schematic showing the five distinct consequences of 5′UTR variants annotated by the tool: those that create an upstream AUG (uAUG_gained), those that disrupt the start site of an existing upstream open reading frame (uORF; uAUG_lost), those that cause a frameshift in the sequence of the uORF (uFrameShift), those that introduce a new stop codon into an existing uORF (uSTOP_gained) and those that disrupt the stop site of an existing uORF (uSTOP_lost). (b) The counts of each variant category that are classified as Pathogenic/Likely Pathogenic (teal) or Uncertain Significance (VUS; grey) in ClinVar