Literature DB >> 35289834

RevUP, an online scoring system for regulatory variants implicated in rare diseases.

Solenne Correard1, Brittany Hewitson1, Robin van der Lee1, Wyeth W Wasserman1.   

Abstract

SUMMARY: To address the difficulty in assessing the implication of regulatory variants in diseases, a scoring scheme published by Van der Lee et al., 2020 allows the calculation of the Regulatory Variant Evidence score (RVE-score). The score represents the accumulated evidence for a causative role of a regulatory variant in a disease. RevUP (Regulatory Evidence for Variants Underlying Phenotypes) was built to calculate the RVE-score of regulatory variants, based on the 24 criteria, with a hybrid approach combining information retrieved from public databases and user input. AVAILABILITY: RevUP is freely available at http://www.revup-classifier.ca. The source code is available at https://github.com/wassermanlab/revup. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Year:  2022        PMID: 35289834      PMCID: PMC9048665          DOI: 10.1093/bioinformatics/btac157

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

To date, several DNA variant classifiers are available and most are aligned to the American College of Medical Genetics and Genomics (ACMG) consensus recommendations (Richards ). Recently, recommendations were proposed for changes to the ACMG guidelines for clinical interpretation of variants to include variants found in non-coding regions of the genome, showing the relevance of such variants for genetic conditions (Ellingford ; Turner and Eichler, 2019). With the growing use of clinical whole-genome sequencing, there is a need for extension of the classification systems suitable for regulatory sequence variants. Diverse regulatory variant prediction methods have been developed, such as PRVCS (Li ) or GREEN-VARAN (Giacopuzzi ), have begun to aggregate predictions and prioritize variants expected to disrupt gene regulation. Such bioinformatics tools generally draw upon genomics databases for information, but often the strongest evidence is found in gene-specific publications. Furthermore, the focus on regulatory dysfunction does not necessarily relate to pathogenic impact, which is critical for clinical classification. In 2020, a review of 46 regulatory disease variants reported to disrupt the expression of 40 transcription factor genes generated a semiquantitative classification scheme that incorporates both functional and clinical evidence (van der Lee ). The scheme allows the calculation of the Regulatory Variant Evidence score (RVE-score), based on 24 criteria, which summarizes the accumulated evidence for a regulatory variant to have a causative role in a rare disease. Regulatory Evidence for Variants Underlying Phenotypes (RevUP, http://www.revup-classifier.ca), a web-based classifier, was built to calculate the RVE-score of regulatory variants, based on the 24 criteria presented in van der Lee . Some previously released classifiers (not regulation focused) rely on user input only (Kleinberger ), while others rely uniquely on information available in public databases (Li and Wang, 2017), RevUP relies on both. The result page is a user-friendly display, downloadable and suitable for inclusion in a scientific report, together with a FAQ on the use of RevUP.

2 Materials and methods

2.1 Architecture and hosting

RevUP is a web application composed of a front end written in React, and a backend framework written in Flask. It is hosted on Amazon Web Services using a t2.medium elastic compute (EC2) instance with Nginx being used as the web server to route traffic. All code for RevUP is available on GitHub https://github.com/wassermanlab/revup.

2.2 Scoring scheme

The scoring scheme presented in van der Lee is composed of a clinical component ‘Is there a causal link between genotype and phenotype?’ and a functional component ‘Does the variant have a damaging effect on the gene?’. Each evidence is given a score of 0 or 1 (0: The evidence does not apply to the variant or it is unknown; 1: The evidence applies to the variant, pondered based on the evidence weight). From the evidence, three scores are calculated: (i) the C-score, reflecting the clinical evidence available; (ii) the F-score, reflecting the functional evidence available and (iii) the RVE-score which is the accumulated evidence for a causative role of a regulatory variant in a disease. For some of the evidence, the information can be retrieved from public databases while other evidence requires user input (ex: C3.1, ‘Variant shows familial segregation with the disease’).

2.3 User input on the variant and target gene (Step 1) and databases queried by the webserver

For RevUP to query external databases, the user must input the variant details and the suspected target gene. Six external resources are then queried: PhyloP (Cooper ), PhastCons (Siepel ), gnomAD (Karczewski ), CADD (Rentzsch ), ReMap (Chèneby ) and ENCODE (Davis ). The details on the versions used and the information retrieved can be found in Supplementary Table S1.

2.4 User input for additional evidence (Step 2)

Some evidence cannot be found online. Others can be found using public databases but are too complicated to query automatically as they require interpretation by a human expert (ex: C2.1, ‘Suspected target gene has been implicated in the same or a similar disease phenotype, or is otherwise relevant’). Therefore, the user will have the ability to answer ‘Yes’ or ‘No’ to these questions in order for the tool to calculate the score.

2.5 Results modification and comments (Step 3)

Users may have additional evidence that was not available in the queried public databases, therefore, the user can modify the score for each evidence level as well as add comments. Comments were created to allow citations to relevant publications, or specific figures within publications (i.e. free text). It is not advised to change the scores unless the user has strong evidence to justify it.

3 Results

Based on the user input and on the values retrieved from the external databases, RevUP calculates the C-score, the F-score and the RVE-score for the submitted variant. The result page is composed of two parts. The top portion (Fig. 1A) presents the information concerning the variant, the RVE-score and the strength of this score relative to the distribution of a curated collection of 46 regulatory variants (van der Lee ). This can indicate to the user if the score is high or low compared to published variants. Then, the C- and F-scores are displayed separately for the user to assess the strength and weakness of their analysis. The bottom portion (Fig. 1B) indicates, for each evidence level, the score, the information outputted from public databases and the comments added by the user (if any). The results are downloadable in PDF or png formats to present as-is or in modified form.
Fig. 1.

RevUP report obtained for the scoring of the non-coding variant in the NOTCH1 gene reported by Wang in proband 1. (A) Summary information for the variant; (B) clinical and functional information compiled from user input and external databases used to generate the RevUP score

RevUP report obtained for the scoring of the non-coding variant in the NOTCH1 gene reported by Wang in proband 1. (A) Summary information for the variant; (B) clinical and functional information compiled from user input and external databases used to generate the RevUP score An example is shown in Figure 1, based on a recently published variant located in a regulatory region upstream of NOTCH1 and implicated in the tetralogy of Fallot (Wang ). We input the variant characteristics and the information found in the paper in RevUP. The tool allowed for a quick scoring of the variant, and the creation of the displayed figure.

4 Conclusions and outlook

RevUP is a strong addition to the available variant classifier tools, as it will allow users to assess properties specific to regulatory variants. This scoring system does not conflict with the ACMG classification guidelines; rather it can be used as additional information when studying regulatory variants. The tool will save time for the user as it is able to query databases rapidly. Click here for additional data file.
  14 in total

1.  Predicting regulatory variants with composite statistic.

Authors:  Mulin Jun Li; Zhicheng Pan; Zipeng Liu; Jiexing Wu; Panwen Wang; Yun Zhu; Feng Xu; Zhengyuan Xia; Pak Chung Sham; Jean-Pierre A Kocher; Miaoxin Li; Jun S Liu; Junwen Wang
Journal:  Bioinformatics       Date:  2016-06-06       Impact factor: 6.937

2.  Family-based whole-genome sequencing identifies compound heterozygous protein-coding and noncoding mutations in tetralogy of Fallot.

Authors:  Yifeng Wang; Tao Jiang; Pushi Tang; Yifei Wu; Zhu Jiang; Juncheng Dai; Yayun Gu; Jing Xu; Min Da; Hongxia Ma; Guangfu Jin; Xuming Mo; Qingguo Li; Xiaowei Wang; Zhibin Hu
Journal:  Gene       Date:  2020-03-09       Impact factor: 3.688

3.  InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines.

Authors:  Quan Li; Kai Wang
Journal:  Am J Hum Genet       Date:  2017-01-26       Impact factor: 11.025

4.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.

Authors:  Adam Siepel; Gill Bejerano; Jakob S Pedersen; Angie S Hinrichs; Minmei Hou; Kate Rosenbloom; Hiram Clawson; John Spieth; Ladeana W Hillier; Stephen Richards; George M Weinstock; Richard K Wilson; Richard A Gibbs; W James Kent; Webb Miller; David Haussler
Journal:  Genome Res       Date:  2005-07-15       Impact factor: 9.043

Review 5.  The Role of De Novo Noncoding Regulatory Mutations in Neurodevelopmental Disorders.

Authors:  Tychele N Turner; Evan E Eichler
Journal:  Trends Neurosci       Date:  2018-12-15       Impact factor: 13.837

6.  Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

Authors:  Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm
Journal:  Genet Med       Date:  2015-03-05       Impact factor: 8.822

7.  The Encyclopedia of DNA elements (ENCODE): data portal update.

Authors:  Carrie A Davis; Benjamin C Hitz; Cricket A Sloan; Esther T Chan; Jean M Davidson; Idan Gabdank; Jason A Hilton; Kriti Jain; Ulugbek K Baymuradov; Aditi K Narayanan; Kathrina C Onate; Keenan Graham; Stuart R Miyasato; Timothy R Dreszer; J Seth Strattan; Otto Jolanki; Forrest Y Tanaka; J Michael Cherry
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

8.  GREEN-DB: a framework for the annotation and prioritization of non-coding regulatory variants from whole-genome sequencing data.

Authors:  Edoardo Giacopuzzi; Niko Popitsch; Jenny C Taylor
Journal:  Nucleic Acids Res       Date:  2022-03-21       Impact factor: 16.971

9.  ReMap 2020: a database of regulatory regions from an integrative analysis of Human and Arabidopsis DNA-binding sequencing experiments.

Authors:  Jeanne Chèneby; Zacharie Ménétrier; Martin Mestdagh; Thomas Rosnet; Allyssa Douida; Wassim Rhalloussi; Aurélie Bergon; Fabrice Lopez; Benoit Ballester
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

10.  The mutational constraint spectrum quantified from variation in 141,456 humans.

Authors:  Konrad J Karczewski; Laurent C Francioli; Grace Tiao; Beryl B Cummings; Jessica Alföldi; Qingbo Wang; Ryan L Collins; Kristen M Laricchia; Andrea Ganna; Daniel P Birnbaum; Laura D Gauthier; Harrison Brand; Matthew Solomonson; Nicholas A Watts; Daniel Rhodes; Moriel Singer-Berk; Eleina M England; Eleanor G Seaby; Jack A Kosmicki; Raymond K Walters; Katherine Tashman; Yossi Farjoun; Eric Banks; Timothy Poterba; Arcturus Wang; Cotton Seed; Nicola Whiffin; Jessica X Chong; Kaitlin E Samocha; Emma Pierce-Hoffman; Zachary Zappala; Anne H O'Donnell-Luria; Eric Vallabh Minikel; Ben Weisburd; Monkol Lek; James S Ware; Christopher Vittal; Irina M Armean; Louis Bergelson; Kristian Cibulskis; Kristen M Connolly; Miguel Covarrubias; Stacey Donnelly; Steven Ferriera; Stacey Gabriel; Jeff Gentry; Namrata Gupta; Thibault Jeandet; Diane Kaplan; Christopher Llanwarne; Ruchi Munshi; Sam Novod; Nikelle Petrillo; David Roazen; Valentin Ruano-Rubio; Andrea Saltzman; Molly Schleicher; Jose Soto; Kathleen Tibbetts; Charlotte Tolonen; Gordon Wade; Michael E Talkowski; Benjamin M Neale; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2020-05-27       Impact factor: 69.504

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.