Literature DB >> 29340599

A benchmark study of scoring methods for non-coding mutations.

Damien Drubay1,2, Daniel Gautheret3, Stefan Michiels1,2.   

Abstract

Motivation: Detailed knowledge of coding sequences has led to different candidate models for pathogenic variant prioritization. Several deleteriousness scores have been proposed for the non-coding part of the genome, but no large-scale comparison has been realized to date to assess their performance.
Results: We compared the leading scoring tools (CADD, FATHMM-MKL, Funseq2 and GWAVA) and some recent competitors (DANN, SNP and SOM scores) for their ability to discriminate assumed pathogenic variants from assumed benign variants (using the ClinVar, COSMIC and 1000 genomes project databases). Using the ClinVar benchmark, CADD was the best tool for detecting the pathogenic variants that are mainly located in protein coding gene regions. Using the COSMIC benchmark, FATHMM-MKL, GWAVA and SOMliver outperformed the other tools for pathogenic variants that are typically located in lincRNAs, pseudogenes and other parts of the non-coding genome. However, all tools had low precision, which could potentially be improved by future non-coding genome feature discoveries. These results may have been influenced by the presence of potential benign variants in the COSMIC database. The development of a gold standard as consistent as ClinVar for these regions will be necessary to confirm our tool ranking. Availability and implementation: The Snakemake, C++ and R codes are freely available from https://github.com/Oncostat/BenchmarkNCVTools and supported on Linux. Contact: damien.drubay@gustaveroussy.fr or stefan.michiels@gustaveroussy.fr. Supplementary information: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2018        PMID: 29340599     DOI: 10.1093/bioinformatics/bty008

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  De novo pattern discovery enables robust assessment of functional consequences of non-coding variants.

Authors:  Hai Yang; Rui Chen; Quan Wang; Qiang Wei; Ying Ji; Guangze Zheng; Xue Zhong; Nancy J Cox; Bingshan Li
Journal:  Bioinformatics       Date:  2019-05-01       Impact factor: 6.937

2.  Genetic variation in EPHA contributes to sensitivity to paclitaxel-induced peripheral neuropathy.

Authors:  Lauren A Marcath; Kelley M Kidwell; Kiran Vangipuram; Christina L Gersch; James M Rae; Monika L Burness; Jennifer J Griggs; Catherine Van Poznak; Daniel F Hayes; Ellen M Lavoie Smith; N Lynn Henry; Andreas S Beutler; Daniel L Hertz
Journal:  Br J Clin Pharmacol       Date:  2020-02-04       Impact factor: 4.335

3.  Whole Genome Sequence, Variant Discovery and Annotation in Mapuche-Huilliche Native South Americans.

Authors:  Elena A Vidal; Tomás C Moyano; Bernabé I Bustos; Eduardo Pérez-Palma; Carol Moraga; Eleodoro Riveras; Alejandro Montecinos; Lorena Azócar; Daniela C Soto; Mabel Vidal; Alex Di Genova; Klaus Puschel; Peter Nürnberg; Stephan Buch; Jochen Hampe; Miguel L Allende; Verónica Cambiazo; Mauricio González; Christian Hodar; Martín Montecino; Claudia Muñoz-Espinoza; Ariel Orellana; Angélica Reyes-Jara; Dante Travisany; Paula Vizoso; Mauricio Moraga; Susana Eyheramendy; Alejandro Maass; Giancarlo V De Ferrari; Juan Francisco Miquel; Rodrigo A Gutiérrez
Journal:  Sci Rep       Date:  2019-02-14       Impact factor: 4.379

4.  RegulationSpotter: annotation and interpretation of extratranscriptic DNA variants.

Authors:  Jana Marie Schwarz; Daniela Hombach; Sebastian Köhler; David N Cooper; Markus Schuelke; Dominik Seelow
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

5.  regBase: whole genome base-wise aggregation and functional prediction for human non-coding regulatory variants.

Authors:  Shijie Zhang; Yukun He; Huanhuan Liu; Haoyu Zhai; Dandan Huang; Xianfu Yi; Xiaobao Dong; Zhao Wang; Ke Zhao; Yao Zhou; Jianhua Wang; Hongcheng Yao; Hang Xu; Zhenglu Yang; Pak Chung Sham; Kexin Chen; Mulin Jun Li
Journal:  Nucleic Acids Res       Date:  2019-12-02       Impact factor: 16.971

6.  ncVarDB: a manually curated database for pathogenic non-coding variants and benign controls.

Authors:  Harry Biggs; Padmini Parthasarathy; Alexandra Gavryushkina; Paul P Gardner
Journal:  Database (Oxford)       Date:  2020-12-01       Impact factor: 3.451

7.  Calibrating variant-scoring methods for clinical decision making.

Authors:  Silvia Benevenuta; Emidio Capriotti; Piero Fariselli
Journal:  Bioinformatics       Date:  2021-01-25       Impact factor: 6.937

8.  Classification of non-coding variants with high pathogenic impact.

Authors:  Lambert Moyon; Camille Berthelot; Alexandra Louis; Nga Thi Thuy Nguyen; Hugues Roest Crollius
Journal:  PLoS Genet       Date:  2022-04-29       Impact factor: 5.917

9.  Whole-genome sequencing identifies complex contributions to genetic risk by variants in genes causing monogenic systemic lupus erythematosus.

Authors:  Jonas Carlsson Almlöf; Sara Nystedt; Dag Leonard; Maija-Leena Eloranta; Giorgia Grosso; Christopher Sjöwall; Anders A Bengtsson; Andreas Jönsen; Iva Gunnarsson; Elisabet Svenungsson; Lars Rönnblom; Johanna K Sandling; Ann-Christine Syvänen
Journal:  Hum Genet       Date:  2019-02-01       Impact factor: 4.132

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.