Literature DB >> 35098354

Evaluating the relevance of sequence conservation in the prediction of pathogenic missense variants.

Emidio Capriotti1, Piero Fariselli2.   

Abstract

Evolutionary information is the primary tool for detecting functional conservation in nucleic acid and protein. This information has been extensively used to predict structure, interactions and functions in macromolecules. Pathogenicity prediction models rely on multiple sequence alignment information at different levels. However, most accurate genome-wide variant deleteriousness ranking algorithms consider different features to assess the impact of variants. Here, we analyze three different ways of extracting evolutionary information from sequence alignments in the context of pathogenicity predictions at DNA and protein levels. We showed that protein sequence-based information is slightly more informative in the annotation of Clinvar missense variants than those obtained at the DNA level. Furthermore, to achieve the performance of state-of-the-art methods, such as CADD and REVEL, the conservation of reference and variant, encoded as frequencies of reference/alternate alleles or wild-type/mutant residues, should be included. Our results on a large set of missense variants show that a basic method based on three input features derived from the protein sequence profile performs similarly to the CADD algorithm which uses hundreds of genomic features. As expected, our method results in ~ 3% lower area under the receiver-operating characteristic curve (AUC). When compared with an ensemble-based algorithm (REVEL). Nevertheless, the combination of predictions of multiple methods can help to identify more reliable predictions. These observations indicate that for missense variants, evolutionary information, when properly encoded, plays the primary role in ranking pathogenicity.
© 2021. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35098354     DOI: 10.1007/s00439-021-02419-4

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   5.881


  30 in total

Review 1.  Bioinformatics for personal genome interpretation.

Authors:  Emidio Capriotti; Nathan L Nehrt; Maricel G Kann; Yana Bromberg
Journal:  Brief Bioinform       Date:  2012-01-13       Impact factor: 11.622

2.  Predicting functionally important residues from sequence conservation.

Authors:  John A Capra; Mona Singh
Journal:  Bioinformatics       Date:  2007-05-22       Impact factor: 6.937

3.  Functional annotations improve the predictive score of human disease-related mutations in proteins.

Authors:  Remo Calabrese; Emidio Capriotti; Piero Fariselli; Pier Luigi Martelli; Rita Casadio
Journal:  Hum Mutat       Date:  2009-08       Impact factor: 4.878

4.  Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information.

Authors:  E Capriotti; R Calabrese; R Casadio
Journal:  Bioinformatics       Date:  2006-08-07       Impact factor: 6.937

5.  Blind prediction of deleterious amino acid variations with SNPs&GO.

Authors:  Emidio Capriotti; Pier Luigi Martelli; Piero Fariselli; Rita Casadio
Journal:  Hum Mutat       Date:  2017-05-02       Impact factor: 4.878

Review 6.  A brief history of human disease genetics.

Authors:  Melina Claussnitzer; Judy H Cho; Rory Collins; Nancy J Cox; Emmanouil T Dermitzakis; Matthew E Hurles; Sekar Kathiresan; Eimear E Kenny; Cecilia M Lindgren; Daniel G MacArthur; Kathryn N North; Sharon E Plon; Heidi L Rehm; Neil Risch; Charles N Rotimi; Jay Shendure; Nicole Soranzo; Mark I McCarthy
Journal:  Nature       Date:  2020-01-08       Impact factor: 49.962

7.  PhD-SNPg: a webserver and lightweight tool for scoring single nucleotide variants.

Authors:  Emidio Capriotti; Piero Fariselli
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

Review 8.  Integrating molecular networks with genetic variant interpretation for precision medicine.

Authors:  Emidio Capriotti; Kivilcim Ozturk; Hannah Carter
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2018-12-12

9.  Calibrating variant-scoring methods for clinical decision making.

Authors:  Silvia Benevenuta; Emidio Capriotti; Piero Fariselli
Journal:  Bioinformatics       Date:  2021-01-25       Impact factor: 6.937

10.  Prediction and interpretation of deleterious coding variants in terms of protein structural stability.

Authors:  François Ancien; Fabrizio Pucci; Maxime Godfroid; Marianne Rooman
Journal:  Sci Rep       Date:  2018-03-14       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.