Literature DB >> 31144781

Predicting pathogenicity of missense variants with weakly supervised regression.

Yue Cao1, Yuanfei Sun1, Mostafa Karimi1, Haoran Chen1, Oluwaseyi Moronfoye1, Yang Shen1.   

Abstract

Quickly growing genetic variation data of unknown clinical significance demand computational methods that can reliably predict clinical phenotypes and deeply unravel molecular mechanisms. On the platform enabled by the Critical Assessment of Genome Interpretation (CAGI), we develop a novel "weakly supervised" regression (WSR) model that not only predicts precise clinical significance (probability of pathogenicity) from inexact training annotations (class of pathogenicity) but also infers underlying molecular mechanisms in a variant-specific manner. Compared to multiclass logistic regression, a representative multiclass classifier, our kernelized WSR improves the performance for the ENIGMA Challenge set from 0.72 to 0.97 in binary area under the receiver operating characteristic curve (AUC) and from 0.64 to 0.80 in ordinal multiclass AUC. WSR model interpretation and protein structural interpretation reach consensus in corroborating the most probable molecular mechanisms by which some pathogenic BRCA1 variants confer clinical significance, namely metal-binding disruption for p.C44F and p.C47Y, protein-binding disruption for p.M18T, and structure destabilization for p.S1715N.
© 2019 Wiley Periodicals, Inc.

Entities:  

Keywords:  clinical significance; genetic variation; genome medicine; machine learning; model interpretability; molecular mechanism; weak supervision

Year:  2019        PMID: 31144781      PMCID: PMC6744350          DOI: 10.1002/humu.23826

Source DB:  PubMed          Journal:  Hum Mutat        ISSN: 1059-7794            Impact factor:   4.878


  40 in total

1.  SIFT: Predicting amino acid changes that affect protein function.

Authors:  Pauline C Ng; Steven Henikoff
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Improved flexible refinement of protein docking in CAPRI rounds 22-27.

Authors:  Yang Shen
Journal:  Proteins       Date:  2013-10-17

3.  Functional analysis of BRCA1 C-terminal missense mutations identified in breast and ovarian cancer families.

Authors:  J Vallon-Christersson; C Cayanan; K Haraldsson; N Loman; J T Bergthorsson; K Brøndum-Nielsen; A M Gerdes; P Møller; U Kristoffersson; H Olsson; A Borg; A N Monteiro
Journal:  Hum Mol Genet       Date:  2001-02-15       Impact factor: 6.150

4.  Functional differences among BRCA1 missense mutations in the control of centrosome duplication.

Authors:  Z Kais; N Chiba; C Ishioka; J D Parvin
Journal:  Oncogene       Date:  2011-07-04       Impact factor: 9.867

5.  Benchmarking mutation effect prediction algorithms using functionally validated cancer-related missense mutations.

Authors:  Luciano G Martelotto; Charlotte Ky Ng; Maria R De Filippo; Yan Zhang; Salvatore Piscuoglio; Raymond S Lim; Ronglai Shen; Larry Norton; Jorge S Reis-Filho; Britta Weigelt
Journal:  Genome Biol       Date:  2014-10-28       Impact factor: 13.583

Review 6.  Objective assessment of the evolutionary action equation for the fitness effect of missense mutations across CAGI-blinded contests.

Authors:  Panagiotis Katsonis; Olivier Lichtarge
Journal:  Hum Mutat       Date:  2017-06-21       Impact factor: 4.878

7.  Identification of breast tumor mutations in BRCA1 that abolish its function in homologous DNA recombination.

Authors:  Derek J R Ransburgh; Natsuko Chiba; Chikashi Ishioka; Amanda Ewart Toland; Jeffrey D Parvin
Journal:  Cancer Res       Date:  2010-01-26       Impact factor: 12.701

8.  A formal perturbation equation between genotype and phenotype determines the Evolutionary Action of protein-coding variations on fitness.

Authors:  Panagiotis Katsonis; Olivier Lichtarge
Journal:  Genome Res       Date:  2014-09-12       Impact factor: 9.043

9.  Genome Landscapes of Disease: Strategies to Predict the Phenotypic Consequences of Human Germline and Somatic Variation.

Authors:  Rachel Karchin; Ruth Nussinov
Journal:  PLoS Comput Biol       Date:  2016-08-18       Impact factor: 4.475

10.  iCFN: an efficient exact algorithm for multistate protein design.

Authors:  Mostafa Karimi; Yang Shen
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

View more
  2 in total

1.  Assessment of blind predictions of the clinical significance of BRCA1 and BRCA2 variants.

Authors:  Melissa S Cline; Giulia Babbi; Sandra Bonache; Yue Cao; Rita Casadio; Xavier de la Cruz; Orland Díez; Sara Gutiérrez-Enríquez; Panagiotis Katsonis; Carmen Lai; Olivier Lichtarge; Pier L Martelli; Gilad Mishne; Alejandro Moles-Fernández; Gemma Montalban; Sean D Mooney; Robert O'Conner; Lars Ootes; Selen Özkan; Natalia Padilla; Kymberleigh A Pagel; Vikas Pejaver; Predrag Radivojac; Casandra Riera; Castrense Savojardo; Yang Shen; Yuanfei Sun; Scott Topper; Michael T Parsons; Amanda B Spurdle; David E Goldgar
Journal:  Hum Mutat       Date:  2019-08-23       Impact factor: 4.878

2.  Assessing the performance of in silico methods for predicting the pathogenicity of variants in the gene CHEK2, among Hispanic females with breast cancer.

Authors:  Alin Voskanian; Panagiotis Katsonis; Olivier Lichtarge; Vikas Pejaver; Predrag Radivojac; Sean D Mooney; Emidio Capriotti; Yana Bromberg; Yanran Wang; Max Miller; Pier Luigi Martelli; Castrense Savojardo; Giulia Babbi; Rita Casadio; Yue Cao; Yuanfei Sun; Yang Shen; Aditi Garg; Debnath Pal; Yao Yu; Chad D Huff; Sean V Tavtigian; Erin Young; Susan L Neuhausen; Elad Ziv; Lipika R Pal; Gaia Andreoletti; Steven E Brenner; Maricel G Kann
Journal:  Hum Mutat       Date:  2019-08-17       Impact factor: 4.700

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.