Literature DB >> 22319177

Incorporating molecular and functional context into the analysis and prioritization of human variants associated with cancer.

Thomas A Peterson1, Nathan L Nehrt, Dohwan Park, Maricel G Kann.   

Abstract

BACKGROUND AND
OBJECTIVE: With recent breakthroughs in high-throughput sequencing, identifying deleterious mutations is one of the key challenges for personalized medicine. At the gene and protein level, it has proven difficult to determine the impact of previously unknown variants. A statistical method has been developed to assess the significance of disease mutation clusters on protein domains by incorporating domain functional annotations to assist in the functional characterization of novel variants.
METHODS: Disease mutations aggregated from multiple databases were mapped to domains, and were classified as either cancer- or non-cancer-related. The statistical method for identifying significantly disease-associated domain positions was applied to both sets of mutations and to randomly generated mutation sets for comparison. To leverage the known function of protein domain regions, the method optionally distributes significant scores to associated functional feature positions.
RESULTS: Most disease mutations are localized within protein domains and display a tendency to cluster at individual domain positions. The method identified significant disease mutation hotspots in both the cancer and non-cancer datasets. The domain significance scores (DS-scores) for cancer form a bimodal distribution with hotspots in oncogenes forming a second peak at higher DS-scores than non-cancer, and hotspots in tumor suppressors have scores more similar to non-cancers. In addition, on an independent mutation benchmarking set, the DS-score method identified mutations known to alter protein function with very high precision.
CONCLUSION: By aggregating mutations with known disease association at the domain level, the method was able to discover domain positions enriched with multiple occurrences of deleterious mutations while incorporating relevant functional annotations. The method can be incorporated into translational bioinformatics tools to characterize rare and novel variants within large-scale sequencing studies.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22319177      PMCID: PMC3277632          DOI: 10.1136/amiajnl-2011-000655

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  52 in total

1.  Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms.

Authors:  Robert J Clifford; Michael N Edmonson; Cu Nguyen; Kenneth H Buetow
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

2.  PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level.

Authors:  Lucía Conde; Juan M Vaquerizas; Javier Santoyo; Fátima Al-Shahrour; Sergio Ruiz-Llorente; Mercedes Robledo; Joaquín Dopazo
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 3.  Pathogenic or not? And if so, then how? Studying the effects of missense mutations using bioinformatics methods.

Authors:  Janita Thusberg; Mauno Vihinen
Journal:  Hum Mutat       Date:  2009-05       Impact factor: 4.878

4.  The Protein Mutant Database.

Authors:  T Kawabata; M Ota; K Nishikawa
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

5.  Comprehensive statistical study of 452 BRCA1 missense substitutions with classification of eight recurrent substitutions as neutral.

Authors:  S V Tavtigian; A M Deffenbaugh; L Yin; T Judkins; T Scholl; P B Samollow; D de Silva; A Zharkikh; A Thomas
Journal:  J Med Genet       Date:  2005-07-13       Impact factor: 6.318

6.  Human non-synonymous SNPs: server and survey.

Authors:  Vasily Ramensky; Peer Bork; Shamil Sunyaev
Journal:  Nucleic Acids Res       Date:  2002-09-01       Impact factor: 16.971

7.  Structural and functional restraints on the occurrence of single amino acid variations in human proteins.

Authors:  Sungsam Gong; Tom L Blundell
Journal:  PLoS One       Date:  2010-02-12       Impact factor: 3.240

8.  CUPSAT: prediction of protein stability upon point mutations.

Authors:  Vijaya Parthiban; M Michael Gromiha; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

9.  Edgetic perturbation models of human inherited disorders.

Authors:  Quan Zhong; Nicolas Simonis; Qian-Ru Li; Benoit Charloteaux; Fabien Heuze; Niels Klitgord; Stanley Tam; Haiyuan Yu; Kavitha Venkatesan; Danny Mou; Venus Swearingen; Muhammed A Yildirim; Han Yan; Amélie Dricot; David Szeto; Chenwei Lin; Tong Hao; Changyu Fan; Stuart Milstein; Denis Dupuy; Robert Brasseur; David E Hill; Michael E Cusick; Marc Vidal
Journal:  Mol Syst Biol       Date:  2009-11-03       Impact factor: 11.429

10.  SNAP: predict effect of non-synonymous polymorphisms on function.

Authors:  Yana Bromberg; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2007-05-25       Impact factor: 16.971

View more
  14 in total

Review 1.  Interpreting functional effects of coding variants: challenges in proteome-scale prediction, annotation and assessment.

Authors:  Khader Shameer; Lokesh P Tripathi; Krishna R Kalari; Joel T Dudley; Ramanathan Sowdhamini
Journal:  Brief Bioinform       Date:  2015-10-22       Impact factor: 11.622

2.  Empirical null estimation using zero-inflated discrete mixture distributions and its application to protein domain data.

Authors:  Iris Ivy M Gauran; Junyong Park; Johan Lim; DoHwan Park; John Zylstra; Thomas Peterson; Maricel Kann; John L Spouge
Journal:  Biometrics       Date:  2017-09-22       Impact factor: 2.571

3.  Pan-Cancer Analysis of Mutation Hotspots in Protein Domains.

Authors:  Martin L Miller; Ed Reznik; Nicholas P Gauthier; Bülent Arman Aksoy; Anil Korkut; Jianjiong Gao; Giovanni Ciriello; Nikolaus Schultz; Chris Sander
Journal:  Cell Syst       Date:  2015-09-23       Impact factor: 10.304

Review 4.  Towards precision medicine: advances in computational approaches for the analysis of human variants.

Authors:  Thomas A Peterson; Emily Doughty; Maricel G Kann
Journal:  J Mol Biol       Date:  2013-08-17       Impact factor: 5.469

5.  Comprehensive Analysis of Constraint on the Spatial Distribution of Missense Variants in Human Protein Structures.

Authors:  R Michael Sivley; Xiaoyi Dou; Jens Meiler; William S Bush; John A Capra
Journal:  Am J Hum Genet       Date:  2018-02-15       Impact factor: 11.025

6.  Oncodomains: A protein domain-centric framework for analyzing rare variants in tumor samples.

Authors:  Thomas A Peterson; Iris Ivy M Gauran; Junyong Park; DoHwan Park; Maricel G Kann
Journal:  PLoS Comput Biol       Date:  2017-04-20       Impact factor: 4.475

7.  A protein domain-centric approach for the comparative analysis of human and yeast phenotypically relevant mutations.

Authors:  Thomas A Peterson; DoHwan Park; Maricel G Kann
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

8.  Recent trends in biomedical informatics: a study based on JAMIA articles.

Authors:  Xiaoqian Jiang; Krystal Tse; Shuang Wang; Son Doan; Hyeoneui Kim; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2013-11-08       Impact factor: 4.497

Review 9.  The common ground of genomics and systems biology.

Authors:  Ana Conesa; Ali Mortazavi
Journal:  BMC Syst Biol       Date:  2014-03-13

10.  MutationAligner: a resource of recurrent mutation hotspots in protein domains in cancer.

Authors:  Nicholas Paul Gauthier; Ed Reznik; Jianjiong Gao; Selcuk Onur Sumer; Nikolaus Schultz; Chris Sander; Martin L Miller
Journal:  Nucleic Acids Res       Date:  2015-11-20       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.