Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Incorporating molecular and functional context into the analysis and prioritization of human variants associated with cancer.

Literature DB >> 22319177

Incorporating molecular and functional context into the analysis and prioritization of human variants associated with cancer.

Thomas A Peterson¹, Nathan L Nehrt, Dohwan Park, Maricel G Kann.

Abstract

BACKGROUND AND
OBJECTIVE: With recent breakthroughs in high-throughput sequencing, identifying deleterious mutations is one of the key challenges for personalized medicine. At the gene and protein level, it has proven difficult to determine the impact of previously unknown variants. A statistical method has been developed to assess the significance of disease mutation clusters on protein domains by incorporating domain functional annotations to assist in the functional characterization of novel variants.
METHODS: Disease mutations aggregated from multiple databases were mapped to domains, and were classified as either cancer- or non-cancer-related. The statistical method for identifying significantly disease-associated domain positions was applied to both sets of mutations and to randomly generated mutation sets for comparison. To leverage the known function of protein domain regions, the method optionally distributes significant scores to associated functional feature positions.
RESULTS: Most disease mutations are localized within protein domains and display a tendency to cluster at individual domain positions. The method identified significant disease mutation hotspots in both the cancer and non-cancer datasets. The domain significance scores (DS-scores) for cancer form a bimodal distribution with hotspots in oncogenes forming a second peak at higher DS-scores than non-cancer, and hotspots in tumor suppressors have scores more similar to non-cancers. In addition, on an independent mutation benchmarking set, the DS-score method identified mutations known to alter protein function with very high precision.
CONCLUSION: By aggregating mutations with known disease association at the domain level, the method was able to discover domain positions enriched with multiple occurrences of deleterious mutations while incorporating relevant functional annotations. The method can be incorporated into translational bioinformatics tools to characterize rare and novel variants within large-scale sequencing studies.

Entities: Disease Species

Mesh：

Substances：
Proteins

Year: 2012 PMID： 22319177 PMCID： PMC3277632 DOI： 10.1136/amiajnl-2011-000655

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

52 in total

1. Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms.

Authors: Robert J Clifford; Michael N Edmonson; Cu Nguyen; Kenneth H Buetow
Journal: Bioinformatics Date: 2004-01-29 Impact factor: 6.937

2. PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level.

Authors: Lucía Conde; Juan M Vaquerizas; Javier Santoyo; Fátima Al-Shahrour; Sergio Ruiz-Llorente; Mercedes Robledo; Joaquín Dopazo
Journal: Nucleic Acids Res Date: 2004-07-01 Impact factor: 16.971

Review 3. Pathogenic or not? And if so, then how? Studying the effects of missense mutations using bioinformatics methods.

Authors: Janita Thusberg; Mauno Vihinen
Journal: Hum Mutat Date: 2009-05 Impact factor: 4.878

4. The Protein Mutant Database.

Authors: T Kawabata; M Ota; K Nishikawa
Journal: Nucleic Acids Res Date: 1999-01-01 Impact factor: 16.971

5. Comprehensive statistical study of 452 BRCA1 missense substitutions with classification of eight recurrent substitutions as neutral.

Authors: S V Tavtigian; A M Deffenbaugh; L Yin; T Judkins; T Scholl; P B Samollow; D de Silva; A Zharkikh; A Thomas
Journal: J Med Genet Date: 2005-07-13 Impact factor: 6.318

6. Human non-synonymous SNPs: server and survey.

Authors: Vasily Ramensky; Peer Bork; Shamil Sunyaev
Journal: Nucleic Acids Res Date: 2002-09-01 Impact factor: 16.971

7. Structural and functional restraints on the occurrence of single amino acid variations in human proteins.

Authors: Sungsam Gong; Tom L Blundell
Journal: PLoS One Date: 2010-02-12 Impact factor: 3.240

8. CUPSAT: prediction of protein stability upon point mutations.

Authors: Vijaya Parthiban; M Michael Gromiha; Dietmar Schomburg
Journal: Nucleic Acids Res Date: 2006-07-01 Impact factor: 16.971

9. Edgetic perturbation models of human inherited disorders.

Authors: Quan Zhong; Nicolas Simonis; Qian-Ru Li; Benoit Charloteaux; Fabien Heuze; Niels Klitgord; Stanley Tam; Haiyuan Yu; Kavitha Venkatesan; Danny Mou; Venus Swearingen; Muhammed A Yildirim; Han Yan; Amélie Dricot; David Szeto; Chenwei Lin; Tong Hao; Changyu Fan; Stuart Milstein; Denis Dupuy; Robert Brasseur; David E Hill; Michael E Cusick; Marc Vidal
Journal: Mol Syst Biol Date: 2009-11-03 Impact factor: 11.429

10. SNAP: predict effect of non-synonymous polymorphisms on function.

Authors: Yana Bromberg; Burkhard Rost
Journal: Nucleic Acids Res Date: 2007-05-25 Impact factor: 16.971

14 in total

Review 1. Interpreting functional effects of coding variants: challenges in proteome-scale prediction, annotation and assessment.

Authors: Khader Shameer; Lokesh P Tripathi; Krishna R Kalari; Joel T Dudley; Ramanathan Sowdhamini
Journal: Brief Bioinform Date: 2015-10-22 Impact factor: 11.622

2. Empirical null estimation using zero-inflated discrete mixture distributions and its application to protein domain data.

Authors: Iris Ivy M Gauran; Junyong Park; Johan Lim; DoHwan Park; John Zylstra; Thomas Peterson; Maricel Kann; John L Spouge
Journal: Biometrics Date: 2017-09-22 Impact factor: 2.571

Incorporating molecular and functional context into the analysis and prioritization of human variants associated with cancer.

1. Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms.

2. PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level.

Review 3. Pathogenic or not? And if so, then how? Studying the effects of missense mutations using bioinformatics methods.

4. The Protein Mutant Database.

5. Comprehensive statistical study of 452 BRCA1 missense substitutions with classification of eight recurrent substitutions as neutral.

6. Human non-synonymous SNPs: server and survey.

7. Structural and functional restraints on the occurrence of single amino acid variations in human proteins.

8. CUPSAT: prediction of protein stability upon point mutations.

9. Edgetic perturbation models of human inherited disorders.

10. SNAP: predict effect of non-synonymous polymorphisms on function.

Review 1. Interpreting functional effects of coding variants: challenges in proteome-scale prediction, annotation and assessment.

2. Empirical null estimation using zero-inflated discrete mixture distributions and its application to protein domain data.

3. Pan-Cancer Analysis of Mutation Hotspots in Protein Domains.

Review 4. Towards precision medicine: advances in computational approaches for the analysis of human variants.

5. Comprehensive Analysis of Constraint on the Spatial Distribution of Missense Variants in Human Protein Structures.

6. Oncodomains: A protein domain-centric framework for analyzing rare variants in tumor samples.

7. A protein domain-centric approach for the comparative analysis of human and yeast phenotypically relevant mutations.

8. Recent trends in biomedical informatics: a study based on JAMIA articles.

Review 9. The common ground of genomics and systems biology.

10. MutationAligner: a resource of recurrent mutation hotspots in protein domains in cancer.