| Literature DB >> 30982079 |
E Joseph Jordan1, Keshav Patil2, Krishna Suresh3, Jin H Park4,5, Yael P Mosse6,7, Mark A Lemmon4,5, Ravi Radhakrishnan8,9,10.
Abstract
Methods to catalog and computationally assess the mutational landscape of proteins in human cancers are desirable. One approach is to adapt evolutionary or data-driven methods developed for predicting whether a single-nucleotide polymorphism (SNP) is deleterious to protein structure and function. In cases where understanding the mechanism of protein activation and regulation is desired, an alternative approach is to employ structure-based computational approaches to predict the effects of point mutations. Through a case study of mutations in kinase domains of three proteins, namely, the anaplastic lymphoma kinase (ALK) in pediatric neuroblastoma patients, serine/threonine-protein kinase B-Raf (BRAF) in melanoma patients, and erythroblastic oncogene B 2 (ErbB2 or HER2) in breast cancer patients, we compare the two approaches above. We find that the structure-based method is most appropriate for developing a binary classification of several different mutations, especially infrequently occurring ones, concerning the activation status of the given target protein. This approach is especially useful if the effects of mutations on the interactions of inhibitors with the target proteins are being sought. However, many patients will present with mutations spread across different target proteins, making structure-based models computationally demanding to implement and execute. In this situation, data-driven methods-including those based on machine learning techniques and evolutionary methods-are most appropriate for recognizing and illuminate mutational patterns. We show, however, that, in the present status of the field, the two methods have very different accuracies and confidence values, and hence, the optimal choice of their deployment is context-dependent.Entities:
Keywords: Driver mutations; Machine learning; Molecular dynamics; Passenger mutations; Structural bioinformatics
Mesh:
Substances:
Year: 2019 PMID: 30982079 PMCID: PMC6589134 DOI: 10.1007/s00018-019-03097-2
Source DB: PubMed Journal: Cell Mol Life Sci ISSN: 1420-682X Impact factor: 9.261