Literature DB >> 30010784

DMCM: a Data-adaptive Mutation Clustering Method to identify cancer-related mutation clusters.

Xinguo Lu1, Xin Qian1, Xing Li1, Qiumai Miao1, Shaoliang Peng1,2.   

Abstract

Motivation: Functional somatic mutations within coding amino acid sequences confer growth advantage in pathogenic process. Most existing methods for identifying cancer-related mutations focus on the single amino acid or the entire gene level. However, gain-of-function mutations often cluster in specific protein regions instead of existing independently in the amino acid sequences. Some approaches for identifying mutation clusters with mutation density on amino acid chain have been proposed recently. But their performance in identification of mutation clusters remains to be improved.
Results: Here we present a Data-adaptive Mutation Clustering Method (DMCM), in which kernel density estimate (KDE) with a data-adaptive bandwidth is applied to estimate the mutation density, to find variable clusters with different lengths on amino acid sequences. We apply this approach in the mutation data of 571 genes in over twenty cancer types from The Cancer Genome Atlas (TCGA). We compare the DMCM with M2C, OncodriveCLUST and Pfam Domain and find that DMCM tends to identify more significant clusters. The cross-validation analysis shows DMCM is robust and cluster cancer type enrichment analysis shows that specific cancer types are enriched for specific mutation clusters. Availability and implementation: DMCM is written in Python and analysis methods of DMCM are written in R. They are all released online, available through https://github.com/XinguoLu/DMCM. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2019        PMID: 30010784     DOI: 10.1093/bioinformatics/bty624

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features.

Authors:  Changli Feng; Zhaogui Ma; Deyun Yang; Xin Li; Jun Zhang; Yanjuan Li
Journal:  Front Bioeng Biotechnol       Date:  2020-05-05

2.  Identifying Potential miRNAs-Disease Associations With Probability Matrix Factorization.

Authors:  Junlin Xu; Lijun Cai; Bo Liao; Wen Zhu; Peng Wang; Yajie Meng; Jidong Lang; Geng Tian; Jialiang Yang
Journal:  Front Genet       Date:  2019-12-11       Impact factor: 4.599

3.  Anomaly Detection Based Latency-Aware Energy Consumption Optimization For IoT Data-Flow Services.

Authors:  Yuansheng Luo; Wenjia Li; Shi Qiu
Journal:  Sensors (Basel)       Date:  2019-12-24       Impact factor: 3.576

4.  Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network.

Authors:  Zixuan Meng; Linai Kuang; Zhiping Chen; Zhen Zhang; Yihong Tan; Xueyong Li; Lei Wang
Journal:  Front Genet       Date:  2021-03-17       Impact factor: 4.599

5.  sgRNA-PSM: Predict sgRNAs On-Target Activity Based on Position-Specific Mismatch.

Authors:  Bin Liu; Zhihua Luo; Juan He
Journal:  Mol Ther Nucleic Acids       Date:  2020-01-31       Impact factor: 8.886

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.