Literature DB >> 31115714

A Novel Method for Identifying the Potential Cancer Driver Genes Based on Molecular Data Integration.

Wei Zhang1, Shu-Lin Wang2.   

Abstract

The identification of the cancer driver genes is essential for personalized therapy. The mutation frequency of most driver genes is in the middle (2-20%) or even lower range, which makes it difficult to find the driver genes with low-frequency mutations. Other forms of genomic aberrations, such as copy number variations (CNVs) and epigenetic changes, may also reflect cancer progression. In this work, a method for identifying the potential cancer driver genes (iPDG) based on molecular data integration is proposed. DNA copy number variation, somatic mutation, and gene expression data of matched cancer samples are integrated. In combination with the method of iKEEG, the "key genes" of cancer are identified, and the change in their expression levels is used for auxiliary evaluation of whether the mutated genes are potential drivers. For a mutated gene, the concept of mutational effect is defined, which takes into account the effects of copy number variation, mutation gene itself, and its neighbor genes. The method mainly includes two steps: the first step is data preprocessing. First, DNA copy number variation and somatic mutation data are integrated. Then, the integrated data are mapped to a given interaction network, and the diffusion kernel is used to form the mutation effect matrix. The second step is to obtain the key genes by using the iKGGE method, and construct the connection matrix by means of the gene expression data of the key genes and mutation impact matrix of the mutated genes. Experiments on TCGA breast cancer and Glioblastoma multiforme datasets demonstrate that iPDG is effective not only to identify the known cancer driver genes but also to discover the rare potential driver genes. When measured by functional enrichment analysis, we find that these genes are clearly associated with these two types of cancers.

Entities:  

Keywords:  DNA copy numbers variation data; Diffusion kernel; Driver genes; Gene expression data; Somatic mutation data

Mesh:

Year:  2019        PMID: 31115714     DOI: 10.1007/s10528-019-09924-2

Source DB:  PubMed          Journal:  Biochem Genet        ISSN: 0006-2928            Impact factor:   1.890


  4 in total

1.  Scalable analysis of multi-modal biomedical data.

Authors:  Jaclyn Smith; Yao Shi; Michael Benedikt; Milos Nikolic
Journal:  Gigascience       Date:  2021-09-11       Impact factor: 6.524

2.  An Effective Graph Clustering Method to Identify Cancer Driver Modules.

Authors:  Wei Zhang; Yifu Zeng; Lei Wang; Yue Liu; Yi-Nan Cheng
Journal:  Front Bioeng Biotechnol       Date:  2020-04-07

3.  Feature Selection for Breast Cancer Classification by Integrating Somatic Mutation and Gene Expression.

Authors:  Qin Jiang; Min Jin
Journal:  Front Genet       Date:  2021-02-26       Impact factor: 4.599

4.  Identification of Early Warning Signals at the Critical Transition Point of Colorectal Cancer Based on Dynamic Network Analysis.

Authors:  Lei Liu; Zhuo Shao; Jiaxuan Lv; Fei Xu; Sibo Ren; Qing Jin; Jingbo Yang; Weifang Ma; Hongbo Xie; Denan Zhang; Xiujie Chen
Journal:  Front Bioeng Biotechnol       Date:  2020-05-29
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.