| Literature DB >> 31850075 |
Haidan Yan1,2, Xusheng Deng1,2, Haifeng Chen3, Jun Cheng1,2, Jun He1,2, Qingzhou Guan1,2, Meifeng Li1,2, Jiajing Xie1,2, Jie Xia1,2, Yunyan Gu4, Zheng Guo1,2.
Abstract
The heterogeneity of cancer is a big obstacle for cancer diagnosis and treatment. Prioritizing combinations of driver genes that mutate in most patients of a specific cancer or a subtype of this cancer is a promising way to tackle this problem. Here, we developed an empirical algorithm, named PathMG, to identify common and subtype-specific mutated sub-pathways for a cancer. By analyzing mutation data of 408 samples (Lung-data1) for lung cancer, three sub-pathways each covering at least 90% of samples were identified as the common sub-pathways of lung cancer. These sub-pathways were enriched with mutated cancer genes and drug targets and were validated in two independent datasets (Lung-data2 and Lung-data3). Especially, applying PathMG to analyze two major subtypes of lung cancer, lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LSCC), we identified 13 subtype-specific sub-pathways with at least 0.25 mutation frequency difference between LUAD and LSCC samples in Lung-data1, and 12 of the 13 sub-pathways were reproducible in Lung-data2 and Lung-data3. Similar analyses were done for colorectal cancer. Together, PathMG provides us a novel tool to identify potential common and subtype-specific sub-pathways for a cancer, which can provide candidates for cancer diagnoses and sub-pathway targeted treatments.Entities:
Keywords: cancer genes; common sub-pathways; drug targets; mutation; subtype-specific sub-pathways
Year: 2019 PMID: 31850075 PMCID: PMC6892778 DOI: 10.3389/fgene.2019.01228
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Description of mutation data used in this study.
| Data | Cancer type | Samples | References |
|---|---|---|---|
| Lung-data1 | LUAD | 230 | ( |
| LSCC | 178 | ( | |
| Lung-data2 | LUAD | 562 | ( |
| LSCC | 469 | ||
| Lung-data3 | LUAD | 438 | ( |
| LSCC | 308 | ||
| CRC-data1 | CRC | 619 | ( |
| CRC-data2 | 536 | ( | |
| CRC-data3 | 224 | ( | |
| CRC-data4 | 13 | – |
Figure 1The schematic diagram of the algorithm to identify common mutation sub-pathways. Orange nodes denote genes remained in the sub-pathway.
Figure 2The mutation frequencies of the common sub-pathways across different datasets for lung cancer (A) and CRC (B), respectively. The figures on the bars represent the number of genes within the identified sub-pathways. (C) The sub-pathway identified from PI3K-Akt signaling pathway. The genes with red font were genes in the sub-pathway and the genes in squares filled with red color were cancer genes.
Figure 3The top five most significant subtype-specific sub-pathways with the largest differences of mutation frequencies. The heatmap shows the p values of the sub-pathways calculated by Fisher’s exact test, and the figures on the heatmap represent the mutation frequency differences between lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LSCC). The mutation frequency difference was calculated as the mutation frequency of the sub-pathway in LUAD minus the mutation frequency of the sub-pathway in LSCC. When the figure on the heatmap was positive (negative), the sub-pathway was LUAD-specific (LSCC-specific) sub-pathway.
Figure 4Kaplan–Meier estimates of overall survival according to whether lung squamous cell carcinoma-specific sub-pathway of cell cycle mutated in the patients.