| Literature DB >> 24462714 |
Xiaolei Zhao1, Shouqiang Zhong2, Xiaoyu Zuo3, Meihua Lin1, Jiheng Qin1, Yizhao Luan1, Naizun Zhang2, Yan Liang4, Shaoqi Rao5.
Abstract
Many cancers apparently showing similar phenotypes are actually distinct at the molecular level, leading to very different responses to the same treatment. It has been recently demonstrated that pathway-based approaches are robust and reliable for genetic analysis of cancers. Nevertheless, it remains unclear whether such function-based approaches are useful in deciphering molecular heterogeneities in cancers. Therefore, we aimed to test this possibility in the present study. First, we used a NCI60 dataset to validate the ability of pathways to correctly partition samples. Next, we applied the proposed method to identify the hidden subtypes in diffuse large B-cell lymphoma (DLBCL). Finally, the clinical significance of the identified subtypes was verified using survival analysis. For the NCI60 dataset, we achieved highly accurate partitions that best fit the clinical cancer phenotypes. Subsequently, for a DLBCL dataset, we identified three hidden subtypes that showed very different 10-year overall survival rates (90%, 46% and 20%) and were highly significantly (P=0.008) correlated with the clinical survival rate. This study demonstrated that the pathway-based approach is promising for unveiling genetic heterogeneities in complex human diseases.Entities:
Keywords: Cancer; Enrichment analysis; Genetic heterogeneity; Pathway-based approach; Sample partitioning; Survival analysis
Mesh:
Year: 2014 PMID: 24462714 PMCID: PMC4411334 DOI: 10.1016/j.gpb.2013.12.001
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Signature pathways for NCI60
| hsa05222: small cell lung cancer | 19 | 7.83E−06 | 9.82E−03 | 0.83 | 2 | <0.001 |
| hsa04512: ECM–receptor interaction | 21 | 3.03E−07 | 3.81E−04 | 0.69 | 8 | <0.001 |
| hsa04510: focal adhesion | 36 | 1.54E−10 | 1.93E−07 | 0.78 | 3 | <0.001 |
Note: Signature pathways for NCI60 were identified by using FDR for multiple tests correction (adjusted α = 0.01). Details of the NCI60 dataset were described previously [18]. a Modified Fisher Exact P value. b FDR stands for false positive rate, which is used for adjustment of multiple tests for 201 pathways. c Statistical significance of ARI for the selected pathway. ARI stands for adjusted Rand index.
Figure 1Decision tree based on three signature pathways for five cancer types The internal nodes of the tree are the signature pathways. The leaf nodes represent the classification for five types of cancer (renal cancer, central nervous system cancer, melanoma, colon cancer and leukemia). Included in the leaf nodes are the total number of samples over the number of the incorrectly predicted samples for the specific type of cancer indicated.
Signature pathways for DLBCL
| hsa04640: hematopoietic cell lineage | 22 | 3.80E−10 | 4.76E−07 |
| hsa04060: cytokine receptor interaction | 24 | 1.00E−06 | 1.26E−03 |
Note: Signature pathways for DLBCL were identified by using FDR for multiple tests correction (adjusted α = 0.01). a Modified Fisher Exact P value. b FDR stands for false positive rate, which is used for adjustment of multiple tests for 201 pathways. DLBCL stands for diffuse large B-cell lymphoma.
Figure 2Clinically distinct DLBCL subtypes defined by gene expression profiling of two signature pathways Kaplan–Meier plot of the overall survival of the three molecular subtypes of DLBCL, partitioned using the expression profiles of the genes contained in two signature pathways, hsa04640 and hsa04060.
Multivariate Cox proportional-hazard model built using the genes in the two signature pathways
| CD10 | −0.762 | 10.635 | 0.001 | 0.530 (0.295–0.738) |
| CD21 | −0.735 | 6.210 | 0.013 | 0.467 (0.269–0.855) |
| IL2RB | −0.630 | 6.377 | 0.012 | 0.479 (0.327–0.869) |
Note: CI stands for confidence interval.
Figure 3A detailed procedure chart for pathway-based analysis of genetic heterogeneities