| Literature DB >> 25802847 |
Tao Huang1, Jing Yang2, Yu-Dong Cai3.
Abstract
The mechanisms of lung cancer are highly complex. Not only mRNA gene expression but also microRNAs, DNA methylation, and copy number variation (CNV) play roles in tumorigenesis. It is difficult to incorporate so much information into a single model that can comprehensively reflect all these lung cancer mechanisms. In this study, we analyzed the 129 TCGA (The Cancer Genome Atlas) squamous cell lung carcinoma samples with gene expression, microRNA expression, DNA methylation, and CNV data. First, we used variance inflation factor (VIF) regression to build the whole genome integrative network. Then, we isolated the lung cancer subnetwork by identifying the known lung cancer genes and their direct regulators. This subnetwork was refined by the Bayesian method, and the directed regulations among mRNA genes, microRNAs, methylations, and CNVs were obtained. The novel candidate key drivers in this refined subnetwork, such as the methylation of ARHGDIB and HOXD3, microRNA let-7a and miR-31, and the CNV of AGAP2, were identified and analyzed. On three large public available lung cancer datasets, the key drivers ARHGDIB and HOXD3 demonstrated significant associations with the overall survival of lung cancer patients. Our results provide new insights into lung cancer mechanisms.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25802847 PMCID: PMC4352729 DOI: 10.1155/2015/358125
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
TCGA sample IDs of 129 squamous cell lung carcinoma patients with gene expression, microRNA expression, DNA methylation, and CNV data.
| Sample IDs | Sample IDs | Sample IDs | Sample IDs | Sample IDs |
|---|---|---|---|---|
| TCGA-18-3414 | TCGA-33-4566 | TCGA-66-2795 | TCGA-39-5021 | TCGA-39-5037 |
| TCGA-18-3411 | TCGA-60-2713 | TCGA-66-2744 | TCGA-51-4080 | TCGA-37-4141 |
| TCGA-18-3410 | TCGA-18-3417 | TCGA-66-2742 | TCGA-21-1075 | TCGA-33-4547 |
| TCGA-18-3412 | TCGA-66-2786 | TCGA-66-2771 | TCGA-39-5031 | TCGA-33-4582 |
| TCGA-22-0944 | TCGA-33-4583 | TCGA-66-2787 | TCGA-21-1077 | TCGA-34-5240 |
| TCGA-46-3766 | TCGA-66-2785 | TCGA-46-3767 | TCGA-37-4135 | TCGA-66-2791 |
| TCGA-18-3406 | TCGA-66-2780 | TCGA-66-2754 | TCGA-21-1072 | TCGA-66-2789 |
| TCGA-22-4604 | TCGA-66-2781 | TCGA-18-3415 | TCGA-37-4133 | TCGA-66-2788 |
| TCGA-60-2706 | TCGA-18-4721 | TCGA-18-3419 | TCGA-33-4533 | TCGA-66-2792 |
| TCGA-60-2696 | TCGA-22-4613 | TCGA-46-3765 | TCGA-33-4538 | TCGA-22-4607 |
| TCGA-60-2710 | TCGA-22-4601 | TCGA-18-3421 | TCGA-37-4130 | TCGA-22-4596 |
| TCGA-60-2698 | TCGA-66-2782 | TCGA-18-3408 | TCGA-66-2790 | TCGA-33-4532 |
| TCGA-60-2711 | TCGA-43-3920 | TCGA-18-3416 | TCGA-66-2737 | TCGA-22-4595 |
| TCGA-60-2712 | TCGA-46-3769 | TCGA-66-2793 | TCGA-66-2753 | TCGA-22-4591 |
| TCGA-60-2708 | TCGA-43-3394 | TCGA-66-2783 | TCGA-66-2734 | TCGA-22-4594 |
| TCGA-66-2758 | TCGA-51-4079 | TCGA-66-2794 | TCGA-34-2596 | TCGA-37-3789 |
| TCGA-60-2722 | TCGA-37-3792 | TCGA-66-2800 | TCGA-34-2608 | TCGA-46-3768 |
| TCGA-60-2721 | TCGA-39-5039 | TCGA-56-1622 | TCGA-43-2581 | TCGA-43-2578 |
| TCGA-60-2724 | TCGA-63-5131 | TCGA-60-2725 | TCGA-60-2695 | TCGA-66-2727 |
| TCGA-60-2716 | TCGA-39-5036 | TCGA-51-4081 | TCGA-34-2600 | TCGA-39-5011 |
| TCGA-66-2759 | TCGA-39-5034 | TCGA-22-1012 | TCGA-66-2766 | TCGA-34-5241 |
| TCGA-60-2723 | TCGA-63-5128 | TCGA-22-1011 | TCGA-66-2767 | TCGA-39-5029 |
| TCGA-66-2755 | TCGA-39-5035 | TCGA-21-1079 | TCGA-66-2770 | TCGA-39-5028 |
| TCGA-60-2719 | TCGA-39-5030 | TCGA-21-1078 | TCGA-66-2765 | TCGA-18-3407 |
| TCGA-60-2720 | TCGA-66-2768 | TCGA-21-1080 | TCGA-66-2763 | TCGA-18-4086 |
| TCGA-60-2714 | TCGA-33-4586 | TCGA-21-1076 | TCGA-66-2777 |
Figure 1The refined key Bayesian subnetwork of lung cancer. The grey, green, red, and pink nodes represent mRNA genes, microRNAs, methylations, and copy number variations (CNVs), respectively. The one-arrow edges represent directed regulation, while the two-arrow edges represent undirected regulation.
Figure 2The Kaplan-Meier plots of key drivers ARHGDIB and HOXD3 on three large lung cancer datasets. The log-rank P values of ARHGDIB on GSE4573 (a), GSE30219 (b), and GSE41271 (c) were 0.042, 0.0781, and 0.0021, respectively. The patients with high expression of ARHGDIB had good prognoses. The log-rank P values of HOXD3 on GSE4573 (d), GSE30219 (e), and GSE41271 (f) were 0.0441, 0, and 0.0888, respectively. The patients with high expression of HOXD3 had poor prognoses.