| Literature DB >> 25452685 |
Yen-Tsung Huang1, Thomas Hsu2, David C Christiani3.
Abstract
The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set with Copy Number data (TEGS-CN). TEGS-CN was adapted from TEGS, a method that we previously developed for gene expression data using a variance component score test. With additional development, we extend the method to analyze DNA copy number data, accounting for different sizes and thus various numbers of copy number probes in genes. The test statistic follows a mixture of X (2) distributions that can be obtained using permutation with scaled X (2) approximation. We conducted simulation studies to evaluate the size and the power of TEGS-CN and to compare its performance with TEGS. We analyzed a genome-wide copy number data from 264 patients of non-small-cell lung cancer. With the Molecular Signatures Database (MSigDB) pathway database, the genome-wide copy number data can be classified into 1814 biological pathways or gene sets. We investigated associations of the copy number profile of the 1814 gene sets with pack-years of cigarette smoking. Our analysis revealed five pathways with significant P values after Bonferroni adjustment (<2.8 × 10(-5)), including the PTEN pathway (7.8 × 10(-7)), the gene set up-regulated under heat shock (3.6 × 10(-6)), the gene sets involved in the immune profile for rejection of kidney transplantation (9.2 × 10(-6)) and for transcriptional control of leukocytes (2.2 × 10(-5)), and the ganglioside biosynthesis pathway (2.7 × 10(-5)). In conclusion, we present a new method for pathway analyses of copy number data, and causal mechanisms of the five pathways require further study.Entities:
Keywords: cancer genomics; copy numbers; gene set analyses; pathway analyses; variance component test
Year: 2014 PMID: 25452685 PMCID: PMC4218657 DOI: 10.4137/CIN.S13978
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Power curves of TEGS-CN varying the magnitude of associations β (A) and the proportion of non-zero associations (B).
Figure 2Power curves of TEGS-CN and TEGS, with CN-smoking associations occurring in small genes.
Characteristics of the 264 non-small-cell lung cancer patients.
| TOTAL | HEAVY SMOKERS | LIGHT- OR NONSMOKER | |
|---|---|---|---|
| Sample Size | 264 | 132 | 132 |
| Male (%) | 61.4 | 67.4 | 55.3 |
| Age | |||
| Mean +/− SD | 67.4 +/− 8.3 | 67.7 +/− 8.0 | 67.1 +/− 8.5 |
| Cigarette Smoking in Pack-Years | |||
| Median +/− IQR | 35.6 +/− 38.72 | 58.8 +/− 40.5 | 19.8 +/− 18.7 |
| Clinical Stage | |||
| Stage 1 (%) | 73.1 | 73.5 | 72.7 |
| Stage 2 (%) | 17.0 | 15.9 | 18.2 |
| Stage 3 or 4 (%) | 9.08 | 10.1 | 7.6 |
| Cigarette Smoking Status | |||
| Never Smoked (%) | 6.8 | 0 | 13.6 |
| Ex-Smokers (%) | 48.5 | 49.2 | 47.7 |
| Current Smokers (%) | 44.7 | 50.8 | 38.6 |
| Adenocarcinoma (%) | 66.3 | 64.4 | 68.2 |
Notes:
Heavy smokers are defined as pack-years of cigarette smoking >35.6 (the median of the smoking pack-years in 264 subjects), and light- or non-smokers are those with ≤35.6 smoking pack-years.
Figure 3Distribution of numbers of genes (A) and copy number probes (B) in pathways. Note that the number of genes (A) and copy number probes (B) per pathway was truncated at 500 and 4000, respectively, due to skewness of the distribution, and the entire range was described in the text.
The five pathways with P value <0.05 after Bonferroni adjustment.
| GENE PATHWAY (MSigDB ID) | NUMBER OF GENES | NUMBER OF COPY NUMBER PROBES | NOMINAL P-VALUES |
|---|---|---|---|
| PTENPATHWAY | 10 | 54 | 7.8 × 10−7 |
| HEATSHOCK_YOUNG_UP | 5 | 21 | 3.6 × 10−6 |
| FLECHNER_PBL_KIDNEY_TRANSPLANT_REJECTED_VS_OK_DN | 31 | 103 | 9.2 × 10−6 |
| SCHRAETS_MLL_TARGETS_UP | 21 | 120 | 2.2 × 10−5 |
| GANGLIOSIDE_BIOSYNTHESIS | 6 | 37 | 2.7 × 10−5 |
P values of genes in PTENPATHWAY.
| GENES | WORKING INDEPENDENCE | SAMPLE COVARIANCE |
|---|---|---|
| 6.52 × 10−7 | 0.0592 | |
| 0.0391 | 0.0131 | |
| 0.0726 | 0.0606 | |
| 0.0891 | 0.135 | |
| 0.106 | 0.106 | |
| 0.239 | 0.255 | |
| 0.295 | 0.295 | |
| 0.270 | 0.270 | |
| 0.457 | 1.65 × 10−4 | |
| 0.539 | 0.539 |
P values of genes in GANGLIOSIDE_BIOSYNTHESIS.
| GENES | WORKING INDEPENDENCE | SAMPLE COVARIANCE |
|---|---|---|
| 0.000441 | 0.118 | |
| 0.000489 | 0.176 | |
| 0.00527 | 0.00578 | |
| 0.0721 | 0.0725 | |
| 0.402 | 0.316 | |
| 0.740 | 0.740 |
P values of genes in HEATSHOCK_YOUNG_UP.
| GENES | WORKING INDEPENDENCE | SAMPLE COVARIANCE |
|---|---|---|
| 1.42 × 10−6 | 5.92 × 10−7 | |
| 0.214 | 0.492 | |
| 0.230 | 0.177 | |
| 0.360 | 0.360 | |
| 0.767 | 0.404 |
P values of genes in FLECHNER_PBL_KIDNEY_ TRANSPLANT_REJECTED_VS_OK_DN.
| GENES | WORKING INDEPENDENCE | SAMPLE COVARIANCE |
|---|---|---|
| 9.86 × 10−5 | 0.245 | |
| 0.00064 | 0.00064 | |
| 0.00588 | 0.00588 | |
| 0.0363 | 0.0363 | |
| 0.0453 | 0.0495 | |
| 0.0519 | 0.0519 | |
| 0.0733 | 0.0897 | |
| 0.0987 | 0.113 | |
| 0.104 | 0.0810 | |
| 0.203 | 0.203 | |
| 0.206 | 0.1623 | |
| 0.208 | 0.1477 | |
| 0.239 | 0.255 | |
| 0.287 | 0.288 | |
| 0.295 | 0.295 | |
| 0.335 | 0.335 | |
| 0.367 | 0.0444 | |
| 0.372 | 0.261 | |
| 0.381 | 0.187 | |
| 0.438 | 0.438 | |
| 0.438 | 0.438 | |
| 0.470 | 0.470 | |
| 0.488 | 0.488 | |
| 0.506 | 0.0411 | |
| 0.524 | 0.548 | |
| 0.554 | 0.554 | |
| 0.555 | 0.576 | |
| 0.607 | 0.653 | |
| 0.634 | 0.101 | |
| 0.656 | 0.656 | |
| 0.700 | 0.429 |
P values of genes in SCHRAETS_MLL_TARGETS_UP.
| GENES | WORKING INDEPENDENCE | SAMPLE COVARIANCE |
|---|---|---|
| 5.40 × 10−5 | 0.00177 | |
| 7.98 × 10−5 | 0.0395 | |
| 0.000145 | 9.61 × 10−5 | |
| 0.00199 | 0.000263 | |
| 0.00952 | 0.0954 | |
| 0.0243 | 0.00311 | |
| 0.0272 | 0.0265 | |
| 0.0529 | 0.0559 | |
| 0.0869 | 0.106 | |
| 0.161 | 0.171 | |
| 0.215 | 0.00658 | |
| 0.281 | 0.233 | |
| 0.295 | 0.295 | |
| 0.466 | 0.466 | |
| 0.476 | 0.425 | |
| 0.491 | 0.0850 | |
| 0.492 | 0.492 | |
| 0.625 | 0.625 | |
| 0.686 | 0.686 | |
| 0.775 | 0.405 | |
| 0.994 | 0.994 |