| Literature DB >> 30872752 |
Huan-Ming Hsu1,2, Chi-Ming Chu3,4,5,6,7,8, Yu-Jia Chang9, Jyh-Cherng Yu10, Chien-Ting Chen3, Chen-En Jian3, Chia-Yi Lee3, Yueh-Tao Chiang11,12, Chi-Wen Chang11,12, Yu-Tien Chang13,14.
Abstract
Gene co-expression network analysis (GCNA) can detect alterations in regulatory activities in case/control comparisons. We propose a framework to detect novel genes and networks for predicting breast cancer recurrence. Thirty-four prognosis candidate genes were selected based on a literature review. Four Gene Expression Omnibus Series (GSE) microarray datasets (n = 920) were used to create gene co-expression networks based on these candidates. We applied the framework to four comparison groups according to node (+/-) and recurrence (+/-). We identified a sub-network containing two candidate genes (LST1 and IGHM) and six novel genes (IGHA1, IGHD, IGHG1, IGHG3, IGLC2, and IGLJ3) related to B cell-specific immunoglobulin. These novel genes were correlated with recurrence under the control of node status and were found to function as tumor suppressors; higher mRNA expression indicated a lower risk of recurrence (hazard ratio, HR = 0.87, p = 0.001). We created an immune index score by performing principle component analysis and divided the genes into low and high groups. This discrete index significantly predicted relapse-free survival (RFS) (high: HR = 0.77, p = 0.019; low: control). Public tool KM Plotter and TCGA-BRCA gene expression data were used to validate. We confirmed these genes are correlated with RFS and distal metastasis-free survival (DMFS) in triple-negative breast cancer (TNBC) and general breast cancer.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30872752 PMCID: PMC6418134 DOI: 10.1038/s41598-019-40826-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Highly co-expressed genes correlated with BC recurrence.
| Recurrence | Non-recurrence |
|---|---|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
| |
|
| |
P values were calculated using Cox proportional hazard ratio regression for breast cancer recurrence controlled by node (+/−) and *means p value < 0.05.
Figure 1Gene co-expression networks. Co-expression networks of six subgroups, (a) Recurrence (+), (b) Recurrence (−), (c) Node (+) and Recurrence (+), (d) Node (+) and Recurrence (−), (e) Node (−) and Recurrence (+), (f) Node (−) and Recurrence (−). The width of the gene connection indicates the degree of correlation between genes. Colors of the gene icons and connecting lines denote similar gene expression patterns for genes in the same color, which were analyzed by hierarchical clustering. Connection lines in green denote neighboring genes that do not belong to the same cluster. Size of the gene icon reflects the absolute value of cv of gene expression. The 34 candidate genes are represented by diamonds; co-expressed genes are represented by circles, and significant recurrence associated co-expressed genes are represented by stars. The gene icon frame is shown in red if 0.01 ≤ p < 0.05, and it is shown in yellow if p < 0.01. Up-regulated genes are shown by a dashed line, whereas down-regulated genes are shown by a solid line.
Highly co-expressed genes correlated with BC recurrence grouped by node status.
| Group | Recurrence | Non-recurrence |
|---|---|---|
| Node+ | None |
|
|
| ||
|
| ||
|
| ||
|
| ||
|
| ||
| Node− |
|
|
|
|
| |
|
| ||
|
|
| |
|
|
|
P values were calculated using Cox proportional hazard ratio regression for breast cancer recurrence controlled by node (+/−). *Means p value < 0.05, **means p value < 0.01.
Univariable Cox proportional hazard ratio regression of novel co-expressed genes for breast cancer recurrence.
| Gene | B | HR | P |
|---|---|---|---|
|
| −0.17 | 0.85 | 0.02 |
|
| −0.13 | 0.87 | 0.03 |
|
| −0.24 | 0.78 | 0.02 |
|
| −0.17 | 0.84 | 0.02 |
|
| −0.16 | 0.86 | 0.01 |
|
| −0.16 | 0.85 | 0.02 |
|
| 0.22 | 1.24 | 0.02 |
|
| 0.10 | 1.10 | 0.13 |
|
| −0.04 | 0.96 | 0.50 |
|
| −0.02 | 0.98 | 0.81 |
|
| 0.30 | 1.35 | 0.05 |
|
| −0.03 | 0.97 | 0.65 |
|
| 0.02 | 1.02 | 0.79 |
P values (p) were calculated using Cox proportional hazard ratio regression for breast cancer recurrence controlled by node (+/−). *Means p < 0.05. HR: hazard ratio, B: the coefficient of predictors in the Cox proportional hazard ratio regression.
Cox proportional hazard ratio regression of the immune index for breast cancer recurrence.
| Model 1a | B | HR | P |
|---|---|---|---|
| Node | 0.53 | 1.70 | 0.001 |
| Immune index | −0.14 | 0.87 | 0.014 |
|
| |||
| Node | 0.461 | 1.586 | 0.004 |
| Low immune index (n = 355) | ref | ||
| High immune index (n = 552) | −0.256 | 0.774 | 0.019 |
aImmune index used in this model is a continuous variable.
bImmune index in this model was divided into high and low immune index groups by cutoff point −0.5.
B: the coefficients of predictors, HR: hazard ratio and ref: reference group in the Cox proportional hazard ratio regression.
Figure 2Cox proportional hazard ratio regression prediction model of a relapse-free survival curve based on the immune index.
The validation of immune-related genes using TNBC samples from KM Plotter online cancer survival analysis tool (http://kmplot.com/analysis/).
| RFS | DMFS |
|---|---|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
Multivariable Cox Proportional-Hazards Regression Models of immunoglobulin-related genes and node status on RFS and DMFS using TGCA-BRCA data sets.
| Gene | Univariable Cox Proportional-Hazards Regression | RFS | DMFS | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TNBC = 0 | TNBC = 1 | ALL | TNBC = 0 | TNBC = 1 | ALL | ||||||||||||||||
| B | HR | P value | B | HR | P value | B | HR | P value | B | HR | P value | B | HR | P value | B | HR | P value | B | HR | P value | |
| IGDCC3 | −0.00108 | 0.998921 | 0.063262 | −0.00657 | 0.993451 | 0.000488*** | −0.0008 | 0.999199 | 0.040908* | −0.00135 | 0.998651 | 0.017673* | |||||||||
| IGJ | −0.00056 | 0.999437 | 0.244451 | −0.00168 | 0.998319 | 0.000325*** | −0.00106 | 0.998939 | 0.003212** | −0.00157 | 0.99843 | 0.026129* | |||||||||
| IGSF22 | −0.00119 | 0.998815 | 0.013396* | −0.00162 | 0.99838 | 0.004776** | −0.00151 | 0.998488 | 0.002168** | ||||||||||||
| IGSF6 | 0.000771 | 1.000771 | 0.097379 | 0.001365 | 1.001366 | 0.022637* | |||||||||||||||
| IGSF9 | −0.00052 | 0.999482 | 0.223937 | −0.00124 | 0.998759 | 0.031277* | |||||||||||||||
| IGSF9B | 0.000988 | 1.000988 | 0.047992* | 0.000972 | 1.000972 | 0.036841* | 0.001386 | 1.001387 | 0.017102* | ||||||||||||
| IGSF3 | −0.00037 | 0.999633 | 0.428202 | 0.0022 | 1.002203 | 0.005787** | |||||||||||||||
Multivariable Cox Proportional-Hazards Regression analysis of relapse-free survival (RFS) and distal metastasis-free survival (DMFS) were under controlled by node status of negative/positive and N0-N3 respectively. P stands for p value. *Means p value < 0.05, **means p value < 0.01, ***means p < 0.001. HR: Hazard ratio.
Descriptive statistics of the four microarray datasets.
| Variables | Data sets, n(%) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Wang | Sotiriou | Ivshina | Desmedt | ||||||
| Recurrence at the end of follow-up# | 0 | 179 | 62.6 | 120 | 64.2 | 160 | 64.3 | 107 | 54.0 |
| 1 | 107 | 37.4 | 67 | 35.8 | 89 | 35.7 | 91 | 46.0 | |
| Node | Negative | 286 | 100.0 | 153 | 83.6 | 159 | 66.3 | 198 | 100.0 |
| Positive | 0 | 0 | 30 | 16.4 | 81 | 33.8 | 0 | 0 | |
| Follow-up* time, mean(sd) | 6.46(3.52) | 6.62(3.95) | 7.14(4.30) | 9.31(5.56) | |||||
#Chi-square: for analysis of the difference between recurrence status and data sets, p = 0.104.
*ANOVA: for analysis of the difference in the follow-up time among the data sets, p < 0.001.
Figure 3Study flowchart.