| Literature DB >> 28623901 |
Reiko Nishihara1,2,3,4, Kimberly Glass5, Kosuke Mima6, Tsuyoshi Hamada6, Jonathan A Nowak7, Zhi Rong Qian6, Peter Kraft8, Edward L Giovannucci8,5, Charles S Fuchs5,6,9,10,11, Andrew T Chan5,12, John Quackenbush13,14, Shuji Ogino15,16,17,18, Jukka-Pekka Onnela19.
Abstract
BACKGROUND: Colorectal carcinoma evolves through a multitude of molecular events including somatic mutations, epigenetic alterations, and aberrant protein expression, influenced by host immune reactions. One way to interrogate the complex carcinogenic process and interactions between aberrant events is to model a biomarker correlation network. Such a network analysis integrates multidimensional tumor biomarker data to identify key molecular events and pathways that are central to an underlying biological process. Due to embryological, physiological, and microbial differences, proximal and distal colorectal cancers have distinct sets of molecular pathological signatures. Given these differences, we hypothesized that a biomarker correlation network might vary by tumor location.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28623901 PMCID: PMC5474023 DOI: 10.1186/s12859-017-1718-5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Tumor molecular and pathological features in colorectal carcinomas
| Biomarker | Status measured | % or mean (SD) of the status in proximal colon ( | % or mean (SD) of the status in distal colorectum | % availability in proximal colon ( | % availability in distal colorectum ( |
|---|---|---|---|---|---|
| Somatic oncogenic mutations | |||||
|
| Mutation | 25.1% | 5.6% | 86.1% | 87.4% |
|
| Mutation | 43.0% | 36.6% | 86.2% | 87.5% |
|
| Mutation | 18.4% | 14.5% | 79.7% | 80.9% |
| Methylation status | |||||
|
| Methylation | 25.3% | 3.9% | 83.5% | 82.5% |
|
| Methylation | 44.3% | 21.8% | 83.5% | 82.5% |
|
| Methylation | 40.3% | 9.9% | 83.5% | 82.3% |
|
| Methylation | 52.8% | 20.9% | 83.5% | 82.5% |
|
| Methylation | 44.3% | 12.1% | 83.5% | 82.5% |
|
| Methylation | 53.1% | 21.0% | 83.5% | 82.3% |
|
| Methylation | 40.1% | 8.8% | 83.5% | 82.5% |
|
| Methylation | 30.4% | 7.9% | 83.3% | 82.5% |
| LINE-1 | Degree of methylation (%) | 64.7 (SD, 9.7) | 62.6 (SD, 9.9) | 85.1% | 84.5% |
|
| Methylation | 22.0% | 29.6% | 65.1% | 65.2% |
| Microsatellite instability (MSI) | |||||
| MSI | MSI-high | 28.2% | 4.9% | 85.9% | 85.7% |
| Protein expression | |||||
| CDH1 | Loss | 52.6% | 50.9% | 44.9% | 42.5% |
| CDKN1A | Loss | 72.3% | 89.2% | 52.8% | 55.2% |
| CDKN1B | Loss | 47.9% | 34.8% | 51.7% | 53.8% |
| CDKN2A | Loss | 35.3% | 17.9% | 46.8% | 47.8% |
| CDX2 | Loss | 36.6% | 17.2% | 43.2% | 39.7% |
| MGMT | Loss | 34.7% | 38.9% | 42.6% | 43.9% |
| AURKA | Overexpression | 22.2% | 14.3% | 32.6% | 28.4% |
| CCND1 | Overexpression | 82.6% | 73.5% | 64.9% | 65.5% |
| CD274 | Overexpression | 85.6% | 83.7% | 43.2% | 40.0% |
| CDK8 | Overexpression | 72.6% | 69.4% | 30.7% | 26.5% |
| CTNNB1 (nuclear) | Overexpression | 32.7% | 58.2% | 75.8% | 74.2% |
| CTSB | Overexpression | 82.7% | 82.9% | 45.2% | 42.3% |
| DNMT3B | Overexpression | 19.7% | 10.3% | 45.5% | 49.3% |
| EPAS1 | Overexpression | 47.0% | 44.5% | 46.2% | 42.0% |
| FASN | Overexpression | 61.1% | 60.0% | 64.8% | 65.9% |
| HGF | Overexpression | 45.8% | 47.9% | 39.6% | 37.2% |
| HIF1A | Overexpression | 17.0% | 20.1% | 47.7% | 42.6% |
| IGF2BP3 | Overexpression | 36.8% | 32.7% | 43.8% | 41.2% |
| IRS1 | Overexpression | 29.5% | 30.6% | 44.6% | 41.7% |
| IRS2 | Overexpression | 31.6% | 33.2% | 44.9% | 41.9% |
| PPARG | Overexpression | 24.1% | 19.2% | 33.6% | 29.4% |
| PTGER2 | Overexpression | 27.8% | 26.3% | 45.8% | 41.3% |
| PTGS2 | Overexpression | 53.5% | 69.7% | 81.7% | 79.9% |
| SIRT1 | Overexpression | 38.6% | 37.9% | 39.4% | 35.9% |
| STAT3 | Overexpression | 54.7% | 54.0% | 46.4% | 43.2% |
| TP53 | Overexpression | 33.0% | 53.2% | 66.7% | 67.2% |
| VDR | Overexpression | 37.0% | 38.2% | 47.4% | 42.5% |
| YAP1 (cytoplasmic) | Overexpression | 21.3% | 17.7% | 44.2% | 41.0% |
| JC Virus T-Antigen (JCVT) | Overexpression | 28.4% | 40.7% | 43.9% | 47.7% |
| Immune reactions | |||||
| Peritumoral lymphocytic reaction | Greater reaction | 19.4% | 12.9% | 95.8% | 91.2% |
| Intratumoral periglandular reaction | Greater reaction | 17.2% | 9.0% | 95.9% | 91.6% |
| Tumor infiltrating lymphocytes (TIL) | Greater reaction | 16.3% | 4.4% | 95.9% | 91.6% |
| Crohn’s-like reaction | Greater reaction | 9.9% | 5.0% | 80.4% | 73.2% |
| CD3+ in cancer area | Density (cells/mm2) | 716.6 (SD, 1501.8) | 696.5 (SD, 1560.7) | 47.1% | 42.9% |
| CD8+ in cancer area | Density (cells/mm2) | 814.7 (SD, 1741.0) | 700.9 (SD, 1582.1) | 46.7% | 41.7% |
| CD45RO+ in cancer area | Density (cells/mm2) | 711.4 (SD, 1113.4) | 634.4 (SD, 1244.7) | 48.1% | 43.6% |
| FOXP3+ in cancer area | Density (cells/mm2) | 40.0 (SD, 47.0) | 35.2 (SD, 37.9) | 45.2% | 41.9% |
| miRNA expression | |||||
| MIR21 | Normalized expression level | 8.3 (SD, 13.0) | 5.6 (SD, 4.0) | 53.2% | 48.6% |
| MIR155 | Normalized expression level | 0.01 (SD, 0.01) | 0.005 (SD, 0.006) | 53.2% | 48.6% |
| Microorganism | |||||
|
| Presence | 16.0% | 9.1% | 71.7% | 67.1% |
aPercentage (%) indicates the proportion of the status measure for a binary biomarker, and the mean (SD) was calculated for a continuous biomarker
SD standard deviation
Demographic, clinical and pathologic features of colorectal cancers in the network analysis dataset by tumor location
| Total ( | Proximal colon cancer ( | Distal colorectal cancer ( |
| |
|---|---|---|---|---|
| Age, mean (SD) | 69.9 (8.6) | 70.0 (8.8) | 69.7 (8.5) | 0.56 |
| Sex | ||||
| Female | 758 (54.9%) | 379 (54.9%) | 379 (54.9%) | 0.99 |
| Male | 622 (45.1%) | 311 (45.1%) | 311 (45.1%) | |
| Tumor location | ||||
| Cecum | 253 (18.3%) | 253 (36.7%) | 0 (0%) | <0.0001 |
| Ascending colon | 303 (22.0%) | 303 (43.9%) | 0 (0%) | |
| Hepatic flexure | 45 (3.3%) | 45 (6.5%) | 0 (0%) | |
| Transverse colon | 89 (6.4%) | 89 (12.9%) | 0 (0%) | |
| Splenic flexure | 31 (2.2%) | 0 (0%) | 31 (4.5%) | |
| Descending colon | 66 (4.8%) | 0 (0%) | 66 (9.6%) | |
| Sigmoid colon | 296 (21.4%) | 0 (0%) | 296 (42.9%) | |
| Rectosigmoid junction | 100 (7.2%) | 0 (0%) | 100 (14.5%) | |
| Rectum | 197 (14.3%) | 0 (0%) | 197 (28.6%) | |
| TNM Stage | ||||
| I | 324 (26.2%) | 146 (22.9%) | 178 (29.7%) | 0.002 |
| II | 404 (32.7%) | 232 (36.4%) | 172 (28.7%) | |
| III | 332 (26.8%) | 159 (24.9%) | 173 (28.9%) | |
| IV | 177 (14.3%) | 101 (15.8%) | 76 (12.7%) | |
| Tumor differentiation | ||||
| Well to moderate | 1231 (89.2%) | 577 (83.6%) | 654 (94.8%) | <0.0001 |
| Poor | 149 (10.8%) | 113 (16.4%) | 36 (5.2%) | |
| Family history of colorectal cancer in first-degree relative(s) | ||||
| No | 1074 (78.4%) | 532 (77.9%) | 542 (78.9%) | 0.65 |
| Yes | 296 (21.6%) | 151 (22.1%) | 145 (21.1%) | |
The % numbers indicate the fraction of cases with a given feature among total cases, proximal colon cancer cases, or distal colorectal cancer cases
a P value was calculated using a t-test for age and chi-squared tests for categorical variables
Network characteristics by tumor location
| Total ( | Proximal colon cancer ( | Distal colorectal cancer ( | |
|---|---|---|---|
| Correlation network based on original variables | |||
| Number of nodes | 54 | 54 | 54 |
| Number of edges | 268 | 173 | 95 |
| Median degree | 8.0 | 3.0 | 2.0 |
| Average clustering coefficient | 0.52 | 0.50 | 0.30 |
| Hubsa (degree centrality) |
| MSI (0.39) | - |
| Correlation network in non-MSI-high cancer | |||
| Number of nodes | 53 | 53 | 53 |
| Median degree | 4.0 | 1.0 | 1.0 |
| Number of edges | 120 | 64 | 56 |
| Average clustering coefficient | 0.33 | 0.23 | 0.26 |
| Hubsa (degree centrality) |
|
|
|
| Correlation network based on the same number of edges | |||
| Number of nodes | 54 | 54 | 54 |
| Median degree | 1.5 | 4.0 | 1.0 |
| Number of edges | 100 | 100 | 100 |
| Average clustering coefficient | 0.27 | 0.29 | 0.27 |
| Hubsa (degree centrality) |
|
|
|
aMarkers with degree centrality at or above the 80th percentile in the colorectal cancer network
b CDKN2A promoter hypermethylation
MSI microsatellite instability, TIL Tumor infiltrating lymphocytes
Fig. 1Biomarker networks by tumor location; proximal colon cancer network (a), and distal colorectal cancer network (b). A node represents a molecular feature, and an edge specifies a statistically significant Spearman correlation between two markers (nodes) with a significance level of 3.5 × 10−5 (0.05/1431, based on the Bonferroni correction). The red line indicates a positive correlation, and the blue line indicates a negative correlation; line width is proportional to correlation coefficient. CDKN2A (IHC), protein expression of CDKN2A; CDKN2A, methylation level of CDKN2A; LINE-1, methylation level of long interspersed nucleotide element 1; MSI, microsatellite instability; TIL, lymphocytes on top of neoplastic epithelial cells
Fig. 2a Degree distribution in biomarker networks by tumor location. b Cumulative degree distribution in biomarker networks by tumor location. The solid (the distal colorectal cancer network) and dashed lines (the proximal colon cancer network) indicate the median degree values
Markers with differential connectivity by tumor location based on Cook’s distance among highly-connected markersa
| Highly-connected markersa | Degree centrality | Degree in proximal colon cancer | Degree in distal colorectal cancer | Cook’s distance |
|---|---|---|---|---|
| MSI | 0.45 | 21 | 9 | 0.12 |
|
| 0.38 | 18 | 9 | 0.028 |
|
| 0.38 | 16 | 8 | 0.015 |
|
| 0.45 | 18 | 10 | 0.0071 |
| TIL | 0.43 | 18 | 10 | 0.0071 |
aHighly-connected markers which were at or above the 80th percentile of the degree distribution in the colorectal cancer network. The table shows markers with Cook’s distance higher than 0.005
MSI microsatellite instability, TIL tumor infiltrating lymphocytes
Fig. 3Total number of edges in biomarker networks as a function of the significance levels in Spearman correlation analyses