| Literature DB >> 35306505 |
Roberto Herrero1, Dario A Leon2,3, Augusto Gonzalez4.
Abstract
A small portion of a tissue defines a microstate in gene expression space. Mutations, epigenetic events or external factors cause microstate displacements which are modeled by combining small independent gene expression variations and large Levy jumps, resulting from the collective variations of a set of genes. The risk of cancer in a tissue is estimated as the microstate probability to transit from the normal to the tumor region in gene expression space. The formula coming from the contribution of large Levy jumps seems to provide a qualitatively correct description of the lifetime risk of cancer in 8 tissues, and reveals an interesting connection between the risk and the way the tissue is protected against infections.Entities:
Mesh:
Year: 2022 PMID: 35306505 PMCID: PMC8934350 DOI: 10.1038/s41598-022-08502-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
TCGA abbreviations for the studied cancer types.
| Abbreviation | Cancer type |
|---|---|
| BLCA | Bladder Urothelial Carcinoma |
| Breast invasive carcinoma | |
| Colon adenocarcinoma | |
| Esophageal carcinoma | |
| Head and and neck squamous cell carcinoma | |
| KIRC | Kidney clear cell carcinoma |
| KIRP | Kidney papillary cell carcinoma |
| Liver hepatocellular carcinoma | |
| Lung adenocarcinoma | |
| LUSC | Lung squamous cell carcinoma |
| PRAD | Prostate adenocarcinoma |
| READ | Rectum adenocarcinoma |
| STAD | Stomach adenocarcinoma |
| Thyroid carcinoma | |
| UCEC | Uterine corpus endometrial carcinoma |
Figure 1(a) PC analysis of the GE data for adenocarcinoma of the colon. Normal (blue circles) and tumor samples (red circles) are shown. Ellipses illustrating the centers and r.m.s. radii of both clouds of points are drawn. (b) Schematics of the fitness landscape. The fitness is normalized to the homeostatic value. The tumor region exhibits the deepest well (highest fitness).
Figure 2The 30 genes with most significant contributions to the vector in COAD. The x axis is the sequence number of a given gene in the TCGA data. CST1 is highlighted among the over-expressed and AQP8 among the silenced genes.
A set of parameters compiled for a group of tumors.
| BLCA | 140.61 | 57.53 | 34.68 | 48.40 | 0.0512 | . | . | . | . |
| 137.37 | 20.97 | 31.66 | 84.74 | 0.0450 | 4.3 | 0.09228 | 0.03427 | ||
| 155.89 | 11.71 | 28.53 | 115.65 | 0.0526 | 73 | 0.04264 | 0.01504 | ||
| 138.70 | 64.28 | 35.79 | 38.63 | 0.0710 | 33.18 | 0.00412 | 0.01378 | ||
| 123.50 | 27.74 | 23.54 | 72.22 | 0.0549 | 21.15 | 0.01527 | 0.00578 | ||
| KIRC | 171.81 | 28.70 | 36.01 | 107.09 | 0.0679 | . | . | . | . |
| KIRP | 163.42 | 19.90 | 27.78 | 115.74 | 0.0768 | . | . | . | . |
| 134.67 | 20.48 | 45.23 | 68.96 | 0.0461 | 0.9125 | 0.00397 | 0.00310 | ||
| 145.33 | 13.52 | 32.06 | 99.75 | 0.0581 | 0.07 | 0.01610 | 0.00847 | ||
| LUSC | 194.49 | 11.62 | 36.65 | 146.22 | 0.0522 | . | . | . | . |
| PRAD | 91.33 | 31.31 | 32.17 | 27.85 | 0.0523 | 3 | 0.13712 | 0.07730 | |
| READ | 168.05 | 22.90 | 28.81 | 116.34 | 0.0521 | . | . | . | . |
| STAD | 136.97 | 27.14 | 43.24 | 66.59 | 0.0455 | . | . | . | . |
| 112.55 | 20.02 | 39.85 | 52.67 | 0.0532 | 0.087 | 0.00649 | 0.00442 | ||
| UCEC | 171.38 | 38.24 | 22.14 | 111.00 | 0.0439 | . | . | . | . |
The geometry of the normal and tumor regions, i.e. the parameters , and come from Ref.[10]. The minimal distance between both regions is . The D value is estimated as the maximum of [11]. On the other hand, the number of tissue stem cells, and the stem cell turnover rate, , are borrowed from Refs.[27,28]. The lifetime risk of cancer and its deviation (when available) is computed from Ref.[28] as the mean value and the standard deviation of the cumulative risk at a maximum age of 80 years. Bold marked tissues correspond to cancer types for which all the data is available.
Figure 3A test of how Eq. (6) describes the risk of cancer in 8 tissues. Data from Table 2 is used to this end. A very small slope is obtained in both, the full fit and a fit without LUAD and THCA, thus small amplitude fluctuations in gene expression space may not account for the risk of cancer in these tissues.
Figure 4Testing the ability of Eq. (10) to describe the cancer risk in 8 tissues. The error bars were estimated by means of the last column of Table 2. The slope of the linear fit is near one, as expected. 72 % of the data dispersion is explained by the linear dependence (p-value = 0.04).
Figure 5Lifetime risk of cancer per stem cell in a tissue vs the number of stem cell generations. The analysis is based on Eq. (12). The band delimited by the red dashed lines contains the group of tissues qualified as normal. See detailed explanations in the main text.
The Extra Risk Score (ERS) index of Eq. (12) for cancer in different tissues.
| Cancer type | ERS |
|---|---|
| Hepatocellular C | 1.13 |
| Melanoma | 1.16 |
| Pancreatic endocrine C | 1.23 |
| Pancreatic ductal AC | 1.45 |
| Medulloblastoma | 1.49 |
| Myeloid leukemia | 1.54 |
| Duodenal AC | 1.93 |
| Lymphocytic leukemia | 1.95 |
| Colorectal AC | 2.04 |
| Basal Cell C | 4.02 |
| Lung AC (non-smokers) | 5.15 |
| Esophageal SCC | 5.44 |
| Hepatocellular C with HCV | 11.29 |
| Colorectal AC with Lynch | 21.30 |
| Colorectal AC with FAP | 42.61 |
| Head and Neck SCC with HPV | 122.96 |
| Duodenal AC with FAP | 225.29 |
| Small intestinal AC | 0.12 |
| Glioblastoma | 30.03 |
| Testicular germinal cell | 52.78 |
| Osteosarcomas Head | 70.03 |
| Ovarian germinal cell | 79.87 |
| Thyroid medullary C | 84.22 |
| Osteosarcomas Arms | 124.72 |
| Osteosarcomas Pelvis | 138.09 |
| Osteosarcomas | 153.05 |
| Thyroid papillary and follicular C | 239.78 |
| Osteosarcomas Legs | 266.49 |
| Gallbladder non papillary AC | 1299.58 |
| Head and Neck SCC | 21.38 |
| Lung AC (smokers) | 92.77 |