| Literature DB >> 32900359 |
Kyung Hyun Lee1, Marek Kimmel2,3.
Abstract
*: Background Telomeres, which are composed of repetitive nucleotide sequences at the end of chromosomes, behave as a division clock that measures replicative senescence. Under the normal physiological condition, telomeres shorten with each cell division, and cells use the telomere lengths to sense the number of divisions. Replicative senescence has been shown to occur at approximately 50-70 cell divisions, which is termed the Hayflick's limit. However, in cancer cells telomere lengths are stabilized, thereby allowing continual cell replication by two known mechanisms: activation of telomerase and Alternative Lengthening of Telomeres (ALT). The connections between the two mechanisms are complicated and still poorly understood. *: Results In this research, we propose that two different approaches, G-Networks and Stochastic Automata Networks, which are stochastic models motivated by queueing theory, are useful to identify a set of genes that play an important role in the state of interest and to infer their previously unknown correlation by obtaining both stationary and joint transient distributions of the given system. Our analysis using G-Network detects five statistically significant genes (CEBPA, FOXM1, E2F1, c-MYC, hTERT) with either mechanism, contrasted to normal cells. A new algorithm is introduced to show how the correlation between two genes of interest varies in the transient state according not only to each mechanism but also to each cell condition. *: Conclusions This study expands our existing knowledge of genes associated with mechanisms of telomere maintenance and provides a platform to understand similarities and differences between telomerase and ALT in terms of the correlation between two genes in the system. This is particularly important because telomere dynamics plays a major role in many physiological and disease processes, including hematopoiesis.Entities:
Keywords: Correlation analysis; G-networks; Gene regulatory networks; Queueing network theory; Stochastic automata networks; Telomeres
Mesh:
Substances:
Year: 2020 PMID: 32900359 PMCID: PMC7488072 DOI: 10.1186/s12864-020-06937-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Description of the notation in Eq. 1
| Notation | Definition |
|---|---|
| Number of automata in the network | |
| Number of states, where the state space of each automaton is {0,1,⋯, | |
| Number of synchronizing events and an index of each event, respectively | |
| Local transition rate matrix (normalized) of the automaton | |
| Effect matrix corresponding to the | |
| Normalizing matrix needed for the tensor product of |
Fig. 1Four activities for gene regulation in a G-network model
Vocabulary and notation for application of G-Networks to Gene Regulatory Networks
| Biology | G-Networks | Notation |
|---|---|---|
| Gene | Node where customers are stored | |
| Signals/notch in the scale of the gene activity information [ | Positive/negative customers | |
| mRNA expression level | The number of positive customers | |
| Translation / protein bursting | Arrival rate of positive customers from the outside of the system | |
| Degradation | Arrival rate of negative customers from the outside of the system | |
| Activation / transcription | Transition probabilities of positive customers | |
| Repression | Transition probabilities of negative customers | |
| Signals not influencing the gene activity | Customers that exit the system | |
| Protein-protein interaction | Service rate |
The first and second columns contain biological and G-Networks terminologies of gene regulation, respectively. The third column includes the corresponding notations used in this study. The subscript i indicates the ith gene
Fig. 2The correlation of a pair of genes. The correlation of a pair of genes can be either positive (green solid line) or negative (red dashed line). The thicker line represents the stronger correlation of two connected genes, while the thinner line indicates the weaker correlation of them
Fig. 3A network with 5 automata (genes) and 7 synchronizing transitions
Values of the parameters needed to determine the infinitesimal generator (Q) using 4 normal cell lines, where
| Translation ( | Degradation ( | ||||
| CEBPA | 4 | 1.18489 | 3 | 0.333 | |
| E2F1 | 4 | 1.79118 | 3 | 0.333 | |
| FOXM1 | 4 | 2.78510 | 2 | 0.5 | |
| c-MYC | 5 | 6.70809 | 3 | 0.333 | |
| hTERT | 5 | 6.07682 | 1 | 1 | |
| Normal: Activation/transcription processes ( | |||||
| CEBPA | E2F1 | FOXM1 | c-MYC | hTERT | |
| CEBPA | 0 | 0 | 0 | 0 | 0 |
| E2F1 | 0 | 0 | 0 | 0.333 | 0 |
| FOXM1 | 0 | 0 | 0 | 0.5 | 0 |
| c-MYC | 0 | 0 | 0 | 0 | 0.333 |
| hTERT | 0 | 0 | 0 | 0 | 0 |
| Normal: Repression process ( | |||||
| CEBPA | E2F1 | FOXM1 | c-MYC | hTERT | |
| CEBPA | 0 | 0.333 | 0.333 | 0 | 0 |
| E2F1 | 0 | 0 | 0 | 0 | 0.333 |
| FOXM1 | 0 | 0 | 0 | 0 | 0 |
| c-MYC | 0.333 | 0 | 0 | 0 | 0 |
| hTERT | 0 | 0 | 0 | 0 | 0 |
The “service” (or firing) rate of gene i, denoted by μ, represents the protein–protein interactions, e.g., phosphorylation and ubiquitination. Gene i activates and inhibits gene j with probability and , respectively. Genes in the rows correspond to the “starting” genes, while those in columns to the ending genes
Values of the parameters that are needed to determine the infinitesimal generator (Q) using 18 ALT cell lines
| Translation ( | Degradation ( | ||||
| CEBPA | 4 | 1.18489 | 3 | 0.5433 | |
| E2F1 | 4 | 1.79118 | 3 | 0.1258 | |
| FOXM1 | 4 | 2.78510 | 2 | 0.2417 | |
| c-MYC | 5 | 6.70809 | 3 | 0.0280 | |
| hTERT | 5 | 6.07682 | 1 | 1 | |
| ALT: Activation/transcription processes ( | |||||
| CEBPA | E2F1 | FOXM1 | c-MYC | hTERT | |
| CEBPA | 0 | 0 | 0 | 0 | 0 |
| E2F1 | 0 | 0 | 0 | 0.5574 | 0 |
| FOXM1 | 0 | 0 | 0 | 0.7583 | 0 |
| c-MYC | 0 | 0 | 0 | 0 | 0.3855 |
| hTERT | 0 | 0 | 0 | 0 | 0 |
| ALT: Repression process ( | |||||
| CEBPA | E2F1 | FOXM1 | c-MYC | hTERT | |
| CEBPA | 0 | 0.2268 | 0.2230 | 0 | 0 |
| E2F1 | 0 | 0 | 0 | 0 | 0.3168 |
| FOXM1 | 0 | 0 | 0 | 0 | 0 |
| c-MYC | 0.5865 | 0 | 0 | 0 | 0 |
| hTERT | 0 | 0 | 0 | 0 | 0 |
The definitions and representations of and are the same as in Table 3
Fig. 4ECDF plots of hTERT under the normal condition (top) and ALT (malignant) condition (bottom). Orange and green straight and thick lines represent theoretical (logarithmically transformed) CDF of normal and ALT conditions, respectively. Purple, blue (thin) and red (thick) lines serve as the logarithmically transformed ECDF of hTERT after 10 steps, 100 steps and 500 steps, respectively
Fig. 9ECDF plots of E2F1 under the normal condition (top) and ALT (malignant) condition (bottom). Orange and green straight and thick lines represent theoretical (logarithmically transformed) CDF of normal and ALT conditions, respectively. Top: Orange, blue, purple, red (thick) lines serves as the logarithmically transformed ECDF of E2F1 after 10 steps, 100 steps, 300 steps and 500 steps, respectively. Bottom: Orange, blue, purple, red (thick) lines serves as the logarithmically transformed ECDF of E2F1 after 10 steps, 100 steps, 500 steps and 1000 steps, respectively
Fig. 5Trend of correlation between each pair of genes in the system over time in ALT (malignant) cells. The general relationship of two genes in the system is similar regardless of the mechanism by which telomeres maintain their sufficient lengths. However, within the same condition of cells, the magnitude of correlation and the speed to the steady state vary by interactions
Fig. 7Trend of correlation between a pair of genes in the system over time in (top) normal cells and (bottom) in malignant cells from 16 telomerase-positive cell lines
Fig. 8Trend of correlation between a pair of genes in the system over time in normal cells with the assumption that a pair of genes are initially negatively correlated
Mean mRNA expression levels (normalized and scaled) of genes in normal and ALT cells
| Normal | ALT | Mean diff. | ||
|---|---|---|---|---|
| CEBPA (Gene 1) | 4.724 | 2.617 | 2.107 | 0.003* |
| E2F1 (Gene 2) | 2.475 | 3.117 | -0.642 | 0.029* |
| FOXM1 (Gene 3) | 2.484 | 3.115 | -0.631 | 0.027* |
| c-MYC (Gene 4) | 1.957 | 3.232 | -1.275 | 0.000* |
| hTERT (Gene 5) | 2.662 | 3.075 | -0.414 | 0.128 |
P-values were obtained from the Student’s t–test based on the mean differences of the expression levels. An asterisk indicates statistical significance of p<0.05
Fig. 6A box plot displaying the five number summary of the five significant genes in ALT cells. Red triangle dots represent the mRNA expression levels of the corresponding gene in 4 normal cells
Possible movements of mRNA expression levels
| Case 1 | When |
| Case 2 | When |
| Case 3 | When |
| Case4 | When both |
Correlation coefficients at for each type of cells (normal, ALT, and telomerase-positive cells)
| Time | |||
|---|---|---|---|
| Normal | ALT | Telomerase | |
| Gene 1 and Gene 2 | 0.3321 | 0.3219 | 0.3414 |
| Gene 1 and Gene 3 | 0.3258 | 0.3140 | 0.3317 |
| Gene 1 and Gene 4 | 0.2190 | 0.2097 | 0.2046 |
| Gene 1 and Gene 5 | 0.2905 | 0.2271 | 0.2234 |
| Gene 2 and Gene 3 | 0.2943 | 0.3095 | 0.3055 |
| Gene 2 and Gene 4 | 0.2028 | 0.2047 | 0.1951 |
| Gene 2 and Gene 5 | 0.2398 | 0.2454 | 0.2260 |
| Gene 3 and Gene 4 | 0.1995 | 0.2007 | 0.1936 |
| Gene 3 and Gene 5 | 0.2254 | 0.2289 | 0.2188 |
| Gene 4 and Gene 5 | 0.1583 | 0.1519 | 0.1396 |
Mean mRNA expression levels (normalized and scaled) of genes in normal and telomerase-active (Telom.) cells
| Normal | Telom. | Mean diff. | ||
|---|---|---|---|---|
| CEBPA (Gene 1) | 4.643 | 2.589 | 2.054 | 0.003* |
| E2F1 (Gene 2) | 3.428 | 2.893 | 0.535 | 0.190 |
| FOXM1 (Gene 3) | 2.918 | 3.020 | -0.102 | 0.754 |
| c-MYC (Gene 4) | 1.842 | 3.290 | -1.448 | 0.000* |
| hTERT (Gene 5) | 1.874 | 3.282 | -1.408 | 0.000* |
P-values were obtained from the Student’s t–test based on the mean differences of the expression levels. An asterisk indicates statistical significance of p<0.05
Parameters of stationary distribution, q in Eq. 3, of normal and malignant (either ALT or telomerase) cells
| CEBPA | E2F1 | FOXM1 | c-MYC | hTERT | |
|---|---|---|---|---|---|
| Normal (for ALT) | 0.8253 | 0.7122 | 0.7130 | 0.6618 | 0.7269 |
| ALT | 0.7235 | 0.7571 | 0.7570 | 0.7637 | 0.7546 |
| Normal (for telomerase) | 0.8228 | 0.7741 | 0.7448 | 0.6481 | 0.6521 |
| Telomerase | 0.7478 | 0.7431 | 0.7513 | 0.7669 | 0.7429 |
Note that the APDA [18] differently estimates λ− (degradation) of normal cells depending on the cell type to be compared