| Literature DB >> 24371277 |
Qin Ma1, Jaxk H Reeves, David A Liberles, Lili Yu, Zheng Chang, Jing Zhao, Juan Cui, Ying Xu, Liang Liu.
Abstract
As biotechnology advances rapidly, a tremendous amount of cancer genetic data has become available, providing an unprecedented opportunity for understanding the genetic mechanisms of cancer. To understand the effects of duplications and deletions on cancer progression, two genomes (normal and tumor) were sequenced from each of five stomach cancer patients in different stages (I, II, III and IV). We developed a phylogenetic model for analyzing stomach cancer data. The model assumes that duplication and deletion occur in accordance with a continuous time Markov Chain along the branches of a phylogenetic tree attached with five extended branches leading to the tumor genomes. Moreover, coalescence times of the phylogenetic tree follow a coalescence process. The simulation study suggests that the maximum likelihood approach can accurately estimate parameters in the phylogenetic model. The phylogenetic model was applied to the stomach cancer data. We found that the expected number of changes (duplication and deletion) per gene for the tumor genomes is significantly higher than that for the normal genomes. The goodness-of-fit test suggests that the phylogenetic model with constant duplication and deletion rates can adequately fit the duplication data for the normal genomes. The analysis found nine duplicated genes that are significantly associated with stomach cancer.Entities:
Mesh:
Year: 2013 PMID: 24371277 PMCID: PMC3950708 DOI: 10.1093/nar/gkt1320
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The number of duplicated genes on the genomes of five stomach cancer patients
| Subject | Stage | No of duplicated genes in tumor genomes |
|---|---|---|
| S1 | I | 64 |
| S2 | II | 84 |
| S3 | II | 57 |
| S4 | III | 75 |
| S5 | IV | 72 |
Figure 1.The tree in the phylogenetic model. The normal genomes (N) at the tips of the tree are attached with five extended branches leading to the tumor genomes (T). Ni and Ti are the normal and tumor genomes of patient i. The tree (above normal genomes) represents the history of five normal genomes, while the extended branches represent the process leading to the five tumor genomes. The normal genomes are the ancestral genomes of the tumor genomes.
Figure 2.Simulation results. The square root of the mean square error for (a) estimating parameter m, and for (b) estimating parameter θ.
The maximum likelihood estimates of m and θ for normal and tumor genomes
| Normal | 0.0028 (0.0002) | 0.7134 (0.1024) |
| Tumor | 0.0037 (0.0003) | 1.4750 (0.1881) |
The values within parentheses are standard errors of the estimates.
The chi-square goodness-of-fit test for the phylogenetic model
| No of patients with duplication | Normal | Tumor | ||
|---|---|---|---|---|
| Observed counts | Expected counts | Observed counts | Expected counts | |
| 0 | 20 885 | 20 880 | 20 803 | 20 798 |
| 1 | 46 | 44.6 | 129 | 103.2 |
| 2 | 23 | 24.1 | 26 | 46.4 |
| 3 | 18 | 17.8 | 16 | 26.9 |
| 4 | 10 | 15.6 | 7 | 16.4 |
| 5 | 18 | 17.6 | 19 | 8.9 |
The estimates of relative duplication and deletion rates on the extended branches
| Duplication rate | Deletion rate | |
|---|---|---|
| T1 | 0.0013 | 0.2626 |
| T2 | 0.0024 | 0.2079 |
| T3 | 0.0007 | 0.2412 |
| T4 | 0.0009 | 0.1443 |
| T5 | 0.0017 | 0.2817 |
Identification of duplicated genes associated with cancer
| Number of duplications | Number of genes | Cumulative | Significance | |
|---|---|---|---|---|
| 0 | 20 853 | 1.0 | 1.0 | |
| 1 | 109 | 0.006 | >0.5 | |
| 2 | 7 | 1.4e-05 | 1e-04 | * |
| 3 | 1 | 1.7e-8 | 1.7e-08 | * |
| 4 | 0 | 1e-11 | 2.4e-15 | * |
| 5 | 1 | 2.4e-15 | 2.4e-15 | * |
The significant genes are indicated by *. The cumulative P-value was adjusted with Bonferroni correction for multiple comparisons.