| Literature DB >> 26779818 |
Fan Zhang1, Li Cui2, Michael D Kuo3.
Abstract
Understanding tumor diversity has been a long-lasting and challenging question for researchers in the field of cancer heterogeneity or tumor evolution. Studies have reported that compared to normal cells, there is a higher genetic diversity in tumor cells, while higher genetic diversity is associated with higher progression risks of tumor. We thus hypothesized that tumor diversity also holds true at the gene expression level. To test this hypothesis, we used t-test to compare the means of Simpson's diversity index for gene expression (SDIG) between tumor and non-tumor samples. We found that the mean SDIG in tumor tissues is significantly higher than that in the non-tumor or normal tissues (P<0.05) for most datasets. We also combined microarrays and next-generation sequencing data for validation. This cross-platform and cross-experimental validation greatly increased the reliability of our results. Production and hosting by Elsevier Ltd.Entities:
Keywords: Diversity; Gene expression; Hepatocellular carcinoma cell; Microarray; Next-generation sequencing
Mesh:
Year: 2016 PMID: 26779818 PMCID: PMC4747664 DOI: 10.1016/j.gpb.2015.07.003
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Main characteristics of datasets examined in the current study
| GSE5093 | Microarray | 40 | 20 | 20 | 1.24 | 38 (3–40) | Tumor samples and the corresponding non-cancerous adjacent hepatic tissues came from two HCC patient groups, | |
| GSE3500 | Microarray | 181 | 105 | 76 | 9.86 | 172 (3–180) | The dataset included expression data in more than 200 samples. 105 tumor samples from 82 HCC patients with primary HCC and 76 non-tumor samples from 72 controls were analyzed in this study for genes that were shared by all the 181 samples | |
| GSE4024 | Microarray | 196 | 98 | 98 | 9.83 | 192 (3–196) | RNA from 19 normal liver samples was pooled as reference for all microarray experiments. At least two hybridizations were carried out to obtain gene expression profile data for each of the 49 HCC tissues | |
| GSE1898 | Microarray | 364 | 182 | 182 | 8.31 | 360 (3–364) | RNA from 18 normal liver samples was pooled as reference for gene expression profiles from 91 human HCC tissues. Two hybridizations were performed for each of the 91 HCC tissues | |
| GSE65484 | aCGH | 28 | 14 | 14 | 0 | The dataset included 50 HCC patients and 14 paired adjacent tissues We only used the 14 pairs of HCC and normal tissues for our study | ||
| GSE65485 | NGS | 55 | 50 | 5 | 0 | Whole transcriptome sequencing profiling was performed for 50 HCC samples and 5 adjacent normal samples | ||
Note: aCGH, array-based comparative genomic hybridization; NGS, next-generation sequencing; MIM, missing in metastasis; MAM, metachronous adrenal metastasis.
Figure 1Gene expression diversity of HCC tumor and non-tumor samples in the six datasets examined
Boxplot of gene expression diversity for GSE5093 (A), GSE3500 (B), GSE4024 (C), GSE1898 (D), GSE65484 (E), and GSE65485 (F), respectively. The median expression diversity is indicated with horizontal line. The boxplot shows minimum, first quartile, median, third quartile, and maximum from bottom to top. The gene expression diversity was calculated using equations (3), (4), (5).
Gene expression diversity in HCC and normal samples of datasets examined in the current study
| GSE5093 | Paired | 100 | 343 | 86 | 0.9996 | 0.9997 | 3.756E−05∗ |
| GSE3500 | Non-paired | NA | 5150 | 65 | 0.9894 | 0.9918 | 0.0178∗ |
| GSE4024 | Paired | 61.2 | 5283 | 55 | 0.9996 | 0.9996 | 0.1034 |
| GSE1898 | Paired | 67.2 | 19,543 | 59 | 0.9995 | 0.9995 | 0.0032∗ |
| GSE65484 | Paired | 85.7 | 146 | 74.5 | 0.9420 | 0.9861 | 0.0495∗ |
| GSE65485 | Non-paired | NA | 238 | 95.2 | 0.9862 | 0.9927 | 0.0018∗ |
| Mean | 72 | ||||||
Note: GSE3500 and GSE65485 are not paired and PPS can’t be calculated. Q indicates the positive pair number of SDIG, and T indicates the proportion of positive pair number (PPPN) of SDIG relative to the total number of tumor and normal SDIG pairs. HCC, hepatocellular carcinoma; PPS, proportion of positive sample; NA, not available; SDIG, Simpson’s diversity index for gene expression. t-test was performed for statistical analysis and differences between HCC and normal samples are considered significant with P < 0.05 (∗).