| Literature DB >> 20454660 |
Octavio Martínez1, M Humberto Reyes-Valdés, Luis Herrera-Estrella.
Abstract
A central goal of cancer biology is to understand how cells from this family of genetic diseases undergo specific morphological and physiological changes and regress to a de-regulated state of the cell cycle. The fact that tumors are unable to perform most of the specific functions of the original tissue led us to hypothesize that the degree of specialization of the transcriptome of cancerous tissues must be less than their normal counterparts. With the aid of information theory tools, we analyzed four datasets derived from transcriptomes of normal and tumor tissues to quantitatively test the hypothesis that cancer reduces transcriptome specialization. Here, we show that the transcriptional specialization of a tumor is significantly less than the corresponding normal tissue and comparable with the specialization of dedifferentiated embryonic stem cells. Furthermore, we demonstrate that the drop in specialization in cancerous tissues is largely due to a decrease in expression of genes that are highly specific to the normal organ. This approach gives us a better understanding of carcinogenesis and offers new tools for the identification of genes that are highly influential in cancer progression.Entities:
Mesh:
Year: 2010 PMID: 20454660 PMCID: PMC2862708 DOI: 10.1371/journal.pone.0010398
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Scatter plot of H (Diversity) and δ (Specialization) in transcriptomes of normal tissues (blue), cancerous tissues (red), and stem cells (black).
Comparable data sets are linked by a discontinuous line. - Human data from 53 libraries of 13 distinct tissues with a total of 671,197 tags for 28,087 genes; grouped analyses. - Mouse data from 29 libraries of 5 distinct tissues and with a total of 541,453 expressed tags for 25,044 distinct genes; grouped analyses. Data for and are from the “Cancer Genome Anatomy Project” (http://cgap.nci.nih.gov/). Approximate 95% confidence intervals for diversity and specialization are plotted as continuous colored lines. See Supporting Text S1 as well as Figure S1, Figure S2, Figure S3, Figure S4, Figure S5 and Figure S6 that illustrate individual comparisons and details.
Figure 2Scatter plot of H (Diversity) and δ (Specialization) in transcriptomes of normal (blue) and tumor (red) tissues in dataset C.
Human expression data are from the “Human Transcriptome Map” project (http://bioinfo.amc.uva.nl/HTMseq/controller), datasets “All tissues normal” and “All tissues tumor”. Data consist of 18,609,073 tags for a total of 62,916 loci by chromosome. See Figure S7, Figure S8, Figure S9 and Figure S10 that amplify the boxes of this figure presenting the 95% confidence intervals for the estimates.
Genes exclusively detected by the fully grouped analysis in cancer tissues in dataset A (human data) at high expression frequency (>0.0001).
| Gene Symbol | Gene Description | Frequency | Tissues | P-value |
|
| Silver homolog (mouse) | 0.00109391 | 3 | <2.2e–16 |
|
| Dehydrogenase/reductase (SDR family) member 2 | 0.00024536 | 4 | 5.838e–13 |
|
| SRY (sex determining region Y)-box 10 | 0.00023105 | 2 | 3.732e–12 |
|
| TNF receptor-associated factor 7 | 0.00022083 | 10 | 9.017e–12 |
|
| Chromosome 10 open reading frame 2 (Twinkle) | 0.00020038 | 8 | 1.467e–10 |
|
| Phosphoribosyl pyrophosphate synthetase 1 | 0.00016971 | 10 | 5.846e–09 |
|
| Melan-A | 0.00015949 | 1 | 1.311e–08 |
|
| Aryl hydrocarbon receptor interacting protein-like 1 | 0.00015949 | 2 | 1.311e–08 |
|
| Melanoma antigen family A, 6 | 0.00015949 | 5 | 1.311e–08 |
|
| Kelch-like 21 (Drosophila) | 0.00014517 | 9 | 8.442e–08 |
|
| Guanine nucleotide binding protein (G protein), beta polypeptide 3 | 0.00014313 | 2 | 8.137e–08 |
|
| Kinesin family member C1 | 0.00014108 | 10 | 1.322e–07 |
|
| S100 calcium binding protein B | 0.00013904 | 2 | 1.287e–07 |
|
| Chromatin licensing and DNA replication factor 1 | 0.00013086 | 11 | 3.23e–07 |
|
| ZW10 interactor antisense | 0.00012677 | 10 | 8.912e–07 |
|
| XPA binding protein 2 | 0.00011655 | 9 | 1.997e–06 |
|
| Solute carrier family 45, member 2 | 0.00011041 | 2 | 5.299e–06 |
Frequency p – Relative average frequency of expression in cancer tissues. Tissues – Number of cancerous tissues where the gene was expressed in the 12 tissues studied in dataset . P-value for the Fisher's Exact Test for the frequency of expression of the gene in normal vs. cancer tissues; significant at an α≈0.05 experiment-wise confidence level by Bonferroni correction.
Examples of genes highly expressed in only one type of cancer in dataset A.
| Organ | Gen symbol | Gen description |
| P-value |
|
|
| Orthodenticle homeobox 2 | 0.00074 | 0.00131448 |
|
|
| Asialoglycoprotein receptor 2 | 0.00063 | 0.01024596 |
|
|
| T, brachyury homolog (mouse) | 0.00050 | 0.00692187 |
|
|
| Chromosome 4 open reading frame 7 | 0.00065 | 0.01260259 |
|
|
| Interleukin 9 receptor | 0.00052 | 0.001396 |
|
|
| DNA (cytosine-5-)-methyltransferase 3-like | 0.00084 | 4.0955e–07 |
|
|
| Melan-A | 0.00066 | 0.00964147 |
Frequency p – Relative average frequency of expression in cancer tissues. P-value for the Fisher's Exact Test for the frequency of expression of the gene in the normal vs. cancer tissues