| Literature DB >> 24160471 |
Tzu-Hung Hsiao, Hung-I Harry Chen, Stephanie Roessler, Xin Wei Wang, Yidong Chen1.
Abstract
Copy number alterations (CNAs) can be observed in most of cancer patients. Several oncogenes and tumor suppressor genes with CNAs have been identified in different kinds of tumor. However, the systematic survey of CNA-affected functions is still lack. By employing systems biology approaches, instead of examining individual genes, we directly identified the functional hotspots on human genome. A total of 838 hotspots on human genome with 540 enriched Gene Ontology functions were identified. Seventy-six aCGH array data of hepatocellular carcinoma (HCC) tumors were employed in this study. A total of 150 regions which putatively affected by CNAs and the encoded functions were identified. Our results indicate that two immune related hotspots had copy number alterations in most of patients. In addition, our data implied that these immune-related regions might be involved in HCC oncogenesis. Also, we identified 39 hotspots of which copy number status were associated with patient survival. Our data implied that copy number alterations of the regions may contribute in the dysregulation of the encoded functions. These results further demonstrated that our method enables researchers to survey biological functions of CNAs and to construct regulation hypothesis at pathway and functional levels.Entities:
Year: 2013 PMID: 24160471 PMCID: PMC3833309 DOI: 10.1186/1687-4153-2013-14
Source DB: PubMed Journal: EURASIP J Bioinform Syst Biol ISSN: 1687-4145
Figure 1Schematic diagram illustrating the enrichment analysis of spatial functional hotspots in human genome. (A) The indicator matrix B was generated by sliding the detection window along the genome. It contains the information of genes located in each segment. The matrix P records the gene sets of Gene Ontology. (B) By comparing the two matrices assessed with Fisher's exact test, the p values of gene sets in each segment were generated. The enriched functions of each segment were then identified if passed the selection criteria. (C) The nearby segments with the same enriched function were merged to a detection window. The enrichment analysis between the function and the subset of genes in the windows were performed. The subset was constructed by excluding gene by gene along the left side or right side of the genome coordinate. The position with the smallest p value of left side and right side excluding subset was defined as the boundaries of the optimal region of the functional hotspot.
Figure 2The summary of the spatial functional hotspots. (A) The histogram of the length of the SFHs. (B) The histogram of the gene numbers in the SFHs. (C) The SFH number in each chromosome. (D) The SFH density (numbers of SFHs per million base pair) in each chromosome. (E) The most significant p value and (F) the number of SFHs in the position of human genome. In (E) and (F), the chromosomes were separated by red broken lines.
Figure 3The top 20 enriched function in human genome. The top 20 significant SFHs, which contain 18 functions, were showed in the figure. The function of 'intermediate filament’ was enriched at both chromosomes 12 and 17. The 'response to stimulus’ function was enriched at chromosomes 7 and 14.
Top ten gain/loss spatial functional hotspots (SFHs) of HCC with enriched functions
| Gain | | | | | | |
| 1 | 153.33 | 153.59 | 59 | bp | Response to lipopolysaccharide | 4 |
| 1 | 153.27 | 153.43 | 59 | bp | 4 | |
| 1 | 153.33 | 153.36 | 59 | bp | Response to zinc ion | 2 |
| 1 | 152.27 | 153.43 | 59 | bp | Epidermis development | 13 |
| 1 | 152.27 | 153.23 | 59 | mf | Structural molecule activity | 8 |
| 1 | 152.88 | 153.43 | 58 | bp | Keratinocyte differentiation | 12 |
| 1 | 152.88 | 153.23 | 58 | bp | Peptide cross-linking | 6 |
| 1 | 152.88 | 153.23 | 58 | mf | Protein binding, bridging | 4 |
| 1 | 153.50 | 153.60 | 57 | mf | Protein homodimerization activity | 5 |
| 1 | 153.50 | 153.60 | 57 | cc | Perinuclear region of cytoplasm | 5 |
| Loss | | | | | | |
| 8 | 6.84 | 6.88 | 42 | bp | 3 | |
| 4 | 190.39 | 191.01 | 40 | mf | Sequence-specific DNA binding | 6 |
| 4 | 190.39 | 191.01 | 40 | mf | Sequence-specific DNA binding transcription Factor activity | 6 |
| 8 | 6.35 | 6.91 | 40 | cc | Extracellular space | 7 |
| 4 | 90.80 | 91.76 | 40 | cc | Platelet alpha granule lumen | 2 |
| 4 | 90.80 | 91.76 | 40 | bp | Platelet degranulation | 2 |
| 8 | 26.61 | 27.47 | 39 | bp | Response to stress | 3 |
| 17 | 10.35 | 10.56 | 38 | bp | Actin filament-based movement | 2 |
| 8 | 26.37 | 27.31 | 37 | bp | Response to cocaine | 2 |
| 8 | 22.88 | 23.08 | 37 | mf | Caspase activator activity | 2 |
Immune-related functions are in italics.
The SFHs of which copy number status were associated with patient survival
| Gain | | | | | | |
| 19 | 41.38 | 41.63 | 2.6E-04 | mf | Oxygen binding | 2 |
| 19 | 43.23 | 44.29 | 0.001 | bp | Defense response | 4 |
| 19 | 54.72 | 55.11 | 0.002 | bp | Defense response | 6 |
| 19 | 54.72 | 55.11 | 0.002 | bp | Cell surface receptor linked Signaling pathway | 6 |
| 19 | 40.09 | 40.23 | 0.003 | mf | Lysophospholipase activity | 2 |
| 19 | 40.09 | 40.23 | 0.003 | mf | Carboxylesterase activity | 2 |
| 19 | 39.41 | 39.52 | 0.004 | cc | SCF ubiquitin ligase complex | 2 |
| 19 | 39.41 | 39.52 | 0.004 | mf | Glycoprotein binding | 2 |
| 19 | 51.63 | 52.15 | 0.007 | mf | Sugar binding | 9 |
| 19 | 54.78 | 55.38 | 0.007 | bp | Regulation of immune response | 8 |
| 19 | 51.63 | 52.27 | 0.007 | bp | Cell adhesion | 11 |
| 19 | 42.18 | 44.32 | 0.007 | cc | Anchored to membrane | 9 |
| 19 | 54.72 | 55.55 | 0.007 | mf | Transmembrane receptor activity | 8 |
| 19 | 54.80 | 55.30 | 0.007 | mf | Antigen binding | 5 |
| 19 | 54.78 | 55.42 | 0.007 | bp | Cellular defense response | 5 |
| 19 | 54.72 | 55.55 | 0.007 | cc | Integral to plasma membrane | 12 |
| 19 | 58.55 | 59.08 | 0.007 | bp | Viral reproduction | 7 |
| 19 | 54.78 | 55.40 | 0.007 | bp | Immune response | 6 |
| 19 | 50.86 | 51.59 | 0.008 | mf | Peptidase activity | 17 |
| 19 | 45.41 | 45.45 | 0.008 | mf | Lipid transporter activity | 2 |
| Loss | | | | | | |
| 8 | 11.83 | 12.18 | 2.4E-04 | bp | Defense response to bacterium | 4 |
| 4 | 55.10 | 55.99 | 0.001 | bp | Vascular endothelial growth Factor receptor signaling pathway | 2 |
| 4 | 55.10 | 55.99 | 0.001 | mf | Growth factor binding | 2 |
| 8 | 22.88 | 23.08 | 0.001 | mf | Caspase activator activity | 2 |
| 8 | 22.30 | 23.02 | 0.002 | bp | Apoptosis | 5 |
| 4 | 74.61 | 74.97 | 0.003 | bp | Inflammatory response | 5 |
| 8 | 26.37 | 27.47 | 0.004 | cc | Growth cone | 3 |
| 8 | 26.37 | 27.32 | 0.004 | bp | Response to cocaine | 2 |
| 8 | 22.01 | 23.08 | 0.005 | bp | Cellular response to mechanical stimulus | 3 |
| 4 | 68.69 | 69.36 | 0.006 | mf | Serine-type endopeptidase activity | 5 |
| 4 | 68.69 | 69.36 | 0.006 | bp | Proteolysis | 5 |
| 4 | 68.69 | 69.36 | 0.006 | mf | Peptidase activity | 5 |
| 4 | 70.86 | 71.40 | 0.007 | bp | Biomineral tissue development | 5 |
| 4 | 71.06 | 71.47 | 0.007 | bp | Odontogenesis of dentine-containing tooth | 3 |
| 4 | 74.26 | 74.85 | 0.007 | cc | Platelet alpha granule lumen | 3 |
| 4 | 74.26 | 74.85 | 0.007 | bp | Platelet degranulation | 3 |
| 4 | 76.92 | 76.94 | 0.008 | bp | Defense response to virus | 2 |
| 4 | 74.70 | 75.32 | 0.008 | bp | Cell-cell signaling | 4 |
| 8 | 38.13 | 38.33 | 0.008 | bp | Cell growth | 2 |
Figure 4Patient survival correlated to copy number status of tumor. (A) Kaplan-Meier plots of patient disease-free survival were separated by the status of copy number in the SFH located at 41.1 to 41.9 Mbp at chromosome 19. The SFH has enrichment of 'oxygen binding’. Samples were assigned to two groups, copy number gain (red line) and neutral (blue line). The p values was statistically significant (<0.01). (B) Kaplan-Meier plots of the copy number status in SFH at 11.8 to 12.2 Mbp at chromosome 8. Samples were assigned to two groups, copy number loss (green line) and neutral (blue line). The SFH had enrichment of 'defense response to bacterium’. The p value of test was also significant.