| Literature DB >> 16981993 |
Angela Re1, Davide Cora, Alda Maria Puliti, Michele Caselle, Isabella Sbrana.
Abstract
BACKGROUND: Common fragile sites (cfs) are specific regions in the human genome that are particularly prone to genomic instability under conditions of replicative stress. Several investigations support the view that common fragile sites play a role in carcinogenesis. We discuss a genome-wide approach based on graph theory and Gene Ontology vocabulary for the functional characterization of common fragile sites and for the identification of genes that contribute to tumour cell biology.Entities:
Mesh:
Year: 2006 PMID: 16981993 PMCID: PMC1601973 DOI: 10.1186/1471-2105-7-413
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Visualization of the network based on correlated expression patterns for fragile sites at α = 1%.
Clustered fragile sites as they are detected by means of the Hoshen-Kopelman algorithm at the three significance thresholds (see text).
| Threshold | Cluster | Nodes | Links | Fragile sites | |||||||||
| 1% | A1 | 18 | 40 | FRA1C | FRA1D | FRA1E | FRA1G | FRA2D | FRA2H | FRA3C | FRA5E | FRA6E | FRA7G |
| FRA9C | FRA11D | FRA13A | FRA18A | FRA20B | FRA22B | FRAXB | FRAXC | ||||||
| B1 | 6 | 5 | FRA3B | FRA7F | FRA7H | FRA14C | FRA16C | FRA16D | |||||
| C1 | 2 | 1 | 14q21 | 18p11 | |||||||||
| 5% | A5 | 21 | 57 | FRA1C | FRA1D | FRA1E | FRA1G | FRA2D | FRA2H | FRA3C | FRA5D | FRA5E | FRA6E |
| 7p14 | FRA7G | FRA9C | FRA10F | FRA11D | FRA13A | FRA18A | FRA20B | FRA22B | FRAXB | ||||
| FRAXC | |||||||||||||
| B5 | 6 | 5 | FRA3B | FRA7F | FRA7H | FRA14C | FRA16C | FRA16D | |||||
| C5 | 2 | 1 | 14q21 | 18p11 | |||||||||
| 10% | ABC10 | 31 | 83 | FRA1C | FRA1D | FRA1E | FRA1G | 1q43 | FRA2D | FRA2H | FRA3B | FRA3C | FRA5D |
| FRA5E | FRA6E | 7p14 | FRA7B | FRA7F | FRA7G | FRA7H | FRA9C | FRA10F | FRA11D | ||||
| FRA13A | FRA14C | 14q21 | FRA16C | FRA16D | 18p11 | FRA18A | FRA20B | FRA22B | FRAXB | ||||
| FRAXC | |||||||||||||
| D10 | 2 | 1 | FRA5C | 7p15 | |||||||||
Capital letters label connected components at each significance level α that is reported in the left-most column. Labels including multiple letters reflect connected components merging. Numbers in labels indicate the significance level at which connected components are detected. The third and fourth columns include respectively the number of fragile sites and of internal links for each connected component. Fragile sites are reported in the right-most column.
Figure 2Visualization of the network based on correlated expression patterns for fragile sites at α = 5%.
Figure 3Visualization of the network based on correlated expression patterns for fragile sites at α = 10%.
Deviations of fragile sites' degrees in the correlation-based network from the Erdos-Renyi random model.
| α | Degree | Erdos-Renyi p-value | Fragile sites | |||||
| 1% | 11 | 8.50e-10 | FRA1C | FRA22B | ||||
| 9 | 1.50e-07 | FRA20B | ||||||
| 8 | 1.71e-06 | FRA2H | ||||||
| 6 | 1.53e-04 | FRA7G | ||||||
| 5 | 0.001 | FRA11D | FRA13A | |||||
| 4 | 0.007 | FRA1G | ||||||
| 3 | 0.037 | FRA1E | FRA3C | FRA5E | FRA18A | FRAXB | ||
| 2 | 0.142 | FRA1D | FRA3B | FRA7H | FRA14C | FRA16C | ||
| 5% | 13 | 1.48e-10 | FRA22B | |||||
| 11 | 1.98e-08 | FRA1C | FRA2H | |||||
| 10 | 2.02e-07 | FRA20B | ||||||
| 9 | 1.87e-06 | FRA13A | ||||||
| 8 | 1.56e-05 | FRA7G | FRA11D | |||||
| 7 | 1.15e-04 | FRA18A | ||||||
| 6 | 7.48e-04 | FRA3C | ||||||
| 5 | 0.004 | FRA1G | ||||||
| 4 | 0.019 | FRA1E | FRA5E | FRA6E | FRAXB | |||
| 3 | 0.071 | FRA1D | ||||||
| 2 | 0.198 | FRA3B | FRA5D | FRA7H | FRA14C | FRA16C | ||
| 10% | 14 | 4.89e-10 | FRA1C | FRA22B | ||||
| 12 | 4.23e-08 | FRA20B | ||||||
| 11 | 3.50e-07 | FRA2H | FRA18A | |||||
| 10 | 2.66e-06 | FRA11D | ||||||
| 9 | 1.83e-05 | FRA1G | FRA7G | FRA13A | ||||
| 7 | 6.27e-04 | FRA3C | ||||||
| 6 | 0.003 | FRA1E | FRA5E | |||||
| 5 | 0.013 | FRA6E | FRA14C | FRAXB | ||||
| 4 | 0.043 | FRA1D | FRAXC | |||||
| 3 | 0.119 | FRA3B | FRA16C | |||||
| 2 | 0.247 | FRA5D | 7p14 | FRA7B | FRA7F | FRA7H | 14q21 | |
| FRA16D | ||||||||
Fragile sites with degree higher than 2 are reported at each significance level α. Probabilities of fragile sites having observed degrees in the random model are reported as well.
Fragile site's betweenness and degree measures at each significance level α.
| Fragile site | Betweenness | Degree | Betweenness | Degree | Betweenness | Degree |
| α = 1% | α = 5% | α = 10% | ||||
| FRA1C | 0.121 | 11 | 0.055 | 11 | 0.170 | 14 |
| FRA22B | 0.103 | 11 | 0.104 | 13 | 0.076 | 14 |
| FRA20B | 0.078 | 9 | 0.030 | 10 | 0.060 | 12 |
| FRA2H | 0.086 | 8 | 0.089 | 11 | 0.082 | 11 |
| FRA7G | 0.058 | 6 | 0.077 | 8 | 0.038 | 9 |
| FRA11D | 0.004 | 5 | 0.011 | 8 | 0.080 | 10 |
| FRA13A | 0.004 | 5 | 0.025 | 9 | 0.028 | 9 |
| FRA1G | 0.004 | 4 | 0.011 | 5 | 0.135 | 9 |
| FRA1E | 0.000 | 3 | 0.000 | 4 | 0.103 | 6 |
| FRA3C | 0.002 | 3 | 0.052 | 6 | 0.028 | 7 |
| FRA5E | 0.000 | 3 | 0.000 | 4 | 0.115 | 6 |
| FRA18A | 0.053 | 3 | 0.114 | 7 | 0.170 | 11 |
| FRAXB | 0.005 | 3 | 0.004 | 4 | 0.005 | 5 |
| FRA1D | 0.000 | 2 | 0.001 | 3 | 0.003 | 4 |
| FRA3B | 0.013 | 2 | 0.011 | 2 | 0.017 | 3 |
| FRA7H | 0.013 | 2 | 0.011 | 2 | 0.000 | 2 |
| FRA14C | 0.020 | 2 | 0.016 | 2 | 0.120 | 5 |
| FRA16C | 0.020 | 2 | 0.016 | 2 | 0.088 | 3 |
| FRA6E | _ | _ | 0.000 | 4 | 0.003 | 5 |
| FRA5D | _ | _ | 0.004 | 2 | 0.001 | 2 |
| FRAXC | _ | _ | _ | _ | 0.000 | 4 |
| 7p14 | _ | _ | _ | _ | 0.001 | 2 |
| FRA7B | _ | _ | _ | _ | 0.064 | 2 |
| FRA7F | _ | _ | _ | _ | 0.000 | 2 |
| 14q21 | _ | _ | _ | _ | 0.058 | 2 |
| FRA16D | _ | _ | _ | _ | 0.043 | 2 |
Comparison between clustering coefficients for the whole real network and for the Erdos-Renyi model at each significance level α.
| α | C | Erdos-Renyi estimate |
| 1% | 0.349 | 0.007 |
| 5% | 0.327 | 0.009 |
| 10% | 0.380 | 0.013 |
Comparison between community organization at α = 10% and connected component organization at α = 5%.
| A5 | FRA1C | FRA1D | FRA1E | FRA1G | FRA2D | FRA2H | FRA3C | FRA5D | FRA5E | FRA6E | 7p14 | FRA7G | FRA9C |
| FRA10F | FRA11D | FRA13A | FRA18A | FRA20B | FRA22B | FRAXB | FRAXC | ||||||
| B5 | FRA3B | FRA7F | FRA7H | FRA14C | FRA16C | FRA16D | |||||||
| C5 | 14q21 | 18p11 | |||||||||||
| com. no 1 | FRA1C | FRA1D | FRA1E | FRA1G | 1q43 | FRA2D | FRA2H | FRA3C | FRA5D | FRA5E | FRA6E | 7p14 | FRA7B |
| FRA7G | FRA9C | FRA10F | FRA11D | FRA13A | FRA18A | FRA20B | FRA22B | FRAXB | FRAXC | ||||
| com. no 2 | FRA3B | FRA7F | FRA7H | FRA14C | FRA16C | FRA16D | |||||||
Communities identified at α = 10% appear as connected components at more stringent significance levels. The two communities at α = 10% coincide respectively with A5 and B5 at α = 5% except for C5 which is enclosed in the larger community. C5's fragile sites are highlighted by italic font typing.
Functional characterization of fragile sites' connected components.
| GO id | GO term description | Cluster | -Log10 (p value) | ||
| α = 1% | α = 5% | α = 10% | |||
| GO:0005126 | hematopoietin/interferon-class | A1, A5, ABC10 | 15.090 | 15.408 | 11.807 |
| (D200-domain) cytokine receptor | A1, A5, ABC10 | ||||
| GO:0005132 | interferon-alpha/beta receptor binding | A1, A5, ABC10 | 12.394 | 11.902 | 9.704 |
| GO:0009615 | response to virus | A1, A5, ABC10 | 11.857 | 10.889 | 6.811 |
| GO:0005125 | cytokine activity | A1, A5, ABC10 | 6.252 | 5.843 | 4.919 |
| GO:0005615 | extracellular space | A1, A5 | 5.390 | 4.627 | < threshold |
| GO:0042272 | nuclear RNA export factor complex | A1, A5 | 4.713 | 4.538 | < threshold |
| GO:0005515 | protein binding | A1 | 4.698 | < threshold | < threshold |
| GO:0008320 | protein carrier activity | ABC10 | < threshold | < threshold | 4.645 |
| GO:0004091 | carboxylesterase activity | B1, B5 | 7.133 | 7.133 | _ |
| GO:0004759 | serine esterase activity | B1, B5 | 7.133 | 7.133 | _ |
| GO:0006805 | xenobiotic metabolism | B1, B5 | 4.038 | 4.038 | _ |
| GO:0003700 | transcription factor activity | D10 | _ | _ | 5.853 |
| GO:0030528 | transcription regulator activity | D10 | _ | _ | 4.758 |
| GO:0003676 | nucleic acid binding | D10 | _ | _ | 4.149 |
Overrepresented GO terms are reported along with their Gene Ontology identifiers, full descriptions, enriched connected component(s) for those GO terms and negative logarithm of their hypergeometric p-values.
KEGG pathways for genes annotated to overrepresented fragile sites.
| MAPK signaling pathway | Notch signaling pathway | Wnt signaling pathway | Calcium signaling pathway | TGF-beta signaling pathway | ||||||||||
| PLA2G3 | MAP3K13 | JAG1 | DVL3 | SENP2 | TCF7 | DVL3 | PDE1A | SMAD5 | ||||||
| Jak-STAT signaling pathway: | ||||||||||||||
| IFNA1 | IFNA2 | IFNA4 | IFNA5 | IFNA6 | IFNA8 | IFNA10 | IFNA14 | IFNA17 | IFNA21 | IFNB1 | IFNK | IFNW1 | LIF | OSM |
| EPO | L12RB2 | |||||||||||||
| ECM-receptor interaction | Cell adhesion molecules (CAMs) | Neuroactive ligand-receptor interaction | ||||||||||||
| LAMC2 | LAMC1 | ITGAV | ITGAV | VCAM1 | NEGR1 | SST | ||||||||
| Cytokine-cytokine receptor interaction: | ||||||||||||||
| IFNA1 | IFNA2 | IFNA4 | IFNA5 | IFNA6 | IFNA8 | IFNA10 | IFNA14 | IFNA17 | IFNA21 | IFNB1 | IFNK | IFNW1 | LIF | OSM |
| EPO | TNFSF4 | TNFSF18 | L12RB2 | MET | ||||||||||
| Natural killer cell-mediated cytotoxicity | Leukocyte transendothelial migration | Complement-coagulation cascades | ||||||||||||
| IFNA1 | IFNA2 | IFNA4 | IFNA5 | IFNA6 | IFNA8 | NCF2 | VCAM1 | PLG SERPINC1 SERPINE1 KNG1 | ||||||
| Toll-like receptor signaling pathway | T/B cell receptor signaling pathway | |||||||||||||
| IFNA1 | IFNA2 | IFNA4 | IFNA5 | IFNA6 | IFNA8 | IFNA10 | IFNA14 | IFNA17 | IFNA21 | IFNB1 | BCL10 | |||
| Focal adhesion | Adherens junction | Tight junctin | Regulation of actin cytoskeleton | |||||||||||
| TNR | LAMC2 | MET | LAMC1 | ITGAV | MET | INADL | ITGAV | NCKAP1 | LIMK2 | |||||
| cell cycle | Apoptosis | |||||||||||||
| CDC7 | CDKN2A | FASLG | ||||||||||||
| SNARE interactions in vesicular transport | Basal transcription factors | Ribosome | RNA polymerase | Proteasome | ||||||||||
| SNAP25 | STX6 | TAF1L | GTF2B | RPL5 | POLR2H | NBEA | ||||||||
| Glycerophospholipid metabolism | Purine | Pyramidine | by cytochrome P450 | Glycosphingolipid metabolism | ||||||||||
| DBT | PLA2G3 | ACHE | PDE1A | POLR2H | POLR2H | CYP3A43 | SMPD3 | |||||||
GO annotated genes are associated with their specific KEGG pathways. KEGG pathways are assembled based on the pathways' map that the KEGG database itself provides.
Candidate fragile genes in tumours.
| Hugo ids | Gene description | Fra. site | Involvement in cancer | |
| Notes | References | |||
| GLMN | glomulin, FKBP associated protein | FRA1D | familial glomangioma | GCd |
| CLCA3 | chloride channel, calcium activated | FRA1D | strictly linked to CLCA2, tumor suppressor gene for breast cancer | Oncogene 2004 (19): 1471-80 |
| EVI-5 | EVI-5 homolog | FRA1D | Cell 2006 Jan 27; 124(2): 367-80 | |
| TNFSF4 | TNF(ligand) superfamily, member 4 | FRA1G | leukemia T-cell, neoplasms | AGCOH, GCd |
| TNFSF18 | TNF(ligand) superfamily, member 18 | FRA1G | neoplasms | AGCOH |
| TNR | tenascin R | FRA1G | neoplasms | AGCOH |
| ANGPTL1 | Angiopoietin-related protein 1 precursor | FRA1G | neoplasms | Cancer Cell 2004 Nov; 6(5):507-16 |
| ITGAV | CD51 antigen | FRA2H | neoplasm metastasis, colon, breast and pancreatic carcinoma | Cancer Metastasis Rev 2003 Mar;22(1):103-15 |
| FAM3D | family with sequence similarity 3, member D | FRA3B | located 900 kb, with no intervening genes, distal from FHIT gene | |
| THPO | thrombopoietin | FRA3C | myeloproliferative disorders | AGCOH, GCd |
| ADIPOQ | adiponectin, C1Q and collagen domain containing | FRA3C | myeloblastic leukemia | Br J Cancer 2006 Jan 16; 94(1):156-60 |
| AHSG | alpha-2-HS-glycoprotein | FRA3C | liver carcinoma | GCd |
| BCL6 | B-cell CLL/lymphoma 6 | FRA3C | lymphoma b-cell, lymphoma non-hodgkins, burkitt lymphoma | Oncogene 2006 Jul 3 |
| PLG | plasminogen precursor | FRA6E | neoplasm metastasis, neoplasms vascular tissue | J Cell Biochem 2005 Oct; 96(2):242-61 |
| HOXA5 | homeo box A5 | 7p15.2 | breast carcinoma | J Biol Chem 2000 Aug ; 275(34):26551-5 |
| HOXA7,9 | homeo box A7, A9 | 7p15.2 | myeloid leukemia | Leuk Res 2000 Oct; 24(10): 849-55 |
| EPO | erythropoietin | FRA7F | myelodyplastic syndromes | AGCOH, GCd |
| AZGP1 | alpha-2-glycoprotein 1, zinc | FRA7F | cancer of breast, prostate (GCd) | AGCOH |
| PBEF1 | pre-B-cell colony enhancing factor 1 | FRA7F | myeloid leukemia | Leukemia 2005 Jun; 19(6): 998-1004 |
| CAV-1,-2 | caveolin 1, caveolin 2 | FRA7G | carcinoma of colon, adenocarcinoma, ovarian epithelial carcinoma | Genomics 2003 Feb; 81(2):105-7 |
| MET | HGF receptor | FRA7G | gastric carcinoma, neoplasm metastasis, carcinoma renal cell | Oncogene 2006 Jan 19; 25(3):409-18 |
| INFB1 | interferon beta 1 | FRA9C | neoplasms, melanoma | AGCOH, GCd |
| IFNE1 | interferon epsilon 1 | FRA9C | located 350 kb distal from MTAP, CDKN2A and CGKN2B tumor genes | |
| IFNA1 | interpheron alpha 1 | FRA9C | leukemia | AGCOH, GCd |
| IFNA2 | interpheron alpha 2 | FRA9C | myeloid leukemia, renal cell cancer | GCd, TGD |
| SMPD3 | sphingomyelin phosphodiesterase 3 | FRA16C | strictly linked to CDH3 (P-cadherin tumour gene) | |
| CKLF | chemokine-like factor | FRA16C | strictly linked to CDH5 (P-cadherin tumour gene) | |
| CES2 | Carboxylesterase 2 precursor | FRA16C | malignant neoplasms, carcinoma of colon | Neoplasia 2005 Apr;7(4): 407-16 |
| NQO1 | NAD(P)H dehydrogenase, quinone 1 | FRA16C | carcinoma of colon, malignant neoplasm of lung, cancer of bladder | Clin Cancer Res 2005 Dec; 11(24):8866-71 |
| LIF | leukemia inhibitory factor | FRA22B | leukemia, myeloma, neuroepitelioma, stricltly linked to HORMAD2 | AGCOH, TGD |
| OSM | oncostatin M | FRA22B | leukemia, myeloma, kaposi sarcoma, melanoma, plasmacitoma | AGCOH, TGD |
| TCN2 | transcobalamin II, macrocytic anemia | FRA22B | colon adenocarcinoma | GCd |
| SEC14L2 | SEC14-like 2 | FRA22B | neoplasms | Cancer Res 2005 Nov 1; 65(21):9807-16 |
We report genes localized at the observed fragile sites which are assessed to play a role in tumour development in public databases or PubMed articles. Abbreviations for cited databases: AGCOH, Atlas of Genetics and Cytogenetics in Oncology and Haematology; TGD, Tumor Gene Database; GCd, Genecards database.
Figure 4The theoretical t-Student distribution is superimposed on the histogram of t coefficients that we calculate from fragile site expression data.