| Literature DB >> 21714896 |
Jiang Li1, Binsheng Gong, Xi Chen, Tao Liu, Chao Wu, Fan Zhang, Chunquan Li, Xiang Li, Shaoqi Rao, Xia Li.
Abstract
BACKGROUND: The construction of the Disease Ontology (DO) has helped promote the investigation of diseases and disease risk factors. DO enables researchers to analyse disease similarity by adopting semantic similarity measures, and has expanded our understanding of the relationships between different diseases and to classify them. Simultaneously, similarities between genes can also be analysed by their associations with similar diseases. As a result, disease heterogeneity is better understood and insights into the molecular pathogenesis of similar diseases have been gained. However, bioinformatics tools that provide easy and straight forward ways to use DO to study disease and gene similarity simultaneously are required.Entities:
Mesh:
Year: 2011 PMID: 21714896 PMCID: PMC3150296 DOI: 10.1186/1471-2105-12-266
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Figure 1Example of a sub-DO DAG. Example of a sub-DO DAG starting with leaves of DOID:114 (heart disease) and DOID:1086 (congenital chromosomal disease).
Figure 2Distribution of the Pearson correlation coefficient of gene similarity scores between parameter combinations.
Figure 3Hierarchical clustering of 128 cancer terms. The distance between two diseases is defined to be 1- the Wang's similarity of the two diseases. The tree was constructed using the average method of hierarchical clustering. The red line corresponds to a p-value of 0.01. Disease correlations below this line are considered significant. The different colours represent the various categories of significant disease correlations.
Figure 4The network of all the 128 cancer terms. The colours correspond to the significant disease correlation categories in Figure 3. The nodes coloured in grey are not grouped in Figure 3. The thickness of the edges between two diseases represents the strength of their correlation.
Figure 5Hierarchical clustering result of the obesity related genes. The grey bar indicates the genes that could not be grouped into a certain module.
Gene modules of the obesity related genes
| Cluster | Size | Average similarity | ||||
|---|---|---|---|---|---|---|
| M1 | 92 | 0.43 | <1.0E-05 | <1.0E-04 | cholesterol homeostasis; high-density lipoprotein particle remodelling; triglyceride catabolic process | Insulin signaling pathway; Type II diabetes mellitus |
| M2 | 60 | 0.30 | 0.25 | 0.28 | N/A$ | Pyruvate metabolism; Galactose metabolism; |
| M3 | 55 | 0.30 | 0.29 | 0.29 | feeding behavior; photoreceptor cell maintenance | Neuroactive ligand-receptor interaction; Circadian rhythm - mammal; |
| M4 | 31 | 0.50 | <1.0E-05 | <1.0E-04 | response to estrogen stimulus; response to cytokine stimulus; cell aging | Pathways in cancer; Colorectal cancer; Endometrial cancer; |
| M5 | 30 | 0.62 | <1.0E-05 | <1.0E-04 | response to lipopolysaccharide; response to glucocorticoid stimulus | Cytokine-cytokine receptor interaction; Toll-like receptor signaling pathway; |
| M6 | 23 | 0.55 | <1.0E-05 | <1.0E-04 | positive regulation of phosphoinositide 3-kinase cascade; positive regulation of cholesterol esterification | Renin-angiotensin system; Prostate cancer |
| M7 | 15 | 0.34 | 0.12 | 0.16 | N/A | Insulin signaling pathway |
| M8 | 15 | 0.43 | 6.0E-04 | 6.0E-03 | blood coagulation; STAT protein nuclear translocation | Complement and coagulation cascades; Regulation of actin cytoskeleton |
| M9 | 15 | 0.53 | <1.0E-05 | <1.0E-04 | response to interleukin-1; response to glucocorticoid stimulus | Hematopoietic cell lineage; Cytokine-cytokine receptor interaction |
| M10 | 12 | 0.40 | 1.5E-02 | 2.2E-02 | N/A | N/A |
# the original p-value calculated by permutation
* FDR using Benjamini and Hochberg multiple testing correlations
§ Refer to Additional file 3 for complete GO and KEGG annotations.
$ N/A indicates that there are no enriched GO or KEGG annotation for this module.