| Literature DB >> 21347123 |
Ranga C Gudivada1, Yun Fu, Anil G Jegga, Xiaoyan A Qu, Eric K Neumann, Bruce J Aronow.
Abstract
A principal goal for biomedical research is to improve our understanding of factors that control clinical disease phenotypes. Among genetically-determined diseases, identical mutations may exhibit substantial phenotype variance by individual and background strain, suggesting both environmental and genetic mutant allele interactions. Moreover, different diseases can share phenotypic features extensively. To test the hypothesis that phenotypic similarities and differences among diseases and disease subvariants may represent differential activation of correlated feature "disease phenotype modules", we systematically parsed Online Mendelian Inheritance in Man (OMIM) and Syndrome DB databases using the UMLS to construct a disease - clinical phenotypic feature matrix suitable for various clustering algorithms. Using Cardiovascular Syndromes as a model, our results demonstrate a critical role for representing both phenotypic generalization and specificity relationships for the ability to retrieve non-trivial associations among disease entities such as shared protein domains and pathway and ontology functions of associated causal genes.Entities:
Year: 2008 PMID: 21347123 PMCID: PMC3041520
Source DB: PubMed Journal: Summit Transl Bioinform ISSN: 2153-6430
Figure 1:Reduction in Clinical features after semantic normalization. Count above each bar indicates the total number of features
Figure 2:Plot of normalized cumulative sum versus dimension
Figure 3:Phenotype similarity versus gene annotation similarities (a) proteins associated with similar phenotypes and sharing at least one CDD domain (b) genes associated with similar phenotypes and sharing three or more GOA at the sixth or more detailed level. The average signal of 10 randomized phenomaps is at the lowest level. Disallow same gene analysis skips the disease pairs having same implicated gene.