| Literature DB >> 25420641 |
Rocío Rodríguez-López1,2, Armando Reyes-Palomares3,4, Francisca Sánchez-Jiménez5,6, Miguel Ángel Medina7,8.
Abstract
BACKGROUND: Several types of genetic interactions in humans can be directly or indirectly associated with the causal effects of mutations. These interactions are usually based on their co-associations to biological processes, coexistence in cellular locations, coexpression in cell lines, physical interactions and so on. In addition, pathological processes can present similar phenotypes that have mutations either in the same genomic location or in different genomic regions. Therefore, integrative resources for all of these complex interactions can help us prioritize the relationships between genes and diseases that are most deserving to be studied by researchers and physicians.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25420641 PMCID: PMC4260198 DOI: 10.1186/s12859-014-0375-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1PhenUMA knowledge base. A: Schematic representation of the PhenUMA knowledge base contents. Three types of relationships are included in the knowledge base: i) “known relationships” (solid lines) were extracted from the databases OMIM, Orphanet and STRING and also include the metabolic interactions from Veeramani and Bader [15]; ii) “inferred relationships” (dashed lines) were taken from the OMIM and Orphanet known relationships; and iii) “semantic similarity relationships” (dotted lines). For the semantic similarity relationships, scores were calculated using the HPO and GO. Genes (red triangles), OMIM diseases (yellow circles) and Orphanet diseases (blue octagons) are the components of these relationships. This schematic describes how inferred relationships were determined from known relationships; that is, how the dashed lines were deduced from the solid lines. B: Illustrative example of integration between phenotypic and functional gene-gene relationships as retrieved in PhenUMA for ornithine transcarbamylase (OTC; MIM# 300461) at a medium confidence level.
Summary of main relationships in the knowledge base
|
|
|
|
|
|---|---|---|---|
|
| |||
| OMIM-OMIM | Inferred by Genes (OMIM) | 1843 | 2885 |
| OMIM-OMIM | Phenotypic Similarity (HPO) | 4627 | 149689a |
| Orphan Disease-Orphan Disease | Inferred by Genes (Orphanet) | 1655 | 3568 |
| Orphan Disease-Orphan Disease | Phenotypic Similarity (HPO) | 3068 | 75924a |
| Gene-Gene | Inferred by OMIM (OMIM) | 784 | 3217 |
| Gene-Gene | Inferred by Orphan Disease (Orphanet) | 1641 | 8292 |
| Gene-Gene | Phenotypic Similarity (HPO) | 1681 | 24902a |
|
| |||
| Gene-Gene | Functional Similarity (GO Biological Process) | 9123 | 486982a |
| Gene-Gene | Functional Similarity (GO Cellular Component) | 6046 | 565739a |
| Gene-Gene | Functional Similarity (GO Molecular Function) | 8087 | 397683a |
| Gene-Gene | Protein-protein interactions (STRING) | 10316 | 96856 |
| Gene-Gene | Metabolic interactions [Veeramani and Bader[ | 535 | 9812 |
aResulting relationships to apply the respective cutoff for low confidence level.
Figure 2Effects of phenotypic similarity cutoff variations on the number of elements and Jaccard coefficients. Computed phenotypic similarities for gene pairs (blue squares), OMIM disease pairs (red circles) and Orphanet disease pairs (green triangles) were filtered at the 95th percentile, and different cutoff scores corresponding to the 95th, 98th, 99th and 99.5th percentiles were used. The Resnik and Robinson measurements are shown as solid and dashed lines, respectively. A: Variations in the number of genes and diseases that are involved in phenotypic similarities at increasing values of the similarity score. B: Variations of the Jaccard’s similarity coefficients calculated from the resulting intersection between the phenotypic similarity-based networks and their respective inferred networks is represented as the distinct similarity scores.
Figure 3Building network process. A: Input provided by the user of PhenUMA. Gene-Gene network building allows a set of genes or diseases as input (OMIM or Orphan diseases). In case of providing a disease list, genes associated with each disease (OMIM or Orphanet associations) are use to create the gene-gene network in the building network stage. Disease-Disease network can relate OMIM diseases or Orphan diseases and in both cases the input type are similar: a list of diseases or a set of genes. Phenotype query network building require of a set of phenotypes (HPO) as input, which is taken as a phenotype profile. B: Building network stage is divided in two parts: the seed network building that contains the relationships between de input set (genes, diseases or phenotypes) and the rest of elements included in the database and the network enrichment that consist in the addition of the rest of relationships included in the knowledge base (see Figure 1) between the elements related in each network.
Figure 4Subsets of inferred and phenotypically similar gene pairs. Venn diagram showing the distribution of gene pairs between a dataset of inferred relationships (from the union of OMIM and Orphanet) and the phenotypic similarity gene network at a low level of confidence corresponding to the 98th percentile.
Figure 5Phenotypically similar disorders associated with SSADH deficiency at different confidence levels. PhenUMA results of the query for SSADH deficiency (MIM# 271980) at different levels of confidence A: Low, B: Medium and C: High. All panels are screenshots of the PhenUMA results that were edited to highlight the main clinical features associated with each OMIM disease cluster.
Phenotypic enrichment of SSADHD and high confidence similar disorders
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| HP:0002197 | Generalized seizures | 70 | 13 | 4.87E-19 | (607628, 607681, 611364, 600669, 608096, 607631, 607208, 300423, 608217, 600131, 271980, 604827, 300088) |
| HP:0002123 | Generalized myoclonic seizures | 27 | 6 | 2.62E-08 | (611364, 600669, 607631, 607208, 271980, 604827) |
| HP:0002133 | Status epilepticus | 11 | 4 | 3.53E-06 | (608096, 607208, 271980, 300088) |
| HP:0002392 | EEG with polyspike wave complexes | 4 | 3 | 1.35E-05 | (607681, 600669, 600131) |
| HP:0000717 | Autism | 35 | 4 | 5.29E-04 | (606053, 238350, 209800, 271980) |
| HP: 0000708 | Behavioural/Psychiatric Abnormality | 406 | 8 | 4.47E-03 | (143465, 606053, 238350, 167870, 209800, 271980, 300088, 190100) |
| HP:0001311 | Neurophysiological abnormality | 83 | 4 | 1.65E-02 | (607681, 600669, 600131, 271980) |
| HP:0000739 | Anxiety | 33 | 3 | 1.71E-02 | (167870, 271980, 190100) |
Comparison of PhenUMA with other tools
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| PhenUMA | Yes | IC-based | Yes | Yes | Yes | Yes | Yes |
| Phenomizer | Yes | IC-based | No | Yes | No | Yes | No |
| GeneMania | No | - | Yes | No | Yes | Yes | Yes |
| PhenomeNET | Yes | Jaccard’s Index | Yes | Yes | Yes | Yes | No |
| MalaCards | No* | MCRDS | Yes | Yes | Yes | No | Yes |
*Mouse Phenotypes (from Mammalian Phenotype Ontology) are related with the disease queried but not Human Phenotypes.
Figure 6ROC curve and false discovery rates (FDR) for phenotypic similarities between diseases provided by PhenUMA and PhenomeNET. A: ROC curves for phenotypic similarities between OMIM diseases. For all the cases we used the same reference dataset. This dataset are all inferred OMIM disease pairs that are those diseases associated with the same gene/s. It is noteworthy that the results from Robinson and Resnik are equivalent to those in Additional file 1: Figure S1A and S1B, respectively, B: FDR for increasing values of phenotypic similarity scores.
Phenotypic enrichment of OMIM diseases similar to SSADH Deficiency (OMIM 271980)
|
| |||||||
|---|---|---|---|---|---|---|---|
|
|
|
| |||||
| Phenotypes | IC | Top 10 | Top 50 | Top 10 | Top 50 | Top 10 | Top 50 |
| Status epilepticus | 0,709 |
|
| 6,81E-01 | 5,83E-01 | 1 | 1,39E-01 |
| Absence seizures | 0,681 |
|
|
|
| 2,99E-01 | |
| Hyperkinesis | 0,658 | 1 | 1 | 1 | |||
| Hallucinations | 0,613 | 7,55E-01 | 1 | 1 | |||
| Generalized myoclonic seizures | 0,604 |
|
|
|
| 1 | 1 |
| Anxiety | 0,581 | 6,90E-02 |
| 5,78E-02 | 4,79E-01 | ||
| Autism | 0,574 | 7,76E-02 | 1,51E-01 | 5,67E-01 | 1 | ||
| Psychosis | 0,565 | 1 |
| 1 | 1 | 1 | |
| Generalized tonic-clonic seizures | 0,562 |
|
|
|
|
|
|
| Delayed speech and language development | 0,543 | 1 | 1 | 1 | 1 | ||
| Aggressive behavior | 0,540 | 1 |
| 1 | 1 | 1 | |
| Hypokinesia | 0,491 | 1 | 1 | ||||
| EEG abnormality | 0,489 | 1 |
| 1 |
| 1 | 1 |
| Increased body weight | 0,486 | 1 | |||||
| Hyperactivity | 0,484 |
| 1 | 1 | 1 | ||
| Hyporeflexia | 0,437 | 1 | 1 | 1,14E-01 | |||
| Motor delay | 0,420 | 1 | 1 | 1 | |||
| Ataxia | 0,317 | 1 | 1 | 1 |
| ||
| Abnormality of eye movement | 0,307 | 8,75E-01 | 1,54E-01 |
| |||
| Muscular hypotonia | 0,281 | 1 | 1 |
| |||
| Intellectual disability | 0,214 | 1 | 1 | 8,65E-01 |
| ||
| Abnormality of metabolism/homeostasis | 0,123 | 1 | 1 | 1 | 1 | 1 | 1 |
In bold, Bonferroni corrected P-values ≤0.05, hypergeometric tests.