| Literature DB >> 25093073 |
Anika Oellrich1, Sebastian Koehler2, Nicole Washington3, Chris Mungall1, Suzanna Lewis3, Melissa Haendel4, Peter N Robinson2, Damian Smedley1.
Abstract
BACKGROUND: The molecular etiology is still to be identified for about half of the currently described Mendelian diseases in humans, thereby hindering efforts to find treatments or preventive measures. Advances, such as new sequencing technologies, have led to increasing amounts of data becoming available with which to address the problem of identifying disease genes. Therefore, automated methods are needed that reliably predict disease gene candidates based on available data. We have recently developed Exomiser as a tool for identifying causative variants from exome analysis results by filtering and prioritising using a number of criteria including the phenotype similarity between the disease and mouse mutants involving the gene candidates. Initial investigations revealed a variation in performance for different medical categories of disease, due in part to a varying contribution of the phenotype scoring component.Entities:
Year: 2014 PMID: 25093073 PMCID: PMC4108905 DOI: 10.1186/2041-1480-5-S1-S4
Source DB: PubMed Journal: J Biomed Semantics
Figure 1Performance of PHIVE score in Exomiser by Orphanet disease category, together with just the phenotype-based scores. The number of diseases tested for each category are shown in parentheses. Note that many diseases belong to multiple disease categories. The overall PHIVE performance and the contribution of the phenotype score used in Exomiser is seen to vary with respect to the disease category. The highest contribution of the phenotype score is in the category of gastroenterological diseases, the smallest in the category of abdominal surgical diseases.
PhenoDigm performs best for urogenital diseases for mouse and cardiac malformation for fish out of 31 disease categories.
| disease category* | diseases(mouse)† | AUC(mouse)‡ | diseases(fish)† | AUC(fish)‡ |
|---|---|---|---|---|
| abdominal surgical | 104 | 0.856 (0.336) | 67 | |
| Allergic | 5 | - | 0 | - |
| Bone | 368 | 185 | 0.650 (0.110) | |
| cardiac | 128 | 0.857 (0.138) | 58 | |
| cardiac malformations | 34 | 0.822 (0.221) | 23 | |
| circulatory system | 63 | 0.825 (0.239) | 31 | 0.658 (0.417) |
| developmental anomalies in embryogenesis | 943 | 0.852 (0.177) | 475 | |
| endocrine | 307 | 128 | 0.629 (0.382) | |
| eye | 582 | 269 | 0.646 (0.147) | |
| gastroenterological | 74 | 0.842 (0.391) | 36 | |
| haematological | 151 | 0.816 (0.151) | 53 | 0.603 (0.215) |
| hepatic | 41 | 8 | - | |
| immunological | 134 | 0.843 (0.391) | 36 | |
| inborn errors of metabolism | 384 | 91 | 0.646 (0.103) | |
| infectious | 3 | - | 2 | - |
| infertility | 41 | 0.817 (0.154) | 18 | 0.635 (0.496) |
| neurological | 777 | 328 | 0.630 (0.486) | |
| odontological | 44 | 0.899 (0.078) | 18 | 0.693 (0.161) |
| otorhinolaryngological | 150 | 74 | ||
| renal | 277 | 0.846 (0.479) | 130 | |
| respiratory | 65 | 0.808 (0.126) | 35 | 0.594 (0.135) |
| skin | 418 | 0.852 (0.161) | 154 | 0.636 (0.442) |
| surgical maxillo facial | 89 | 0.836 (0.367) | 56 | |
| surgical thoracic | 33 | 0.816 (0.176) | 12 | 0.641 (0.364) |
| systematic and rheumatological | 68 | 0.832 (0.297) | 14 | 0.592 (0.311) |
| teratologic | 1 | - | 1 | - |
| tumors | 239 | 0.835 (0.388) | 130 | |
| urogenital | 62 | 31 | 0.608 (0.207) | |
| all diseases | 3728 | 0.845 | 1558 | 0.630 |
* disease categories according to Orphanet [19]; † number of diseases falling into this category with phenotype data for the orthologue(s) of the associated gene; ‡ AUC, measured on disease-gene associations from OMIM's MorbidMap and Orphanet curation. Value in brackets shows the p-value of obtained this result compared to those obtained from randomly selecting the same number of diseases. Significant results (p value <0.05) are shown in bold. Note that one disease may fall into different categories due to multiple systems affected by disease.
Figure 2Relationships between common HPO clinical phenotypes for the hepatic disease class and MP terms. (a) The HPO term for Hepatomegaly is identified as being equivalent to the MP term for enlarged liver via their logical definitions. This MP term and its children are identified as the best scoring matches. (b) Lack of a logical definition for Pruritis leads to the best scoring MP matches being to the higher level term of abnormal skin physiology and all of its multiple child terms, many of which having no relationship to pruritis.