| Literature DB >> 29297376 |
Jiajie Peng1, Qianqian Li1, Xuequn Shang2.
Abstract
BACKGROUND: Although disease diagnosis has greatly benefited from next generation sequencing technologies, it is still difficult to make the right diagnosis purely based on sequencing technologies for many diseases with complex phenotypes and high genetic heterogeneity. Recently, calculating Human Phenotype Ontology (HPO)-based phenotype semantic similarity has contributed a lot for completing disease diagnosis. However, factors which affect the accuracy of HPO-based semantic similarity have not been evaluated systematically.Entities:
Keywords: Biological ontology; Human phenotype ontology; Semantic similarity
Mesh:
Year: 2017 PMID: 29297376 PMCID: PMC5763495 DOI: 10.1186/s13326-017-0144-y
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1-The workflow of HPOFactor
Fig. 2-The rank of disease by changing the size of phenotype annotation set. The x-axis is the number of HPO annotations. The y-axis is the rank of disease associated with the query patient
Fig. 3-The rank of causative gene by changing the size of phenotype annotation set. The x-axis is the number of HPO annotations. The y-axis is the rank of causative gene of the query patient
Fig. 4-The rank of disease by the phenotype with different evidence code. The x-axis is the ranking threshold for the disease. The y-axis is the ratio of patients satisfying the ranking threshold
Fig. 5-The rank of disease (a) and causative gene (b) by varying the quality of phenotype annotations. The x-axis is the ranking threshold for the disease/causative gene. The y-axis is the ratio of patients satisfying the ranking threshold
Fig. 6-The rank of disease (a) and causative gene (b) by changing the coverage of phenotype annotations. The x-axis is the ranking threshold for the disease/causative gene. The y-axis is the ratio of patients satisfying the ranking threshold