| Literature DB >> 32315351 |
Aditya Rao1, Thomas Joseph1, Vangala G Saipradeep1, Sujatha Kotte1, Naveen Sivadasan1, Rajgopal Srinivasan1.
Abstract
INTRODUCTION: Phenotype-driven rare disease gene prioritization relies on high quality curated resources containing disease, gene and phenotype annotations. However, the effectiveness of gene prioritization tools is constrained by the incomplete coverage of rare disease, phenotype and gene annotations in such curated resources.Entities:
Mesh:
Year: 2020 PMID: 32315351 PMCID: PMC7173875 DOI: 10.1371/journal.pone.0231728
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1PRIORI-modules: (a) Rare Disease Dictionary Curation module, (b) Rare Disease Annotator (c) Rare Disease MEDLINE Processor (d) Network Creation module (e) Gene Prioritization module.
Examples of overlaps within and across dictionaries.
| Overlap Type | Entity | ID1 | ID2 |
|---|---|---|---|
| HGNC | cox2 | HGNC:7421 | HGNC:9605 |
| nat3 | HGNC:14679 | HGNC:15908 | |
| Orphanet-HPO | submucosal cleft palate | ORPHA:155878 | HP:0000176 |
| fulminant hepatic failure | ORPHA:90062 | HP:0004448 | |
| HGNC-HPO | Paroxysmal nocturnal hemoglobinuria | HGNC:8957 | HP:0004818 |
| Warts | HGNC:6514 | HP:0200043 | |
| HGNC-Orphanet | cayman ataxia | HGNC:779 | ORPHA:94122 |
| spg71 | HGNC:17277 | ORPHA:401840 |
Dictionary term counts before and after curation.
| Dictionary | Main Resource | Additional Resource(s) | Before Curation | After Curation |
|---|---|---|---|---|
| Disease | 22,546 | 20,294 | 42,840 | 45,856 |
| Phenotype | 30,812 | 17,915 | 48,727 | 55,183 |
| Gene | 80,832 | 5 | 80,832 | 82,387 |
Fig 2(a) Comparison of the quality of associations of PRIORI-T with DisGeNET and Open Targets using Orphanet associations. (b) Comparison of the quality of associations of PRIORI-T with rare-disease related DisGeNET and Open Targets using Orphanet associations.
Fig 3The cumulative percentage of causal genes found in Top-k when gene prioritization was performed on 230 real-world rare disease clinical cases using HANRD, PRIORI-T, PRIORI-T(HANRDINITIAL+ICN) and Orphamizer.
Fig 4The Top-k distribution of the causal genes plotted for each the three time-series datasets–up to 1994, 2004 and 2018 for both ICN and ASN.