Literature DB >> 35998640

OARD: Open annotations for rare diseases and their phenotypes based on real-world data.

Cong Liu1, Casey N Ta1, Jim M Havrilla2, Jordan G Nestor3, Matthew E Spotnitz1, Andrew S Geneslaw2, Yu Hu2, Wendy K Chung4, Kai Wang2, Chunhua Weng5.   

Abstract

Diagnosis for rare genetic diseases often relies on phenotype-driven methods, which hinge on the accuracy and completeness of the rare disease phenotypes in the underlying annotation knowledgebase. Existing knowledgebases are often manually curated with additional annotations found in published case reports. Despite their potential, real-world data such as electronic health records (EHRs) have not been fully exploited to derive rare disease annotations. Here, we present open annotation for rare diseases (OARD), a real-world-data-derived resource with annotation for rare-disease-related phenotypes. This resource is derived from the EHRs of two academic health institutions containing more than 10 million individuals spanning wide age ranges and different disease subgroups. By leveraging ontology mapping and advanced natural-language-processing (NLP) methods, OARD automatically and efficiently extracts concepts for both rare diseases and their phenotypic traits from billing codes and lab tests as well as over 100 million clinical narratives. The rare disease prevalence derived by OARD is highly correlated with those annotated in the original rare disease knowledgebase. By performing association analysis, we identified more than 1 million novel disease-phenotype association pairs that were previously missed by human annotation, and >60% were confirmed true associations via manual review of a list of sampled pairs. Compared to the manual curated annotation, OARD is 100% data driven and its pipeline can be shared across different institutions. By supporting privacy-preserving sharing of aggregated summary statistics, such as term frequencies and disease-phenotype associations, it fills an important gap to facilitate data-driven research in the rare disease community.
Copyright © 2022 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  electronic health records; human phenotype ontology; knowledge graph; natural language processing; open data sharing; phenotype association; rare disease

Mesh:

Year:  2022        PMID: 35998640      PMCID: PMC9502051          DOI: 10.1016/j.ajhg.2022.08.002

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.043


  52 in total

1.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

2.  A novel tool for the identification of correlations in medical data by faceted search.

Authors:  Danilo Schmidt; Klemens Budde; Daniel Sonntag; Hans-Jürgen Profitlich; Matthias Ihle; Oliver Staeck
Journal:  Comput Biol Med       Date:  2017-04-14       Impact factor: 4.589

3.  Phevor combines multiple biomedical ontologies for accurate identification of disease-causing alleles in single individuals and small nuclear families.

Authors:  Marc V Singleton; Stephen L Guthery; Karl V Voelkerding; Karin Chen; Brett Kennedy; Rebecca L Margraf; Jacob Durtschi; Karen Eilbeck; Martin G Reese; Lynn B Jorde; Chad D Huff; Mark Yandell
Journal:  Am J Hum Genet       Date:  2014-04-03       Impact factor: 11.025

Review 4.  Rare childhood diseases: how should we respond?

Authors:  Y Zurynski; K Frith; H Leonard; E Elliott
Journal:  Arch Dis Child       Date:  2008-08-06       Impact factor: 3.791

5.  Next-generation diagnostics and disease-gene discovery with the Exomiser.

Authors:  Damian Smedley; Julius O B Jacobsen; Marten Jäger; Sebastian Köhler; Manuel Holtgrewe; Max Schubach; Enrico Siragusa; Tomasz Zemojtel; Orion J Buske; Nicole L Washington; William P Bone; Melissa A Haendel; Peter N Robinson
Journal:  Nat Protoc       Date:  2015-11-12       Impact factor: 13.491

6.  Deep Phenotyping on Electronic Health Records Facilitates Genetic Diagnosis by Clinical Exomes.

Authors:  Jung Hoon Son; Gangcai Xie; Chi Yuan; Lyudmila Ena; Ziran Li; Andrew Goldstein; Lulin Huang; Liwei Wang; Feichen Shen; Hongfang Liu; Karla Mehl; Emily E Groopman; Maddalena Marasa; Krzysztof Kiryluk; Ali G Gharavi; Wendy K Chung; George Hripcsak; Carol Friedman; Chunhua Weng; Kai Wang
Journal:  Am J Hum Genet       Date:  2018-06-28       Impact factor: 11.025

7.  ClinPhen extracts and prioritizes patient phenotypes directly from medical records to expedite genetic disease diagnosis.

Authors:  Cole A Deisseroth; Johannes Birgmeier; Ethan E Bodle; Jennefer N Kohler; Dena R Matalon; Yelena Nazarenko; Casie A Genetti; Catherine A Brownstein; Klaus Schmitz-Abe; Kelly Schoch; Heidi Cope; Rebecca Signer; Julian A Martinez-Agosto; Vandana Shashi; Alan H Beggs; Matthew T Wheeler; Jonathan A Bernstein; Gill Bejerano
Journal:  Genet Med       Date:  2018-12-05       Impact factor: 8.822

8.  Linking common human diseases to their phenotypes; development of a resource for human phenomics.

Authors:  Şenay Kafkas; Sara Althubaiti; Georgios V Gkoutos; Robert Hoehndorf; Paul N Schofield
Journal:  J Biomed Semantics       Date:  2021-08-23

9.  Australian families living with rare disease: experiences of diagnosis, health services use and needs for psychosocial support.

Authors:  Matilda Anderson; Elizabeth J Elliott; Yvonne A Zurynski
Journal:  Orphanet J Rare Dis       Date:  2013-02-11       Impact factor: 4.123

10.  eRAM: encyclopedia of rare disease annotations for precision medicine.

Authors:  Jinmeng Jia; Zhongxin An; Yue Ming; Yongli Guo; Wei Li; Yunxiang Liang; Dongming Guo; Xin Li; Jun Tai; Geng Chen; Yaqiong Jin; Zhimei Liu; Xin Ni; Tieliu Shi
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.