Literature DB >> 29295162

Phenotypic Analysis of Clinical Narratives Using Human Phenotype Ontology.

Feichen Shen1, Liwei Wang1, Hongfang Liu1.   

Abstract

Phenotypes are defined as observable characteristics and clinical traits of diseases and organisms. As connectors between medical experimental findings and clinical practices, phenotypes play vital roles in translational medicine. To facilitate the translation between genotype and phenotype, Human Phenotype Ontology (HPO) was developed as a semantically computable vocabulary to capture phenotypic abnormalities found in human diseases discovered through biomedical research. The use of HPO in annotating phenotypic information in clinical practice remains unexplored. In this study, we investigated the use of HPO to annotate phenotypic information in clinical domain by leveraging a corpus of 12.8 million clinical notes created from 2010 to 2015 for 729 thousand patients at Mayo Clinic Rochester campus and assessed the distribution information of HPO terms in the corpus. We also analyzed the distributional difference of HPO terms among demographic groups. We further demonstrated the potential application of the annotated corpus to support knowledge discovery in precision medicine through Wilson's Disease.

Entities:  

Keywords:  Human Phenotype Ontology; Phenotypic Analysis; Semantic Annotation

Mesh:

Year:  2017        PMID: 29295162      PMCID: PMC7466871     

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


Introduction

Phenotypes, defined as observable characteristics and clinical traits of diseases and organisms, have attracted increasing attention in the area of translational medicine by serving as the connectors between medical experimental findings and clinical practices. For example, in genomic medicine, phenotypes provide evidence to stratify and differentiate various groups of patients in order to discover specific hidden genotype-phenotype associations [1]. As a tool for annotating human phenotypic abnormalities, Human Phenotype Ontology (HPO) [2] has been developed as a controlled vacubulary for phenotypes by mining and integrating phenotype knowledge from medical literatures, Orphanet [3], Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources (DECIPHER) [4], and Online Mendelian Inheritance in Man (OMIM) [5]. HPO also provides associations with other biomedical resources such as Gene Ontology [6]. Multiple studies have utilized HPO for data annotation. For example, Taboada [7] developed a semantic annotation system to automatically extract rare disease information from PubMed literature using HPO. In addition, Westbury and his colleagues [8] used a HPO-based clustering approach to identify group of heritable bleeding and platelet disorders (BPD) cases. Zhu and her colleagues [9] leveraged HPO to develop a genetic testing knowledge base. Here, we used HPO to annotate a large collection of clinical narratives consisting of all clinical notes generated from 2010 to 2015 at Mayo Clinic Rochester campus and assessed the distributional information of HPO terms. A case study was performed to demonstrate the potential application of the annotated data. In the following, we describe the methods used for generating the annotated corpus. Next, we present and discuss our results. We then analyze the distributional differences among demographic groups and detail the case study followed by the conclusion and potential future directions.

Background and Materials

Clinical Data Collection

We collected all clinical notes created between 2010 and 2015 at Mayo Clinic Rochester campus with research authorization. The resulting corpus contains 12.8 million clinical notes corresponding to 729 thousand patients. Specifically, we limited our annotation to the diagnosis section of clinical notes for better extracting phenotype related terms.

The Unified Medical Language System and MetaMap

The Unified Medical Language System (UMLS) [10] is an integrated database that contains all key medical terminologies and their related resources. Each term in the UMLS has a Concept Unique Identifier (CUI) and belongs to a specific semantic type. MetaMap [11] is a configurable application that is able to map biomedical terms to UMLS Metathesaurus. In this study, we applied MetaMap with the UMLS version 2015 on free-text clinical notes to extract related terms as a preprocessing step.

Human Phenotype Ontology

We used the latest version of Human Phenotype Ontology (HPO) released on September 2016. The HPO contains four sub-ontologies as shown in Table 1, and each sub-ontology has its different focus on annotating area.
Table 1–

HPO Sub-ontologies

Sub-ontologyDescription
Phenotypic AbnormalityMain ontology of the HPO and covers most of the clinical abnormalities
Mode of InheritanceSub-ontology to describe mode of inheritance
Clinical ModifierSub-ontology with description of typical modifiers of clinical symptoms
Mortality/AgingSub-ontology that indicates time of death
HPO package contains an ontology file that records all HPO phenotype terms with their synonyms and resource references (e.g., the UMLS). As shown in Figure 1, in the ontology hierarchical structure, Menorrhagia (HP_0000132) belongs to sub-ontology Phenotypic Abnormality and is listed as a subclass of abnormal bleeding. Two synonyms abnormally heaving bleeding during menstruation and hypermenorrhea are connected to it through object property has_exact_synonym. Moreover, Menorrhagia relates to a UMLS CUI C0025323 through Web Ontology Language (OWL) axiom and object property annotatedTarget. The corresponding Resource Description Framework (RDF) representation in Figure 1 indicates such triple relationships. Although HPO has link to the UMLS, the current version of HPO has not been fully incorporated into the UMLS version after 2013. Due to this limitation, the combination of both HPO and the UMLS is able to make a more comprehensive annotation than using only one of them.
Figure 1–

An Example of HPO Phenotype

In addition, there is another clinical annotation file wrapped in HPO package that captures around 119,000 phenotype-disease associations where a majority of them (115,000) correspond to hereditary conditions. Take Wilson’s Disease as an example, the HPO annotation file records 33 relevant phenotypes, e.g., Proteinuria, Dementia, and Coma etc.

Methods

Our annotation workflow is designed to leverage the UMLS and the corresponding NLP tool MetaMap. Figure 2 shows the overview of our annotation workflow, a combination of natural language processing (NLP) and semantic processing techniques. It includes two modules: i) the preprocessing module which leverages NLP techniques to extract key words from text; and ii) the semantic annotation module which maps the lexicon entries acquired from text to HPO terms and applies a greedy approach for each entry.
Figure 2–

Annotation Work Flow

NLP Preprocessing Module

We first identify UMLS concepts in the diagnosis section of the clinical notes using MetaMap. We only keep UMLS concepts with the following phenotype-related semantic types: Disease or Syndrome (dsyn), Neoplastic Process (neop), Mental or Behavioral Dysfunction (mobd), Anatomical Abnormality (anab), Congenital Abnormality (cgab), Injury or Poisoning (inpo), Finding (fndg) and Sign or Symptom (sosy).

Semantic Annotation Module

Our annotation focuses on the main HPO sub-ontology Phenotypic Abnormality (HP_0000118) and its descendants. It contains 11,721 phenotype terms and 19,358 related synonyms defined in HPO or inferred from the linked UMLS CUIs. Our mapping includes two steps: we map UMLS concepts derived from the NLP preprocessing module to HPO terms using exact match, in case of ambiguity, we select the one with the smallest CUI number; and for HPO terms that failed to be mapped to the UMLS, the module then conducts another round of string matching refinement. Specifically, it reviews tokenized original text to check if there exists any possible candidate missed by MetaMap.

Results

The proposed annotation system was developed with Eclipse Standard/SDK version Luna 4.4.0. The interface to access MetaMap, the UMLS, and HPO were coded in Java programming language. In addition, we deployed the whole system on the Open Grid Scheduler (OGS) framework running on 64 bit Linux CentOS 6.8 servers hosted by Mayo Clinic to improve processing speed on large amounts of clinical notes. In the following, to protect patients’ privacy, any phenotypes with counts less than 11 were marked as <0.0015% or <11.

HPO Phenotype Coverage in Clinical Data

Overall, 3,241 out of 11,721 (27.7%) HPO phenotypes were extracted from 6,771,409 (52.8%) clinical notes. HPO has 23 abnormality categories. In our cohort, we counted the number of unique phenotypes and computed phenotype coverage for each category as shown in Figure 3. For example, Abnormality of the nervous system has the most phenotypes involved (599), e.g., Anomia, Anxiety, Bone pain and Dysesthesia while Abnormality of the thoracic cavity has no phenotype contained. Meanwhile, Abnormality of the voice holds a small group of phenotypes (11), including Dysphonia, High pitched voice, Hoarse cry, and Laryngeal dystonia etc. However, in terms of phenotype coverage, 78.6% (11 of 14) phenotypes under Abnormality of the voice were found. Abnormality of the breast also holds the second highest phenotype coverage as 55.6% (15 of 27). Since one phenotype can belong to more than one abnormality categories, in clinical notes, we found that 846 phenotypes are marked by 2 categories, 157 phenotypes dropped in 3 categories, 15 phenotypes are depicted by 4 categories, and only 1 phenotype is described by 5 categories.
Figure 3–

Distribution and Coverage for HPO Categories

Table 2 shows the top 10 frequent phenotypes as well as 10 phenotypes with relatively less frequent occurrences in clinical notes. We can see that many HPO common phenotype terms were found in cohort, such as Hypertension, Hyperlipidemia, Apnea, or Anxiety, etc.
Table 2–

Selected HPO Annotated Common/Rare Phenotypes

Common PhenotypesPatients’ PercentileRare PhenotypesPatients’ Percentile
Hypertension19.8%Alacrima<0.0015%
Hyperlipidemia15.2%Intercostal retractions<0.0015%
Depression9.1%Lattice retinal degeneration<0.0015%
Apnea8.6%Hemifacial hypertrophy<0.0015%
Back pain8.5%Decreased lacrimation<0.0015%
Obesity8.4%Episodic tachypnea<0.0015%
Sleep apnea8.2%Alobar holoprosencephaly<0.0015%
Anxiety8.2%Barrel-shaped chest<0.0015%
Headache8%Chemodectoma<0.0015%
Arthritis7.7%Gastrointestinal carcinoma<0.0015%
We also noted our clinical notes contain phenotypes that related to rare diseases. For example, Alacrima is one of the features for Triple A syndrome. Hemifacial hypertrophy is a rare congenital disease. Episodic tachypnea often comes along with Joubert syndrome. Alobar holoprosencephaly is associated with Holoprosencephaly. Barrel-shaped chest is one of the symptoms of Dyggve Melchior Clausen (DMC) syndrome. Chemodectoma contributes to one of the signs caused by Abdominal Chemodectomas with Cutaneous Angiolipomas.

Demographic Phenotype Analysis

We retrieved patients’ associated demographic information and assessed the phenotype distributional difference among demographic groups. We analyzed the distribution of HPO abnormality categories among different age groups. Basically, we grouped age 0-17, 18-35, 36-54 and >55 as children and teenagers, young adults, mid adults, and old adults age groups. As shown in Figure 4, Abnormality of the nervous system is popular in each age group. Abnormality of immune system occurs more frequently in children and teenagers. Abnormality of the digestive system often comes with young adult patients. While Abnormality of the cardiovascular, skeletal, metabolism/homeostasis and respiratory systems are common problems for mid adults but more common problems for old adults. In addition, Immune system disorder, Abnormality of genitourinary system, and Neoplasm have a high prevalence in old patients.
Figure 4–

Distribution of Phenotypes across 4 Age Groups

In total, there are 2,864 unique phenotypes found in male and 2,907 unique phenotypes detected in female. Based on patients’ percentile for both genders, we found that Hypertension, Apnea, Hyperlipidemia, Back pain, and Obesity are 5 most frequently occurred common phenotypes. We also conducted two case-control studies on male and female respectively to discover dominant phenotypes for each gender. We ranked phenotypes by descending order of odds ratios and filtered out phenotypes that are only associated with one specific gender (e.g., Cryptorchidism for male and Amenorrhea for female). With the odds ratio of 13.6, Hypergonadotropic hypogonadism is the phenotype that closely related to male (Table 3). Meanwhile, Hirsutism shows the highest odds ratio of 94.3 with female (Table 4). In general, top phenotypes in female held a relative higher average odds ratio.
Table 3–

Top 5 Male-Specific Phenotypes

Phenotype#Patientsodds ratiop-value
Hypergonadotropic hypogonadism13613.62.0e-31
Violent behavior20113.9e-05
Dissecting aortic aneurysm189.90.1e-03
Calf muscle hypertrophy<119.90.9e-02
Sinusitis99759.43.7e-158
Table 4–

Top 5 Female-Specific Phenotypes

Phenotype#Patientsodds ratiop-value
Hirsutism114094.36.4e-298
Hepatocellular adenoma114264.9e-27
Bulimia41723.73.4e-94
Choroid plexus cyst12616.44.6e-27
Neural tube defect5512.51.8e-11
We also extracted 29 races from clinical notes and picked 2 with the most population (excluding other and unknown races), which are White (519,098) and Black or Africa (8,835). Based on phenotype frequencies, we found top 5 common phenotypes for both races are Hypertension, Apnea, Hyperlipidemia, Depression, and Back pain, which highly overlapped with top 5 common phenotypes for male and female except Depression. We also conducted two case-control studies on White and Black or Africa patients to acquire significant phenotypes for each race. Table 5 shows top 5 significant White-specific phenotypes based on odds ratio. Based on literature review, we validated that Basal cell carcinoma, Melanoma, and Macular degeneration are highly related to patients in white race [12-14]. Table 6 shows top 5 significant Black or Africa-specific phenotypes based on odds ratio with the number of affected patients greater than or equal to 3. We found that 4 out of 5 top Black or Africa-specific phenotypes are related to genetic and rare diseases. Scarring alopecia of scalp is a rare disorder that destroys hair follicle. Clitoral hypertrophy is a congenital malformation and a rare condition. Intellectual disability, profound is related to Lissencephaly. Spastic ataxia is a symptom of Autosomal Dominant Hereditary Ataxia. Except those rare phenotypes, Status asthmaticus is a severe asthma. Literature indicated that Scarring alopecia of scalp and Intellectual disability are related to Black or Africa patients [15-16]. For other 3 phenotypes, even they occur in few patients but it may be worth to explore more as little evidence has been found about the relationship between them and Black or Africa patients in literature.
Table 5–

Top 5 White-Specific Phenotypes

Phenotype#Patientsodds ratiop-value
Basal cell carcinoma12969113.23.2e-92
Melanoma2023544.85.6e-136
Barrett esophagus289916.51.9e-17
Telangiectasia188416.11.6e-11
Macular degeneration133011.38.5e-08
Table 6–

Top 5 Black or Africa-Specific Phenotypes

Phenotype#Patientsodds ratiop-value
Scarring alopecia of scalp<1125.20.5e-03
Clitoral hypertrophy<1114.70.3e-03
Intellectual disability, profound<1113.60.2e-02
Status asthmaticus<1111.80.1e-03
Spastic ataxia<1111.80.3e-02

A Case Study of Phenotypic Analysis

Wilson’s Disease (WD) is an autosomal recessive rare inherited disorder that makes copper accumulate in organs, and its diagnosis can be challenging as its signs and symptoms are often indistinguishable from those of other liver diseases, such as hepatitis. In this section, we conducted two case-control studies to demonstrate the potential of HPO-annotated data in supporting disease diagnosis. Among a total of 615,590 patients, we extracted 39 patients with confirmed diagnosis of WD and 63 patients suspected for WD but negative by genetic testing, leaving 615,488 patients with no clinical assertions or doubts for WD. Based on the three groups, two case-control studies were designed as follows: 1) To reveal significant phenotypes of WD, odds ratios of phenotypes were computed for 39 confirmed WD patients compared to 615,488 Non-WD patients; 2) To investigate the phenotypes significant for similar disease differentiation, odds ratios of phenotypes were computed for 39 confirmed WD patients compared to 63 patients suspected for WD. Table 7 and 8 shows top 10 phenotypes ranked by odds ratio (with the number of affected patients greater than or equal to 3) representing significant clinical features of WD patients against the general population and the specific cohort with similar symptoms, respectively. Overall, odds ratio of phenotypes in the WD cohort against the general population were higher. Although the number of patients was relatively low, manual evaluation showed that all phenotypes are related to WD [17-19]. In contrast, odds ratio of phenotypes in the WD cohort against the similar cohort were relative lower, reflecting the reality that differential difficulties exist in the diagnosis of WD and WD similar diseases. However, according to [17], 7 phenotypes Hypoalbuminemia, Nevus, Osteopenia, Cognitive impairment, Cirrhosis, Renal insufficiency, and Fever are common symptoms of WD, of which only Osteopenia and Cirrhosis are covered by WD-related phenotypes recorded in HPO annotation file. The remaining are possible comorbidities highly occurring with WD. This analysis revealed the potential of these phenotypes in differentiating WD from similar diseases.
Table 7–

Top 10 Phenotypes with Odds Ratio for Confirmed WD Patients to Non-WD related Patients

Phenotype#Patientsodds ratiop-value
Hepatic failure<1122.50.4e-03
Hypoalbuminemia<1112.10.2e-02
Cirrhosis1110.92.0e-08
Elevated hepatic transaminases<117.290.9e-02
Splenomegaly<117.270.9e-02
Ascites<115.520.7e-02
Encephalopathy<114.540.6e-02
Hepatitis<114.020.5e-02
Lipoma<113.510.6e-01
Thrombocytopenia<113.180.9e-02
Table 8–

Top 10 Phenotypes with Odds Ratio for Confirmed WD Patients to Suspected Patients

Phenotype#Patientsodds ratiop-value
Hyperglycemia<115.20.2
Hypoalbuminemia<115.20.2
Nevus<113.50.2
Esophagitis<112.50.4
Osteopenia<112.50.4
Cognitive impairment<112.50.4
Hyperlipidemia<112.540.2
Cirrhosis112.080.2
Renal insufficiency<112.060.3
Fever<111.710.5

Discussion

Our study showed that almost half of the clinical notes contain no HPO terms. To investigate further, we extracted top 10 frequent phenotypes from those notes including Fatigue, Pharyngitis, Dyslipidemia, Dermatoheliosis, Insomnia, Diabetes, Fibromyalgia, Snoring, Chronic Obstructive Airway Disease, and Rash. In general, most of these phenotypes are not hereditary ones, and some of them are more generic than HPO terms (e.g., SNOMED CT terms [20]). For example, Fatigue covers HPO annotated phenotype Exercise-induced muscle fatigue, Pharyngitis gives a more general description of HPO annotated phenotype Recurrent Pharyngitis, and Diabetes is the super set of HPO annotated phenotype Insulin-resistant diabetes mellitus. Note that in our case study, since Wilson’s Disease is a rare disease, we were not able to provide a large sample size, which results in relative low odds ratio and high p-value when compared confirmed patients to the general population and suspected patients. In addition, in this research, to reduce the confounding factor of comorbidity, we picked phenotypes at the same year with WD diagnosis. In the future, to further address this issue, a more specific timestamp can be added to put restriction on diagnosed period. Currently, HPO records 33 phenotypes specifically related to WD. Although HPO coverage for WD is not sufficient, phenotypes in HPO can help on extracting all symptoms of diseases, making it possible to reveal significant phenotypes of WD by conducting case-control analysis. Therefore, it is possible to combine clinical notes with HPO to provide empirical evidences to further help diagnose and differentiate WD from similar diseases. The integration of genotypic and phenotypic knowledge is an essential step to facilitate knowledge discovery in translational medicine. HPO, as a tool with linkage to gene databases and statistics of patients’ percentile of phenotypes for each disease (e.g., 1% as very rare, 5% as rare), can be leveraged for this purpose. Especially for rare phenotypes, HPO makes it convenient for researchers to explore the potential genotype-phenotype associations.

Conclusions and Future Work

In this study, we used HPO to annotate phenotypes in clinical text for conducting phenotypic analysis. We demonstrated its potential in facilitating knowledge discovery. In the future, we will try different ontologies and dictionaries to detect phenotypes from clinical notes and compare performances among them. Furthermore, a more specific diagnosis time range will be considered to largely reduce mischaracterization caused by complication and comorbidity. Moreover, we will use HPO with NLP and machine learning techniques to link various phenotypes with genes and drugs to perform knowledge discovery on gene disease and drug repositioning.
  18 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

3.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  [Orphanet: a European database for rare diseases].

Authors:  S S Weinreich; R Mangon; J J Sikkens; M E en Teeuw; M C Cornel
Journal:  Ned Tijdschr Geneeskd       Date:  2008-03-01

5.  Race, iris color, and age-related macular degeneration.

Authors:  R N Frank; J E Puklin; C Stock; L A Canter
Journal:  Trans Am Ophthalmol Soc       Date:  2000

6.  Wilson disease with thrombocytopenia (case report).

Authors:  M Zhvania; K Gogberashvili; M Gagoshidze; E Uberi
Journal:  Georgian Med News       Date:  2014-12

7.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.

Authors:  Peter N Robinson; Sebastian Köhler; Sebastian Bauer; Dominik Seelow; Denise Horn; Stefan Mundlos
Journal:  Am J Hum Genet       Date:  2008-10-23       Impact factor: 11.025

8.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations.

Authors:  Joshua C Denny; Marylyn D Ritchie; Melissa A Basford; Jill M Pulley; Lisa Bastarache; Kristin Brown-Gentry; Deede Wang; Dan R Masys; Dan M Roden; Dana C Crawford
Journal:  Bioinformatics       Date:  2010-03-24       Impact factor: 6.937

9.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.

Authors:  Ada Hamosh; Alan F Scott; Joanna S Amberger; Carol A Bocchini; Victor A McKusick
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  Human phenotype ontology annotation and cluster analysis to unravel genetic defects in 707 cases with unexplained bleeding and platelet disorders.

Authors:  Sarah K Westbury; Ernest Turro; Daniel Greene; Claire Lentaigne; Anne M Kelly; Tadbir K Bariana; Ilenia Simeoni; Xavier Pillois; Antony Attwood; Steve Austin; Sjoert Bg Jansen; Tamam Bakchoul; Abi Crisp-Hihn; Wendy N Erber; Rémi Favier; Nicola Foad; Michael Gattens; Jennifer D Jolley; Ri Liesner; Stuart Meacham; Carolyn M Millar; Alan T Nurden; Kathelijne Peerlinck; David J Perry; Pawan Poudel; Sol Schulman; Harald Schulze; Jonathan C Stephens; Bruce Furie; Peter N Robinson; Chris van Geet; Augusto Rendon; Keith Gomez; Michael A Laffan; Michele P Lambert; Paquita Nurden; Willem H Ouwehand; Sylvia Richardson; Andrew D Mumford; Kathleen Freson
Journal:  Genome Med       Date:  2015-04-09       Impact factor: 11.117

View more
  10 in total

1.  Incorporating Knowledge-Driven Insights into a Collaborative Filtering Model to Facilitate the Differential Diagnosis of Rare Diseases.

Authors:  Feichen Shen; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  A clinical text classification paradigm using weak supervision and deep representation.

Authors:  Yanshan Wang; Sunghwan Sohn; Sijia Liu; Feichen Shen; Liwei Wang; Elizabeth J Atkinson; Shreyasee Amin; Hongfang Liu
Journal:  BMC Med Inform Decis Mak       Date:  2019-01-07       Impact factor: 2.796

3.  HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.

Authors:  Feichen Shen; Suyuan Peng; Yadan Fan; Andrew Wen; Sijia Liu; Yanshan Wang; Liwei Wang; Hongfang Liu
Journal:  J Biomed Inform       Date:  2019-06-27       Impact factor: 6.317

4.  Detection of Surgical Site Infection Utilizing Automated Feature Generation in Clinical Notes.

Authors:  Feichen Shen; David W Larson; James M Naessens; Elizabeth B Habermann; Hongfang Liu; Sunghwan Sohn
Journal:  J Healthc Inform Res       Date:  2018-11-06

5.  Modeling seizures in the Human Phenotype Ontology according to contemporary ILAE concepts makes big phenotypic data tractable.

Authors:  David Lewis-Smith; Peter D Galer; Ganna Balagura; Hugh Kearney; Shiva Ganesan; Mahgenn Cosico; Margaret O'Brien; Priya Vaidiswaran; Roland Krause; Colin A Ellis; Rhys H Thomas; Peter N Robinson; Ingo Helbig
Journal:  Epilepsia       Date:  2021-05-05       Impact factor: 6.740

6.  Detecting rare diseases in electronic health records using machine learning and knowledge engineering: Case study of acute hepatic porphyria.

Authors:  Aaron M Cohen; Steven Chamberlin; Thomas Deloughery; Michelle Nguyen; Steven Bedrick; Stephen Meninger; John J Ko; Jigar J Amin; Alex J Wei; William Hersh
Journal:  PLoS One       Date:  2020-07-02       Impact factor: 3.240

7.  Rare disease knowledge enrichment through a data-driven approach.

Authors:  Feichen Shen; Yiqing Zhao; Liwei Wang; Majid Rastegar Mojarad; Yanshan Wang; Sijia Liu; Hongfang Liu
Journal:  BMC Med Inform Decis Mak       Date:  2019-02-14       Impact factor: 2.796

8.  Computational analysis of 10,860 phenotypic annotations in individuals with SCN2A-related disorders.

Authors:  Katherine Crawford; Julie Xian; Katherine L Helbig; Peter D Galer; Shridhar Parthasarathy; David Lewis-Smith; Michael C Kaufman; Eryn Fitch; Shiva Ganesan; Margaret O'Brien; Veronica Codoni; Colin A Ellis; Laura J Conway; Deanne Taylor; Roland Krause; Ingo Helbig
Journal:  Genet Med       Date:  2021-03-17       Impact factor: 8.822

9.  Utilization of Electronic Medical Records and Biomedical Literature to Support the Diagnosis of Rare Diseases Using Data Fusion and Collaborative Filtering Approaches.

Authors:  Feichen Shen; Sijia Liu; Yanshan Wang; Andrew Wen; Liwei Wang; Hongfang Liu
Journal:  JMIR Med Inform       Date:  2018-10-10

10.  Clinical genome sequencing in an unbiased pediatric cohort.

Authors:  Isabelle Thiffault; Emily Farrow; Lee Zellmer; Courtney Berrios; Neil Miller; Margaret Gibson; Raymond Caylor; Janda Jenkins; Deb Faller; Sarah Soden; Carol Saunders
Journal:  Genet Med       Date:  2018-07-16       Impact factor: 8.822

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.