Literature DB >> 21597104

Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience.

Jyotishman Pathak1, Janey Wang, Sudha Kashyap, Melissa Basford, Rongling Li, Daniel R Masys, Christopher G Chute.   

Abstract

BACKGROUND: Systematic study of clinical phenotypes is important for a better understanding of the genetic basis of human diseases and more effective gene-based disease management. A key aspect in facilitating such studies requires standardized representation of the phenotype data using common data elements (CDEs) and controlled biomedical vocabularies. In this study, the authors analyzed how a limited subset of phenotypic data is amenable to common definition and standardized collection, as well as how their adoption in large-scale epidemiological and genome-wide studies can significantly facilitate cross-study analysis.
METHODS: The authors mapped phenotype data dictionaries from five different eMERGE (Electronic Medical Records and Genomics) Network sites studying multiple diseases such as peripheral arterial disease and type 2 diabetes. For mapping, standardized terminological and metadata repository resources, such as the caDSR (Cancer Data Standards Registry and Repository) and SNOMED CT (Systematized Nomenclature of Medicine), were used. The mapping process comprised both lexical (via searching for relevant pre-coordinated concepts and data elements) and semantic (via post-coordination) techniques. Where feasible, new data elements were curated to enhance the coverage during mapping. A web-based application was also developed to uniformly represent and query the mapped data elements from different eMERGE studies.
RESULTS: Approximately 60% of the target data elements (95 out of 157) could be mapped using simple lexical analysis techniques on pre-coordinated terms and concepts before any additional curation of terminology and metadata resources was initiated by eMERGE investigators. After curation of 54 new caDSR CDEs and nine new NCI thesaurus concepts and using post-coordination, the authors were able to map the remaining 40% of data elements to caDSR and SNOMED CT. A web-based tool was also implemented to assist in semi-automatic mapping of data elements.
CONCLUSION: This study emphasizes the requirement for standardized representation of clinical research data using existing metadata and terminology resources and provides simple techniques and software for data element mapping using experiences from the eMERGE Network.

Entities:  

Mesh:

Year:  2011        PMID: 21597104      PMCID: PMC3128396          DOI: 10.1136/amiajnl-2010-000061

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  14 in total

1.  Creating mappings for ontologies in biomedicine: simple methods work.

Authors:  Amir Ghazvinian; Natalya F Noy; Mark A Musen
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

Review 2.  Interface terminologies: facilitating direct entry of clinical data into electronic health record systems.

Authors:  S Trent Rosenbloom; Randolph A Miller; Kevin B Johnson; Peter L Elkin; Steven H Brown
Journal:  J Am Med Inform Assoc       Date:  2006-02-24       Impact factor: 4.497

3.  The NCBI dbGaP database of genotypes and phenotypes.

Authors:  Matthew D Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeffrey Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan Graeff; James Ostell; Stephen T Sherry
Journal:  Nat Genet       Date:  2007-10       Impact factor: 38.330

4.  Comparing heterogeneous SNOMED CT coding of clinical research concepts by examining normalized expressions.

Authors:  James E Andrews; Timothy B Patrick; Rachel L Richesson; Hana Brown; Jeffrey P Krischer
Journal:  J Biomed Inform       Date:  2008-02-05       Impact factor: 6.317

5.  Data standards in clinical research: gaps, overlaps, challenges and future directions.

Authors:  Rachel L Richesson; Jeffrey Krischer
Journal:  J Am Med Inform Assoc       Date:  2007-08-21       Impact factor: 4.497

6.  Definitions and qualifiers in SNOMED CT.

Authors:  Ronald Cornet
Journal:  Methods Inf Med       Date:  2009-02-18       Impact factor: 2.176

7.  Representing the NCI Thesaurus in OWL DL: Modeling tools help modeling languages.

Authors:  Natalya F Noy; Sherri de Coronado; Harold Solbrig; Gilberto Fragoso; Frank W Hartel; Mark A Musen
Journal:  Appl Ontol       Date:  2008-01-01       Impact factor: 1.115

Review 8.  PhenX: a toolkit for interdisciplinary genetics research.

Authors:  Patrick J Stover; William R Harlan; Jane A Hammond; Tabitha Hendershot; Carol M Hamilton
Journal:  Curr Opin Lipidol       Date:  2010-04       Impact factor: 4.776

9.  Evaluating Phenotypic Data Elements for Genetics and Epidemiological Research: Experiences from the eMERGE and PhenX Network Projects.

Authors:  Jyotishman Pathak; Helen Pan; Janey Wang; Sudha Kashyap; Peter A Schad; Carol M Hamilton; Daniel R Masys; Christopher G Chute
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2011-03-07

10.  BioPortal: ontologies and integrated data resources at the click of a mouse.

Authors:  Natalya F Noy; Nigam H Shah; Patricia L Whetzel; Benjamin Dai; Michael Dorf; Nicholas Griffith; Clement Jonquet; Daniel L Rubin; Margaret-Anne Storey; Christopher G Chute; Mark A Musen
Journal:  Nucleic Acids Res       Date:  2009-05-29       Impact factor: 16.971

View more
  55 in total

1.  Improving risk prediction for depression via Elastic Net regression - Results from Korea National Health Insurance Services Data.

Authors:  Min-Hyung Kim; Samprit Banerjee; Sang Min Park; Jyotishman Pathak
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 2.  Common data elements for spinal cord injury clinical research: a National Institute for Neurological Disorders and Stroke project.

Authors:  F Biering-Sørensen; S Alai; K Anderson; S Charlifue; Y Chen; M DeVivo; A E Flanders; L Jones; N Kleitman; A Lans; V K Noonan; J Odenkirchen; J Steeves; K Tansey; E Widerström-Noga; L B Jakeman
Journal:  Spinal Cord       Date:  2015-02-10       Impact factor: 2.772

3.  Computationally translating molecular discoveries into tools for medicine: translational bioinformatics articles now featured in JAMIA.

Authors:  Atul J Butte; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2011 Jul-Aug       Impact factor: 4.497

4.  An integrated, ontology-driven approach to constructing observational databases for research.

Authors:  William Hsu; Nestor R Gonzalez; Aichi Chien; J Pablo Villablanca; Päivi Pajukanta; Fernando Viñuela; Alex A T Bui
Journal:  J Biomed Inform       Date:  2015-03-26       Impact factor: 6.317

5.  Future translational applications from the contemporary genomics era: a scientific statement from the American Heart Association.

Authors:  Caroline S Fox; Jennifer L Hall; Donna K Arnett; Euan A Ashley; Christian Delles; Mary B Engler; Mason W Freeman; Julie A Johnson; David E Lanfear; Stephen B Liggett; Aldons J Lusis; Joseph Loscalzo; Calum A MacRae; Kiran Musunuru; L Kristin Newby; Christopher J O'Donnell; Stephen S Rich; Andre Terzic
Journal:  Circulation       Date:  2015-04-16       Impact factor: 29.690

6.  Mining Hierarchies and Similarity Clusters from Value Set Repositories.

Authors:  Kevin J Peterson; Guoqian Jiang; Scott M Brue; Feichen Shen; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

7.  Complex overlapping concepts: An effective auditing methodology for families of similarly structured BioPortal ontologies.

Authors:  Ling Zheng; Yan Chen; Gai Elhanan; Yehoshua Perl; James Geller; Christopher Ochs
Journal:  J Biomed Inform       Date:  2018-05-28       Impact factor: 6.317

8.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.

Authors:  Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-03-26       Impact factor: 4.497

9.  Multilingual Medical Data Models in ODM Format: A Novel Form-based Approach to Semantic Interoperability between Routine Healthcare and Clinical Research.

Authors:  B Breil; J Kenneweg; F Fritz; P Bruland; D Doods; B Trinczek; M Dugas
Journal:  Appl Clin Inform       Date:  2012-07-11       Impact factor: 2.342

Review 10.  Introducing the Big Knowledge to Use (BK2U) challenge.

Authors:  Yehoshua Perl; James Geller; Michael Halper; Christopher Ochs; Ling Zheng; Joan Kapusnik-Uner
Journal:  Ann N Y Acad Sci       Date:  2016-10-17       Impact factor: 5.691

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.