Literature DB >> 28269930

SMASH: A Data-driven Informatics Method to Assist Experts in Characterizing Semantic Heterogeneity among Data Elements.

William Brown1, Chunhua Weng2, David K Vawdrey3, Alex Carballo-Diéguez4, Suzanne Bakken5.   

Abstract

Semantic heterogeneity (SH) is detrimental to data interoperability and integration in healthcare. Assessing SH is difficult, yet fundamental to addressing the problem. Using expert-based and data-driven methods we assessed SH among HIV-associated data elements (DEs). Using Clinicaltrials.gov, we identified and obtained eight data dictionaries, and created a DE inventory. We vectorized DEs by study, and developed a new method, String Metric-assisted Assessment of Semantic Heterogeneity (SMASH), to find DEs: similar in An and Bn, unique to An, and unique to Bn. An HIV expert assessed pairs for semantic equivalence. Heterogeneous DEs were either semantically-equivalent/syntactically-different (HIV-positive/HIV+/Seropositive), or syntactically-equivalent/semantically-different ("Partner" [sexual]/"Partner"[relationship]). Context of usage was considered. SMASH aided identification of SH. Of 1,175 DE from pairs, 1,048 (87%) were semantically heterogeneous and 127 (13%) were homogeneous. Most heterogeneous pairs (97%) were semantically-equivalent/syntactically-different. Expert-based and data-driven methods are complementary for assessing SH, especially among semantically-equivalent/syntactically-different DE. Similar expert-based/data-driven solutions are recommended for resolving SH.

Mesh:

Year:  2017        PMID: 28269930      PMCID: PMC5333258     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  12 in total

1.  HIV cohort collaborations: proposal for harmonization of data exchange.

Authors:  Jesper Kjaer; Bruno Ledergerber
Journal:  Antivir Ther       Date:  2004-08

2.  A call for collaborative semantics harmonization.

Authors:  Chunhua Weng; Douglas B Fridsma
Journal:  AMIA Annu Symp Proc       Date:  2006

3.  User-centered semantic harmonization: a case study.

Authors:  Chunhua Weng; John H Gennari; Douglas B Fridsma
Journal:  J Biomed Inform       Date:  2007-03-21       Impact factor: 6.317

4.  A method to map heterogeneity between near but non-equivalent semantic attributes in multiple health data registries.

Authors:  Nadine Schuurman; Agnieszka Leszczynski
Journal:  Health Informatics J       Date:  2008-03       Impact factor: 2.681

5.  NIH's Big Data to Knowledge initiative and the advancement of biomedical informatics.

Authors:  Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2014 Mar-Apr       Impact factor: 4.497

6.  News from the NIH: leveraging big data in the behavioral sciences.

Authors:  Robert M Kaplan; William T Riley; Patricia L Mabry
Journal:  Transl Behav Med       Date:  2014-09       Impact factor: 3.046

7.  A Collaborative Framework for Representation and Harmonization of Clinical Study Data Elements Using Semantic MediaWiki.

Authors:  Guoqian Jiang; Harold R Solbrig; Dave Iberson-Hurst; Rebecca D Kush; Christopher G Chute
Journal:  Summit Transl Bioinform       Date:  2010-03-01

8.  Conducting research using the electronic health record across multi-hospital systems: semantic harmonization implications for administrators.

Authors:  Kathryn H Bowles; Sheryl Potashnik; Sarah J Ratcliffe; Melissa Rosenberg; Nai-Wei Shih; Maxim Topaz; John H Holmes; Mary D Naylor
Journal:  J Nurs Adm       Date:  2013-06       Impact factor: 1.737

9.  High heterogeneity of HIV-related sexual risk among transgender people in Ontario, Canada: a province-wide respondent-driven sampling survey.

Authors:  Greta R Bauer; Robb Travers; Kyle Scanlon; Todd A Coleman
Journal:  BMC Public Health       Date:  2012-04-20       Impact factor: 3.295

10.  KaBOB: ontology-based semantic integration of biomedical databases.

Authors:  Kevin M Livingston; Michael Bada; William A Baumgartner; Lawrence E Hunter
Journal:  BMC Bioinformatics       Date:  2015-04-23       Impact factor: 3.169

View more
  1 in total

1.  Automated Identification of Common Disease-Specific Outcomes for Comparative Effectiveness Research Using ClinicalTrials.gov: Algorithm Development and Validation Study.

Authors:  Joseph Finkelstein; Anas Elghafari
Journal:  JMIR Med Inform       Date:  2021-02-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.