Literature DB >> 33290878

Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction.

Kevin Lybarger1, Mari Ostendorf2, Meliha Yetisgen3.   

Abstract

Social determinants of health (SDOH) affect health outcomes, and knowledge of SDOH can inform clinical decision-making. Automatically extracting SDOH information from clinical text requires data-driven information extraction models trained on annotated corpora that are heterogeneous and frequently include critical SDOH. This work presents a new corpus with SDOH annotations, a novel active learning framework, and the first extraction results on the new corpus. The Social History Annotation Corpus (SHAC) includes 4480 social history sections with detailed annotation for 12 SDOH characterizing the status, extent, and temporal information of 18K distinct events. We introduce a novel active learning framework that selects samples for annotation using a surrogate text classification task as a proxy for a more complex event extraction task. The active learning framework successfully increases the frequency of health risk factors and improves automatic extraction of these events over undirected annotation. An event extraction model trained on SHAC achieves high extraction performance for substance use status (0.82-0.93 F1), employment status (0.81-0.86 F1), and living status type (0.81-0.93 F1) on data from three institutions.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Active learning; Machine learning; Natural language processing; Social determinants of health

Mesh:

Year:  2020        PMID: 33290878      PMCID: PMC7856628          DOI: 10.1016/j.jbi.2020.103631

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  25 in total

Review 1.  Extent of illicit drug use and dependence, and their contribution to the global burden of disease.

Authors:  Louisa Degenhardt; Wayne Hall
Journal:  Lancet       Date:  2012-01-07       Impact factor: 79.321

2.  Annual smoking-attributable mortality, years of potential life lost, and productivity losses--United States, 1997-2001.

Authors: 
Journal:  MMWR Morb Mortal Wkly Rep       Date:  2005-07-01       Impact factor: 17.586

3.  Towards the Inference of Social and Behavioral Determinants of Sexual Health: Development of a Gold-Standard Corpus with Semi-Supervised Learning.

Authors:  Daniel J Feller; Jason Zucker; Oliver Bear Don't Walk; Bharat Srikishan; Roxana Martinez; Henry Evans; Michael T Yin; Peter Gordon; Noémie Elhadad
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

4.  Using Neural Multi-task Learning to Extract Substance Abuse Information from Clinical Notes.

Authors:  Kevin Lybarger; Meliha Yetisgen; Mari Ostendorf
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

5.  Addressing Social Determinants to Improve Patient Care and Promote Health Equity: An American College of Physicians Position Paper.

Authors:  Hilary Daniel; Sue S Bornstein; Gregory C Kane; Jan K Carney; Heather E Gantzer; Tracey L Henry; Joshua D Lenchus; Joseph M Li; Bridget M McCandless; Beth R Nalitt; Lavanya Viswanathan; Caleb J Murphy; Ayeetin M Azah; Lianne Marks
Journal:  Ann Intern Med       Date:  2018-04-17       Impact factor: 25.391

Review 6.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

7.  Exploring Representativeness and Informativeness for Active Learning.

Authors:  Bo Du; Zengmao Wang; Lefei Zhang; Liangpei Zhang; Wei Liu; Jialie Shen; Dacheng Tao
Journal:  IEEE Trans Cybern       Date:  2015-11-17       Impact factor: 11.448

8.  Active Deep Learning-Based Annotation of Electroencephalography Reports for Cohort Identification.

Authors:  Ramon Maldonado; Travis R Goodwin; Sanda M Harabagiu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26

9.  Leveraging the Learning Health Care Model to Improve Equity in the Age of Genomic Medicine.

Authors:  Katherine D Blizinsky; Vence L Bonham
Journal:  Learn Health Syst       Date:  2017-11-27

10.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

View more
  10 in total

1.  A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models.

Authors:  Zehao Yu; Xi Yang; Chong Dang; Songzi Wu; Prakash Adekkanattu; Jyotishman Pathak; Thomas J George; William R Hogan; Yi Guo; Jiang Bian; Yonghui Wu
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

2.  Extracting Patient-level Social Determinants of Health into the OMOP Common Data Model.

Authors:  Jimmy Phuong; Elizabeth Zampino; Nicholas Dobbins; Juan Espinoza; Daniella Meeker; Heidi Spratt; Charisse Madlock-Brown; Nicole G Weiskopf; Adam Wilcox
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

3.  Development and validation of a prediction model for actionable aspects of frailty in the text of clinicians' encounter notes.

Authors:  Jacob A Martin; Andrew Crane-Droesch; Folasade C Lapite; Joseph C Puhl; Tyler E Kmiec; Jasmine A Silvestri; Lyle H Ungar; Bruce P Kinosian; Blanca E Himes; Rebecca A Hubbard; Joshua M Diamond; Vivek Ahya; Michael W Sims; Scott D Halpern; Gary E Weissman
Journal:  J Am Med Inform Assoc       Date:  2021-12-28       Impact factor: 4.497

4.  Advancing Interoperability of Patient-level Social Determinants of Health Data to Support COVID-19 Research.

Authors:  Jimmy Phuong; Stephanie Hong; Matvey B Palchuk; Juan Espinoza; Daniella Meeker; David A Dorr; Galina Lozinski; Charisse Madlock-Brown; William G Adams
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

Review 5.  A scoping review of publicly available language tasks in clinical natural language processing.

Authors:  Yanjun Gao; Dmitriy Dligach; Leslie Christensen; Samuel Tesch; Ryan Laffin; Dongfang Xu; Timothy Miller; Ozlem Uzuner; Matthew M Churpek; Majid Afshar
Journal:  J Am Med Inform Assoc       Date:  2022-09-12       Impact factor: 7.942

6.  Extracting COVID-19 diagnoses and symptoms from clinical text: A new annotated corpus and neural event extraction framework.

Authors:  Kevin Lybarger; Mari Ostendorf; Matthew Thompson; Meliha Yetisgen
Journal:  J Biomed Inform       Date:  2021-03-26       Impact factor: 8.000

7.  Sensitivity and Specificity of Real-World Social Factor Screening Approaches.

Authors:  Joshua R Vest; Wei Wu; Eneida A Mendonca
Journal:  J Med Syst       Date:  2021-11-12       Impact factor: 4.460

8.  Extracting social determinants of health from electronic health records using natural language processing: a systematic review.

Authors:  Braja G Patra; Mohit M Sharma; Veer Vekaria; Prakash Adekkanattu; Olga V Patterson; Benjamin Glicksberg; Lauren A Lepow; Euijung Ryu; Joanna M Biernacka; Al'ona Furmanchuk; Thomas J George; William Hogan; Yonghui Wu; Xi Yang; Jiang Bian; Myrna Weissman; Priya Wickramaratne; J John Mann; Mark Olfson; Thomas R Campion; Mark Weiner; Jyotishman Pathak
Journal:  J Am Med Inform Assoc       Date:  2021-11-25       Impact factor: 7.942

9.  Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding.

Authors:  Yanjun Gao; Dmitriy Dligach; Timothy Miller; Samuel Tesch; Ryan Laffin; Matthew M Churpek; Majid Afshar
Journal:  LREC Int Conf Lang Resour Eval       Date:  2022-06

10.  Event-Based Clinical Finding Extraction from Radiology Reports with Pre-trained Language Model.

Authors:  Wilson Lau; Kevin Lybarger; Martin L Gunn; Meliha Yetisgen
Journal:  J Digit Imaging       Date:  2022-10-17       Impact factor: 4.903

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.