Literature DB >> 31329882

Underserved populations with missing race ethnicity data differ significantly from those with structured race/ethnicity documentation.

Evan T Sholle1, Laura C Pinheiro2, Prakash Adekkanattu1, Marcos A Davila1, Stephen B Johnson3, Jyotishman Pathak3, Sanjai Sinha2, Cassidie Li2, Stasi A Lubansky2, Monika M Safford2, Thomas R Campion1,3,4.   

Abstract

OBJECTIVE: We aimed to address deficiencies in structured electronic health record (EHR) data for race and ethnicity by identifying black and Hispanic patients from unstructured clinical notes and assessing differences between patients with or without structured race/ethnicity data.
MATERIALS AND METHODS: Using EHR notes for 16 665 patients with encounters at a primary care practice, we developed rule-based natural language processing (NLP) algorithms to classify patients as black/Hispanic. We evaluated performance of the method against an annotated gold standard, compared race and ethnicity between NLP-derived and structured EHR data, and compared characteristics of patients identified as black or Hispanic using only NLP vs patients identified as such only in structured EHR data.
RESULTS: For the sample of 16 665 patients, NLP identified 948 additional patients as black, a 26%increase, and 665 additional patients as Hispanic, a 20% increase. Compared with the patients identified as black or Hispanic in structured EHR data, patients identified as black or Hispanic via NLP only were older, more likely to be male, less likely to have commercial insurance, and more likely to have higher comorbidity. DISCUSSION: Structured EHR data for race and ethnicity are subject to data quality issues. Supplementing structured EHR race data with NLP-derived race and ethnicity may allow researchers to better assess the demographic makeup of populations and draw more accurate conclusions about intergroup differences in health outcomes.
CONCLUSIONS: Black or Hispanic patients who are not documented as such in structured EHR race/ethnicity fields differ significantly from those who are. Relatively simple NLP can help address this limitation.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  electronic health record; ethnicity; natural language processing; race

Mesh:

Year:  2019        PMID: 31329882      PMCID: PMC6696506          DOI: 10.1093/jamia/ocz040

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  17 in total

1.  Barriers to collecting patient race, ethnicity, and primary language data in physician practices: an exploratory study.

Authors:  Romana Hasnain-Wynia; Kevin Van Dyke; Mara Youdelman; Christian Krautkramer; Susan L Ivey; Robert Gilchick; Erin Kaleba; Matthew K Wynia
Journal:  J Natl Med Assoc       Date:  2010-09       Impact factor: 1.798

2.  Unequal treatment: confronting racial and ethnic disparities in health care.

Authors:  Alan Nelson
Journal:  J Natl Med Assoc       Date:  2002-08       Impact factor: 1.798

3.  Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure.

Authors:  Jennifer H Garvin; Scott L DuVall; Brett R South; Bruce E Bray; Daniel Bolton; Julia Heavirland; Steve Pickard; Paul Heidenreich; Shuying Shen; Charlene Weir; Matthew Samore; Mary K Goldstein
Journal:  J Am Med Inform Assoc       Date:  2012-03-21       Impact factor: 4.497

4.  The "meaningful use" regulation for electronic health records.

Authors:  David Blumenthal; Marilyn Tavenner
Journal:  N Engl J Med       Date:  2010-07-13       Impact factor: 91.245

5.  Automated identification of patients with a diagnosis of binge eating disorder from narrative electronic health records.

Authors:  Brandon K Bellows; Joanne LaFleur; Aaron W C Kamauu; Thomas Ginter; Tyler B Forbush; Stephen Agbor; Dylan Supina; Paul Hodgkins; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2013-11-07       Impact factor: 4.497

Review 6.  Natural language processing: an introduction.

Authors:  Prakash M Nadkarni; Lucila Ohno-Machado; Wendy W Chapman
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

7.  Tracking health disparities through natural-language processing.

Authors:  Mark L Wieland; Stephen T Wu; Vinod C Kaggal; Barbara P Yawn
Journal:  Am J Public Health       Date:  2013-01-17       Impact factor: 9.308

8.  Obtaining data on patient race, ethnicity, and primary language in health care organizations: current challenges and proposed solutions.

Authors:  Romana Hasnain-Wynia; David W Baker
Journal:  Health Serv Res       Date:  2006-08       Impact factor: 3.402

9.  Accuracy of race, ethnicity, and language preference in an electronic health record.

Authors:  Elissa V Klinger; Sara V Carlini; Irina Gonzalez; Stella St Hubert; Jeffrey A Linder; Nancy A Rigotti; Emily Z Kontos; Elyse R Park; Lucas X Marinacci; Jennifer S Haas
Journal:  J Gen Intern Med       Date:  2014-12-20       Impact factor: 5.128

10.  Longitudinal analysis of pain in patients with metastatic prostate cancer using natural language processing of medical record text.

Authors:  Norris H Heintzelman; Robert J Taylor; Lone Simonsen; Roger Lustig; Doug Anderko; Jennifer A Haythornthwaite; Lois C Childs; George Steven Bova
Journal:  J Am Med Inform Assoc       Date:  2012-11-09       Impact factor: 4.497

View more
  18 in total

1.  Challenges with quality of race and ethnicity data in observational databases.

Authors:  Fernanda C G Polubriaginof; Patrick Ryan; Hojjat Salmasian; Andrea Wells Shapiro; Adler Perotte; Monika M Safford; George Hripcsak; Shaun Smith; Nicholas P Tatonetti; David K Vawdrey
Journal:  J Am Med Inform Assoc       Date:  2019-08-01       Impact factor: 4.497

2.  Health informatics and health equity: improving our reach and impact.

Authors:  Tiffany C Veinot; Jessica S Ancker; Suzanne Bakken
Journal:  J Am Med Inform Assoc       Date:  2019-08-01       Impact factor: 4.497

3.  Diversity in Machine Learning: A Systematic Review of Text-Based Diagnostic Applications.

Authors:  Lane Fitzsimmons; Maya Dewan; Judith W Dexheimer
Journal:  Appl Clin Inform       Date:  2022-05-25       Impact factor: 2.762

4.  A Method to Improve Availability and Quality of Patient Race Data in an Electronic Health Record System.

Authors:  Marika M Cusick; Evan T Sholle; Marcos A Davila; Joseph Kabariti; Curtis L Cole; Thomas R Campion
Journal:  Appl Clin Inform       Date:  2020-11-25       Impact factor: 2.342

5.  Addressing Challenges and Implications of National Surveillance for Racial/Ethnic Disparities in Diabetes.

Authors:  Kunihiro Matsushita; Olive Tang; Elizabeth Selvin
Journal:  JAMA       Date:  2019-12-24       Impact factor: 56.272

6.  Development and validation of a machine learning algorithm for predicting the risk of postpartum depression among pregnant women.

Authors:  Yiye Zhang; Shuojia Wang; Alison Hermann; Rochelle Joly; Jyotishman Pathak
Journal:  J Affect Disord       Date:  2020-09-30       Impact factor: 4.839

Review 7.  Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Authors:  Barbara M Decker; Chloé E Hill; Steven N Baldassano; Pouya Khankhanian
Journal:  Seizure       Date:  2021-01-13       Impact factor: 3.184

8.  Automated Extraction of Tumor Staging and Diagnosis Information From Surgical Pathology Reports.

Authors:  Sajjad Abedian; Evan T Sholle; Prakash M Adekkanattu; Marika M Cusick; Stephanie E Weiner; Jonathan E Shoag; Jim C Hu; Thomas R Campion
Journal:  JCO Clin Cancer Inform       Date:  2021-10

9.  Barriers to identifying residents with dementia for embedded pragmatic trials: A call to action.

Authors:  Ellen McCreedy; Andrea Gilmore-Bykovskyi; David A Dorr; Julie Lima; Ellen P McCarthy; David J Meyers; Richard Platt; V G Vinod Vydiswaran; Julie P W Bynum
Journal:  J Am Geriatr Soc       Date:  2021-11-02       Impact factor: 5.562

10.  Assessing the impact of social determinants of health on predictive models for potentially avoidable 30-day readmission or death.

Authors:  Yongkang Zhang; Yiye Zhang; Evan Sholle; Sajjad Abedian; Marianne Sharko; Meghan Reading Turchioe; Yiyuan Wu; Jessica S Ancker
Journal:  PLoS One       Date:  2020-06-25       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.