Literature DB >> 27896978

DEVELOPMENT AND PERFORMANCE OF TEXT-MINING ALGORITHMS TO EXTRACT SOCIOECONOMIC STATUS FROM DE-IDENTIFIED ELECTRONIC HEALTH RECORDS.

Brittany M Hollister1, Nicole A Restrepo, Eric Farber-Eger, Dana C Crawford, Melinda C Aldrich, Amy Non.   

Abstract

Socioeconomic status (SES) is a fundamental contributor to health, and a key factor underlying racial disparities in disease. However, SES data are rarely included in genetic studies due in part to the difficultly of collecting these data when studies were not originally designed for that purpose. The emergence of large clinic-based biobanks linked to electronic health records (EHRs) provides research access to large patient populations with longitudinal phenotype data captured in structured fields as billing codes, procedure codes, and prescriptions. SES data however, are often not explicitly recorded in structured fields, but rather recorded in the free text of clinical notes and communications. The content and completeness of these data vary widely by practitioner. To enable gene-environment studies that consider SES as an exposure, we sought to extract SES variables from racial/ethnic minority adult patients (n=9,977) in BioVU, the Vanderbilt University Medical Center biorepository linked to de-identified EHRs. We developed several measures of SES using information available within the de-identified EHR, including broad categories of occupation, education, insurance status, and homelessness. Two hundred patients were randomly selected for manual review to develop a set of seven algorithms for extracting SES information from de-identified EHRs. The algorithms consist of 15 categories of information, with 830 unique search terms. SES data extracted from manual review of 50 randomly selected records were compared to data produced by the algorithm, resulting in positive predictive values of 80.0% (education), 85.4% (occupation), 87.5% (unemployment), 63.6% (retirement), 23.1% (uninsured), 81.8% (Medicaid), and 33.3% (homelessness), suggesting some categories of SES data are easier to extract in this EHR than others. The SES data extraction approach developed here will enable future EHR-based genetic studies to integrate SES information into statistical analyses. Ultimately, incorporation of measures of SES into genetic studies will help elucidate the impact of the social environment on disease risk and outcomes.

Entities:  

Mesh:

Year:  2017        PMID: 27896978      PMCID: PMC5147499          DOI: 10.1142/9789813207813_0023

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  19 in total

1.  Education, genetic ancestry, and blood pressure in African Americans and Whites.

Authors:  Amy L Non; Clarence C Gravlee; Connie J Mulligan
Journal:  Am J Public Health       Date:  2012-06-14       Impact factor: 9.308

Review 2.  The social determinants of health: it's time to consider the causes of the causes.

Authors:  Paula Braveman; Laura Gottlieb
Journal:  Public Health Rep       Date:  2014 Jan-Feb       Impact factor: 2.792

3.  Education, income and ethnic differences in cumulative biological risk profiles in a national sample of US adults: NHANES III (1988-1994).

Authors:  Teresa Seeman; Sharon S Merkin; Eileen Crimmins; Brandon Koretz; Susan Charette; Arun Karlamangla
Journal:  Soc Sci Med       Date:  2007-10-24       Impact factor: 4.634

Review 4.  Using electronic health records to drive discovery in disease genomics.

Authors:  Isaac S Kohane
Journal:  Nat Rev Genet       Date:  2011-05-18       Impact factor: 53.242

5.  A new initiative on precision medicine.

Authors:  Francis S Collins; Harold Varmus
Journal:  N Engl J Med       Date:  2015-01-30       Impact factor: 91.245

6.  Principles of human subjects protections applied in an opt-out, de-identified biobank.

Authors:  Jill Pulley; Ellen Clayton; Gordon R Bernard; Dan M Roden; Daniel R Masys
Journal:  Clin Transl Sci       Date:  2010-02       Impact factor: 4.689

7.  Gene-education interactions identify novel blood pressure loci in the Framingham Heart Study.

Authors:  Jacob Basson; Yun Ju Sung; Karen Schwander; Rezart Kume; Jeannette Simino; Lisa de las Fuentes; Dabeeru Rao
Journal:  Am J Hypertens       Date:  2014-01-28       Impact factor: 2.689

8.  The modifying effect of socioeconomic status on the relationship between traffic, air pollution and respiratory health in elementary schoolchildren.

Authors:  Sabit Cakmak; Christopher Hebbern; Jasmine D Cakmak; Jennifer Vanos
Journal:  J Environ Manage       Date:  2016-04-08       Impact factor: 6.789

9.  Socioeconomic status and lung cancer: unraveling the contribution of genetic admixture.

Authors:  Melinda C Aldrich; Steve Selvin; Margaret R Wrensch; Jennette D Sison; Helen M Hansen; Charles P Quesenberry; Michael F Seldin; Lisa F Barcellos; Patricia A Buffler; John K Wiencke
Journal:  Am J Public Health       Date:  2013-08-15       Impact factor: 9.308

10.  Assessing the accuracy of observer-reported ancestry in a biorepository linked to electronic medical records.

Authors:  Logan Dumitrescu; Marylyn D Ritchie; Kristin Brown-Gentry; Jill M Pulley; Melissa Basford; Joshua C Denny; Jorge R Oksenberg; Dan M Roden; Jonathan L Haines; Dana C Crawford
Journal:  Genet Med       Date:  2010-10       Impact factor: 8.822

View more
  13 in total

1.  Using Electronic Health Records To Generate Phenotypes For Research.

Authors:  Sarah A Pendergrass; Dana C Crawford
Journal:  Curr Protoc Hum Genet       Date:  2018-12-05

2.  Assessing the capacity of social determinants of health data to augment predictive models identifying patients in need of wraparound social services.

Authors:  Suranga N Kasthurirathne; Joshua R Vest; Nir Menachemi; Paul K Halverson; Shaun J Grannis
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

3.  Identification of social determinants of health using multi-label classification of electronic health record clinical notes.

Authors:  Rachel Stemerman; Jaime Arguello; Jane Brice; Ashok Krishnamurthy; Mary Houston; Rebecca Kitzmiller
Journal:  JAMIA Open       Date:  2021-02-09

4.  Enhancing diversity to reduce health information disparities and build an evidence base for genomic medicine.

Authors:  Lucia A Hindorff; Vence L Bonham; Lucila Ohno-Machado
Journal:  Per Med       Date:  2018-09-13       Impact factor: 2.512

Review 5.  Social Determinants of Mental Health: Where We Are and Where We Need to Go.

Authors:  Margarita Alegría; Amanda NeMoyer; Irene Falgàs Bagué; Ye Wang; Kiara Alvarez
Journal:  Curr Psychiatry Rep       Date:  2018-09-17       Impact factor: 5.285

Review 6.  Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Authors:  Barbara M Decker; Chloé E Hill; Steven N Baldassano; Pouya Khankhanian
Journal:  Seizure       Date:  2021-01-13       Impact factor: 3.184

7.  Extracting Country-of-Origin from Electronic Health Records for Gene- Environment Studies as Part of the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) Study.

Authors:  Eric Farber-Eger; Robert Goodloe; Jonathan Boston; William S Bush; Dana C Crawford
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26

8.  Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning.

Authors:  Haohan Wang; Xiang Liu; Yifeng Tao; Wenting Ye; Qiao Jin; William W Cohen; Eric P Xing
Journal:  Pac Symp Biocomput       Date:  2019

9.  Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UK.

Authors:  Natasha Chilman; Xingyi Song; Angus Roberts; Esther Tolani; Robert Stewart; Zoe Chui; Karen Birnie; Lisa Harber-Aschan; Billy Gazard; David Chandran; Jyoti Sanyal; Stephani Hatch; Anna Kolliakou; Jayati Das-Munshi
Journal:  BMJ Open       Date:  2021-03-25       Impact factor: 2.692

10.  Leveraging the Learning Health Care Model to Improve Equity in the Age of Genomic Medicine.

Authors:  Katherine D Blizinsky; Vence L Bonham
Journal:  Learn Health Syst       Date:  2017-11-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.