Chengyi Zheng1, Yi Luo1, Cheryl Mercado1, Lina Sy1, Steven J Jacobsen1, Brad Ackerson2, Bruno Lewin3, Hung Fu Tseng1. 1. Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, California, USA. 2. South Bay Medical Center, Kaiser Permanente Southern California, Harbor City, California, USA. 3. Los Angeles Medical Center, Kaiser Permanente Southern California, Los Angeles, California, USA.
Abstract
IMPORTANCE: Diagnosis codes are inadequate for accurately identifying herpes zoster (HZ) ophthalmicus (HZO). There is significant lack of population-based studies on HZO due to the high expense of manual review of medical records. BACKGROUND: To assess whether HZO can be identified from the clinical notes using natural language processing (NLP). To investigate the epidemiology of HZO among HZ population based on the developed approach. DESIGN: A retrospective cohort analysis. PARTICIPANTS: A total of 49 914 southern California residents aged over 18 years, who had a new diagnosis of HZ. METHODS: An NLP-based algorithm was developed and validated with the manually curated validation data set (n = 461). The algorithm was applied on over 1 million clinical notes associated with the study population. HZO versus non-HZO cases were compared by age, sex, race and co-morbidities. MAIN OUTCOME MEASURES: We measured the accuracy of NLP algorithm. RESULTS: NLP algorithm achieved 95.6% sensitivity and 99.3% specificity. Compared to the diagnosis codes, NLP identified significant more HZO cases among HZ population (13.9% vs. 1.7%). Compared to the non-HZO group, the HZO group was older, had more males, had more Whites and had more outpatient visits. CONCLUSIONS AND RELEVANCE: We developed and validated an automatic method to identify HZO cases with high accuracy. As one of the largest studies on HZO, our finding emphasizes the importance of preventing HZ in the elderly population. This method can be a valuable tool to support population-based studies and clinical care of HZO in the era of big data.
IMPORTANCE: Diagnosis codes are inadequate for accurately identifying herpes zoster (HZ) ophthalmicus (HZO). There is significant lack of population-based studies on HZO due to the high expense of manual review of medical records. BACKGROUND: To assess whether HZO can be identified from the clinical notes using natural language processing (NLP). To investigate the epidemiology of HZO among HZ population based on the developed approach. DESIGN: A retrospective cohort analysis. PARTICIPANTS: A total of 49 914 southern California residents aged over 18 years, who had a new diagnosis of HZ. METHODS: An NLP-based algorithm was developed and validated with the manually curated validation data set (n = 461). The algorithm was applied on over 1 million clinical notes associated with the study population. HZO versus non-HZO cases were compared by age, sex, race and co-morbidities. MAIN OUTCOME MEASURES: We measured the accuracy of NLP algorithm. RESULTS: NLP algorithm achieved 95.6% sensitivity and 99.3% specificity. Compared to the diagnosis codes, NLP identified significant more HZO cases among HZ population (13.9% vs. 1.7%). Compared to the non-HZO group, the HZO group was older, had more males, had more Whites and had more outpatient visits. CONCLUSIONS AND RELEVANCE: We developed and validated an automatic method to identify HZO cases with high accuracy. As one of the largest studies on HZO, our finding emphasizes the importance of preventing HZ in the elderly population. This method can be a valuable tool to support population-based studies and clinical care of HZO in the era of big data.
Authors: Chengyi Zheng; Jonathan Duffy; In-Lu Amy Liu; Lina S Sy; Ronald A Navarro; Sunhea S Kim; Denison S Ryan; Wansu Chen; Lei Qian; Cheryl Mercado; Steven J Jacobsen Journal: JMIR Public Health Surveill Date: 2022-05-24
Authors: Sally L Baxter; Adam R Klie; Bharanidharan Radha Saseendrakumar; Gordon Y Ye; Michael Hogarth Journal: J Med Internet Res Date: 2020-08-14 Impact factor: 5.428