Literature DB >> 31361300

Development of a global infectious disease activity database using natural language processing, machine learning, and human expertise.

Joshua Feldman1, Andrea Thomas-Bachli2,3, Jack Forsyth2,3, Zaki Hasnain Patel2,3, Kamran Khan2,3,4.   

Abstract

OBJECTIVE: We assessed whether machine learning can be utilized to allow efficient extraction of infectious disease activity information from online media reports.
MATERIALS AND METHODS: We curated a data set of labeled media reports (n = 8322) indicating which articles contain updates about disease activity. We trained a classifier on this data set. To validate our system, we used a held out test set and compared our articles to the World Health Organization Disease Outbreak News reports.
RESULTS: Our classifier achieved a recall and precision of 88.8% and 86.1%, respectively. The overall surveillance system detected 94% of the outbreaks identified by the WHO covered by online media (89%) and did so 43.4 (IQR: 9.5-61) days earlier on average. DISCUSSION: We constructed a global real-time disease activity database surveilling 114 illnesses and syndromes. We must further assess our system for bias, representativeness, granularity, and accuracy.
CONCLUSION: Machine learning, natural language processing, and human expertise can be used to efficiently identify disease activity from digital media reports.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  communicable diseases; health information systems; internet; machine learning; public health surveillance

Mesh:

Year:  2019        PMID: 31361300      PMCID: PMC7647217          DOI: 10.1093/jamia/ocz112

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  12 in total

1.  Comparison of web-based biosecurity intelligence systems: BioCaster, EpiSPIDER and HealthMap.

Authors:  A Lyon; M Nunn; G Grossel; M Burgman
Journal:  Transbound Emerg Dis       Date:  2011-12-20       Impact factor: 5.005

Review 2.  An overview of internet biosurveillance.

Authors:  D M Hartley; N P Nelson; R R Arthur; P Barboza; N Collier; N Lightfoot; J P Linge; E van der Goot; A Mawudeku; L C Madoff; L Vaillant; R Walters; R Yangarber; J Mantero; C D Corley; J S Brownstein
Journal:  Clin Microbiol Infect       Date:  2013-06-21       Impact factor: 8.067

3.  Big Data for Infectious Disease Surveillance and Modeling.

Authors:  Shweta Bansal; Gerardo Chowell; Lone Simonsen; Alessandro Vespignani; Cécile Viboud
Journal:  J Infect Dis       Date:  2016-12-01       Impact factor: 5.226

4.  Timeliness of nongovernmental versus governmental global outbreak communications.

Authors:  Luke Mondor; John S Brownstein; Emily Chan; Lawrence C Madoff; Marjorie P Pollack; David L Buckeridge; Timothy F Brewer
Journal:  Emerg Infect Dis       Date:  2012-07       Impact factor: 6.883

5.  Electronic event-based surveillance for monitoring dengue, Latin America.

Authors:  Anne G Hoen; Mikaela Keller; Aman D Verma; David L Buckeridge; John S Brownstein
Journal:  Emerg Infect Dis       Date:  2012-07       Impact factor: 6.883

6.  Quantifying the effect of media limitations on outbreak data in a global online web-crawling epidemic intelligence system, 2008-2011.

Authors:  David Scales; Alexei Zelenev; John S Brownstein
Journal:  Emerg Health Threats J       Date:  2013-11-08

7.  Evaluation of local media surveillance for improved disease recognition and monitoring in global hotspot regions.

Authors:  Jessica S Schwind; David J Wolking; John S Brownstein; Jonna A K Mazet; Woutrina A Smith
Journal:  PLoS One       Date:  2014-10-15       Impact factor: 3.240

8.  Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks.

Authors:  Saurav Ghosh; Prithwish Chakraborty; Elaine O Nsoesie; Emily Cohn; Sumiko R Mekaru; John S Brownstein; Naren Ramakrishnan
Journal:  Sci Rep       Date:  2017-01-19       Impact factor: 4.379

Review 9.  Digital disease detection: A systematic review of event-based internet biosurveillance systems.

Authors:  Jesse O'Shea
Journal:  Int J Med Inform       Date:  2017-02-08       Impact factor: 4.046

10.  Evaluation of epidemic intelligence systems integrated in the early alerting and reporting project for the detection of A/H5N1 influenza events.

Authors:  Philippe Barboza; Laetitia Vaillant; Abla Mawudeku; Noele P Nelson; David M Hartley; Lawrence C Madoff; Jens P Linge; Nigel Collier; John S Brownstein; Roman Yangarber; Pascal Astagneau
Journal:  PLoS One       Date:  2013-03-05       Impact factor: 3.240

View more
  3 in total

Review 1.  Advancing Artificial Intelligence in Health Settings Outside the Hospital and Clinic.

Authors:  Nakul Aggarwal; Mahnoor Ahmed; Sanjay Basu; John J Curtin; Barbara J Evans; Michael E Matheny; Shantanu Nundy; Mark P Sendak; Carmel Shachar; Rashmee U Shah; Sonoo Thadaney-Israni
Journal:  NAM Perspect       Date:  2020-11-30

Review 2.  Public Health and Epidemiology Informatics: Recent Research Trends Moving toward Public Health Data Science.

Authors:  Sébastien Cossin; Rodolphe Thiébaut
Journal:  Yearb Med Inform       Date:  2020-08-21

3.  Automated Classification of Online Sources for Infectious Disease Occurrences Using Machine-Learning-Based Natural Language Processing Approaches.

Authors:  Mira Kim; Kyunghee Chae; Seungwoo Lee; Hong-Jun Jang; Sukil Kim
Journal:  Int J Environ Res Public Health       Date:  2020-12-17       Impact factor: 3.390

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.