| Literature DB >> 30349632 |
Mark Abraham Magumba1, Peter Nabende1, Ernest Mwebaze2.
Abstract
The social web has emerged as a dominant information architecture accelerating technology innovation on an unprecedented scale. The utility of these developments to public health use cases like disease surveillance, information dissemination, outbreak prediction and so forth has been widely investigated and variously demonstrated in work spanning several published experimental studies and deployed systems. In this paper we provide an overview of automated disease surveillance efforts based on the social web characterized by their different high level design choices regarding functional aspects like user participation and language parsing approaches. We briefly discuss the technical rationale and practical implications of these different choices in addition to the key limitations associated with these systems within the context of operable disease surveillance. We hope this can offer some technical guidance to multi-disciplinary teams on how best to implement, interpret and evaluate disease surveillance programs based on the social web.Entities:
Keywords: Crowd sourced Disease surveillance; Data Mining; Knowledge Engineering; Participatory Epidemiology; The social web
Year: 2018 PMID: 30349632 PMCID: PMC6194101 DOI: 10.5210/ojphi.v10i2.9312
Source DB: PubMed Journal: Online J Public Health Inform ISSN: 1947-2579
Figure 1Analytical Pipeline for Disease Surveillance applications on the social web
Figure 2Activity cycle for textual Analytical pipeline for disease surveillance applications on the Social Web
Figure 3Disease forecasting, nowcasting and real-time monitoring vs. disease progression timeline
Figure 4Proportion of worldwide internet users by access mode. Source: gs.statcounter.com [102]
Figure 5Mobile OS market share for different platforms. Source: Statista.com [103]
Figure 6Yearly global sales of smart wearables from 2014 to 2017 by category from 2014 to2017. Source: Statista.com [104].
Figure 7District level epidemic trajectories for the 2014-2015 Ebola outbreak in Sierra Leone. Weekly incidence records for each district are shown as colored ‘x’, solid line in the corresponding color is the approximate average incidence. Dates shown on the x-axis (dd/mm/yy) are endings of epidemic weeks. Source: Yang et al [105]
A comparison of the merits and demerits of different design choices for automated disease surveillance on the social web
|
|
| | |
|
| User Participation Model | Explicit User Participation | -Higher quality data | Limits potential pool of participants | |
| Implicit user Participation | All social media users are a potential source of data | | | |
| Language Parsing Methodology | Rule Based Parsing | Computationally inexpensive | Commonest approach of text matching is susceptible to false positives | |
| Machine learning approaches | Better accuracy | | | |
| Multiplicity of data sources | Single Data Source | low operational overhead in terms of data acquisition | Resulting models may be less robust than models built from multiple data sources | |
| Multiple Data Sources | -Models are likely to be more robust than those from a single data source | High operational overhead regarding data acquisition for instance maintaining multiple APIs | | |
| System Deployment Options | Web deployment | -Single deployment may cater for multiple devices since the application is accessible to any device with a browser | -Application may not perform optimally
across different devices and | |
| Mobile deployment | -Enjoy deeper platform integration on mobile devices allowing for a more intuitive user interface for instance on Android developers can take advantage of phone features like the speaker, accelerometer, vibration and GPS location | -Unlike web deployments each platform requires its own version of the application and therefore there can be considerable development effort if it is intended to serve multiple platforms |