| Literature DB >> 30022056 |
Kate A Timmins1, Mark A Green2, Duncan Radley3, Michelle A Morris4, Jamie Pearce5.
Abstract
There has been growing interest in the potential of 'big data' to enhance our understanding in medicine and public health. Although there is no agreed definition of big data, accepted critical components include greater volume, complexity, coverage and speed of availability. Much of these data are 'found' (as opposed to 'made'), in that they have been collected for non-research purposes, but could include valuable information for research. The aim of this paper is to review the contribution of 'found' data to obesity research to date, and describe the benefits and challenges encountered. A narrative review was conducted to identify and collate peer-reviewed research studies. Database searches conducted up to September 2017 found original studies using a variety of data types and sources. These included: retail sales, transport, geospatial, commercial weight management data, social media, and smartphones and wearable technologies. The narrative review highlights the variety of data uses in the literature: describing the built environment, exploring social networks, estimating nutrient purchases or assessing the impact of interventions. The examples demonstrate four significant ways in which 'found' data can complement conventional 'made' data: firstly, in moving beyond constraints in scope (coverage, size and temporality); secondly, in providing objective, quantitative measures; thirdly, in reaching hard-to-access population groups; and lastly in the potential for evaluating real-world interventions. Alongside these opportunities, 'found' data come with distinct challenges, such as: ethical and legal questions around access and ownership; commercial sensitivities; costs; lack of control over data acquisition; validity; representativeness; finding appropriate comparators; and complexities of data processing, management and linkage. Despite widespread recognition of the opportunities, the impact of 'found' data on academic obesity research has been limited. The merit of such data lies not in their novelty, but in the benefits they could add over and above, or in combination with, conventionally collected data.Entities:
Mesh:
Year: 2018 PMID: 30022056 PMCID: PMC6291419 DOI: 10.1038/s41366-018-0153-7
Source DB: PubMed Journal: Int J Obes (Lond) ISSN: 0307-0565 Impact factor: 5.095
Summary of the implementation, value and challenges of ‘found’ data in obesity research
| Data type | Examples of data | Research aims | Value added over conventional data | Challenges |
|---|---|---|---|---|
| Retail sales | •Retailer data sets [ | •Monitoring nutrient availability | •Wider population coverage | •Sampling: |
| Transport | •Road/transport cameras/sensors [ | •Investigating ecological association between motorised transport and obesity | •Enhanced detail—times, spatial locations, frequencies, duration, cost, multiple modes | •Lack of complementary data on other relevant behaviours and outcomes |
| Commercial weight management programmes | •NHS referred patients to commercial programmes [ | •Real-world service evaluations | •Scale | •Data accessibility, quality, completeness and representativeness |
| Geospatial | •Web mapping platforms [ | •Measuring features of the built environment | •Finer resolution data: | •Lack of complementary data on other relevant behaviours and outcomes |
| Social media | •Twitter [ | •Geographically locating posts about certain behaviours | •Offers social perspective | •Sample bias and representativeness |
| Smartphones and wearable technologies | •Step count apps [ | •Exploring activity patterns across populations | •Wide-scale coverage, international | •Sample bias and representativeness |