| Literature DB >> 24499879 |
Lora E Fleming1, Andy Haines2, Brian Golding3, Anthony Kessel4, Anna Cichowska5, Clive E Sabel6, Michael H Depledge7, Christophe Sarran8, Nicholas J Osborne9, Ceri Whitmore10, Nicola Cocksedge11, Daniel Bloomfield12.
Abstract
Linking environmental, socioeconomic and health datasets provides new insights into the potential associations between climate change and human health and wellbeing, and underpins the development of decision support tools that will promote resilience to climate change, and thus enable more effective adaptation. This paper outlines the challenges and opportunities presented by advances in data collection, storage, analysis, and access, particularly focusing on "data mashups". These data mashups are integrations of different types and sources of data, frequently using open application programming interfaces and data sources, to produce enriched results that were not necessarily the original reason for assembling the raw source data. As an illustration of this potential, this paper describes a recently funded initiative to create such a facility in the UK for use in decision support around climate change and health, and provides examples of suitable sources of data and the purposes to which they can be directed, particularly for policy makers and public health decision makers.Entities:
Mesh:
Year: 2014 PMID: 24499879 PMCID: PMC3945564 DOI: 10.3390/ijerph110201725
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Exemplar database linkages/mashups of climatic and environmental with human health data.
| Institution/Project | Brief Description and Links |
|---|---|
| EVO | The Environmental Virtual Observatory (EVO) is a proof of concept project with NERC funding that has been created to demonstrate that linking data, models and expert knowledge will provide cost effective answers to vital wide-ranging environmental issues, initially in the soil-water system. The project exploits cloud computing to develop new applications for accessing, filtering and synthesising data to develop new knowledge and evaluation tools. It investigates possible structures for the cloud environment and develops exemplars at a local and national scale to demonstrate how the EVO could make environmental monitoring and decision making more efficient, effective and transparent to the whole community. |
| ECDC E3 Geoportal | The objective of European Centres for Disease Control (ECDC) E3 Geoportal is to promote geospatial infectious disease modelling in Europe and its integration in public health. There are many different determinants of infectious disease transmission but they are often highly dispersed and/or difficult to obtain. The E3 Geoportal will facilitate the collection and exchange of these datasets in a user-friendly manner. It is an inventory of information and resources which are collected, maintained, and managed by a collaborative effort under the European Environment and Epidemiology Network. |
| SAIL (Wales) | The Secure Anonymised Information Linkage (SAIL) Databank is a large scale data warehouse technology. The SAIL system links together the widest possible range of person-based data using robust anonymisation techniques on the College of Medicine’s IBM supercomputer and bespoke data transportation fabric to a wide range of NHS systems in Wales, allowing for future data mashups. SAIL is continually expanding, both in types of dataset and in geographical coverage, and many additional organisations have since provided, or agreed to provide, their datasets. Through the robust processes that have been developed and implemented, this growing databank represents a valuable resource for health-related research and service development, whilst complying with the requirements of data protection legislation and confidentiality guidelines. |
| URGENCHE Project | Urban Reduction of GHG Emissions in China and Europe (URGENCHE) is a FP7 funded project bringing together a team of internationally recognised scientists to develop and apply a methodological framework for the assessment of the overall risks and benefits of alternative greenhouse gas (GHG) emission reduction policies for health and well-being in China and Europe. |
| NOAA MATCH | NOAA Metadata Access Tool for Climate Change and Health (MATCH) is a publicly accessible, online tool for researchers that offers centralized access to metadata (standardized contextual information) about thousands of government-held datasets related to health, the environment, and climate-science. |
| PULSE-Brazil | NERC-funded project involving the University of Exeter, the Met Office and Brazilian partners. PULSE-Brazil brings together health data (especially respiratory health) and environmental data. It uses different kinds of data (e.g. satellite records on fires in the Amazon) and it has a different main output (a tool to support decision makers, rather than a platform to aid researchers). Both projects can learn from each other across a range of technical, methodological and theoretical issues. |
| ESCAPE | EU funded project on long-term health risks to air pollution exposure. ESCAPE concentrates on respiratory, cardiovascular, cancer and pregnancy-related risks. The project’s communications strategy concentrates on producing material for use with patient groups. |
| AVOID | The Met Office Hadley Centre has datasets produced under the DECC/Defra funded Avoiding Danger Climate Change (AVOID) Programme. Includes observations programme to measure salinity, current velocity and temperature in the upper oceans. |
| EO2HEAVEN | EO2HEAVEN (Earth Observation and Environmental Modelling for the Mitigation of Health Risks) was a research project co-funded by the European Commission as part of the 7th Framework Programme (FP7) Environmental theme. EO2HEAVEN contributed to a better understanding of the complex relationships between environmental changes and their impact on human health. The project monitored changes induced by human activities, with emphasis on atmospheric, river, lake and coastal marine pollution. The result of this collaboration was the design and development of a GIS-based system upon an open and standards-based Spatial Information Infrastructure (SII) envisaged as a helpful tool for research of human exposure and early detection of infections. |
| EXPOSOMICS/HELIX | EXPOSOMICS is an EU funded project, led by Imperial College, and involving institutions from six other countries. It aims to predict individual disease risk from examining drinking water and air-borne contaminants; health data (long-term cohorts) and environmental data will be analysed together. HELIX project is an EU funded project, led by the Centre for Research in Environmental Epidemiology (CREAL) involving institutions from eight other countries. It is focused on the early life exposome since pregnancy and the early years of life are well recognized to be periods of high susceptibility to environmental damage with lifetime consequences. |
Summary table of health databases the MED MI Partnership has identified, as well as other databases for potential future collaborations (more details at http://bit.ly/OZwgxo).
| Health | CPRD | UK Biobank | ONS Mortality | Million Women | DSSS | RCGP WRS | LABBASE PHE | ELS PHE | Vec S PHE |
|---|---|---|---|---|---|---|---|---|---|
| From | 2012 | 2010 | 1836 | 1996 | 1999 | 1967 | 1975 * | 1980 | 1900 ** |
| Cohort | 20M patients registered with general practitioners (projected) | 503,316 | n/a | 1.36 m | n/a | n/a | n/a | n/a | n/a |
| Area | England | UK | England + Wales | UK | England + Wales | England + Wales | England | England | England |
| Info. | Underpins a comprehensive interventional research service. Extremely comprehensive and vital to this kind of linking research. | Age 40–69 Very broad range of genetic variables, phenotypic and exposure data. | All causes of death | Women 50–64. Special focus on HRT and breast health. | Self-reported, including cold, flu, fever, rash, heat | GP-diagnosis | Lab diagnosis | Clinical diagnosis | Vector distr. |
| Geo-ref | Postcode | Postcode | Postcode | Postcode | SHA *** | SHA *** | Postcode | Postcode | Grid Ref |
DSSS: Direct Syndromic Surveillance System (Public Health England (PHE)/NHS Direct); PHE has access for surveillance, use for research to be negotiated with NHS Direct); RCGP WRS: Royal College of General Practitioners Weekly Returns Service; LABBASE: Laboratory confirmed diagnoses; ELS: Enhanced Legionella Surveillance; Vec S: Vectors Surveillance (ticks and mosquitoes); * Giardia (1975), Campylobacter (1976), Legionella (1977), Salmonella (1980), Cryptosporidium (1983), Lyme disease (1986); ** More intensive surveillance from 2005; *** Possible to use first part of postcode.
Summary table of potential health databases the MED MI Partnership has identified, as well as other databases for potential future collaborations (more details at http://bit.ly/OZwgxo).
| Health (Possible Future Collaborations) | ARS | 1958 Birth Cohort | ALSPAC | ELSA | CFAS I, II | White-Hall II |
|---|---|---|---|---|---|---|
| From | 2000 | 1958 | 1991 | 2002 | 1989 | 1985 |
| Cohort | n/a | 17,416 | 14,000 | 13,500 | 18 k | 10,308 |
| Area | UK | GB | Avon | UK | UK/London | |
| Information | Response times to 999 calls (weather-related) | Single week, with follow-ups | Strong environmental and genetic data | Age > 50. Health and social. Ongoing study with new recruits. | Age 65+. Genetic and other data. | Age 35–55 in 1985–1988 Civil Service staff |
| Geo-ref | Postcode | Wards | Postcode | Postcode | Postcode | Postcode |
ARS: Ambulance Response Data (12 regional trusts); ALSPAC: Avon Longitudinal Study Parents and Children; ELSA: English Longitudinal Study of Ageing; CFAS: Cognitive Function and Ageing Studies.
Summary table of environmental databases the MED MI Partnership has identified, as well as other databases for potential future collaborations (more details at http://bit.ly/OZwgxo).
| Environment | MIDAS | Pollen | Daily Land | Monthly Land | Daily Sea Surface Temperature Gridded 5 km | Marine Biotoxins |
|---|---|---|---|---|---|---|
| From | 1961 (a few further back to 1850) | <1950 | 1961 | 1961 | 1985 | 2001: England & Wales; 2005: Scotland |
| Area | UK and coastal ships | UK land | UK land | UK land | Global Ocean | UK coastal locations |
| Availability | Research License via BADC or from Met Office | Owned by MAARA/Pollen UK (see letter of support) | Research License from Met Office | Research License from Met Office | Freely available through MyOcean. | Owned by CEFAS on behalf of Food Standards Authority |
| Information | 450 stations supply daily: mean, maximum & minimum temperatures; sunshine amount; snow depth at 09:00 UTC | Over three decades of data on airborne pollen and fungal spores. | Daily mean temperature, daily max temperature, daily minimum temperature, precipitation amount all provided for each 5 km grid square. | Precipitation amount, weather type, sunshine amount, provided for each month for each 5 km grid square. | Sea Surface Temperature retrieved from a combination of remote and in situ measurement at a resolution of 1/20 degree (~5 km). | Sampling records for a variety of sites around the UK coastline. Changes in sampling practice make year-to-year trends difficult to extract, but case study comparison with simulated results should be possible. |
| Geo-ref | Latitude & longitude; height above mean sea level | Latitude & longitude; height above mean sea level | Latitude & Longitude | Latitude & Longitude |
Potential challenges for and with data mashups.
| Potential Challenges |
|---|
Mashup access, governance, and ownership |
Access to and ownership of original data |
Training of personnel and users |
Rapidly changing hardware and software |
Funding and resources (including long term secure data storage and appropriate staffing) to ensure longevity |
Confidentiality of data |
International standardization of data |
Different types of complex data with issues of variable granularity, time spans, “richness”, certainty, |
Creation and maintenance of data documentation |
Understanding of the uncertainty of the data |
Need for and understanding of new methods of modeling and statistics |
Interpretation of data, analyses and findings |
Interpretation and evaluation of new associations for validity and strength |
Use of real time data to make decisions |
Evaluation of use and effectiveness of the mashup |
Ability to look at big picture without obscuring smaller issues (such as effects on subpopulations) |
Communication of the uncertainty of data and findings |
Interactions with wide variety of stakeholders |
Maintenance of the mashup and its resources over long periods of time |