| Literature DB >> 35789918 |
Andrew G Rundle1, Michael D M Bader2, Stephen J Mooney3.
Abstract
Purpose of review: Innovations in information technology, initiatives by local governments to share administrative data, and growing inventories of data available from commercial data aggregators have immensely expanded the information available to describe neighborhood environments, supporting an approach to research we call Urban Health Informatics. This review evaluates the application of machine learning to this new wealth of data for studies of the effects of neighborhood environments on health. Recent findings: Prominent machine learning applications in this field include automated image analysis of archived imagery such as Google Street View images, variable selection methods to identify neighborhood environment factors that predict health outcomes from large pools of exposure variables, and spatial interpolation methods to estimate neighborhood conditions across large geographic areas. Summary: In each domain, we highlight successes and cautions in the application of machine learning, particularly highlighting legal issues in applying machine learning approaches to Google's geo-spatial data.Entities:
Keywords: Google Street View; Machine learning; Neighborhood environments; Neighborhood-wide association studies; Spatial interpolation; Urban health informatics
Year: 2022 PMID: 35789918 PMCID: PMC9244309 DOI: 10.1007/s40471-022-00296-7
Source DB: PubMed Journal: Curr Epidemiol Rep
Summary of machine learning applications
| Application | Uses | Issues |
|---|---|---|
| Automated image analysis | Accelerate virtual systematic social observation (VSSO) methods. Increase the density of sampled locations and expand the geographic coverage of VSSO studies | Legal issues with Google’s terms of use for Google Maps and Street View |
| Variable selection | Application of GWAS and EWAS-style studies to large pools of neighborhood-level variables | Choosing from the array of possible selection algorithms: results from different algorithms may disagree |
| Spatial interpolation | Characterize neighborhood conditions over large areas using data from a sample of locations | Researchers using “off-the-shelf” existing data versus researchers setting their own sampling plan for collecting data: need to account for uncertainty in the estimation of the neighborhood-level data |
Fig. 1A common workflow for machine learning applied to Google Street View Images and its relationship to activities prohibited by Google’s terms of use [26]
Fig. 2Schematic diagram of a neighborhood environment-wide association study as applied to selecting variables that predict physical activity levels in older adults [41]