Literature DB >> 33215073

Using electronic health records for population health sciences: a case study to evaluate the associations between changes in left ventricular ejection fraction and the built environment.

Yiye Zhang1,2, Mohammad Tayarani3, Subhi J Al'Aref4, Ashley N Beecy5, Yifan Liu1, Evan Sholle1, Arindam RoyChoudhury1, Kelly M Axsom6, Huaizhu Oliver Gao3, Jyotishman Pathak1, Jessica S Ancker1.   

Abstract

OBJECTIVE: Electronic health record (EHR) data linked with address-based metrics using geographic information systems (GIS) are emerging data sources in population health studies. This study examined this approach through a case study on the associations between changes in ejection fraction (EF) and the built environment among heart failure (HF) patients.
MATERIALS AND METHODS: We identified 1287 HF patients with at least 2 left ventricular EF measurements that are minimally 1 year apart. EHR data were obtained at an academic medical center in New York for patients who visited between 2012 and 2017. Longitudinal clinical information was linked with address-based built environment metrics related to transportation, air quality, land use, and accessibility by GIS. The primary outcome is the increase in the severity of EF categories. Statistical analyses were performed using mixed-effects models, including a subgroup analysis of patients who initially had normal EF measurements.
RESULTS: Previously reported effects from the built environment among HF patients were identified. Increased daily nitrogen dioxide concentration was associated with the outcome while controlling for known HF risk factors including sex, comorbidities, and medication usage. In the subgroup analysis, the outcome was significantly associated with decreased distance to subway stops and increased distance to parks.
CONCLUSIONS: Population health studies using EHR data may drive efficient hypothesis generation and enable novel information technology-based interventions. The availability of more precise outcome measurements and home locations, and frequent collection of individual-level social determinants of health may further drive the use of EHR data in population health studies.
© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association.

Entities:  

Keywords:  built environment; cardiovascular diseases; electronic health records; geographic information system; public health informatics

Year:  2020        PMID: 33215073      PMCID: PMC7660965          DOI: 10.1093/jamiaopen/ooaa038

Source DB:  PubMed          Journal:  JAMIA Open        ISSN: 2574-2531


Lay summary

Electronic health record (EHR) data linked with address-based metrics using geographic information systems (GIS) are emerging data sources in population health studies. We examined the relationship between the built environment and the changes in ejection fraction (EF) using EHR data at an urban academic medical center. Longitudinal clinical information of 1287 heart failure (HF) patients was linked with address-based built environment metrics related to transportation, air quality, land use, and accessibility by GIS. The primary outcome is the increase in the severity of EF categories. Statistical analyses were performed using mixed-effects models, including a subgroup analysis of patients who initially had normal EF measurements. Increased daily nitrogen dioxide concentration was associated with the outcome while controlling for known HF risk factors including sex, comorbidities, and medication usage. In the subgroup analysis that was performed among patients with initially normal EF measurements, the outcome was significantly associated with decreased distance to subway stops and increased distance to parks. The found association on air quality is consistent with known literature, whereas the accessibility to the subway and parks present new evidence to be validated with bigger datasets in the future. EHR data may drive efficient hypothesis generation for population health studies.

INTRODUCTION

Population health studies have commonly been defined by cohort identification and follow-up in the last decades. The success of population health studies is largely determined by the available funding to define and follow-up patient cohorts as well as the theories and hypotheses that drive the study designs. Today, researchers across the domains in medicine and healthcare are increasingly drawn to electronic health records (EHRs), a source of routinely collected observational health data that potentially reduces the burden of cohort identification and follow-up while accelerating the hypothesis generation process. EHR data have seen a wide variety of use cases, ranging from disease prediction, computational phenotyping, drug discovery, to personalized treatment strategies. Cited for its routine availability, large volume, and rich details, EHR data are expected to drive automation and innovation for operational tasks and research studies that are traditionally knowledge- and labor-intensive. In this study, we aimed to evaluate the feasibility of using EHR data in population health studies to examine the associations of clinical outcomes with environmental exposures., In particular, this case study focused on the built environment, which refers to the human-made environment through urban planning, such as buildings to provide food and shelter, infrastructure for public transportation, and space for social activities. The built environment is considered to influence public health through several mechanisms, including air quality, noise level, and access to healthy lifestyles. Notably, significant health effects by the built environment on heart failure (HF) patients have been reported by previous research. HF is among the leading causes of morbidity, mortality, and substantial healthcare expenditure in the United States. Its global prevalence is estimated to be more than 26 million, a figure projected to increase further as the global population continues to age. Previous population health and epidemiologic studies have identified risk factors of HF incidence and mortality, including male sex, high blood pressure, coronary artery disease, diabetes, valvular heart disease, tobacco use, obesity, low education level, and socioeconomic deprivation. Among the built environment factors, the effects from air quality and roadway proximity have been reported in multiple studies. HF incidence has been associated with exposure to particulate matter ≤2.5 μm in aerodynamic diameter (PM2.5) in a 4-year prospective cohort study of women across the United States, and an 11.5-year prospective cohort study in Europe. HF mortality was associated with exposure to PM2.5 in the Cancer Prevention Study II of 1.2 million adults over a 16-year follow-up. HF mortality was also associated with roadway proximity and noise volume in 5-year follow-up studies in Worcester, Massachusetts, a 9-year cohort study in the Netherlands, and a cross-sectional survey in Toronto, Canada, respectively. Observational data such as national Medicare claims and registries have been used in studies to identify associations between HF hospital admission rates and air pollutants,, and between socioeconomic deprivation and the cardiovascular events. Compared to claims data, EHRs contain richer clinical information from structured and unstructured laboratory and imaging test results without requiring the upfront investment of creating registries. Through geocoding of patient residential location information in EHRs and further linking it to publicly available environmental data sources, recent studies have identified associations such as air pollution and cardiovascular events during labor and delivery, air pollution and asthma, among others. Limitation of using EHR data in observational studies has been discussed in previous and recent literature., Particularly, it is known that EHR-derived cohorts potentially lead to erroneous and biased association estimates due to incomplete data collection and censoring especially in fragmented healthcare markets. Nevertheless, evaluating the ability to detect known population health associations in the EHR data may pave the path for more rigorous data collection efforts using the EHR, potentially starting to address current limitations., Thus, we sought to contribute to the existing literature on EHR-based population health studies by conducting a case study of whether previously reported effects of the built environment on HF patients’ cardiovascular functions could be identified from the EHR data combined with address-based metrics. Furthermore, we explored whether additional associations would be found using EHR data for future hypothesis-generating studies.

MATERIALS AND METHODS

Study setting

This study was performed at an academic medical center in a dense, urban environment in New York City. Study data were extracted from a commercial EHR and transformed to the Observational Medical Outcomes Partnership (OMOP) common data model maintained by the Observational Health Data Sciences and Informatics consortium. Weill Cornell Medicine Internal Review Board approved the study design and its use of protected health information (Protocol#: 1711018789).

Participants

We included patients if they had an encounter at the studied medical center between 2012 and 2017 and had a primary or secondary diagnosis of acute or chronic HF. A diagnosis of HF was defined as ICD-9-CM: 428.* or ICD-10-CM: I50*. Patients must have had at least 2 transthoracic echocardiograms with EF measurement that were more than 1 year apart, or have died more than 1 year after the baseline EF measurements. Patients were excluded if they died within 1 year of the baseline EF measurement, if their addresses were not recorded in the EHR, or if address changes were recorded from 2012 to 2017. Lastly, due to the reduced levels of physical activity and subsequent lower exposure to the built environment, patients with severely abnormal EF measurements as defined in the “Outcome Measurement” section at baseline were excluded. The inclusion and exclusion criteria are described in Figure 1.
Figure 1.

Patient inclusion and exclusion criteria.

Patient inclusion and exclusion criteria.

EHR data

Data elements extracted from patients’ EHR data were age, sex, race, average body mass index (BMI), EF, binary indicators for whether patients have ever smoked, binary indicators for whether patients have received at least one prescription of beta blockers (carvedilol, metoprolol, bisoprolol), or renin-angiotensin inhibitors (angiotensin-converting enzyme inhibitor or angiotensin receptor blocker), binary indicators for whether patients have had at least one diagnostic code for hypertension (ICD-9-CM: 401.X-405.X, 437.2 or ICD-10-CM: I10.*, I15.0, I15.8, I67.4), diabetes mellitus (ICD-9-CM: 250.* or ICD-10-CM: E10.*, E11.*), valvular heart disease (ICD-9-CM: 394.*-397.*, V42.2, V43.3 or ICD-10-CM: I05.0, I05.1, I05.2, I05.8, I06.0, I08.0, I08.8, I08.9, I07.1, I07.2, I07.8, Z95.2, Z95.3), coronary artery disease (ICD-9-CM: 410.* -414.*,429.2, V45.81 or ICD-10-CM: I21.09, I21.19, I21.11, I21.29, I21.4, I21.3, I21.9, I21.A1, I21.A9, H18.411, I25.10, I25.2, I20.8, I20.1, I20.8, I20.9, Z95.1), primary care locations, and mortality. Diagnostic codes were extracted from billing diagnoses. Dates of death were obtained through the EHR and the Social Security Death Index. EF measurements were extracted from the unstructured notes of the patients’ EHR using a rule-based natural language processing method described by Johnson et al. Patients’ residential locations were passed to an application programming interface offered by the United States Census Bureau which allowed us to derive both the latitude/longitude pairs and the US census tracts which equal to the 11-digit Federal Information Processing Standard (FIPS) codes.

Public data

The built environment factors on accessibility, traffic, land use, and air quality were extracted at the individual patient level using the aforementioned latitude/longitude pairs in the EHR. In addition, FIPS-level social determinants were extracted based on the 11-digit FIPS code in the EHR. Four indicators were defined to measure accessibility to public and active transportation and green spaces: the distance to the nearest bus stops, the distance to the nearest subway stops, the distance to the nearest parks, and the distance to the nearest bike facilities. Data on accessibility were obtained from the NYC Department of Planning public data repository. Parks were defined as areas designated as a park, ball field, playground, or public space in NYC Zoning Districts. Figure 2 displays the distances to the park across the 5 boroughs in NYC.
Figure 2.

Distances to the nearest parks from patients’ home locations.

Distances to the nearest parks from patients’ home locations. The traffic data were obtained from the New York Best Practice Model, which is an activity-based travel demand model that includes traffic volume on highways, major arterials, and collector’s links along with several other transportation measures. The model predicts daily traffic volume in each roadway link for the different types of vehicles including passenger vehicles, buses, taxi, and trucks. We grouped the traffic volumes into 2 groups, respectively, namely, light-duty vehicles such as passenger cars and taxies, and heavy-duty vehicles such as buses and trucks. The stratification controls for the varying environmental impacts by the light- and heavy-duty vehicles.Figure 3 displays heavy-duty vehicle activity within 250-m buffers across the 5 boroughs of NYC.
Figure 3.

The heavy-duty vehicle activity within 250-m buffer (right) in the studied environment.

The heavy-duty vehicle activity within 250-m buffer (right) in the studied environment. We measured the walkability and availability of a variety of resources for retail, commercial, facility, and residential purposes within 500 m of each patient’s home location using the land use mix index and the floor area ratios. The floor area ratio measures the building floor area divided by land area. For example, the areas with a higher share of parking space have lower retail floor area ratio values while areas with smaller setbacks from the street have higher values. Four types of floor area ratios were computed: retail floor area ratio, residential floor area ratio, commercial floor area ratio, and facility floor area ratio. Higher floor area ratios are considered to promote more walkability, an important built environment indicator as our study focuses on an urban environment. Land use data were extracted from the NYC Department of Planning public data repository which includes information about land use type at the parcel level. A measure for the heterogeneity of land use, higher land use mix indices indicates a higher walkability of the area. For air quality, we estimated patients’ exposure to nitrogen dioxide (NO2) using the Land Use Regression model obtained from the Center for Air, Climate and Energy Solutions. This air pollutant model estimates the daily NO2 concentration at the block group level using land use regression models and covers both regional and local air pollution hotspots. Lastly, we obtained census-tract level estimates of social determinants of health including poverty rates, percentages of college degrees, and median home values from the FACETS dataset. While not at the individual-level, these estimates allowed us to control for socioeconomic risk factors identified in previous studies.

Outcome measurement

Left ventricular ejection fraction (EF), the portion of blood pumped out by the left ventricle with each contraction, is one of the most important measurements in diagnosing and defining stages of HF. Under the definitions provided by the American Society of Echocardiography and the European Association of Cardiovascular Imaging, EF measurements are classified into 4 categories: normal (EF >51% in men and EF >53% in women), mildly abnormal (EF between 41%–51% in men and 41%–53% in women), moderately abnormal (EF within 30%–40% in men and women), and severely abnormal (EF <30% in men and women). EF severity may be reflected in HF patients’ changing conditions such as shortness of breath and reduced ability for physical activity. EF measurements are most commonly taken with transthoracic echocardiograms and recorded in the EHRs repeatedly following patients’ routine care. The outcome in the study is a composite outcome of EF change defined as a deteriorated EF category or mortality within 1 year of a baseline EF measurement. Deteriorated EF category includes a shift from normal to mildly/moderately/severely abnormal, from mildly abnormal to moderately/severely abnormal, or from moderately abnormal to severely abnormal. The composite outcome was not treated as a time-to-event outcome as we assumed that the recorded dates of EF measurements do not equal to the actual time of EF change. Although a prognostic marker such as New York Heart Association (NYHA) Functional Classification would be a strong indicator of HF, this marker was not reliably available in the EHR data, and therefore we defined the study outcomes on the basis of EF measurements.

Statistical methods

Bivariate associations between the exposure and outcome were assessed with chi-squared tests for categorical variables and analysis of variance for continuous variables. Two hypotheses were tested. H1: The built environment is significantly associated (vs not associated) with a reduction in EF among patients with HF. Mixed-effects logistic regression with fixed and random effects was used to analyze the associations while controlling for previously reported HF risk factors. HF risk factors that were considered as the fixed effect variables are age, sex, race, BMI, smoking (yes/no), diabetes (yes/no), valvular heart disease (yes/no), coronary artery disease (yes/no), average poverty level at the census tract. The model also contained multiple built environment variables as fixed effects, including floor area ratio for residential use, floor area ratio for facility use, floor area ratio for commercial use, floor area ratio for retail use, land use mix index, average daily NO2 concentration (μg/m3), light-duty vehicle in 250 m buffer in kilometer, heavy-duty vehicle in 250 m buffer in kilometer, distance (km) to nearest bus stops, distance (km) to nearest parks, distance (km) to nearest subway stops, and distance (km) to nearest bike paths. The primary care locations were treated as the random effects in the model to control for the possible care variations across clinics within the health system. Backward elimination was performed for variable selection among the aforementioned variables. Tests for correlations and multicollinearity among variables were tested using the variance inflation factor (VIF). The models were constructed using Stata 14’s generalized structural equation model. Since the majority of the patients were age 60 and above, we did not create matched cases and controls by age. H2: The built environment is significantly associated (vs not associated) with a reduction in EF among HF patients with baseline normal EF. It is known that the amount of physical activity that may be tolerated by HF patients decreases as the disease progresses. Since reduced physical activity likely leads to different levels of environmental exposure, we examined the effects of the built environment on the outcome among patients whose initial EF measurements were normal as a subgroup analysis. Sensitivity analyses were conducted. In the first analysis, we strictly used only deterioration in the EF category as the primary outcome of a change in EF. In this analysis, if patients were recorded to have died after at least 1 year following the baseline EF measurement but had no EF measurements that are at least 1 year apart, they were included in the analysis as no-change. Additionally, we performed a second sensitivity analysis limiting the study sample to only patients with both initial and final EF measurements. In the second analysis, if patients were recorded to have died after at least 1 year following the baseline EF measurement but had no EF measurements that are at least 1 year following baseline, they were excluded from the analysis. In addition, on a subset of patients we were able to obtain values of B-type natriuretic peptide (BNP) which is known to be a marker for the severity of acute and chronic HF. BNP values were compared between the group with initial EF measurements studied in the subgroup analysis versus the rest of the study sample whose initial EF measurements were abnormal as defined by the American Society of Echocardiography and the European Association of Cardiovascular Imaging.

RESULTS

A total of 1287 adult patients who met the study criteria were identified. Table 1 lists the variables and their bivariate associations with the outcome. We imputed 215 missing values in BMI using multiple imputation.
Table 1.

Descriptive patient characteristics

VariableOutcome (percentage/standard deviation)
NoYes
Number of patients887400
Initial EF category
 Normal747 (84.22%)326 (81.50%)
 Mildly abnormal86 (9.70%)45 (11.25%)

 Moderately abnormal

 

se

54 (6.09%)29 (7.25%)
Last EF category/all-cause mortality*
 Normal832 (93.80%)0 (0.00%)
 Mildly abnormal36 (4.06%)119 (29.75%)
 Moderately abnormal19 (2.14%)84 (21.00%)
 Severely abnormal0 (0.00%)81 (20.25%)
 All-cause mortality0 (0.00%)116 (29.00%)
Sex*
 Female443 (49.94%)148 (37.00%)
 Male444 (50.06%)252 (63.00%)
Race
 Asian55 (6.20%)21 (5.25%)
 Black or African American182 (20.52%)73 (18.25%)
 White321 (36.19%)150 (37.50%)
 Unknown131 (14.77%)73 (18.25%)
 Other198 (22.32%)83 (20.75%)
Age*68.03 (sd=10.82)66.44 (sd=12.14)
BMI29.17 (sd=7.47)27.88 (sd=6.87)
Smoking (smoker and ex-smoker)
 No392 (44.19%)155 (38.75%)
 Yes495 (55.81%)245 (61.25%)
Valvular heart disease
 No235 (26.49%)113 (28.25%)
 Yes652 (73.51%)287 (71.75%)
Coronary artery disease
 No89 (10.03%)34 (8.50%)
 Yes798 (89.97%)366 (91.50%)
Hypertension
 No37 (4.17%)18 (4.5%)
 Yes850 (95.83%)382 (95.5%)
Diabetes
 No395 (44.53%)170 (42.50%)
 Yes492 (55.47%)230 (57.50%)
Medication*
 No147 (16.57%)37 (9.25%)
 Yes740 (83.43%)363 (90.75%)
Census-tract level poverty rate18.92% (SD = 0.145)18.93% (SD = 0.137)
Standardized area for residential use3.130 (SD = 3.397)3.373 (SD = 3.591)
Standardized area for commercial use2.002 (SD = 3.618)2.111 (SD = 3.711)
Standardized area ratio for retail use2.132 (SD = 2.754)2.346 (SD = 3.217)
Standardized land use mix index8.446 (SD = 10.458)8.823 (SD = 10.619)
Distance (km) to nearest bus stops0.103 (SD = 0.117)0.099 (SD = 0.083)
Distance (km) to nearest subway stops*0.595 (SD = 0.746)0.499 (SD = 0.547)
Distance (km) to nearest parks0.212 (SD = 0.153)0.222 (SD = 0.163)
Distance (km) to nearest bike paths0.191 (SD = 0.293)0.189 (SD = 0.273)
Daily NO2 concentration (μg/m3)*9.19 (SD = 0.50)9.27 (SD = 0.51)
Light-duty vehicles in 250-m buffer28141.99 (SD = 40348.13)23827.40 (SD = 32150.47)
Heavy-duty vehicles in 250-m buffer3470.27 (SD = 4492.35)3284.22 (SD = 4291.09)

P value <0.05.

Descriptive patient characteristics Moderately abnormal se P value <0.05. Results from the mixed-effects logistic regression for H1 are shown in Table 2. As in previous literature, male sex (OR = 1.093, P value < 0.001) and daily NO2 concentration (OR = 1.071, P value < 0.001) were significantly associated with increased odds of the outcome. In addition, medication prescription (OR = 1.137, P value < 0.001), age (OR = 0.997, P value < 0.017), BMI (OR = 0.999, P value = 0.001), and Asian race (OR = 0.915, P value < 0.041) were significant in the model. The daily NO2 concentration remained significant in the sensitivity analyses (see Supplementary Tables SA2 and SA4), and in larger models that included other built environment variables (data not shown). We did not find other risk factors previously reported to be significantly associated. The model had no significant multicollinearity based on VIF (<10).
Table 2.

Mixed-effects logistic regression for reduction of EF (N = 1287)

Odds ratio P value95% confidence interval
Diabetes1.0460.0900.9931.101
Medication1.137<0.001*1.0761.201
Valvular heart disease0.9570.0830.9101.006
Hypertension0.9960.9480.8871.119
Smoking1.0540.1010.9901.122
Male (vs Female)1.093<0.001*1.0491.139
Race (Base: White)
 Asian0.9150.041*0.8400.996
 Black0.9540.2920.8731.041
 Declined1.0230.5600.9481.104
 Other0.9520.0880.8991.007
BMI0.9990.001*1.0001.000
Census-tract poverty rate1.0660.4400.9061.255
Age0.9970.017*0.9950.999
Coronary artery disease1.0060.8850.9241.096
Daily NO2 concentration (μg/m3)1.071<0.001*1.0361.107

P value < 0.05.

Mixed-effects logistic regression for reduction of EF (N = 1287) P value < 0.05. For H2, a subgroup analysis of the study cohort (N = 1073) whose initial EF was normal is shown in Table 3. Male sex (OR = 1.117, P value < 0.001), BMI (OR = 0.999, P value = 0.037), and medication prescription (OR = 1.158, P value < 0.001) were significantly associated with the outcome. Unlike our findings from the main analysis, increased distance (km) to nearest parks (OR = 1.166, P value = 0.049) and decreased distance (km) to subway stops (OR = 0.947, P value = 0.001) were found to be significantly associated with the outcome. The daily NO2 concentration was no longer significantly associated with the outcome in the subgroup analysis. Similar results were obtained in the 2 sensitivity analyses (Supplementary Tables SA3 and SA5).
Table 3.

Subgroup analysis of the study cohort whose initial EF was normal: mixed-effects logistic regression for reduction of EF (N = 1073)

Odds ratio P value95% confidence interval
Diabetes1.0350.2150.9801.092
Medication1.158<0.001*1.0991.221
Valvular heart disease0.9710.3730.9091.036
Hypertension0.9550.4350.8501.073
Smoking1.0700.0620.9971.148
Male (vs female)1.117<0.001*1.0651.173
Race (Base: White)
 Asian0.9290.1870.8331.036
 Black0.9550.2910.8781.040
 Declined0.9990.9900.9151.092
 Other0.9740.4070.9141.037
BMI0.9990.037*1.0001.000
Census-tract poverty rate1.1890.0530.9981.416
Age0.9970.0540.9951.000
Coronary artery disease0.9780.6450.8911.074
Distance (km) to nearest parks1.1660.049*1.0011.358
Distance (km) to nearest subway stops0.947<0.001*0.9270.967

P value < 0.05.

Subgroup analysis of the study cohort whose initial EF was normal: mixed-effects logistic regression for reduction of EF (N = 1073) P value < 0.05. Results from the sensitivity analysis are shown in Supplementary Tables SA1–SA5 in the Appendix for the main cohort and the subgroup analysis cohort. In all analyses, the daily concentration of NO2 remained significantly associated with the outcome in the main cohort. The associations between the outcome and the distances to parks and subway stations also remained significant in the subgroup analysis cohort. Among the patients who were included in the subgroup analysis, 143 patients had BNP values that were within 2 months of the initial EF measurements used to decide inclusion for subgroup analysis. The distribution of the BNP across EF category is shown in Table 4. The normal group, used for the subgroup analysis, has significantly lower BNP levels compared to the patients whose initial EF measurements were categorized as mild or moderately abnormal (P value = 0.001).
Table 4.

BNP values in the patients with normal versus abnormal initial EF measurements

EF categoryMean (SD)Median
Normal552.1 (662.2)352
Abnormal (mild + moderate)1061.9 (1133.0)679
BNP values in the patients with normal versus abnormal initial EF measurements

DISCUSSION

The goal of this case study was to identify built environment factors that are associated with a reduction in EF among a cohort of HF patients using EHR data linked with address-based metrics. The daily concentration (μg/m3) of NO2, and accessibility to nearest parks and subway stops, was found in the main and subgroup analysis to be significantly associated with the outcome, respectively. Our finding on the daily concentration (μg/m3) of NO2 agreed with previous studies that examined air quality and cardiovascular events. The outcome’s association with the distance to the parks in the subgroup analysis has also been reported in previous literature. Given the urban study environment, the associations may be an indicator of increased opportunities in staying physically active. Exercise training has been increasingly reported in recent years to benefit long-term health in HF patients across all ages, gender, and HF severity groups. For example, a recent multicenter randomized clinical trial found exercise training to be associated with modest significant reductions in cardiovascular mortality and HF hospitalization among patients with chronic HF. Specifically related to parks, a randomized crossover study in an urban environment in London, United Kingdom found that walking near a park led to an improvement in lung function, while significant effect of the same exercise was not observed when subjects were walking along a densely populated area. In addition, previous studies have reported that access to parks alleviates stress, improves mental health, and increases subjective well-being, and associated with lower medical expenditures., Both stress and poor mental health have well-documented correlative and causative associations with cardiovascular morbidity including HF. The outcome’s association with the distance to nearest subway stops contrasts against previous studies where lack of transportation has been identified as a barrier to healthcare access and thus a risk factor. However, similar to the distance to parks, in the urban setting we studied, our finding may actually reflect the increased likelihood for routine physical activity through walking. As public transportation is by far the most common form of commute in this urban setting, it is possible that longer walks required to reach the subway stops contributed to an increased level of physical activity. It may also explain why the associations from proximity to park and subway stops were only observed in the subgroup analysis since the main analysis included patients with different EF categories and subsequently possible varying levels of physical activity. We aim to explore this association further in future studies. Additionally, while we only studied NO2 in this study as an indicator of air quality, future studies will also examine the exposure from other air pollutants such as PM2.5 in the urban environment. The use of structured EHR data in our study faced a number of limitations. First, although the healthcare organization had clinics around the city, it is possible that our study data missed EF measurements and other comorbid conditions that were recorded outside the study setting in constructing the models. To address this limitation, we excluded patients who only had 1 EF measurement from our study to better ensure that patients had continuous care within the health system. Additionally, our study data had information on medication but they were limited to prescription and not the actual usage. BMI was significantly associated with the outcome in both the main and subgroup analyses but with very small effects, possibly due to the number of missing values and the resulting imputations. Moreover, while we used natural language processing to extract EF measurements from the imaging reports and clinical notes, diagnoses in the study data were extracted mainly using structured diagnostic codes. The diagnostic codes used to define HF included chronic and acute HF, as well as both HF with preserved and reduced EF. Therefore, an important limitation of the study is the lack of documentation for NYHA Functional Classification and other biomarkers for HF severity in the EHR. While EF is a measure of cardiac function, it is limited in defining the severity of patient symptoms and not always correlated with physical activity especially in patients with HF with preserved EF. This study conducted a sensitivity analysis using available BNP values near the initial EF measurements to further investigate the patient characteristics in the subgroup. Future studies will need to leverage unstructured data for more accurate data extraction, and also to conduct subgroup analysis on patients with preserved and reduced EF separately. Specifically to this study, in both main and subgroup analyses, we found that decreased age and more medication prescription were significantly associated with an increased odds of reduction in EF. These findings may reflect the characteristics of a patient cohort under treatment in a health system identified from the EHR data in comparison to a general population. This study excluded patients who had recorded address changes during the study period of 2012–2017. As a result, the sample size was a limiting factor in the identification of additional built environment factors that may be significantly associated with the outcome. Despite this effort, there likely still are unrecorded changes in the home locations that were not captured in the data, in addition to exposures prior to 2012 that could have contributed to the study outcome. Future studies may explore large datasets combining EHR data from multiple health systems and insurance claims data to address this challenge. Lastly, our study data did not capture detailed social determinants of health such as individual income, family support, occupation, stress level, and altitude of the apartment buildings that may contribute to the outcome. Future efforts in better tracking social determinants of health in the EHR may alleviate this limitation.

CONCLUSION

Using EHRs linked with address-based metrics on the built environment, we found that air quality, proximity to subway stops, and proximity to parks were associated with a reduction in EF among HF patients in an urban environment. Our findings confirm previous findings on the effects of clean air quality and physical activity for enhanced cardiovascular health. More importantly, findings from this study may help pave the path for promoting future integration of public data sources with EHR data, and more rigorous and precise data collection of patient-level exposure from the built environment in routine patient visits. Clinical decision support may be built within the EHR to provide built environment-related real-time alerts and reminders to care providers for personalized management of HF. Furthermore, collected data may enable larger observational studies on the effects of the built environment on cardiovascular health, thus potentially expediting longitudinal environmental health studies.

FUNDING

This work was supported by the Center for Transportation, Environment, and Community Health New Research Initiatives Fund (79841-10984). This work was partially supported by Weill Cornell Medicine Dean’s Diversity and Healthcare Disparities Award, National Library of Medicine (K01LM013257-01), the Clinical and Translational Science Center (UL1 TR000457), Joint Clinical Trials Office, and the University Transportation Research Center September 11th grant (55606-08-28).

AUTHOR CONTRIBUTIONS

Y.Z. designed the overall study in consultation with J.S.A., J.P., and O.G. Y.Z., M.T., E.S. obtained the study data. Y.Z., M.T., E.S., and Y.L. performed data analysis in consultation with A.R.C. S.A., A.B., and K.M.A. provided clinical inputs and interpretation. Y.Z. and M.T. wrote the paper with input from all authors.

SUPPLEMENTARY MATERIAL

Supplementary material is available at Journal of the American Medical Informatics Association online. Click here for additional data file.
  39 in total

1.  JAMA patient page. Acute emotional stress and the heart.

Authors:  Janet M Torpy; Alison E Burke; Richard M Glass
Journal:  JAMA       Date:  2007-07-18       Impact factor: 56.272

2.  Using electronic health record alerts to provide public health situational awareness to clinicians.

Authors:  Joseph Lurio; Frances P Morrison; Michelle Pichardo; Rachel Berg; Michael D Buck; Winfred Wu; Kwame Kitson; Farzad Mostashari; Neil Calman
Journal:  J Am Med Inform Assoc       Date:  2010 Mar-Apr       Impact factor: 4.497

Review 3.  Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review.

Authors:  Benjamin A Goldstein; Ann Marie Navar; Michael J Pencina; John P A Ioannidis
Journal:  J Am Med Inform Assoc       Date:  2016-05-17       Impact factor: 4.497

4.  Using electronic health record data for environmental and place based population health research: a systematic review.

Authors:  Leah H Schinasi; Amy H Auchincloss; Christopher B Forrest; Ana V Diez Roux
Journal:  Ann Epidemiol       Date:  2018-03-21       Impact factor: 3.797

5.  FACETS: using open data to measure community social determinants of health.

Authors:  Michael N Cantor; Rajan Chandras; Claudia Pulgarin
Journal:  J Am Med Inform Assoc       Date:  2018-04-01       Impact factor: 4.497

6.  Global Public Health Burden of Heart Failure.

Authors:  Gianluigi Savarese; Lars H Lund
Journal:  Card Fail Rev       Date:  2017-04

Review 7.  Traveling towards disease: transportation barriers to health care access.

Authors:  Samina T Syed; Ben S Gerber; Lisa K Sharp
Journal:  J Community Health       Date:  2013-10

8.  Long-term exposure to air pollution and incidence of cardiovascular events in women.

Authors:  Kristin A Miller; David S Siscovick; Lianne Sheppard; Kristen Shepherd; Jeffrey H Sullivan; Garnet L Anderson; Joel D Kaufman
Journal:  N Engl J Med       Date:  2007-02-01       Impact factor: 91.245

9.  From Sour Grapes to Low-Hanging Fruit: A Case Study Demonstrating a Practical Strategy for Natural Language Processing Portability.

Authors:  Stephen B Johnson; Prakash Adekkanattu; Thomas R Campion; James Flory; Jyotishman Pathak; Olga V Patterson; Scott L DuVall; Vincent Major; Yindalon Aphinyanaphongs
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2018-05-18

Review 10.  Exercise-based rehabilitation for heart failure.

Authors:  Rod S Taylor; Viral A Sagar; Ed J Davies; Simon Briscoe; Andrew J S Coats; Hayes Dalal; Fiona Lough; Karen Rees; Sally Singh
Journal:  Cochrane Database Syst Rev       Date:  2014-04-27
View more
  1 in total

1.  Social Determinants of Health Factors for Gene-Environment COVID-19 Research: Challenges and Opportunities.

Authors:  Jimmy Phuong; Naomi O Riches; Charisse Madlock-Brown; Deborah Duran; Luca Calzoni; Juan C Espinoza; Gora Datta; Ramakanth Kavuluru; Nicole G Weiskopf; Cavin K Ward-Caviness; Asiyah Yu Lin
Journal:  Adv Genet (Hoboken)       Date:  2022-03-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.