Literature DB >> 36105559

The relationship between the built environment and respiratory health: Evidence from a longitudinal study in Indonesia.

Witri Indriyani1,2, Muhammad Halley Yudhistira1,3, Prani Sastiono1,3, Djoni Hartono2.   

Abstract

Multiple studies have discussed the relationship between the built environment and non-infectious diseases, but research involving infectious diseases and the built environment is scarce. How the built environment is associated with infectious diseases varies across areas, and previous literature produces mixed results. This study investigated the relationship between the built environment and infectious diseases in Indonesia, which has different settings compared to developed countries. We combined the longitudinal panel data, Indonesian Family Life Survey (IFLS), and land cover data to examine the relationship between the built environment and the likelihood of contracting respiratory infectious diseases. We focused on the sprawl index to measure the built environment. The study confirmed that a sprawling neighbourhood is linked to lower respiratory infection symptoms by employing a fixed effect method. The association is more evident in urban areas and for females. The results also suggested that the linkage works through housing quality, such as housing crowdedness and ventilation, and neighbourhood conditions like neighbourhood transportation modes and air pollution levels. Thus, our results underlined the need to consider the health consequences of the densification policy and determine the direction of landscape planning and policy.
© 2022 The Authors.

Entities:  

Keywords:  Built environment; Communicable disease; Fixed effect; Health; Respiratory infection; Sprawl index

Year:  2022        PMID: 36105559      PMCID: PMC9464964          DOI: 10.1016/j.ssmph.2022.101193

Source DB:  PubMed          Journal:  SSM Popul Health        ISSN: 2352-8273


Introduction

Where people live could directly affect their health. Some places offer a healthier built environment than other places. Generally, the built environment refers to the human-made physical environment, including land use patterns, transportation systems, physical infrastructure and urban design (Saelens & Handy, 2008). The linkage between the built environment and health outcomes started from ecology theory (Stokols, 1996) and neighbourhood effect theory (Galster, 2012) which find that the built environment could be a health improvement or stressor depending on its characteristics and the human's health-related behaviour. Whether the built environment is related to health outcomes is an ongoing discussion. On the one hand, a compact built environment might offer higher walkability and lead to active behaviour, thus decreasing the risk of non-communicable diseases like obesity, heart diseases and cancer prevalence (Ewing, Meakins, Hamidi, & Nelson, 2014; Zhao & Kaestner, 2010). On the other hand, a compact built environment is also associated positively with infectious diseases since compact areas encourage high contact intensity and amplify the risk of disease contagion (Glaeser, 2011). How the built environment is associated with infectious diseases also varies across areas. Recent empirical studies on the built environment and infectious disease show mixed results (Bhadra, Mukherjee, & Sarkar, 2021; Hamidi, Ewing, & Sabouri, 2020; B. Li, Ma, & Zhang, 2021; S. Li, Ma, & Zhang, 2021). While research discovered negative and no significant relationship between a compact built environment and the rate of respiratory infection disease, Covid-19 (Carozzi, Provenzano, & Roth, 2020; Hamidi, Ewing, & Sabouri, 2020; Hamidi, Sabouri, & Ewing, 2020), other studies report a positive and significant association between compact built environment and Covid-19 (Bhadra et al., 2021; Emeruwa et al., 2020; B. Li, Ma, & Zhang, 2021; S. Li, Ma, & Zhang, 2021). One explanation for the mixed results was the different characteristics in the backdrop of the studies. In developed countries, even though compactly built environments allow for close physical contact, such areas also benefit more from better health facilities and health education, which enable them to prevent disease infection (Hamidi, Sabouri, & Ewing, 2020). Meanwhile, in developing and less developed countries with limited health facilities, a compact built environment induces a higher contagion due to intense contact. Coupled with overcrowded housing, poor sanitation, and pollution will aggravate the disease transmission and prevalence in developing and less developed countries (Antunes & Waldman, 2001; Ciencewicki & Jaspers, 2007; Mondal & Paul, 2020; Nasim, El-Zein, & Thomas, 2022). Furthermore, the extent to which the built environment is linked to infectious diseases in urban and rural areas could be different. In rural areas, the built environment is less compact which could prevent the villagers from disease transmission. However, the lack of access to health facilities and education increases the disease's prevalence (S. Wang et al., 2015). The higher social capital in rural areas (Ziersch, Baum, Darmawan, Kavanagh, & Bentley, 2009) also has advantages and disadvantages regarding infectious diseases. Higher social capital can accelerate health information exchange. Still, social capital can also increases human contact, such as secondhand cigarette smoking or shared food or drink, potentially accelerating disease transmission (Villalonga-Olives & Kawachi, 2017). This study investigates the built environment and the likelihood of individuals contracting respiratory infectious diseases in Indonesia, a developing country with a constant rate of respiratory infectious diseases (Ministry of Health, 2018) and experiencing built environment changes (Yudhistira, Indriyani, Pratama, Sofiyandi, & Kurniawan, 2019). Specifically, the study investigates whether the built environment correlates with non-specific acute respiratory infection symptoms, pneumonia symptoms, and tuberculosis (TB). However, measuring the built environment in a small neighbourhood is challenging when the data is limited. Fortunately, technology development provides an alternative approach by utilizing various digital and photographic impressions, such as location tracking by cell phone, satellite imagery, and a street photograph. The analysis employs the combination of social economy and health data from three rounds of the Indonesian Family Life Survey (IFLS) 2000, 2007, and 2014, and land cover data from European Space Agency Climate Change Initiatives (ESA CCI) in the respective years. We measure the built environment with sprawl index using state-of-the-art Geographic Information Systems (GIS) to accommodate an individual's neighbourhood situation at the sub-district level following Burchfield, Overman, Puga, and Turner (2006). The study hypothesizes that sprawled built environments might correlate with the probability of infecting respiratory diseases, and the correlation is statistically significant. The relationship between the built environment and respiratory infectious diseases could happen due to self-selection. Individuals with certain health statuses self-select into densely populated or sprawled area living conditions. For example, individuals with lower health status might self-select into higher compact living conditions to benefit from better health facilities in densely populated areas. They could also self-select into less dense areas for better environments, such as cleaner air. Hence, the study utilizes a panel fixed-effect method to reduce the bias from time-invariant unobservable due to self-selection. We use the wave-island fixed effect to handle heterogeneity at the island level that might varies across years. Then we restrict our sample to non-migrant individuals to control unobservable factors in sub-districts that might confound the relationship between the built environment and individual health outcomes. Moreover, the longitudinal data, IFLS, also partially deals with individual time-variant confounders. The more refined controls and the data form could accommodate the self-selection problems and make our analysis closer to causality than a simple ordinary least square in cross-section data. This study is related to the recent literature linking built environments and health outcomes (Bhadra et al., 2021; Hamidi, Ewing, & Sabouri, 2020; Hamidi, Sabouri, & Ewing, 2020). Most studies limit themselves by using qualitative explanations in justifying the underlying mechanism of how built environments might associate with health outcomes. We complement these studies by providing a parametric estimation. The estimation utilizes the IFLS data to identify the channels, i.e., housing crowdedness, housing ventilation, and neighbourhood transportation modes and Village Potential (Podes) data to describe the health facilities at the sub-district level. The study also uses pollution data from the TROPOspheric Monitoring Instrument (TROPOMI) onboard European Space Agency Copernicus Sentinel-5 Precursor (S5P) satellite to detect the neighbourhood pollution level. We include three airborne particles, nitrogen dioxide (NO2), sulfur dioxide (SO2), and carbon monoxide (CO), to define air pollution. Secondly, the study applies the sprawl index developed by Burchfield et al. (2006) as an approach to the built environment using GIS methods. The sprawl index allows us to capture better the relationship between the built environment and health outcomes than the administrative approach (Drewnowski et al., 2019). Thirdly, we examine a nearly causal relationship between the built environment and an individual's respiratory infectious disease with longitudinal data, while the earlier studies mainly discuss the link to non-communicable diseases or employ cross-sectional data (Ewing et al., 2014; Hamidi, Sabouri, & Ewing, 2020; Zhao & Kaestner, 2010). The longitudinal analysis data could partially absorb time-variant unobservables. Additionally, we enhance the model specification by working on the panel fixed-effect method. The method could reduce the endogeneity by dealing with time-invariant unobservables. Finally, the results of this study will increase the awareness of the implication of the densification policy on respiratory infectious diseases in Indonesia. The structure of the study is as follows. Section 2 provides the context of the study. Section 3 describes the data and empirical strategy. Section 4 presents the results, and section 5 discusses the findings. Last, section 6 concludes the study.

Context: respiratory disease prevalence and built environment in Indonesia

The area's characteristics could affect how the built environment relates to infectious diseases. Indonesia has been facing challenges of limited health facilities, overcrowding housing, and poor sanitation that might influence the relationship between the built environment and respiratory infectious diseases. Statistics of Indonesia 2021 show that 39% of Indonesian households live in indecent housing. They are not meeting the four criteria, i.e., sufficient living space (7.2 m2 per capita), having access to clean drinking water, having access to proper sanitation, and building resilience or durable housing. Indonesia also has issues with health inequality and inadequacy. We note three critical points from the statistics descriptive of Indonesia's health facilities in Table 1. First, the health facilities do not change substantially from 2006 to 2020. Second, the table indicates health facilities' inequality between sub-districts. We still found several sub-districts without Community health care (Puskesmas), whereas the regulation requires at least one Puskesmas as primary health care in each sub-district (Regulation of the Minister of Health of Republic of Indonesia Number 75 the Year 2014 about Community Health Center, 2014). Third, the number of Puskesmas is still inadequate compared to the average sub-district population (Based on Podes, 2007, the average population in each sub-district is around 40 thousand people). Table 1 shows that on average, each sub-district has one to two Puskesmas, serving approximately 40 thousand people, or the ratio is one Puskesmas for more than 20.000 people. The ratio is lower than the government minimum target, 1:16.000 (Regulation of the Minister of Law and Human Rights of Republic of Indonesia Number 34 the Year 2016 about District/City of Human Rights Care, 2016). Indonesia was also the second lowest among Southeast Asia countries, with the ratio of physicians per person, at 0.46 per 1000 people (World Bank, 2019).
Table 1

Number of health facilities at the sub-district level.

2006
2014
2020
meanmin.max.meanmin.max.meanmin.max.
Hospital0.3090.30100.4016
Community health care (Puskesmas)1.50111.50181.5014
Polyclinic and infirmary2.00711.701012.0071
Supporting Puskesmas4.20383.70353.6032
Doctor's private practice5.602184.701455.20125
Pharmacy1.80882.80954.1079
Number of health facilities at the sub-district level. The basic health research (RISKESDAS) exhibits that the rate of respiratory infection in Indonesia remains unchanged and tends to increase in more compact neighbourhoods, e.g., urban areas. Table 2 provides the prevalence of Indonesia's respiratory infectious diseases, including non-specific acute respiratory infections, pneumonia, and TB, which are common in Indonesia. We need to read the data carefully since the data in 2018 was collected differently than those in 2007 and 2013. The TB prevalence might still be comparable across the years, but pneumonia and non-specific acute respiratory infections are only comparable in 2007 and 2013. The prevalence of TB nationally was constant and increased slightly in 2018, showing that the health program designed to combat TB had not yet been successful. In the early years, the TB prevalence was higher in rural areas, but the number in urban areas has recently been more evident. Among genders, the TB prevalence was consistently higher for males. In 2021 Indonesia ranked 3rd globally in terms of TB incidence (World Health Organization, 2021).
Table 2

The prevalence of respiratory infectious diseases in Indonesia.

200720132018
Pneumonia
National2.11.84.0
Urban1.61.63.8
Rural2.42.04.3
Male1.61.94.2
Female2.41.73.9
Non-specific acute respiratory infections
National25.525.09.3
Urban23.324.19.0
Rural26.926.09.7
Male25.625.19.0
Female25.524.99.7
Tuberculosis
National0.40.40.4
Urban0.40.40.4
Rural0.40.30.4
Male0.40.40.5
Female0.40.30.4

Source: Indonesia Basic Health Research, Ministry of Health (2008, 2013, 2018).

Notes: The prevalence of pneumonia and non-specific acute respiratory infections was calculated based on the diagnosis of health workers (doctor, nurse, midwife) and the symptoms. The TB prevalence was estimated from the health workers' diagnosis only. Specifically, in 2018, the TB prevalence was counted if diagnosed by a doctor. The period of pneumonia and non-specific acute respiratory infection symptoms was similar in data between 2007 and 2013, during the last month when surveyed. In 2018, the duration of the pneumonia check was extended up to the last year when surveyed, and the period of non-specific acute respiratory infections was restricted only to the last two weeks when surveyed. The diagnosis of TB in the three surveys was asked during the last year when surveyed.

The prevalence of respiratory infectious diseases in Indonesia. Source: Indonesia Basic Health Research, Ministry of Health (2008, 2013, 2018). Notes: The prevalence of pneumonia and non-specific acute respiratory infections was calculated based on the diagnosis of health workers (doctor, nurse, midwife) and the symptoms. The TB prevalence was estimated from the health workers' diagnosis only. Specifically, in 2018, the TB prevalence was counted if diagnosed by a doctor. The period of pneumonia and non-specific acute respiratory infection symptoms was similar in data between 2007 and 2013, during the last month when surveyed. In 2018, the duration of the pneumonia check was extended up to the last year when surveyed, and the period of non-specific acute respiratory infections was restricted only to the last two weeks when surveyed. The diagnosis of TB in the three surveys was asked during the last year when surveyed. The prevalence of pneumonia decreased from 2007 to 2013 by more than 15% or around 2% annually. The decrease mainly occurred in rural areas, whereas the decline in urban areas was only 1.8% or about 0.3% annually. It is also noticeable that pneumonia rose substantially (16.6%) for males while the female prevalence dropped drastically (30%) in 2013. The non-specific acute respiratory infection prevalence was steady between the two survey periods, but the prevalence increased in urban areas. The number was also consistently higher for males, though the gap is small between gender. From the data, we could see the paradox that the prevalence of non-specific acute respiratory infections in the urban area is growing, whereas the national prevalence is declining. At the same time, population growth and infrastructure development have changed Indonesia's built environment (Pratama, Yudhistira, & Koomen, 2022; Yudhistira et al., 2019). Table 3 present the average sprawl index in a 5 km radius and developed areas in Indonesia. The sprawl index has decreased, indicating that the built areas are more compact nationally. In contrast, the proportion of the developed areas increases by about 2% annually. The data show the changes in the built environment in Indonesia, regardless of the small changes.
Table 3

The built environment of Indonesia.

200020072014
Sprawl index88.587.486.4
Developed areas (%)7.58.49.6
The built environment of Indonesia. Despite Indonesia's socioeconomic and health challenges, the empirical study that linked health, particularly infectious respiratory disease and the built environment in Indonesia is limited. The recent research mainly focused on the socioeconomic and health behaviour determinants of respiratory infectious disease and neglected the role of the built environment (Fathmawati, Rauf, & Indraswari, 2021; Oktaria et al., 2021; Windi et al., 2021). They found that age, parent's occupation, household wealth, place of residence, vitamin D deficiency, and water sources are associated with respiratory infection among children under five years in Indonesia. Our study complements the existing literature by investigating the association between respiratory infectious disease and the built environment in Indonesia and providing potential channels, such as housing, neighbourhood condition and health facilities.

Data and identification

Data

The study employs socioeconomic and health data from IFLS3 (2000), IFLS4 (2007), and IFLS5 (2014). We do not include IFLS1 as IFLS1 does not provide information related to runny nose and difficulty breathing, which is important to define our main interest variable (respiratory infection symptom). We also find incomparable variables in IFLS1 with other waves, such as the breakdown of education level. Also, we exclude the IFLS2 since the data were collected when a severe economic crisis started in Indonesia might affect the socio-health-economic condition, including health. The IFLS data provides longitudinal panel data, covers over 30,000 respondents from 13 out of 34 provinces in Indonesia, and represents 83% of the Indonesian population with detailed sub-district information. The first wave survey was conducted in 1993, followed by the second and third in 1997 and 2000. The total number of households interviewed was about 10.000 households (with around 39,000 individuals) in wave 3; 13,000 (with around 43,000 individuals) in wave 4; and 16,000 households (with around 50,000 individuals) in wave 5. The IFLS produces detailed information at the individual, household, and community level covering socioeconomics and demographic condition, and health to risk-averse behaviour. Since the self-health assessment data in the IFLS is only available for individuals aged 15+ and the TB information is only available for individuals aged 40+, thus our sample was 15 years old or older interviewed in the three waves (waves 3–5). We use sub-district ID to combine IFLS and sprawl index data. Thus, the individual data is nested in sub-district and time (wave). We combine sprawl index, pollution, and health facilities data similarly. The decision to link the sprawl index with IFLS data at the subdistrict level is due to the data availability, as data on the household location is only available at the subdistrict level. After combining the IFLS data with sprawl index data and taking only observations with no missing values, we get 9758 individuals' and 7021 households' panels and 1070 sub-districts in our samples. We restricted the sample to non-migrant individuals to accommodate the unobservable at the sub-district level, affecting the individual location choice. After the restriction, our observations reduced considerably to around 9279 individuals, 5728 households, and 733 sub-districts. We also observed the community and facilities data in the corresponding waves for further explanation. The community questionnaires were asked of the community leader. In total, we have 170 communities in those three waves. Like the individual data, the household and neighbourhood data are nested in sub-district and time (wave). Previous research uses various ways when assessing health outcomes, among others conducting a self-reported assessment, medical record, and prescription record analysis (Barbara et al., 2012; Brinkhues et al., 2018). This study employs a self-reported assessment of respiratory infectious diseases from IFLS since formal or medical report data with socioeconomics information in Indonesia is unavailable. For the TB, we use a proxy of self-reported TB medication consumption and medical diagnosis in the related year. The TB information is only available in IFLS4 (2007) and IFLS5 (2014) for respondents aged 40+. Since the IFLS does not provide information on pneumonia and non-specific acute respiratory infectious diagnosis, we adopt the symptom identifications in IFLS following the RISKESDAS report. In the RISKESDAS report, individuals are suspected of having pneumonia symptoms if they have had a high fever, cough, and difficulty breathing during the last year when surveyed. Individuals are suspected of having non-specific acute respiratory infection symptoms when they had a high fever, runny nose, cough, or a sore throat during the previous two weeks when surveyed. Thus, the study utilizes similar criteria of the RISKESDAS to identify pneumonia and the non-specific acute respiratory infection symptoms in the IFLS. Some literature also applied a similar approach (Brinkhues et al., 2018; Windi et al., 2021). We include individual and household characteristics as control variables, such as age, working status, marital status, education level, gender, household members, and the head of the household. Since gender is time-invariant, we exclude gender in the fixed effect estimation. This study's main variable of interest is the built environment, measured by the sprawl index following Burchfield et al. (2006) at the sub-district level. When constructing the sprawl index, we use land cover maps from ESA CCI in the related year with a spatial resolution of 300mx300 m. The data is available yearly; thus, we can combine it with the IFLS data. We classify the urban and non-urban settlements to define the developed and undeveloped areas within a 5 km radius of the individual grid cells. Forty years timespan (2000–2014) allow a considerable change in the sprawl index that should lead to sufficient variation in the data. The study applies a moderate radius, 5 km, to gain the variability of the sprawl index, especially in the sub-district dominated by undeveloped areas, and to capture the neighbourhood effects, such as transportation modes and pollution. Previous studies also use a similar radius when measuring the built environment (Surya et al., 2021; Zhang, Ning, Chen, & Yang, 2021).Fig. A2 escribes how we calculate the sprawl index.

Empirical strategy

Our empirical strategy is as follows. First, we measure the sprawl index at the sub-district level where the individuals live when surveyed. Then, we observe the same individuals in three waves. We propose an individual fixed effect and wave fixed effect to account for time-invariant unobservable and time trends. Thus, our main estimation: For household channels, we also use panel fixed effect as follows For neighbourhood channels, we use ordinary least squares (OLS) due to small observations as follows

Dependent variable

The dependent variable of this study is . is an indicator variable of getting the respiratory infectious disease symptoms of individual i in sub-district j at time or wave t. The is a binary variable, assign a value for 1 if an individual has a positive diagnosis or symptoms of respiratory infectious diseases, and 0 otherwise. is an indicator variable of housing crowdedness or housing ventilation adequacy of household h in sub-district j at time or wave t. Housing crowdedness is measured by dividing household members by housing floor size. Ventilation adequacy is a binary variable (1 for good ventilation and 0 otherwise). is an indicator variable of the existence of transportation modes in community c, in sub-district j at time or wave t. consist of three indicators, the existence of public transport, transport mode to the station, bus stop, terminal, and transport mode to market. The existence of public transportation with three or four wheels or motorboat service is defined as 1 if available and 0 otherwise. Similarly, the transport mode was defined as 1 if use two-wheeled motor vehicles, pedicabs, bicycles, and on foot and 0 otherwise.

Variable of interest and controls

Our variable of interest in this study is . is a sprawl index of sub-district j in a certain radius at time or wave t. We use a 5 km radius for our baseline estimate. The sprawl index ranged from 0 to 100, with a higher index indicating a higher sprawl or scatterer neighbourhood. Coefficient captures the relationship between the sprawl index and the likelihood of having respiratory infectious diseases. Coefficient describes the relationship between the sprawl index and housing crowdedness or ventilation adequacy. Coefficient accounts for the relationship between the sprawl index and the transportation facility in the community. By employing the fixed effect, we can analyze how the change in sprawl index correlates with the health outcomes of the same individual over time. This study also employs , a vector control variable for individual i in sub-district j at time t that might relate to the diseases such as age, working status, marital status, education level, and households condition following the existing research (Hamidi, Sabouri, & Ewing, 2020; Miandad, Anwar, Ahmed, Rahman, & Khan, 2019; Murad et al., 2019; L. Wang, Chen, Sun, Yang, & Li, 2020). is an individual fixed effect to control individual time-invariant unobservable factors that could confound the relationship between health outcomes and built environment form, and is a wave-island fixed effect or wave-island interaction to absorb heterogeneity at the island level that varies across years. We also include the survey wave fixed effect, , to account for any possible changes over time, and is an error term. Similarly, , is a vector control variable for household h in sub-district j at time t, i.e., age of the household head, gender of the household head, marital status of the household head, number of household members, and years of schooling of the household head. is a household fixed effect and is an error term. is a vector control variable for community c in sub-district j at time t. consists of road condition (1 if asphalt or cement roads, 0 otherwise) and percentage of households with electricity, and the existence of industry or factory or plant (1 if any, 0 otherwise) and is an error term. We use a fixed effect approach in our main estimation to reduce the endogeneity problems associated with individual self-selection. Individuals with certain health statuses might choose their residential locations. For instance, individuals with respiratory diseases might consciously choose to live in cleaner neighbourhoods that are located in sprawled areas. The study also restricts the sample to non-migrants to ensure the non-existence of self-selection across sub-districts. All standard errors in this model are clustered at the sub-district level to deal with the potential serial correlation due to exogenous shock across sub-district that is uncorrelated with sprawl index but explains the likelihood of getting infectious respiratory diseases in all sub-districts. We also conduct several robustness checks in our model to test the consistency of our baseline results. First, the study applies different radii of the sprawl index. We estimate the model in 1 km and 10 km radii. Second, since we have a binary variable of we employ fixed-effect logistic regression. Third, underlining the possibility of endogeneity problems, we test the estimate using a two-stage least square (TSLS) with internal instruments adopted from Baum and Lewbel (2019). Last, we omit 10% of the oldest observations to absorb the possibility of the extreme value of observations. According to the background and the empirical results of the relationship between the built environment and the likelihood of getting respiratory infectious diseases, we hypothesize that the built environment and the possibility of getting respiratory infectious diseases are correlated and statistically significant.

Results

The relationship between the built environment and respiratory diseases

Fig. 1 illustrates the negative trend of sprawl index and respiratory infectious diseases. The figures suggest that the sprawler areas are associated with a lower rate of respiratory infectious diseases. The point-biserial correlation (r) also shows negative correlations between sprawl index and respiratory infectious diseases. Fig. 1c notes that non-specific respiratory infection has the highest correlation with sprawl index than other mentioned respiratory infectious diseases. Conversely, TB has the lowest correlation with sprawl index among the diseases. The descriptive statistics in Tables A1 and A2 also clarify that the proportion of individuals with infectious respiratory symptoms is higher in a more compact sub-district (the sprawl index is less than the median) than in a sprawling sub-district (the sprawl is higher than the median). The table also indicates the difference in individual characteristics between compact and sprawl sub-district. The more compact sub-district has a lower proportion of married and not working individuals, higher years of education, and higher household members.
Fig. 1

The relationship between sprawl index and respiratory infectious disease

Notes: Fig. 1 is a binned scatter plot of sprawl index and individuals' respiratory infectious diseases at the sub-district level. The binned scatter group the sprawl index into 20 equally sized bins. r is the point biserial correlation coefficient.

The relationship between sprawl index and respiratory infectious disease Notes: Fig. 1 is a binned scatter plot of sprawl index and individuals' respiratory infectious diseases at the sub-district level. The binned scatter group the sprawl index into 20 equally sized bins. r is the point biserial correlation coefficient. Table 4 presents the results of fixed effect estimation. We start the analysis by identifying the link of sprawl index on the likelihood of having respiratory infectious diseases from a 5 km radius around the neighbourhood with individual characteristics' control in the first column. The result shows that the sprawl index is negatively correlated with the likelihood of respiratory infectious illness symptoms by 0.2% per one percentage point of the sprawl index. The coefficient is statistically significant at the 1% level. Then, we gradually introduce the baseline controls in columns (2)–(4) based on equation (1). The coefficients decrease slightly to 0.17% with lower significance.
Table 4

The relationship between sprawl and respiratory infectious disease, baseline estimates.

FE: respiratory infection
(1)(2)(3)(4)(5)
Sprawl index 5 km radius−0.0020∗∗∗−0.0020∗∗∗−0.0012∗−0.0017∗∗−0.0025∗∗∗
(0.0007)(0.0007)(0.0007)(0.0007)(0.0009)
Age−0.0029∗∗−0.0028∗∗−0.0029−0.0032−0.0048
(0.0014)(0.0013)(0.0029)(0.0029)(0.0038)
Age-squared0.00004∗∗0.00004∗∗0.000020.000020.00003
(0.00001)(0.00001)(0.00002)(0.00002)(0.00002)
Years of schooling0.00220.00230.00280.00280.0044
(0.0020)(0.0021)(0.0021)(0.0021)(0.0029)
Marital = married−0.0143−0.0155∗−0.0135−0.0133−0.0048
(0.0091)(0.0091)(0.0091)(0.0091)(0.0122)
Working = yes0.00690.00660.00890.00910.0049
(0.0067)(0.0067)(0.0066)(0.0066)(0.0089)
Number of household members−0.0020∗−0.0003−0.0004−0.0015
(0.0011)(0.0011)(0.0011)(0.0017)
Years of schooling of the household head−0.00020.00010.0001−0.0017
(0.0009)(0.0009)(0.0009)(0.0013)
Constant0.3010∗∗∗0.3040∗∗∗0.2660∗∗0.3410∗∗∗0.4380∗∗∗
(0.0685)(0.0684)(0.1080)(0.1140)(0.1540)
Wave fixed effectNoNoYesYesYes
Wave-island fixed effectNoNoNoYesYes



SampleAllAllAllAllNon-migrant
Mean of dep variable0.1120.1120.1120.1120.111
Adjusted R20.0020.0020.0070.0080.008
Observation2644226442264422644219748

Notes: The sample in columns (1)–(4) is individuals aged 15+ and in column (5) is non-migrant individuals aged 15+. The study defined non-migrants if the individual did not move to another sub-district across waves. In other words, the individual had the same sub-district address during the three survey waves. All samples were obtained from the three waves of the IFLS survey (2000, 2007, and 2014), with an outcome equal to 1 if the individuals showed respiratory infectious disease symptoms or diagnosis. For comparison, the study provides baseline estimation with cross-section person weight 2000 and longitudinal person weight in Table A3, Appendix. The inverse probability weighting estimates have similar directions and slightly higher coefficients. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

The relationship between sprawl and respiratory infectious disease, baseline estimates. Notes: The sample in columns (1)–(4) is individuals aged 15+ and in column (5) is non-migrant individuals aged 15+. The study defined non-migrants if the individual did not move to another sub-district across waves. In other words, the individual had the same sub-district address during the three survey waves. All samples were obtained from the three waves of the IFLS survey (2000, 2007, and 2014), with an outcome equal to 1 if the individuals showed respiratory infectious disease symptoms or diagnosis. For comparison, the study provides baseline estimation with cross-section person weight 2000 and longitudinal person weight in Table A3, Appendix. The inverse probability weighting estimates have similar directions and slightly higher coefficients. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10. The model in columns (1)–(4) might still suffer from the sub-district unobserved heterogeneous variables that might influence individuals' location preferences, for instance, the eviction of residential areas for local infrastructure development. Hence, we restrict our sample to non-migrant individuals in column (5), resulting in a higher coefficient. With the non-migrant sample, one percentage point expansion of the sprawl index corresponds to the lower likelihood of being ill by 0.25%. The coefficient is statistically significant at the 1% level. Ignoring the unobservable at the sub-district level could underestimate the results. We explain that health facilities in each sub-district might confound the relationship. To hold our argument, the study correlates the sprawl index with health facilities, such as the number of hospitals, community health centres, clinics, practitioners, and pharmacies in Fig. A1. The figures depict that denser (less sprawl) areas experience better health facilities, supporting our discussion of a potential confounder between sprawl index and health outcome. Based on the empirical strategy, we will focus on the non-migrant sample with full controls as suggested in column (5) to minimize the potential omitted variable biased. We also examine the link between the sprawl index and the possibility of having infectious diseases by disease type, non-specific acute respiratory diseases, pneumonia, and tuberculosis in Table 5. The non-specific acute respiratory infections and pneumonia coefficient are consistent with baseline results when applying the 5 km radius. Columns (1) and (2) report that a one percentage point addition of the sprawl index corresponds to a lower probability of getting non-specific acute respiratory infections and pneumonia by 0.26% and 0.07%. Based on the coefficient estimate, the sprawl index contributes around 7% of the national non-specific acute respiratory infection prevalence and 0.12% of national pneumonia prevalence when simulating the coefficients to the national prevalence.
Table 5

The relationship between sprawl and respiratory infectious disease by disease's type.

non-specific acute respiratory infection
pneumonia
tuberculosis
(1)(2)(3)
Sprawl index 5 km radius−0.0026∗∗∗−0.0007∗0.00003
(0.0008)(0.0004)(0.0003)
Age−0.0049−0.0001−0.0004
(0.0039)(0.0023)(0.0027)
Age-squared0.000030.000020.000003
(0.00002)(0.00001)(0.000008)
Years of schooling0.00380.00060.0004
(0.0028)(0.0014)(0.0010)
Marital = married−0.00410.0026−0.0003
(0.0121)(0.0056)(0.0033)
Working = yes0.0059−0.00270.0009
(0.0087)(0.0041)(0.0029)
Number of household members−0.0021(0.0016)0.00001(0.0008)0.0011(0.0007)
Years of schooling of the household head−0.0015(0.0013)0.0003(0.0006)0.0001(0.0003)
Constant0.4410∗∗∗0.0481−0.0020
(0.1540)(0.0805)(0.1220)
Wave fixed effectYesYesYes
Wave-island fixed effectYesYesYes
SampleNon-migrantNon-migrantNon-migrant
Mean of dep variable0.1060.0240.004
Adjusted R20.0070.0050.004
Observation197451974514643

Notes: The sample is non-migrant individuals aged 15+ from three waves of the IFLS survey (2000, 2007, or 2014), with an outcome equal to 1 if the individual showed non-specific acute respiratory infection or pneumonia symptoms or consumed TB medication or diagnoses with TB. The TB information is only available in IFLS4 and IFLS5 and was asked for individuals aged 40+. The sprawl index was calculated with a 5 ​km radius. We also control for health facilities (e.g.number of hospitals, number of community health care (Puskesmas), and number of Supporting Puskesmas) to account for access to health facilities that might be associated with the sprawl index. The health facilities data is obtained from Potential Village data in 1999, 2006, and 2014 (the closest year of IFLS) since the IFLS data do not provide detailed information on health facilities at the sub-district level. The result is similar to the model without health facilities control estimation, suggesting that this problem might be less prevalent in our model. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denote significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

The relationship between sprawl and respiratory infectious disease by disease's type. Notes: The sample is non-migrant individuals aged 15+ from three waves of the IFLS survey (2000, 2007, or 2014), with an outcome equal to 1 if the individual showed non-specific acute respiratory infection or pneumonia symptoms or consumed TB medication or diagnoses with TB. The TB information is only available in IFLS4 and IFLS5 and was asked for individuals aged 40+. The sprawl index was calculated with a 5 ​km radius. We also control for health facilities (e.g.number of hospitals, number of community health care (Puskesmas), and number of Supporting Puskesmas) to account for access to health facilities that might be associated with the sprawl index. The health facilities data is obtained from Potential Village data in 1999, 2006, and 2014 (the closest year of IFLS) since the IFLS data do not provide detailed information on health facilities at the sub-district level. The result is similar to the model without health facilities control estimation, suggesting that this problem might be less prevalent in our model. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denote significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10. Meanwhile, the association of the sprawl index with the TB disease in column 3 is insignificant. We argue that the smaller sample size reduces the statistical power. Compared to other infectious diseases, TB's observation numbers are only three-quarters, and the average number of respondents with the disease is only 0.4% of the total sample.

Sample heterogeneity

Our models suggest that sprawl could reduce the likelihood of getting respiratory infectious diseases. But the results might be heterogeneous between sub-samples, so we checked some sub-samples, the urban-rural sub-sample and the male-female sub-sample.

Urban-rural heterogeneity

We examine the relationship between the sprawl index and the likelihood of contracting respiratory infectious diseases in urban-rural areas. The different characteristics and amenities between urban and rural areas might explain the mixed relationship. Table 6, columns (1)–(2) infer that the link differs between urban and rural areas. The coefficient of the urban sample in column (1) indicates the 0.27% reduction in the probability of being ill for a one percentage point increase in the sprawl index, and the coefficient is statistically significant at a 1% level. In other words, the compact built environment is associated with respiratory disease symptoms in urban areas. In comparison, the rural areas' coefficient in column (2) is positive but insignificant. The results support our earlier argument that the relationship between built environment forms and health outcomes in more compact areas, like urban areas, is more significant.
Table 6

Sample heterogeneity: areas and gender.

Areas
Gender
urban
rural
male
female
(1)(2)(3)(4)
Sprawl index 5 km radius−0.0027∗∗∗0.0028−0.0013−0.0034∗∗∗
(0.0010)(0.0029)(0.0012)(0.0012)
Age−0.0061−0.0055−0.0048−0.0047
(0.0055)(0.0062)(0.0061)(0.0052)
Age-squared0.000020.00005∗0.000020.00003
(0.00003)(0.00003)(0.00003)(0.00003)
Years of schooling0.0059∗0.0029−0.00150.0109∗∗∗
(0.0035)(0.0054)(0.0044)(0.0035)
Marital = married0.0027−0.0061−0.00610.0007
(0.0152)(0.0263)(0.0188)(0.0145)
Working = yes0.0060−0.0039−0.00470.0071
(0.0124)(0.0141)(0.0181)(0.0094)
Number of household members−0.0014(0.0021)−0.0020(0.0036)−0.0036(0.0026)−0.0001(0.0020)
Years of schooling of the household head−0.0021(0.0017)−0.0005(0.0025)−0.0044∗(0.0024)0.0001(0.0016)
Constant0.1950−0.01460.29000.6340∗∗∗
(0.2470)(0.3330)(0.2270)(0.2310)
Wave fixed effectYesYesYesYes
Wave-island fixed effectYesYesYesYes
SampleNon-migrantNon-migrantNon-migrantNon-migrant
Mean of dep variable0.1180.1000.1120.111
Adjusted R20.0080.0140.0090.010
Observation123317417859911149

Notes: The sample is non-migrant individuals aged 15+ from three waves of the IFLS survey (2000, 2007, and 2014), with an outcome equal to 1 if the individual showed respiratory infectious disease symptoms or diagnosis. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

Sample heterogeneity: areas and gender. Notes: The sample is non-migrant individuals aged 15+ from three waves of the IFLS survey (2000, 2007, and 2014), with an outcome equal to 1 if the individual showed respiratory infectious disease symptoms or diagnosis. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

Gender heterogeneity

Additionally, we estimate the relationship between gender in Table 6, columns (3)–(4), to account for the variation of biological characteristics between males and females that might affect the probability of getting respiratory diseases (Groeneveld et al., 2020). Our results show that the sprawl and health outcomes linkage between gender is similar but seem more susceptible for females as the coefficient for the female category in column (4) is higher than the coefficient for the male in column (3). In column (4), a one percentage point of sprawl index is related to the lower probability of contracting the diseases by 0.34% for females. The association is insignificant for the male sub-sample.

Channels

This section provides two general potential mechanisms to what extent sprawl index is associated with respiratory infectious diseases. We test the land rent theory by Mills (1972) and Muth (1969) and transportation modes choice by McFadden (1974) to explain the mediators.

Household effect: housing crowdedness and housing ventilation

The land rent in dense areas is higher than in less dense areas since the land is economically more productive (Mills, 1972; Muth, 1969). As a result, a moderate income is only able to afford smaller and lower-quality housing in dense areas. The higher housing crowdedness could speed up the transmission of infectious diseases (Hu, Roberts, Azevedo, & Milner, 2021). Poor housing ventilation also contributes to disease transmission (Luongo et al., 2016). Reversely, people could choose a bigger and better housing quality by living further away from dense areas and having a lower risk of contracting diseases. To support the argument of housing-related changes, we assess the linkage of the sprawl index on housing crowdedness and housing ventilation in different areas (urban vs rural) following Equation (2). We use the housing conditions as a potential channel to explain whether the built environment is associated with illness due to respiratory infectious diseases by area in Table 7, columns (1)–(4). Table 7 shows some evidence of the link between the sprawl index and household conditions. The results present that the sprawl index is negatively correlated with the housing crowdedness in urban and rural areas (columns 1 and 2) but insignificant. The coefficients indicate that the sprawled neighbourhood in urban areas is associated with lower housing crowdedness, but the ventilation adequation is better in rural areas than in urban areas with a higher sprawl index (columns 2 and 4). The mechanism results partly support our baseline estimates.
Table 7

Channels: Household effects.

Household effect
Urban
Rural
crowded
ventilation
crowded
ventilation
(1)(2)(3)(4)
Sprawl index 5 km radius−0.0004(0.0009)0.0004(0.0065)−0.0006(0.0022)0.0269∗∗(0.0137)
Age of the household head−0.0018(0.0027)0.0224∗(0.0135)−0.0029(0.0021)0.0037(0.0170)
Age of the household head-squared0.00002(0.00003)−0.0002(0.0001)0.00002(0.00002)−0.00004(0.0002)
Gender of the household head = male0.0134(0.0167)0.1210(0.0939)−0.0346∗∗∗(0.0132)0.1770(0.1540)
Marital status of the household head = married0.0016(0.0178)−0.1110(0.1080)0.0373∗∗(0.0155)−0.2070(0.1490)
Number of household members0.0053∗∗(0.0022)0.0036(0.0086)0.0209∗∗∗(0.0038)0.0096(0.0098)
Years of schooling of the household head0.0006(0.0017)0.0063(0.0068)−0.0025(0.0016)−0.0002(0.0111)
Constant0.08840.35200.1310−1.8200
(0.0953)(0.5960)(0.1880)(1.3340)
Wave fixed effectYesYesYesYes
Wave-island fixed effectYesYesYesYes
SampleHouseholdHouseholdHouseholdHousehold
Mean of dep variable0.1030.8010.0890.812
Adjusted R20.0550.0560.4490.204
Observation3631363324812484

Notes: The household sample is the household where our individual sample is located. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

Channels: Household effects. Notes: The household sample is the household where our individual sample is located. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

Neighbourhood effect: neighbourhood transportation modes and air pollution

We also explain the channel of sprawl index and the likelihood of having respiratory infectious diseases via transport mode choice following Equation (3). The rationale is residents in the densely built environment have several options of mass public transport or high occupancy vehicles, allowing intense physical contact among the passengers. Since a respiratory virus or pathogen could transmit from humans to humans by direct body contact or droplet (Xu, Tian, Sabel, & Xu, 2019), public mass transport or high occupancy vehicles increases the risk of disease contamination. Unfortunately, the individual transport modes choice data is unavailable in the IFLS; thus, we used some approaches at the community level or equal to the village. From community and facility data, we could identify the villagers' transportation modes for daily activities. Table 8 columns (1)–(6) illustrate the association between the sprawl index and neighbourhood situations. Columns (1) and (4) show the link between the sprawl index and the availability of public transport with three or four wheels or motorboat service. The results suggest the higher sprawl index in urban areas correlates with less public (mass) transport with three or four-wheel or motorboat service. We also categorize transportation modes into single and high-occupancy vehicles. We assume that high occupancy vehicles have a greater risk of infection than single-occupancy vehicles as they could hold two or more passengers. To be more specific, we estimate the relationship between the sprawl index and the type of transportation modes to reach the nearest bus stop, terminal, station, or pier in columns (2) and (5) and the nearest market in columns (3) and (6). We define two categories of transportation modes, 1 for single-occupancy vehicles and 0 for high occupancy vehicles, privately or publicly owned. In urban areas, all coefficients are positive and significant in 5 km, implying that the sprawl index is positively related to the use of single-occupancy vehicles, such as two-wheeled motor vehicles, pedicabs, bicycles, and on foot to reach sub-district centres. The findings support our baseline estimates where the relationship is more evident in urban areas.
Table 8

Channel: Neighbourhood effect.

Neighbourhood effect
Urban
Rural
public transport
transport mode to the station, bus stop, terminal
transport mode to market
public transport
transport mode to the station, bus stop, terminal
transport mode to market
(1)(2)(3)(4)(5)(6)
Sprawl index 5 km radius−0.0014∗∗∗(0.0005)0.0029∗∗∗(0.0006)0.0019∗∗∗(0.0007)−0.0046∗∗(0.0021)0.0010(0.0021)−0.0015(0.0039)
Road = asphalt or cement0.4300∗∗∗(0.1500)−0.1570(0.1410)0.0019(0.1570)0.2450∗∗(0.1040)−0.1780(0.1200)−0.2110∗(0.1080)
Electricity (%)0.00160.0019−0.0059−0.0043∗∗0.0059∗∗∗0.0044∗∗∗
(0.0037)(0.0034)(0.0047)(0.0019)(0.0018)(0.0016)
Industry/factory/plant = yes0.0310(0.0425)−0.0451(0.0447)0.0217(0.0487)0.0399(0.0675)−0.1300∗(0.0739)−0.0068(0.0671)
Constant0.32400.07450.69201.1830∗∗∗0.01510.2350
(0.3750)(0.3590)(0.4740)(0.2470)(0.2510)(0.4020)
SampleCommunityCommunityCommunityCommunityCommunityCommunity
Mean of dep variable0.8470.2370.2460.6130.3930.307
Adjusted R20.0500.0510.0300.0320.0310.017
Observation452452452163163163

Notes: The community sample is obtained from the community and facilities data of IFLS. The questionnaires were asked of the community leader. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

Channel: Neighbourhood effect. Notes: The community sample is obtained from the community and facilities data of IFLS. The questionnaires were asked of the community leader. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10. In addition, previous studies show a linkage between air pollution and the transmission and severity of respiratory infectious diseases. They suggest that the interaction between pollution concentration and respiratory viruses could negatively affect the respiratory system (Bereitschaft & Debbage, 2013; Domingo & Rovira, 2020). As planned compactly built environments might have lower pollution levels than unplanned compactly built environments (Hankey & Marshall, 2017; F. Li & Zhou, 2019; She et al., 2017), the impact of pollution on diseases could be different. Hence, we evaluate the association between the sprawl index and outdoor pollution at the sub-district level as a potential mediator. In Fig. A3 – A5, we provide the scatter plot and Pearson correlation coefficients of sprawl index and three airborne particulates, i.e., NO2, SO2, and CO. Due to data unavailability in the same study year, we exploit the data at the sub-district level in the closest year, 2018. The air particulates are calculated in mol per m2 unit. Unfortunately, we could not directly define the urban-rural sub-sample at the sub-district since the categorization is conducted at the village level. So, we match the pollution data at the sub-district level with the IFLS sample and classify it into an urban-rural subsample. Fig. A3 shows that the sprawl index generally has a negative trend with the three air particulates, which implies that the areas with higher sprawl indexes are linked to lower airborne particulates or higher air quality. The patterns are relatively similar between urban and rural areas (Fig. A4 – A.5). The NO2 seems to have the highest negative linkage with the sprawl index. The findings could explain how the sprawl index is correlated with the probability of having infectious respiratory symptoms.

Robustness check: different radii, method alternatives, observations omission

We perform a robustness check to see whether our results are affected in any way by identifying the relationship between the sprawl index and the likelihood of contracting respiratory infectious diseases in multiple aspects, which indicates that our results consistently maintain the negative association. Table 9, columns (1) and (2) present the estimation within a 1 km and 10 km radius. The coefficients' direction is similar to the baseline results, with a lower coefficient and statistical power in a 1 km radius and a higher coefficient in a 10 km radius than the baseline estimates.
Table 9

The relationship between sprawl and respiratory infectious disease on various radii, methods, and sample.

Dependent variable: respiratory infectious diseasesRadii
Method alternative
Dropping Sample
1 km
10 km
panel logistic FE
Lewbels internal instrument
drop 10% the eldest
(1)(2)(3)(4)(5)
Sprawl index 1 km radius−0.0009
(0.0006)
Sprawl index 5 km radius−0.0302∗∗∗−0.0006∗∗∗−0.0028∗∗∗
(0.0105)(0.0002)(0.0009)
Sprawl index 10 km radius−0.0042∗∗∗
(0.0011)
Marginal effect−0.0011



Age−0.0045−0.0050−0.0558−0.0016−0.0044
(0.0038)(0.0038)(0.0402)(0.0012)(0.0044)
Age-squared0.000030.000030.00030.000010.00001
(0.00002)(0.00002)(0.0002)(0.00001)(0.00002)
Years of schooling0.00440.00440.0565−0.0016∗0.0048
(0.0029)(0.0029)(0.0349)(0.0009)(0.0029)
Gender = male0.0044
(0.0054)
Marital = married−0.0041−0.0047−0.0661−0.0117∗−0.0114
(0.0121)(0.0122)(0.1310)(0.0066)(0.0132)
Working = yes0.00460.00490.03800.00050.0059
(0.0089)(0.0089)(0.1010)(0.0064)(0.0094)
Number of household members−0.0015−0.0015−0.02080.0006−0.0015
(0.0017)(0.0017)(0.0208)(0.0009)(0.0016)
Years of schooling of the household head−0.0017−0.0016−0.0160−0.0005−0.0019
(0.0013)(0.0013)(0.0159)(0.0007)(0.0014)
Constant0.2960∗∗0.5940∗∗∗0.2520∗∗∗0.4540∗∗∗
(0.1410)(0.1690)(0.0352)(0.1670)
Wave fixed effectYesYesYesYesYes
Wave-island fixed effectYesYesYesNoYes



SampleNon-migrantNon-migrantNon-migrantNon-migrantNon-migrant
Mean of dep variable0.1110.1110.1110.112
Adjusted R20.0070.0080.0090.008
Observation197481974837951974817977

Notes: The sample is non-migrant individuals aged 15+ from three waves of the IFLS survey (2000, 2007, and 2014), with an outcome equal to 1 if the individual showed non-specific acute respiratory infection or pneumonia symptoms or consumed TB medication or diagnosed with TB. Standard errors clustered at the sub-district level were reported in parentheses. A. Asterisks denote significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

The relationship between sprawl and respiratory infectious disease on various radii, methods, and sample. Notes: The sample is non-migrant individuals aged 15+ from three waves of the IFLS survey (2000, 2007, and 2014), with an outcome equal to 1 if the individual showed non-specific acute respiratory infection or pneumonia symptoms or consumed TB medication or diagnosed with TB. Standard errors clustered at the sub-district level were reported in parentheses. A. Asterisks denote significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10. We also test our estimation with a fixed-effect logistic model. The result in column (3) shows that the logistic fixed effect approach has a negative and significant coefficient comparable with our baseline estimates, despite the lower marginal effect value. The lower effect might be due to the unchanged outcome across waves causing considerable observation omission in panel logistic estimation, thus affecting the marginal effect value. Considering the potential endogeneity issue, we also estimate our model using the Lewbel internal instrument following Baum and Lewbel (2019). The result in column (4) gives similar evidence with a smaller coefficient than the baseline estimate. It makes sense as the generated instrument yields higher standard errors affecting the coefficient's magnitude (Baum & Lewbel, 2019). Additionally, we observe the model by dropping the eldest age in each wave by 10% in column (5). The results are comparable to our baseline estimates that indicate the sprawler area has a lower rate of respiratory infectious diseases.

Discussion

Several studies identify the empirical evidence of how built environments are linked to health outcomes (Ewing et al., 2014; Zhao & Kaestner, 2010). But the study investigating the relationship between the built environments and the likelihood of contracting respiratory infectious diseases are scarce, particularly in Indonesia, with considerably built environment changes and persistent infectious diseases. The prior research on respiratory infectious diseases in Indonesia mainly focused on the socioeconomic and health behaviour determinants of respiratory infectious disease and neglected the role of the neighbourhood. The previous study is also limited in providing the mediator on how the built environment could correlate with health outcomes and identifying the finer built environment measurement. Thus, this study resolves those limitations by investigating the quantitative relationship between the built environment and respiratory infectious diseases and measuring quantitative calculations of potential channels on how the built environment relates to respiratory infectious diseases. Our research estimates the built environments via a sprawl index calculated from land cover satellite data. We apply a fixed effect method to accommodate time-invariant unobservable factors and use the IFLS longitudinal data to deal with time-varying confounders. Controlling time-invariant and time-variant unobservable variables makes our results closer to causal identification than simple OLS on cross-sectional data. The results find that a 1% increase in sprawl index generally is related to a lower likelihood of contracting respiratory infectious diseases, about 0.25%. The magnitudes vary across areas and gender, with urban areas and female sub-samples showing more evidence than rural areas and male sub-samples. The higher relationship in urban areas could be caused by the compactness level between urban and rural areas, with urban areas being more compact than rural areas. Also, the differences in commuting duration and the number of modes of transportation between gender might explain the higher association between sprawl and infectious diseases in the female sample. Commuter survey data of Jakarta Metropolitan Areas in 2014 shows that the average commuting duration between males and females aged 15+ is similar, 62 ​min for males and 60 ​min for females. When estimating the commuting's average number of transportation modes, female commuters have a higher average number of transportation modes than male commuters, almost 3 for females and 2 for males (Statistics of Indonesia, 2014). In disease transmission, the number of transportation modes taken might explain why females have a higher prevalence of respiratory infectious diseases. The findings also correspond to Groeneveld et al. (2021), revealing that females have higher respiratory symptoms. Despite the lower coefficient, the result is consistent with previous research, which found that a compact built environment is positively related to a higher rate of respiratory infectious diseases (B. Li, Ma, & Zhang, 2021; Yip, Huang, & Liang, 2021). The study provides several robustness tests to check the result's reliability by applying different radii of the sprawl index, employing alternative methods, and omitting the eldest 10% of observations. The results suggest that the relationship is more robust with a wider radius and lower age range. The study also reveals that compact neighbourhoods benefit from better health facilities but are insufficient compared to the population. The current Puskesmas-population ratio (1:20000) is under the government minimum standard (1:16000), and among Southeast Asia countries, Indonesia has the second lowest ratio of physicians and population (World Bank, 2019). However, the study does not suggest that the sprawling-built environment is better than the compactly built environment in the context of contamination of infectious diseases. Instead, densification policies should consider health effects. Our results could be subjects of public health intervention. The built environment, population characteristics, neighbourhood infrastructure, individual characteristics, and socioeconomic status must be simultaneously considered when deploying non-pharmacological public health intervention. Non-pharmaceutical interventions, such as improving housing quality, particularly sufficient housing ventilation and capacity. Promoting better neighbouring quality by changing the population behaviour that produces fewer pollutants, such as using electricity wisely and using or riding eco-friendly transportation, particularly in compact neighbourhoods, could prevent people from respiratory infectious diseases. Also, the densification policy should be accompanied by adequate and equal health facilities to prevent and control diseases. The government should consistently implement a minimum housing floor size policy, particularly in dense areas. So far, the government of Indonesia has issued the regulation related minimum floor size policy, but unfortunately, the implementation is still weak. The government could improve public transportation policies that support cleaner air and fewer physical contact. This study also has limitations. Firstly, the study might contain measurement errors when identifying the relationship between sprawl index and TB prevalence. TB could be systematically undiagnosed due to limited access to health facilities which might be associated with the sprawl index. Secondly, we could not provide individual transportation mode choices, impacting individuals' respiratory diseases at the sub-district level since the data is available at the district level. The cross-district movement behaviour might differ from sub-district movement behaviour; thus, including the mobility across districts could give misleading results. Thirdly, self-reported health data used in this study might be less accurate than medical or prescription reports. Those limitations are beyond our study, and we left them for future research improvement.

Conclusion

The study confirms that a sprawling neighbourhood is linked to lower respiratory infection symptoms. The link is more evident in urban areas and for females. The study also suggested that the linkage works through housing quality, such as housing crowdedness and ventilation, and neighbourhood conditions like neighbourhood transportation modes and air pollution levels. Sprawl neighbourhood is associated with less crowded housing and better housing ventilation, but the association is insignificant. Sprawl neighbourhoods are also related to higher air quality and single-occupancy vehicles used. Those sprawl neighbourhood characteristics could prevent people from contracting respiratory infectious diseases. Even though compact neighbourhood benefits from better health facilities, the quantity is still limited compared to the population. Thus, our results underline the need to consider the health consequences of the densification policy and determine the direction of landscape planning and policy.

Ethical statement

The authors used publicly available deidentified data for this analysis, thus this study has no ethical issues.

Funding and acknowledgements

Indriyani acknowledges the Indonesia Endowment Fund for Education/, Ministry of Finance, the Republic of Indonesia, for providing financial support for this study. The funding source had no role in the study design, data collection, data analysis, data interpretation or preparation of the manuscript.

Author statement

Witri Indriyani: conceptualization and design, methodology, analysis and interpretation, writing-original draft. Muhammad Halley Yudhistira: conceptualization and design, methodology, analysis and interpretation, writing-original draft. Prani Sastiono: conceptualization and design, methodology, analysis and interpretation. Djoni Hartono: conceptualization and design, methodology, analysis and interpretation.

Declaration of competing interest

None.
Table A.1

Descriptive statistics for all samples

All
≤ median of sprawl index 5 km radius
> median of sprawl index 5 km radius
obs.mean/%s.d.obs.mean/%s.d.obs.mean/%s.d.
Dependent variables
 Respiratory infectious diseases

Not showing symptom

2349088.84%1158287.60%1190890.08%

Showing symptom

295211.16%164012.40%13129.92%

Pneumonia

Not showing symptom

2581597.66%1286997.36%1294697.96%

Showing symptom

6192.34%3492.64%2702.04%
 Acute respiratory infection = yes

Not showing symptom

2362489.37%1165488.17%1197090.57%

Showing symptom

281010.63%156411.83%12469.43%
 Tuberculosis

Not showing symptom

1791699.54%908199.51%883599.58%

Showing symptom

820.46%450.49%370.42%
Independent variables
 Sprawl radius 5 km2644275.0830.001322253.1228.751322097.043.02
Individual characteristics
 Marital status = yes

Unmarried

590622.34%304723.04%285921.63%

Married

2053677.66%1017576.96%1036178.37%
 Working status = yes

Not working

723127.35%403330.50%319824.19%

Working

1921172.65%918969.50%1002275.81%
 Age2644242.0614.611322241.1414.281322042.9814.88
 Years of schooling264427.814.68132228.594.59132207.034.64
Household characteristics
 Number of household members264425.852.83132225.982.99132205.712.65
 Years of schooling of the household head264427.394.84132228.174.80132206.614.76
Table A.2

Descriptive statistics for the non-migrant sample

All
≤ median of sprawl index 5 km radius
> median of sprawl index 5 km radius
obs.mean/%s.d.obs.mean/%s.d.obs.mean/%s.d.
Dependent variables
 Respiratory infectious diseases

Not showing symptom

1755288.88%864487.45%890890.32%

Showing symptom

219611.12%124112.55%9559.68%
 Pneumonia

Not showing symptom

1927797.63%961697.29%966197.97%

Showing symptom

4682.37%2682.71%2002.03%
 Acute respiratory infection = yes

Not showing symptom

1764989.38%870288.04%894790.73%

Showing symptom

209610.62%118211.96%9149.27%
 Tuberculosis

Not showing symptom

1458199.58%732699.51%725599.64%

Showing symptom

620.42%360.49%260.36%
Independent variables
 Sprawl radius 5 km1974876.0329.88988554.6929.41986397.422.69
Individual characteristics
 Marital status = yes

Unmarried

402820.40%210721.32%192119.48%

Married

1572079.60%777878.68%794280.52%
 Working status = yes

Not working

486624.64%280328.36%206320.92%

Working

1488275.36%708271.64%780079.08%
 Age1974843.9813.67988543.1213.44986344.8413.84
 Years of schooling197487.494.6898858.314.6198636.674.60
Household characteristics
 Number of household members197486.102.8098856.282.9698635.912.63
 Years of schooling of the household head197487.114.7998857.904.7598636.334.70
Table A.3

The relationship between sprawl and respiratory infections, weighted baseline estimates

All sample
Non-migrant sample
respiratory infection
respiratory infection
non-specific acute respiratory infection
pneumonia
TB
cross-section person weight
longitudinal person weight
cross-section person weight
longitudinal person weight
cross-section person weight
longitudinal person weight
cross-section person weight
longitudinal person weight
cross-section person weight
longitudinal person weight
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)
Sprawl index 5 km radius−0.0024∗∗∗(0.0008)−0.0024∗∗(0.0010)−0.0034∗∗∗(0.0009)−0.0034∗∗∗(0.0010)−0.0034∗∗∗(0.0009)−0.0033∗∗∗(0.0010)−0.0009∗∗(0.0004)−0.0009∗∗(0.0004)−0.0001(0.0003)−0.0002(0.0004)
Age−0.0030−0.0012−0.0043−0.0039−0.0043−0.0035−0.00001−0.0006−0.0009−0.0023
(0.0032)(0.0045)(0.0041)(0.0054)(0.0041)(0.0055)(0.0025)(0.0034)(0.0028)(0.0028)
Age-squared0.00002−0.000010.000020.000010.000030.000010.000020.00002−0.0000010.000003
(0.00002)(0.00002)(0.00002)(0.00003)(0.00002)(0.00003)(0.00001)(0.00001)(0.00001)(0.00001)
Years of schooling0.0025−0.00020.00440.00130.00420.00100.0005−0.0002−0.00010.0004
(0.0022)(0.0027)(0.0029)(0.0039)(0.0028)(0.0038)(0.0014)(0.0017)(0.0010)(0.0011)
Marital = married−0.0123−0.0226∗∗−0.0069−0.0178−0.0047−0.01560.00060.0023−0.0045−0.0045
(0.0097)(0.0105)(0.0128)(0.0138)(0.0128)(0.0136)(0.0055)(0.0057)(0.0045)(0.0048)
Working = yes0.00890.00510.00920.01080.00930.0111−0.0037−0.00570.0009−0.0024
(0.0082)(0.0094)(0.0105)(0.0113)(0.0101)(0.0111)(0.0043)(0.0048)(0.0042)(0.0050)
Number of household members−0.0009(0.0011)−0.0003(0.0014)−0.0022(0.0017)−0.0017(0.0020)−0.0028∗(0.0016)−0.0025(0.0020)−0.0001(0.0009)0.00001(0.0008)0.0005(0.0006)0.0006(0.0007)
Years of schooling of the household head0.00005(0.0010)−0.0006(0.0012)−0.0019(0.0013)−0.0026(0.0017)−0.0019(0.0013)−0.0026(0.0017)0.0003(0.0006)0.0002(0.0006)0.0003(0.0003)0.0004(0.0004)
Constant0.4420∗∗∗0.4330∗∗0.5390∗∗∗0.5710∗∗∗0.5300∗∗∗0.5470∗∗∗0.06400.08930.05240.1070
(0.1310)(0.1720)(0.1650)(0.2060)(0.1640)(0.2070)(0.0974)(0.1230)(0.1310)(0.1270)
Wave fixed effectYesYesYesYesYesYesYesYesYesYes
Wave-island fixed effectYesYesYesYesYesYesYesYesYesYes
Mean of dep variable0.1080.1110.1060.1100.1020.1050.0230.0220.0040.004
Adjusted R20.0090.0110.0090.0110.0080.0090.0060.0050.0030.004
Observation26436207691974316148197401614519740161451464011444

Notes: The estimated use inverse probability weighting and use cross-section person weight year 2000 for all years (waves) and longitudinal person weight. Cross-section person weight in IFLS3 is constructed so that the estimates will represent the Indonesian population living in the 13 IFLS provinces at the time of IFLS3 in 2000. The longitudinal person weight is constructed so that the IFLS3 panel sample is representative of the Indonesian population living in the 13 IFLS provinces in 1993. Standard errors clustered at the sub-district level were reported in parentheses. Asterisks denoted significance: ∗∗∗p ​< ​0.01, ∗∗p ​< ​0.05, ∗p ​< ​0.10.

  31 in total

1.  The impact of AIDS, immigration and housing overcrowding on tuberculosis deaths in São Paulo, Brazil, 1994-1998.

Authors:  J L Antunes; E A Waldman
Journal:  Soc Sci Med       Date:  2001-04       Impact factor: 4.634

2.  Agreement between self-report and medical records on signs and symptoms of respiratory illness.

Authors:  Angela M Barbara; Mark Loeb; Lisa Dolovich; Kevin Brazil; Margaret Russell
Journal:  Prim Care Respir J       Date:  2012-06

Review 3.  The dark side of social capital: A systematic review of the negative health effects of social capital.

Authors:  E Villalonga-Olives; I Kawachi
Journal:  Soc Sci Med       Date:  2017-10-21       Impact factor: 4.634

4.  Pneumococcal carriage, density, and co-colonization dynamics: A longitudinal study in Indonesian infants.

Authors:  Chrysanti Murad; Eileen M Dunne; Sunaryati Sudigdoadi; Eddy Fadlyana; Rodman Tarigan; Casey L Pell; Emma Watts; Cattram D Nguyen; Catherine Satzke; Jason Hinds; Mia Milanti Dewi; Meita Dhamayanti; Nanan Sekarwana; Kusnandi Rusmil; E Kim Mulholland; Cissy Kartasasmita
Journal:  Int J Infect Dis       Date:  2019-06-25       Impact factor: 3.623

Review 5.  A review of rural and peri-urban sanitation infrastructure in South-East Asia and the Western Pacific: Highlighting regional inequalities and limited data.

Authors:  Nabeela Nasim; Abbas El-Zein; Jacqueline Thomas
Journal:  Int J Hyg Environ Health       Date:  2022-06-22       Impact factor: 7.401

6.  Impact of population density on Covid-19 infected and mortality rate in India.

Authors:  Arunava Bhadra; Arindam Mukherjee; Kabita Sarkar
Journal:  Model Earth Syst Environ       Date:  2020-10-14

7.  Rural-urban differences in the prevalence of chronic disease in northeast China.

Authors:  Shibin Wang; Changgui Kou; Yawen Liu; Bo Li; Yuchun Tao; Carl D'Arcy; Jieping Shi; Yanhua Wu; Jianwei Liu; Yingli Zhu; Yaqin Yu
Journal:  Asia Pac J Public Health       Date:  2014-09-22       Impact factor: 1.399

8.  The Moving to Health (M2H) approach to natural experiment research: A paradigm shift for studies on built environment and health.

Authors:  A Drewnowski; D Arterburn; J Zane; A Aggarwal; S Gupta; P M Hurvitz; A V Moudon; J Bobb; A Cook; P Lozano; D Rosenberg
Journal:  SSM Popul Health       Date:  2018-12-28

9.  Association of built environment attributes with the spread of COVID-19 at its initial stage in China.

Authors:  Shuangjin Li; Shuang Ma; Junyi Zhang
Journal:  Sustain Cities Soc       Date:  2021-02-03       Impact factor: 7.587

10.  The incidence of acute respiratory infection in Indonesian infants and association with vitamin D deficiency.

Authors:  Vicka Oktaria; Margaret Danchin; Rina Triasih; Yati Soenarto; Julie E Bines; Anne-Louise Ponsonby; Michael W Clarke; Stephen M Graham
Journal:  PLoS One       Date:  2021-03-23       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.