Literature DB >> 27579289

The examination of relationship between socioeconomic factors and number of tuberculosis using quantile regression model for count data in Iran 2010-2011.

Fatemeh Sarvi1, Somayeh Momenian2, Mahmoud Khodadost3, Bagher Pahlavanzadeh4, Mahshid Nasehi5, Eghbal Sekhavati6.   

Abstract

BACKGROUND: Poverty and low socioeconomic status are the most important reasons of increasing the global burden of tuberculosis, not only in developing countries but also in developed countries for particular groups. The purpose of this study was to assess the association between socioeconomic factors and the number of tuberculosis patients using quantile regression for count data.
METHODS: This cross-sectional study was conducted on 11,320 tuberculosis patients from March 2010 to March 201 in Iran. Data was gathered from the 345 sections of Iran by Ministry of Health and Medical Education and Statistical Center of Iran. The jittering method was applied for smoothing, and then, the quantile regression for count data was fitted. The AIC was used to compare the fitness of quantile regression for count data model and Poisson log-linear model. The R (3.0.1) software and Quantreg and AER packages were used for all analysis and modeling of the data.
RESULTS: The results of fitting the quantile regression for count data showed that in all percentiles, the more increase in immigration rate, illiteracy rate, unemployment and urbanization rates, the more tuberculosis morbidity rate was increased. The maximum increase of tuberculosis due to immigration rate, urbanization rate, unemployment rate, and illiteracy rate was in 95th percentile (β^=0.315), 85'Th percentile (β^=0.162), 75'Th percentile (β^=0.114 ), and 95'Th percentile (β^=0.304), respectively. For 50th percentiles and higher percentiles, with increasing the sum of physicians to the number of population, the tuberculosis morbidity rate was decreased, and the maximum decrease was in 95'Th percentile ( β^=-0.1). For all percentiles, the AIC showed that quantile regression for count data had been a better fit to data.
CONCLUSION: With respect to the relationship between socioeconomic factors and TB rate, health care observers should pay close attention to improving these factors in Iran to reduce the TB mortality and morbidity.

Entities:  

Keywords:  Iran; Poisson regression; Quantile regression; Tuberculosis

Year:  2016        PMID: 27579289      PMCID: PMC5004564     

Source DB:  PubMed          Journal:  Med J Islam Repub Iran        ISSN: 1016-1430


Introduction

TB that is caused by the bacillus Mycobacterium tuberculosis is an infectious disease. It can affect the lungs (pulmonary TB) and also other sites of the body (extrapulmonary TB). Tuberculosis is a most important global health problem, According to the report of WHO, “In 2012, an estimated 8.6 million people developed TB and 1.3 million died from the disease” (1). In developing countries that have lower socioeconomic status, TB is one of the most killer diseases and also in developed countries TB has historically been correlated with poverty and low socioeconomic status (2,3). Families with lower socioeconomic status have more children and frequently reside in more crowed quarters and usually live in undesirable environmental conditions (poor environmental sanitation and poor hygienic training). These factors may increase the probability of exposure to infection among poorer and less educated groups too because they have lack information about vaccination and lack access to the medical care (4,5). With increased urbanization, the rate of TB is increased too, because there are more risk factors for TB in cities like overcrowded housing, high contacts (6). Bhunu et al. studied the effects of poverty on TB transmission. The results showed that transmission rate is higher in poor communities than in the rich ones due to overcrowding, poor nutrition, reduced treatment uptake and lower socioeconomic status (7). Santos M, et al. evaluated the epidemiological status of Tuberculosis concerning to the socioeconomic characteristics of Sao Jose between 1998 and 2004. The socioeconomic factors that were studied were Schooling, Income, and Number of Residents. The results showed that in poorest areas the disease prevalence was higher (8). The higher rate of this disease is among migrant groups. If migrant groups move from areas with low TB rates to one of the high rates, the risk of the face of primary infections is increased. If migration was occurred from areas with high rate of TB to areas with a low rate of TB, the migrant group transmissions this high rate of TB to those areas. Even if the migration occurs between areas with similar incidence, due to the change in the living environment, and exposure to undesirable situations, the probability of contact with TB for the first time increases, so it is likely that TB is prevalent among migrant groups (9). Jose Leopold et al. investigated the relationship between TB with several population characteristics in Sao Paulo, Brazil, from 1994 to 1998. In their study, the relationship between immigration rate for foreigners and mortality rate of TB and also the relationship between immigration rates for the other Brazilian States with a mortality rate of TB was significant (10). The counts data are typically modeling with log-linear Poisson model. With this model, the relationship between covariates and the mean of the response can be estimated (11). However, this model cannot be applied to evaluate the effects of covariates on the other aspects of the response distribution, such as the quartiles. Therefore based on the log-linear model the picture of the relationship is incomplete, on the other hand, the assumption of the log-linear model is the equality of mean and variance of the response distribution, whereas in real, the variance of the data may be bigger than the mean. This issue is named overdispersion. The overdispersion may lead to the significant relationship between a response variable and covariates that is not significant in fact and also with overdispersion the variance of the data may be an underestimate (12). To solve this problem of the log-linear model, we can use the quantile regression for count data that investigate how covariates affect the entire distribution of the count responses and also unlike the log-linear model, quantile regression model for count data, has a separate scale parameter, and overdispersion does not occur (13). The aim of this study was to investigate the relationship between the socioeconomic factors such as immigration rate, unemployment rate, urbanization rate, the sum of physicians to the number of population points and illiteracy rate with the number of tuberculosis patients.

Methods

This cross-sectional study was conducted on 11,320 TB Patients from March 2010 to March 2011 in 345 cities of Iran. Data was gathered from the Ministry of Health and Medical Education and Statistical Center of Iran for every city. We used the Quantile regression for count data for the skewness nature of response variable and to study the effect of socioeconomic factors on entire distribution, The estimation of quantile regression that studied by Koenker and Bassett (1978) was offered to continuous data (14). Since the count data have not continuous Quantiles, to fit the model, Machado and Santos Silva (2005) offered the jittering approach for smoothing (15). Using this approach, the response variable becomes continuous, and we can fit the quantile regression for data, so we produced data from uniform distribution formand replace the response variable YK with “jittered” responses K+UK where UK~uniform (0,1). These jittered responses have continuous real quantiles. The form of model for these data is, where x is explanatory variable and k is the number of city, is the quantile that we fitted the model for them, is the estimated coefficient for urbanization and T is the transpose of the explanatory vector. We can implement this model as the equivalent model with this form where, these two models are equivalent because we applied a monotonic transformation (15). So to apply the quantile regression for count data, we first jittered the response data that was the number of TB patients in 345 cities. The quantile regression model for count data was fitted after jittered the data and applied the transformation. The quantiles that fitted the model was 0.05, 0.1, 0.15, 0.25, 0.5, 0.75, 0.85, 0.9 and 0.95. Since the number of TB patients in every city is dependent on population of the city, we used the population of the city as an offset term. The AIC (Akaike Information Criterion) was used to compare the fitness of quantile regression for count data model and Poisson log-linear model. As the part of the information may be loosed due to fitting the model, we must apply the model that have a lower loss of information. One of the criterions for considering this issue is AIC. The lower AIC shows, the better model. We used software for all computations. We applied the Quantreg and AER packages to fitting the quantile regression for count data and testing the existence of overdispersion, respectively. For all computations, the P value less than 0.05 were considered significant.

Results

The total number of TB patients from 345 cities of Iran during March to March 2011 were enrolled in this study. The mean±SD age of patients was Years), and the mean±SD for the number of TB patients were. Figure 1 shows the histogram graph of TB patients in every city. This figure displayed the right skewed of the data, and the value of skewness was yield 7.28 with a standard error of 0.13.
Fig. 1
Histogram graph for number of TB patients in every city Descriptive statistics of variables are shown in Table 1.
Table 1

Descriptive statistics of study subjects

VariableMean (Std. Deviation)MaximumMinimum
Immigration rate-0.03 (6.68)-17.833.70
Urbanization rate0.52 (.20)0.990.08
Unemployment rate14.72 (9.84)59.501.05
Illiteracy rate19.41 (6.17)47.463.96
Sum of physicians to number of population points4.25(58.5)1081.2000.005
For every quantile, the quantile regression model for count data was fitted, and the parameters were estimated. The poisson regression model was fitted to data. The results of estimation using fitting the poisson regression model and quantile regression model for count data was shown in Tables 2, 3.
Table 2

The results of Poisson regression model for assessing the associations between socio-economic factors and tuberculosis morbidity

Variable β^(τ) exp (β^(τ)) Std. Errorp
intercept-0.9140.4011.5580.713
Immigration rate0.1531.1650.0096<0.001
Urbanization rate0.1691.1840.013<0.001
Unemployment rate0.1271.1350.012<0.001
Illiteracy rate0.2931.3410.0134<0.001
Sum of physicians to number of population points0.0270.9730.003<0.001
Table 3

The results of quantile regression model for count data for assessing the associations between socio-economic factors and tuberculosis morbidity

Variabl Results for the following percentiles
5%10%15%25%50%75%85%90%95%
Immigration rate β^(τ) 0.0770.1960.170.1410.1650.1530.0980.1730.315
exp (β^(τ)) 1.081.221.191.151.181.171.1031.191.37
Urbanization rate β^(τ) 0.0580.1090.0520.040.080.1130.1620.1420.126
exp (β^(τ)) 1.061.1151.051.041.081.121.181.1651.134
Unemployment rate β^(τ) 0.0890.0720.0830.0890.0790.1140.0560.1080.05
exp (β^(τ)) 1.0931.081.091.091.081.131.0571.1141.05
Illiteracy rate β^(τ) 0.2610.300.120.120.1760.2350.2710.2750.304
exp (β^(τ)) 1.31.381.131.131.191.261.311.321.26
Sum of physicians to number of population points β^(τ) 0.0490.0440.0390.024-0.005-0.027-0.05-0.076-0.1
exp (β^(τ)) 1.051.0451.0391.0240.9950.9730.9510.9270.9
In Poisson model, there was a significant association between socioeconomic factors and TB morbidity rate. With increasing one unit of these factors, the morbidity rate of TB was increased. For example with increasing one unit in the unemployment rate, the TB morbidity rate was increased by 0.135. The AER package in software was used for testing the dispersion and the existence of overdispersion with fitting the Poisson model was confirmed (p<0.001). The interpret coefficients of the quantile regression for count data model was similar to the Poisson regression model. The results of the quantile regression model for count data showed that in all percentiles, with increasing the immigration rate, the TB morbidity rate was increased. The relationship in higher percentiles was significant. It means that in counties with high TB morbidity rate, immigration and TB have the strongest relationship. For all percentiles of TB morbidity rate, with increasing the urbanization rate, unemployment rate, and illiteracy rate, the TB morbidity rate was increased. The maximum increase for immigration rate was in 95th percentile (=0.315), for urbanization rate was in 85th percentile (=0.162), for unemployment rate was in 75th percentile (=0.114) and for illiteracy rate was in 95th percentile(=0.304). In 50th percentiles and higher percentiles, with increasing the sum of physicians to the number of population, the tuberculosis morbidity rate was decreased, and the maximum decrease was in 95th percentile (=-0.1) (table 3). In higher percentiles of TB morbidity rate, with increasing the number of physicians to population points, TB morbidity rate was decreased and for lower percentiles with increasing average annual income, the TB morbidity rate was decreased, although this relationship was not significant (Fig. 2).
Fig. 2
Coefficients of quantile regression for count data for every quantiles (the red line show the Poisson’s estimate) Also, the AIC was used to compare the fitness of quantile regression for count data model and Poisson log-linear model. The smaller AIC showed the better model. The related AIC was computed for every percentile, and the quantile regression was the best-fitted model (Table 4).
Table 4

Results of AIC for assessing the fitness of the models

Percentiles5%10%15%25%50%75%85%90%95%Poisson model
AIC964.9924.6901.6860.7850.7937.61003.81070.21130.66722.9
For all percentiles, the AIC was smaller than one for Poisson regression, and it showed that in all percentiles, the quantile regression for count data was better fitted to data in comparison of Poisson regression model.

Discussion

As the aim of the study is to consider the effect of covariates on entire distribution of the data and because of skewness of the data, we used the quantile regression for count data and also fitted the Poisson regression on data to compared two models. In Poisson regression model, the relationship between socioeconomic factors and TB was significant. This can be due to the existence of overdispersion, but quantile regression has a separate scale parameter and prevent of occurring the overdispersion. So, we used this model for more accurate estimation of parameters in this study. Also, the results of comparing the fitness of two models showed that the AIC was smaller for the quantile regression for count data in all percentiles compared to the Poisson regression model. The results of the quantile regression showed that with increasing the immigration rate, the TB morbidity rate was increased. This result is consistent with the studies of Jose Leopold et al. Talbot et al. and Gilbert et al. (10,16,17). In their studies, the TB morbidity and mortality rates had an increasing trend with increase the immigration. The foreign immigrants in Iran are almost from Pakistan and Afghanistan countries that are the countries with a high incidence rate of TB and then, immigration from these countries to Iran may increase the morbidity rate of TB. Overcrowded housing and high probability of contacts in cities lead to increasing the TB morbidity. Also in this study, with increasing the urbanization rate, the TB morbidity rate was increased and is consistent with the study of Kwan CK, et al. (6) that reported the urbanization as one of the reasons of increasing the mortality and morbidity of TB. Other socioeconomic factors in this study that affected the TB morbidity rate were unemployment and illiteracy rate. In poorer societies with worse conditions like higher unemployment rate or lower schooling or high illiteracy rate, the rate of TB morbidity and mortality is high. The results of this study showed that with increasing the unemployment rate, and illiteracy rate, the rate of TB morbidity was increased. Santos and Bhunu et al. reported similar results (7,8). One of the reasons for this relationships is that the illiterate peoples have low information about the vaccination, hygiene, disease prevention and the risk factors. Also, the results of Richard Coke et al. is consistent with results of our study (18). In this study, with increasing the access to medical care, the TB morbidity was decreased. For higher percentiles, with increasing the sum of physicians to the number of population point’s factor, the TB morbidity rate was decreased that is consistent with the study of Santos et al. (8). Thus, the results of our study confirmed that the factors related to the low socioeconomic status lead to high morbidity of tuberculosis and with improving these factors we can prevent the TB morbidity and mortality.

Conclusion

As a result, the quantile regression for count data provided a suitable measurement to examine the relationship between socioeconomic factors and TB morbidity. These findings demonstrated that with increasing the immigration rate, illiteracy rate, unemployment and urbanization rates, the TB morbidity rate was increased. Also, with increasing the sum of physicians to the number of population, the tuberculosis morbidity rate was decreased. So it seems necessary to improve these factors such as increase the proportion of physicians to the number of population to reduce the TB morbidity and mortality and to reach the TB control goals.
  9 in total

1.  The impact of AIDS, immigration and housing overcrowding on tuberculosis deaths in São Paulo, Brazil, 1994-1998.

Authors:  J L Antunes; E A Waldman
Journal:  Soc Sci Med       Date:  2001-04       Impact factor: 4.634

2.  The impact of immigration on tuberculosis rates in the United Kingdom compared with other European countries.

Authors:  R L Gilbert; D Antoine; C E French; I Abubakar; J M Watson; J A Jones
Journal:  Int J Tuberc Lung Dis       Date:  2009-05       Impact factor: 2.373

3.  Tuberculosis control in vulnerable groups.

Authors:  Jose I Figueroa-Munoz; Pilar Ramon-Pardo
Journal:  Bull World Health Organ       Date:  2008-09       Impact factor: 9.408

Review 4.  HIV and tuberculosis: a deadly human syndemic.

Authors:  Candice K Kwan; Joel D Ernst
Journal:  Clin Microbiol Rev       Date:  2011-04       Impact factor: 26.132

5.  Risk factors for pulmonary tuberculosis in Russia: case-control study.

Authors:  Richard Coker; Martin McKee; Rifat Atun; Boika Dimitrova; Ekaterina Dodonova; Sergei Kuznetsov; Francis Drobniewski
Journal:  BMJ       Date:  2005-12-08

6.  Tuberculosis among foreign-born persons in the United States.

Authors:  Kevin P Cain; Stephen R Benoit; Carla A Winston; William R Mac Kenzie
Journal:  JAMA       Date:  2008-07-23       Impact factor: 56.272

7.  Poverty: socioeconomic characterization at tuberculosis.

Authors:  Maria de Lourdes Sperli Geraldes Santos; Silvia Helena Figueiredo Vendramini; Claudia Eli Gazetta; Sonia Aparecida Cruz Oliveira; Tereza Cristina Scatena Villa
Journal:  Rev Lat Am Enfermagem       Date:  2007 Sep-Oct

8.  A national study of socioeconomic status and tuberculosis rates by country of birth, United States, 1996-2005.

Authors:  Nicole A Olson; Amy L Davidow; Carla A Winston; Michael P Chen; Julie A Gazmararian; Dolores J Katz
Journal:  BMC Public Health       Date:  2012-05-18       Impact factor: 3.295

9.  Persistent high incidence of tuberculosis in immigrants in a low-incidence country.

Authors:  Troels Lillebaek; Ase B Andersen; Asger Dirksen; Else Smith; Lene T Skovgaard; Axel Kok-Jensen
Journal:  Emerg Infect Dis       Date:  2002-07       Impact factor: 6.883

  9 in total
  1 in total

Review 1.  Biosynthetic and Synthetic Strategies for Assembling Capuramycin-Type Antituberculosis Antibiotics.

Authors:  Ashley L Biecker; Xiaodong Liu; Jon S Thorson; Zhaoyong Yang; Steven G Van Lanen
Journal:  Molecules       Date:  2019-01-25       Impact factor: 4.411

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.