| Literature DB >> 30148894 |
Ilke Onur1, Malathi Velamuri2.
Abstract
Researchers interested in the effect of health on various life outcomes (such as employment, earnings and life satisfaction) often use self-reported health and disease status as an indicator of true, underlying health status. Self-reports appear to be reasonable measures of overall health. For example, self-assessed overall health has been found to be a reliable predictor of mortality. However, the validity of self-reports is questionable when investigating specific diseases such as diabetes and hypertension. A small and nascent body of research comparing self-reported status on certain diseases with the true status based on clinical diagnoses has found significant gaps. These validation exercises predominantly use data from high-income countries. In this paper, we use survey data from India to compare self-reports of disease prevalence to diagnostic tests conducted on the same individuals. We focus on hypertension and lung disease, two of the primary causes of death in India. We find that self-reported measures substantially understate the true disease burden for both conditions. The attenuation bias from using self-reports is over 80 percent for both diseases, and bigger than estimates from high-income countries. We test and reject the hypothesis that self-reports of the disease status are identical to the true disease status in expectation. We identify characteristics associated with false negative reporting (reporting not having the disease but testing positive for it) for both diseases. The large awareness gap between self-reports and true disease burden indicates multiple deficiencies in India's public health policy. The survey data depicts limited access to medical facilities, high levels of health illiteracy, low rates of health insurance, and other barriers related to poverty and lack of equity in the delivery of health services. These factors prevent timely intervention for managing health and controlling disease, invariably leading to morbidity and often to premature death.Entities:
Mesh:
Year: 2018 PMID: 30148894 PMCID: PMC6110485 DOI: 10.1371/journal.pone.0202786
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of the sample.
| Variable | Full sample | Hypertension (H) | Lung disease (L) | ||||
|---|---|---|---|---|---|---|---|
| H = 1 | H = 0 | Difference | L = 1 | L = 0 | Difference | ||
| Age structure (%) | |||||||
| Age 45-54 | 47.3 | 39.2 | 53.1 | -0.14 | 53.8 | 53.2 | 0.01 [0.05] |
| Age 55-64 | 27.2 | 28.6 | 26.6 | 0.02 [0.03] | 24.1 | 28.3 | -0.04 [0.04] |
| Age 65-74 | 17.9 | 22.1 | 14.9 | 0.07 | 16.0 | 13.3 | 0.03 [0.03] |
| Age 75-85 | 7.5 | 10.1 | 5.4 | 0.05 | 6.0 | 5.2 | 0.01 [0.02] |
| Females (%) | 51.9 | 52.9 | 51.3 | 0.02 [0.04] | 35.8 | 53.6 | -0.18 |
| Urban residence (%) | 25.0 | 27.9 | 23.2 | 0.05 [0.04] | 28.6 | 28.3 | 0.0 [0.05] |
| Caste (%) | |||||||
| SC | 13.7 | 11.7 | 15.5 | -0.03 [0.03] | 11.8 | 13.6 | -0.02 [0.02] |
| ST | 14.8 | 16.3 | 13.1 | 0.03 [0.03] | 13.0 | 10.8 | 0.02 [0.05] |
| OBC | 38.6 | 38.5 | 38.7 | -0.0 [0.03] | 42.2 | 41.5 | 0.01 [0.05] |
| None/Other | 33.0 | 33.4 | 32.7 | 0.0 [0.03] | 33.0 | 34.1 | -0.01 [0.04] |
| Religion (%) | |||||||
| Hindu | 76.3 | 73.6 | 78.1 | -0.05 | 82 | 77 | 0.05 [0.04] |
| Muslim | 7.3 | 9.1 | 5.9 | 0.03 | 6.1 | 8.0 | -0.02 [0.02] |
| Christian | 6.8 | 5.7 | 7.7 | -0.02 [0.02] | 7.2 | 8.5 | -0.01 [0.02] |
| Sikh | 8.2 | 9.9 | 7.1 | 0.03 | 4.2 | 5.8 | -0.02 [0.02] |
| Other | 1.4 | 1.7 | 1.2 | 0.01 [0.01] | 1.0 | 1.0 | -0.0 [0.01] |
| Currently married (%) | 79.3 | 74.4 | 82.3 | -0.08 | 83.3 | 83.3 | 0.0 [0.04] |
| # children | 3.14 | 3.19 | 3.08 | 0.11 [0.17] | 3.07 | 3.05 | 0.02 [0.15] |
| Any children dead | 14.2 | 17.0 | 12.4 | 0.05 | 14.0 | 13.5 | 0.01 [0.03] |
| Fully literate (%) | 49.8 | 50.6 | 49.6 | 0.01 [0.04] | 58.7 | 61.6 | -0.03 [0.05] |
| Schooling (Years) | 4.33 | 4.21 | 4.47 | -0.27 [0.43] | 5.61 | 5.31 | 0.3 [0.51] |
| Education (%) | |||||||
| No education | 47.3 | 45.5 | 48.2 | -0.03 [0.04] | 36.9 | 35.9 | 0.01 [0.04] |
| < 6 years | 14.9 | 18.1 | 12.4 | 0.06 | 15.5 | 15.8 | -0.0 [0.03] |
| 6–12 years | 33.2 | 33.5 | 33.4 | 0.0 [0.03] | 39.0 | 42.8 | -0.04 [0.05] |
| > 12 years | 4.6 | 2.9 | 6.0 | -0.03 [0.03] | 8.6 | 5.5 | 0.03 [0.03] |
| Ever worked (%) | 32.4 | 31.9 | 33.7 | -0.02 [0.03] | 41.5 | 36.5 | 0.05 [0.04] |
| Log expenditure pc | 10.36 | 10.3 | 10.41 | -0.11 [0.21] | 10.48 | 10.43 | 0.05 [0.2] |
| No health insurance (HI) | 92.2 | 92.1 | 92.0 | 0.0 [0.02] | 86.4 | 95.6 | -0.09 |
| Don’t know what HI is | 48.7 | 49.5 | 48.3 | 0.01 [0.04] | 45.9 | 51.2 | -0.05 [0.05] |
| Good cooking fuel | 41.8 | 45.0 | 40.6 | 0.04 [0.04] | 39.5 | 45.0 | -0.06 [0.04] |
| Good cooking fuel-1 | 38.4 | 42.8 | 35.9 | 0.07 | 36.2 | 40.5 | -0.04 [0.04] |
| Indoor plumbing | 37.5 | 41.7 | 35.4 | 0.06 [0.05] | 36.0 | 38.4 | -0.02 [0.05] |
| Electricity | 84.4 | 85.1 | 85.1 | -0.0 [0.04] | 86.1 | 86.5 | -0.0 [0.05] |
| Toilet inside house | 65.8 | 67.1 | 65.6 | 0.02 [0.04] | 69.6 | 73.7 | -0.04 [0.05] |
| Ever smoked | 21.4 | 20.4 | 22.1 | -0.02 [0.03] | 29.8 | 19.9 | 0.1 |
| Current smoker | 17.2 | 17.1 | 17.1 | -0.0 [0.03] | 23.7 | 16.3 | 0.07 |
| Former smoker | 4.4 | 3.2 | 5.0 | -0.02 [0.1] | 6.1 | 3.6 | 0.03 [0.02] |
| Ever drank | 13.8 | 13.0 | 14.4 | -0.01 [0.02] | 21.1 | 11.8 | 0.09 |
| Current drinker | 9.1 | 8.6 | 9.6 | -0.01 [0.02] | 12.4 | 8.3 | 0.04 [0.03] |
| Former drinker | 4.5 | 4.4 | 4.7 | -0.0 [0.01] | 8.7 | 3.4 | 0.05 |
| Any exercise | 34.5 | 33.3 | 36 | -0.03 [0.02] | 38.6 | 39.5 | -0.01 [0.05] |
| Heavy exercise | 22.7 | 21.9 | 23.9 | -0.02 [0.02] | 24.0 | 30.4 | -0.06 |
| Moderate exercise | 11.6 | 11.4 | 12.0 | -0.01 [0.02] | 14.6 | 9.0 | 0.06 |
| Passive smoking | 27.2 | 26.9 | 27.2 | -0.0 [0.03] | 30.2 | 24.9 | 0.05 [0.04] |
| Overweight | 16.6 | 18 | 15.7 | 0.02 [0.03] | 11.5 | 19.9 | -0.08 |
| Obese | 6.4 | 7.9 | 5.3 | 0.03 | 4.9 | 5.7 | -0.01 [0.02] |
| CVD risk | 31.5 | 35.3 | 28.5 | 0.07 | 28.1 | 29.3 | -0.01 [0.04] |
| Pulse/heart rate | 80.3 | 83.4 | 78.0 | 5.37 | 78.9 | 78.9 | -0.01 [1.5] |
| EBV levels | 113.0 | 113.1 | 113.9 | -0.8 [4.27] | 106.1 | 118.1 | -12.0 |
| Episodic Memory | 8.69 | 8.72 | 8.65 | 0.07 [0.26] | 9.08 | 9.02 | 0.06 [0.36] |
| Observations (N) | 1,149 | 483 | 638 | 242 | 335 | ||
Note: Standard deviation in () parentheses, standard error in [] parentheses;
@—Household uses either coal, charcoal, natural gas, LPG, kerosene or electricity for cooking;
@@—Household uses either natural gas, LPG, kerosene or electricity for cooking;
# BMI in [25–30] range;
## BMI >30;
+C-reactive protein concentration in blood <3 mg/L;
++Epstein-Barr virus antibody levels.
*—significant at the 90% level;
**—significant at the 95% level;
***—significant at the 99% level
Distribution of test-diagnosed and self-reported disease, by State.
| All States | Punjab | Rajasthan | Kerala | Karnataka | |
|---|---|---|---|---|---|
| #Households | 807 | 199 | 191 | 234 | 183 |
| 1. | |||||
| Self-reported (S) | 0.17 [0.02] | 0.20 [0.05] | 0.03 [0.01] | 0.33 [0.03] | 0.16 [0.04] |
| Test | 0.43 [0.02] | 0.54 [0.02] | 0.47 [0.05] | 0.33 [0.03] | 0.41 [0.03] |
| False negative (S = 0/T = 1) | 0.77 [0.03] | 0.75 [0.05] | 0.94 [0.02] | 0.57 [0.06] | 0.74 [0.06] |
| False positive (S = 1/T = 0) | 0.13 [0.01] | 0.15 [0.05] | 0.01 [0.01] | 0.28 [0.03] | 0.09 [0.03] |
| Observations (S) | 1,144 | 285 | 279 | 325 | 255 |
| Observations (T) | 1,121 | 279 | 266 | 326 | 250 |
| 2. | |||||
| Self-reported (S) | 0.04 [0.01] | 0.01 [0.01] | 0.05 [0.01] | 0.09 [0.02] | 0.02 [0.01] |
| Test | 0.43 [0.03] | 0.31 [0.05] | 0.46 [0.06] | 0.43 [0.03] | 0.44 [0.05] |
| False negative (S = 0/T = 1) | 0.96 [0.01] | 0.99 [0.01] | 0.94 [0.03] | 0.89 [0.04] | 0.95 [0.02] |
| False positive (S = 1/T = 0) | 0.03 [0.01] | 0.02 [0.01] | 0.04 [0.01] | 0.07 [0.03] | 0.01 [0.01] |
| Observations (S) | 1,136 | 284 | 280 | 322 | 250 |
| Observations (T) | 577 | 88 | 127 | 206 | 156 |
Note: Standard errors in parentheses.;
* Systolic > 140 or diastolic > 90 based on average of 3 readings;
@ Forced expiratory volume (FEV1) to forced vital capacity (FVC) percentage<70, based on average of 3 readings.
Error decomposition.
| Condition | Self-report (S) | Test (T) | Mean error (S-T) | |
|---|---|---|---|---|
| (1) | (2) | (3) | (4) | |
| 1. BP/Hypertension | 0.17 [0.016] | 0.43 [0.02] | -0.26 [0.024] | 0.83 [0.04] |
| 2. Lung disease | 0.04 [0.007] | 0.43 [0.026] | -0.4 [0.028] | 0.87 [0.109] |
Note: Standard error in parentheses.
Tests of hypothesis that error in reporting = 0.
| Condition | Hypothesis based on | |
|---|---|---|
| OLS | Bivariate Probit | |
| 1. BP/Hypertension | 45.11 | 51.75 |
| 2. Lung disease | 10.54 | 1,594.45 |
+ Test statistics are F-tests for the joint significance of the independent variables (and a constant) in a linear regression with dependent variable equal to (S-T), the difference between self-reports (S) and test diagnosis (T) for each condition;
# Test statistics are F-tests for the equality of independent variables in the two probit equations—one using the self-reports (S) and the other using the test diagnosis (T) indicators as dependent variables. The vector of control variables include age, age-squared, female, indicators for state of residence, caste, religion, urban status, marital status, number of children, indicator for whether any child of respondent died, logarithm of household expenditure per capita, literacy status, years of schooling, whether ever worked for pay, indicators for smoking, drinking alcohol and exercise activity, whether respondent is overweight or obese, and whether household has electricity, indoor plumbing, indoor toilet, uses good cooking fuel and test score for memory function.
Estimates of false negative reporting: Hypertension.
| Variable | Marginal Effect | S.E. | Marginal Effect | S.E. | Marginal Effect | S.E. | Marginal Effect | S.E. |
|---|---|---|---|---|---|---|---|---|
| Age | -0.042 | 0.03 | -0.044 | 0.012 | -0.035 | 0.022 | -0.038 | 0.011 |
| Female | -0.017 | 0.054 | -0.027 | 0.023 | -0.03 | 0.054 | -0.057 | 0.03 |
| Married | 0.011 | 0.061 | 0.03 | 0.029 | 0.033 | 0.035 | 0.035 | 0.025 |
| Log(HH expenditure pc) | -0.087 | 0.03 | 0.002 | 0.006 | -0.045 | 0.034 | 0.005 | 0.005 |
| Fully literate | -0.097 | 0.056 | -0.066 | 0.025 | ||||
| Schooling (Yrs) | -0.004 | 0.007 | -0.001 | 0.003 | ||||
| Ever worked | 0.012 | 0.043 | -0.014 | 0.023 | ||||
| Has health insurance | -0.09 | 0.064 | -0.036 | 0.032 | ||||
| Overweight | -0.057 | 0.042 | -0.072 | 0.022 | ||||
| Obese | -0.096 | 0.073 | -0.03 | 0.039 | ||||
| Waist-to-hip ratio | -0.254 | 0.159 | -0.19 | 0.083 | ||||
| Smoking: | ||||||||
| Current smoker | -0.033 | 0.06 | -0.027 | 0.033 | ||||
| Former smoker | -0.167 | 0.113 | -0.092 | 0.053 | ||||
| Drinking: | ||||||||
| Drinks alcohol | 0.019 | 0.056 | -0.01 | 0.036 | ||||
| Formerly drank | 0.036 | 0.078 | 0.004 | 0.047 | ||||
| Exercise: | ||||||||
| Exercise often | 0.074 | 0.061 | 0.027 | 0.028 | ||||
| Exercise moderately | 0.024 | 0.034 | 0.026 | 0.029 | ||||
| HH has electricity | -0.097 | 0.09 | -0.015 | 0.054 | ||||
| Uses good fuel | -0.107 | 0.067 | -0.083 | 0.031 | ||||
| Indoor plumbing | 0.137 | 0.056 | -0.053 | 0.021 | ||||
| Toilet inside | 0.017 | 0.068 | -0.044 | 0.035 | ||||
| 0.428 | - | 0.01 | - | |||||
| 0.391 | - | 0.649 | - | |||||
| Wald test for joint significance of instrumental variables (IVs) | ||||||||
| 12.52 | - | 12.99 | - | |||||
| 0.0 | - | 0.0 | - | |||||
| # Observations | 1,233 | 1,265 | 1,176 | 1,207 | ||||
Note: All regressions also control for state of residence, urban status, caste, religion, # children, whether any of respondent’s children died, and memory function score.
***-significant at the 99% level;
**—significant at the 95% level;
*—significant at the 90% level.
Estimates of false negative reporting: Lung disease.
| Variable | Marginal Effect | S.E. | Marginal Effect | S.E. | Marginal Effect | S.E. | Marginal Effect | S.E. |
|---|---|---|---|---|---|---|---|---|
| Age | -0.001 | 0.001 | -0.002 | 0.001 | -0.002 | 0.001 | -0.002 | 0.001 |
| Female | 0.043 | 0.022 | 0.018 | 0.013 | 0.052 | 0.018 | 0.008 | 0.017 |
| Log(HH expenditure pc) | 0.002 | 0.003 | 0.003 | 0.002 | 0.007 | 0.003 | 0.002 | 0.002 |
| Fully literate | 0.001 | 0.015 | -0.023 | 0.023 | ||||
| Schooling (Yrs) | -0.001 | 0.001 | 0.0 | 0.002 | ||||
| Overweight | -0.044 | 0.017 | 0.007 | 0.014 | ||||
| Obese | 0.132 | 0.039 | -0.022 | 0.023 | ||||
| Ever smoked | 0.005 | 0.01 | -0.026 | 0.013 | ||||
| Any exercise | 0.033 | 0.015 | 0.016 | 0.013 | ||||
| Uses good cooking fuel | -0.005 | 0.011 | -0.0 | 0.01 | ||||
| -0.482 | - | -0.525 | - | |||||
| 0.736 | - | 0.511 | - | |||||
| Wald test for joint significance of instrumental variables (IVs) | ||||||||
| 4.28 | - | 3.55 | ||||||
| 0.02 | - | 0.037 | - | |||||
| # Observations | 711 | 1,257 | 712 | 1,255 | ||||
Note: All regressions also control for state of residence, urban status, caste, religion and memory function score. The expanded specification (Columns 5 and 7), in addition to these variables and those listed, also controls for whether respondent drinks or ever drank alcohol.
***—significant at the 99% level;
**—significant at the 95% level;
*—significant at the 90% level.