Paul Novosad1, Radhika Jain2, Alison Campion3, Sam Asher4. 1. Economics, Dartmouth College, Hanover, New Hampshire, USA paul.novosad@dartmouth.edu. 2. Stanford University, Stanford, California, USA. 3. Development Data Lab, Washington, DC, USA. 4. International Economics, Johns Hopkins University Paul H Nitze School of Advanced International Studies, Washington, DC, USA.
This study modelled the extent to which demographics and the population prevalence of COVID-19mortality risk factors are likely to affect mortality in India, a major lower-middle-income country.This study distinguished between controlled and uncontrolled health conditions (such as diabetes), a difference that may significantly affect population risk.This study used multivariate HRs from the largest sample analysis of risk factors for COVID-19mortality to date (N=17 278 392), which is essential because bivariate HRs may overestimate the effect of pre-existing health conditions that increase with age.In addition to modelling total mortality, this study modelled the age distribution of deaths, which has been very different in low- and middle-income countries (LMICs) and high-income countries.The key limitation of this study is that there are almost no large sample analyses describing the additional risk of COVID-19mortality associated with health conditions that are more common in LMICs than in high-income countries, such as protein calorie malnutrition, micronutrient deficiency and HIV/AIDS.
Introduction
The number of cases of COVID-19 continues to rise around the world. A growing share of cases is coming from low- and middle-income countries (LMICs) in Asia, Africa and the Americas that were largely spared in the initial stages of the pandemic.1 Because the severity of infection increases substantially with age, forecasts have projected much lower aggregate mortality rates in LMICs than in high-income countries.2–4However, the reported fatality numbers from LMICs to date have suggested a much greater share of COVID-19deaths among the young. As of May 2020, 30.5% of deaths in Brazil occurred in those under age 60, whereas 27% of deaths in Mexico occurred in those under age 50.5–7 As of July 2020, 47% of COVID-19deaths in India occurred in those under age 60, whereas a particular study of Tamil Nadu and Andhra Pradesh found 54% of deaths occurred in those under age 65.8 9 In contrast, individuals under age 65 have accounted for only 5%–13% of deaths in 10 European countries and Canada and 8%–24% in US states.10 It is not presently known whether the different age pattern of deaths in LMICs is driven by erroneous reporting, differences in infection patterns, younger populations or different rates of COVID-19 comorbidities in the underlying populations.Many modelling studies have presumed that worse population health in LMICs will lead to excess mortality or else have ignored differential population health as a factor entirely.4 11 12 To date, there has been limited analysis of the prevalence in LMICs of the specific conditions associated with increased COVID-19 severity, such as diabetes, obesity, cardiovascular disease, hypertension and chronic kidney disease, or of how they change the expected level and age distribution of mortality.4 13–16 Some studies have adjusted mortality estimates for population comorbidities by treating all comorbidities as equivalent or by multiplying the mortality rate by a fixed amount to adjust for population health.17–20 One study combined condition-specific prevalence and HRs from a sample of hospitalisations, but excluded obesity and uncontrolled diabetes, and did not examine mortality or the age distribution of mortality as outcomes.21Using England as a benchmark, this study examines how comorbidities understood to increase COVID-19mortality are likely to affect COVID-19mortality rates in aggregate and across the age distribution in India, identifying the specific risk factors with the largest mortality effects. We further study the extent to which accounting for differences in demographics and underlying health conditions can explain the increased share of deaths among the young in India relative to England.Our analysis focuses on India and on the COVID-19 risk factors that are currently documented. At the time of writing, India has the second highest number of cumulative COVID-19infections in the world and one of the highest growth rates in infections of any major country, making it an essential population to study.1 The methodology is readily adjusted to account for new risk factors or data from other countries and may be useful for modelling the epidemic in a range of LMICs.
Methods
Our approach requires three types of data: (1) the relative risk of COVID-19mortality associated with gender, age and each health condition; (2) the age-specific prevalence of each health condition in England and India; and (3) the age and gender distributions for the two countries.
Estimates of relative risk of COVID-19 mortality from comorbidities
We obtained estimates of COVID-19mortality risk for a wide range of comorbidities from the OpenSAFELY study, a closed cohort study of 17 278 392 adults from England.22 This was the largest analysis of comorbidities associated with COVID-19mortality to date and one of the few studies that estimates risk factors in a multivariate model adjusting for age, sex and other health conditions. This adjustment is important because many COVID-19 comorbidities are increasing in age and their HRs are thus biased upwards in analyses not adjusting for age.The OpenSAFELY study enrolled all individuals registered with a general practice within The Phoenix Partnership system on 1 February 2020, who were aged 18 years or older on enrolment, who had at least 1 year of medical history within the system and who had recorded age and sex. The underlying data set represents 40% of the population of England, and the prevalence of health conditions in the study cohort is similar to estimates of population prevalence in England (online supplemental appendix p 4). Patients were followed through 6 May. The outcome was in-hospital death among people with confirmed COVID-19infections. HRs for mortality from a Cox proportional-hazards model were estimated for a comprehensive list of risk factors described in other studies, adjusted for sex, age and all other risk factors. As patient-level data from OpenSAFELY are not publicly available, we extracted HRs from the paper reporting results of the analysis.22 In the absence of patient-level COVID-19mortality data with comorbidity information in India, we assumed that comorbidity-associated HRs were the same in India as in England.Ideally, HRs would measure mortality risk conditional on infection, rather than on registration with a general practice (as in OpenSAFELY) or hospitalisation (as in prior work).21 23 The HRs in this study therefore reflect combined mortality and infection risk; the analysis assumes that pre-existing health conditions are not significant predictors of infection. Reassuringly, HRs measuring mortality risk conditional on hospitalisation in other studies are similar to those used here.11
Demographics and risk factor prevalence in India and England
Age distributions and age-specific sex ratios for India and England were obtained from official censuses.We obtained data on age-specific prevalence of health risk factors for India and England from multiple sources, prioritising biomarker data where available and matching definitions as closely as possible to the conditions for which HRs are available. We restrict samples to ages 18–99 for consistency with the HRs.For India, we aggregated individual-level biomarker data from two public population health surveys for obesity, diabetes and hypertension. The fourth round of the Indian District Level Household Survey and the second round of the Annual Health Survey were conducted between 2012 and 2014; they jointly cover 94% of the Indian population and provide the most recent nationwide direct measures of height, weight, fasting plasma glucose (FPG) and blood pressure (BP) for adults of all ages in India. Details of data set construction are provided in online supplemental appendix p 1. For England, age-specific prevalences of obesity, hypertension and diabetes were obtained from the nationally representative 2018 Health Survey for England, which collected symptoms and medical diagnoses for a range of illnesses, as well as direct measures of height, weight, BP and glycated haemoglobin (HbA1c).24Body mass index was classified into no evidence of obesity (<30 kg/m2), obese class 1 or 2 (30–39.9 kg/m2) and obese class 3 (40+ kg/m2). Hypertension was defined as systolic BP ≥140 mm Hg or diastolic BP ≥90 mm Hg (uncontrolled) or a medical diagnosis of hypertension with BP below the thresholds (controlled). The prevalence of controlled and uncontrolled hypertension was reported separately but combined in the risk estimation for consistency with OpenSAFELY. OpenSAFELY classified controlled diabetes as HbA1c >51 mmol/mol and HbA1c <58 mmol/mol, and uncontrolled diabetes as HbA1c ≥58 mmol/mol. Corresponding thresholds for the one-time FPG measures in the Indian data set are not well defined. In England, the prevalence was reported based on a threshold of HbA1c ≥48 mmol/mol (6.5%). Therefore, we followed the standard screening and diagnosis thresholds recommended by the WHO and International Diabetes Federation and defined uncontrolled diabetes in India as a plasma glucose reading ≥126 mg/dL (7.0 mmol/L) if fasting or ≥200 mg/dL (11.1 mmol/L) if not fasting. We used the corresponding recommended threshold of HbA1c ≥48 mmol/mol (6.5%) for uncontrolled diabetes in England.25 In both countries, we classified individuals with biomarkers below the thresholds but with a diagnosis of diabetes as having controlled diabetes.Age-specific prevalence for asthma, chronic heart disease, kidney disease, stroke, dementia, haematological malignancies and all other cancers was drawn from the Global Burden of Diseases, Risk Factors and Injuries Studies (GBD) for India and England.26 OpenSAFELY reports separate HRs for cancers diagnosed <1 year ago, 1–4.9 years ago and ≥5 years ago; because the year of diagnosis is unavailable in GBD, we used a single classification for each class of cancers and the HR for diagnosis <1 year ago. For chronic respiratory disease, we used chronic obstructive pulmonary disease (COPD) prevalence from the GBD for India and modelled COPD prevalence from the Clinical Practice Research Datalink cohort database for England.27 GBD prevalence of Parkinson’s disease, epilepsy, multiple sclerosis and motor neuron disease was combined and classified as neurological disorders.The following risk factors were not available for India and were excluded from the analysis for both England and India for comparability: fibrosing lung disease, bronchiectasis or cystic fibrosis, lupus, asthma with no recent oral corticosteroid use, cancers diagnosed more than a year ago, organ transplant and spleen disease. Given that the relationship between smoking and COVID-19mortality remains under debate, we excluded it from the analysis.28 We also excluded ethnicity and socioeconomic status, which cannot be measured comparably across England and India and are unlikely to have similar relative risk in the two countries. Where aggregate data were available only in coarse age bins, we used non-linear interpolation to reaggregate the data into age bins matching the HR data.
Estimating the contribution of health conditions to population COVID-19 mortality risk
The OpenSAFELY study reports HRs for each age group, sex and health condition with women aged 50–59 years with no conditions as the reference group.22 We transform the HR for each health condition c into a relative risk () assuming a population mortality rate r of 1%:To obtain continuous relative risk for age, we used a polynomial interpolation for the log HR at each age, renormalising with age 50 as the reference group (online supplemental appendix p 5).The additional population mortality risk associated with a given health condition increases with the condition’s relative risk for COVID-19mortality and with its prevalence. We defined the age-specific and condition-specific population relative risk of condition c at age a as follows:where is the prevalence of condition c at age a, and describes the proportional increase in mortality at age a associated with health condition c.We combined PRRs to obtain an age-specific population relative risk of mortality arising from the combined prevalence of all the health conditions:isolates the expected mortality difference at each age between India and England that is driven by the combined prevalence of all the health conditions studied. This approach implicitly assumes that the health conditions are uncorrelated with each other. Without microdata on the full set of health conditions, this assumption is unavoidable, but will bias the England versus India comparison only if the correlation of health conditions is substantially different in the two countries. We explore the possible extent of this bias in the online supplemental appendix. Using age-specific prevalence, our analysis fully accounts for the substantial correlations between age and health conditions.We next calculated the increase in population mortality from each health condition across all ages, taking into account the age-specific prevalence of each health condition, its relative risk and the population share at each age. The condition-specific population relative risk of each health condition across the full population () is given by the following equation:is greater when the relative risk of condition c is higher and when its prevalence is higher at ages with higher population. The combined effect on population mortality of all the health conditions is given by the product of each condition-specific .Finally, we aggregated the population relative risks across health conditions to model the age distribution of deaths in each country. The number of deaths at each age is the product of the mortality rate of the reference group (50-year-old women with no other risk factors), the population at age a, the age-specific population relative risk of the full set of health conditions (), the PRR of gender and the direct relative risk of COVID-19mortality for an individual at age a ():We plotted the age distribution of deaths as shares of all deaths rather than in levels, eliminating the need to assume a reference group mortality rate. We summarised the shape of the distribution by reporting the share of expected deaths in each country that are under the age of 60. We present results from three models: (1) England’s demographics and health distribution, (2) India’s demographics and health distribution and (3) India’s demographics but England’s age-specific prevalence of health risk factors. The third model allowed us to examine the mortality shift that comes from differences in population health alone.
Role of the funding source
The funder of the study had no role in study design, data collection, data analysis, data interpretation or writing. The corresponding author had full access to the data in the study and had final responsibility for the decision to submit for publication.
Patient and public involvement
Because this study uses existing epidemiological data, it was not appropriate to involve patients or the public in the research.
Results
Prevalence of COVID-19 risk factors in India and England
Demographic characteristics and overall prevalence of risk factors are substantially different in India relative to England (table 1). In all, 83.7% of Indian adults are below the age of 60, compared with 69.9% of English adults. Indians have substantially lower rates of obesity and cancer (4.4% and 0.3% in India compared with 27.9% and 2.8% in England), but higher rates of uncontrolled diabetes, kidney disease and chronic liver disease (8.9%, 9.7% and 5.3% in India compared with 2.1%, 5.6% and 2.6% in England).
Table 1
Prevalence of COVID-19 risk factors in India and England
Prevalence (%)
India
England
Ages 18–39
50.2
36.6
Ages 40–49
19.2
16.3
Ages 50–59
14.3
17.0
Ages 60–69
10.3
13.3
Ages 70–79
4.6
10.4
Ages 80–99
1.5
6.3
Male
47.1
48.9
Diabetes (controlled)
1.7
6.4
Diabetes (uncontrolled)
8.9
2.1
Hypertension
28.2
28.0
Obese (classes 1 and 2)
4.0
24.8
Obese (class 3)
0.4
3.1
Chronic heart disease
4.4
5.9
Chronic respiratory disease
4.8
2.5
Asthma
2.5
9.2
Kidney disease
9.7
5.6
Chronic liver disease
5.3
2.6
Haematological cancer
0.0
0.2
Non-haematological cancer
0.3
2.6
Stroke, dementia
1.3
1.5
Other neurological conditions
0.0
0.1
Psoriasis, rheumatoid
1.0
2.4
Other immunosuppressive conditions
0.1
0.1
Prevalence of COVID-19 risk factors in India and EnglandWe show differences in age-specific prevalence between India and England for the conditions for which we have biomarkers in India and are more precisely estimated (figure 1), as well as age-specific prevalence of all conditions for both countries (online supplemental appendix p 3). The overall rates of diabetes are higher in India at all ages, but diabetes in India is overwhelmingly uncontrolled, whereas three-quarters of diabetes is controlled in England. Hypertension (the sum of controlled and uncontrolled) is higher in India at young ages (31.3% for ages 40–49 in India and 18.3% in England) but lower at higher ages (52.3% at ages 70–79 in India and 61.3% in England) and is overwhelmingly uncontrolled. Conversely, obesity rates are higher at all ages in England.
Figure 1
Prevalence of (A) diabetes, (B) hypertension and (C) obesity in India and England.
Prevalence of (A) diabetes, (B) hypertension and (C) obesity in India and England.
Relative risk of COVID-19 mortality from combined risk factors in India and England
The age-specific population relative risk of COVID-19mortality from all health conditions combined () is higher in India than in England at nearly all ages, but the difference in between the two countries is below 15% at every age and highest between ages 40 and 80 (figure 2).
Figure 2
Age-specific population relative risk of COVID-19 mortality from all health conditions (PRR).
Age-specific population relative risk of COVID-19mortality from all health conditions (PRR).reflects the age-specific prevalence and associated COVID-19mortality risk from all health conditions combined. Taking risk, prevalence and population at every age into account provides the full population relative risk of COVID-19mortality from each health condition ()—or the proportional increase in population mortality across all ages associated with each health condition (table 2).
Table 2
Population relative risk of COVID-19 mortality from each health condition (PRR
).
Individualrelative risk
Populationrelative risk (PRRc)
India
England
Diabetes (controlled)
1.31
1.004
1.020
Diabetes (uncontrolled)
1.94
1.078
1.020
Hypertension
0.89
0.971
0.969
Obese (classes 1 and 2)
1.15
1.006
1.037
Obese (class 3)
1.91
1.004
1.028
Chronic heart disease
1.17
1.008
1.021
Chronic respiratory disease
1.62
1.035
1.015
Asthma
1.13
1.003
1.011
Kidney disease
1.42
1.050
1.046
Chronic liver disease
1.73
1.042
1.024
Haematological cancer
2.79
1.000
1.007
Non-haematological cancer
1.71
1.002
1.033
Stroke, dementia
2.15
1.016
1.041
Other neurological conditions
2.56
1.002
1.002
Psoriasis, rheumatoid
1.19
1.002
1.007
Other immunosuppressive conditions
1.69
1.001
1.001
Population relative risk of COVID-19mortality from each health condition (PRR
).Uncontrolled diabetes, which is associated with substantial mortality risk (=1.94), increases total mortality by 7.8% in India (=1.078), but only by 2.0% in England ( =1.020), reflecting its significantly higher prevalence in India at all ages. In contrast, controlled diabetes, more common in England than in India, is associated with 2.0% higher mortality in England and only 0.4% in India (=1.020 vs 1.004). In addition to uncontrolled diabetes, the health conditions associated with the largest increases in mortality in India are kidney disease (=1.050) and chronic respiratory disease (=1.035). In England, the most consequential health conditions are obesity (combined across all obesity classes=1.065) and kidney disease (=1.046).Comparing the percentage difference between of each health condition between India and England (figure 3), the condition with the largest differential impact on mortality between the two countries is uncontrolled diabetes, which is associated with 5.67% higher population mortality in India relative to England. Mortality in India relative to England is also increased by chronic respiratory disease (+1.88%) and chronic liver disease (+1.73%), but decreased by the differential prevalence of obesity (combined −5.47%), cancer (−3.65%) and stroke/dementia (−2.47%). No other risk factor has an effect of greater than ±2% on India’s relative mortality. The combined effect of health conditions leads to 6.26% higher mortality in England than in India, reflecting England’s higher age-specific prevalence of certain conditions like obesity and cancer, as well as its older age structure that increases population share at ages with worse health. This differential mortality risk does not include the direct effect of older age, which is associated with substantial risk (=6.08 for ages 70–80 and 20.61 for age >80) and magnifies England’s mortality disadvantage substantially.
Figure 3
Percent change in population relative risk of COVID-19 mortality from each health condition (PRR
) in India versus England.
Percent change in population relative risk of COVID-19mortality from each health condition (PRR
) in India versus England.Combining the population relative risk from health conditions with the direct effect of demographics on mortality, we modelled the density of deaths across the age distribution (figure 4). In England, 9.4% of modelled deaths are below age 60, closely matching the 6.5% observed in England through May 2020 and reported in the OpenSAFELY data set. In India, 23.5% of modelled deaths are below age 60, which is substantially lower than the 50% observed in case reports. Applying England’s age-specific prevalence of health conditions to India’s demographic distribution, to isolate the effect of health conditions from demographics, results in a distribution nearly identical to the India model. In other words, differences in health conditions between India and England have almost no effect on mortality, indicating that the higher share of deaths in younger populations in India comes from the demographic distribution alone.
Figure 4
Modelled age distribution of COVID-19 mortality.
Modelled age distribution of COVID-19mortality.In the online supplemental appendix, we test sensitivity to uncertainty in prevalences and HRs (online supplemental appendix pp 6–8). The latter estimates cover alternate HRs estimated from other studies.21 29 We also test sensitivity to alternate assumptions about covariance of health conditions (online supplemental appendix p 9). In all cases, we find that the population relative risk from health conditions in England is greater than that in India, and that accounting for health conditions cannot explain any of the higher incidence of mortality among the young in India relative to England.
Discussion
We used the best publicly available data on population health to examine the extent to which demographics and pre-existing health conditions known to be associated with COVID-19mortality can account for the disproportionately high share of COVID-19deaths in younger populations observed in India relative to England. We show that differences in population health do not significantly shift the relative age distribution of disease severity and slightly lower aggregate mortality in India relative to England. Higher prevalence of diabetes and respiratory illness increases mortality risk relative to England, but these effects are offset by lower rates of obesity, heart disease and cancer. While the Indian age distribution substantially shifts expected mortality towards the young, it explains only a third of the difference in the share of deaths under 60 compared with England.Epidemiological models have typically assumed that comorbidities will exacerbate the mortality of COVID-19 in India and other LMICs relative to high-income countries. We found that comorbidities identified as key risk factors in high-income countries are not associated with higher expected mortality in India relative to England, in aggregate or among the young. This suggests that understanding the other factors that may explain the differential mortality among the young observed in lower income contexts, such as different patterns of infection, under-resourced health systems or comorbidities unique to LMICs, should be a priority for further research.This study improves on prior work by examining the extent to which comorbidities can explain the younger incidence of COVID-19mortality in LMICs, by estimating mortality effects of specific comorbidities and by calibrating a model with a comprehensive set of comorbidity HRs drawn from a large-sample multivariate analysis of COVID-19mortality. Models calibrated with bivariate HRs or raw prevalences of comorbidities among severe cases are likely to overestimate the effect of pre-existing health conditions because of the significant increase in all comorbidities with age alongside the direct effect of age on COVID-19mortality.The key limitation of this study is that there are virtually no data on the COVID-19mortality risks associated with health conditions that are more common in LMICs than in high-income countries, such as protein calorie malnutrition, micronutrient deficiency and HIV/AIDS.4 If these conditions make individuals more susceptible to severe infections, then population health may indeed exacerbate the severity of COVID-19 in LMICs. Understanding the extent to which health conditions endemic to LMICs affect COVID-19 severity is an urgent priority, particularly as policy responses increasingly focus on identifying and isolating high-risk individuals.30Our analysis is also constrained by the limited and changing evidence on risk factors for COVID-19 severity. Based on the availability of existing measures, our model assumed that health condition relative risks are age-invariant. However, data from New York’s epidemiological surveillance system suggest that hypertension and diabetes may contribute more to mortality at younger ages,31 which would exacerbate the burden of illness among the young in LMICs. Furthermore, if illness severity and the quality of prior medical management of pre-existing health conditions change mortality risk for the same diagnosis across contexts, applying HRs from England may understate mortality risk in India. Finally, HRs which are not conditioned on infection may reflect infection risk in addition to disease severity risk and thus may not translate directly to the Indian context.Recognising these limitations, we have posted our analysis on github (see data availability statement), allowing estimates to be calibrated with different risk factors, HRs and data from other countries, as more research on the virus emerges.
Authors: Hafiz Muhammad Salman; Javaria Syed; Atif Riaz; Zouina Sarfraz; Azza Sarfraz; Syed Hashim Abbas Ali Bokhari; Ivan Cherrez Ojeda Journal: BMC Public Health Date: 2022-03-07 Impact factor: 3.295