Philip Arevalo1, Huong Q McLean2, Edward A Belongia2, Sarah Cobey1. 1. Department of Ecology and Evolutionary Biology, University of Chicago, Chicago, United States. 2. Center for Clinical Epidemiology and Population Health, Marshfield Clinic Research Institute, Marshfield, United States.
Abstract
Seasonal variation in the age distribution of influenza A cases suggests that factors other than age shape susceptibility to medically attended infection. We ask whether these differences can be partly explained by protection conferred by childhood influenza infection, which has lasting impacts on immune responses to influenza and protection against new influenza A subtypes (phenomena known as original antigenic sin and immune imprinting). Fitting a statistical model to data from studies of influenza vaccine effectiveness (VE), we find that primary infection appears to reduce the risk of medically attended infection with that subtype throughout life. This effect is stronger for H1N1 compared to H3N2. Additionally, we find evidence that VE varies with both age and birth year, suggesting that VE is sensitive to early exposures. Our findings may improve estimates of age-specific risk and VE in similarly vaccinated populations and thus improve forecasting and vaccination strategies to combat seasonal influenza.
Seasonal variation in the age distribution of influenza A cases suggests that factors other than age shape susceptibility to medically attended infection. We ask whether these differences can be partly explained by protection conferred by childhood influenza infection, which has lasting impacts on immune responses to influenza and protection against new influenza A subtypes (phenomena known as original antigenic sin and immune imprinting). Fitting a statistical model to data from studies of influenza vaccine effectiveness (VE), we find that primary infection appears to reduce the risk of medically attended infection with that subtype throughout life. This effect is stronger for H1N1 compared to H3N2. Additionally, we find evidence that VE varies with both age and birth year, suggesting that VE is sensitive to early exposures. Our findings may improve estimates of age-specific risk and VE in similarly vaccinated populations and thus improve forecasting and vaccination strategies to combat seasonal influenza.
Seasonal influenza is a serious public health concern, resulting in approximately 100,000–600,000 hospitalizations and 5000–27,000 deaths per year in the United States despite extensive annual vaccination campaigns (Reed et al., 2015). The rapid evolution of the virus to escape preexisting immunity contributes to the relatively high incidence of influenza, including in previously infected older children and adults. How susceptibility arises and changes over time in the host population has been difficult to quantify.A pathogen’s rate of antigenic evolution should affect the mean age of the hosts it infects, and differences in the rate of antigenic evolution have been proposed to explain differences in the age distributions of the two subtypes of influenza A. Compared to H3N2, H1N1 disproportionately infects children (Gagnon et al., 2018b; Caini et al., 2018; Khiabanian et al., 2009). It also evolves antigenically more slowly (Bedford et al., 2015). Thus, compared to H3N2, H1N1 is slower to escape immunity in individuals who have experienced prior infection (namely older children and adults), making them less susceptible to reinfection (Bedford et al., 2015; Beauté et al., 2015; Caini et al., 2018; Khiabanian et al., 2009). H3N2, in contrast, exhibits well known changes in antigenic phenotype that are expected to drive cases toward adults (Smith et al., 2004; Cobey and Hensley, 2017). Under this simple model, hosts previously infected with a subtype face equal risk of reinfection (on challenge) with an antigenic variant of that subtype.The age distributions of influenza cases in exceptional circumstances—pandemics and spillovers of avian influenza—have shown unexpected variation that suggests important effects of prior infection. Excess mortality in some adult cohorts during the 1918 and 2009 H1N1 pandemics correlates with childhood infection with other subtypes (Gagnon et al., 2013; Worobey et al., 2014; Gagnon et al., 2018a). In the post-2009 pandemic period, excess mortality and hospitalization were observed among cohorts first exposed to H2N2 or H3N2 during H1N1pdm-dominated seasons (Budd et al., 2019). Similarly, the subtypes circulating in childhood predict individuals’ susceptibility to severe zoonotic infections with avian H5N1 and H7N9, regardless of later exposure to other seasonal subtypes (Gostic et al., 2016). These patterns suggest that early influenza infections, and not prior infection per se, strongly shape susceptibility.Early infections might also affect the protection conferred by influenza vaccination. Foundational work on the theory of original antigenic sin demonstrated that an individual’s immune response to influenza vaccination is biased toward antigens similar to those encountered in childhood (Davenport and Hennessy, 1956). In some cases, this may result in a narrow antibody response focused on a single epitope (Davis et al., 2018). This phenomenon has been suggested to explain an unexpected decrease in vaccine effectiveness (VE) in the middle-aged in the 2015–2016 influenza season (Skowronski et al., 2017b; Flannery et al., 2018). More generally, it has been hypothesized that biases in immune memory can arise from both past infections and vaccinations and lead to variation in VE that is sensitive to the precise history of exposures (Smith et al., 1999; Skowronski et al., 2017a).To measure the effect of early exposures on medically attended infection risk and VE, we fitted statistical models to 3493 PCR-confirmed influenza cases identified through seasonal studies of influenza VE from the 2007–2008 to 2017–2018 seasons in the Marshfield Epidemiologic Study Area (MESA) in Marshfield, Wisconsin (Belongia et al., 2009; Belongia et al., 2011; Griffin et al., 2011; Treanor et al., 2012; Ohmit et al., 2016; McLean et al., 2014; Gaglani et al., 2016; Zimmerman et al., 2016; Jackson et al., 2017; Flannery et al., 2018, Figure 1—figure supplement 1). Each influenza season, individuals in a defined community cohort were recruited and tested for influenza when seeking outpatient care for acute respiratory infection. Eligibility was restricted to individuals >6 months of age living in MESA who received routine care from the Marshfield Clinic and who presented in an outpatient setting.
Figure 1—figure supplement 1.
Sample collection and final study population.
Flowchart of sample collection (A) and final study population stratified by season, age, test status, and vaccination status (B). ‘Test-positive’ is defined as testing positive for the dominant circulating influenza A subtype in that season.
We sought to explain the variation in the age distribution of these cases by subtype and over time. Our model predicted the relative number of cases of influenza in each birth year each season as a function of the age structure of the population, age-specific differences in the risk of medically attended influenzaA infection, early influenza infection, and vaccination. Despite the extensive antigenic evolution in both subtypes over the study period, we found strong evidence of protection from the subtype to which a birth cohort was likely first infected (the imprinting subtype) and variation in VE by birth cohort.
Materials and methods
Study cohort
Cases of PCR-confirmed, medically attended influenza were identified from annual community cohorts based on residency in MESA. MESA is a contiguous geographic area surrounding Marshfield, Wisconsin, where nearly all 61,000 residents receive outpatient and inpatient care from the Marshfield Clinic Health System (Kieke et al., 2015). For each influenza season from 2007 to 2008 through 2017–2018, we identified MESA residents >6 months of age who received routine care from the Marshfield Clinic. These individuals were eligible for recruitment into that season’s VE study if they sought care for acute respiratory infection. Trained research coordinators recruited patients during clinical encounters in primary care departments, including urgent care, pediatrics, combined internal medicine and pediatrics, internal medicine, and family practice. Patients were enrolled on weekdays, evenings, and weekends when clinical services were provided. Research staff used an electronic appointment system to screen the chief complaints for respiratory or febrile illness. Patients were then approached in-person to assess eligibility based on specific respiratory symptoms and duration of illness. The proportion of patients with medically attended acute respiratory infection (MAARI) who were screened for enrollment varied by season and was largely determined by the volume of patients each day and staffing capacity. Only symptoms and illness duration were used to determine eligibility among those patients who were in the predefined cohort. Patients were also assessed for the presence of medical conditions that put them at high risk for complications from influenza infection, as defined by the Advisory Committee on Immunization Practice (Smith et al., 2006). These conditions included cardiovascular disease, diabetes, pulmonary disease, cancer, kidney disease, liver disease, blood disorders, immunosuppressive disorders, metabolic disorders, and neurological/musculoskeletal disorders. We considered subjects vaccinated if they received that season’s influenza vaccine ≥14 days before enrollment. For the 2009–2010 season, we only considered receipt of the 2009 monovalent vaccine. The Marshfield Clinic generally does not capture MAARI in nursing facilities with dedicated medical staff, causing undersampling of the oldest age groups. We adjusted for this (Appendix 1: ‘Age-specific rates of approachment, enrollment, and nursing home residence’).Each season, recruitment began when influenza activity was detected in the community and usually continued for 12–15 weeks. Symptom eligibility criteria varied by season but included fever/feverishness or cough during most seasons. We retroactively standardized symptom eligibility criteria to only require cough as a symptom. Individuals with illness duration >7 days or presenting in an inpatient (hospital) setting were excluded. After obtaining informed consent, a mid-turbinate swab was obtained for influenza detection. RT-PCR was performed using CDC primers and probes to identify influenza cases, including type and subtype.
Calculating differences in the age distribution between seasons
We defined the age distribution of each season as the number of cases of the dominant (more common) subtype in each of nine age groups (0–4 year-olds, 5–9 year-olds, 10–14 year-olds, 15–19 year-olds, 20–29 year-olds, 30–39 year-olds, 40–49 year-olds, 50–64 year-olds, and ≥65 years old). We excluded the subdominant subtype in each season due to concerns that short-term interference between the subtypes (Laurie et al., 2015; Goldstein et al., 2011) would affect the age distribution of the rarer subtype. The G-test of independence was used to measure differences in seasons’ age distributions.
Calculating relative risk
To evaluate relative infection risk in different age groups, we measured their relative risk of infection in the first versus second half of each season. This risk is a combination of the chance of infection, conditional on infection (susceptibility), and the rate of contact with infectedpeople. Attack rates should be higher in populations that experience more risk, and therefore these populations should be infected earlier in the epidemic (Worby et al., 2015). To calculate relative risk we used an approach similar to Worby et al., 2015. We defined the midpoint of each season as the week in which the cumulative number of cases of the dominant subtype among all people exceeded half the total for that season. Weeks before and after this point were assigned to the first and second half of the season, respectively. We assigned each case to one of the five age groups used by Worby et al., 2015 (0-4 year-olds, 5–17 year-olds, 18–49 year-olds, 50–64 year olds, and ≥65 years old). For each age group , we defined relative risk aswhere and are the fraction of cases of the dominant subtype during influenza season that occurred during the first or second half of the season, respectively. A relative risk >1 indicates that cases in an age group were more likely to occur during the first half of the season.
Calculating imprinting probabilities
We hypothesized that the subtype of a person’s first influenzaA infection affects their future susceptibility to that subtype. Testing this hypothesis requires knowing the probability that a person’s primary influenzaA infection was with a particular subtype. To calculate these probabilities, we emulated the approach of Gostic et al., 2016, which assumes these probabilities are determined by a person’s year of birth and subsequent exposure to each subtype.First, we calculated the probability that an individual born in year received their first influenza A exposure in influenza season . Assuming a constant per-season rate of infection , the probability of infection in one season (i.e., the attack rate) is given byBy assuming that the average probability that a naive individual is infected in a single season is 0.28 (Bodewes et al., 2011; Gostic et al., 2016), we calculated the expected per-season infection rate () asHowever, because the intensity of epidemics varies between seasons (, Appendix 1: ‘Seasonal intensity’) and the fraction of the epidemic experienced by a person depends on their birth year (, Appendix 1: ‘Fraction of season experienced’), we considered the time-varying per-season infection rate,Therefore, the probability that a naive individual born in year is infected in season isWe used to calculate the fraction of a birth cohort that received their first influenzaA infection in season . Let be the fraction of people born in year who were unexposed at the beginning of season (Appendix 1: ‘Calculating the fraction unexposed’). The probability that a person born in year has their first infection in season isWe calculated , the probability that a person born in year had their first influenzaA infection with subtype in season , by multiplying by the frequency of subtype in season , (Figure 3—figure supplement 1),
Figure 3—figure supplement 1.
Intensity and subtype frequencies of influenza A.
The intensity (top panel) and subtype frequencies (bottom panel) of influenza A seasons in the United States. Intensity is measured as the product of influenza-like illness (ILI) and the fraction of respiratory specimens testing positive for influenza A in national surveillance data (Appendix 1: ‘Seasonal intensity’). This is normalized to the average intensity value between 1977 and 2017–2018. Seasons before 1977 where United States ILI surveillance data are unavailable are assumed to have an intensity score of 1 (i.e., the average score over all other seasons). Subtype frequencies were obtained from national surveillance data before the 2007–2008 season and directly from the MESA studies afterwards.
Modeling approach
We aimed to predict , the fraction of cases of subtype in season among people born in year with vaccination status . Our models assume that this is proportional to a combination of the following factors:Demography. The age distribution of our study cohort is not static over the study period. All models adjusted for the changing fractions of the population in each birth cohort and season (Figure 1—figure supplement 2; Mathematical expressions for model components: ‘Demography’).
Figure 1—figure supplement 2.
Birth year distribution of population.
Each panel shows the population distribution of all individuals in the study area who met the age criteria for study enrollment. People under 6 months old at the start of the sampling period in a season were not eligible to participate.
Age-specific effects. We considered that age itself may be associated with differences in medically attended influenzaA infection risk stemming from differences in susceptibility and/or rates of contact with infectious people. Additionally, we expect that age groups may intrinsically differ in their healthcare-seeking behaviors. These factors are inseparable in our data, and all models represent their combined effects with a static age-specific parameter shared by both subtypes that describes the risk of age-specific medically attended influenzaA infection (Mathematical expressions for model components: ‘Age-specific factors’). We assumed no intrinsic differences in the age-specific virulence of the two subtypes. These age-specific parameters were fitted. We also adjusted for other potential sources of age-specific bias, including age-specific differences in study approachment and enrollment rates (Appendix 1: ‘Age-specific rates of approachment, enrollment, and nursing home residence’).Imprinting. We tested several hypotheses of how primary exposures could affect the risk of medically attended infection with H1N1 and H3N2. In each version, we estimated fractional reductions in risk of medically attended H1N1 and H3N2infection due to primary (i.e., imprinting) exposure to the same type:Subtype-specific imprinting: Influenza has two main antigens, hemagglutinin (HA) and neuraminadase (NA). Imprinting could in theory derive from responses to either or both antigens. Because H1N1 is the only seasonal subtype of influenza with N1, we cannot separate the effects of initial N1 exposure from initial H1 exposure. However, since N2 appears in both H3N2 and H2N2 viruses, we can estimate protection against H3N2infection from initial N2 exposure separately from protection from initial H3 exposure (Mathematical expressions for model components: ‘HA subtype imprinting’ and ‘N2 imprinting’).Group-level imprinting: Influenza A viruses fall into two groups (I and II) corresponding to the two phylogenetic clades of HA. Gostic et al., 2016 found that primary infection by a virus belonging to one group protected against severe infection by another subtype in the same group. If group-level imprinting were influential, we would see primary infection with H2N2 conferring protection against H1N1, another group I virus, as well as H1N1 protecting against H1N1, and H3N2 against H3N2. We considered a separate class of models that assumes group-level protection instead of subtype-specific protection (Mathematical expressions for model components: ‘HA group imprinting’).Vaccination. Approximately 45% of the MESA population was vaccinated against influenza each year (Figure 1—figure supplement 3; Appendix 1: ‘Vaccination coverage’). We estimated cases in vaccinated and unvaccinated individuals of each birth year separately. Naively, we expect that vaccinated individuals should seek medical attention for acute respiratory infection proportionally to the fraction of their cohort vaccinated that season. However, vaccinated individuals may seek medical attention for acute respiratory infection more frequently than non-vaccinees due to correlations between the decision to vaccinate, healthcare-seeking behavior, and underlying medical conditions (Jackson et al., 2006a; Jackson et al., 2006b; Belongia et al., 2011). Indeed, we generally observed higher rates of high-risk medical conditions among vaccinated people compared to unvaccinated people (Figure 1—figure supplement 4). We attempted to adjust for this by calculating the fraction of vaccinated people among those who had MAARI and tested negative for influenza (i.e., the test-negative controls, ‘Mathematical expressions for model components: Vaccination’). We found that the vaccinated fraction exceeds vaccination coverage for most age groups, suggesting vaccinated individuals are overrepresented among cases for reasons unrelated to influenza (Figure 1—figure supplement 5). We also assumed that vaccination is not perfectly effective, and defined VE as the fractional reduction in cases expected in vaccinated compared to unvaccinated individuals after controlling for the effects described above. We estimated subtype-specific VE under five scenarios: (i) constant across age groups and seasons; (ii) constant across age groups but season-specific; (iii) age-specific but constant across seasons; (iv) imprinting-specific; and (v) birth-cohort-specific. We assumed that vaccination affects risk only in the current season, i.e, vaccination in a prior season confers no residual protection (Mathematical expressions for model components: ‘Vaccination’; Ohmit et al., 2014; Ohmit et al., 2016; Jackson et al., 2017; Skowronski et al., 2016; Pebody et al., 2013; McLean et al., 2018).
Figure 1—figure supplement 3.
Vaccination coverage.
We estimated monovalent vaccination coverage in 2009–2010 by measuring vaccination coverage among enrolled people and fitting a smoothing spline to the data (solid line).
Figure 1—figure supplement 4.
Age distribution of high-risk medical status.
High-risk medical status (Materials and methods, ‘Study cohort’) varies with age and vaccination status but stays relatively consistent across seasons. Each plot shows the fraction of enrolled people who had a high-risk medical condition for each season stratified by age, vaccination status, and test status. High-risk medical condition data was not collected for the 2009 pandemic season.
Figure 1—figure supplement 5.
Rate of MAARI in vaccinated and unvaccinated controls.
Vaccinated individuals seek healthcare for MAARI at a higher rate than predicted by vaccination coverage. We measured the fraction of vaccinated people among all who presented with MAARI and tested negative for influenza (; Materials and methods: ‘Vaccination’). This is plotted against vaccination coverage by season for different age groups. The dashed grey line shows where and vaccination coverage are equal. Vaccination coverage for the 2009–2010 season uses monovalent vaccination coverage estimated directly from all individuals with MAARI. We do not show the 2009 pandemic season because the monovalent vaccine was not distributed until the second wave of the pandemic.
We defined models as specific combinations of the above factors. We tested a set of 10 models by pairing each of the possible implementations of HA imprinting with each implementation of VE (Figure 1). Demography, age-specific effects, and N2 imprinting were included in all these models. To test whether more complex models truly improved model fit, we also tested a simple model with constant VE and no effect of imprinting. We evaluated these 11 models by maximum likelihood and compared their performance using the corrected Akaike information criterion (cAIC, ‘Model likelihood’) and leave-one-out cross-validation.
Figure 1.
Summary of models tested.
Ten different models result from considering different combinations of HA imprinting and VE. We also tested one additional model excluding the effects of N2 and HA imprinting (Materials and methods: ‘Modeling approach’).
Flowchart of sample collection (A) and final study population stratified by season, age, test status, and vaccination status (B). ‘Test-positive’ is defined as testing positive for the dominant circulating influenza A subtype in that season.
Each panel shows the population distribution of all individuals in the study area who met the age criteria for study enrollment. People under 6 months old at the start of the sampling period in a season were not eligible to participate.
We estimated monovalent vaccination coverage in 2009–2010 by measuring vaccination coverage among enrolled people and fitting a smoothing spline to the data (solid line).
High-risk medical status (Materials and methods, ‘Study cohort’) varies with age and vaccination status but stays relatively consistent across seasons. Each plot shows the fraction of enrolled people who had a high-risk medical condition for each season stratified by age, vaccination status, and test status. High-risk medical condition data was not collected for the 2009 pandemic season.
Vaccinated individuals seek healthcare for MAARI at a higher rate than predicted by vaccination coverage. We measured the fraction of vaccinated people among all who presented with MAARI and tested negative for influenza (; Materials and methods: ‘Vaccination’). This is plotted against vaccination coverage by season for different age groups. The dashed grey line shows where and vaccination coverage are equal. Vaccination coverage for the 2009–2010 season uses monovalent vaccination coverage estimated directly from all individuals with MAARI. We do not show the 2009 pandemic season because the monovalent vaccine was not distributed until the second wave of the pandemic.
Each bar shows the fraction of individuals who were vaccinated in that season who also received at least one influenza vaccination in the previous two seasons.
Most vaccinated study participants received the inactivated influenza vaccine. The fraction of vaccinated people who received the standard-dose inactivated influenza vaccine (IIV-SD), the high-dose inactivated influenza vaccine (IIV-HD), or the live attenuated influenza vaccine (LAIV) is shown for all participants (A), children < 18 years old (B), and adults ≥65 years old (C).
Summary of models tested.
Ten different models result from considering different combinations of HA imprinting and VE. We also tested one additional model excluding the effects of N2 and HA imprinting (Materials and methods: ‘Modeling approach’).
Sample collection and final study population.
Flowchart of sample collection (A) and final study population stratified by season, age, test status, and vaccination status (B). ‘Test-positive’ is defined as testing positive for the dominant circulating influenza A subtype in that season.
Birth year distribution of population.
Each panel shows the population distribution of all individuals in the study area who met the age criteria for study enrollment. People under 6 months old at the start of the sampling period in a season were not eligible to participate.
Vaccination coverage.
We estimated monovalent vaccination coverage in 2009–2010 by measuring vaccination coverage among enrolled people and fitting a smoothing spline to the data (solid line).
Age distribution of high-risk medical status.
High-risk medical status (Materials and methods, ‘Study cohort’) varies with age and vaccination status but stays relatively consistent across seasons. Each plot shows the fraction of enrolled people who had a high-risk medical condition for each season stratified by age, vaccination status, and test status. High-risk medical condition data was not collected for the 2009 pandemic season.
Rate of MAARI in vaccinated and unvaccinated controls.
Vaccinated individuals seek healthcare for MAARI at a higher rate than predicted by vaccination coverage. We measured the fraction of vaccinated people among all who presented with MAARI and tested negative for influenza (; Materials and methods: ‘Vaccination’). This is plotted against vaccination coverage by season for different age groups. The dashed grey line shows where and vaccination coverage are equal. Vaccination coverage for the 2009–2010 season uses monovalent vaccination coverage estimated directly from all individuals with MAARI. We do not show the 2009 pandemic season because the monovalent vaccine was not distributed until the second wave of the pandemic.
Repeat vaccination by age group and season.
Each bar shows the fraction of individuals who were vaccinated in that season who also received at least one influenza vaccination in the previous two seasons.
Vaccine type received.
Most vaccinated study participants received the inactivated influenza vaccine. The fraction of vaccinated people who received the standard-dose inactivated influenza vaccine (IIV-SD), the high-dose inactivated influenza vaccine (IIV-HD), or the live attenuated influenza vaccine (LAIV) is shown for all participants (A), children < 18 years old (B), and adults ≥65 years old (C).
Mathematical expressions for model components
Demography
We expect that the fraction of cases in each birth cohort should be proportional to the underlying demographic birth year distribution of the population. To calculate the demographic birth year distribution, we used MESA-specific data on the age distribution for each season (Kieke et al., 2015). Because people ≥90 years old were grouped into a single age class, we estimated the number of people in each age ≥90 years old by assuming a geometric decline in population with age. We converted the age distribution for each season into a distribution by birth year by assigning people of a specific age into the two possible birth years of that age (Appendix 1: ‘Birth year distribution of the study population’). Therefore,where is the fraction of the population in season who were born in year .
Age-specific factors
We modeled intrinsically age-specific differences in medically attended influenzaA infection risk and healthcare-seeking behavior by using parameters that represent the relative risk of medically attended influenzaA infection in each age group. These parameters combine the effects of underlying age-specific differences in influenza A medically attended infection risk as well as age-specific differences in healthcare-seeking behavior. We considered the same age groups as before (0–4 year-olds, 5–9 year-olds, 10–14 year-olds, 15–19 year-olds, 20–29 year-olds, 30–39 year-olds, 40–49 year-olds, 50–64 year-olds, and ≥65 years old). We chose 20–29 year-olds as our reference age group. All age groups aside from 20 to 29 year-olds had an associated parameter () that scaled their risk of medically attended influenzaA infection relative to 20–29 year-olds. These parameters can take on any positive value.Since our models describe the distribution of cases by birth year and not by age, we mapped the age-group-specific parameters () to birth cohorts in each season (). We considered that each birth cohort has two possible ages in each season ( and ). Let be a function that specifies the age group of a given age . Then , the age-specific relative risk in season of medically attended influenzaA infection for a person born in year , iswhere and are the fractions of birth cohort who are age or in influenza season (Appendix 1: ‘Fraction of birth cohort with specific age’), and and are the age-group-specific parameters for and .Our models also included age-specific approachment rates (), enrollment rates (), and nursing home enrollment () as covariates, all of which bias the age distribution of medically attended influenza infections (Appendix 1: ‘Age-specific rates of approachment, enrollment, and nursing home residence’). The combination of estimated age-specific effects and age-specific covariates was modeled as
HA subtype imprinting
We considered that imprinting to HA reduces a birth cohort’s risk of future infection from the same HA subtype. Therefore,where is the strength of HA imprinting for subtype and is the imprinting probability in season of birth cohort to subtype (‘Calculating imprinting probabilities’).
HA group imprinting
We considered that imprinting to HA reduces a birth cohort’s risk of future infection with viruses from the same HA group. Therefore,where is the strength of HA imprinting for group one viruses; is the strength of HA imprinting for group two viruses; and , , and are the imprinting probabilities in season of birth cohort to H1N1, H2N2, and H3N2.
N2 imprinting
We considered that imprinting to N2 reduces a birth cohort’s risk of H3N2infection. Therefore,where is the strength of N2 imprinting, and and are the imprinting probabilities of birth cohort in season to H3N2 and H2N2.
Vaccination
We assumed that vaccination decreases the risk of medically attended infection. However, vaccinated individuals may seek healthcare for symptomatic influenza at a different rate than unvaccinated individuals. Moreover, because vaccines are routinely recommended for individuals with underlying health conditions, pre-existing susceptibility to MAARI among vaccinated individuals may also differ from unvaccinated individuals. Let represent the fraction of vaccinated individuals in age group in season that present with MAARI. We use test-negative controls to estimate this aswhere and are the number of vaccinated or unvaccinated individuals born in year presenting with MAARI and testing negative for influenza in season . We converted to (i.e., to a covariate indexed by birth cohort) using the same method described in ‘Age-specific factors.’ We tested five different VE schemes: subtype-specific VE that remained constant across seasons and cohorts (two parameters), subtype-specific VE that varied between the age groups described above (18 parameters), VE that varied between seasons (12 parameters), VE for each possible imprinting subtype (six parameters), and birth-cohort-specific VE (18 parameters). These VE parameters () reduced the probability of medically attended influenzaA infection among vaccinated individuals in a birth cohort, i.e,where depends on the specific implementation of VE used.Constant VE only varies with the infecting subtype, thusSeason-specific VE varies with subtype and season, thusFor age-specific VE, we used the same age classes described above for ‘Age-specific factors’ but did not consider a reference age class, so that each age group had an associated VE for each subtype. We used these age-specific VE parameters to calculate the VE against subtype in birth cohort during season using the same procedure described in ‘Age-specific factors’ (Equation 9). Therefore,where and are age-specific VE parameters for and .For imprinting-specific VE, we used the imprinting probabilities for each birth cohort described in ‘Calculating imprinting probabilities’ to scale V such thatwhere is the VE among people imprinted to subtype against infection by dominant subtype , and is the imprinting probability for subtype in season for birth cohort .For birth-cohort-specific VE, we defined nine birth cohorts corresponding to the nine age groups we used for the 2017–2018 season: 1918–1952, 1953–1967, 1968–1977, 1978–1987, 1988–1997, 1998–2002, 2003–2007, 2008–2012, and 2013–2017. Let be the birth cohort of people born in year . Thenwhere is the VE among people in cohort against infection by dominant subtype .
Model likelihood
Recall that our aim is to predict , the fraction of all PCR-confirmed influenza cases of dominant subtype in influenza season among people born in year with vaccination status . These fractions can also be interpreted as multinomial parameters that describe the probability that in season , a medically attended influenza infection of subtype occurs among people born in year with vaccination status . Each model assumes that is proportional to a collection of model components described above (demography, age, imprinting, and vaccination). Thus,where is a multinomial probability under model , indicates whether model contains component , and is the mathematical expression for model component given , , , and (e.g., for HA subtype imprinting, ).To obtain proper multinomial probabilities, we calculated a normalizing constant for each season such that all probabilities in that season sum to 1. For convenience, let be the unnormalized multinomial probability for model . Then for a specific season , the normalized multinomial probability iswhere is the maximum birth year possible for a specific season .To calculate the likelihood of a given model, we used the multinomial probabilities and the observed birth year distribution of cases. Let be the number of PCR-confirmed cases of dominant subtype in influenza season among people born in year with vaccination status . The total number of PCR-confirmed cases of dominant subtype in season isFor models fitted to a restricted set of ages, we limited the cases for each season to the birth cohorts that were guaranteed to meet the age requirements in that season.Then, the likelihood of model in season is given by the multinomial likelihood,Finally, the full model likelihood for model over all observed seasons isWe fitted the model to case data using the L-BFGS-B algorithm implemented in the R package optimx. We estimated 95% confidence intervals for parameters of the best-fitting model by evaluating likelihood profiles at 14 evenly spaced points and interpolating the entire profile using a smoothing spline.
Results
The age distribution of cases varies between seasons and subtypes
The age distribution of cases varies between subtypes. The relative burden of cases is consistently higher in people ≥65 years old during H3N2-dominated seasons compared to H1N1-dominated seasons (Figure 2). The age distribution tends to vary more between subtypes than within either over time (Figure 2—figure supplement 1, off-diagonal quadrants). This is consistent with recent work showing that the ratios of H3N2 to H1N1 cases differ between age groups (Gagnon et al., 2018b).
Figure 2.
The age distribution of cases.
(A) The age distributions of cases from the 2007–2008 through the 2017–2018 influenza seasons in MESA. Dark lines with open circles indicate the average fraction of cases in each age group. Lighter-colored lines show the age distribution for individual seasons. (B) The age distribution of cases in H1N1-dominated seasons. (C) The age distribution of cases in H3N2-dominated seasons.
Seasons differ in their age distributions. The color intensity of each cell shows the observed G-test statistic, which measures how much the age distributions of two seasons differ from the null expectation that they are drawn from the same distribution (Materials and methods: ‘Calculating differences in the age distribution between seasons'.). The text in each cell shows the Bonferroni-corrected p-value for each G-test. The dominant subtype of each season is indicated by the label color.
(A). Each point shows the rank of an age group’s relative risk of infection during the first half compared to the second half of an epidemic period (x-axis) and the rank of the fraction of cases belonging to that age group in the same epidemic period (y-axis) (Appendix 1: ‘Correlation of relative risk and fraction of cases’). Points are colored by the dominant subtype of the season and x-axis values are offset to facilitate visualization. Points with the same x and y values overlap and are indicated by darker shading. (B). To account for potential undersampling of cases at the beginning and end of specific seasons, we simulated 1000 replicate epidemics (Appendix 1 : ‘Sensitivity to sampling effort’) and calculated the same correlation as in panel A. The range is indicated by a vertical line and the median by a square.
Each panel shows the relative risk of infection in the first versus the second half of an epidemic for different age groups in each season (Materials and methods: ‘Calculating relative risk’). Relative risk greater than 1 (indicated by the grey dashed line) means that an age group was more likely to be infected at during the first rather than second half of an epidemic. Age groups with no cases in the latter half of a season are indicated by asterisks and no bar. The dominant subtype of each subtype is indicated by the bar color. 95% binomial confidence intervals are indicated by grey vertical lines. Bars with asterisks over them indicate that the 95% confidence interval includes the scenario where all cases occur in the first half of the season.
Figure 2—figure supplement 1.
Statistical analysis of age distribution of cases.
Seasons differ in their age distributions. The color intensity of each cell shows the observed G-test statistic, which measures how much the age distributions of two seasons differ from the null expectation that they are drawn from the same distribution (Materials and methods: ‘Calculating differences in the age distribution between seasons'.). The text in each cell shows the Bonferroni-corrected p-value for each G-test. The dominant subtype of each season is indicated by the label color.
The age distribution of cases.
(A) The age distributions of cases from the 2007–2008 through the 2017–2018 influenza seasons in MESA. Dark lines with open circles indicate the average fraction of cases in each age group. Lighter-colored lines show the age distribution for individual seasons. (B) The age distribution of cases in H1N1-dominated seasons. (C) The age distribution of cases in H3N2-dominated seasons.
Statistical analysis of age distribution of cases.
Seasons differ in their age distributions. The color intensity of each cell shows the observed G-test statistic, which measures how much the age distributions of two seasons differ from the null expectation that they are drawn from the same distribution (Materials and methods: ‘Calculating differences in the age distribution between seasons'.). The text in each cell shows the Bonferroni-corrected p-value for each G-test. The dominant subtype of each season is indicated by the label color.
Correlation of relative risk and fraction of cases within an age group.
(A). Each point shows the rank of an age group’s relative risk of infection during the first half compared to the second half of an epidemic period (x-axis) and the rank of the fraction of cases belonging to that age group in the same epidemic period (y-axis) (Appendix 1: ‘Correlation of relative risk and fraction of cases’). Points are colored by the dominant subtype of the season and x-axis values are offset to facilitate visualization. Points with the same x and y values overlap and are indicated by darker shading. (B). To account for potential undersampling of cases at the beginning and end of specific seasons, we simulated 1000 replicate epidemics (Appendix 1 : ‘Sensitivity to sampling effort’) and calculated the same correlation as in panel A. The range is indicated by a vertical line and the median by a square.
Relative risk among different age groups across seasons.
Each panel shows the relative risk of infection in the first versus the second half of an epidemic for different age groups in each season (Materials and methods: ‘Calculating relative risk’). Relative risk greater than 1 (indicated by the grey dashed line) means that an age group was more likely to be infected at during the first rather than second half of an epidemic. Age groups with no cases in the latter half of a season are indicated by asterisks and no bar. The dominant subtype of each subtype is indicated by the bar color. 95% binomial confidence intervals are indicated by grey vertical lines. Bars with asterisks over them indicate that the 95% confidence interval includes the scenario where all cases occur in the first half of the season.The age distribution also varies within subtypes over time (Figure 2—figure supplement 1, diagonal quadrants). The seven H3N2-dominated seasons display three types of age distributions (Figure 2—figure supplement 1, clusters of lighter-colored cells in the upper left-hand quadrant), and two correspond to major antigenic clusters (2007–2008, Fonville et al., 2016; 2010–2012, Ann et al., 2012). These differences sometimes coincide with significant shifts in the age distribution between seasons. For instance, the highest fraction of H3N2 cases occurs in 20–29 year olds in the 2007–2008 season, but this age group has the lowest fraction of cases in the next H3N2-dominated season (2010–2011, Figure 2C). In H1N1, the shift from seasonal to pandemic strains is associated with large changes in the age distribution (Figure 2—figure supplement 1, lower right-hand quadrant).We found further evidence that age groups differed in their susceptibility across seasons by examining the relative risk of infection during the first versus second half of each epidemic period (Materials and methods: ‘Calculating relative risk’). Individuals at greater risk of infection should be infected disproportionately early rather than late in an epidemic (Worby et al., 2015). We confirmed that an age group’s relative risk correlates with the fraction of cases within that age group in the same season (Pearson’s r = 0.58, 95% CI 0.38–0.73; Figure 2—figure supplement 2A; Appendix 1: ‘Correlation of relative risk and fraction of cases’). This trend is evident for H1N1 (Pearson’s r = 0.73, 95% CI 0.45–0.88; Figure 2—figure supplement 2A) and H3N2 seasons separately (Pearson’s r = 0.52, 95% CI 0.30–0.69; Figure 2—figure supplement 2A). The positive correlation in all seasons is robust to undersampling of cases at the start and end of seasons (Appendix 1: ‘Sensitivity to sampling effort’, Figure 2—figure supplement 2B). This provides supporting evidence that the different numbers of cases in each age group reflect underlying differences in infection risk.
Figure 2—figure supplement 2.
Correlation of relative risk and fraction of cases within an age group.
(A). Each point shows the rank of an age group’s relative risk of infection during the first half compared to the second half of an epidemic period (x-axis) and the rank of the fraction of cases belonging to that age group in the same epidemic period (y-axis) (Appendix 1: ‘Correlation of relative risk and fraction of cases’). Points are colored by the dominant subtype of the season and x-axis values are offset to facilitate visualization. Points with the same x and y values overlap and are indicated by darker shading. (B). To account for potential undersampling of cases at the beginning and end of specific seasons, we simulated 1000 replicate epidemics (Appendix 1 : ‘Sensitivity to sampling effort’) and calculated the same correlation as in panel A. The range is indicated by a vertical line and the median by a square.
Just as the age distribution of cases varies over time, the age groups with high relative risks of infection change over time. If people contact one another similarly from one season to the next, these shifting relative risks imply that age groups’ relative susceptibilities change over time. For instance, 5–17 year olds had the highest relative risk of early infection in the 2008–2009 season, whereas 50–64 year-olds had the highest relative risk in the 2013–2014 season (Figure 2—figure supplement 3). Relative risks in MESA vary more than national estimates, which show that 5–17 year-olds had the highest relative risk in all but one season from the 2009 pandemic to 2013–2014 (Worby et al., 2015). These differences may partly be due to the fact that our measurements of relative risk use outpatient visits, whereas the national estimates use hospitalizations.
Figure 2—figure supplement 3.
Relative risk among different age groups across seasons.
Each panel shows the relative risk of infection in the first versus the second half of an epidemic for different age groups in each season (Materials and methods: ‘Calculating relative risk’). Relative risk greater than 1 (indicated by the grey dashed line) means that an age group was more likely to be infected at during the first rather than second half of an epidemic. Age groups with no cases in the latter half of a season are indicated by asterisks and no bar. The dominant subtype of each subtype is indicated by the bar color. 95% binomial confidence intervals are indicated by grey vertical lines. Bars with asterisks over them indicate that the 95% confidence interval includes the scenario where all cases occur in the first half of the season.
Taken together, these findings suggest that the risk of influenza infection is not a simple function of age alone. Other factors, such as past influenza infections and vaccination, might explain the changing age distributions of cases in time.
Imprinting probabilities of age groups change over time
We hypothesized that variation in the age distribution of cases could be explained by the aging of birth cohorts with similar early exposure histories. This would cause the early exposure history of an age group, and thus potentially its susceptibility, to change in time. To calculate the probability that people in a particular age group had their first influenzaA infection with a particular subtype, we adapted the approach from Gostic et al., 2016. Briefly, we calculated the probability that an individual born in a specific year had a primary infection with H1N1, H2N2, or H3N2 using data on relative epidemic sizes and the frequencies of circulating subtypes (Figure 3—figure supplement 1; Materials and methods: ‘Calculating imprinting probabilities’).As expected, age groups’ early exposures are not static and change over time (Figure 3). Older people nonetheless tend to be imprinted to H1N1 or H2N2, whereas younger people have higher probabilities of imprinting to H3N2. The effects of the 2009 H1N1 pandemic are evident in the three youngest age groups as a transient increase (from 2009 to approximately 2013) in their H1N1 imprinting probability. These imprinting probabilities are relatively well-constrained even after for accounting for uncertainty in epidemic size (Figure 3—figure supplement 2; Appendix 1: ‘Sensitivity to uncertainty in ILI and the frequency of influenza A’).
Figure 3.
Imprinting probabilities by age group across seasons.
Each panel shows the imprinting probabilities of an age group from the 2007–2008 season through the 2017–2018 season. The color of each bar corresponds to the imprinting subtype or naive individuals, who have not yet been infected.
The intensity (top panel) and subtype frequencies (bottom panel) of influenza A seasons in the United States. Intensity is measured as the product of influenza-like illness (ILI) and the fraction of respiratory specimens testing positive for influenza A in national surveillance data (Appendix 1: ‘Seasonal intensity’). This is normalized to the average intensity value between 1977 and 2017–2018. Seasons before 1977 where United States ILI surveillance data are unavailable are assumed to have an intensity score of 1 (i.e., the average score over all other seasons). Subtype frequencies were obtained from national surveillance data before the 2007–2008 season and directly from the MESA studies afterwards.
Uncertainty in ILI and the frequency of A have a small impact on imprinting probabilities. We simulated 10000 datasets to represent the range of possible epidemic sizes for seasons where we did not have data on either ILI or the frequency of influenza A (Appendix 1: ‘Sensitivity to uncertainty in ILI and the frequency of influenza A’). The vertical dashed line shows the point at which data on ILI and the frequency of influenza A are available while the vertical dotted line shows the point at which data on only the frequency of A is available. The median imprinting probabilities for those simulations is shown as a solid line with the maximum and minimum imprinting probabilities shown by the bounds of the shaded area.
Figure 3—figure supplement 2.
Imprinting probabilities with random sampling of seasonal intensity.
Uncertainty in ILI and the frequency of A have a small impact on imprinting probabilities. We simulated 10000 datasets to represent the range of possible epidemic sizes for seasons where we did not have data on either ILI or the frequency of influenza A (Appendix 1: ‘Sensitivity to uncertainty in ILI and the frequency of influenza A’). The vertical dashed line shows the point at which data on ILI and the frequency of influenza A are available while the vertical dotted line shows the point at which data on only the frequency of A is available. The median imprinting probabilities for those simulations is shown as a solid line with the maximum and minimum imprinting probabilities shown by the bounds of the shaded area.
Imprinting probabilities by age group across seasons.
Each panel shows the imprinting probabilities of an age group from the 2007–2008 season through the 2017–2018 season. The color of each bar corresponds to the imprinting subtype or naive individuals, who have not yet been infected.
Intensity and subtype frequencies of influenza A.
The intensity (top panel) and subtype frequencies (bottom panel) of influenza A seasons in the United States. Intensity is measured as the product of influenza-like illness (ILI) and the fraction of respiratory specimens testing positive for influenza A in national surveillance data (Appendix 1: ‘Seasonal intensity’). This is normalized to the average intensity value between 1977 and 2017–2018. Seasons before 1977 where United States ILI surveillance data are unavailable are assumed to have an intensity score of 1 (i.e., the average score over all other seasons). Subtype frequencies were obtained from national surveillance data before the 2007–2008 season and directly from the MESA studies afterwards.
Imprinting probabilities with random sampling of seasonal intensity.
Uncertainty in ILI and the frequency of A have a small impact on imprinting probabilities. We simulated 10000 datasets to represent the range of possible epidemic sizes for seasons where we did not have data on either ILI or the frequency of influenza A (Appendix 1: ‘Sensitivity to uncertainty in ILI and the frequency of influenza A’). The vertical dashed line shows the point at which data on ILI and the frequency of influenza A are available while the vertical dotted line shows the point at which data on only the frequency of A is available. The median imprinting probabilities for those simulations is shown as a solid line with the maximum and minimum imprinting probabilities shown by the bounds of the shaded area.
Age-specific differences in medically attended influenza A infection risk affect epidemic patterns
We fitted models to estimate the underlying effects of age, early infections, and vaccination on the age distributions of cases. As expected, the cases reveal age-specific differences in the risk of medically attended influenzaA infection (Figure 4; Figure 4—figure supplement 1; Appendix 2—table 1). This risk is roughly threefold higher among children <4 years old compared to adults 20–29 years old, after adjusting for other effects (Figure 4). The decline in risk through middle age is generally consistent with attack rates estimated from serology (Monto et al., 1985; Bodewes et al., 2011; Wu et al., 2010; Huang et al., 2019) and clinical infections (Wu et al., 2017). We recently observed smaller differences in the attack rates of school-aged children and their parents when estimating infections serologically (Ranjeva et al., 2019). We hypothesize that the attack rates estimated from clinical infections might show larger differences by age due to age-related changes in infection severity and healthcare-seeking behavior. Indeed, rates of healthcare-seeking behavior have been shown to decline with age before rising in adults ≥65 years old (Biggerstaff et al., 2014; Brooks-Pollock et al., 2011; Van Cauteren et al., 2012), consistent with our results. Finally, the increased risk of medically attended influenzaA infection among people ≥65 years old compared to other adults may be related to the increasing prevalence of high-risk medical conditions with age (Figure 1—figure supplement 4).
Figure 4.
Estimates of relative age-specific medically attended influenza infection risk.
Open circles represent the maximum likelihood estimates of parameters describing age-specific differences in the relative risk of medically attended influenza A infection. Lines show the 95% confidence interval.
The best-fitting model includes age-specific risk of medically attended influenza A infection, HA subtype imprinting, and age-specific VE. The 11 main models are shown as rows with colored squares indicating whether that model included parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e., the CI includes 0). Models are sorted by their cAIC relative to the best-fitting model.
Figure 4—figure supplement 1.
Ranking of models fitted to all ages.
The best-fitting model includes age-specific risk of medically attended influenza A infection, HA subtype imprinting, and age-specific VE. The 11 main models are shown as rows with colored squares indicating whether that model included parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e., the CI includes 0). Models are sorted by their cAIC relative to the best-fitting model.
Appendix 2—table 1.
Estimates of parameters shared by the age-specific VE and birth-cohort-specific VE models.
Model with age-specific VE, age ≥6 months (MLE, 95% CI)
Model with age-specific VE, age ≥15 years (MLE, 95% CI)
Model with age-specific VE, age < 65 years (MLE, 95% CI)
Model with age-specific VE, age 15–64 years (MLE, 95% CI)
Model with birth-cohort-specific VE, age ≥15 years (MLE, 95% CI)
Imprinting protection (%)
H1
66 (53, 77)
48 (25, 66)
64 (47, 77)
43 (11, 66)
49 (24, 67)
H3
33 (17, 46)
41 (20, 56)
34 (18, 47)
36 (13, 52)
41 (20, 56)
N2
0 (0, 7)
0 (0, 11)
0 (0, 8)
0 (0, 10)
0 (0, 11)
Age-specific risk of medically attended influenza A infection
0–4 years
3.0 (2.5, 3.6)
N.A.
3.0 (2.5, 3.6)
N.A.
N.A.
5–9 years
2.6 (2.2, 3.0)
N.A.
2.5 (2.2, 3.0)
N.A.
N.A.
10–14 years
1.7 (1.4, 2.0)
N.A.
1.7 (1.4, 2.0)
N.A.
N.A.
15–19 years
1.2 (1.0, 1.5)
1.2 (1.0, 1.5)
1.2 (1.0, 1.5)
1.2 (1.0, 1.5)
1.2 (1.0, 1.5)
30–39 years
1.1 (0.9, 1.3)
1.1 (0.9, 1.3)
1.1 (0.9, 1.3)
1.1 (0.9, 1.3)
1.1 (0.9, 1.3)
40–49 years
0.9 (0.7, 1.1)
0.9 (0.8, 1.1)
0.9 (0.7, 1.1)
0.9 (0.8, 1.1)
0.9 (0.8, 1.1)
50–64 years
1.0 (0.8, 1.3)
1.0 (0.8, 1.2)
1.0 (0.8, 1.3)
1.0 (0.8, 1.2)
0.9 (0.7, 1.1)
65+ years
1.6 (1.2, 2.1)
1.4 (1.0, 1.9)
N.A
N.A.
1.5 (1.1, 1.9)
Estimates of relative age-specific medically attended influenza infection risk.
Open circles represent the maximum likelihood estimates of parameters describing age-specific differences in the relative risk of medically attended influenzaA infection. Lines show the 95% confidence interval.
Ranking of models fitted to all ages.
The best-fitting model includes age-specific risk of medically attended influenzaA infection, HA subtype imprinting, and age-specific VE. The 11 main models are shown as rows with colored squares indicating whether that model included parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e., the CI includes 0). Models are sorted by their cAIC relative to the best-fitting model.
Initial infection confers long-lasting, subtype-specific protection against future infection
Our best-fitting model supports subtype-specific imprinting for H1N1 and H3N2 (Figure 5, top row; Appendix 2—table 1). This model also provides the best predictive power compared to other models in a leave-one-out cross-validation analysis (Figure 5—figure supplement 1; Figure 5—figure supplement 2; Appendix 1: ‘Evaluation of predictive power’). The risk of future medically attended infection by H1N1 is reduced by 66% (95% CI 53–77%) in people imprinted to H1N1, whereas the risk of future medically attended infection by H3N2 is reduced by 33% (95% CI 17–46%) in people imprinted to H3N2. We found no evidence of a protective effect from imprinting to N2 (0%, 95% CI 0–7%). These estimates of imprinting protection are insensitive to:
Figure 5.
Estimates of imprinting strength.
Imprinting is more protective against H1N1 infection than H3N2 infection. Open circles represent the maximum likelihood estimates of imprinting parameters from the model including HA subtype imprinting and age-specific VE fitted to the indicated age group (y-axis). Black lines show 95% confidence intervals.
The model which best-predicts excluded seasons includes HA subtype imprinting and age-specific VE. Models are shown as rows with colored squares indicating whether that model included parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e., the CI includes 0). Models are sorted by their MSE in predicting excluded seasons (Appendix 1: ‘Evaluation of predictive power’).
Each panel shows the number of observed and predicted cases by birth year among unvaccinated (A) and vaccinated (B) study participants. Predictions and 95% prediction intervals were generated by fitting the model including age-specific risk of medically attended influenza A infection, HA subtype imprinting, and age-specific VE fitted to all seasons except the season in the panel (Appendix 1: ‘Evaluation of predictive power’).
Each panel shows the number of cases per sampling day (green circles). We extrapolated cases at the start and end of the season (orange dashed line) if the observed number of cases per day exceeded 1 (black line) at the start and end of that season (Appendix 1: ‘Sensitivity to sampling effort’).
We fitted the model including HA subtype imprinting and age-specific VE to simulated cases in seasons where the enrollment period does not fully overlap the epidemic period and recorded the maximum likelihood estimates for H1N1 and H3N2 imprinting protection (Appendix 1: ‘Sensitivity to sampling effort’). The distributions of these values are shown as violin plots and the medians are shown as squares. Estimates of imprinting protection from the best-fitting model without simulated data with a 95% confidence interval are shown as circles with error bars.
We tested whether excess cases in each birth cohort were negatively correlated with excess cases in the same birth cohort in the next season of the same subtype (Appendix 1: ‘Calculating excess cases’). We find a weak positive correlation for cases of H1N1 (Spearman’s =0.12, 95% CI 0.02–0.22) and H3N2 (Spearman’s =0.05, 95% CI −0.03–0.14).
Figure 5—figure supplement 1.
Ranking of models by predictive power.
The model which best-predicts excluded seasons includes HA subtype imprinting and age-specific VE. Models are shown as rows with colored squares indicating whether that model included parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e., the CI includes 0). Models are sorted by their MSE in predicting excluded seasons (Appendix 1: ‘Evaluation of predictive power’).
Figure 5—figure supplement 2.
Model performance on excluded seasons.
Each panel shows the number of observed and predicted cases by birth year among unvaccinated (A) and vaccinated (B) study participants. Predictions and 95% prediction intervals were generated by fitting the model including age-specific risk of medically attended influenza A infection, HA subtype imprinting, and age-specific VE fitted to all seasons except the season in the panel (Appendix 1: ‘Evaluation of predictive power’).
uncertainty in imprinting probabilities due to uncertainty in past epidemic sizes (Figure 3—figure supplement 2; Appendix 1: ‘Sensitivity to uncertainty in ILI and the frequency of influenza A’; Appendix 2—table 3),
Appendix 2—table 3.
Estimates of imprinting protection fitted to datasets representing upper and lower bounds of imprinting probabilities.
Dataset
Best-fitting model
H1 imprinting protection (%, 95% CI)
H3 imprinting protection (%, 95% CI)
Lower bound
Demography, age, HA imprinting, age-specific VE
72 (57, 84)
32 (17, 44)
Upper bound
Demography, age, HA imprinting, age-specific VE
61 (48, 72)
37 (20, 51)
choice of age groups for medically attended influenzaA infection risk and VE (Appendix 1: ‘Sensitivity to age groups’; Appendix 2—table 4), and
Appendix 2—table 4.
Estimates of imprinting protection for models with different age groups.
Age groups (years)
Best-fitting model
H1 imprinting protection (%, 95% CI)
H3 imprinting protection (%, 95% CI)
0–4, 5–17, 18–64, 65+
Demography, age, HA imprinting, age-specific VE
56 (40, 68)
36 (25, 46)
0–8, 9–17, 18–49, 50–64, 65+
Demography, age, HA imprinting, age-specific VE
62 (47, 74)
35 (21, 48)
undersampling of influenza cases in some seasons (Figure 5—figure supplement 3).
Figure 5—figure supplement 3.
Cases per sampling day.
Each panel shows the number of cases per sampling day (green circles). We extrapolated cases at the start and end of the season (orange dashed line) if the observed number of cases per day exceeded 1 (black line) at the start and end of that season (Appendix 1: ‘Sensitivity to sampling effort’).
Estimates of imprinting strength.
Imprinting is more protective against H1N1infection than H3N2infection. Open circles represent the maximum likelihood estimates of imprinting parameters from the model including HA subtype imprinting and age-specific VE fitted to the indicated age group (y-axis). Black lines show 95% confidence intervals.
Ranking of models by predictive power.
The model which best-predicts excluded seasons includes HA subtype imprinting and age-specific VE. Models are shown as rows with colored squares indicating whether that model included parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e., the CI includes 0). Models are sorted by their MSE in predicting excluded seasons (Appendix 1: ‘Evaluation of predictive power’).
Model performance on excluded seasons.
Each panel shows the number of observed and predicted cases by birth year among unvaccinated (A) and vaccinated (B) study participants. Predictions and 95% prediction intervals were generated by fitting the model including age-specific risk of medically attended influenzaA infection, HA subtype imprinting, and age-specific VE fitted to all seasons except the season in the panel (Appendix 1: ‘Evaluation of predictive power’).
Cases per sampling day.
Each panel shows the number of cases per sampling day (green circles). We extrapolated cases at the start and end of the season (orange dashed line) if the observed number of cases per day exceeded 1 (black line) at the start and end of that season (Appendix 1: ‘Sensitivity to sampling effort’).
Estimates of imprinting protection with added simulated cases.
We fitted the model including HA subtype imprinting and age-specific VE to simulated cases in seasons where the enrollment period does not fully overlap the epidemic period and recorded the maximum likelihood estimates for H1N1 and H3N2 imprinting protection (Appendix 1: ‘Sensitivity to sampling effort’). The distributions of these values are shown as violin plots and the medians are shown as squares. Estimates of imprinting protection from the best-fitting model without simulated data with a 95% confidence interval are shown as circles with error bars.
Correlation of excess cases between seasons.
We tested whether excess cases in each birth cohort were negatively correlated with excess cases in the same birth cohort in the next season of the same subtype (Appendix 1: ‘Calculating excess cases’). We find a weak positive correlation for cases of H1N1 (Spearman’s =0.12, 95% CI 0.02–0.22) and H3N2 (Spearman’s =0.05, 95% CI −0.03–0.14).In theory, the estimated protective effects of imprinting could be influenced by cross-protection rather than the impact of first infection per se. Because first infections are also recent infections in children, we reasoned that the observed imprinting effects might arise from confounding with recent infections in these ages. Based on an estimated 7 year half-life of homologous protection after H1N1pdm infection in children (Ranjeva et al., 2019) and the fact that most children experience primary influenzaA infection by 5 years of age (Bodewes et al., 2011), we reasoned that excluding children <15 years old would diminish the impact of protection from recent infection on our results. When we excluded the youngest age groups, our estimates of H1N1 imprinting protection decreased while H3N2 imprinting protection increased (Figure 5, second row). However, initial infection by H1N1 was still more protective than initial infection by H3N2, both imprinting effects remained positive, and there was no significant change in the values of other estimated parameters (Appendix 2—table 1 and Appendix 2—table 2).
Appendix 2—table 2.
Estimates of age-specific VE parameters in models fitted to different age groups.
Model with age-specific VE, age ≥6 months (MLE, 95% CI)
Model with age-specific VE, age ≥15 years (MLE, 95% CI)
Model with age-specific VE, age < 65 years (MLE, 95% CI)
Model with age-specific VE, age 15–64 years (MLE, 95% CI)
Age-specific VE against H1N1 (%)
0–4 years
69 (56, 84)
N.A.
68 (55, 83)
N.A.
5–9 years
26 (0, 48)
N.A.
24 (0, 47)
N.A.
10–14 years
92 (80, 96)
N.A.
92 (80, 96)
N.A.
15–19 years
86 (62, 95)
89 (66, 97)
86 (61, 95)
89 (65, 97)
20–29 years
84 (65, 91)
86 (69, 91)
83 (63, 90)
85 (67, 91)
30–39 years
8 (0, 37)
22 (0, 47)
5 (0, 35)
19 (0, 45)
40–49 years
18 (0, 45)
28 (0, 47)
14 (0, 42)
24 (0, 49)
50–64 years
32 (7, 51)
39 (16, 56)
28 (2, 48)
37 (14, 55)
65+ years
50 (16, 71)
64 (39, 83)
N.A.
N.A.
Age-specific VE against H3N2 (%)
0–4 years
58 (48, 67)
N.A.
58 (48, 67)
N.A.
5–9 years
45 (31, 58)
N.A.
45 (30, 57)
N.A.
10–14 years
23 (0, 41)
N.A.
22 (0, 41)
N.A.
15–19 years
31 (3, 53)
33 (4, 55)
30 (2, 53)
32 (1, 54)
20–29 years
34 (11, 51)
37 (15, 53)
33 (11, 51)
36 (14, 53)
30–39 years
10 (0, 31)
15 (0, 35)
9 (0, 30)
12 (0, 33)
40–49 years
36 (15, 52)
42 (24, 57)
36 (15, 52)
42 (23, 57)
50–64 years
47 (35, 56)
49 (37, 58)
47 (35, 57)
48 (36, 58)
65+ years
41 (24, 54)
38 (20, 52)
N.A.
N.A.
The effects of recent infection should also manifest in the difference between the observed and estimated numbers of cases (i.e., the excess cases, Appendix 1: ‘Calculating excess cases’), since unlike typical transmission models, our model does not take prior-season infections into account when estimating cases for the current season. More infections in a birth cohort in one season should reduce susceptibility in that birth cohort at the start of the next season. We thus expect that excess cases in one season will be followed by missing cases in the next season dominated by that subtype (i.e., a negative correlation in excess cases). Instead, we observed that excess cases for each birth cohort are weakly positively correlated from season to season, suggesting that immunity from recent infections is not a major driver of temporal variation in the age distribution of cases (Figure 5—figure supplement 5).
Figure 5—figure supplement 5.
Correlation of excess cases between seasons.
We tested whether excess cases in each birth cohort were negatively correlated with excess cases in the same birth cohort in the next season of the same subtype (Appendix 1: ‘Calculating excess cases’). We find a weak positive correlation for cases of H1N1 (Spearman’s =0.12, 95% CI 0.02–0.22) and H3N2 (Spearman’s =0.05, 95% CI −0.03–0.14).
Since older adults have the highest probability of primary infection with H1N1, we also reasoned that older adults might disproportionately drive the strong protection from H1N1 imprinting we observe. People born before 1947 were likely exposed to H1N1 strains that are antigenically similar to the post-pandemic H1N1 strains that comprise most of our H1N1infection data (Manicassamy et al., 2010; O'Donnell et al., 2012), creating the possibility that strain-specific cross-immunity drives the pattern we attribute to subtype-specific imprinting. These people nearly all fall into the ≥65 year-old age group in the study period. The study also underenrolled medically attended infections among people in nursing facilities, which would artificially lower the case count in this age group and may affect estimates of imprinting protection. Therefore, we excluded adults ≥65 years old and refitted our models. Excluding the oldest adults does not significantly change estimated imprinting protection or other parameters (Appendix 2—table 1 and Appendix 2—table 2).When we exclude both the youngest and oldest age groups, initial infections by H1N1 and H3N2 have similar protective effects (Figure 5, bottom row). This shows that the combined effects of cross-protection in both the youngest and oldest individuals contribute to the signal of imprinting protection we observe, but they are not its sole drivers.
VE varies by birth cohort in older children and adults
The best-fitting model includes age-specific VE (Figure 4—figure supplement 1; Appendix 2—table 2). While serological responses to influenza vaccination are weakest in the young (Englund et al., 2005; Neuzil et al., 2006) and old (Lee et al., 2018; DiazGranados et al., 2014), it is unclear what age-related factors would drive variation in VE in other age groups. We hypothesized that VE in these ages varies with early exposure history, which correlates with birth year, rather than age.To test this hypothesis, we fitted a model with birth-cohort-specific VE to the cases, excluding either children <15 years old or adults ≥65 years old. We chose birth cohorts that corresponded to the age groups of the original model in 2017–2018 (Materials and methods: ‘Vaccination’), keeping the number of parameters the same (e.g., VE in the 20–29 age group became VE in the 1988–1997 birth year cohort). We find that age-specific VE still outperforms all other models after we exclude the oldest age group (≥65 years old). In contrast, birth-cohort-specific VE performs better when we exclude children <15 years old (Figure 6—figure supplement 1). Estimates of imprinting protection and age-specific risk of medically attended influenza in the birth-cohort-specific VE models are not significantly different from estimates from the best-fitting model fitted to all ages (Appendix 2—table 1). Taken together, these results suggest that birth-cohort-specific VE best explains the case distribution in older children and adults, who have likely experienced their first influenza infection, whereas age-specific VE best explains cases in younger children, who have less influenza exposure.
Figure 6—figure supplement 1.
Ranking of models fitted to people ≥15 years old.
A model including age-specific risk of medically attended influenza A infection, HA subtype imprinting, and birth-cohort-specific VE best fits cases of people ≥15 years old. The 11 main models are shown as rows with colored squares indicating whether that model uses parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e, the CI includes 0). Models are sorted by their cAIC relative to the best-fitting model.
VE differs between birth cohorts that have similar imprinting by subtype (Figure 6; Appendix 2—table 5). For example, the 1968–1977 and 1988–1997 cohorts have similar probabilities of primary exposure to H1N1 and H3N2, but they differ substantially in their VE to both subtypes (Figure 6). The 1988–1997 and 1998–2002 cohorts also have similar probabilities of primary exposure to each subtype and have similar H1N1 VEs, but have significantly different H3N2 VEs (Figure 6). Antigenic differences within each subtype might explain this variation.
Figure 6.
Estimates of birth-cohort-specific VE.
Birth-cohort-specific VE differs significantly between subtypes and birth cohorts. The location of each pie chart represents the H3N2 (x-axis) and H1N1 (y-axis) VE estimates for a birth cohort (indicated by text) obtained from our model fitted to people ≥15 years old. Pie charts are colored by the probability of first infection by each subtype (i.e, imprinting probability). 95% confidence intervals of the VE estimates are indicated by light grey solid lines. The dashed grey line shows the diagonal where the VE estimate for H1N1 is equal to the VE estimate for H3N2.
A model including age-specific risk of medically attended influenza A infection, HA subtype imprinting, and birth-cohort-specific VE best fits cases of people ≥15 years old. The 11 main models are shown as rows with colored squares indicating whether that model uses parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e, the CI includes 0). Models are sorted by their cAIC relative to the best-fitting model.
The birth-cohort-specific VE model predicts observed cases better than the age-specific VE model for people ≥15 years old. Bars show the excess cases in vaccinated individuals relative to the birth-cohort-specific VE model (dark colors) and the age-specific VE model (light colors) for age groups ≥15 years old. Colors indicate the dominant subtype of a given season. 95% prediction intervals are shown as grey error bars.
Appendix 2—table 5.
Estimates for VE from model with birth-cohort-specific VE fitted to people ≥15 years old.
Birth cohort
H1N1 VE (%, MLE, 95% CI)
H3N2 VE (%, MLE, 95% CI)
1998–2002
100 (22, 100)
0 (0, 36)
1988–1997
89 (74, 93)
62 (45, 76)
1978–1987
59 (35, 76)
17 (0, 35)
1968–1977
23 (0, 47)
25 (2, 44)
1953–1967
28 (4, 46)
43 (32, 53)
1918–1952
61 (38, 76)
45 (32, 55)
Estimates of birth-cohort-specific VE.
Birth-cohort-specific VE differs significantly between subtypes and birth cohorts. The location of each pie chart represents the H3N2 (x-axis) and H1N1 (y-axis) VE estimates for a birth cohort (indicated by text) obtained from our model fitted to people ≥15 years old. Pie charts are colored by the probability of first infection by each subtype (i.e, imprinting probability). 95% confidence intervals of the VE estimates are indicated by light grey solid lines. The dashed grey line shows the diagonal where the VE estimate for H1N1 is equal to the VE estimate for H3N2.
Ranking of models fitted to people ≥15 years old.
A model including age-specific risk of medically attended influenzaA infection, HA subtype imprinting, and birth-cohort-specific VE best fits cases of people ≥15 years old. The 11 main models are shown as rows with colored squares indicating whether that model uses parameters indicated by the columns. Orange squares indicate covariates that were not estimated. Light green squares mean that a given estimated parameter was supported. Dark green squares mean that the model did not support the inclusion of the parameters indicated by the column (i.e, the CI includes 0). Models are sorted by their cAIC relative to the best-fitting model.
Excess cases for models using birth-cohort-specific VE and age-specific VE.
The birth-cohort-specific VE model predicts observed cases better than the age-specific VE model for people ≥15 years old. Bars show the excess cases in vaccinated individuals relative to the birth-cohort-specific VE model (dark colors) and the age-specific VE model (light colors) for age groups ≥15 years old. Colors indicate the dominant subtype of a given season. 95% prediction intervals are shown as grey error bars.
Discrepancies partly explained by antigenic evolution
The best-fitting model accurately reproduces the age distributions of vaccinated and unvaccinated cases of each subtype, aggregated across seasons (Figure 7A). The only exception is that it underestimates aggregate H1N1 cases in unvaccinated 5–9 year-olds. By examining the differences between predicted and observed cases for each season, we see that this is largely driven by infection during the 2009 H1N1 pandemic (Figure 7B). Such a large antigenic change may have negated any protection from previous infection in 5–9 year-olds and made them particularly susceptible to pandemic infection.
Figure 7.
Model predictions compared to observed case counts.
(A) The model including age-specific VE and subtype-specific HA imprinting accurately predicts the overall age distribution of cases across seasons and age groups. Each row depicts the age distribution of cases among unvaccinated (top) and vaccinated (bottom) individuals over all sampled seasons (2007–2008 through 2017–2018). Each column indicates H1N1 cases (left, blue) and H3N2 cases (right, red). Open circles represent observed cases, solid lines represent the predicted number of cases from the best-fitting model, the shaded area represents the 95% prediction interval of the best-fitting model. (B) Excess cases of dominant subtype for each season. Excess cases are defined as the predicted number of cases from the best-fitting model - observed cases (Appendix 1: ‘Calculating excess cases’). Each panel shows the excess cases of the dominant subtype for each season for each age group among unvaccinated (dark bars) and vaccinated (light bars) individuals. Grey error bars show the 95% prediction interval.
Model predictions compared to observed case counts.
(A) The model including age-specific VE and subtype-specific HA imprinting accurately predicts the overall age distribution of cases across seasons and age groups. Each row depicts the age distribution of cases among unvaccinated (top) and vaccinated (bottom) individuals over all sampled seasons (2007–2008 through 2017–2018). Each column indicates H1N1 cases (left, blue) and H3N2 cases (right, red). Open circles represent observed cases, solid lines represent the predicted number of cases from the best-fitting model, the shaded area represents the 95% prediction interval of the best-fitting model. (B) Excess cases of dominant subtype for each season. Excess cases are defined as the predicted number of cases from the best-fitting model - observed cases (Appendix 1: ‘Calculating excess cases’). Each panel shows the excess cases of the dominant subtype for each season for each age group among unvaccinated (dark bars) and vaccinated (light bars) individuals. Grey error bars show the 95% prediction interval.The model underestimates cases in unvaccinated individuals who were 30–39 years old and over 50 years old in the 2013–2014 season (Figure 7B), as indicated by the many excess cases in these age groups in that season. This is further evidence that subtype-specific imprinting cannot explain all age variation. As mentioned before, this season provided one of the first examples that original antigenic sin could affect protection: middle-aged adults had been targeting a familiar site on the pandemic strain that then mutated, rendering them susceptible. Other age groups were effectively blind to these changes, owing to their different exposure histories (Linderman et al., 2014; Huang et al., 2015; Arriola et al., 2014; Dávila et al., 2014; Petrie et al., 2016).
Discussion
The distribution of influenza cases by birth year is consistent with subtype-level imprinting, whereby initial infection with a subtype protects against future medically attended infections by the same subtype. The stronger protective effect observed from primary H1N1infection compared to primary H3N2infection may be caused by stronger cross-protective responses to conserved epitopes in the more slowly evolving H1N1 (Bedford et al., 2015). This is in line with previous work showing that protection conferred by H1N1infection lasts longer than protection conferred by H3N2infection (Ranjeva et al., 2019). Another recent study found stronger imprinting protection from primary H1N1 compared to primary H3N2infection (Gostic et al., 2019). Subtype-specific protection observed in seasonal influenza is narrower than the previously reported HA-group-level imprinting protection against avian influenza (Gostic et al., 2016), but in both cases, the protection correlates strongly with primary infection rather than any prior exposure.Examining cases of seasonal influenza over a 20 year period in Arizona, Gostic et al., 2019 find evidence of imprinting protection not only from HA but also NA, which we do not. We speculate that this discrepancy may be due to increasing vaccination coverage over time in middle-aged adults. During the period of the Arizona study (1993–1994 through 2014–2015), vaccination coverage in U.S. adults increased most rapidly in this age group (NHIS, 2009), which corresponds to the H2N2-imprinted cohorts near the end of the study. Without adjustment for vaccination, the apparently increased protection in the middle aged might resemble N2 imprinting. Accounting for vaccination in the MESA population, including the relatively stable vaccination coverage in each age group over time (Figure 1—figure supplement 3), suggests imprinting protection is driven by HA.In contrast to the clear role of the imprinting subtype in protection against medically attended infection, the model implicates the imprinting strain or other attributes of early exposure history in VE. We expect that people born around the same time were likely exposed to similar strains, not just subtypes, of influenza A early in life, and our results support the idea that biases in immune memory from these early exposures (i.e., original antigenic sin; Davenport and Hennessy, 1957; Francis, 1960; Fazekas de St Groth and Webster, 1966) influence VE. Specifically, we observe that our model is consistent with previous suggestions of birth-cohort-specific VE. The model with birth-cohort-specific VE better estimates cases in vaccinated 50–64 year-olds (born 1953–1967) in the 2015–2016 season than the model with age-specific VE, as indicated by the fewer excess cases predicted in that age group and an improved fit of 1.1 log-likelihood units (Figure 6—figure supplement 2; Appendix 1: ‘Calculating excess cases’). Reduced VE in this group during the 2015–2016 season has been attributed to the exacerbation of antigenic mismatch by the vaccine in adults whose antibody responses were focused on a non-protective site (Skowronski et al., 2017b; Flannery et al., 2018). The improved performance of birth-cohort-specific VE relative to age-specific VE suggests other seasons and age groups where original antigenic sin might have influenced VE, such as 20–29 year-olds in the 2007–2008 influenza season.
Figure 6—figure supplement 2.
Excess cases for models using birth-cohort-specific VE and age-specific VE.
The birth-cohort-specific VE model predicts observed cases better than the age-specific VE model for people ≥15 years old. Bars show the excess cases in vaccinated individuals relative to the birth-cohort-specific VE model (dark colors) and the age-specific VE model (light colors) for age groups ≥15 years old. Colors indicate the dominant subtype of a given season. 95% prediction intervals are shown as grey error bars.
Although seasonal estimates of VE routinely stratify by age, shifts in VE from one season to the next might thus be easier to interpret in light of infection history (e.g., Skowronski et al., 2017b; Flannery et al., 2018). The results suggest this effect may be subtle, i.e, influenced by strains’ specific identities rather than merely their subtype. Our model cannot distinguish between the possibility that the precise identity of the imprinting strain primarily determines later VE, or if individuals’ responses to vaccination are shaped by a particular succession of exposures, which would be common to others in the same birth cohort. Regardless, variation in VE between birth cohorts appears substantial and presents a challenge for vaccination strategies (Erbelding et al., 2018).The use of different influenza vaccines in MESA during this period is unlikely to affect the results. Most people enrolled in the study received the standard-dose inactivated influenza vaccine (IIV-SD) (Figure 1—figure supplement 7). However, between 9–26% of vaccinated children <18 years old received the live attenuated influenza vaccine (LAIV) between the 2008–2009 and 2015–2016 seasons (Figure 1—figure supplement 7B). A separate study of LAIV VE in the United States found that LAIV and IIV-SD recipients who were repeat vaccinees (as most children were) had similar VE, and thus we do not expect that LAIV receipt should affect VE estimates (McLean et al., 2018). Similarly, 1–15% of adults ≥65 years old received the high-dose inactivated influenza vaccine (IIV-HD) between 2009–2010 and 2017–2018 (Figure 1—figure supplement 7C). This vaccine is 20% more effective than IIV-SD (Lee et al., 2018). Therefore, the changing ratio of IIV-HD to IIV-SD recipients over time might bias results toward cohort-specific VE in models that include people ≥65 years old. However, when we fitted to cases between 15–64 years old, we found that cohort-specific VE still performed best. Thus, we conclude that changes in IIV-HD coverage do not substantially influence results.
Figure 1—figure supplement 7.
Vaccine type received.
Most vaccinated study participants received the inactivated influenza vaccine. The fraction of vaccinated people who received the standard-dose inactivated influenza vaccine (IIV-SD), the high-dose inactivated influenza vaccine (IIV-HD), or the live attenuated influenza vaccine (LAIV) is shown for all participants (A), children < 18 years old (B), and adults ≥65 years old (C).
Potential methodological biases and the vaccination history of the study population nonetheless suggest caution in interpreting VE estimates. Selection and misclassification biases can arise when using influenza test-negative controls to control for differences in healthcare-seeking behavior (Lewnard et al., 2018; Sullivan et al., 2016). Because we also use test-negative controls to set our null expectation for the distribution of cases among birth cohorts, our VE estimates are subject to these biases as well. Moreover, since 45% of the study population is vaccinated, and most participants are frequent vaccinees (Figure 1—figure supplement 6), we are limited in our ability to generalize the VE results to populations with much lower vaccination coverage and/or a shorter history of vaccination. Frequent vaccination has been associated with reduced VE (McLean et al., 2014; Saito et al., 2018; Skowronski et al., 2016). Therefore, the model may underestimate VE in less vaccinated populations. Underestimation of VE could also occur if unvaccinated people are protected by vaccination in the preceding season. Inference might also be distorted if vaccination has large indirect effects, which our model does not consider. Finally, our analysis is worth repeating in a larger population to reduce stochastic influences. We observed an unusually high H1N1 VE in the 1998–2002 birth cohort. Because we restricted cases in this analysis to people ≥15 years old, this VE estimate included data from only the 2013–2014 and 2015–2016 influenza seasons. No H1N1 cases among vaccinated or unvaccinated individuals were observed in this birth cohort in those seasons, which led to the high VE. This might have been due to particular epidemic dynamics in MESA.
Figure 1—figure supplement 6.
Repeat vaccination by age group and season.
Each bar shows the fraction of individuals who were vaccinated in that season who also received at least one influenza vaccination in the previous two seasons.
Incorporating differences in susceptibility based on early exposures might improve methods to forecast influenza seasons. The analysis of the relative risk of infection during the first half of each season shows more variation in the susceptible age groups from season to season than previously estimated (Worby et al., 2015). While the smaller sample sizes in MESA introduce uncertainty, the correlation between the relative risk and total fraction of cases indicates that the age groups driving epidemics indeed change from season to season. Because the contact structure of the population is probably constant over influenza seasons, variation in the driving age group may be determined by fluctuating susceptibility, which is partly determined by early infections. Therefore, incorporating information on early exposure history into epidemic models may allow for more accurate identification of at-risk populations and fine-scale epidemic timing.While the rate of antigenic evolution affects the rate at which different populations become susceptible to infection, we propose that the heterogeneity in susceptibility observed here may also drive antigenic evolution. Heterogeneity in susceptibility implies that influenza viruses face different selective pressures in groups with different exposure histories (Cobey and Koelle, 2008; Nakajima et al., 2000). Recent research consistent with this hypothesis has shown that sera isolated from different individuals can select for distinct escape mutants (Lee et al., 2019). More careful study of how immune memory to influenza evolves from infection and vaccination might improve understanding of influenza’s evolution.
Code and data availability
The code and data used to perform the analyses for this project are available at https://github.com/cobeylab/FluAImprinting (Arevalo et al., 2019; copy archived at https://github.com/elifesciences-publications/FluAImprinting).In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.Acceptance summary:Rapid evolution of influenza A presents the immune system with a moving target and allows the virus to infect the same human host multiple times. This is because changes in the virus mean that acquired immunity to a strain causing one infection will not, in general, be able to prevent subsequent infections with different strains. The first influenza infection a person experiences may have a particularly strong influence on how the immune system responds to subsequent influenza infections. In this study, the effects of these early influenza exposures (determined indirectly by birth year) are examined with a thorough and elegant analysis of high quality influenza data. The findings provide strong evidence that the first infection does indeed have an important influence, explaining variation in the age distribution of influenza cases in different influenza seasons and variation in vaccine effectiveness. The evidence that vaccine effectiveness varies with birth cohort and not just with age, in particular, has important implications for how we think about vaccine effectiveness.Decision letter after peer review:Thank you for submitting your article "Earliest infections predict the age distribution of seasonal influenza A cases" for consideration by eLife. Your article has been reviewed by four peer reviewers, including Ben S Cooper as the Reviewing Editor and Reviewer #2, and the evaluation has been overseen by Neil Ferguson as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Marc Baguelin (Reviewer #3).The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.Summary:This manuscript presents an analysis of 10 years of clinical testing of respiratory illness cases for influenza from a single population in the US. Data include PCR-confirmed influenza cases including type and sub-type, age and vaccine history. The analysis aims to quantify the effects of earliest influenza infections in childhood on subsequent risk of clinical infection with different subtypes, a question of considerable scientific interest. The analysis provides evidence that susceptibility to different subtypes is determined by the subtype of the first infection (as proxied by the birth cohort) and that vaccine responses are driven by precise infection history.Essential revisions:1) Fairly substantial re-organisation of the paper is needed in order to improve readability. In particular, although the Results and Discussion are separate sections, the Results sections tend to stray into discussion, and we advise you to change this. Furthermore, this manuscript would profit from putting the Materials and methods before the Results. Although it is normal to structure eLife articles with the Materials and methods section following the Discussion, we can accommodate different organisation if the editors feel the paper requires this for readability.2) In some places more precise terminology is needed and the almost interchangeable use of infection and case is potentially confusing. These data are all obtained from people seeking care and the primary results are about subtype differences. At the very least, the language describing the results has to be very tightly constrained to cases with any discussion of possible implications for infections reserved for the discussion. Although there is a paucity of good data describing the rate at which H1 and H3 result in clinical cases, there is no suggestion that this is a uniform process. It likely varies from year to year and from age group to age group.3) The formal description of the models was in some places ambiguous or confusing and this section needs careful attention.4) We strongly suggest that the results should be about cases as a function of birth cohort (as these are the data available) and then the idea that birth cohort may be a proxy for infection history should be reserved for the discussion. Even in older people, recent infection history will vary among people the same age and will likely to be a dominant factor in both risk of infection and risk of clinical disease. Especially, given other publications from the group looking at epitope-level histories, mapping infection history so strongly onto birth cohort seems unusual.5) The results are likely to be of wide interest because of the duration of the study and the consistency of the study design. However, the population from which they come is not representative. Levels of vaccination in the US are extremely high and, given how vaccination efficacy is measured by reduced rates of clinical disease rather than by reduced rates of infection, and these data are from clinical cases, caution is needed when generalizing these results. Even though this is mentioned in the discussion, the characteristics of the population are not given sufficient prominence in the Abstract and Discussion.6) Please provide a visualisation of the raw data showing how many tests were done in each year, and what was their outcome, stratified by reported vaccination status. How did the control population compare?7) Throughout the manuscript the authors dichotomise results into significant/non-significant based on whether p-values are below or above the arbitrary threshold of 0.05 (and in some cases it is only reported whether p-values are above or below, the actual values are not given). While this is common practice, it goes against mainstream statistical thinking which advises against the use of such "bright lines" when reporting or interpreting results (see the recent consensus statement on p-values from the American Statistical Association which "is intended to steer research into a post p<0.05 era." https://www.amstat.org/asa/files/pdfs/P-ValueStatement.pdf). While p-values clearly have a role (though the authors should bear in mind that with enough data, an arbitrarily small difference of no clinical significance can have an arbitrarily small p-value) dichotomising results into significant /non-significant is rarely helpful. We strongly encourage the authors to consider whether this approach is justified in light of the ASA statement and to consider revising the manuscript accordingly.8) While not an essential revision, out-of-sample validation of the models (rather than just model comparison) would greatly strengthen the conclusions.Essential revisions:1) Fairly substantial re-organisation of the paper is needed in order to improve readability. In particular, although the Results and Discussion are separate sections, the Results sections tend to stray into discussion, and we advise you to change this. Furthermore, this manuscript would profit from putting the Materials and methods before the Results. Although it is normal to structure eLife articles with the Materials and methods section following the Discussion, we can accommodate different organisation if the editors feel the paper requires this for readability.We have reorganized the paper. The Materials and methods section is now before the Results and we moved less central methods to a Supplementary Methods section to improve readability. We moved the “Modeling approach” section to the Materials and methods directly before the section now titled “Mathematical expressions for model components” to clarify the logic of our approach.We also increased the distinction between the Results and Discussion sections. For instance, we moved interpretation of the results in the subsection titled “VE varies by birth cohort in older children and adults” to the Discussion. More description is in the responses to Major Point 4.2) In some places more precise terminology is needed and the almost interchangeable use of infection and case is potentially confusing. These data are all obtained from people seeking care and the primary results are about subtype differences. At the very least, the language describing the results has to be very tightly constrained to cases with any discussion of possible implications for infections reserved for the discussion. Although there is a paucity of good data describing the rate at which H1 and H3 result in clinical cases, there is no suggestion that this is a uniform process. It likely varies from year to year and from age group to age group.This is an excellent point. We now refer to the risk of “medically attended infections” instead of “infections” where appropriate.3) The formal description of the models was in some places ambiguous or confusing and this section needs careful attention.We have made the changes suggested. In brief, we added equation numbers and more detail to the sections on calculating the attack rate, imprinting probabilities, and model likelihood.4) We strongly suggest that the results should be about cases as a function of birth cohort (as these are the data available) and then the idea that birth cohort may be a proxy for infection history should be reserved for the discussion. Even in older people, recent infection history will vary among people the same age and will likely to be a dominant factor in both risk of infection and risk of clinical disease. Especially, given other publications from the group looking at epitope-level histories, mapping infection history so strongly onto birth cohort seems unusual.We have shifted most interpretation to the Discussion:“We hypothesized that VE in these ages varies with early exposure history, which correlates with birth year, rather than age.”“VE differs between birth cohorts that have similar imprinting by subtype.”Paragraph beginning “Our results support the idea that biases in immune memory from early exposures (i.e., original antigenic sin) influence VE…” has been moved from Results to Discussion.5) The results are likely to be of wide interest because of the duration of the study and the consistency of the study design. However, the population from which they come is not representative. Levels of vaccination in the US are extremely high and, given how vaccination efficacy is measured by reduced rates of clinical disease rather than by reduced rates of infection, and these data are from clinical cases, caution is needed when generalizing these results. Even though this is mentioned in the discussion, the characteristics of the population are not given sufficient prominence in the Abstract and Discussion.We agree that the 45% vaccination coverage of the Marshfield population, although similar to national coverage, limits our ability to generalize some results to much less vaccinated populations. We have revised the Abstract:“Seasonal variation in the age distribution of influenza A cases suggests that factors other than age shape susceptibility to medically attended infection. We ask whether these differences can be partly explained by protection conferred by childhood influenza infection, which has lasting impacts on immune responses to influenza and protection against new influenza A subtypes (phenomena known as original antigenic sin and immune imprinting). Fitting a statistical model to data from studies of influenza vaccine effectiveness (VE), we find that primary infection appears to reduce the risk of medically attended infection with that subtype throughout life. This effect is stronger for H1N1 compared to H3N2. Additionally, we find evidence that VE varies with both age and birth year, suggesting that VE is sensitive to early exposures. Our findings may improve estimates of age-specific risk and VE in similarly vaccinated populations and thus improve forecasting and vaccination strategies to combat seasonal influenza.”We have also added the following line to the Discussion:“Moreover, since 45% of the study population is vaccinated, and most participants are frequent vaccinees (Figure 1—figure supplement 6), we are limited in our ability to generalize the VE results to populations with much lower vaccination coverage and/or a shorter history of vaccination. Frequent vaccination has been associated with reduced VE (McLean et al., 2014; Saito et al., 2018; Skowronski et al., 2016). Therefore, the model may underestimate VE in less vaccinated populations. Underestimation of VE could also occur if unvaccinated people are protected by vaccination in the preceding season. Inference might also be distorted if vaccination has large indirect effects, which our model does not consider.”6) Please provide a visualisation of the raw data showing how many tests were done in each year, and what was their outcome, stratified by reported vaccination status. How did the control population compare?We now show the data of all enrolled (i.e., tested) individuals for each season, stratified by age, vaccination status, and test outcome as Figure 1—figure supplement 1.The proportion of test-positive cases changes between age groups and seasons, consistent with susceptibility of each age group changing over time. The proportion of vaccinated cases among test-positive cases is generally smaller than the proportion of vaccinated cases among test-negative cases, consistent with the protective effect of vaccination.We also stratified our comorbidity analysis by age, season, vaccination status, and test status (Figure 1—figure supplement 4). Regardless of test outcome, vaccinated people tend to have more high-risk comorbidities than unvaccinated people. We used test-negative cases to adjust for this effect when estimating VE and have clarified this in our overview of the modeling method:“However, vaccinated individuals may seek medical attention for acute respiratory infection more frequently than non-vaccinees due to correlations between the decision to vaccinate, healthcare-seeking behavior, and underlying medical conditions (Jackson et al., 2005a,b; Belongia et al., 2009). Indeed, we generally observe higher rates of high-risk medical conditions among vaccinated people compared to unvaccinated people (Figure 1—figure supplement 4). We attempted to adjust for this by calculating the fraction of vaccinated people among those who had MAARI and tested negative for influenza (i.e., the test-negative controls, Mathematical expressions for model components: "Vaccination").”7) Throughout the manuscript the authors dichotomise results into significant/non-significant based on whether p-values are below or above the arbitrary threshold of 0.05 (and in some cases it is only reported whether p-values are above or below, the actual values are not given). While this is common practice, it goes against mainstream statistical thinking which advises against the use of such "bright lines" when reporting or interpreting results (see the recent consensus statement on p-values from the American Statistical Association which "is intended to steer research into a post p<0.05 era." https://www.amstat.org/asa/files/pdfs/P-ValueStatement.pdf). While p-values clearly have a role (though the authors should bear in mind that with enough data, an arbitrarily small difference of no clinical significance can have an arbitrarily small p-value) dichotomising results into significant /non-significant is rarely helpful. We strongly encourage the authors to consider whether this approach is justified in light of the ASA statement and to consider revising the manuscript accordingly.We agree such dichotomies should be avoided and thank the reviewers for bringing to our attention opportunities for more nuance. For the modelling results, we report 95% confidence intervals for parameter estimates in an effort to focus on effect sizes and windows of uncertainty. Here, we believe that it is reasonable to use terms such as “not significantly different” to denote parameter estimates under different models that cannot be statistically distinguished from each other.In other places, we have revised the text and figures to report our results more accurately:Materials and methods text changed from “The G-test of independence was used to determine whether each pair of seasons had significantly different age distributions. We considered differences significant if the Bonferroni-corrected p-value was <0.05” to “The G-test of independence was used to measure differences in seasons' age distributions.”. In Figure 2—figure supplement 1, we now report both p-values and the G-statistic (a measure of effect size) for all pairs of seasons.We now report the correlation between an age group’s rank in relative risk in a season and its rank in relative size (the fraction of cases) in that season (Figure 2—figure supplement 2). We also report confidence intervals for Pearson’s r instead of p-values.We removed the dashed line that indicated the value of Spearman’s ρ corresponding to a p-value of 0.05.We removed p-values and replaced them with 95% confidence intervals of Spearman’s ρ.8) While not an essential revision, out-of-sample validation of the models (rather than just model comparison) would greatly strengthen the conclusions.We performed a leave-one-out cross-validation analysis, which yielded the same results. Briefly, we excluded a single season of data, refitted all models to this subset, and evaluated how well each model predicted the birth year distribution of cases for the excluded season (subsection “Evaluation of predictive power”).Leave-one-out cross-validation gave the same main result as our original findings based on cAIC. The model with the lowest mean-squared prediction error included age-specific medically attended infection risk, age-specific vaccine effectiveness, and HA imprinting by subtype. We have added this to the Results section:“Our best-fitting model supports subtype-specific imprinting for H1N1 and H3N2 (Figure 5, top row; Appendix 2: Table 1). This model also provides the best predictive power compared to other models in a leave-one-out cross-validation analysis (Figure 5—figure supplement 1; Figure 5—figure supplement 2; Appendix 1: “Evaluation of predictive power”).”
Authors: Edward A Belongia; Burney A Kieke; James G Donahue; Laura A Coleman; Stephanie A Irving; Jennifer K Meece; Mary Vandermause; Stephen Lindstrom; Paul Gargiullo; David K Shay Journal: Vaccine Date: 2011-07-19 Impact factor: 3.641
Authors: Carlos A DiazGranados; Andrew J Dunning; Murray Kimmel; Daniel Kirby; John Treanor; Avi Collins; Richard Pollak; Janet Christoff; John Earl; Victoria Landolfi; Earl Martin; Sanjay Gurunathan; Richard Nathan; David P Greenberg; Nadia G Tornieporth; Michael D Decker; H Keipp Talbot Journal: N Engl J Med Date: 2014-08-14 Impact factor: 91.245
Authors: Amy L Kieke; Burney A Kieke; Sarah L Kopitzke; David L McClure; Edward A Belongia; Jeffrey J VanWormer; Robert T Greenlee Journal: Clin Med Res Date: 2014-12-08
Authors: Jason K H Lee; Gary K L Lam; Thomas Shin; Jiyeon Kim; Anish Krishnan; David P Greenberg; Ayman Chit Journal: Expert Rev Vaccines Date: 2018-05-16 Impact factor: 5.217
Authors: Karen L Laurie; Teagan A Guarnaccia; Louise A Carolan; Ada W C Yan; Malet Aban; Stephen Petrie; Pengxing Cao; Jane M Heffernan; Jodie McVernon; Jennifer Mosse; Anne Kelso; James M McCaw; Ian G Barr Journal: J Infect Dis Date: 2015-05-05 Impact factor: 5.226
Authors: Sylvia Ranjeva; Rahul Subramanian; Vicky J Fang; Gabriel M Leung; Dennis K M Ip; Ranawaka A P M Perera; J S Malik Peiris; Benjamin J Cowling; Sarah Cobey Journal: Nat Commun Date: 2019-04-10 Impact factor: 14.919
Authors: Haley L Dugan; Jenna J Guthmiller; Philip Arevalo; Min Huang; Yao-Qing Chen; Karlynn E Neu; Carole Henry; Nai-Ying Zheng; Linda Yu-Ling Lan; Micah E Tepora; Olivia Stovicek; Dalia Bitar; Anna-Karin E Palm; Christopher T Stamper; Siriruk Changrob; Henry A Utset; Lynda Coughlan; Florian Krammer; Sarah Cobey; Patrick C Wilson Journal: Sci Transl Med Date: 2020-12-09 Impact factor: 17.956
Authors: Rachel J Oidtman; Philip Arevalo; Qifang Bi; Lauren McGough; Christopher Joel Russo; Diana Vera Cruz; Marcos Costa Vieira; Katelyn M Gostic Journal: Trends Microbiol Date: 2021-07-01 Impact factor: 17.079
Authors: Kevin R McCarthy; Tarra A Von Holle; Laura L Sutherland; Thomas H Oguin; Gregory D Sempowski; Stephen C Harrison; M Anthony Moody Journal: Proc Natl Acad Sci U S A Date: 2021-06-08 Impact factor: 11.205
Authors: Eric P Griggs; Brendan Flannery; Ivo M Foppa; Manjusha Gaglani; Kempapura Murthy; Michael L Jackson; Lisa A Jackson; Edward A Belongia; Huong Q McLean; Emily T Martin; Arnold S Monto; Richard K Zimmerman; Goundappa K Balasubramani; Jessie R Chung; Manish Patel Journal: Am J Epidemiol Date: 2022-02-19 Impact factor: 5.363