Abstract
Influenza pandemics typically occur in multiple waves of infection, often associated with initial emergence of a novel virus, followed (in temperate regions) by a later resurgence accompanying the onset of the annual influenza season. Here, we examined whether data collected from an initial pandemic wave could be informative, for the need to implement non-pharmaceutical measures in any resurgent wave. Drawing from the 2009 H1N1 pandemic in 10 states in the USA, we calibrated simple mathematical models of influenza transmission dynamics to data for virologically confirmed hospitalisations during the initial ‘spring’ wave. We then projected pandemic outcomes (cumulative hospitalisations) during the fall wave, and compared these projections with data. Model results show reasonable agreement for all states that reported a substantial number of cases in the spring wave. Using this model we propose a probabilistic decision framework that can be used to determine the need for pre-emptive measures such as postponing school openings, in advance of a fall wave. This work illustrates how model-based evidence synthesis, in real-time during an early pandemic wave, could be used to inform timely decisions for pandemic response.
Introduction
Even as the current coronavirus pandemic causes morbidity, mortality and societal disruption on a global scale, the threat of similar disruptions from pandemic influenza continues unabated [10]. The 1918 H1N1 pandemic caused an estimated 50 million deaths worldwide, with a similar case fatality rate to COVID-19 [1,3]. Today, highly pathogenic strains of avian influenza continue to cause sporadic infections in humans, with occasional instances of human-to-human transmission [13]: preparedness for a future influenza pandemic thus remains as important as ever.
A marked feature of influenza epidemiology is its pronounced seasonality, with already-established influenza viruses causing epidemics each winter in temperate regions of the world. Such seasonality is likely to arise from a combination of factors, and has been associated with environmental conditions including absolute humidity [19], as well as increased transmission amongst schoolchildren with the post-holiday opening of school terms [7,12]. These seasonal drivers have played a strong role in shaping the dynamics of pandemic, as well as seasonal, influenza. For example, in the USA in 2009, the novel H1N1 virus caused a ‘spring wave’ from April to July, during which an estimated 1.8m-5.7m people experienced symptomatic infection and 9,000-21,000 were hospitalised [17]. The subsequent onset of the influenza season in October 2009 was accompanied by a strong resurgence of the virus (the ‘fall wave’), resulting an estimated 60.8m total infections and 274,304 hospitalisations by April 2010 [20]. Similar multi-wave behaviour emerged in other temperate countries, with the UK experiencing three successive waves [16,6]. If novel influenza viruses can emerge at any time of year in temperate countries, it is more likely than not that they would do so outside the normal influenza season, which typically spans a 4 month period (late October to late February) in the Northern Hemisphere.
For any future influenza pandemic, population surveillance data collected from any initial, out-of-season wave could therefore give important information for characterising severity and transmissibility. Could such data be used to predict the potential health impact of a subsequent fall/winter wave? If so, could such projections be used in real time, to trigger pre-emptive control measures in advance of that wave? In this work we addressed these questions using a mathematical model of influenza transmission dynamics, with a focus on pandemic spread in the USA. We present examples of how this framework can be used to guide real-time decision-making for future pandemics.
Results
Figure 1 shows FluSurv-NET data for virologically confirmed hospitalisations with pH1N1 during the spring and fall waves of the 2009 pandemic, for 10 states in the USA in which this data was available. The figure illustrates the difference in size between the two waves. Notably, the geographically southern states of Georgia, New Mexico and Tennessee reported only sparse data in the spring/summer period: these same states also showed substantially fewer cases being reported in the fall wave per capita, when compared to other states.
Shown is weekly, age-specific data collected by the US Centers for Disease Control and Prevention (CDC), for hospitalisations that were virologically confirmed as being pandemic H1N1. Each colour denotes a different age group, as indicated by the legend. Panels show data from the different states reporting FluSurv-NET data. Weeks are numbers along the x-axis according to MMWR numbering with, for example, week 35 corresponding to the week beginning on Sunday 30th August. Note that the y-axis varies between states. As described in the main text, several of these states (e.g. California) show clear signs of distinct spring and fall waves. For the purpose of the model, we used these data in combination with CDC estimates for the proportion of symptomatic cases that are hospitalised, virologically tested, and reported through this dataset.
To model these data, we calibrated a deterministic, compartmental model of influenza transmission dynamics, with five age groups: <4 years old (yo), 5-19yo, 20-49yo, 50-64yo, and >65yo (see Materials and Methods). We calibrated this model to data from the spring wave illustrated in Figure 1, i.e. from months April-August. For each location we then used the calibrated model to project the fall wave dynamics. Figure 2 shows results in the example of California, illustrating both spring wave calibration, and fall wave projections. Figure S2 shows outputs disaggregated by state, and Figure S3 illustrates the performance of the Bayesian Markov Chain Monte Carlo used to perform these calibrations. Although the model tends to estimate fall wave peak timing somewhat earlier than actually occurred in the data (week 41 vs week 42 for aggregate results, week 40 vs week 44 for Connecticut), there is reasonable agreement for the size of the fall wave (comparing areas under the curve for model projections and data for incidence). For the remainder of this analysis, we focus on the cumulative burden in the fall wave rather than peak timing.
For each state shown in Fig.1, we calibrated the model to the epidemic data from the spring wave (black line, to the left of the vertical dashed line, with aggregated model projections shown in grey shaded area). Using this calibrated model, we projected simulations forward into the fall, taking account of the effect of school openings and environmental forcing (blue shaded area). Although the model projection for epidemic peak timing varied in accuracy across states, our subsequent analysis concentrates on cumulative burden (area under the curve). See Figure S1 for results for other states.
Figure 3 shows a state-wise disaggregation for the cumulative projected hospitalisations in the fall wave. Model projections show good agreement in 7 of the 10 states studied. However, model projections appear less accurate in Georgia, New Mexico and Tennessee, where the model substantially overestimates cumulative hospitalisations in the fall wave. As noted above, these are the states with only sparse data reported in both spring and fall waves.
Each panel shows a different state. Crosses in black show data, while coloured points show model-based projections, with each point representing the result of a single sample from the posterior density.
For those states showing reasonable model performance, we next examined how this framework could be operationalised, to trigger pre-emptive interventions in advance of the fall wave. Such interventions could involve, for example, physical distancing orders; pre-emptive school closures; or other non-pharmaceutical measures aimed at reducing opportunities for transmission. As such measures are typically costly and disruptive, any decision to implement them must carefully balance these disruptions against the risks of widespread morbidity and mortality.
As an illustrative example in the current analysis, we concentrated on pre-emptive school closures (i.e. postponing the start of the school term), until a vaccine becomes available. To inform our assumptions for vaccine roll-out, we assumed the same trajectory as in the H1N1 pandemic, when a vaccination programme was in October, ultimately to cover over 25% of the population (see Figure S4). In a hypothetical scenario where pre-emptive school closures are implemented for a 2009-like pandemic, Figure 4 illustrates the reduction in fall wave burden, as a function of different durations of intervention. The figure illustrates, for example, that a 10-week postponement in school opening, together with the impact of the vaccine rollout, could reduce cumulative hospitalisations in the fall wave by 72% (95% CrI 38-90%).
Each colour shows a different age group as indicated by the legend, while shaded areas show 25-75th percentiles, with 2009 vaccination coverage/efficacy. The vertical dashed line represents the candidate delay of 10 weeks usedto illustrate our decision framework in figure 5.
As a decision tool for when to trigger such measures, we defined the ‘probabilistic risk score’ (PRS) as the probability that cumulative hospitalisations in the fall wave will exceed a threshold of h per capita. We assumed that this risk score would be evaluated at the end of the spring wave, and that pre-emptive school closures would be triggered if PRS exceeds a given threshold p. In practice, both h and p would be determined by a policymaker, and can be interpreted as reflecting considerations of healthcare capacity (cumulative hospitalisations, h), alongside tolerance of uncertainty (p). As an illustrative example, we assumed a scenario where h = 1, 500 and P = 0.1. Figure 5 illustrates this decision tool being applied to the H1N1 pandemic in California, as well as an alternative scenario with a hypothetical virus having twice the severity (i.e. the same infectivity, but twice the risk of hospitalisation). Under these parameters, a 2009-like virus would not trigger the intervention (blue line), whereas a more severe virus would do so (solid red line). The FluSurv-NET data set estimates 1, 013 hospitalisations in California during the second wave, consistent with our decision not to trigger an intervention. Moreover in this second scenario, pre-emptive school closures for 10 weeks would bring PRS substantially below the threshold (dashed red line).
Shown, for illustration, is the example of California. These plots can be interpreted as cumulative probability distributions, for the total hospitalisations projected in the fall wave. As described in the main text, we define a ‘probabilistic risk score’ (PRS) as the probability that fall wave hospitalisations will increase a given threshold, h. We assume that pre-emptive interventions would be triggered if PRS exceeds some threshold probability P, with both H and P determined by a policymaker. The figure shows an illustrative scenario where h = 1, 500 cumulative hospitalisations, and P = 0.1 (‘reference point’, shown as a black dot). Any model-based projections can be represented as a downward-sloping curve on this plot: pre-emptive interventions would be triggered if the curve intersects the vertical, dashed line at any point above the reference point. As examples, the blue curve shows model projections for a 2009-pandemic-like virus in California (i.e. corresponding to Fig.3A), a scenario that would not trigger pre-emptive interventions. The solid red curve shows an alternative scenario, of a virus that is equally infectious, but twice as severe (i.e. having twice the risk of hospitalisation given infection). Such a virus would trigger pre-emptive interventions; the dashed red curve shows the reduction in hospitalisation risk that would occur, in a scenario where school opening is postponed for 10 weeks until vaccine rollout is underway (assuming the same vaccine introduction and rollout scenario as occurred in 2009-2010, in response to the pandemic).
Discussion
In any future influenza pandemic, early and accurate information will be critical in deciding how best to respond. Here we have examined how mathematical modelling of transmission dynamics could be used to analyse surveillance data in the early stages of a pandemic, to inform decisions for pre-emptive non-pharmaceutical interventions, in advance of any second wave of infection. Notably, although our approach is based on a relatively simple, compartmental model of influenza transmission, for states where there is sufficient data, this approach shows reasonable projections for cumulative fall wave burden (Fig.3). A key benefit of such a modelling approach is that it can be readily deployed in real time: we illustrate how this model could be applied in a probabilistic way, to inform decisions for pre-emptive school closures (Fig.4).
Model calibration to an unmitigated phase allows us to capture important epidemiological properties of an outbreak, most crucially the basic reproductive number. Though such calibration does not require an entire wave to be unmitigated, any changes in contact patterns during the calibration period is likely to bring additional uncertainty into the model.
Our modelling approach performs less well in states having only sparse data for the spring wave (Figures 1 and 3). Such sparseness could be explained either by under-reporting, or by a genuinely lower level of influenza activity in these locations. However, it is notable that these states also reported systematically low numbers in the fall wave as well (Figures 3 and S2), suggesting that data reporting may be the driving factor. We focused on data for virologically confirmed hospitalisation because it was the least affected by changes in testing practices, during the pandemic [18]. Nonetheless, an important area for future work is to explore the potential for incorporating other forms of data as well, including syndromic and virological surveillance collected from the primary care level and above [14]. Combining different streams of data in this way could offer a helpful approach, to compensate for shortfalls in any individual data stream.
As an example of non-pharmaceutical interventions, we have modelled pre-emptive school closures, i.e. postponing the opening of schools. We note that our estimates for the impact of these measures are driven by modelled variations in the age-specific contact matrix, depending on whether schools are in or out of session. In turn, these variations are derived from estimated contact rates in an education setting[2]. In future, such estimates would benefit from primary evidence for the ‘real world’ impact that could arise from pre-emptive school closures. Because much of the available, primary evidence arises from closures that occurred during the course of an epidemic, it is likely to underestimate the impact of pre-emptive measures. The absence of influenza cases in 2020-2021 is indicative that the combination of non-pharmaceutical interventions employed to mitigate he spread of COVID-19 prevented major influenza outbreaks, though school closures were only a part of this effort [21]. Available evidence is equivocal for COVID-19, but the apparently milder natural history of infection in children, in comparison with influenza, may limit its generalizibility. Nonetheless, other statistical analysis, taking advantage of country-level variations in school opening dates, suggests that later school openings are indeed associated with reduced epidemic peaks [4].
As with any modelling study, our analysis has limitations to note. Our mathematical model involves several simplifications: averaging at the state level, it does not address the marked intra-state, spatial heterogeneity seen in the 2009 pandemic [8], indeed heterogeneity that is likely to be displayed by any future pandemic as well. Further work could seek to address these complexities by incorporating spatial structure. However, it would be important for any such approach to maintain a balance between complexity, and rapid deployability, during a pandemic. As noted above, our work illustrates that even a simple model is able to capture the cumulative fall wave burden in reasonable agreement with the data. Nonetheless, more complex models may be helpful in better capturing the peak timing of the fall wave: an estimate that could be equally important for decision-making, as the cumulative burden. Additionally, our estimates depend on case-to-hospitalisation multipliers, which were estimated during the course of the 2009 pandemic by combining careseeking interviews with other sources of evidence [9]. In any future pandemic, an application of this approach would likewise necessitate such evidence generation, which would need to be implemented rapidly, during the few months of the initial wave. Alternatively, establishing readiness for serological surveillance [5] would likewise provide critical information during the course of a future pandemic, to help translate hospitalisation and syndromic surveillance data to actual burden in the community.
In conclusion, the ongoing coronavirus emergency has highlighted the difficult choices faced by decision-makers, in the face of a pandemic: how to weigh the societal disruptions of sweeping interventions against the need for rapid, decisive action to mitigate the spread of a dangerous pathogen. While surveillance alone plays a critical role in informing these decisions, dynamical analysis of this data could offer additional, important insights for pre-emptive action. We have illustrated our methodology with respect to delayed school openings in order to mitigate a fall wave, though a broader application is feasible, in which real-time data informs parameter uncertainty in a simple model, and the forward projections under different scenarios are used to inform institutional closures or behavioural change. Overall, this analysis serves to illustrate how surveillance data can be analysed with rapid, simple models, in order inform projections on the need for future intervention.
Methods
The Model
We use a deterministic SIR (susceptible-infectious-removed) model of influenza transmission defined as follows:
where SV denotes vaccinated susceptibles, SV P vaccinated susceptibles with active vaccine protection, IS symptomatic infectious, IA asymptomatic infectious, and the subscript i indexes 5 age groups (0-4, 5-19, 18-49, 50-64 and 65+). The contact matrix C taken from [15]. The parameter ρ the proportion of cases that are symptomatic, r the reduction in infectiousness of asymptomatic cases, vi(t) the vaccination rate in age-group i, and xi the corresponding infection-blocking efficacy. We use the vaccination uptake and efficacy observed in 2009, which began in early October (MMWR week 39, Figure S4). We assume a mean delay of 14 days (u = 1/14) from administration of vaccine to full protection. For each FluSurv-NET location, we simulate using the corresponding age-stratified sample location populations Ni. Hospitalisations are modelled as a multiplicative factor of symptomatic incidence, using the case-hospitalisation ratios given in [18]. Vaccination rates and hospitalisation multipliers are given at national level only.
Uncertainty in disease incidence is captured via uncertainty in case-hospitalisation multipliers. These are assumed normally distributed for each age group i, with mean µi and standard deviation σi, given in table 1. For parameter set θ, our model produces simulated weekly incidence y(t), desegregate by age group. These values are divided by the corresponding hospitalisation data to yield simulated multipliers. The likelihood L(θ) is then a product of normal likelihoods N (µi, σi) over all age groups and time points.
Case-hospitalisation multipliers (USA).
We perform a Bayesian fit to the first wave only (up to 1st September, MMWR week 35) using an adaptive MCMC algorithm [11]. We allow for uncertainty in the coefficients of C and account school closures throughout this period by subtracting fixed values δ1 and δ2 from C11 and C22, determined by the education-specific contact rates given in [2]. A complete set of fitted model parameters and ranges of uniform priors is given in table S1. Sampling from the posterior density from the first wave, we project from 1st September, imposing the change in contact rates due to school openings. Together with seasonality, this generates a second wave of infection.
A delay in school openings is modelled via a delay in the change in contact rates. The probabilistic risk scores for a given state and delay in school opening are calculated by projecting the corresponding epidemic scenario for each parameter set in our posterior sample. Each simulation yields a total number of hospitalisations, which are always counted from the week of 1st September, irrespective of delay in school openings.
Data Availability
All data (FluSurNet) is freely available online.
https://www.cdc.gov/flu/weekly/influenza-hospitalization-surveillance.htm
Supplementary Material
Model parameters and boundary values of uniform priors.
Cumulative hospitalisations by state in the first (left) and second (right) waves of the 2009 influenza pandemic. The second wave is counted from week 35(inclusive), the week of 1st September.
Model calibrations and projections by state.
Marginal densities of the Bayesian fit to first-wave data for California: basic reproductive number R0 and recovery rate γ (top); R0 and amplitude of seasonality φ2 (bottom).
2009 influenza pandemic vaccine roll-out in the USA.