Uncertainty and error in SARS-CoV-2 epidemiological parameters inferred from population-level epidemic models

Dominic G. Whittaker; Alejandra D. Herrera-Reyes; Maurice Hendrix; Markus R. Owen; Leah R. Band; Gary R. Mirams; Kirsty J. Bolton; Simon P. Preston

doi:10.1101/2022.07.01.22277134

Abstract

During the SARS-CoV2 pandemic, epidemic models have been central to policy-making. Public health responses have been shaped by model-based projections and inferences, especially related to the impact of various non-pharmaceutical interventions. Accompanying this has been increased scrutiny over model performance, model assumptions, and the way that uncertainty is incorporated and presented. Here we consider a population-level model, focusing on how distributions representing host infectiousness and the infection-to-death times are modelled, and particularly on the impact of inferred epidemic characteristics if these distributions are misspecified. We introduce an SIR-type model with the infected population structured by ‘infected age’, i.e. the number of days since first being infected, a formulation that enables distributions to be incorporated that are consistent with clinical data. We show that inference based on simpler models without infected age, which implicitly misspecify these distributions, leads to substantial errors in inferred quantities relevant to policy-making, such as the reproduction number and the impact of interventions. We consider uncertainty quantification via a Bayesian approach, implementing this for both synthetic and real data focusing on UK data in the period 15 Feb–14 Jul 2020, and emphasising circumstances where it is misleading to neglect uncertainty.

1 Introduction

A simple Susceptible-Infected-Removed (‘SIR’) epidemic model, sometimes termed the ‘general epidemic model’ (Diekmann and Heesterbeek, 2000), is defined as follows. Suppose that S_i and I_i are the number of susceptible and infectious individuals, respectively, on day i; and that between day i and i + 1 for some rate constants λ and θ, λS_iI_i of the susceptibles become infectious, and θI_i of the infectious individuals become ‘removed’ (by death or recovery, so that they are no longer susceptible nor infectious). Though very simple, this model embodies important epidemiological principles and exhibits reasonable dynamics, including exponential growth in the early stages and decline once the susceptible population is suitably depleted. The model also connects to key epidemiological parameters such as the basic reproduction number, R₀, defined as expected number of cases directly generated by one case in a population where all individuals are susceptible, for this model equal to λ/θ. The SIR model remains widely used for prediction and inference for epidemics, including SARS-CoV-2 (Britton, 2020; Dehning et al., 2020).

One of this simple model’s limitations, however, is that it does not distinguish the infectious individuals by how long they have been infected, termed the infected age. Consequently, it assumes that the infectiousness of infected individuals is constant, and therefore that the time between individuals becoming infected and then either recovering or dying follows an exponential (or geometric, in discrete time) distribution. These assumptions are strongly in conflict with clinical literature (Ferretti et al., 2020; Ganyani et al., 2020; Harrison et al., 2020; Verity et al., 2020), as we discuss in the following section. The principal contributions of this paper are to introduce an epidemic model which structured by “infected age”, and to use this model as a basis to infer quantities important to policy-making. Similar models incorporating structure by infected age have been investigated by (Hart et al., 2020) and (Diekmann et al., 2021) in the the context of the “forward” problem, i.e., understanding the impact on model dynamics, in contrast to our focus here on the “inverse” problem, i.e., of understanding the impact on inference based on data: our goal is to explore the inferential consequences of ignoring the infected-age structure, and to highlight circumstances when doing so gives misleading conclusions.

Quantities of policy-making relevance and of interest to infer include: the timing and impact of behavioural changes, especially relating to non-pharmaceutical interventions (NPIs) such as ‘lockdown’ measures; the effect of NPIs on epidemiological characteristics such as the effective reproduction number, ℛ (defined later); the timing and size of the peak in the number of new daily infections; and the number of individuals that remain susceptible at the end of an epidemic wave.

In Section 2 we describe the data available on the epidemic time-course and on changes in mobility patterns relating to response to NPIs. We describe an infected-age-structured model, which is the central model in the paper, and some simpler models for comparison of the types sometimes used for inferring characteristics of an epidemic. The results in Section 3 include an application of the model to compare competing hypotheses for epidemic decline; a simulation study investigating the impact of using non-infected-age-structured models for inference when the data arise from an infected-age-structured model; and full Bayesian inference focusing on UK data for the SARS-CoV-2 outbreak in spring 2020, including for the quantities listed in the preceding paragraph. Section 4 contains a concluding discussion.

2 Data, Models and Methods

2.1 Data

2.1.1 Deaths associated with SARS-CoV-2

We focus on country-level SARS-CoV-2 mortality data for England and Wales for the period FebJuly 2020. The total population, denoted N in the following models, is 59.1 million (ONS, 2020). Various time-course data are available for this period, including the number of confirmed cases by specimen date, though such data are influenced by rapid changes to testing capacity (DHSC, 2020) and strategy (Dunn et al., 2020) that make them challenging to connect to variables in an epidemic model. For this reason we focus exclusively on data for SARS-CoV-2-associated deaths collated by date of death for England and Wales by the UK Office for National Statistics. These deaths data count registered deaths where COVID-19 was mentioned on the death certificate (ONS, 2020), with the first death occurring on 8 Mar 2020. Although death data are available beyond July 2020, we restrict ourselves to considering inferences that may be made at the conclusion of the first epidemic wave. The deaths data are plotted later in results Figure 3.

2.1.2 Mobility data

The UK government announced on 23 Mar 2020 that a lockdown would be imposed entailing severe restrictions including that that no one should leave their place of residence except for some specific reasons (UK Government, 2020a). Over the next 5 months this legislation was revised approximately monthly, sequentially expanding the permissible reasons to leave one’s place of residence, whilst introducing guidance on COVID-secure workplaces (Department for Business, Energy & Industrial Strategy and Department for Digital, Culture, Media & Sport, 2020) and mandating face coverings in some public spaces (UK Government, 2020b). Publicly available data indicate the extent to which public behaviour altered in this period. Google mobility data, for example, indicate a relative change with respect to a normal baseline, in activity in various categories across the UK (Google, 2020), and provide a proxy for changes in mixing intensity in England & Wales over this period. Figure 1A shows the mean of ‘workplace’ and ‘transit’ categories computed daily from the UK Google mobility data, which motivates later modelling of a change in population mixing intensity with a step function to account for the introduction of NPIs.

Figure 1:

(A) Google mobility data and the parameterisation for a step function, defined in (3), for α_j, which describes changes in population mixing intensity. The data points are given by the mean of workplace and transit activity levels for the UK, with Saturday and Sunday removed (the four outliers being bank holidays). The blue step is given by equation (3) and contains two parameters which control the timing and severity of the drop in viral transmission due to lockdown measures: t^∗ and α_b, respectively. Plots (B) and (C) respectively show the infectiousness profile, over a 15 day period, and the infection-to-death distribution, shown over a 50 day period; these correspond to the distributions and default parameters listed in Table 1.

2.1.3 Clinical data on individuals’ response to SARS-CoV-2 infection

The models in §2.2 involve epidemiological parameters drawn from the clinical literature that characterise individuals’ infectiousness and mortality. These are summarised in Table 1, including ‘default’ values that are used in the later simulation study, and priors that are used for the Bayesian inference.

View this table:

Table 1:

Summary of key epidemiological parameters drawn from clinical literature, as described in §2.1.3, including default values used later to generate synthetic data (§3.2), and priors used in the Bayesian inference (§3.4). The default values correspond with the plots of the infectiousness profile and infection-to-death distribution shown in Figures 1B and C. The gamma distribution priors are parameterised as Γ(scale, shape), and the normal priors as 𝒩 (mean, std dev).

For most hosts infected with SARS-CoV-2, detectable viral load in the upper respiratory tract peaks at around 5–6 days following exposure (He et al., 2020). For many hosts this approximately aligns with the the delay from exposure to experiencing symptoms (the infection-to-onset period) (Backer et al., 2020), and thus there is a significant period of infectiousness prior to symptom onset (He et al., 2020; Ashcroft et al., 2020). Some hosts remain asymptomatic throughout infection and experience similar viral load dynamics, besides somewhat faster viral clearance following peak viral load (Kissler et al., 2020). For the purposes of developing a transmission model, it is therefore important to describe a typical host infectiousness profile, β_j, subject to Σ_j β_j = 1, as depending on the time in days, j, since infection (rather than with respect to symptom onset).

The generation time interval, defined as the period from host infection to the generation of progeny infection, has a distribution that describes such an infectiousness profile (Lehtinen et al., 2020). We use the inferred gamma-distributed generation time from the Singapore cluster in Ganyani et al. (2020), with mean 5.2 days and variance 2.96 days, to define a fixed infectiousness profile β_j when inferring the impact of lockdown in §3.4. Since the models in this paper are in discrete time, we discretise the gamma distribution by taking β₁ = F (3/2) and β_j = F (j + 1/2) −F (j −1/2) for j ≥ 2, where F ( ) is the gamma cumulative distribution function. We do not have generation time interval estimates for transmission chains specific to England and Wales. However estimates for other international data sets are congruent with the estimate from Ganyani et al. (2020). Ferretti et al. (2020) estimates a mean generation time of 5.0 days and variance 1.9 from a sample of 40 infector-infectee pairs. Sun et al. (2020) estimate a similar median generation time (5.3 days, IQR:3.1-8.7) that is correlated with the delay from symptom onset to isolation. To account for potential variability due to alternate estimates for the latent period, the role of an infected’s behaviour — in particular the efficacy and timing of isolation — in influencing the generation interval, and the potential for unrepresentative sampling of infector-infectee pairs, we draw on the range and reported confidence across these estimates to construct priors on the mean and variance of a gamma distributed infectiousness profile (see Table 1).

We refer to β_j as the infectiousness profile rather than the generation time distribution, as the latter has a small correction due to any overlap in the distributions for time from infection to death and infectiousness by infected-age due to the non-zero hazard of death while infectious (see §2.2.5). Figure 1B shows the ‘default’ infectiousness profile.

To infer the epidemic dynamics based on the death data, it is key to characterise accurately the infection-to-death time distribution. We denote the probability mass function of the infection-to-death distribution by , such that is the probability that death occurs j days after infection, conditional on ultimately dying from SARS-CoV-2. The infection-to-death distribution is the convolution of infection-to-onset and onset-to-death distributions, for which separately there are data.

The infection-to-onset period is estimated from cases with well constrained exposure history (e.g. Backer et al., 2020). For consistency with our choice of infectiousness profile, we adopt the infection-to-onset period used by Ganyani et al. (2020) when inferring the generation interval distribution above, namely a gamma distribution with mean 5.2 days and standard deviation 2.8 days. For onset to death, early data from Hubei, China, corrected for epidemic growth, suggested an average time 18 days and standard deviation 8.4 days (Verity et al., 2020). A report published by the UK government provides distributions for the time from symptom onset to death for over 22,000 patients hospitalised in the UK during the first epidemic wave (up August 1st 2020), by age range and sex (Harrison et al., 2020). The aggregate distribution is described by a gamma distribution with mean 14 days (std. dev. 9.8 days), and is stable when re-weighting the age- and sex-dependent distributions to match those of reported deaths in England and Wales over the same period (ONS, 2020), giving some confidence that this distribution is representative for the population we are modelling over the period of study. Similar estimates for the death delay are reported by Sherratt et al. (2021) based on confidential UK data.

We model the infection-to-death distribution as a negative binomial distribution, chosen for an appropriate shape, computing point estimates for its parameters by matching moments to the convolution of the Ganyani et al. (2020) infection-to-onset and Harrison et al. (2020) onset-to-death distributions described above. This leads to the parameters shown in Table 1 and the distribution plotted in Figure 1C. There remains uncertainty, however, in the infection-to-death distribution owing to uncertainty in the infection-to-onset period (see Backer et al., 2020), the censoring effect of unknown symptom onset dates in the hospitalisation data (Harrison et al., 2020), and regional variability (about which we lack data) that may influence the effective average time to death profiles (e.g. Hawryluk et al., 2020). We hence characterise uncertainty in the infection-to-death distribution via priors on the mean and dispersion parameters. We choose priors, shown in Table 1, such that infection-to-death distributions with high prior probability are consistent with the distributions estimated by Harrison et al. (2020) and Verity et al. (2020) time-to-death distributions.

A further key parameter for inferring epidemic dynamics from death data is the infection fatality rate (IFR), δ, which is the risk of death for an infected host, neglecting other host covariates. Seroprevalence of antibody to SARS-CoV-2 — together with evidence regarding the preservation of measurable antibody (Huang and Garcia-Carreras, 2020) — provides an estimate of the integrated exposure history to SARS-Cov-2, and enables estimation of the δ. In the results sections we explore the impact of different assumptions about δ: in §3.1 we let δ be a free parameter; in §3.2 we fix its value to the central estimate for England from (O’Driscoll et al., 2020); and then in §3.3 and §3.4 we try to characterise knowledge about δ with a judicious prior (Table 1) that reflects uncertainty and bias that may arise for various reasons, such as non-representative serological surveys, non-uniform prevalence in different risk groups (e.g. in care homes versus the surrounding community) and waning antibody levels (O’Driscoll et al., 2020).

2.2 Epidemic models

In the following we present several simple epidemic models. The first, called SI_tD, is the central one in this paper and introduced with the goal of being as simple as possible whilst retaining structure by infected age. The subsequent SIRD_ΔD, SIURD and SE²I²U ²RD models are simple models that are not directly structured by infected age. Figure 2 shows schematics of the various models.

Figure 2:

Compartmental structure for models (A) SI_tD, (B) SIRD/SIRD_ΔD model, (C) SIURD, and (D) SE²I²U ²RD, as defined in §2.2. In (B) the D_ΔD compartment indicates deaths that will occur after an additional fixed delay ΔD. The SIRD model is the special case of the SIRD_ΔD model with ΔD = 0. Model (D) includes includes two compartments for each of the exposed (E₁, E₂), infectious (I₁, I₂), and non-infectious (U₁, U₂) states.

Figure 3:

A comparison of the maximum likelihood parameter set SI_tD model predictions for cases in which ρ, I₀, the infection fatality rate, δ, and NB ϕ are inferred and infectivity is constant (herd immunity), which is a special case of the model in which step function parameters t^∗ and α_b are also inferred (2 extra parameters; non-herd immunity). Inferred parameters are shown in the table to the right, together with emerging epidemiological quantities of interest below. The argmin_i{ℛ_i < 1} values of 54 vs. 33 translate to 9th April vs. 19th March 2020, respectively.

2.2.1 A model structured by infected age: SI_tD

We denote by I_i,j the number of individuals who on day i became infected j > 0 days ago; and the total number of infected individuals on day i is thus I_i = Σ_j I_i,j. In this model, between day i and i + 1 the number of new infections is I_i+1,1 = S_iΣ_j λ_ijI_i,j, for suitable λ_ij that in general depends on time i (to reflect changing transmission owing to changes in population mixing intensity, e.g., due to NPIs) and infected age j (to reflect non-constant infectiousness of those infected); and the number of new deaths amongst individuals with infected age j is h_jI_i,j, for suitable h_j. The general infected-age-structured model is therefore

The degrees of freedom in the model need to be controlled via some further modelling choices for the {λ_ij} and {h_j}. We will write

Here, α_i models the population mixing intensity relative to pre-epidemic social behaviour and is subject to the constraint α₁ = 1. In practice α is a composite parameter capturing contact rates, social distancing (including mask wearing) and mobility. A simple model for the impact of introducing NPIs is to enable a sharp reduction in α_i at some change-point time t^∗. Enabling a continuous t^∗ helps in Markov Chain Monte Carlo procedures described later, hence we model α_i to transition from 1 to a new baseline α_b at change-point time t^∗ via in which ⌊·⌋, ⌈·⌉ and 𝟙_(·) respectively denote the standard floor, ceiling and indicator functions. An example of {α_i} is shown in Figure 1A.

The infectiousness profile β_j, defined in §2.1.3, is a probability mass function where Σ_j β_j = 1, so in view of the constraints on {α_i} and {β_j}, the ρ > 0 is included in (2) as an intensity

parameter. Parameter N = S_i + I_i + D_i is the population size, which in this model is constant, because births and non-SARS-CoV-2 death are negligible on the time scales of interest and are thus neglected.

In the language of survival analysis, the {h_j : j ≥ 1} is the hazard rate function for the death of infecteds, such that h_j is the probability that an individual having survived j − 1 days post-infection will die on day j. In terms of the infection-to-death distribution, p_ζ, and its corresponding cumulative distribution function , where δ is the infection fatality rate.

In this model, in contrast to some common epidemic models, including the non-infected-age-structured ones described below, the I denotes individuals who have been “infected” but are not necessarily “infectious”, since individuals of infected age j contribute to infecting susceptibles if and only if β_j > 0. This removes the need for an E variable for “exposed but pre-infectious” individuals, or an R variable for “recovered” individuals.

2.2.2 SIRD and SIRD_ΔD models

A simple model and commonly used model (e.g. Britton, 2020; Lourençco et al., 2020) that does not maintain structure by infected age is as follows.

In this model (and those below), the number of new daily infecteds is I_i+1,1 = λ_iS_iI_i, where and N = S_i + I_i + R_i + D_i. There is a close parallel to (1) but with key differences: infectiousness of infected individuals is constant with respect to infected age (β_j is taken to equal 1), and the hazard of removal from being infected is also constant (h_j is taken to equal some constant θ). This model includes an R variable, because the assumption of constant infectiousness of I individuals necessitates a way other than death for an infected individual to cease being infectious. By analogy to (1), the constant hazard implies that the duration in infected state is geometrically distributed.

The SIRD_ΔD model is a generalisation of the SIRD model that replaces equation (4d) with which introduces a non-negative integer “delay-to-death” parameter, Δ_D. Introducing a fixed delay in this way is a common modelling strategy to make infection-to-recovery and infection-to-death distributions distinct (e.g. Lourencço et al., 2020) and the latter non-exponential. The basic SIRD model is the special case with delay Δ_D = 0. A generalisation of (6), in which Δ_D is not necessarily an integer, is

For example, if Δ_D = 7.2 then 80% of the deaths will have a delay of 7 days and 20% will have a delay of 8 days.

2.2.3 An SIURD model

A different strategy besides incorporating a delay is to incorporate additional model states (see e.g. Royal Society SET-C (2020)). A model variation in this spirit is to incorporate a non-infectious state U that follows infection but precedes recovery or death, i.e., in which ξ is a rate constant.

2.2.4 An SE²I²U²RD model

The same strategy can be extended by including more states, for example an exposed (infected but not yet infectious) state, and by representing states using multiple compartments. Such an approach can enable the model dynamics to mimic the delay to peak infectiousness, and the delay between infection and death, and hence indirectly models infected ages (Hurtado and Kirosingh, 2019). The following model, chosen because it closely matches the approach of Kucharski et al. (2020), uses two compartments for each of the E, I and U states. in which η is a rate constant.

2.2.5 Connection to epidemiological parameters

The foregoing models connect directly to some key epidemiological parameters. A parameter important for characterising whether the epidemic is growing or declining is the time-varying reproduction number, ℛ_i, of which there are multiple variations. We adopt for ℛ_i in this paper the instantaneous reproduction number (Fraser, 2007), which is the average number of secondary infecteds generated by a single infected assuming there are no population level changes in susceptibility or mixing behaviour during their infection. We calculate ℛ_i by performing the sum over infected age j of the number of new infecteds generated by a host at infected age j (Heffernan et al., 2005). For the SI_tD model: where G_j is the survival function of an infected individual associated with the hazard h_j of removal from the I, defined as the probability of not having been removed by day j after infection, which equals . The product β_jG_j is the proportion of the ℛ_i secondary infections that are generated at an infected age j, and therefore when normalised is the (discretised) generation time distribution.

For the SIRD, SIRD_ΔD, SIURD and SE²I²U ²RD models, we compute ℛ_i from the same definition, noting that infecteds must transit through I before reaching D (G_j = 1). Given residence in an infectious state, the number of new infecteds generated per infected (λ_iS_i) is independent of infected-age. The infectiousness profile is β_j = A_j/ Σ (A_j), in which A_j is the proportion of infecteds in an infectious state at infected-age j, which can be derived by considering the disease progression defined in equations (4), (7) or (8) as a Markov process (A). Hence β_j depends on the number of E and I states as well as the parameters governing the total time in the latent (η) and infected classes (θ). However, by construction, the average residence time in an infectious state is θ⁻¹ for each of these models. Hence

Estimates of ℛ_i can be used to infer the benefit of mitigating measures such as NPIs for reducing transmission. Of strategic interest is to infer argmin_i {ℛ_i < 1}, as this indicates the time i when NPIs were sufficient to put the epidemic into decline. Also related and important is the peak daily incidence, max_i{S_i −S_i+1}, in which for each model S_i −S_i+1 is the number of new infecteds on day i. The basic reproduction number, written ℛ₀, is the special case for infection seeded into a fully susceptible population (S_i ≈ N) and prior to any mitigation (α_i = 1).

By design, the SI_tD model explicitly incorporates the infectiousness profile, {β_j}, and the infection-to-death distribution , such that these can be chosen according to clinical data. For the other models in which these aren’t explicit, it is helpful to understand what are the implied {β_j} and . Calculation of these is in A.

2.2.6 Model initial conditions

In this paper, i indexes the number of days since 15 Feb 2020, which is day i = 0. We assume initially zero deaths, D₀ = 0, and zero recovereds, R₀ = 0 (for models including R) and S₀ = N − I₀ for parameter I₀ which is to be inferred. For the infected-age-structured model, it is necessary to specify how the I₀ initial infecteds are distributed by infected age {I_0,j}. When the epidemic is growing exponentially the distribution of infecteds by infected age converges to an equilibrium distribution (see B), thus we assume assume convergence to this equilibrium distribution in the dynamics prior to day i = 0, and for the numerical calculations in this paper we use the equilibrium distribution as the initial condition at i = 0.

2.3 Observation model

The epidemic models above are deterministic. It is common to account for variability in the observed number of daily deaths, , on day i, in mechanistic models of infectious disease transmission via a negative binomial observation model (e.g. Mathews et al., 2007; Cauchemez and Ferguson, 2008). The negative binomial model admits overdispersion, which is often present in count data on cases or deaths from an infectious disease owing to spatial and demographic heterogeneities, or other unmodelled processes (Held et al., 2019). We adopt the negative binomial model, assuming that independently for each i, where NB(μ, ϕ) denotes the negative binomial distribution with mean μ and dispersion parameter ϕ defined such that the variance equals μ + ϕμ² (Robinson and Smyth, 2008).

2.4 Inference methods

We denote by Θ^∗ the free parameters that appear in the respective dynamical models, such that D_i = D_i(Θ^∗), and by Θ = (Θ^∗, ϕ) the vector of all the parameters including in the observation model (11). The data are the daily deaths indexed by time i. We denote by P(𝒟|Θ) the likelihood function for Θ under observation model (11). When adopting a Bayesian approach and specifying a prior distribution P (Θ) on Θ, the posterior distribution for Θ is then

In the results sections below, where we compute point estimates of Θ we do so using the maximum a posteriori (MAP) estimator then the inferred dynamics, and the epidemiological parameters inferred from them (defined in §2.2.5), are based on solutions of the respective epidemic model with . The MAP estimator corresponds to the widely used maximum-likelihood estimator (MLE) if the priors are uninformative, or more generally if P (Θ) is constant on a domain that contains and zero elsewhere. Elsewhere we target the full posterior distribution (12) by sampling from it using Markov chain Monte Carlo (MCMC). We used an adaptive Metropolis-Hastings MCMC algorithm (Haario et al., 2001) which involves, after a warm-up phase, adapting the covariance matrix of a multi-variate Gaussian proposal distribution according to the covariance of the accepted samples. Convergence to the stationary distribution is hastened with an initial maximisation of the posterior density with respect to Θ using a covariance matrix adaptation evolution strategy (CMA-ES) (Hansen et al., 2003). These methods were implemented using the PINTS python package (Clerx et al., 2019).

For priors on Θ, the priors arising from the clinical data are detailed in Table 1 and §2.1.3, in addition to which we specify: ρ ∼ U (1, 10) and I₀ ∼ U (1, 5 × 10⁷) consistent with wide range of possible values for ℛ₀ and number of initial infecteds; t^∗ ∼ 𝒩 (31, σ = 3), where i = 31 corresponds to where the Google mobility data, shown in Figure 1A, first suggest a substantial reduction in mobility; and α_b ∼ U (0, 1) and ϕ ∼ U (0, 1) such that both are uniform on their possible ranges of values. For the SIR-type models, transition rates in equations (4), (7) and (8) have upper limits such that, for example, no more than the entirety of a compartment can transition out in a single time step. Consequently, we take θ ∼ U (0, 1) for the SIRD, SIRDΔD & SIURD models, ξ ∼ U (0, 1) for the SIURD model and θ, η, ξ ∼ U (0, 0.5) for the SE²I²U ²RD model. Due to the additional scaling by 1/θ in the relationship between R_i and ρ for SIR-type models (equation 10) plausible values for ρ are contracted and we use ρ ∼ U (0, 3).

3. Results

3.1 A change in mixing intensity, and not ‘herd immunity’, is necessary to explain the epidemic dynamics

A theory after decline from the initial epidemic peak was that the epidemic dynamics were affected little by NPIs and could be be explained by ‘herd immunity’ (Lourençco et al., 2020), that is, that R_i in (9) had reduced to below 1 because the number of remaining susceptibles, S_i, had sufficiently decreased to curtail the growth. The theory that depletion of susceptibles is adequate to explain the dynamics can be tested by fitting both models to the data; the ‘herd immunity’ theory corresponds to the a special case of the general model that has a step change in α_i but with the restriction of having constant α_i = 1, achieved in the model by setting α_b = 1. This supposes no effect from NPIs, and thus that the decline in the epidemic must be on account of the depletion of susceptibles. In this restricted model the parameter t^∗ is redundant, therefore the restricted model has two fewer parameters than the unrestricted model. To consider this, we use the SI_tD model, with the infectiousness profile and infection-to-death distribution parameters fixed to the values shown in Table 1, and noninformative priors on ρ, I₀, t^∗, α_b, δ, and ϕ, then fit the model to the the England and Wales deaths data described in §2.1.1. We then do likewise for the restricted model, with the extra restriction that α_b = 1 and with t^∗ removed.

The fitted models are shown in Figure 3, from which it is clear that the full model matches well to the data but the restricted model matches very poorly. The difference in the value of the maximised values of the log-posterior, log P (Θ | 𝒟), for the two models is 260, which for models differing, as here, by two degrees of freedom is overwhelming evidence that the restricted model is inadequate.

Some values of epidemiological parameters in the fitted restricted model also seem implausiable; for example, the fitted I₀ is greater than 1 million people, and the IFR is ∼ 4 times smaller than the best estimate from O’Driscoll et al. (2020) — see Figure 3 for all of the inferred parameter values.

The values of the maximised likelihoods, and visual inspection of the fits, make clear that a change in transmission dynamics over time, via α_i in the model, is necessary to explain the epidemic dynamics during the first outbreak. In other words, it was never plausible that ‘herd immunity’ was responsible for the end of the first outbreak, as confirmed by the subsequent resurgence of infections in autumn of 2020.

3.2 Inference for epidemiological parameters is unreliable if based on non infected-age structured models

To investigate the impact of model error on the values and uncertainties of inferred parameters — and in particular whether simpler epidemic models such as SIRD, SIRD_ΔD, SIURD and SE²I²U ²RD can be used to infer accurately the epidemiological parameters — we generate synthetic data from the time-structured SI_tD model (§2.1.1) and observation model (11), with the model parameters as specified in Table 2, then refit both the SI_tD and those other models to compare the inferred parameter values with the true ones.

View this table:

Table 2:

Details of the simulation study described in section §3.2, including parameter values, in addition to those in Table 1, used to create synthetic data from the SI_tD model; and MAP estimates of the model parameters and derived epidemiological parameters from fitting the various models to the synthetic data. The (–) indicate parameters not relevant to the particular model. The derived parameters are: the basic reproduction number R₀; the minimum value of ℛ, min_i{ℛ_i}; the index of the day at which ℛ_i first decreases below 1, argmin_i{ℛ_i < 1}; the maximum number of new infections on any day, max_i {S_i −S_i+1} ; the mean, β_μ, of the infectiousness profile; and the mean of the infection-to-death distribution, . The (A) and (B) tables differ in that (B) involves extra parameters being fixed when the models are fitted, as described in §3.2.

We fit the models, fixing the IFR, δ, to the default value from Table 1 and treat the other parameters as free. Hence the free parameters include those that determine the implied infectiousness profile and the infection-to-death distributions. The data-generating and fitted parameters values are summarised in Table 2A, and results are plotted in Figure 4.

Figure 4:

(A) Synthetic data from the SI_tD model generated using parameters in Tables 1A and 2, and dynamics of fitted SI_tD, SIRD, SIRD_ΔD, SIURD and SE²I²U ²RD models, corresponding to MAP estimates of parameters per Table 2A. (B) Violin plots of the posterior marginals for derived epidemiological parameters under the different models, with dashes representing (min, mean, max) and the vertical blue line showing the true values. (C) Infectiousness profiles and infection-to-death distributions for the fitted models.

Figure 4A shows that each fitted model matches broadly well the deaths data. The SIRD and SIRD_ΔD models provide similar and relatively reasonable fits to the synthetic death data, but the peak daily deaths in these model fits occurs almost one week before the true peak.

Each of the simpler models — SIRD, SIRD_ΔD, SIURD — substantially underestimate ℛ₀ compared to the true value, in each case at least by a factor of 1.6, but estimates from SEI²I²U ²RD are vastly too large and with very high posterior variance; see Fig 4B. The latter also had the largest number of free parameters, so we also considered the case with η fixed to the value 1/5.2, which is an appropriate choice for this parameter in the sense that it fixes the model’s onset-to-infection interval to match with the mean we assumed when setting the default infection-to-death distribution (see Section 2.1.3). Even with this parameter value fixed, the there is little improvement in the inferred values for ℛ₀. During the epidemic decline after introduction of NPIs, the misspecification of the model is less impactful on the inferred values for ℛ_i, but the inferred time, t^∗, at which NPIs are inferred to impact mixing intensity is highly variable between models: for example it is inferred around 14 days too late for the SIRD model, 7 days early for SIRD_ΔD and 9 days late for SIURD (Table 2). Compared to the data-generating model, the inferred peak of new infections, max_iI_i,1, and the inferred timing of this peak, argmaxiI_i,1, are also very inaccurate. For SIRD and SIRD_ΔD models the peak is approximately 36% too low and the timing is respectively 14 days late and 2 days early. For the SIURD model the inferred peak is approximately 43% too high and 9 days too late. Because here we are fixing the IFR and are fitting to death data, all model versions return approximately the same remnant susceptible pool (S_i=end/N ≈ 0.88) by design. This means that from equation (9) if mixing intensity returns to pre-epidemic levels following the first wave (α_i = 1), and all other parameters are fixed, then ℛ_i ≈ 0.88 ℛ₀. This provides an upper bound on the median _i for a subsequent unmitigated outbreak equal to 2.8 in this simulated example. Because the simple SIR-like models underestimate ℛ₀, they also substantially underestimate ℛ_i for a subsequent unmitigated outbreak, suggesting an upper bound of the median ℛ_i ≈ 1.4 (SIRD, SIRD_ΔD) or 1.1 (SIURD) which are highly misleading. In contrast, estimates of the median unmitigated ℛ_i for the SEI²I²U ²RD models are vastly inflated. In the context of influencing policy-making, the errors arising from the simple SIR-like models are unacceptably large. Figure 4c shows that the implied infectiousness profile and infection-to-death distributions for the fitted models were all quite different to the true ones.

In the preceding investigation with synthetic data, the data-generating SI_tD model was at an advantage over the other models through having its infectiousness profile and infection-to-death distribution specified correctly, and without free parameters related to these that needed estimating. To what extent did the other models perform poorly on account of their extra parameters, rather than their model misspecification? To investigate, we fixed the values of those parameters to “optimal” values in the sense of matching moments of their implied infectiousness profile and infection-to-death distribution matched as closely as possible to the true ones, and with the consequence that the number of free parameters is then the same for each model, including for the data-generating SI_tD model. The parameters are summarised in Table 2B and explained as follows.

To match the mean of the infectiousness profile to the true one (recalling that the relationship between this profile and the model parameters for the various models is explained in A), for SIRD, SIRD_ΔD and SIURD models we take the infectivity rate θ = 1/5.2 where, per Table 1, 5.2 days is the mean of the assumed infectiousness profile. For the infection-to-death distribution, matching to the mean of 18.69 days for SIRD_ΔD entails taking entails taking Δ_D = 18.69 −1 − 5.2 = 12.49 and for SIURD taking ξ = 1/12.49 respectively. For the SE²I²U ²RD model there are many choices for η and θ that yield an implied infectiousness profile (A) with appropriate mean, however in general the variance is higher than that for the clinical infectiousness profile (§2.1.3). We therefore fix η = θ, which minimises the variance in the implied infectiousness profile. We then select fixed values for θ & ξ to match the means of the clinical infectiousness and time-to-death profiles to those of the implied profiles.

The results of this second fitting, with extra fixed parameters, are shown in and the results are shown in Table 2B and Figure 5. For SE²I²U ²RD in particular the results are much improved. Figure 5B shows that the derived epidemiological parameters are much more reliably inferred, although ℛ₀ is still slightly underestimated and argmin_i {ℛ_i < 1} overestimated. The better performance is because the the inclusion of the E state in this model, with the duplicated compartments for E, I and U, enables the fitted infectiousness profile and infection-to-death distribution to match reasonably closely to the true ones, as shown in Figure 5c. For each of the other models this is not the case, there is a much larger mismatch, as shown in Figure 5c, and a consequence is that the values of inferred epidemiological parameters, particularly ℛ₀, remain highly unreliable.

Figure 5:

(A) Synthetic data from the SI_tD model (§2.2.1) generated using parameters in Tables 1 and 2; and dynamics of fitted SI_tD, SIRD, SIRD_ΔD, SIURD and SE²I²U ²RD models for the parameters indicated in Table 2B; this version has additional fixed parameters, whose values are chosen a priori such that for each model the infectiousness profile and infection-to-death distribution, shown in (C), matches as closely as possible to the data-generating ones. (B) Violin plots of the derived epidemiological parameters uncertainties for each model and the synthetic data true values (blue dotted lines).

In summary, a model such as SE²I²U ²RD could plausibly lead to reasonable inference for epidemiological parameters, provided the number of pre-infectious, infectious and post-infectious compartments is judiciously chosen such that the implied infectiousness profile and infection-to-death distribution can match suitably closely the clinical data; though if, as in this paper, inference is based on deaths data (§2.1.3) then it appears important that the parameters of such a model are estimated a priori from the clinical data.

The results in Table 2 & Figures 4 and 5 are generated from a single synthetic data set; however, the results are representative of the large number repetitions performed.

3.3 Bayesian inference leads to small posterior uncertainty when conditioning on the infectiousness profile and infection-to-death distribution

In this section, we adopt a Bayesian approach to characterise the uncertainty in the inferred parameters and predictions. We fixed the infectiousness profile and infection-to-death distribution to their default values from Table 1. The priors for the other parameters are as specified in §2.4, and results for fitting the SI_tD model are shown in Figure 6. Figure 6A shows bi- and uni-variate marginal posterior distributions for the model parameters; these suggest relatively small posterior variance, and small posterior covariance except between ρ and α_b; and I₀ and α_b. Figure 6B shows a good match between the deaths data and the model dynamics based on the maximum a-posteriori (MAP) model parameter values. Figure 6C shows corresponding inferred new infections, which peak in mid-March, owing to the inferred sharp change in α_i, then decline. Posterior distributions for derived epidemiological parameters are shown in Fig 6E. The posterior for ℛ₀ is concentrated in the region ∼3 to 3.5; min_i{R_i} is concentrated around 0.795; almost all the posterior mass for argmin_i{ℛ_i < 1} is on i = 32, which is 18th March; and for max_i{I_i,1} there is somewhat more posterior uncertainty, with values between ∼220,000–400,000 plausible, and the posterior mode at ∼280,000.

Figure 6:

Results from the Bayesian inference described in §3.3. (A) Uni- and bivariate marginal posterior distributions for the model parameters. (B) Daily death data, with superimposed lines showing modelled deaths from 1000 posterior samples (transparent) and MAP parameter values (solid). (C) Inferred daily new infections based on MAP parameter values. (D) Posterior MAP estimate of ℛ_i, with inset showing correspondence between inferred α and average of work-place and transit Google mobility data. (E) Posterior distributions for derived epidemiological parameters.

The change in α_i matches quite closely to the Google mobility data, both in the timing, t^∗, and the extent of the drop, α_b, as shown in Figure 6D. These numbers are consistent with a reduction in mixing throughout March (before the official lockdown was mandated) due to voluntary changes in mixing intensity, perhaps driven by media coverage and government announcements prior to lockdown. In other words, the results suggest the Google mobility data may accurately reflect the timing and drop in population mixing around the period a lockdown was mandated.

Our inferred values of ℛ₀ are slightly higher than estimates based on an exponential growth model fitted to the death data [2.6 (2.4–2.9), Lonergan and Chalmers (2020)], but consistent with other estimates for the UK (Royal Society SET-C, 2020). Estimates of ℛ_i after lockdown are consistent with those from exponential decay models fitted to death data (Lonergan and Chalmers, 2020), and real-time estimates based on reported cases in the period April-June 2020 [see Royal Society SET-C (2020) and references therein] or surveyed contact rates in late March 2020 (Jarvis et al., 2020).

Incorporating uncertainty in the infectiousness profile and infection-to-death distribution leads to misleading conclusions owing to model error

Here we repeat the analysis of the previous section, except this time incorporating uncertainty on the four parameters——describing the infectiousness and infection-to-death distributions. The priors on these parameters are as described in §2.1.3 and summarised in Table 1. Results analogous to Figure 6 are shown in Figure 7. These results show that posterior variances are somewhat larger but not substantially so, although MAP estimates are considerably different.

Figure 7:

Results from the Bayesian inference described in §3.4. The interpretation of (A– E) are as for Fig 6. These results are analogous to those in Fig 6, except here the analysis involved placing priors on—rather than conditioning on—the parameters of the infectiousness profile and infection-to-death distribution.

The MAP parameter gives visually a similarly good fit between model and observed data (Figure 7B), predicting daily new infections to peak in early- to mid-March, after which a sharp drop in new infections is observed (Figure 7C). The inferred mixing intensity in this case still agrees reasonably with the UK Google mobility data (Figure 7D), but is predicted to drop earlier and further than was inferred in §3.3, when the values of those four parameters were instead conditioned upon.

The posteriors for derived parameters show ℛ₀ is distributed largely between 5 and 7 and peaking at ∼ 6; min_i{ℛ_i} is concentrated around 0.72, argmini{ℛ_i < 1} is some time between 11th and 17th March, with posterior peaking on 14th March, and max_i {I_i,1} somewhere between ∼ 250,000–500,000, peaking at ∼ 350,000 (Figure 7E). These posterior distributions are somewhat in conflict with some informative priors (the influence of the priors can be seen in Supplementary Figure S1). For example, the peak infectiousness occurs ∼ 3 days later than estimated by Ganyani et al. (2020), and an ₀ of around 6 is higher than most very early estimates of ℛ₀ (Park et al., 2020). However, reproduction numbers of around 6 have been reported for other European countries (Billah et al., 2020).

A possible explanation for the discrepancy in results between this section and §3.3, and with estimates from clinical data such as Ganyani et al. (2020), is that it is a consequence of model error. To investigate this, we repeated the Bayesian inference of the present section and of §3.3 for the synthetic data shown in Figure 4. Doing so leads to posterior distributions that are consistent with the data-generating parameters, and consistent with each other, with larger variance under the approach of this section in which there is prior uncertainty on the four additional parameters. Results for these cases are in the Supplementary Material (Supplementary Figures S2 and S3). In other words, the results of the Bayesian inference are as we might expect when the data arise from the assumed model, supporting the possibility that the unexpected results for the real data are on account of model error.

If so, what are the possible sources of model error? The negative binomial observation model heavily penalises discrepancies between modelled and observed deaths when the number of daily deaths is small, especially when the over-dispersion parameter, ϕ is small. The MAP estimate of ϕ is indeed small—around 10⁻³—especially so with the extra four degrees of freedom considered in this section, which enable the deterministic part of the model to explain more of the variability in the data. A consequence is that the inference is dominated by the model having to match well the data in the period when the number of daily deaths is small but growing rapidly. This is the period when the deterministic component of the model, which assumes a homogeneously mixing population and neglects stochastic effects in the epidemic process, describes the dynamics least well. In summary: enabling the extra four degrees of freedom in this section appears to lead to an overfitted model, which leads to underestimation of the overdispersion parameter (and variance). The posterior consequently has low variance, but with inference dominated by the dynamics in a period when the model is likely to be least accurate, and the low-variance posterior is located in a region of parameter space that suggests model error is making the results unreliable.

The Supplementary Material contains results of some further exploration of this issue, including where we have fixed ϕ at two larger values, 0.01 and 0.1, (Supplementary Figures S4 and S5), which inflates posterior variance and shifts the MAP estimates to more plausible values, e.g. 4.5 for ℛ₀; and a case with the Negative Binomial observation model replaced by a Gaussian model (Supplementary Figure S6), such that the inference is not dominated by the early period with low prevalence, and for this case again the MAP estimates of parameters are more in accordance with other sources and with the results of §3.3.

4 Summary and discussion

We have explored various simple epidemic models, particularly investigating the importance of accurately characterising the infectiousness profile and the infection-to-death time distribution. These distributions are well characterised in the clinical literature and easy to incorporate into an SIR-type model structured by infected age. The results show that it is essential to incorporate them in models used for inferring the underlying epidemic dynamics from death data. Basic SIR models implicitly mis-specify the infectiousness and infection-to-death distributions and lead to highly misleading inferences about the impact of NPIs, the time and magnitude of peak infections, and the basic reproduction number.

In the particular context of the spring 2020 SARS-CoV-2 outbreak in England and Wales, we used the infected-age-structured model, SI_tD, to compare the hypothesis that the epidemic decline was on account of NPIs versus the hypothesis, which was entertained at the time, that the decline was owing to ‘herd immunity’. When using distributions for the infectious profile and infection-to-death distribution that are drawn from clinical literature, but fitting all the other model parameters, we found very strong evidence in favour of efficacious NPIs, rather than herd immunity, as the explanation for the epidemic decline (Figure 3). This conclusion was in spite of accommodating the possibility of an implausibly low IFR, δ, and high initial number infected, I₀, both of which are to the benefit of the herd immunity hypothesis. This indicates that the death data, combined with infectiousness profile and infection-to-death distributions available from clinical data, enable dismissing of the ‘herd immunity’ hypothesis, without needing to know the IFR very accurately.

The limitations of very basic SIR-type models in mis-specifying the infectiousness profile and the infection-to-death has been pointed out by Keeling and Rohani (2008); Wearing et al. (2005). Our work extends this to consider the impact of mis-specification when the model is used to infer epidemiological parameters and dynamics from data, and particularly when there is an abrupt change in population mixing intensity. Our findings show major errors in the inferred dynamics—for example, timing of changes in population mixing can be wrong by weeks— using basic SIR-type models, suggesting that untangling the impact of NPIs adopted in quick succession (e.g. Dehning et al., 2020) may be especially error-prone if using a simple SIR-like model. Models such as SE²I²U ²RD that incorporate multiple infected states (in this case preinfectious, infectious, post-infectious) and multiple sub-compartments per state can potentially perform well if the number of states, the number of sub-compartments, and the values of rate parameters are suitably chosen, but in our view this entails more complexity and no advantage compared using the SI_tD model.

Our estimates for ℛ₀ are significantly larger when we take a fully Bayesian approach and fit parameters describing the infectiousness profile and infection-to-death distribution. Much of the variation between early estimates from case data has been attributed to assumptions for the generation time distribution (Park et al., 2020), which is closely linked to the infectiousness profile. The need to quantify uncertainty in estimates of ℛ₀ for use in policy-making has been highlighted, for example by Royal Society SET-C (2020). Uncertainty analysis of a spatial agent-based model suggests that parameters that largely determine the infectiousness profile (exposure to onset delays, onset to isolation delays) remain significant sources of overall uncertainty (Edeling et al., 2021). We found that incorporating uncertainty by placing priors with relatively large variance on the four parameters characterising the infectiousness profile and infection-to-death distribution — as opposed to conditioning on them — yielded low-variance posteriors concentrated in implausible regions of parameter space, contrary to our initial expectation that the additional prior uncertainty would largely just inflate the posterior variance. We understand this to be the impact of model error, with the negative binomial observation model leading to inference being dominated by a period in which the deterministic dynamics are least reliable. A tendency of deterministic models to yield overconfident and error-prone estimates of ℛ_i from early incidence data has also been documented (King et al., 2015).

We have argued in favour of an SI_tD model structured by infected age over a basic SIR model, especially when the goal is inference from death data, but this model is itself only a coarse population-level model with many limitations. The model assumes homogeneous mixing, and neglects stochastic effects which might be appreciable when prevalence is low. To keep the model simple, we accommodated only a step change in the population mixing intensity, α_i. For the observation model for deaths, we assumed a negative binomial model and conditional independence between different days, which are strong parametric assumptions upon which the inference can be sensitive. We did not attempt to model subtleties such as how hospital fatality rates may depend on case loads within hospitals (Docherty et al., 2020). Hence there are many possible sources of model error. In the context of inference with this model, we have found that affording too much freedom through high prior variance on parameters enables overfitting of the deterministic part and amplifies the impact of model error. Erring on the side of choosing priors with high variance is therefore not necessarily a conservative strategy, at least when model error is non-negligible.

Data Availability

Open source software to reproduce the results in this paper is available at https://github.com/DGWhittaker/nottingham_covid_modelling

https://github.com/DGWhittaker/nottingham_covid_modelling

Software Availability

Open source software to reproduce the results in this paper is available at https://github.com/DGWhittaker/nottingham_covid_modelling.

Funding

This work was supported by the Wellcome Trust [grant number 212203/Z/18/Z]. DGW, ADH-R and GRM acknowledge support from the Wellcome Trust via a Senior Research Fellowship to GRM. KJB acknowledges support from a University of Nottingham Anne McLaren Fellowship. This research was funded in whole, or in part, by the Wellcome Trust [212203/Z/18/Z]. For the purpose of open access, the author has applied a CC-BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

A Infectiousness profiles and time-to-death distributions for SIRD, SIRD_ΔD and SE²I²U²RD models

For our models without a latent state defined by equations (4) & (7), the proportion of newly infected hosts remaining in an infectious state (I) at infected-age j is simply those who are not removed by the previous time-step A_j = (1 −θ)^j−1, and β_j = θ(1 −θ)^j−1. The probability mass function for death (as before conditioning on death as the outcome of the infection) can be calculated by summing over the probabilities of Markov chains that arrive in D at infected-age j. For the SIRD model:

The distribution of arrival times in D is shifted for the SIRD_ΔD model,

For the SIURD model:

For the slightly more complicated SE²1²U ²RD model described in equation (8) we construct a matrix describing the Markov transitions following infection (Diekmann et al., 2021) where T_n,k is the probability of transitioning from state n to state k. Describing a newly infected individual in E₁ by a state vector i₀ = (1, 0, 0, 0, 0, 0, 0, 0), the proportion of new infections in I₁ at infected-age j is , where T^j indicates the jth power of T. Similarly . Hence A_j = (T^j−1)_1,3+ (T^j−1)_1,4. If denotes the proportion of infections in I at time-step j^′ after arriving in I_i, then , and thus β_j = θ((T^j−1)_1,3 + (T^j−1)_1,4). The probability of death at infected-age j is .

B Pseudo-steady distribution of infecteds by infected age

During a phase of epidemic growth with S_i ≈ N and λ_ij = λ_j constant with respect to i, the distribution of Is by infected age reaches a dynamic equilibrium, and the number of infecteds grows exponentially,

Together these imply I_i+1,j = cI_i,j. Then under model (1), we get, and the solution of these equations for {I_i,j}, after eliminating c, is the equilibrium distribution.

Supplementary Figures

Figure S1:

Comparison of posterior distributions from the ‘fully’ Bayesian inference in which the priors are diffuse (left) and informative (right). In each case, the posteriors (blue) are overlaid on the priors (orange). The right panel corresponds to results shown in Figure 7.

Figure S2:

Bayesian inference of synthetic data from Figure 4 using parameters described in §3.3. Uni- and bivariate marginal posterior distributions for the model parameters are shown, with the true, data-generating values shown as black, dotted lines.

Figure S3:

Bayesian inference of synthetic data from Figure 4 using parameters described in §3.4. Uni- and bivariate marginal posterior distributions for the model parameters are shown, with the true, data-generating values shown as black, dotted lines. These results are analogous to those in Fig S2, except here the analysis involved placing priors on—rather than conditioning on—the parameters of the infectiousness profile and infection-to-death distribution.

Figure S4:

Bayesian inference of real data using the SI_tD model with uncertainty on and ζ_ϕ, with ϕ fixed to 0.01. (Upper panel) Uni- and bivariate marginal posterior distributions for the model parameters. (Lower panel) The interpretation is the same as that of panels (B-D) in Figure 6.

Figure S5:

Bayesian inference of real data using the SI_tD model with and ζ_ϕ, with ϕ fixed to 0.1. (Upper panel) Uni- and bivariate marginal posterior distributions for the model parameters. (Lower panel) The interpretation is the same as that of panels (B-D) in Figure 6.

Figure S6:

Bayesian inference of real data using the SI_tD model with uncertainty on and ζ_ϕ, with Gaussian noise σ fixed to 60. (Upper panel) Uni- and bivariate marginal posterior distributions for the model parameters. (Lower panel) The interpretation is the same as that of panels (B-D) in Figure 6.

Acknowledgements

We thank Frank Ball and Theodore Kypraios for helpful discussions.

Footnotes

↵a joint first authors

References

↵
Ashcroft, P., Huisman, J. S., Lehtinen, S., Bouman, J. A., Althaus, C. L., Regoes, R. R. and Bon-hoeffer, S. (2020) COVID-19 infectivity profile correction. arXiv preprint arxiv:2007.06602.
↵
Backer, J. A., Klinkenberg, D. and Wallinga, J. (2020) Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance, 25. URL: https://www.eurosurveillance.org/content/10.2807/1560-7917.ES.2020.25.5.2000062.
↵
Billah, M. A., Miah, M. M. and Khan, M. N. (2020) Reproductive number of coronavirus: A systematic review and meta-analysis based on global level evidence. PloS one, 15, e0242128.
OpenUrl CrossRef PubMed
↵
Britton, T. (2020) Basic estimation-prediction techniques for covid-19, and a prediction for stockholm. medRxiv. URL: https://www.medrxiv.org/content/early/2020/05/14/2020.04.15.20066050.
↵
Cauchemez, S. and Ferguson, N. M. (2008) Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface, 5, 885–897.
OpenUrl
↵
Clerx, M., Robinson, M., Lambert, B., Lei, C. L., Ghosh, S., Mirams, G. R. and Gavaghan, D. J. (2019) Probabilistic Inference on Noisy Time Series (PINTS). Journal of Open Research Software, 7, 23.
OpenUrl
↵
Dehning, J., Zierenberg, J., Spitzner, F. P., Wibral, M., Neto, J. P., Wilczek, M. and Priesemann, V. (2020) Inferring change points in the spread of covid-19 reveals the effectiveness of interventions. Science, 369. URL: https://science.sciencemag.org/content/369/6500/eabb9789.
↵
Department for Business, Energy & Industrial Strategy and Department for Digital, Culture, Media & Sport (2020) Working safely during coronavirus (COVID-19). URL: https://www.gov.uk/guidance/working-safely-during-coronavirus-covid-19. Accessed November 30, 2020.
↵
DHSC (2020) Daily tests processed and testing capacity (UK): 20 March to 22 September 2020. URL: https://www.gov.uk/government/publications/daily-tests-processed-and-testing-capacity-uk-20-march-to-22-september-2020. Accessed December 7, 2020.
↵
Diekmann, O. and Heesterbeek, J. A. P. (2000) Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation.
Wiley Series in Mathematical & Computational Biology. Wiley.
↵
Diekmann, O., Othmer, H. G., Planque, R. and Bootsma, M. C. (2021) On discrete time epidemic models in Kermack-McKendrick form. medRxiv.
↵
Docherty, A. B., Mulholland, R. H., Lone, N. I., Cheyne, C. P., De Angelis, D., Diaz-Ordaz, K., Donoghue, C., Drake, T. M., Dunning, J., Funk, S., Garcia-Fiñana, M., Girvan, M., Hardwick, H. E., Harrison, J., Ho, A., Hughes, D. M., Keogh, R. H., Kirwan, P. D., Leeming, G., Nguyen-Van-Tam, J. S., Pius, R., Russell, C. D., Spencer, R., Tom, B. D., Turtle, L., Openshaw, P. J., Baillie, J. K., Harrison, E. M. and Semple, M. G. (2020) Changes in uk hospital mortality in the first wave of covid-19: the isaric who clinical characterisation protocol prospective multicenter observational cohort study. medRxiv. URL: https://www.medrxiv.org/content/early/2020/12/22/2020.12.19.20248559.
↵
Dunn, P., Allen, L., Cameron, G. and Alderwick, H. (2020) COVID-19 policy tracker. URL: https://www.health.org.uk/news-and-comment/charts-and-infographics/covid-19-policy-tracker. Accessed December 7, 2020.
↵
Edeling, W., Arabnejad, H., Sinclair, R., Suleimenova, D., Gopalakrishnan, K., Bosak, B., Groen, D., Mahmood, I., Crommelin, D. and Coveney, P. V. (2021) The impact of uncertainty on predictions of the covidsim epidemiological code. Nature Computational Science, 1, 128–135.
OpenUrl
↵
Ferretti, L., Wymant, C., Kendall, M., Zhao, L., Nurtay, A., Abeler-Dörner, L., Parker, M., Bonsall, D. and Fraser, C. (2020) Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science, 368. URL: https://science.sciencemag.org/content/368/6491/eabb6936.
↵
Fraser, C. (2007) Estimating individual and household reproduction numbers in an emerging epidemic. PloS ONE, 2, e758.
OpenUrl CrossRef PubMed
↵
Ganyani, T., Kremer, C., Chen, D., Torneri, A., Faes, C., Wallinga, J. and Hens, N. (2020) Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance, 25. URL: https://www.eurosurveillance.org/content/10.2807/1560-7917.ES.2020.25.17.2000257.
↵
Google (2020) COVID-19 Community Mobility Reports. URL: https://www.google.com/covid19/mobility/. Accessed November 30, 2020.
↵
Haario, H., Saksman, E. and Tamminen, J. (2001) An Adaptive Metropolis Algorithm. Bernoulli, 7, 223–242. URL: https://www.jstor.org/stable/3318737. Publisher: International Statistical Institute (ISI) and Bernoulli Society for Mathematical Statistics and Probability.
OpenUrl CrossRef
↵
Hansen, N., Müller, S. D. and Koumoutsakos, P. (2003) Reducing the Time Complexity of the Derandomized Evolution Strategy with Covariance Matrix Adaptation (CMA-ES). Evolutionary Computation, 11, 1–18.
OpenUrl CrossRef PubMed Web of Science
↵
Harrison, E., Docherty, A. and Semple, C. (2020) COVID-19: time from symptom onset until death in UK hospitalised patients. URL: https://assets.publishing.service.gov.uk. Accessed November 30, 2020.
↵
Hart, W., Maini, P., Yates, C. and Thompson, R. (2020) A theoretical framework for transitioning from patient-level to population-scale epidemiological dynamics: influenza a as a case study. Journal of the Royal Society Interface, 17, 20200230.
OpenUrl
↵
Hawryluk, I., Mellan, T. A., Hoeltgebaum, H., Mishra, S., Schnekenberg, R. P., Whittaker, C., Zhu, H., Gandy, A., Donnelly, C. A., Flaxman, S. and Bhatt, S. (2020) Inference of covid-19 epidemiological distributions from brazilian hospital data. Journal of The Royal Society Interface, 17, 20200596. URL: https://royalsocietypublishing.org/doi/abs/10.1098/rsif.2020.0596.
OpenUrl
↵
He, X., Lau, E. H., Wu, P., Deng, X., Wang, J., Hao, X., Lau, Y. C., Wong, J. Y., Guan, Y., Tan, X. et al. (2020) Temporal dynamics in viral shedding and transmissibility of covid-19. Nature medicine, 26, 672–675.
OpenUrl CrossRef PubMed
↵
Heffernan, J. M., Smith, R. J. and Wahl, L. M. (2005) Perspectives on the basic reproductive ratio. Journal of the Royal Society Interface, 2, 281–293.
OpenUrl
↵
Held, L., Hens, N., Wallinga, J. and O’Neill, P. (2019) Handbook of Infectious Disease Data Analysis. CRC Press LLC, Milton. Available from: ProQuest Ebook Central, accessed 18 December 2020.
Huang, A. and Garcia-Carreras, B.ans Hitchings, M. e. a. (2020) A systematic review of antibody mediated immunity to coronaviruses: kinetics, correlates of protection, and association with severity. Nat Commun, 11, 4704. URL: https://doi.org/10.1038/s41467-020-18450-4.
OpenUrl CrossRef PubMed
↵
Hurtado, P. and Kirosingh, A. (2019) Generalizations of the ‘linear chain trick’: incorporating more flexible dwell time distributions into mean field ode models. J. Math. Biol., 79.
↵
Jarvis, C. I., Van Zandvoort, K., Gimma, A., Prem, K., Klepac, P., Rubin, G. J. and Edmunds, W. J. (2020) Quantifying the impact of physical distance measures on the transmission of covid-19 in the uk. BMC medicine, 18, 1–10.
OpenUrl
↵
Keeling, M. J. and Rohani, P. (2008) Modeling Infectious Diseases in Humans and Animals. Princeton University Press. URL: http://www.jstor.org/stable/j.ctvcm4gk0.
↵
King, A. A., Domenech de Celles, M., Magpantay, F. M. and Rohani, P. (2015) Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to ebola. Proceedings of the Royal Society B: Biological Sciences, 282, 20150347.
OpenUrl CrossRef PubMed
↵
Kissler, S. M., Fauver, J. R., Mack, C., Olesen, S. W., Tai, C., Shiue, K. Y., Kalinich, C. C., Jednak, S., Ott, I. M., Vogels, C. B., Wohlgemuth, J., Weisberger, J., DiFiori, J., Anderson, D. J., Mancell, J., Ho, D. D., Grubaugh, N. D. and Grad, Y. H. (2020) Sars-cov-2 viral dynamics in acute infections. medRxiv. URL: https://www.medrxiv.org/content/early/2020/12/01/2020.10.21.20217042.
↵
Kucharski, A. J., Russell, T. W., Diamond, C., Liu, Y., Edmunds, J., Funk, S., Eggo, R. M., Sun, F., Jit, M., Munday, J. D. et al. (2020) Early dynamics of transmission and control of covid-19: a mathematical modelling study. The lancet infectious diseases, 20, 553–558.
OpenUrl CrossRef PubMed
↵
Lehtinen, S., Ashcroft, P. and Bonhoeffer, S. (2020) On the relationship between serial interval, infectiousness profile and generation time. medRxiv. URL: https://www.medrxiv.org/content/early/2020/09/18/2020.09.18.20197210.
↵
Lonergan, M. and Chalmers, J. D. (2020) Estimates of the ongoing need for social distancing and control measures post-”lockdown” from trajectories of covid-19 cases and mortality. European Respiratory Journal, 56.
↵
Lourençco, J., Pinotti, F., Thompson, C. and Gupta, S. (2020) The impact of host resistance on cumulative mortality and the threshold of herd immunity for sars-cov-2. medRxiv. URL: https://www.medrxiv.org/content/early/2020/10/01/2020.07.15.20154294.
↵
Mathews, J. D., McCaw, C. T., McVernon, J., McBryde, E. S. and McCaw, J. M. (2007) A biological model for influenza transmission: Pandemic planning implications of asymptomatic infection and immunity. PLOS ONE, 2, 1–6. URL: https://doi.org/10.1371/journal.pone.0001220.
OpenUrl CrossRef
↵
ONS (2020) Deaths registered weekly in England and Wales, provisional: week ending 13 November 2020. URL: https://www.ons.gov.uk/. Accessed November 30, 2020.
ONS (2020) Population estimates. URL: https://www.ons.gov.uk/. Accessed November 30, 2020.
↵
O’Driscoll, M., Dos Santos, G. R., Wang, L., Cummings, D. A., Azman, A. S., Paireau, J., Fontanet, A., Cauchemez, S. and Salje, H. (2020) Age-specific mortality and immunity patterns of SARS-CoV-2. Nature, 1–9.
↵
Park, S. W., Bolker, B. M., Champredon, D., Earn, D. J., Li, M., Weitz, J. S., Grenfell, B. T. and Dushoff, J. (2020) Reconciling early-outbreak estimates of the basic reproductive number and its uncertainty: framework and applications to the novel coronavirus (SARS-CoV-2) outbreak. MedRxiv.
↵
Robinson, M. D. and Smyth, G. K. (2008) Small-sample estimation of negative binomial dispersion, with applications to SAGE data. Biostatistics, 9, 321–332. URL: https://academic.oup.com/biostatistics/article/9/2/321/353777. Publisher: Oxford Academic.
OpenUrl CrossRef PubMed Web of Science
↵
Royal Society SET-C (2020) Reproduction number (R) and growth rate (r) of the COVID-19 epidemic in the UK: methods of estimation, data sources, causes of heterogeneity, and use as a guide in policy formulation. URL: https://royalsociety.org. Members of the Royal Society SET-C and members of the SET-C sub-group.
↵
Sherratt, K., Abbott, S., Meakin, S. R., Hellewell, J., Munday, J. D., Bosse, N., working group, C. C.-., Jit, M. and Funk, S. (2021) Exploring surveillance data biases when estimating the reproduction number: with insights into subpopulation transmission of covid-19 in england. Philosophical Transactions of the Royal Society B, 376, 20200283.
OpenUrl
↵
Sun, K., Wang, W., Gao, L., Wang, Y., Luo, K., Ren, L., Zhan, Z., Chen, X., Zhao, S., Huang, Y., Sun, Q., Liu, Z., Litvinova, M., Vespignani, A., Ajelli, M., Viboud, C. and Yu, H. (2020) Transmission heterogeneities, kinetics, and controllability of SARS-CoV-2. Science. URL: https://science.sciencemag.org/content/early/2020/11/23/science.abe2424.
↵
UK Government (2020a) The Health Protection (Coronavirus, Restrictions) (England) Regulations 2020. URL: https://www.legislation.gov.uk/uksi/2020/350/regulation/6/2020-03-26. Accessed November 30, 2020.— (2020b) The Health Protection (Coronavirus, Wearing of Face Coverings in a Relevant Place) (England) Regulations 2020. URL: https://www.legislation.gov.uk/uksi/2020/791/schedule. Accessed November 30, 2020.
↵
Verity, R., Okell, L. C., Dorigatti, I., Winskill, P., Whittaker, C., Imai, N., Cuomo-Dannenburg, G., Thompson, H., Walker, P. G. T., Fu, H., Dighe, A., Griffin, J. T., Baguelin, M., Bhatia, S., Boonyasiri, A., Cori, A., Cucunubá, Z., FitzJohn, R., Gaythorpe, K., Green, W., Hamlet, A., Hinsley, W., Laydon, D., Nedjati-Gilani, G., Riley, S., van Elsland, S., Volz, E., Wang, H., Wang, Y., Xi, X., Donnelly, C. A., Ghani, A. C. and Ferguson, N. M. (2020) Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet. Infectious diseases, 20, 669–677.
OpenUrl CrossRef PubMed
↵
Wearing, H. J., Rohani, P. and Keeling, M. J. (2005) Appropriate models for the management of infectious diseases. PLoS medicine, 2, e174.
OpenUrl

View the discussion thread.

Posted July 02, 2022.

Download PDF

Data/Code

Citation Tools

Subject Area

Public and Global Health

Subject Areas

All Articles

Addiction Medicine (349)
Allergy and Immunology (668)
Allergy and Immunology (668)
Anesthesia (181)
Cardiovascular Medicine (2648)
Dentistry and Oral Medicine (316)
Dermatology (223)
Emergency Medicine (399)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
Epidemiology (12228)
Forensic Medicine (10)
Gastroenterology (759)
Genetic and Genomic Medicine (4103)
Geriatric Medicine (387)
Health Economics (680)
Health Informatics (2657)
Health Policy (1005)
Health Systems and Quality Improvement (985)
Hematology (363)
HIV/AIDS (851)
Infectious Diseases (except HIV/AIDS) (13695)
Intensive Care and Critical Care Medicine (797)
Medical Education (399)
Medical Ethics (109)
Nephrology (436)
Neurology (3882)
Nursing (209)
Nutrition (577)
Obstetrics and Gynecology (739)
Occupational and Environmental Health (695)
Oncology (2030)
Ophthalmology (585)
Orthopedics (240)
Otolaryngology (306)
Pain Medicine (250)
Palliative Medicine (75)
Pathology (473)
Pediatrics (1115)
Pharmacology and Therapeutics (466)
Primary Care Research (452)
Psychiatry and Clinical Psychology (3432)
Public and Global Health (6527)
Radiology and Imaging (1403)
Rehabilitation Medicine and Physical Therapy (814)
Respiratory Medicine (871)
Rheumatology (409)
Sexual and Reproductive Health (410)
Sports Medicine (342)
Surgery (448)
Toxicology (53)
Transplantation (185)
Urology (165)

[1] ↵
Ashcroft, P., Huisman, J. S., Lehtinen, S., Bouman, J. A., Althaus, C. L., Regoes, R. R. and Bon-hoeffer, S. (2020) COVID-19 infectivity profile correction. arXiv preprint arxiv:2007.06602.

[2] ↵
Backer, J. A., Klinkenberg, D. and Wallinga, J. (2020) Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance, 25. URL: https://www.eurosurveillance.org/content/10.2807/1560-7917.ES.2020.25.5.2000062.

[3] ↵
Billah, M. A., Miah, M. M. and Khan, M. N. (2020) Reproductive number of coronavirus: A systematic review and meta-analysis based on global level evidence. PloS one, 15, e0242128.
OpenUrl CrossRef PubMed

[4] ↵
Britton, T. (2020) Basic estimation-prediction techniques for covid-19, and a prediction for stockholm. medRxiv. URL: https://www.medrxiv.org/content/early/2020/05/14/2020.04.15.20066050.

[5] ↵
Cauchemez, S. and Ferguson, N. M. (2008) Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface, 5, 885–897.
OpenUrl

[6] ↵
Clerx, M., Robinson, M., Lambert, B., Lei, C. L., Ghosh, S., Mirams, G. R. and Gavaghan, D. J. (2019) Probabilistic Inference on Noisy Time Series (PINTS). Journal of Open Research Software, 7, 23.
OpenUrl

[7] ↵
Dehning, J., Zierenberg, J., Spitzner, F. P., Wibral, M., Neto, J. P., Wilczek, M. and Priesemann, V. (2020) Inferring change points in the spread of covid-19 reveals the effectiveness of interventions. Science, 369. URL: https://science.sciencemag.org/content/369/6500/eabb9789.

[8] ↵
Department for Business, Energy & Industrial Strategy and Department for Digital, Culture, Media & Sport (2020) Working safely during coronavirus (COVID-19). URL: https://www.gov.uk/guidance/working-safely-during-coronavirus-covid-19. Accessed November 30, 2020.

[9] ↵
DHSC (2020) Daily tests processed and testing capacity (UK): 20 March to 22 September 2020. URL: https://www.gov.uk/government/publications/daily-tests-processed-and-testing-capacity-uk-20-march-to-22-september-2020. Accessed December 7, 2020.

[10] ↵
Diekmann, O. and Heesterbeek, J. A. P. (2000) Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation.

[11] Wiley Series in Mathematical & Computational Biology. Wiley.

[12] ↵
Diekmann, O., Othmer, H. G., Planque, R. and Bootsma, M. C. (2021) On discrete time epidemic models in Kermack-McKendrick form. medRxiv.

[13] ↵
Docherty, A. B., Mulholland, R. H., Lone, N. I., Cheyne, C. P., De Angelis, D., Diaz-Ordaz, K., Donoghue, C., Drake, T. M., Dunning, J., Funk, S., Garcia-Fiñana, M., Girvan, M., Hardwick, H. E., Harrison, J., Ho, A., Hughes, D. M., Keogh, R. H., Kirwan, P. D., Leeming, G., Nguyen-Van-Tam, J. S., Pius, R., Russell, C. D., Spencer, R., Tom, B. D., Turtle, L., Openshaw, P. J., Baillie, J. K., Harrison, E. M. and Semple, M. G. (2020) Changes in uk hospital mortality in the first wave of covid-19: the isaric who clinical characterisation protocol prospective multicenter observational cohort study. medRxiv. URL: https://www.medrxiv.org/content/early/2020/12/22/2020.12.19.20248559.

[14] ↵
Dunn, P., Allen, L., Cameron, G. and Alderwick, H. (2020) COVID-19 policy tracker. URL: https://www.health.org.uk/news-and-comment/charts-and-infographics/covid-19-policy-tracker. Accessed December 7, 2020.

[15] ↵
Edeling, W., Arabnejad, H., Sinclair, R., Suleimenova, D., Gopalakrishnan, K., Bosak, B., Groen, D., Mahmood, I., Crommelin, D. and Coveney, P. V. (2021) The impact of uncertainty on predictions of the covidsim epidemiological code. Nature Computational Science, 1, 128–135.
OpenUrl

[16] ↵
Ferretti, L., Wymant, C., Kendall, M., Zhao, L., Nurtay, A., Abeler-Dörner, L., Parker, M., Bonsall, D. and Fraser, C. (2020) Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science, 368. URL: https://science.sciencemag.org/content/368/6491/eabb6936.

[17] ↵
Fraser, C. (2007) Estimating individual and household reproduction numbers in an emerging epidemic. PloS ONE, 2, e758.
OpenUrl CrossRef PubMed

[18] ↵
Ganyani, T., Kremer, C., Chen, D., Torneri, A., Faes, C., Wallinga, J. and Hens, N. (2020) Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance, 25. URL: https://www.eurosurveillance.org/content/10.2807/1560-7917.ES.2020.25.17.2000257.

[19] ↵
Google (2020) COVID-19 Community Mobility Reports. URL: https://www.google.com/covid19/mobility/. Accessed November 30, 2020.

[20] ↵
Haario, H., Saksman, E. and Tamminen, J. (2001) An Adaptive Metropolis Algorithm. Bernoulli, 7, 223–242. URL: https://www.jstor.org/stable/3318737. Publisher: International Statistical Institute (ISI) and Bernoulli Society for Mathematical Statistics and Probability.
OpenUrl CrossRef

[21] ↵
Hansen, N., Müller, S. D. and Koumoutsakos, P. (2003) Reducing the Time Complexity of the Derandomized Evolution Strategy with Covariance Matrix Adaptation (CMA-ES). Evolutionary Computation, 11, 1–18.
OpenUrl CrossRef PubMed Web of Science

[22] ↵
Harrison, E., Docherty, A. and Semple, C. (2020) COVID-19: time from symptom onset until death in UK hospitalised patients. URL: https://assets.publishing.service.gov.uk. Accessed November 30, 2020.

[23] ↵
Hart, W., Maini, P., Yates, C. and Thompson, R. (2020) A theoretical framework for transitioning from patient-level to population-scale epidemiological dynamics: influenza a as a case study. Journal of the Royal Society Interface, 17, 20200230.
OpenUrl

[24] ↵
Hawryluk, I., Mellan, T. A., Hoeltgebaum, H., Mishra, S., Schnekenberg, R. P., Whittaker, C., Zhu, H., Gandy, A., Donnelly, C. A., Flaxman, S. and Bhatt, S. (2020) Inference of covid-19 epidemiological distributions from brazilian hospital data. Journal of The Royal Society Interface, 17, 20200596. URL: https://royalsocietypublishing.org/doi/abs/10.1098/rsif.2020.0596.
OpenUrl

[25] ↵
He, X., Lau, E. H., Wu, P., Deng, X., Wang, J., Hao, X., Lau, Y. C., Wong, J. Y., Guan, Y., Tan, X. et al. (2020) Temporal dynamics in viral shedding and transmissibility of covid-19. Nature medicine, 26, 672–675.
OpenUrl CrossRef PubMed

[26] ↵
Heffernan, J. M., Smith, R. J. and Wahl, L. M. (2005) Perspectives on the basic reproductive ratio. Journal of the Royal Society Interface, 2, 281–293.
OpenUrl

[27] ↵
Held, L., Hens, N., Wallinga, J. and O’Neill, P. (2019) Handbook of Infectious Disease Data Analysis. CRC Press LLC, Milton. Available from: ProQuest Ebook Central, accessed 18 December 2020.

[28] Huang, A. and Garcia-Carreras, B.ans Hitchings, M. e. a. (2020) A systematic review of antibody mediated immunity to coronaviruses: kinetics, correlates of protection, and association with severity. Nat Commun, 11, 4704. URL: https://doi.org/10.1038/s41467-020-18450-4.
OpenUrl CrossRef PubMed

[29] ↵
Hurtado, P. and Kirosingh, A. (2019) Generalizations of the ‘linear chain trick’: incorporating more flexible dwell time distributions into mean field ode models. J. Math. Biol., 79.

[30] ↵
Jarvis, C. I., Van Zandvoort, K., Gimma, A., Prem, K., Klepac, P., Rubin, G. J. and Edmunds, W. J. (2020) Quantifying the impact of physical distance measures on the transmission of covid-19 in the uk. BMC medicine, 18, 1–10.
OpenUrl

[31] ↵
Keeling, M. J. and Rohani, P. (2008) Modeling Infectious Diseases in Humans and Animals. Princeton University Press. URL: http://www.jstor.org/stable/j.ctvcm4gk0.

[32] ↵
King, A. A., Domenech de Celles, M., Magpantay, F. M. and Rohani, P. (2015) Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to ebola. Proceedings of the Royal Society B: Biological Sciences, 282, 20150347.
OpenUrl CrossRef PubMed

[33] ↵
Kissler, S. M., Fauver, J. R., Mack, C., Olesen, S. W., Tai, C., Shiue, K. Y., Kalinich, C. C., Jednak, S., Ott, I. M., Vogels, C. B., Wohlgemuth, J., Weisberger, J., DiFiori, J., Anderson, D. J., Mancell, J., Ho, D. D., Grubaugh, N. D. and Grad, Y. H. (2020) Sars-cov-2 viral dynamics in acute infections. medRxiv. URL: https://www.medrxiv.org/content/early/2020/12/01/2020.10.21.20217042.

[34] ↵
Kucharski, A. J., Russell, T. W., Diamond, C., Liu, Y., Edmunds, J., Funk, S., Eggo, R. M., Sun, F., Jit, M., Munday, J. D. et al. (2020) Early dynamics of transmission and control of covid-19: a mathematical modelling study. The lancet infectious diseases, 20, 553–558.
OpenUrl CrossRef PubMed

[35] ↵
Lehtinen, S., Ashcroft, P. and Bonhoeffer, S. (2020) On the relationship between serial interval, infectiousness profile and generation time. medRxiv. URL: https://www.medrxiv.org/content/early/2020/09/18/2020.09.18.20197210.

[36] ↵
Lonergan, M. and Chalmers, J. D. (2020) Estimates of the ongoing need for social distancing and control measures post-”lockdown” from trajectories of covid-19 cases and mortality. European Respiratory Journal, 56.

[37] ↵
Lourençco, J., Pinotti, F., Thompson, C. and Gupta, S. (2020) The impact of host resistance on cumulative mortality and the threshold of herd immunity for sars-cov-2. medRxiv. URL: https://www.medrxiv.org/content/early/2020/10/01/2020.07.15.20154294.

[38] ↵
Mathews, J. D., McCaw, C. T., McVernon, J., McBryde, E. S. and McCaw, J. M. (2007) A biological model for influenza transmission: Pandemic planning implications of asymptomatic infection and immunity. PLOS ONE, 2, 1–6. URL: https://doi.org/10.1371/journal.pone.0001220.
OpenUrl CrossRef

[39] ↵
ONS (2020) Deaths registered weekly in England and Wales, provisional: week ending 13 November 2020. URL: https://www.ons.gov.uk/. Accessed November 30, 2020.

[40] ONS (2020) Population estimates. URL: https://www.ons.gov.uk/. Accessed November 30, 2020.

[41] ↵
O’Driscoll, M., Dos Santos, G. R., Wang, L., Cummings, D. A., Azman, A. S., Paireau, J., Fontanet, A., Cauchemez, S. and Salje, H. (2020) Age-specific mortality and immunity patterns of SARS-CoV-2. Nature, 1–9.

[42] ↵
Park, S. W., Bolker, B. M., Champredon, D., Earn, D. J., Li, M., Weitz, J. S., Grenfell, B. T. and Dushoff, J. (2020) Reconciling early-outbreak estimates of the basic reproductive number and its uncertainty: framework and applications to the novel coronavirus (SARS-CoV-2) outbreak. MedRxiv.

[43] ↵
Robinson, M. D. and Smyth, G. K. (2008) Small-sample estimation of negative binomial dispersion, with applications to SAGE data. Biostatistics, 9, 321–332. URL: https://academic.oup.com/biostatistics/article/9/2/321/353777. Publisher: Oxford Academic.
OpenUrl CrossRef PubMed Web of Science

[44] ↵
Royal Society SET-C (2020) Reproduction number (R) and growth rate (r) of the COVID-19 epidemic in the UK: methods of estimation, data sources, causes of heterogeneity, and use as a guide in policy formulation. URL: https://royalsociety.org. Members of the Royal Society SET-C and members of the SET-C sub-group.

[45] ↵
Sherratt, K., Abbott, S., Meakin, S. R., Hellewell, J., Munday, J. D., Bosse, N., working group, C. C.-., Jit, M. and Funk, S. (2021) Exploring surveillance data biases when estimating the reproduction number: with insights into subpopulation transmission of covid-19 in england. Philosophical Transactions of the Royal Society B, 376, 20200283.
OpenUrl

[46] ↵
Sun, K., Wang, W., Gao, L., Wang, Y., Luo, K., Ren, L., Zhan, Z., Chen, X., Zhao, S., Huang, Y., Sun, Q., Liu, Z., Litvinova, M., Vespignani, A., Ajelli, M., Viboud, C. and Yu, H. (2020) Transmission heterogeneities, kinetics, and controllability of SARS-CoV-2. Science. URL: https://science.sciencemag.org/content/early/2020/11/23/science.abe2424.

[47] ↵
UK Government (2020a) The Health Protection (Coronavirus, Restrictions) (England) Regulations 2020. URL: https://www.legislation.gov.uk/uksi/2020/350/regulation/6/2020-03-26. Accessed November 30, 2020.— (2020b) The Health Protection (Coronavirus, Wearing of Face Coverings in a Relevant Place) (England) Regulations 2020. URL: https://www.legislation.gov.uk/uksi/2020/791/schedule. Accessed November 30, 2020.

[48] ↵
Verity, R., Okell, L. C., Dorigatti, I., Winskill, P., Whittaker, C., Imai, N., Cuomo-Dannenburg, G., Thompson, H., Walker, P. G. T., Fu, H., Dighe, A., Griffin, J. T., Baguelin, M., Bhatia, S., Boonyasiri, A., Cori, A., Cucunubá, Z., FitzJohn, R., Gaythorpe, K., Green, W., Hamlet, A., Hinsley, W., Laydon, D., Nedjati-Gilani, G., Riley, S., van Elsland, S., Volz, E., Wang, H., Wang, Y., Xi, X., Donnelly, C. A., Ghani, A. C. and Ferguson, N. M. (2020) Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet. Infectious diseases, 20, 669–677.
OpenUrl CrossRef PubMed

[49] ↵
Wearing, H. J., Rohani, P. and Keeling, M. J. (2005) Appropriate models for the management of infectious diseases. PLoS medicine, 2, e174.
OpenUrl