Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Data Driven High Resolution Modeling and Spatial Analyses of the COVID-19 Pandemic in Germany

View ORCID ProfileLennart Schüler, View ORCID ProfileJustin M. Calabrese, View ORCID ProfileSabine Attinger
doi: https://doi.org/10.1101/2021.01.21.21250215
Lennart Schüler
1Institute of Earth and Environmental Sciences, University Potsdam, Potsdam, Germany
2Dept. of Computational Hydrosystems, UFZ – Helmholtz Centre for Environmental Research, Leipzig, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lennart Schüler
  • For correspondence: lennart.schueler{at}ufz.de
Justin M. Calabrese
3Center for Advanced Systems Understanding (CASUS), Görlitz, Germany
4Helmholtz-Zentrum Dresden Rossendorf (HZDR), Dresden, Germany
5Dept. of Ecological Modelling, UFZ – Helmholtz Centre for Environmental Research, Leipzig, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Justin M. Calabrese
Sabine Attinger
1Institute of Earth and Environmental Sciences, University Potsdam, Potsdam, Germany
2Dept. of Computational Hydrosystems, UFZ – Helmholtz Centre for Environmental Research, Leipzig, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sabine Attinger
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The SARS-CoV-2 virus has spread around the world with over 90 million infections to date, and currently many countries are fighting the second wave of infections. With neither sufficient vaccination capacity nor effective medication, non-pharmaceutical interventions (NPIs) remain the measure of choice. However, NPIs place a great burden on society, the mental health of individuals, and economics. Therefore the cost/benefit ratio must be carefully balanced and a target-oriented small-scale implementation of these NPIs could help achieve this balance. To this end, we introduce a modified SEIR-class compartment model and parametrize it locally for all 412 districts of Germany. The NPIs are modeled at district level by time varying contact rates. This high spatial resolution makes it possible to apply geostatistical methods to analyse the spatial patterns of the pandemic in Germany and to compare the results of different spatial resolutions. We find that the modified SEIR model can successfully be fitted to the COVID-19 cases in German districts, states, and also nationwide. We propose the correlation length as a further measure, besides the weekly incidence rates, to describe the current situation of the epidemic.

1 Introduction

The SARS-CoV-2 virus was first detected in China in late 2019, and then rapidly spread around the world. By March 2020, COVID-19, the disease caused by SARS-CoV-2, was officially declared a pandemic by the World Health Organization (Cucinotta et al., 2020). To date, the pandemic has resulted in devastating consequences to life, health, and national economies. The novelty of the SARS-CoV-2 virus, coupled with the comparative lack of clinical research on coronaviruses in general, has left Non-Pharmaceutical Interventions (NPIs), such as masks, lockdowns, and social distancing measures, as the main weapons in the fight against COVID-19. Indeed, NPIs have so far played an important role in modulating the dynamics of the pandemic (Ferguson et al., 2020).

In Europe and other regions, NPIs during the first wave of COVID-19 were typically implemented at the national level or at the state level in some federations. In Germany for example, the first COVID-19 case was reported on 2020-01-27 and the first NPIs were imposed on 2020-17-03, with a lockdown of most public places, including school closures. This was followed two weeks later by a ban on meeting with too many people outside of one’s own household, and the number of people simultaneously allowed in supermarkets was restricted. These measures were largely effective (Khailaie et al., 2020), and the first COVID-19 wave peaked in Germany at the beginning of April 2020. Relaxations of the nationwide NPIs began by the third week of April, and by May 2020, the first wave in Germany was effectively over. While this type of broad-scale NPI deployment strategy was successful, it was also extremely costly and brought with it many unintended consequences. For example, schools and universities across Germany were completely closed during the lockdown (Nicola et al., 2020). Additionally, the price and calendar adjusted GDP shrank by 9.7 % in the second quarter of 2020 relative to the same period in 2019 (Statistisches Bundesamt [Destatis], 2020).

Europe is currently engulfed in a second wave of COVID-19, and despite many advances since the first wave crested, definitive solutions, such as sufficient vaccination capacity, remain elusive. At the same time, the devastating economic, social, and political consequences of nationwide lockdowns have become increasingly apparent. Uncoordinated smaller scale measures failed to keep the virus in check in the fall of 2020. The result has been the reimplementation of nationwide lockdowns. On the one hand, this failure could be interpreted as evidence against the efficacy of local measures. On the other hand, it provides an opportunity to develop more comprehensive strategies for applying NPIs at different scales (e.g., local, regional, national), and for identifying the conditions which require ramping control efforts up to larger scales.

It is therefore imperative that we learn as much as possible about the scale-specific effects of strong NPIs from the first COVID-19 wave. A key limitation is that most analyses so far have focused on the national level (e.g. Khailaie et al., 2020; Barbarossa et al., 2020), and thus have not been able to resolve local trends. An example for such a local or regional trend is the city of Jena which was the first district to implement mandatory mask-wearing. This measure seems to have effectively and very early stopped the disease (Mitze et al., 2020). Another example is the largest superspreader event in Germany to date in a meat processing plant, which mainly affected only two districts (Guenther et al., 2020). Here, we leverage data from the Robert Koch Institute (RKI - Homepage, 2020), reported for each of the 412 administrative districts (i.e., counties) in Germany, to quantify local effects of NPIs from the first COVID-19 wave and the time immediately thereafter. Specifically, we fit modified SEIR-class compartment models to the RKI data at the district level, and quantify changes in the estimated contact rate for each district across time periods defined by the start and end dates of the various NPIs that were implemented. This more granular modeling of the data also facilitates analysis of the dynamics of spatial patterns of infection clusters, which can yield additional insights into how COVID-19 in Germany responded to NPIs. Finally, our framework also permits a direct, multiscale comparison to highlight how the inferences about NPI effectiveness that can be gleaned depend on the scale of analysis.

2 Methods

In Germany, the Robert Koch Institute (RKI - Homepage, 2020) is responsible for gathering and publishing data on COVID-19. Germany is divided into 401 districts, of which one is Lake Constance and has no residents. The RKI further divides the most populous district of Berlin into its 12 boroughs. For simplicity, these 412 areas for which the RKI publishes data will be called districts from now on. The German reporting obligation of all positive COVID-19 tests to the RKI and the fact that these data are published on the district level makes it possible to model the epidemic at this comparatively high spatial resolution. The population size of the districts is taken from the Federal Statistical Office of Germany (Statistisches Bundesamt [Destatis], 2020).

The COVID-19 epidemic in Germany is modeled using a compartmental epidemiological model (Kermack et al., 1927) on the district level. Within each district, the population is divided into Susceptible, Exposed, Infectious,Recovered, and Dead compartments, with the total population being the sum of the individuals in the compartments minus the COVID-19 related deaths N = S + E + I + R − D. To keep the number of parameters as low as possible, the exposed individuals and the asymptomatic cases are handled together in one compartment. The modified SEIRD model is formulated as Embedded Image Embedded Image Embedded Image Embedded Image Embedded Image It is assumed that the asymptomatic cases can recover, but not die due to COVID-19, thus equation (5) is only coupled to equation (3). A graphical visualization of the system of equations (1) - (5) is shown in Figure 1.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1:

A visual representation of Equations (1) - (5), with the different compartments shown as boxes and the transfer rates as arrows. The data gathered by the RKI are shown as dotted arrows, instead of dashed ones. The color coding of the different compartments is kept consistently throughout this manuscript.

The NPIs are modeled by a piecewise constant contact rate β(t), which is allowed to change at the dates of the NPI implementations. Without loss of generality, this assumption is reformulated to constant contact rates βj, with j = 1, 2, …, M + 1 and M being the total number of NPIs. βj is exchanged by βj+1 at the date of the j-th NPI.

Because the latent and asymptomatic cases are lumped together into one compartment, parts of the model structure and some of its parameters cannot easily be mapped to quantities which can actually be measured, like the mean time it takes for the asymptomatic cases to recover. This decision was made in order to keep the number of parameters as low as possible, but at the same time, to have a model, that is flexible enough to reproduce the course of the COVID-19 epidemic across different scales and all districts in Germany.

The assumptions made for SIR-type models break down for small populations. Because the number of cases per day is often already low on the district level without separating the cases into different age groups, we neglect the age distribution of the population to avoid further reducing the number of individuals in the respective compartments.

Using the next generation matrix approach (Diekmann et al., 2010), the reproduction number for the SEIRD-model can be calculated yielding Embedded Image The system of non-linear ordinary equations (1) - (5) is numerically solved using an explicit Runge-Kutta method of order 5(4), derived by Dormand et al., 1980 and implemented by SciPy 1.0 Contributors et al., 2020.

The M +5 unknown parameters θ = (α, β1, β2, … βj, γ, κ, µ)T in equations (1) - (5) are estimated using Bayesian inference. For the evidence, the number of laboratory-confirmed cases per day Iobs and the number of deaths related to COVID-19 per day Dobs, gathered by the RKI, are used. These data are grouped together as Xobs = (Iobs, Dobs)T. Translating Xobs to the SEIRD-model (1) - (5), the rate of positively tested cases per day is expressed as Iobs ≙ αE and the rate of COVID-19 related deaths as Dobs ≙ µI, with X = (αE, µI)T. As the objective function, the negative root-mean-square error L = −E((X − Xobs)2)1/2 is used.

The parameter inference is set up for all of the 412 districts and the sampling is repeated 200000 times for each of them. The prior distributions of the parameters are uniform P (θ) ∼ U and the sampling is done using the Metropolis-Hastings MCMC algorithm (Metropolis et al., 1953; Hastings, 1970). The first 10% of the simulations are used for classical Monte Carlo sampling for the burn-in period. From this, the best parameter set is used as the initial parameter set for the Metropolis sampler. 30 MCMC chains are used for convergence checks. SPOTPY (Houska et al., 2015) is used for the implementation of the parameter inference.

The RKI gathers and updates its data on the COVID-19 epidemic once a day. These data are downloaded and preprocessed in order to use it for the evidence in the Bayesian framework. Next, the parameter inference is executed for all districts in parallel. Finally, the analyses are done and the plots are created. All these steps are part of a fully automatised workflow on the HPC Cluster EVE (Schnicke et al., 2020) at the UFZ Leipzig.

For a comparison with the much more common approach of modeling an epidemic on a national level, the results from all fitted district level simulations are aggregated, first to the level of states within Germany, and subsequently to the national level. This yields three different spatial resolutions that can then be compared: 1) district, 2) state, and 3) national. Additionally, the same SEIRD-model (1) - (5), which was applied to the districts, is also parametrized for the national case and death rates for resolution 3) and for the 16 individual German states for resolution 2).

We performed sensitivity analyses to better understand the model behavior using the extended Fourier amplitude sensitivity test (FAST) algorithm (Saltelli et al., 1999). This method is a variance-based global sensitivity analysis taking parameter interactions into account and is implemented by SPOTPY (Houska et al., 2015).

The relatively high spatial resolution of German districts makes it possible to use geostatistical methods to identify spatial correlation structures. The (semi-)variogram is a function describing the type, range, and strength of these spatial correlations. If only few and spatially separated superspreader events take place in Germany, we expect to see a high correlation range but a low correlation strength, because all the districts with low infection numbers are highly correlated over a large area. But if a superspreader event causes a spreading of infections to neighboring districts and a map of the case numbers on a district level would be plotted, this map would look very patchy, with clusters of high case numbers next to clusters of low case numbers. This would be reflected in a variogram with shorter correlation lengths and a higher correlation strength. The semi-variogram is defined as Embedded Image with z being the quantity of interest (in this case, the number of individuals), with Embedded Image being the bins or the distances in which data points are grouped, and N (rk) being the number of values in the respective bin (Matheron, 1963; Rubin, 2003). The variograms are calculated and estimated with GSTools (Schüler et al., 2020). For the calculation of the variograms, first the reported cases are accumulated over the periods of the NPIs, corresponding to the contact rates βj. Then, for each period an empirical variogram is calculated and a variogram model is fitted to it. For all empirical variograms, the best fit was achieved with exponential models Embedded Image with σ2 being the correlation strength or simply the variance and λ being the correlation range or length.

3 Results

Visualizing the cumulative reported cases exemplarily for the period of the second NPI on 2020-04-02 to the third NPI on 2020-04-20 on a national, a state, and a district level in Figure 2 shows that reported cases are distributed very inhomogeneously. On the state level one can see that there is a gradient from south to north, but that most of the cases are only reported in relatively small areas can only be seen on the district level. These three scales open up the opportunity of comparing the epidemic over very differently sized populations. The districts have a typical population size in the order of 105, the states of 107, and the nation of 108.

Figure 2:
  • Download figure
  • Open in new tab
Figure 2:

The number of laboratory confirmed COVID-19 cases per 100000 accumulated from the second NPI on 2020-04-02 until the third NPI on 2020-04-20 on three different spatial resolutions according to the hierarchical administrative divisions of Germany.

The aggregated and nationally calibrated approaches are compared to the German-wide positively tested cases over time (Fig. 3a). First of all, it can be seen that the calibrated national SEIRD-model (1) - (5) with the variable contact rates can be used to reproduce the epidemic in Germany. Aggregating the simulation results from the fitted district models also reproduces the case numbers on a national level, but with some interesting deviations from the fitted national model. The very fast increase of reported cases until mid of March is matched well by both approaches. The subsequent peak is underestimated by the aggregated models. At the beginning of April, they show a second peak, which does not appear in the national model. For lower infection rates, the accumulated models perform well, although they tend to show minor peaks at the NPI change points. From the final NPI on, the spreading events become more scattered with a higher variance and the aggregated models underestimate the case numbers. There is a problem with the initial conditions, because at the early stage of the epidemic, many districts did not have any reported cases or had larger periods with zero infections. Therefore, the cases have to be interpolated for non-trivial initial conditions. This causes the aggregated cases to be larger at the start of the simulation.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3:

Comparisons of parametrized model runs on a higher hierarchical level with the aggregated case numbers from the fitted district level models. The fitted national model and the summed positive cases resulting from the 412 district level models are compared to the nationwide reported cases in Figure (a) and the fitted state model of Bavaria and the summed positive cases resulting from its 96 district level models are compared to the reported cases in Bavaria in Figure (b).

Similarly and very easily within this modeling framework, the district level data can be aggregated to the next hierarchical level, namely the states. As an example, the state of Bavaria, which had the most cases of all German states during the first wave, is taken. The result is similar to the comparison of the national model. The aggregated reported cases show two peaks, whereas the state model only shows one late peak. The peaks at the dates of the NPIs are also present and the aggregated models underestimate the slow and scattered increase from August on.

Now that we have seen that the aggregated fitted simulations can reproduce the reported case numbers on higher hierarchical levels, we can analyse individual districts and see what is being averaged out, when looking at the case numbers on a higher hierarchical level. At the same time, the capabilities and limits of the modified SEIRD model (1) - (5) applied to districts are shown. The results of the parametrized simulations for three districts with qualitatively different courses of the epidemic are discussed here. The results of the model runs fitted to the Stadtkreis (SK, urban district) Jena, Landkreis (LK, rural district) Gütersloh, and SK Duisburg, respectively (Fig. 4a - 4c) are presented now.

Figure 4:
  • Download figure
  • Open in new tab
Figure 4:

The time evolution of the epidemic in three different districts. The transfer rates into the compartment Exposed Embedded Image is shown in purple, into Infectious (αE) in orange, and into Recovered (κE+γI) in green. The shaded area shows the 95% credible interval of the rates. The reported positive cases are shown as a scatter plot in orange, corresponding to Iobs ≙ αE. The vertical grey lines indicate the dates of the NPIs.

Jena (Fig. 4a) was the first district to introduce mandatory mask-wearing and at the same time, this district was very successful in quickly reducing the confirmed cases to almost zero, with only a few days over several month when single new cases were confirmed. This reduction might be a direct consequence of the mandatory face masks (Mitze et al., 2020). The drop in cases can also be seen from the fitted model results, where the peak of the newly reported cases was around the time the first NPI was implemented. After this peak, the rate quickly decreased to around zero per day at the time of the third NPI. The gradual increase of uncertainty in the contact rates from β2 to β6 is a result of the very low case numbers (Fig. 5).

Figure 5:
  • Download figure
  • Open in new tab
Figure 5:

The posterior distributions of the parameters for SK Jena. For better visualization, the parameters κ and µ are shown again on a separate y-scale. A classical box plot is show inside the violins, with the white dot indicating the optimal parameter.

Compared to Jena, Gütersloh (Fig. 4b) had a broader peak of infections at the beginning of the epidemic, but at the time of the third NPI, the rate became very low here too. This changed in mid June when a major outbreak occurred at a meat processing plant, with over 1000 infected employees (RKI - Homepage, 2020; Guenther et al., 2020). This outbreak was spread out over LK Gütersloh and LK Warendorf, where many of the employees lived.

This outbreak lasted about two weeks, but the model spreads and broadens the peak between the NPI change points before and after the event. This is an issue of the insufficient temporal resolution of the contact rates βj. A drawback of the current parameter estimation is revealed by the model results for Gütersloh. The estimation of all contact rates βj is done simultaneously and not for each NPI period successively. This problem arises before the fifth NPI, where the number of exposed and infectious individuals increases only to decrease after the NPI in order to match the data better.

Duisburg (Fig. 4c) has had a mean infection rate of Embedded Image with a standard deviation of 58% without a significant trend. Linearly fitting the data results in a slope of only Embedded Image. Although SIR-type models tend towards either an exponential increase or decrease of the rates, the modified SEIRD model (1) - (5) reproduces the linear trend in Duisburg satisfactorily. The high variance of the reported cases affects the 95% credible interval, where the spread is much higher relatively to the two other analysed districts (Fig. 4a and 4b).

A different view of the course of the epidemic can be gained by looking at the variograms of the infection rates. The variogram and its fit for a single NPI period from 2020-03-17 until 2020-03-23 of the cumulative case rates are shown in Figure 6a. The variograms for all periods can be found in the appendix (Fig. 9). The correlation lengths, derived from the variograms, increase from about λ1 = 40 km and peak at the crest of the first wave at twice the length λ2 = 81 km, when the first NPIs where implemented (Fig. 6b). From then on, the correlation lengths drop until the first NPIs are relaxed on 2020-04-20 with λ4 = 26 km, where the correlation lengths stay nearly constant until a minor peak at λ6 = 36 km is reached. Finally, a global minimum of λ7 = 5.8 km is reached with the last relaxation of the NPIs. For comparison, the district centroids have a mean distance to their neighboring district centroids of about 32 km.

Figure 6:
  • Download figure
  • Open in new tab
Figure 6:

The empirical and the exponential variograms (Eq. (8)) of the cumulative rates of the reported cases for the time period before the first NPI are shown in Figure (a). The time evolution of the correlation lengths λi of the covariance models for the cumulative cases is shown in Figure (b). The mean distance of the neighboring district centroids is indicated by the dashed grey line.

Figure 7:
  • Download figure
  • Open in new tab
Figure 7:

Histograms of the parameters of all 412 districts. The rug plot indicates each single parameter value with a small vertical tick.

Figure 8:
  • Download figure
  • Open in new tab
Figure 8:

The relative sensitivities of each parameter exemplarily for the Stadtkreis Duisburg. The larger the slice of a parameter, the more it influences the simulation results in regard to the observations, which are the positively tested case rate and the COVID-19 related death rate.

Figure 9:
  • Download figure
  • Open in new tab
Figure 9:

The empirical and the modelled exponential variograms of the cumulative rates of reported cases for every NPI period. The variance is proportional to the cases and the flattening of the exponential variograms indicates the correlation length.

4 Discussion

In this work, we present a modified SEIRD-type epidemiological model with variable contact rates tailored to the COVID-19 pandemic. This model is fit to the data from each of the 412 German districts, all 16 states, and the nation. The parametrization is done using RKI data of the daily positively tested cases and the COVID-19 related deaths. The most important tool to modulate the epidemic to date, the non-pharmaceutical interventions, are implemented using piecewise constant contact rates which only change at the dates of NPI implementations. This model is flexible enough to satisfactorily reproduce the time evolution of the epidemic on a district level over many months, although the development of the epidemic is qualitatively very different across the different districts. Some districts had a very pronounced first peak followed by a long period in which the disease was practically eradicated. Others had a more or less constant rate of positively tested cases over several months. Furthermore, the same model can reproduce the epidemic on a state and on a national level. However, only on the district level is the spatial resolution high enough to analyse spatial patterns, for which we use the geostatistical method of variogram estimation. This method does not require any additional data, which makes variogram analysis an ideal tool during the onset of new epidemics, when only limited data are available.

Monitoring and modeling the infections on this small scale level is a first step towards local, precise, and target-oriented NPIs. Doing so could increase the cost/benefit ratio and also the acceptance of NPIs. The correlation lengths of the estimated variograms might help in evaluating if local NPIs are sufficient or if state or even nationwide measurements should be taken. An example scenario where the case numbers or weekly incidence rates alone are not enough to judge the effectiveness of local-scale NPIs is the following. If the quarantining in the aftermath of a superspreader event is applied too late or not rigorously enough, it could reduce the total amount of newly reported cases, but commuters might have already spread the disease to neighboring districts. In these surrounding districts, the case numbers would only slowly increase. Thus, by only taking the total case numbers into account, one might come to the conclusion, that the superspreader event was successfully quarantined. Whereas the correlation length would increase early with the slow spreading to the neighboring districts, even though the total amount of reported cases drops after the initial quarantining. This information can also be extracted from maps, but they contain the information in complex ways and it is always easier to communicate information in single numbers (e.g. weekly incidence rates, instead of the time evolution of the reported cases, the mean instead of the complete distribution, the h-index instead of the quality and topics of a researcher).

The high spatial resolution of the district level opens up the possibility to aggregate the results to a specific level, e.g. to the states or to the national level, which can also yield unique insights into the epidemic. The aggregated district models show a second peak during the first wave on 2020-04-01 (Fig. 3a). This might actually hint at the large number of districts, where the peak infection was reached with a delay of about two weeks compared to the districts, in which the epidemic started earlier. On a national level this delay is completely averaged out and it cannot be seen in the data on a German-wide level. Later on, the aggregated district models tend to underestimate the national-level case numbers. A reason for this could be that the dynamics of the epidemic are often driven by local superspreader events, which could be isolated and quarantined effectively. These events look like outliers on a district level, but increase the averaged cases on a national level, making them easier to match on the higher level. From August on, the infections seem to become more scattered with a much higher variability than before. This is also roughly the time, when more local NPIs were implemented and a central modeling approach with fixed NPIs for all districts might become too rigid for this kind of scenario.

The correlation lengths λi obtained from the variogram estimation support the idea that districts are the appropriate level of granularity for monitoring and modeling the epidemic. The fact that exponential variograms fit the data best further supports this, as it is a relatively rough correlation type, compared to e.g. Gaussian variograms, indicating that although pronounced spatial correlations exist, immediately neighboring districts can still have very different case numbers. If λi is less than the average neighboring district distance, it indicates that NPIs should only be implemented on a local district level, according to e.g. the weekly incidence rates of the district, published by the RKI. However, λi greater than the inter-district distance and less than the average distance between neighboring states suggests that NPIs should be applied on a state level or on an intermediate level, e.g. in Regierungsbezirken (provinces) in Germany. If the clusters grow beyond state size, nationwide NPIs are likely to be appropriate again. This hierarchical control approach works in both directions, not only for applying new NPIs at targeted spatial extents, but also for lifting existing ones over different regions, as the epidemic subsides. This modeling framework also makes it very easy to make projections on different hierarchical levels, e.g. what effect would NPIs have on the weekly incidence rates, if they are applied locally at a district level or if they are applied on a state level. Combining this with an economic model could help finding a balance between the effectiveness and costs of NPIs.

The model results will likely improve, if the NPI periods are parametrized individually and successively. This would prevent the model from increasing the number of cases prior to an NPI and the actual increase, as can be seen in the results for LK Gütersloh at 2020-06-09 (Fig. 4b) or in the peaks at the NPI dates in the aggregated models (Fig. 3a, 3b). However, a multitude of approaches for such a successive parametrization exist. The approach presented in this study could be a precursor from which all constant parameters (α, γ, κ, µ) are identified. Subsequently, the contact rates βj could be parametrized successively by regarding one NPI period at a time and with priors for βj taken from the precursor run. Alternatively, the constant parameters could also be estimated for each NPI period separately. The differences in these supposedly constant parameters could be used as an indicator, to see if the compartments should be further divided into different age groups, as these parameters do vary between different age groups. But exploring these possibilities is beyond the scope of this work.

A further and likely more important improvement might be to choose an appropriate algorithm out of the wealth of published outlier detection algorithms (e.g. Hodge et al., 2004) and to apply it to the RKI time series to automatically identify superspreader events. Such an event could then be implemented into the existing modeling framework by means of an additional transfer term, which acts like a Dirac pulse type source term for the Infectious compartment, but at the same times obeys the conservation laws. This way, local NPIs can be detected automatically and applied without having to prescribe NPIs manually to all districts individually.

An alternative approach could be to derive information about super-spreader events from identifying change points in the contact rates as done by Dehning et al., 2020.

Data Availability

All data used in this work is freely available and the sources are given in the manuscript.

https://npgeo-corona-npgeo-de.hub.arcgis.com/datasets/dd4580c810204019a7b8eb3e0b329dd6_0

A Model Assessment

With the 412 districts simulated with fitted models, we can create histograms of the model parameters. Looking at the distributions of the model parameters across the districts, it is to be expected that mostly the contact rates βj should vary across districts (Fig. 7a and 7b). Except for some variations in the age structures of the populations, the other model parameters should not vary strongly. But this is only the case for the recovery rate γ, which has a pronounced peak at about γ ≈ 3.2 d−1. The other three parameters are more or less uniformly distributed, but with a negative trend for α. The extended FAST sensitivity analysis (Fig. 8) reveals that the three parameters α, κ, and µ are not uniquely identifiable, as they are not sensitive towards the calibrated data. This explains the uniform distribution of these parameters across the districts, as the parameter calibration has no way of pinpointing the parameters. From the low sensitivity one cannot deduce that the parameters are not important for the model, as the sensitivity analysis only tests the relative influence towards minimizing the objective function.

Acknowledgement

This work was partially funded by the Center of Advanced Systems Understanding (CASUS) which is financed by Germany’s Federal Ministry of Education and Research (BMBF) and by the Saxon Ministry for Science, Culture and Tourism (SMWK) with tax funds on the basis of the budget approved by the Saxon State Parliament. This work was also partially funded by the Where2Test project, which is financed by SMWK with tax funds on the basis of the budget approved by the Saxon State Parliament.

References

  1. ↵
    Barbarossa, Maria Vittoria; Fuhrmann, Jan; Meinke, Jan H.; Krieg, Stefan; Varma, Hridya Vinod; Castelletti, Noemi; Lippert, Thomas; 2020. Modeling the spread of COVID-19 in Germany: Early assessment and possible scenarios. PLoS ONE [online]. Vol. 15, no. 9, e0238559 [visited on 2020-10-28]. issn 1932-6203. Available from doi:10.1371/journal.pone.0238559.
    OpenUrlCrossRef
  2. Cucinotta, Domenico; Vanelli, Maurizio; 2020. WHO Declares COVID-19 a Pandemic. Acta Bio Medica Atenei Parmensis [online]. Vol. 91, no. 1, pp. 157–160 [visited on 2020-10-23]. issn 25316745, issn 03924203. Available from doi:10.23750/abm.v91i1.9397.
    OpenUrlCrossRefPubMed
  3. ↵
    Dehning, Jonas; Zierenberg, Johannes; Spitzner, Frank Paul; Wibral, Michael Neto; Joao Pinheiro, Wilczek, Michael; Priesemann, Vi-ola; 2020. Inferring change points in the COVID-19 spreading reveals the effectiveness of interventions [online]. 2020-04-06 [visited on 2020-09-22]. preprint. Epidemiology. Available from doi:10.1101/2020.04.02.20050922.
    OpenUrlAbstract/FREE Full Text
  4. ↵
    Diekmann, O.; Heesterbeek, J. A. P.; Roberts, M. G.; 2010. The construction of next-generation matrices for compartmental epidemic models. J. R. Soc. Interface. [Online]. Vol. 7, no. 47, pp. 873–885 [visited on 2020-07-16]. issn 1742-5689, issn 1742-5662. Available from doi:10.1098/rsif.2009.0386.
    OpenUrlCrossRefPubMedWeb of Science
  5. Dormand, J.R.; Prince, P.J.; 1980. A family of embedded Runge-Kutta formulae. Journal of Computational and Applied Mathematics [online]. Vol. 6, no. 1, pp. 19–26 [visited on 2020-07-16]. issn 03770427. Available from doi:10.1016/0771-050X(80)90013-3.
    OpenUrlCrossRef
  6. ↵
    Ferguson, N; Laydon, D; Nedjati Gilani, G. Imai, N; Ainslie, K; Baguelin, M; Bhatia, S; Boonyasiri, A; Cucunuba Perez, Zulma; Cuomo-Dannenburg G; Dighe, A; Dorigatti, I; Fu, H; Gaythorpe, K; Green, W; Hamlet, A; Hinsley, W; Okell, L; Van Elsland, S; Thompson, H; Verity, R; Volz, E; Wang, H; Wang, Y; Walker, P; Winskill, P; Whittaker, C; Don-Nelly, C; Riley, S; Ghani, A; 2020. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand [online]. 2020-03-16 [visited on 2020-09-22]. Imperial College London. Available from doi:10.25561/77482.
    OpenUrlCrossRef
  7. ↵
    Guenther, Thomas; Czech-Sioli, Manja; Indenbirken, Daniela; Robitailles, Alexis; Tenhaken, Peter; Exner, Martin; Ottinger, Matthias; Fischer, Nicole; Grundhoff, Adam; Brinkmann, Melanie; 2020. Investigation of a superspreading event preceding the largest meat processing plant-related SARS-Coronavirus 2 outbreak in Germany. SSRN Journal [online] [visited on 2020-10-29]. issn 1556-5068. Available from doi:10.2139/ssrn.3654517.
    OpenUrlCrossRef
  8. ↵
    Hastings, W K; 1970. Monte Carlo sampling methods using Markov chains and their applications. Biometrika. Vol. 57, no. 1, pp. 97–109.
    OpenUrlCrossRefWeb of Science
  9. Hodge, Victoria J; Austin, Jim; 2004. A Survey of Outlier Detection Methodologies. Artificial Intelligence Review. Vol. 22, p. 42.
    OpenUrl
  10. ↵
    Houska, Tobias; Kraft, Philipp; Chamorro-Chavez, Alejandro; Breuer, Lutz; 2015. SPOTting Model Parameters Using a Ready-Made Python Package. PLoS ONE [online]. Vol. 10, no. 12, e0145180 [visited on 2020-07-16]. issn 1932-6203. Available from doi:10.1371/journal.pone.0145180.
    OpenUrlCrossRef
  11. ↵
    Kermack, William Ogilvy; Mckendrick, Anderson G, 1927. A Contribution to the Mathematical Theory of Epidemics. Proceedings of the royal society of london. Series A. Vol. 115, no. 772, pp. 700–721.
    OpenUrlCrossRef
  12. ↵
    Khailaie, Sahamoddin; Mitra, Tanmay; Bandyopadhyay, Arnab; Schips, Marta; Mascheroni, Pietro; Vanella, Patrizio; Lange, Berit; Binder, Sebastian; Meyer-Hermann, Michael; 2020. Estimate of the development of the epidemic reproduction number Rt from Coronavirus SARS-CoV-2 case data and implications for political measures based on prognostics [online]. 2020-04-07 [visited on 2020-04-10]. preprint. Epidemiology. Available from doi:10.1101/2020.04.04.20053637.
    OpenUrlAbstract/FREE Full Text
  13. ↵
    Matheron, Georges; 1963. Principles of geostatistics. Economic Geology [online]. Vol. 58, no. 8, pp. 1246–1266 [visited on 2020-08-17]. issn 1554-0774, issn 0361-0128. Available from doi:10.2113/gsecongeo.58.8.1246.
    OpenUrlAbstract/FREE Full Text
  14. ↵
    Metropolis, Nicholas; Rosenbluth, Arianna W; Rosenbluth, Marshall N; Teller, Augusta H; Teller, Edward; 1953. Equation of State Calculations by Fast Computing Machines. The Journal of Chemical Physics. Vol. 21, no. 6, pp. 1087–1092. Available from doi:10.1063/1.1699114.
    OpenUrlCrossRefPubMedWeb of Science
  15. ↵
    Mitze, Timo; Kosfeld, Reinhold; Rode, Johannes; Wälde, Klaus; 2020. Face Masks Considerably Reduce COVID-19 Cases in Germany: A Synthetic Control Method Approach, p. 31.
  16. ↵
    Nicola, Maria; Alsafi, Zaid; Sohrabi, Catrin; Kerwan, Ahmed; Al-Jabir, Ahmed; Iosifidis, Christos; Agha, Maliha; Agha, Riaz; 2020. The socio-economic implications of the coronavirus pandemic (COVID-19): A review. International Journal of Surgery [online]. Vol. 78, pp. 185– 193 [visited on 2020-10-23]. issn 17439191. Available from doi:10.1016/j.ijsu.2020.04.018.
    OpenUrlCrossRefPubMed
  17. RKI - Homepage, 2020 [online] [visited on 2020-07-16]. Available from: https://www.rki.de/EN/Home/homepage_node.html.
  18. ↵
    Rubin, Yoram; 2003. Applied Stochastic Hydrogeology. Oxford University Press. isbn 978-0-19-803154-3.
  19. ↵
    Saltelli, A.; Tarantola, S.; Chan, K. P.-S.; 1999. A Quantitative Model-Independent Method for Global Sensitivity Analysis of Model Output. Technometrics [online]. Vol. 41, no. 1, pp. 39–56 [visited on 2020-07-17]. issn 0040-1706, issn 1537-2723. Available from doi:10.1080/00401706.1999.10485594.
    OpenUrlCrossRef
  20. ↵
    Schnicke, Thomas; Langenberg, Ben; Schramm, Guido; Krause, Christian; Strempel, Tom; 2020. EVE - High-Performance Computing Cluster. Helmholtz-Zentrum für Umweltforschung GmbH - UFZ, Permoserstr. 15, 04318 Leipzig. Available also from: https://wiki.ufz.de/eve/.
  21. Schüler, Lennart; Müller, Sebastian; 2020. GeoStat-Framework / GSTools: Volatile Violet v1.2.1 [online]. Zenodo [visited on 2020-08-17]. Available from doi:10.5281/zenodo.3751743. Language: eng.
    OpenUrlCrossRef
  22. SCIPY 1.0 CONTRIBUTORS, Virtanen, Pauli; Gommers, Ralf; Oliphant, Travis E.; Haberland, Matt; Reddy, Tyler; Cournapeau, David; Burovski, Evgeni; Peterson, Pearu; Weckesser, Warren; Bright, Jonathan; Walt, Stéfan J. van der, Brett; Matthew Wilson, Joshua; Millman, K. Jarrod; Mayorov, Nikolay; Nelson, Andrew R. J.; Jones, Eric; Kern, Robert; Larson, Eric; Carey, C J; Polat, İlhan; Feng, Yu Moore; Eric W.; Vanderplas, Jake; Laxalde, Denis; Perktold, Josef; Cimrman, Robert; Henriksen, Ian; Quin-Tero, E. A.; Harris, Charles R.; Archibald, Anne M.; Ribeiro, Ant nio H.; Pedregosa, Fabian; Mulbregt, Paul Van, 2020. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods [online]. Vol. 17, no. 3, pp. 261–272 [visited on 2020-07-16]. issn 1548-7091, issn 1548-7105. Available from doi:10.1038/s41592-019-0686-2.
    OpenUrlCrossRefPubMed
  23. Statistisches Bundesamt [Destatis] [Federal Statistical Office], 2020 [online] [visited on 2020-10-23]. Available from: https://www.destatis.de/EN/Themes/Economy/National-Accounts-Domestic-Product/Tables/gdp-bubbles.html.
  24. Statistisches Bundesamt [Destatis] [Kreisfreie Städte und Landkreise am 31.12.2019], 2020 [online]. 2020-09-02 [visited on 2020-10-08]. Available from: https://www.destatis.de/DE/Themen/Laender-Regionen/Regionales/Gemeindeverzeichnis/Administrativ/04-kreise.html.
Back to top
PreviousNext
Posted January 26, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Data Driven High Resolution Modeling and Spatial Analyses of the COVID-19 Pandemic in Germany
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Data Driven High Resolution Modeling and Spatial Analyses of the COVID-19 Pandemic in Germany
Lennart Schüler, Justin M. Calabrese, Sabine Attinger
medRxiv 2021.01.21.21250215; doi: https://doi.org/10.1101/2021.01.21.21250215
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Data Driven High Resolution Modeling and Spatial Analyses of the COVID-19 Pandemic in Germany
Lennart Schüler, Justin M. Calabrese, Sabine Attinger
medRxiv 2021.01.21.21250215; doi: https://doi.org/10.1101/2021.01.21.21250215

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)