High connectivity and human movement limits the impact of travel time on infectious disease transmission

Reju Sam John; Joel C. Miller; Renata L. Muylaert; David T. S. Hayman

doi:10.1101/2023.07.26.23293210

Abstract

The speed of spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) during the coronavirus disease 2019 (COVID-19) pandemic highlights the importance of understanding how infections are transmitted in a highly connected world. Prior to vaccination, changes in human mobility patterns were used as non-pharmaceutical interventions to eliminate or suppress viral transmission. The rapid spread of respiratory viruses, various intervention approaches, and the global dissemination of SARS-CoV-2 underscore the necessity for epidemiological models that incorporate mobility to comprehend the spread of the virus. Here, we introduce a metapopulation susceptible–exposed–infectious–recovered (SEIR) model parameterised with human movement data from 340 cities in China. Our model replicates the early case trajectory in the COVID-19 pandemic. We then use machine learning algorithms to determine which network properties best predict spread between cities and find travel time to be most important, followed by the human movement Weighted Personalised PageRank. However, we show that travel time is most influential locally, after which the high connectivity between cities reduces the impact of travel time between individual cities on transmission speed. Additionally, we demonstrate that only significantly reduced movement substantially impacts infection spread times throughout the network.

1. Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in late 2019 and spread throughout the world in 2020 [1, 2]. Computational epidemiological modeling has been an important tool to predict the emergence and propagation of the pathogen [3]. The role of travel in infection spread is well recognised and pathogens can now move faster than ever before due to modernisation and globalisation [4]. Hence, travel restrictions are a key, non-pharmaceutical tool for ceasing or slowing the transmission of pathogens between locations [5, 6]. For an effective implementation of such interventions we need a better understanding of the effects and benefits of such restrictions a priori. This calls for a metapopulation modeling approach that incorporates population exchange between different locations.

Basic metapopulation models do not identify individuals based on their home locations and are formulated as Markovian processes. However, the primary pattern of human movement is driven by commuting populations. Thus, it is important to track the travellers according to their original locations. Hence we developed a metapopulation susceptible-exposed-infected-recovered (SEIR) compartmental model for commuting individuals in a population, using population and mobility data at the city level.

Researchers have built different epidemiological models to study the spread of SARS-CoV-2 (e.g. [3, 7, 8]). Few models take into account empirical population flows, which are key for understanding transmission dynamics [3, 6, 8–12]. Such models help to accurately depict the actual dynamics of the spread, which are highly influenced by the population flow in a real-life setting [3, 12]. Previous work has applied a global metapopulation disease transmission model with some national subpopulations centered around major transportation hubs, including within China, to model the impact of travel limitations on the national and international spread of SARS-CoV-2 after its emergence in Wuhan, China [12]. These models, however, have focused on SARS-CoV-2 dynamics and spread in the face of a pandemic, but not the overall dynamics of the systems and properties of the networks that facilitate or limit spread. As a result, there is a pressing need for data-driven, quantitative studies that look at fundamental system properties to help plan for future outbreaks for infections with similar epidemiological features.

Here, we created a model for the transmission of a SARS-CoV-2-like infection, using population flow data to effectively describe the infection spread. We used population flow data from all cities in mainland China to their top 100 most connected cities to constrain the parameters in our model. We built the population flow network of China with cities as nodes and the population flow between the cities as the edge weight of the nodes. We first consider models with introductions of 100 infections in Wuhan, using this as the epicentre of the SARS-CoV-2 outbreak, [13], and simulate the transmission behavior and outbreak sizes of SARS-CoV-2-like infections across China, then seed infection in different localities. We investigated which network properties are most important in the spread of infection, and whether there is a relationship between commuter numbers and their travel duration, infectious period, basic reproduction number R₀ (the number of cases generated by a typical index case in a fully susceptible population), and incubation period of the infection. Then, we conducted simulations to understand the impacts of reducing human flow between cities as a non-pharmaceutical intervention to control spread between cities.

2. Methods

(a) Model structure

We developed a metapopulation SEIR model to study the transmission dynamics of an emerging COVID-19 like disease. We assigned home locations to each human host, identified as the level 2 administrative division in China. Let us imagine we have n such locations. The people usually live in a particular location and travel occasionally or periodically to other locations (in a temporary basis). Here we differentiate between the residents and visitors currently located at the same location. Therefore we label S_ii(t), E_ii(t), I_ii(t), R_ii(t), and N_ii(t) as the number of susceptible, exposed, infected, recovered and total people in location i at time t who belong to location i and S_ij (t), E_ij (t), I_ij (t), and R_ij (t) as the number of susceptible, exposed, infected, and recovered people in location i at time t who belong to location j. These visitors are represented by the dashed circles in Figure 1. The visitors interact with residents at a given location and they return to their home location at a fixed rate π_ij. If people from location j travel to location i and stay there for a week before returning to their home location j, then the return rate π_ij will be 1/7 per day. For simplicity, we assume the same return rate (π) for everyone at all locations. The total hosts currently in location i that live in location j will then be, N_ij (t) = S_ij (t) + E_ij (t) + I_ij (t) +R_ij (t). The basic SEIR model structure which incorporates the movement and vital dynamics (birth and natural death) is shown in Figure 1.

Figure 1.

The schematic representation of the SEIR metapopulation model. In the compartment diagram, inhabitants of a particular city or location are represented by a particular color. The square boxes represent the set of individuals who are in the Susceptible, Exposed, Infectious, and Recovered class. The downward arrows represent the flow from one compartment to another within a city or population and are annotated with the corresponding flow rates. The arrow pointing to the right between the square boxes represents the flow between the corresponding compartments of another city or population and is annotated with the emigration rate. The dashed circles represent immigrants in a particular city, for example, j, who emigrated from another city, for example, i. The dashed arrow represents the return of immigrants to the home city and is annotated with the return rate. The outward and inward arrows from and to the square boxes represent death and birth, annotated with the respective death and birth numbers, assuming the same birth and death rates.

Let ϵ_ij represent the rate at which hosts whose home city is j travel to i, with ϵ_ii = 0 for all i. A person from location j who is currently at the location i will be included in the N_ij population. The number of people who belong to the location i remains constant over time, even though the members visit other sites . Further, to make this statement true we made another assumption: birth rate, μ = death rate, μ (so, μN_ii(t) = μS_ii(t) + μE_ii(t) + μI_ii(t) + μR_ii(t)). Finally, we assume homogeneous mixing for individuals within each city. With these assumptions the 8n² equations describing transmission among the peoples while they are at their home site or while they are travelling are: where the parameters are summarised in Table 1.

View this table:

Table 1: Parameters and variables

(b) Human movement data

We need to estimate the emigration rate, ϵ for the populations from all the cities in China. For this we scraped the Baidu migration site (http://qianxi.baidu.com/) [19] for a period from January 19, 2021 to January 18, 2022. The Baidu migration data set is created from a mobile phone application that tracks the movements of users. The Baidu migration website provides the 100 most popular immigration sources and emigration destination locations of each prefecture administrative level. The immigration/emigration (η) of a city is provided as the percentage of the population that migrated from/immigrated to the corresponding city. The website also displays the inward/outward migration index (hereafter Baidu migration index – ι) of all administrative divisions in the mainland China. Hence, the real inward/outward migration can be calculated as, Where the number s is the scaling factor that converts the Baidu migration index (ι) to the absolute number of travellers. However, the numerical value of this scale is ambiguous. Several authors have calculated different values for each unit of Baidu’s migration index, which is summarised in Table 2. Combining evidence from the sources in Table 2, we chose a scaling factor of 50, 000.

View this table:

Table 2: Various scaling factors for Baidu migration index (ι) from different sources.

(c) Human flow network

We use the location-based service offered by the Baidu data server, which gathers information based on Global Positioning System (GPS) locations, locations of cell towers, IP addresses, Wi-Fi, and location data from a variety of software and apps on mobile devices. We collected the mobility data by monitoring the features of the HTTPS (Hypertext Transfer Protocol Secure) requests made to the Baidu data server. This provides us the percentage of movement for all cities and their one hundred most connected cities. After analysing the responding JavaScript Object Notation (JSON) file from the server, the outflow and inflow matrix for the cities in China is generated. Then we took the yearly average of this inflow and outflow matrix, and it turned out that one matrix is approximately the transpose of the other which corresponds to almost all travel being round trips. Therefore, we construct the parameter, ϵ in the model as: ϵ = (Inflow + Outflow^T)/2. Table 3 presents a sample of the yearly average flow matrix (ϵ) between 8 example cities.

View this table:

Table 3: Sample of the yearly average humna population flow matrix. Eight cities are shown from 340 in total.

After developing the infectious disease spread model for the country, simulation experiments were carried out to identify the key parameters that can affect epidemic spread in a highly connected country like China. One of the main parameters that affects epidemic spread is the number of initial infected people in a population. For performing a systematic study to identify the key parameters that can affect epidemic spread, we kept a constant 100 initial infected people throughout the experiment.

(d) Model validation

According to Huang et al., 2020 [25] and Allam 2020 [26], the earliest date of reported COVID-19 cases at Wuhan in Hubei province of China was December 1, 2019, though the earliest infections and cases are not known (see [13]). However, this date is reasonable approximation for our purposes of tracking early infection dynamics. As of February 10, 2020, approximately 71 days after the first case in Wuhan, there had been 262 confirmed cases of COVID-19 in Beijing. [27]. We use these numbers and time frame to help validate the model.

(e) Predictors of spread

There are important features of networks that might govern epidemic spread. These include factors relating to the flow of people, such as the number of travellers (see above), the travel duration and distance, and specific network (graph) properties. Here, we identify key metrics and test which ones influence the spread in our metapopulation which we now describe below.

(i) Travel duration

To understand the effect of travel time, we retrieve and analyze road/street networks of China from the OpenStreetMap (OSM) with OSMnx [28]. A known node in the created China road network is a location of interest on the map, such as a bus stop, house, shop, or train station. The roads that connect these nodes are our edges. They have some useful metadata like distance and the maximum speed allowed on that particular road. We defined the centroid of each prefecture-level city (China administrative level 2) through performing a spatial match between the location reference codes from Baidu and a reference polygon shapefile [29] of China compiled by the United Nations Office for the Coordination of Humanitarian Affairs (OCHA) and the Regional Office for Asia and the Pacific (ROAP). Then we find a known node close to the centroid of each prefecture-level city polygon. This enables us to identify a shortest route between Beijing and all the city centers by connecting all the intermediate connected nodes. One such shortest possible route between Wuhan and Beijing is shown in Figure 2.

Figure 2.

Shortest possible route between Wuhan and Beijing centroids (red) calculated from the China road network (white)

Then we calculated the total distance and travel time between city centers and Beijing by adding the distance between all the nodes between those points. When we compared the drive (motorized-vehicle travel time) time calculated from OSM with Google maps, it appears that OSM is always very (too) fast, even though the distances are comparable. This is due to the fact that the time calculation is done by adding the time to traverse each edge with the assumption that one can travel at the maximum speed limit of that road (edge). For a better understanding of the human flow (immigration and emigration) and infectious disease spread among cities in China, we created another graph of China, where each node is a city and edges between them are weighted by the actual flow of people that we calculated from ι.

(ii) Network statistics

To compute the network statistics for the China travel network, we created a graph of China, where each city is represented as a node and the flow/outflow matrix between each city is used to create the edges between those nodes, which are weighted with the number of people moving between those cities. There are multiple measures of association between nodes in a graph. These measures include node-level ranking algorithms using link-based centrality metrics, including Google’s PageRank, Degree Centrality, Betweenness Centrality and others. We use eight metrics, which are described below, to explore the role of different network properties on spread in our metapopulation.

Degree Centrality is a measure of the connectivity of a city within a network [30]. It is calculated by counting the number of edges a city has and normalizing it by dividing it by the maximum degree in the graph. This measure gives us an understanding of how many cities a particular city is connected to through human mobility and, therefore, how influential it is within the network.

Eigenvector Centrality is a measure of the influence of a city within a network [30–32]. It takes into consideration the centrality of the cities pointing to a particular city and assigns a higher eigenvector centrality value to cities that are visited by people from many other central cities. In our analysis, we reversed the directional graph of the China outflow network to get a better understanding of the influence of a city based on the number of cities it is pointing to, within the eigenvector centrality analysis.

PageRank is an algorithm that computes a ranking of nodes (here city) in a graph based on the structure of incoming links [33, 34]. It has several possible improvements over other centrality measures, such as eigenvector centrality. In the PageRank algorithm that we followed([35]), every node (city) has an arbitrary amount of centrality at the outset. Hence, even an unlinked node will have a baseline centrality; that is, a city’s existence itself gives it an alpha centrality. Also in PageRank, if we have two cities with the same centrality measure, the one with fewer outflow links will transfer more value to the linked nodes than the other.

Weighted PageRank is a modification of the PageRank algorithm that takes into account the weight of edges in a graph, where in our analysis the edge weight is the number of people flowing out between the cities [34, 36]. This modification provides a more accurate picture of the spread of information or people in a network.

Weighted Personalized PageRank, also called “Random Walk with Restart” [37–39], is a variant of the Weighted PageRank algorithm for finding nodes in a graph that are most relevant to another node. Here Weighted Personalized PageRank is adapted for both the outflow from Wuhan and the flow to Beijing [40]. This modification provides a more accurate understanding of the gravity of a node in the graph and how fast Beijing can reach 100 infections if Wuhan has an initial 100 infections.

The HITS [41, 42] algorithm, also known as “hubs and authorities”, is an alternative method of identifying relevant and popular nodes in a network. It provides two separate measures: Authority score and Hub score. The Hub score is calculated by collecting links from the nodes linked to a particular node and assigning a score based on the number of links received and from which nodes. This measure provides a better understanding of the relevance of a city in the network.

Betweenness Centrality is a measure of how often a shortest path between all possible connected nodes passes through a particular node [30, 43–46]. This measure gives us an understanding of the role of a node as a bridge between different parts of the network, facilitating the flow of people from one part of the network to another. A city with high Betweenness Centrality is a critical component of network connectivity, and a decrease in the Betweenness Centrality of a city could significantly impact the flow of people through that city.

(f) Variable importance

(i) Principal component analysis (PCA)

Principal component analysis (PCA) allows us to analyse and visualize multivariate data. We used PCA to allow us to see the relative contribution of the different metapopulation network properties of each node (here city) relating to infection spread and help determine which properties to use in further statistical analyses. PCA was performed using the tool referenced in [47].

(ii) Machine learning algorithms

Because of the non-linear relationships between location properties and the time it took for 100 cases to reach Beijing (our response metric; see Results), we used two machine learning approaches to understand which variables are most important in predicting spread. We used gradient boosting regression tree (BRT) and random forest (RF) analyses, both through ensemble approaches. These approaches have different overfitting diagnosis and accuracy properties [48]. We only used the metrics that were not highly correlated (see Results); travel time, Weighted Personalized PageRank (Beijing flow), Weighted Personalized PageRank (Wuhan Outflow), Population, Degree Centrality, PageRank, and Betweenness Centrality.

(g) Movement and interventions

An important parameter that we hypothesized would alter the epidemic transmission dynamics is the return rate, π, of the commuting population. To estimate its effect, we set up an experiment where we varied the return rate from 1/1 to 1/30.

Lastly, several analyses have looked at the impact of non-pharmaceutical interventions, such as ‘lockdowns’ [5, 8, 49]. In this study, we replicate these interventions to determine the necessary reduction in human flow in a data-driven model. We accomplish this by first reducing the daily population flow from Wuhan and measuring the time it takes to reach 100 cases in Beijing. Then, we reduce the daily population flow from Wuhan and the top five locations linked to Wuhan identified by Weighted Personalized PageRank (Wuhan outflow). Another experiment that can be performed is by reducing the daily population flow from Wuhan and the top five locations identified by the weighted personalized PageRank value for Beijing flow, since it is the second most important metric determining the epidemic spread to Beijing, as one can infer from the BRT and RF analyses (see Results).

3. Results

Our simulation of the model with the parameters listed in the Table 1 above gives similar estimates of the epidemic size and timing of the early stages of the COVID-19 outbreak in Beijing, reaching 100 cases in Beijing on day 60 [25–27] (Figure 3a).

Figure 3.

Beijing’s simulated epidemic size. Beijing’s initial epidemic size based on our metapopulation SEIR, taking 60 days to reach 100 local cases.

By employing parameters for a virus resembling SARS-CoV-2 and utilizing real transport flow data, our SEIR metapopulation model demonstrates that the spread occurs swiftly across the population with a significant peak of cases 186 days after the introduction of the disease, with a total of 5.33 million individuals infected in Beijing with no infection control measures (Figure 3b).

We show that in such a highly connected network, travel time from a location is the most important parameter that determines how fast the infection can spread. However, we also noted that in such a highly connected graph that as the travel time increases its impact on the speed of viral spread throughout the metapopulation has a decreasing effect (Figure 4). Our simulations show that Beijing will record 100 infections with an asymptote at approximately 74 days no matter what the travel time from the location to Beijing is once it is above a travel time threshold (Figure 4). These results were largely insensitive to changes in R₀ (see Figure 5).

Figure 4.

Time for Beijing to record 100 infections following infection introduction in each location in the simulated metapopulation SEIR model. Simulations start with 100 initial infections at each location. Locations within a travel time of less than 5 hours from Beijing deviate from the fit, which may be attributed to the fact that people move more frequently at shorter distances, whereas our assumption is of a constant return rate (π) is 1/5days.

Figure 5.

This figure depicts the same data as shown in Figure 4a, but with varying values of R₀.

Principal component analysis (PCA) reveals that six PCs explained 99.45% of the data variation, with PC1 36.7% and PC2 29.35%. The PCs broadly separated travel and distance related factors from ranking statistics and population (PC1) or travel and distance related factors and centrality metrics from weighted rankings (PC2) (Figure 6). Euclidean distance to Beijing, Travel time to Beijing from Google maps, and Travel time to Beijing from Open Street map have positive loadings in PC1, and Distance to Beijing, Travel time to Beijing from Google maps, and Travel time to Beijing from Open Street map are correlated and have similar relationships, whereas Degree Centrality has a negative loading on PC1. Eigenvector Centrality, Betweenness Centrality, and Hubs are another set of non-unique features, so we can select one of them, such as Betweenness Centrality, for further analysis. PageRank is a weak feature in the analysis; however, network ranking statistics such as Weighted Personalized PageRanks and population have positive loadings in PC2. Seven unique metrics identified from the PCA analysis are now utilized in the algorithms presented below and created the feature importance plot (Figure 7).

Figure 6.

Factors and network metrics that putatively affect the speed of an infection’s spread to Beijing. The Principal Component Analysis shows the relationship between the metrics among the network nodes. From the results of the PCA, we identified unique features, including travel time, degree centrality, betweenness centrality, population size, PageRank, Weighted Personalized PageRank (Wuhan outflow), and Weighted Personalized PageRank (Beijing flow). These unique features were used in machine learning methods to determine their variable importance. See text for details. The numbers in the bracket represent the loading magnitude for each feature.

Figure 7.

The most important factors affecting the speed of an infection’s spread to Beijing calculated using two different machine learning algorithms, gradient boosting regression trees (green) and random forest (blue).

Both machine learning analyses showed the same rankings. Both showed that travel time was the most important variable, but after travel time Weighted Personalized PageRank for Beijing flow was the most important in determining infection spread to Beijing, with population size next, then betweenness centrality.

Altering the return rate to from 1/1 to 1/30 showed that after approximately 1/12, the return rate has limited impact on the spread of infection throughout the network, as evidenced by the asymptote in Figure 8a bottom panel. Kernel density estimation on the set of points with a derivative of days taken to record 100 infection with respect to 1/π greater than −1.0, −0.5, and −0.1 shows changing patterns, especially at −1.0. The resulting histograms are shown Figure 8b.

Figure 8.

Return rate variation and its effect in infection spread. (a) Variation in return rate and response from simulations for the number of days to reach 100 infections and its asymptote values according to different derivative marks. (b) Kernel density estimation for the fitted values.

In this highly connected network, travel restrictions had to be severe to limit spread. Reducing more than 70% of flow from Wuhan led to just a 10 day increase in the time it took for Beijing to reach 100 cases (from 59 to 69 days) (Table 4). It required an 90% reduction before spread slowed substantially (a 21 day decrease).

View this table:

Table 4: Time in days for Beijing to record 100 infections, when we implement interventions at Wuhan. We changed the outflow from Wuhan from 100% to 1% and ran the simulations to see how much time it would take for Beijing to record 100 infections for both scenarios.

4. Discussion

We developed a data-driven metapopulation SEIR model to study the transmission dynamics of COVID-19-like diseases in highly connected countries like China and how different network properties impact infection spread. We found the network representing the metapopulation to be densely connected. The spread of SARS-CoV-2-like viruses is rapid throughout such highly connected networks, so much so that an asymptote is reached where travel times stop being important predictors of spread (Figure 4). Further, in such highly connected networks, we found transmission throughout a network when seeded into a location is largely insensitive to non-pharmaceutical interventions, unless human movement was severely restricted (Table 4).

The importance of connectivity for infection spread has previously been noted, including for SARS-CoV-2 [3, 4, 12, 50–52], but here we quantify the importance of specific network properties including data-driven human movement and travel times to infer connectivity. Our analyses are likely globally relevant, as evidenced by the very rapid spread of SARS-CoV-2 leading to the COVID-19 pandemic, once infection had escaped the city of Wuhan, despite China’s strict human movement restrictions [5, 6], which largely continued until recently. Indeed, Wu and colleagues suggested a 50% reduction in inter-city mobility would have a negligible effect on epidemic dynamics [3], and our work supports and extends that, showing only severely limited flow begins to limit spread (Table 4). Chinazzi and colleagues further demonstrated that travel quarantines introduced in Wuhan on 23 January 2020 only delayed the SARS-CoV-2 epidemic progression by 3 to 5 days within China, though it had a greater effect on international spread, presumably because international spread is more managed (e.g. via flights). Few other countries or regions succeeded in getting close to eliminating SARS-CoV-2 using non-pharmaceutical methods, with New Zealand, Australia, Hong Kong, and Singapore among the few [53]. For example, New Zealand, a small, geographically isolated country, greatly reduced domestic infection introductions through both massive reductions in international travel with quarantine and domestic travel restrictions until national vaccination campaigns reached successful outcomes [54]. However, these were only likely feasible (and possibly socially acceptable) with earlier, less transmissible variants of SARS-CoV-2 which have now been largely superseded [55, 56]. Notably, given China’s recent change from a “zero COVID-19” policy, our model predicted that without interventions, China could have 5.4 million infected people after 233 days using early variant (wild-type) epidemiological parameters. More infectious variants that are currently circulating would be likely to cause many more infections than this.

Our machine learning models found travel time to be the most important factor in determining spread, despite its impact plateauing after a point (Figure 8). We also show that once return rates fall within the incubation period of an infection, the transmission dynamics across the network change as commuters can return before infection occurs within the locations travelled to, as shown by the bimodal distributions of the derivative of days took to record 100 infection with respect to 1/π in Figure 8b. After travel time, Weighted Personalised Page Rank was most important, with greater importance than population size or Betweenness Centrality, providing evidence that human connectivity is a very strong driver of early infection spread. The regional spread of influenza infection was found to similarly correlate more closely with rates of movement of people to and from workplaces than with geographical distance in the United States, with a similar rapid decay of commuting up to around 100 km and a long tail of rare longer range flow [57].

Together, our work suggests there needs to be multiple approaches to reducing infection transmission for pathogens, because limiting movement alone among highly connected populations is ineffective for highly transmissible infections. Other non-pharmaceutical methods include contact tracing and isolation, which has varying degrees of success depending on the systems and infections (e.g. [58]), and pharmaceutical methods which mostly comprises immunisation, or a combination of these (e.g. [59]). Immunisation, however, is pathogen specific and to date universal vaccines for infections such as influenza [60, 61] and coronaviruses [62] do not exist and novel infectious agents may emerge. Therefore, our work further highlights the need for “primary prevention” of infection emergence at the areas of high risk [63], rather than a “preparedness-response” approach that aims to limit spread in human populations after the emergence of novel infections [64].

Our analyses has several limitations. We used a deterministic model because we are modelling large populations, but stochasticity can be important, particularly for infection establishment and spread among smaller populations, which might be more relevant for understanding the initial phases of infectious disease emergence. Future analyses of early introduction dynamics and of smaller communities simulating the early introduction of infection could be interesting. Similar to the use of deterministic models, we assume homogeneous mixing within populations. However, structural factors such as age can impact transmission dynamics by altering attack rates (the total number of infected individuals) and the basic reproduction number (R₀), which represents the number of cases generated by a typical index case in a fully susceptible population [65]. There are numerous advances in modeling human mobility, however studies increasingly show that patterns are generalisable across scales (i.e., within and between cities and countries) [66–69]. We also do not allow loss of immunity or the emergence of new escape variants, which allows reinfection and alters the transmission dynamics over time, but this is less of a concern for our analysis as we are interested in the initial stages of spread [70]. One additional limitation is that human movement data is only available for the top 100 cities connected to each city in mainland China. Nevertheless, our Baidu data covers 85-99.9% (with median of 92.87%) of movement among cities by using the 100 most connected cities to the 340 cities. Because of that, we believe our analysis provides a comprehensive analysis of the mobility patterns in this hyper-connected network.

5. Conclusions

Human movement is fundamental to our way of living and to infectious disease transmission [6, 57, 71]. Our data-driven metapopulation SEIR model of transmission dynamics of COVID-19 like diseases in China shows the spread of SARS-CoV-2-like viruses is rapid throughout such highly connected networks, so much so that an asymptote is reached where travel times stop being important predictors of spread. Our analyses found travel time to be the most important factor in determining spread, despite its impact plateauing after a point, with network metrics weighted by movement of people, modelled here through the Weighted Personalized PageRank, the next most important factor, providing evidence that human connectivity is a driver of infection spread, which can inform mitigation in the future.

Data Availability

All data produced in the present study are available upon reasonable request to the authors

https://github.com/rejusam/SEIR_metapopulation_China.git

Acknowledgements

Massey University’s subscription to New Zealand eScience Infrastructure (NeSi) enabled us to use high-performance computing facilities.

A. Appendix 1

Ethics

Ethics approval was not required for this study.

Data Accessibility

https://github.com/rejusam/SEIR_metapopulation_China.git

Authors’ Contributions

R.S.J.: conceptualization, investigation, methodology, formal analysis, writing-original draft, Data curation, Software, Validation, Visualization; J.C.M.: conceptualization, formal analysis, investigation, writing—review and editing; R.L.M.: investigation, methodology, validation, writing—review and editing; D.T.S.H.: conceptualization, funding acquisition, investigation, methodology, project administration, writing-original draft.

Competing Interests

We declare no conflicts.

Funding

RSJ, RLM, and DTSH were supported by Bryce Carmine and Anne Carmine (née Percival), through the Massey University Foundation (RM22688) and DTSH the Percival Carmine Chair in Epidemiology and Public Health and Royal Society Te Apū rangi RDF-MAU1701. JCM would like to acknowledge the start-up funding La Trobe University, Australia provided.

References

↵
N. Zhu, D. Zhang, W. Wang, X. Li, B. Yang, J. Song, X. Zhao, B. Huang, W. Shi, R. Lu, et al., “A novel coronavirus from patients with pneumonia in China, 2019”, New England journal of medicine (2020).
↵
L. van Dorp et al., “Emergence of genomic diversity and recurrent mutations in SARS-CoV-2”, Infection, Genetics and Evolution 83, 104351 (2020).
OpenUrl
↵
J. T. Wu, K. Leung, and G. M. Leung, “Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study”, The Lancet 395, 689–697 (2020).
OpenUrl CrossRef
↵
A. J. Tatem, D. J. Rogers, and S. I. Hay, “Global transport networks and infectious disease spread”, Advances in parasitology 62, 293–343 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
H. Gibbs, Y. Liu, C. A. Pearson, C. I. Jarvis, C. Grundy, B. J. Quilty, C. Diamond, and R. M. Eggo, “Changing travel patterns in China during the early stages of the COVID-19 pandemic”, Nature communications 11, 1–9 (2020).
OpenUrl
↵
S. Chang, E. Pierson, P. W. Koh, J. Gerardin, B. Redbird, D. Grusky, and J. Leskovec, “Mobility network models of COVID-19 explain inequities and inform reopening”, Nature 589, 82–87 (2021).
OpenUrl CrossRef PubMed
↵
K. Leung, J. T. Wu, D. Liu, and G. M. Leung, “First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: a modelling impact assessment”, The Lancet 395, 1382–1393 (2020).
OpenUrl CrossRef
↵
J. S. Jia, X. Lu, Y. Yuan, G. Xu, J. Jia, and N. A. Christakis, “Population flow drives spatio-temporal distribution of COVID-19 in China”, Nature 582, 389–394 (2020).
OpenUrl PubMed
V. Colizza and A. Vespignani, “Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: Theory and simulations”, en, Journal of Theoretical Biology 251, 450–467 (2008).
OpenUrl CrossRef PubMed Web of Science
D. Balcan, V. Colizza, B. Gonçalves, H. Hu, J. J. Ramasco, and A. Vespignani, “Multiscale mobility networks and the spatial spreading of infectious diseases”, en, Proceedings of the National Academy of Sciences 106, 21484–21489 (2009).
OpenUrl Abstract/FREE Full Text
N. Oliver et al., “Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle”, en, Science Advances 6, eabc0764 (2020).
OpenUrl FREE Full Text
↵
M. Chinazzi et al., “The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak”, Science 368, 395–400 (2020).
OpenUrl Abstract/FREE Full Text
↵
World Health Organization et al., “WHO-convened global study of origins of SARS-CoV-2: China Part”, (2021).
Y. Liu, A. A. Gayle, A. Wilder-Smith, and J. Rocklöv, “The reproductive number of COVID-19 is higher compared to SARS coronavirus”, Journal of travel medicine (2020).
S. Mwalili, M. Kimathi, V. Ojiambo, D. Gathungu, and R. Mbogo, “SEIR model for COVID-19 dynamics incorporating the environment and social distancing”, en, BMC Research Notes 13, 352 (2020).
OpenUrl
H. Xin, Y. Li, P. Wu, Z. Li, E. H. Y. Lau, Y. Qin, L. Wang, B. J. Cowling, T. K. Tsang, and Z. Li, “Estimating the Latent Period of Coronavirus Disease 2019 (COVID-19)”, en, Clinical Infectious Diseases 74, 1678–1681 (2022).
OpenUrl CrossRef
T. Ma et al., “The latent period of coronavirus disease 2019 with SARS-CoV-2 B.1.617.2 Delta variant of concern in the postvaccination era”, en, Immunity, Inflammation and Disease 10, doi:10.1002/iid3.664 (2022).
OpenUrl CrossRef
M. J. Keeling and P. Rohani, Modeling infectious diseases in humans and animals (Princeton University Press, 2008).
↵
Baidu Migration-Baidu Map Smart Eye, http://qianxi.baidu.com/ (visited on 11/27/2022).
X. Jiang, W. Wei, S. Wang, T. Zhang, and C. Lu, “Effects of COVID-19 on Urban Population Flow in China”, International Journal of Environmental Research and Public Health 18, 1617 (2021).
OpenUrl
H. Tian et al., “An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China”, Science 368, 638–642 (2020).
OpenUrl Abstract/FREE Full Text
S. Sanche, Y. T. Lin, C. Xu, E. Romero-Severson, N. Hengartner, and R. Ke, “High Contagiousness and Rapid Spread of Severe Acute Respiratory Syndrome Coronavirus 2”, Emerging Infectious Diseases 26, 1470–1477 (2020).
OpenUrl PubMed
Z. Yuan, Y. Xiao, Z. Dai, J. Huang, Z. Zhang, and Y. Chen, “Modelling the effects of Wuhan’s lockdown during COVID-19, China”, Bulletin of the World Health Organization 98, 484–494 (2020).
OpenUrl PubMed
P. Zhu and X. Tan, “Evaluating the effectiveness of Hong Kong’s border restriction policy in reducing COVID-19 infections”, BMC Public Health 22, doi:10.1186/s12889-022-13234-5 (2022).
OpenUrl CrossRef
↵
C. Huang et al., “Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China”, eng, Lancet (London, England) 395, 497–506 (2020).
OpenUrl
↵
Z. Allam, “The First 50 days of COVID-19: A Detailed Chronological Timeline and Extensive Review of Literature Documenting the Pandemic”, Surveying the Covid-19 Pandemic and its Implications, 1–7 (2020).
↵
S. Tian et al., “Characteristics of COVID-19 infection in Beijing”, The Journal of Infection 80, 401–406 (2020).
OpenUrl CrossRef PubMed
↵
G. Boeing, “OSMnx: New methods for acquiring, constructing, analyzing, and visualizing complex street networks”, en, Computers, Environment and Urban Systems 65, 126–139 (2017).
OpenUrl
↵
U. N. O. for the Coordination of Humanitarian Affairs Regional Office for Asia and the Pacific (ROAP), China - Subnational Administrative Boundaries - Humanitarian Data Exchange, https://data.humdata.org/dataset/cod-ab-chn (visited on 11/17/2022).
↵
J. Golbeck, Analyzing the social web, First edition (Morgan Kaufmann is an imprint of Elsevier, Waltham, MA, 2013).
M. J. Zaki and W. Meira, Data mining and analysis: fundamental concepts and algorithms (Cambridge University Press, New York, NY, 2014).
↵
C. F. A. Negre, U. N. Morzan, H. P. Hendrickson, R. Pal, G. P. Lisi, J. P. Loria, I. Rivalta, J. Ho, and V. S. Batista, “Eigenvector centrality for characterization of protein allosteric pathways”, en, Proceedings of the National Academy of Sciences 115, doi:10.1073/pnas.1810452115 (2018).
OpenUrl Abstract/FREE Full Text
↵
S. Brin and L. Page, “The anatomy of a large-scale hypertextual Web search engine”, en, Computer Networks and ISDN Systems 30, 107–117 (1998).
OpenUrl CrossRef Web of Science
↵
1. I. Sendiña-Nadal
W.-C.-B. Chin and T.-H. Wen, “Geographically Modified PageRank Algorithms: Identifying the Spatial Concentration of Human Movement in a Geospatial Network”, en, PLOS ONE 10, edited by I. Sendiña-Nadal, e0139509 (2015).
OpenUrl
↵
A. N. Langville and C. D. Meyer, “A Survey of Eigenvector Methods for Web Information Retrieval”, SIAM Review 47, 135–161 (2005).
OpenUrl
↵
W. Xing and A. Ghorbani, “Weighted PageRank algorithm”, in Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004. (2004), pages 305–314.
↵
1. V. Grolmusz
W. Jin, J. Jung, and U. Kang, “Supervised and extended restart in random walks for ranking and link prediction in networks”, en, PLOS ONE 14, edited by V. Grolmusz, e0213857 (2019).
OpenUrl
H. Tong, C. Faloutsos, and J.-y. Pan, “Fast Random Walk with Restart and Its Applications”, in Sixth International Conference on Data Mining (ICDM’06) (2006), pages 613–622.
↵
H. Tong, C. Faloutsos, and J.-Y. Pan, “Random walk with restart: fast solutions and applications”, en, Knowledge and Information Systems 14, 327–346 (2008).
OpenUrl
↵
W. Xie, D. Bindel, A. Demers, and J. Gehrke, “Edge-Weighted Personalized PageRank: Breaking A Decade-Old Performance Barrier”, in Proceedings of ACM KDD 2015 (2015).
↵
J. M. Kleinberg, “Authoritative sources in a hyperlinked environment”, en, Journal of the ACM 46, 604–632 (1999).
OpenUrl CrossRef Web of Science
↵
A. N. Langville and C. D. Meyer, “A survey of eigenvector methods for web information retrieval”, SIAM review 47, 135–161 (2005).
OpenUrl
↵
L. C. Freeman, “A Set of Measures of Centrality Based on Betweenness”, Sociometry 40, 35 (1977).
OpenUrl CrossRef Web of Science
L. C. Freeman, D. Roeder, and R. R. Mulholland, “Centrality in social networks: ii. experimental results”, en, Social Networks 2, 119–141 (1979).
OpenUrl CrossRef
M. E. J. Newman, “Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality”, en, Physical Review E 64, 016132 (2001).
OpenUrl CrossRef
↵
S. Wei and L. Wang, “Examining the population flow network in China and its implications for epidemic control based on Baidu migration data”, en, Humanities and Social Sciences Communications 7, 145 (2020).
OpenUrl
↵
E. Taskesen, pca: A Python Package for Principal Component Analysis. Version 1.8.4, 2020. 15
↵
J. Elith, J. R. Leathwick, and T. Hastie, “A working guide to boosted regression trees”, Journal of animal ecology 77, 802–813 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
J. Ge, D. He, Z. Lin, H. Zhu, and Z. Zhuang, “Four-tier response system and spatial propagation of COVID-19 in China by a network model”, Mathematical Biosciences 330, 108484 (2020).
OpenUrl
↵
S. Lai, I. I. Bogoch, N. W. Ruktanonchai, A. Watts, X. Lu, W. Yang, H. Yu, K. Khan, and A. J. Tatem, “Assessing spread risk of COVID-19 within and beyond China in early 2020”, en, Data Science and Management 5, 212–218 (2022).
OpenUrl
D. Balcan, B. Gonçalves, H. Hu, J. J. Ramasco, V. Colizza, and A. Vespignani, “Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model”, Journal of Computational Science 1, 132–145 (2010).
OpenUrl
↵
X. Ding, S. Huang, A. Leung, and R. Rabbany, “Incorporating dynamic flight network in SEIR to model mobility between populations”, Applied Network Science 6, 1–24 (2021).
OpenUrl
↵
C. De Foo, K. A. Grépin, A. R. Cook, L. Y. Hsu, M. Bartos, S. Singh, N. Asgari, Y. Y. Teo, D. L. Heymann, and H. Legido-Quigley, “Navigating from SARS-CoV-2 elimination to endemicity in Australia, Hong Kong, New Zealand, and Singapore”, The Lancet 398, 1547–1551 (2021).
OpenUrl
↵
M. G. Baker, N. Wilson, and A. Anglemyer, “Successful elimination of Covid-19 transmission in New Zealand”, New England Journal of Medicine 383, e56 (2020).
OpenUrl CrossRef PubMed
↵
Z. Du et al., “Reproduction Numbers of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Variants: A Systematic Review and Meta-analysis”, Clinical Infectious Diseases 75, e293–e295 (2022).
OpenUrl CrossRef
↵
Y. Liu and J. Rocklöv, “The effective reproductive number of the Omicron variant of SARS-CoV-2 is several times relative to Delta”, Journal of Travel Medicine 29, taac037. 10.1093/jtm/taac037 (2022).
OpenUrl
↵
C. Viboud, O. N. Bjørnstad, D. L. Smith, L. Simonsen, M. A. Miller, and B. T. Grenfell, “Synchrony, waves, and spatial hierarchies in the spread of influenza”, science 312, 447–451 (2006).
OpenUrl Abstract/FREE Full Text
↵
C.-E. Juneau, A.-S. Briand, P. Collazzo, U. Siebert, and T. Pueyo, “Effective contact tracing for COVID-19: A systematic review”, Global Epidemiology 5, 100103 (2023).
OpenUrl
↵
A. M. Henao-Restrepo et al., “Efficacy and effectiveness of an rVSV-vectored vaccine in preventing Ebola virus disease: final results from the Guinea ring vaccination, open-label, cluster-randomised trial (Ebola Ça Suffit!)”, The Lancet 389, 505–518 (2017).
OpenUrl CrossRef
↵
R. Nachbagauer, J. Feser, A. Naficy, D. I. Bernstein, J. Guptill, E. B. Walter, F. Berlanda-Scorza, D. Stadlbauer, P. C. Wilson, T. Aydillo, et al., “A chimeric hemagglutinin-based universal influenza virus vaccine approach induces broad and long-lasting immunity in a randomized, placebo-controlled phase I trial”, Nature medicine 27, 106–114 (2021).
OpenUrl
↵
N. Pardi, J. M. Carreño, G. O’Dell, J. Tan, C. Bajusz, H. Muramatsu, W. Rijnink, S. Strohmeier, M. Loganathan, D. Bielak, et al., “Development of a pentavalent broadly protective nucleoside-modified mRNA vaccine against influenza B viruses”, Nature Communications 13, 4677 (2022).
OpenUrl
↵
D. M. Morens, J. K. Taubenberger, and A. S. Fauci, “Universal Coronavirus Vaccines — An Urgent Need”, New England Journal of Medicine 386, 297–299 (2022).
OpenUrl
↵
R. L. Muylaert, D. A. Wilkinson, T. Kingston, P. D’Odorico, M. C. Rulli, N. Galli, R. S. John, P. Alviola, and D. T. S. Hayman, Using drivers and transmission pathways to identify SARS-like coronavirus spillover risk hotspots, en, preprint (Ecology, 2022).
↵
One Health High Level Expert Panel, Prevention of zoonotic spillover, Accessed: 2023-04-03, (2023) https://www.who.int/publications/m/item/prevention-of-zoonotic-spillover.
↵
D. Mistry, M. Litvinova, A. Pastore y Piontti, M. Chinazzi, L. Fumanelli, M. F. Gomes, S. A. Haque, Q.-H. Liu, K. Mu, X. Xiong, et al., “Inferring high-resolution human mixing patterns for disease modeling”, Nature communications 12, 1–12 (2021).
OpenUrl
↵
M. Schläpfer, L. Dong, K. O’Keeffe, P. Santi, M. Szell, H. Salat, S. Anklesaria, M. Vazifeh, C. Ratti, and G. B. West, “The universal visitation law of human mobility”, Nature 593, 522–527 (2021).
OpenUrl
V. Palchykov, M. Mitrović, H.-H. Jo, J. Saramäki, and R. K. Pan, “Inferring human mobility using communication patterns”, Scientific reports 4, 1–6 (2014).
OpenUrl
X.-Y. Yan, W.-X. Wang, Z.-Y. Gao, and Y.-C. Lai, “Universal model of individual and population mobility on diverse spatial scales”, Nature communications 8, 1–9 (2017).
OpenUrl
↵
F. Simini, M. C. González, A. Maritan, and A.-L. Barabási, “A universal model for mobility and migration patterns”, Nature 484, 96–100 (2012).
OpenUrl CrossRef PubMed Web of Science
↵
J. S. Lavine, O. N. Bjornstad, and R. Antia, “Immunological characteristics govern the transition of COVID-19 to endemicity”, Science 371, 741–745 (2021).
OpenUrl Abstract/FREE Full Text
↵
A. Wesolowski, N. Eagle, A. J. Tatem, D. L. Smith, A. M. Noor, R. W. Snow, and C. O. Buckee, “Quantifying the Impact of Human Mobility on Malaria”, Science 338, 267–270 (2012).
OpenUrl Abstract/FREE Full Text

View the discussion thread.

Posted July 28, 2023.

Download PDF

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (349)
Allergy and Immunology (668)
Allergy and Immunology (668)
Anesthesia (181)
Cardiovascular Medicine (2648)
Dentistry and Oral Medicine (316)
Dermatology (223)
Emergency Medicine (399)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
Epidemiology (12228)
Forensic Medicine (10)
Gastroenterology (759)
Genetic and Genomic Medicine (4103)
Geriatric Medicine (387)
Health Economics (680)
Health Informatics (2657)
Health Policy (1005)
Health Systems and Quality Improvement (985)
Hematology (363)
HIV/AIDS (851)
Infectious Diseases (except HIV/AIDS) (13695)
Intensive Care and Critical Care Medicine (797)
Medical Education (399)
Medical Ethics (109)
Nephrology (436)
Neurology (3882)
Nursing (209)
Nutrition (577)
Obstetrics and Gynecology (739)
Occupational and Environmental Health (695)
Oncology (2030)
Ophthalmology (585)
Orthopedics (240)
Otolaryngology (306)
Pain Medicine (250)
Palliative Medicine (75)
Pathology (473)
Pediatrics (1115)
Pharmacology and Therapeutics (466)
Primary Care Research (452)
Psychiatry and Clinical Psychology (3432)
Public and Global Health (6527)
Radiology and Imaging (1403)
Rehabilitation Medicine and Physical Therapy (814)
Respiratory Medicine (871)
Rheumatology (409)
Sexual and Reproductive Health (410)
Sports Medicine (342)
Surgery (448)
Toxicology (53)
Transplantation (185)
Urology (165)

[1] ↵
N. Zhu, D. Zhang, W. Wang, X. Li, B. Yang, J. Song, X. Zhao, B. Huang, W. Shi, R. Lu, et al., “A novel coronavirus from patients with pneumonia in China, 2019”, New England journal of medicine (2020).

[2] ↵
L. van Dorp et al., “Emergence of genomic diversity and recurrent mutations in SARS-CoV-2”, Infection, Genetics and Evolution 83, 104351 (2020).
OpenUrl

[3] ↵
J. T. Wu, K. Leung, and G. M. Leung, “Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study”, The Lancet 395, 689–697 (2020).
OpenUrl CrossRef

[4] ↵
A. J. Tatem, D. J. Rogers, and S. I. Hay, “Global transport networks and infectious disease spread”, Advances in parasitology 62, 293–343 (2006).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
H. Gibbs, Y. Liu, C. A. Pearson, C. I. Jarvis, C. Grundy, B. J. Quilty, C. Diamond, and R. M. Eggo, “Changing travel patterns in China during the early stages of the COVID-19 pandemic”, Nature communications 11, 1–9 (2020).
OpenUrl

[6] ↵
S. Chang, E. Pierson, P. W. Koh, J. Gerardin, B. Redbird, D. Grusky, and J. Leskovec, “Mobility network models of COVID-19 explain inequities and inform reopening”, Nature 589, 82–87 (2021).
OpenUrl CrossRef PubMed

[7] ↵
K. Leung, J. T. Wu, D. Liu, and G. M. Leung, “First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: a modelling impact assessment”, The Lancet 395, 1382–1393 (2020).
OpenUrl CrossRef

[8] ↵
J. S. Jia, X. Lu, Y. Yuan, G. Xu, J. Jia, and N. A. Christakis, “Population flow drives spatio-temporal distribution of COVID-19 in China”, Nature 582, 389–394 (2020).
OpenUrl PubMed

[9] V. Colizza and A. Vespignani, “Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: Theory and simulations”, en, Journal of Theoretical Biology 251, 450–467 (2008).
OpenUrl CrossRef PubMed Web of Science

[10] D. Balcan, V. Colizza, B. Gonçalves, H. Hu, J. J. Ramasco, and A. Vespignani, “Multiscale mobility networks and the spatial spreading of infectious diseases”, en, Proceedings of the National Academy of Sciences 106, 21484–21489 (2009).
OpenUrl Abstract/FREE Full Text

[11] N. Oliver et al., “Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle”, en, Science Advances 6, eabc0764 (2020).
OpenUrl FREE Full Text

[12] ↵
M. Chinazzi et al., “The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak”, Science 368, 395–400 (2020).
OpenUrl Abstract/FREE Full Text

[13] ↵
World Health Organization et al., “WHO-convened global study of origins of SARS-CoV-2: China Part”, (2021).

[14] Y. Liu, A. A. Gayle, A. Wilder-Smith, and J. Rocklöv, “The reproductive number of COVID-19 is higher compared to SARS coronavirus”, Journal of travel medicine (2020).

[15] S. Mwalili, M. Kimathi, V. Ojiambo, D. Gathungu, and R. Mbogo, “SEIR model for COVID-19 dynamics incorporating the environment and social distancing”, en, BMC Research Notes 13, 352 (2020).
OpenUrl

[16] H. Xin, Y. Li, P. Wu, Z. Li, E. H. Y. Lau, Y. Qin, L. Wang, B. J. Cowling, T. K. Tsang, and Z. Li, “Estimating the Latent Period of Coronavirus Disease 2019 (COVID-19)”, en, Clinical Infectious Diseases 74, 1678–1681 (2022).
OpenUrl CrossRef

[17] T. Ma et al., “The latent period of coronavirus disease 2019 with SARS-CoV-2 B.1.617.2 Delta variant of concern in the postvaccination era”, en, Immunity, Inflammation and Disease 10, doi:10.1002/iid3.664 (2022).
OpenUrl CrossRef

[18] M. J. Keeling and P. Rohani, Modeling infectious diseases in humans and animals (Princeton University Press, 2008).

[19] ↵
Baidu Migration-Baidu Map Smart Eye, http://qianxi.baidu.com/ (visited on 11/27/2022).

[20] X. Jiang, W. Wei, S. Wang, T. Zhang, and C. Lu, “Effects of COVID-19 on Urban Population Flow in China”, International Journal of Environmental Research and Public Health 18, 1617 (2021).
OpenUrl

[21] H. Tian et al., “An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China”, Science 368, 638–642 (2020).
OpenUrl Abstract/FREE Full Text

[22] S. Sanche, Y. T. Lin, C. Xu, E. Romero-Severson, N. Hengartner, and R. Ke, “High Contagiousness and Rapid Spread of Severe Acute Respiratory Syndrome Coronavirus 2”, Emerging Infectious Diseases 26, 1470–1477 (2020).
OpenUrl PubMed

[23] Z. Yuan, Y. Xiao, Z. Dai, J. Huang, Z. Zhang, and Y. Chen, “Modelling the effects of Wuhan’s lockdown during COVID-19, China”, Bulletin of the World Health Organization 98, 484–494 (2020).
OpenUrl PubMed

[24] P. Zhu and X. Tan, “Evaluating the effectiveness of Hong Kong’s border restriction policy in reducing COVID-19 infections”, BMC Public Health 22, doi:10.1186/s12889-022-13234-5 (2022).
OpenUrl CrossRef

[25] ↵
C. Huang et al., “Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China”, eng, Lancet (London, England) 395, 497–506 (2020).
OpenUrl

[26] ↵
Z. Allam, “The First 50 days of COVID-19: A Detailed Chronological Timeline and Extensive Review of Literature Documenting the Pandemic”, Surveying the Covid-19 Pandemic and its Implications, 1–7 (2020).

[27] ↵
S. Tian et al., “Characteristics of COVID-19 infection in Beijing”, The Journal of Infection 80, 401–406 (2020).
OpenUrl CrossRef PubMed

[28] ↵
G. Boeing, “OSMnx: New methods for acquiring, constructing, analyzing, and visualizing complex street networks”, en, Computers, Environment and Urban Systems 65, 126–139 (2017).
OpenUrl

[29] ↵
U. N. O. for the Coordination of Humanitarian Affairs Regional Office for Asia and the Pacific (ROAP), China - Subnational Administrative Boundaries - Humanitarian Data Exchange, https://data.humdata.org/dataset/cod-ab-chn (visited on 11/17/2022).

[30] ↵
J. Golbeck, Analyzing the social web, First edition (Morgan Kaufmann is an imprint of Elsevier, Waltham, MA, 2013).

[31] M. J. Zaki and W. Meira, Data mining and analysis: fundamental concepts and algorithms (Cambridge University Press, New York, NY, 2014).

[32] ↵
C. F. A. Negre, U. N. Morzan, H. P. Hendrickson, R. Pal, G. P. Lisi, J. P. Loria, I. Rivalta, J. Ho, and V. S. Batista, “Eigenvector centrality for characterization of protein allosteric pathways”, en, Proceedings of the National Academy of Sciences 115, doi:10.1073/pnas.1810452115 (2018).
OpenUrl Abstract/FREE Full Text

[33] ↵
S. Brin and L. Page, “The anatomy of a large-scale hypertextual Web search engine”, en, Computer Networks and ISDN Systems 30, 107–117 (1998).
OpenUrl CrossRef Web of Science

[34] ↵
I. Sendiña-Nadal
W.-C.-B. Chin and T.-H. Wen, “Geographically Modified PageRank Algorithms: Identifying the Spatial Concentration of Human Movement in a Geospatial Network”, en, PLOS ONE 10, edited by I. Sendiña-Nadal, e0139509 (2015).
OpenUrl

[35] I. Sendiña-Nadal

[36] ↵
A. N. Langville and C. D. Meyer, “A Survey of Eigenvector Methods for Web Information Retrieval”, SIAM Review 47, 135–161 (2005).
OpenUrl

[37] ↵
W. Xing and A. Ghorbani, “Weighted PageRank algorithm”, in Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004. (2004), pages 305–314.

[38] ↵
V. Grolmusz
W. Jin, J. Jung, and U. Kang, “Supervised and extended restart in random walks for ranking and link prediction in networks”, en, PLOS ONE 14, edited by V. Grolmusz, e0213857 (2019).
OpenUrl

[39] V. Grolmusz

[40] H. Tong, C. Faloutsos, and J.-y. Pan, “Fast Random Walk with Restart and Its Applications”, in Sixth International Conference on Data Mining (ICDM’06) (2006), pages 613–622.

[41] ↵
H. Tong, C. Faloutsos, and J.-Y. Pan, “Random walk with restart: fast solutions and applications”, en, Knowledge and Information Systems 14, 327–346 (2008).
OpenUrl

[42] ↵
W. Xie, D. Bindel, A. Demers, and J. Gehrke, “Edge-Weighted Personalized PageRank: Breaking A Decade-Old Performance Barrier”, in Proceedings of ACM KDD 2015 (2015).

[43] ↵
J. M. Kleinberg, “Authoritative sources in a hyperlinked environment”, en, Journal of the ACM 46, 604–632 (1999).
OpenUrl CrossRef Web of Science

[44] ↵
A. N. Langville and C. D. Meyer, “A survey of eigenvector methods for web information retrieval”, SIAM review 47, 135–161 (2005).
OpenUrl

[45] ↵
L. C. Freeman, “A Set of Measures of Centrality Based on Betweenness”, Sociometry 40, 35 (1977).
OpenUrl CrossRef Web of Science

[46] L. C. Freeman, D. Roeder, and R. R. Mulholland, “Centrality in social networks: ii. experimental results”, en, Social Networks 2, 119–141 (1979).
OpenUrl CrossRef

[47] M. E. J. Newman, “Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality”, en, Physical Review E 64, 016132 (2001).
OpenUrl CrossRef

[48] ↵
S. Wei and L. Wang, “Examining the population flow network in China and its implications for epidemic control based on Baidu migration data”, en, Humanities and Social Sciences Communications 7, 145 (2020).
OpenUrl

[49] ↵
E. Taskesen, pca: A Python Package for Principal Component Analysis. Version 1.8.4, 2020. 15

[50] ↵
J. Elith, J. R. Leathwick, and T. Hastie, “A working guide to boosted regression trees”, Journal of animal ecology 77, 802–813 (2008).
OpenUrl CrossRef PubMed Web of Science

[51] ↵
J. Ge, D. He, Z. Lin, H. Zhu, and Z. Zhuang, “Four-tier response system and spatial propagation of COVID-19 in China by a network model”, Mathematical Biosciences 330, 108484 (2020).
OpenUrl

[52] ↵
S. Lai, I. I. Bogoch, N. W. Ruktanonchai, A. Watts, X. Lu, W. Yang, H. Yu, K. Khan, and A. J. Tatem, “Assessing spread risk of COVID-19 within and beyond China in early 2020”, en, Data Science and Management 5, 212–218 (2022).
OpenUrl

[53] D. Balcan, B. Gonçalves, H. Hu, J. J. Ramasco, V. Colizza, and A. Vespignani, “Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model”, Journal of Computational Science 1, 132–145 (2010).
OpenUrl

[54] ↵
X. Ding, S. Huang, A. Leung, and R. Rabbany, “Incorporating dynamic flight network in SEIR to model mobility between populations”, Applied Network Science 6, 1–24 (2021).
OpenUrl

[55] ↵
C. De Foo, K. A. Grépin, A. R. Cook, L. Y. Hsu, M. Bartos, S. Singh, N. Asgari, Y. Y. Teo, D. L. Heymann, and H. Legido-Quigley, “Navigating from SARS-CoV-2 elimination to endemicity in Australia, Hong Kong, New Zealand, and Singapore”, The Lancet 398, 1547–1551 (2021).
OpenUrl

[56] ↵
M. G. Baker, N. Wilson, and A. Anglemyer, “Successful elimination of Covid-19 transmission in New Zealand”, New England Journal of Medicine 383, e56 (2020).
OpenUrl CrossRef PubMed

[57] ↵
Z. Du et al., “Reproduction Numbers of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Variants: A Systematic Review and Meta-analysis”, Clinical Infectious Diseases 75, e293–e295 (2022).
OpenUrl CrossRef

[58] ↵
Y. Liu and J. Rocklöv, “The effective reproductive number of the Omicron variant of SARS-CoV-2 is several times relative to Delta”, Journal of Travel Medicine 29, taac037. 10.1093/jtm/taac037 (2022).
OpenUrl

[59] ↵
C. Viboud, O. N. Bjørnstad, D. L. Smith, L. Simonsen, M. A. Miller, and B. T. Grenfell, “Synchrony, waves, and spatial hierarchies in the spread of influenza”, science 312, 447–451 (2006).
OpenUrl Abstract/FREE Full Text

[60] ↵
C.-E. Juneau, A.-S. Briand, P. Collazzo, U. Siebert, and T. Pueyo, “Effective contact tracing for COVID-19: A systematic review”, Global Epidemiology 5, 100103 (2023).
OpenUrl

[61] ↵
A. M. Henao-Restrepo et al., “Efficacy and effectiveness of an rVSV-vectored vaccine in preventing Ebola virus disease: final results from the Guinea ring vaccination, open-label, cluster-randomised trial (Ebola Ça Suffit!)”, The Lancet 389, 505–518 (2017).
OpenUrl CrossRef

[62] ↵
R. Nachbagauer, J. Feser, A. Naficy, D. I. Bernstein, J. Guptill, E. B. Walter, F. Berlanda-Scorza, D. Stadlbauer, P. C. Wilson, T. Aydillo, et al., “A chimeric hemagglutinin-based universal influenza virus vaccine approach induces broad and long-lasting immunity in a randomized, placebo-controlled phase I trial”, Nature medicine 27, 106–114 (2021).
OpenUrl

[63] ↵
N. Pardi, J. M. Carreño, G. O’Dell, J. Tan, C. Bajusz, H. Muramatsu, W. Rijnink, S. Strohmeier, M. Loganathan, D. Bielak, et al., “Development of a pentavalent broadly protective nucleoside-modified mRNA vaccine against influenza B viruses”, Nature Communications 13, 4677 (2022).
OpenUrl

[64] ↵
D. M. Morens, J. K. Taubenberger, and A. S. Fauci, “Universal Coronavirus Vaccines — An Urgent Need”, New England Journal of Medicine 386, 297–299 (2022).
OpenUrl

[65] ↵
R. L. Muylaert, D. A. Wilkinson, T. Kingston, P. D’Odorico, M. C. Rulli, N. Galli, R. S. John, P. Alviola, and D. T. S. Hayman, Using drivers and transmission pathways to identify SARS-like coronavirus spillover risk hotspots, en, preprint (Ecology, 2022).

[66] ↵
One Health High Level Expert Panel, Prevention of zoonotic spillover, Accessed: 2023-04-03, (2023) https://www.who.int/publications/m/item/prevention-of-zoonotic-spillover.

[67] ↵
D. Mistry, M. Litvinova, A. Pastore y Piontti, M. Chinazzi, L. Fumanelli, M. F. Gomes, S. A. Haque, Q.-H. Liu, K. Mu, X. Xiong, et al., “Inferring high-resolution human mixing patterns for disease modeling”, Nature communications 12, 1–12 (2021).
OpenUrl

[68] ↵
M. Schläpfer, L. Dong, K. O’Keeffe, P. Santi, M. Szell, H. Salat, S. Anklesaria, M. Vazifeh, C. Ratti, and G. B. West, “The universal visitation law of human mobility”, Nature 593, 522–527 (2021).
OpenUrl

[69] V. Palchykov, M. Mitrović, H.-H. Jo, J. Saramäki, and R. K. Pan, “Inferring human mobility using communication patterns”, Scientific reports 4, 1–6 (2014).
OpenUrl

[70] X.-Y. Yan, W.-X. Wang, Z.-Y. Gao, and Y.-C. Lai, “Universal model of individual and population mobility on diverse spatial scales”, Nature communications 8, 1–9 (2017).
OpenUrl

[71] ↵
F. Simini, M. C. González, A. Maritan, and A.-L. Barabási, “A universal model for mobility and migration patterns”, Nature 484, 96–100 (2012).
OpenUrl CrossRef PubMed Web of Science

[72] ↵
J. S. Lavine, O. N. Bjornstad, and R. Antia, “Immunological characteristics govern the transition of COVID-19 to endemicity”, Science 371, 741–745 (2021).
OpenUrl Abstract/FREE Full Text

[73] ↵
A. Wesolowski, N. Eagle, A. J. Tatem, D. L. Smith, A. M. Noor, R. W. Snow, and C. O. Buckee, “Quantifying the Impact of Human Mobility on Malaria”, Science 338, 267–270 (2012).
OpenUrl Abstract/FREE Full Text

High connectivity and human movement limits the impact of travel time on infectious disease transmission

Abstract

1. Introduction

2. Methods

(a) Model structure

(b) Human movement data

(c) Human flow network

(d) Model validation

(e) Predictors of spread

(i) Travel duration

(ii) Network statistics

(f) Variable importance

(i) Principal component analysis (PCA)

(ii) Machine learning algorithms

(g) Movement and interventions

3. Results

4. Discussion

5. Conclusions

Data Availability

Acknowledgements

A. Appendix 1

Ethics

Data Accessibility

Authors’ Contributions

Competing Interests

Funding

References

Citation Manager Formats

Subject Area