Summary
Background Human mobility is expected to be a critical factor in the geographic diffusion of infectious diseases, and this assumption led to the implementation of social distancing policies during the early fight against the COVID-19 emergency in the United States. Yet, because of substantial data gaps in the past, what still eludes our understanding are the following questions: 1) How does mobility contribute to the spread of infection within the United States at local, regional, and national scales? 2) How do seasonality and shifts in behavior affect mobility over time? 3) At what geographic level is mobility homogeneous across the United States? Addressing these questions is critical to developing accurate transmission models, predicting the spatial propagation of disease across scales, and understanding the optimal geographical and temporal scale for the implementation of control policies.
Methods We address this problem using high-resolution human mobility data measured via mobile app usage. We compute the daily connectivity network between US counties to understand the spatial clustering and temporal stability of mobility patterns. We then integrate our mobility data into a spatially explicit transmission model to reproduce the national invasion of the first wave of SARS-CoV-2 in the US, and characterize the impact of the spatio-temporal scale of mobility data on disease predictions.
Findings Temporally, we observe that intercounty connectivity is annually stable, and was unperturbed by mobility restrictions during the early phase of the COVID-19 pandemic, despite significant changes in overall activity. Spatially, we identify 104 geographic clusters of US counties that are highly connected by mobility within the cluster and more sparsely connected to counties outside the cluster. Together, these results suggest that intercounty connectivity in the US is relatively static across time and is highly connected at the sub-state level. We find that the stability in temporal patterns allows static mobility data to effectively capture infection dynamics. On the other hand, spatial uniformity at the sub-state (cluster)-scale does not capture spatial dynamics; instead, mobility data at the county-scale is necessary to better predict spatial disease diffusion.
Interpretation Our work demonstrates that intercounty mobility was negligibly affected out-side the lockdown period of Spring 2020, explaining the broad spatial distribution of COVID-19 outbreaks in the US during the early phase of the pandemic. Such geographically dispersed outbreaks place a significant strain on national public health resources and necessitate complex metapopulation modeling approaches for predicting disease dynamics and control design. We thus inform the design of such metapopulation models to balance high disease predictability with low data requirements.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under award number R01GM123007.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
the project has been evaluated and exempted by the Institutional Review Board, Georgetown University. The authors thank the teams at Safegraph/Advan Patterns for sharing mobility data. Mobility data were openly available to the public before the initiation of the study here:https://www.safegraph.com/ public health data are openly available here: https://covid.cdc.gov/covid-data-tracker/#datatracker-home
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
The manuscript was revised to account for the heterogeneity in the reporting rates by US county. In this new version, observed incidence and time of arrivals have been corrected, taking into account such heterogeneity. In the previous version, reporting rates were assumed to be uniform across all US counties and equal to the national average. Consequently, there have been corresponding changes in the simulation results.
Data Availability
mobility data were openly available to the public before the initiation of the study here:https://www.safegraph.com/ public health data are openly available here: https://covid.cdc.gov/covid-data-tracker/#datatracker-home All data produced by data analysis and model simulations in the present work will soon be available on Github, here: https://github.com/GiuliaPullano/USA_first_wave_COVID_mobility.
https://github.com/GiuliaPullano/USA_first_wave_COVID_mobility