Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Survey data yields improved estimates of test-confirmed COVID-19 cases when rapid at-home tests were massively distributed in the United States

View ORCID ProfileMauricio Santillana, Ata A. Uslu, Tamanna Urmi, Alexi Quintana, James N. Druckman, Katherine Ognyanova, Matthew Baum, Roy H. Perlis, David Lazer
doi: https://doi.org/10.1101/2024.05.21.24307697
Mauricio Santillana
1Machine Intelligence Group for the betterment of Health and the Environment, Northeastern University, Boston, Massachusetts
2Department of Epidemiology, Harvard T.H. Chan School of Public Health, Massachusetts
3Network Science Institute, Northeastern University, Boston, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mauricio Santillana
  • For correspondence: msantill{at}g.harvard.edu
Ata A. Uslu
3Network Science Institute, Northeastern University, Boston, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tamanna Urmi
1Machine Intelligence Group for the betterment of Health and the Environment, Northeastern University, Boston, Massachusetts
3Network Science Institute, Northeastern University, Boston, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexi Quintana
3Network Science Institute, Northeastern University, Boston, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James N. Druckman
4Department of Political Science, University of Rochester, Rochester, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Katherine Ognyanova
5School of Communication and Information, Rutgers University, New Brunswick, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew Baum
6Harvard University’s John F. Kennedy School of Government and Department of Government, Cambridge, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Roy H. Perlis
7Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Lazer
3Network Science Institute, Northeastern University, Boston, Massachusetts
8Department of Political Science, Northeastern University, Boston, Massachusetts
9Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts
10Institute for Quantitative Social Science, Harvard University, Cambridge, Massachusetts
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Importance Identifying and tracking new infections during an emerging pandemic is crucial to design and deploy interventions to protect populations and mitigate its effects, yet it remains a challenging task.

Objective To characterize the ability of non-probability online surveys to longitudinally estimate the number of COVID-19 infections in the population both in the presence and absence of institutionalized testing.

Design Internet-based non-probability surveys were conducted, using the PureSpectrum survey vendor, approximately every 6 weeks between April 2020 and January 2023. They collected information on COVID-19 infections with representative state-level quotas applied to balance age, gender, race and ethnicity, and geographic distribution. Data from this survey were compared to institutional case counts collected by Johns Hopkins University and wastewater surveillance data for SARS-CoV-2 from Biobot Analytics.

Setting Population-based online non-probability survey conducted for a multi-university consortium —the Covid States Project.

Participants Residents of age 18+ across 50 US states and the District of Columbia in the US.

Main Outcomes and Measures The main outcomes are: (a) survey-weighted estimates of new monthly confirmed COVID-19 cases in the US from January 2020 to January 2023, and (b) estimates of uncounted test-confirmed cases, from February 1, 2022, to January 1, 2023. These are compared to institutionally reported COVID-19 infections and wastewater viral concentrations.

Results The survey spanned 17 waves deployed from June 2020 to January 2023, with a total of 408,515 responses from 306,799 respondents with mean age 42.8 (STD 13) years; 202,416 (66%) identified as women, and 104,383 (34%) as men. A total of 16,715 (5.4%) identified as Asian, 33,234 (10.8%) as Black, 24,938 (8.1%) as Hispanic, 219,448 (71.5%) as White, and 12,464 (4.1%) as another race. Overall, 64,946 respondents (15.9%) self-reported a test-confirmed COVID-19 infection. National survey-weighted test-confirmed COVID-19 estimates were strongly correlated with institutionally reported COVID-19 infections (Pearson correlation of r=0.96; p=1.8 e-12) from April 2020 to January 2022 (50-state correlation average of r=0.88, SD = 0.073). This was before the government-led mass distribution of at-home rapid tests. Following January 2022, correlation was diminished and no longer statistically significant (r=0.55, p=0.08; 50-state correlation average of r=0.48, SD = 0.227). In contrast, survey COVID-19 estimates correlated highly with SARS-CoV-2 viral concentrations in wastewater both before (r=0.92; p=2.2e-09) and after (r=0.89; p=2.3e-04) January 2022. Institutionally reported COVID-19 cases correlated (r = 0.79, p=1.10e-05) with wastewater viral concentrations before January 2022, but poorly (r = 0.31, p=0.35) after, suggesting both survey and wastewater estimates may have better captured test-confirmed COVID-19 infections after January 2022. Consistent correlation patterns were observed at the state-level. Based on national-level survey estimates, approximately 54 million COVID-19 cases were unaccounted for in official records between January 2022 and January 2023.

Conclusions and Relevance Non-probability survey data can be used to estimate the temporal evolution of test-confirmed infections during an emerging disease outbreak. Self-reporting tools may enable government and healthcare officials to implement accessible and affordable at-home testing for efficient infection monitoring in the future.

Trial Registration NA

Question Can non-probability survey data accurately track institutionally confirmed COVID-19 cases in the United States, and provide estimates of unaccounted infections when rapid at-home tests are popularized and institutionalized tests are discontinued?

Findings The proportion of individuals reporting a positive COVID-19 infection in a longitudinal non-probability survey closely tracked the institutionally reported proportions in the US, and nationally-aggregated wastewater SARS-CoV-2 viral concentrations, from April 2020 to February 2022. Survey estimates suggest that a high number of confirmed infections may have been unaccounted for in official records starting in February 2022, when large-scale distribution of rapid at-home tests occurred. This is further confirmed by viral concentrations in wastewater.

Meaning Non-probability online surveys can serve as an effective complementary method to monitor infections during an emerging pandemic. They provide an alternative for estimating infections in the absence of institutional testing when at-home tests are widely available. Longitudinal surveys have the potential to guide real-time decision-making in future public health crises.

Competing Interest Statement

Dr. Santillana has received institutional research funds from the Johnson and Johnson foundation, from Janssen global public health, and from Pfizer Pharmaceuticals, Inc. Dr. Perlis serves as a scientific advisor to Genomind, Vault Health, Psy Therapeutics, Circular Genomics, Swan AI Studios, Belle AI, and Mila Health.

Funding Statement

Dr. Santillana has been funded (in part) by contracts 200-2016-91779 and cooperative agreement CDC-RFA-FT-23-0069 with the Centers for Disease Control and Prevention (CDC). The findings, conclusions, and views expressed are those of the author(s) and do not necessarily represent the official position of the CDC. Dr. Santillana was also partially supported by the National Institute of General Medical Sciences of the National Institutes of Health under award number R01GM130668. Drs Ognyanova, Lazer, and Baum were supported by the National Science Foundation. Dr. Perlis was supported by National Institute of Mental Health award number RF132335

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study was determined to be exempt by the Institutional Review Board of Harvard University; all participants signed consent online prior to survey access.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • A typo in the results section

Data Availability

Data are available upon reasonable request.

https://www.covidstates.org/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted August 25, 2024.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Survey data yields improved estimates of test-confirmed COVID-19 cases when rapid at-home tests were massively distributed in the United States
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Survey data yields improved estimates of test-confirmed COVID-19 cases when rapid at-home tests were massively distributed in the United States
Mauricio Santillana, Ata A. Uslu, Tamanna Urmi, Alexi Quintana, James N. Druckman, Katherine Ognyanova, Matthew Baum, Roy H. Perlis, David Lazer
medRxiv 2024.05.21.24307697; doi: https://doi.org/10.1101/2024.05.21.24307697
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Survey data yields improved estimates of test-confirmed COVID-19 cases when rapid at-home tests were massively distributed in the United States
Mauricio Santillana, Ata A. Uslu, Tamanna Urmi, Alexi Quintana, James N. Druckman, Katherine Ognyanova, Matthew Baum, Roy H. Perlis, David Lazer
medRxiv 2024.05.21.24307697; doi: https://doi.org/10.1101/2024.05.21.24307697

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)