Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

On Machine Learning-Based Short-Term Adjustment of Epidemiological Projections of COVID-19 in US

View ORCID ProfileSarah Kefayati, Fred Roberts, Sayali Pethe, Xuan Liu, Hu Huang, Vishrawas Gopalakrishnan, Piyush Madan, Jianying Hu, Prithwish Chakraborty, Raman Srinivasan, Ajay Deshpande, Gretchen Jackson
doi: https://doi.org/10.1101/2020.09.11.20180521
Sarah Kefayati
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sarah Kefayati
  • For correspondence: Sarah.kefayati{at}ibm.com
Fred Roberts
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sayali Pethe
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xuan Liu
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hu Huang
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vishrawas Gopalakrishnan
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Piyush Madan
2IBM Research, Yorktown Heights, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jianying Hu
2IBM Research, Yorktown Heights, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Prithwish Chakraborty
2IBM Research, Yorktown Heights, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Raman Srinivasan
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ajay Deshpande
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gretchen Jackson
1IBM Watson Health, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Epidemiological models have provided valuable information for the outlook of COVID-19 pandemic and relative impact of different mitigation scenarios. However, more accurate forecasts are often needed at near term for planning and staffing. We present our early results from a systemic analysis of short-term adjustment of epidemiological modeling of COVID 19 pandemic in US during March-April 2020. Our analysis includes the importance of various types of features for short term adjustment of the predictions. In addition, we explore the potential of data augmentation to address the data limitation for an emerging pandemic. Following published literature, we employ data augmentation via clustering of regions and evaluate a number of clustering strategies to identify early patterns from the data.

From our early analysis, we used CovidActNow as our underlying epidemiological model and found that the most impactful features for the one-day prediction horizon are population density, workers in commuting flow, number of deaths in the day prior to prediction date, and the autoregressive features of new COVID-19 cases from three previous dates of the prediction. Interestingly, we also found that counties clustered with New York County resulted in best preforming model with maximum of R2= 0.90 and minimum of R2=0.85 for state-based and COVID-based clustering strategy, respectively.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

No external funding was received.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The IRB exemption decision for this study was ruled by Western Institutional Review Board per below: "We determined this study is exempt from IRB review because it does not meet the definition of human subject research as defined in 45 CFR 46.102. Specifically, this project involves analysis of data from publicly available datasets and deidentified private datasets. The research activities do not involve human subjects, because the activities do not involve interaction or intervention with the subjects. Additionally, the investigator will not be able to readily ascertain the identity of any of the human subjects whose data is used in this project."

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • robertsf{at}us.ibm.com, sayali.pethe{at}ibm.com, xuanliu{at}us.ibm.com, hu.huang{at}ibm.com, vishrawas.gopalakrishnan1{at}ibm.com, piyush.madan1{at}ibm.com, jyhu{at}us.ibm.com, prithwish.chakraborty{at}ibm.com, rsrin{at}us.ibm.com, ajayd{at}us.ibm.com, gretchen.jackson{at}ibm.com

Data Availability

The data used in model building are all publicly available data except comorbidity data that is IBM proprietary.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted September 13, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
On Machine Learning-Based Short-Term Adjustment of Epidemiological Projections of COVID-19 in US
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
On Machine Learning-Based Short-Term Adjustment of Epidemiological Projections of COVID-19 in US
Sarah Kefayati, Fred Roberts, Sayali Pethe, Xuan Liu, Hu Huang, Vishrawas Gopalakrishnan, Piyush Madan, Jianying Hu, Prithwish Chakraborty, Raman Srinivasan, Ajay Deshpande, Gretchen Jackson
medRxiv 2020.09.11.20180521; doi: https://doi.org/10.1101/2020.09.11.20180521
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
On Machine Learning-Based Short-Term Adjustment of Epidemiological Projections of COVID-19 in US
Sarah Kefayati, Fred Roberts, Sayali Pethe, Xuan Liu, Hu Huang, Vishrawas Gopalakrishnan, Piyush Madan, Jianying Hu, Prithwish Chakraborty, Raman Srinivasan, Ajay Deshpande, Gretchen Jackson
medRxiv 2020.09.11.20180521; doi: https://doi.org/10.1101/2020.09.11.20180521

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)