Abstract
In 2023, cholera affected approximately 1 million people and caused more than 5000 deaths globally, predominantly in low-income and conflict settings. In recent years, the number of new cholera outbreaks has grown rapidly. Further, ongoing cholera outbreaks have been exacerbated by conflict, climate change, and poor infrastructure, resulting in prolonged crises. As a result, the demand for treatment and intervention is quickly outpacing existing resource availability. Prior to improved water and sanitation systems, cholera, a disease primarily transmitted via contaminated water sources, also routinely ravaged high-income countries. Crumbling infrastructure and climate change are now putting new locations at risk — even in high-income countries. Thus, understanding the transmission and prevention of cholera is critical.
Combating cholera requires multiple interventions, the two most common being behavioral education and water treatment. Two-dose oral cholera vaccination (OCV) is often used as a complement to these interventions. Due to limited supply, countries have recently switched to single-dose vaccines (OCV1). One challenge lies in understanding where to allocate OCV1 in a timely manner, especially in settings lacking well-resourced public health surveillance systems. As cholera occurs and propagates in such locations, timely, accurate, and openly accessible outbreak data are typically inaccessible for disease modeling and subsequent decision-making.
In this study, we demonstrated the value of open-access data to rapidly estimate cholera transmission and vaccine effectiveness. Specifically, we obtained non-machine readable (NMR) epidemic curves for recent cholera outbreaks in two countries, Haiti and Cameroon, from figures published in situation and disease outbreak news reports. We used computational digitization techniques to derive weekly counts of cholera cases, resulting in nominal differences when compared against the reported cumulative case counts (i.e., a relative error rate of 5.67% in Haiti and 0.54% in Cameroon). Given these digitized time series, we leveraged EpiEstim—an open-source modeling platform—to derive rapid estimates of time-varying disease transmission via the effective reproduction number (Rt). To compare OCV1 effectiveness in the two considered countries, we additionally used VaxEstim, a recent extension of EpiEstim that facilitates the estimation of vaccine effectiveness via the relation among three inputs: the basic reproduction number (R0), Rt, and vaccine coverage. Here, with Haiti and Cameroon as case studies, we demonstrated the first implementation of VaxEstim in low-resource settings. Importantly, we are the first to use VaxEstim with digitized data rather than traditional epidemic surveillance data.
In the initial phase of the outbreak, weekly rolling average estimates of Rt were elevated in both countries: 2.60 in Haiti [95% credible interval: 2.42-2.79] and 1.90 in Cameroon [1.14-2.95]. These values are largely consistent with previous estimates of R0 in Haiti, where average values have ranged from 1.06 to 3.72, and in Cameroon, where average values have ranged from 1.10 to 3.50. In both Haiti and Cameroon, this initial period of high transmission preceded a longer period during which Rt oscillated around the critical threshold of 1. Our results derived from VaxEstim suggest that Haiti had higher OCV1 effectiveness than Cameroon (75.32% effective [54.00-86.39%] vs. 54.88% [18.94-84.90%]). These estimates of OCV1 effectiveness are generally aligned with those derived from field studies conducted in other countries. Thus, our case study reinforces the validity of VaxEstim as an alternative to costly, time-consuming field studies of OCV1 effectiveness. Indeed, prior work in South Sudan, Bangladesh, and the Democratic
Republic of the Congo reported OCV1 effectiveness ranging from approximately 40% to 80%.
This work underscores the value of combining NMR sources of outbreak case data with computational techniques and the utility of VaxEstim for rapid, inexpensive estimation of vaccine effectiveness in data-poor outbreak settings.
Competing Interest Statement
ENH reports contracts from the Global Listening Project and Council on Foreign Relations outside of the submitted work. All other authors declare no competing interests.
Funding Statement
This research was supported in part by the National Institute of General Medical Sciences, National Institutes of Health (R35GM146974), the National Science Foundation (SES2200228 & IIS2229881), and the Eric & Wendy Schmidt Center of the MIT-Harvard Broad Institute. The funding sources had no involvement in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the paper for publication.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Disclaimer: This work is not yet finalized and will be continuously updated. We welcome feedback and suggestions on the analyses and interpretation and any meaningful additions.
We have updated Figure 1 substantially to add other relevant events during the outbreaks, and have modified the readability of the other figures; we have emphasized the use case of this work and why it is novel throughout the text; we have added additional contextual references to the abstract; we have emphasized which elements are generated through our digitization process throughout the text (and added a new figure to the supplemental appendix); we have made edits to the estimates throughout the text by updating our code; these edits were largely all within a decimal place and do not change any interpretations; we have revised and updated the supplemental appendix found at our Github link.
Data Availability
All data in this analysis are openly available and provided via citations.