Assessing the impact of Enhanced-Case-Finding on tuberculosis incidence and transmission in The Gambia using epidemiological and phylodynamic approaches ======================================================================================================================================================== * Florian Gehre * Francis Oko * Boatema Ofori-Anyinam * Conor J. Meehan * Etthel M. Windels * Ken Joof * Tutty Faal * Francis Mendy * Tijan Jobarteh * Ensa Gitteh * Abi Janet Riley * Binta Sarr-Kuyateh * Catherine Okoi * Edward Demba * William Del-Alorse * Kodjovi Mlaga * Basil Sambou * Fatoumatta Kanuteh * Koduh Lette * Abdul Khalie Muhammad * Shadrac Agbla * Wim Mulders * Simon Donkor * David Jeffries * Anna Roca * Umberto D’Alessandro * Martin Antonio * Ifedayo Adetifa * Bouke C. de Jong ## Abstract **Background** Most tuberculosis (TB) cases in The Gambia are notified in the Greater Banjul Area (GBA). We conducted an Enhanced-Case-Finding (ECF) intervention in the GBA and determined its effect on TB incidence and ongoing TB transmission. **Methods** This was a cluster randomized trial in which randomly assigned intervention areas of grouped settlements received three rounds of an ECF strategy consisting of sensitization followed by auramine microscopy, whereas TB patients in control areas continued to be identified through passive case finding. The primary outcome was TB incidence rate. To exclude that an increase in notified cases, followed by a decrease in notified cases, would hide the future impact of the intervention, we tested for changes in transmission dynamics using both genetic clustering and phylodynamic methods. **Results** No significant difference in TB incidence rates, transmission clustering or effective reproductive number was detected between intervention and control areas. **Conclusion** Although we did not find evidence for decreased TB incidence nor TB transmission through the ECF strategy used, this approach is an examplar of how both classical epidemiology and genomic phylodynamics approaches can be integrated to better assess public health intervention outcomes. Key words * tuberculosis * phylodynamics * TB-ECF trial * InterrupTB study ## Introduction Tuberculosis (TB) remains endemic in most countries worldwide. Despite ongoing efforts to meet Sustainable Development Goals, there has been an extremely slow decline of TB incidence (1) sustained by ongoing transmission from the ‘missing millions’ i.e., the estimated three of the ten million incident TB patients worldwide that are not diagnosed nor treated. To improve TB detection, case finding efforts have been scaled up through several initiatives, such as ‘TB reach’. Nevertheless, the evidence-base for the various approaches to detect additional TB patients is scarce. Landmark studies in Zimbabwe (2) and Vietnam (3) showed an impact of case finding strategies on TB prevalence. Although TB patients become rapidly non-infectious with effective treatment (4), there is no evidence that increased case detection interrupts TB transmission. Measuring the impact of case finding interventions on TB transmission and incidence is complicated by the dynamics of case finding, which is expected to initially *increase* TB case notifications (TB case detection rate) by decreasing the diagnostic gap. If sustained, such efforts are expected to eventually *decrease* TB incidence and thus TB case notifications. It is expected that the effect of case finding would be more reliably measured as a decline in transmission, inferred from the phylogenetic relatedness of the patients’ mycobacterial isolates. In The Gambia, previous spatial analyses identified the Greater Banjul Area, a suburban region bordering the capital city of Banjul, as the area with most TB notifications (5). Therefore, this area was chosen to carry out a cluster randomized trial to determine the effect of enhanced case finding (ECF) through community sensitization. The impact of the intervention on TB transmission was modelled through the analysis of whole genome sequence data from bacterial isolates collected during the trial. ## Methods ### Study population and overall study design The cluster randomized trial on Enhanced Case Finding (ECF study, ClinicalTrials.gov Identifier: [NCT01660646](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT01660646&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom)) was conducted from June 2012-October 2015 in the Greater Banjul Area (GBA) with an estimated population size of 680,000 people (based on the available 2004 census at the time of the study design). The GBA was geographically divided into 20 clusters which included 121 settlements with population ≥ 100, which were randomized (10 each) to either the intervention (Enhanced Case Finding) or the control arm. The primary endpoint was TB incidence rate. To understand the transmission-blocking effect of the intervention on TB transmission, all mycobacterial isolates collected between 2013 and 2017 underwent whole genome sequencing to estimate changes in TB transmission between intervention and control clusters through genetic clustering and phylodynamics. ### Enhanced Case Finding study #### Design and cluster assignment An unrestricted randomization of the study settlements would have resulted in a higher risk of ‘contamination’ from intervention to control clusters, without controlling for spatial proximity or population size. Therefore, a stratified randomization was used to minimize the chance of contamination between settlements, to balance population size between the study arms and balance the number of settlements and settlement sizes in each arm and produce a design that allowed efficient implementation of ECF. Randomization was performed by many runs of an algorithm that used a clustering algorithm (k-means) to spatially cluster settlements with a population of at least 100. For each run the following operations were performed: 1) The clusters of settlements were randomly assigned to two arms (which would later be randomised to the ECF or the control arm). 2) The distance between the two closest villages in different clusters was estimated. 3) The balance of large, medium and small settlements in each arm was quantified. 4) The balance of the total numbers of settlements in each arm was quantified, including the overall population in each arm. Out of 100,000 runs, the assignment maximizing the closest distance (1.1km) between settlements from different clusters, with the best settlement number (60 vs. 61) and overall population balance (imbalance of 708 out of 660,000) was chosen. The two groups of 10 clusters (note that there was an intervention cluster with only one settlement as it was remote from the other settlements) were then randomly allocated to either ECF or control. #### Community Intervention ECF was implemented over several days. On day 1, trained field workers informed the village elder (alkalo) and community leaders of the upcoming intervention and sought their agreement. This was followed by a community meeting to inform and sensitize about TB symptoms (persistent cough, night sweats, weight loss) through a public speech and by showing an educational movie on a portable TV. The participants also had the opportunity to ask questions about the disease during a Q&A session. Before their departure, the research team left sputum collection cups for the inhabitants. These were collected on day 3, provided the participants had signed a written informed consent. Epidemiological data (ethnicity, reason for presenting at Health Centre, symptoms, risk factors) were collected at the same time as collection. The intervention was repeated 3 times (every 6 months) over a period of 18 months. Acid Fast Bacilli (AFB) microscopy positive TB patients identified in this manner were attributed to the direct effect of the ECF. The difference in TB patients presenting in any of the seven chest clinics in the GBA from intervention clusters versus from non-intervention clusters was considered as the indirect effect of the ECF, as they represented sensitized, symptomatic individuals or their indirectly sensitized contacts unable or unwilling to directly return their sputum sample just after the sensitization. All health staff in the seven chest clinics were trained in the use of the case report form to (i) confirm whether a TB patient had heard of or participated in the ECF strategy, (ii) assess risk factors, and (iii) document the patient’s village of residence. To confirm such information, the research team visited TB patients presenting during the study period at home and collected the geographical coordinates through GPS, allowing the correct allocation of each patient to either the intervention or the control arm. #### Primary TB Diagnostics All sputum cups collected in the field during ECF campaigns were brought to the MRC TB laboratory and immediately analyzed using direct fluorescent microscopy prior to decontamination. Sputum samples from TB patients recruited at the Chest Clinics were processed by Fluorescent Microscopy and MGIT culture (BACTEC MGIT 960; Becton Dickinson Microbiology Systems, Sparks, Maryland, USA). Positive MGIT cultures were confirmed as *Mycobacterium tuberculosis* (*Mtb*) by MPT64 antigen test and subcultured on Löwenstein Jensen (LJ) medium. All isolates were stored at -80°C for whole genome sequencing. Patients with AFB positive sputa were informed and referred for TB treatment at the nearest facility. ### TB incidence rate difference (IRD) One of the major read-outs of the ECF study was to understand whether an ECF strategy would reduce incidence in intervention clusters when compared to control areas. Therefore, we calculated the 60-day moving average TB incidence rate difference (IRD) between intervention and control clusters (for all clusters with more than 30 screened people). ### Genome sequencing, Bioinformatics and estimation of TB transmission Cultured TB isolates were retrieved from -80°C storage and cultured in the BACTEC MGIT 960 System (MGIT 960; Becton Dickinson Microbiology Systems, Sparks, Maryland, USA) at 37°C following manufactures’ instructions. Positive cultures were plated on blood agar to check for purity. Pure cultures were subjected to ZN microscopy to confirm the presence of AFB; AFB positive cultures were sub-cultured on LJ slants, one with pyruvate and another with glycerol. If the stored isolate did not regrow in the MGIT, leftovers of processed sputa of AFB-positive and MGIT culture negative patient samples detected within the study period were retrospectively retrieved from freezer storage at -20°C and cultured on LJ slants with pyruvate, to lessen culture bias against *Mtb* L5 and L6 (*M. africanum*) (6). Cultures were incubated at 37°C for up to 8 weeks. From loopfuls of pure *Mtb* colonies grown on LJ, samples for whole genome analysis had genomic DNA extracted using the Maxwell 16 DNA Purification Kit (Promega) (7). Sequencing was undertaken on Illumina HiSeq and Miseq platforms. TBprofiler was used (Pipeline version: 3.0.3, access date: December 2020)(8) on the resulting raw reads to assign lineages and drug resistance profiles. Mixed infections were excluded based on whether the TBprofiler strain calling algorithm detected 2 or more (sub)lineages. Data filtering, quality control and SNP calling was performed as previously reported (9). Briefly, non-*Mtb* reads were filtered out using Centrifuge v1.0.3 (10) and then SNPs were called using MTBseq (11) with default settings and the inferred ancestor genome as the reference (12). Samples with less than 95% of the reference genome covered or with a mean coverage <30x were then excluded. SNP distance matrices were created using the MTBseq join and amend functions using default settings (only sites with >95% of samples with an unambiguous nucleotide call and SNPs within 12bp of each other filtered out). Custom python scripts were then used to create loose transmission clusters (as previously defined (13)) at 5 and 1 SNP cut-offs. These two cut-offs represent best the likely divergence during the 3-year sampling period (13). Transmission clusters were assigned to either the intervention or control arm using two methods: majority counts and earliest sampling (Supplementary figure 1). For majority counts, the arm (intervention or control area) of each sample in a transmission cluster was extracted from the database and the total of each arm summed up. Transmission clusters where >50% of samples were from an intervention area were labelled as intervention transmission clusters. The same approach was applied to the control clusters. Clusters where the number of intervention and control samples were equal (i.e., 50% each) were excluded. For earliest sampling, the date of sampling of each isolate in the transmission cluster was extracted from the database and a cluster was assigned to either intervention or control arm according to the origin of the earliest sample. Clusters where two samples were taken on the same day but from different trial arms were excluded. To look for significant differences between intensity of transmission in intervention and control arms based on the majority clustered and earliest sample methods, a chi-squared test was used as implemented in R v3.6.3 (14) using a p-value cut-off of 0.001 with trial arm vs clustered/unclustered as the variables. To create the graph of clusters, the Wilson confidence interval for each method was calculated using the binom (15), plyr (16) and ggplot2 (17) packages in R. ### Phylodynamic analyses To further investigate the effect of the intervention on *Mtb* transmission, we employed a phylodynamic approach. For computational feasibility, we randomly subsampled 400 sequences belonging to *Mtb* lineage 4 (as it was the most prominent lineage in the dataset) and either the intervention or control arm. On this subset, we fit a structured birth-death model (18) with two compartments: one containing TB patients in the control arm of the study and the other containing TB patients in the intervention arm. In this birth-death model, the birth rate corresponds to the *Mtb* transmission rate, i.e., the number *Mtb* transmission events occurring from one patient per unit of time. The death rate corresponds to the rate of becoming uninfectious for *Mtb* due to recovery or death. Patients can infect people in the same trial arm as well as in the other arm, with probabilities determined by the probabilities of contact with people from the same versus the other trial arm. We assumed no further structuring of the population by village. The effective reproductive number (*Re*), corresponding to the ratio of the birth and death rate, was assumed equal for both trial arms before the start of the study (*Re,*1). After that time, a different *Re* was inferred, which was also allowed to be different for the control and intervention group: ![Formula][1] with *Re,cc*, *Re,ii*, *Re,ci*, and *Re,ic* the effective reproductive numbers within the control group, within the intervention group, from the control group to the intervention group, and from the intervention group to the control group, respectively. *Re,2* represents the base *Re* after the start of the study, *p* is the probability that a contact belongs to a different trial arm, and *f* represents the multiplicative effect of the intervention on the reproductive number. We assumed that the transmission rate was equal for both trial arms due to the random selection and allocation of clusters to study arms. We also assumed that any intervention effect on *Re* was the result of an effect on the infectious period. The reproductive numbers for patients in the control and intervention arm (*Re,c* and *Re,i*, respectively) are then as follows: ![Formula][2] TB patients are sampled with sampling proportion *s*, which was set to zero before the onset of sampling and assumed equal for both intervention arms afterwards. Upon sampling an infected patient, the patient was assumed to become uninfectious with probability *r* (19). A strict molecular clock was assumed, and a general time-reversible nucleotide substitution model with four gamma rate categories to account for site-to-site rate heterogeneity (GTR+Γ4). Phylodynamic inference was performed using the bdmm package (18) in BEAST2 (20,21). A variable SNP alignment was generated and augmented with a count of invariant A, C, G, and T nucleotides to avoid ascertainment bias (22). Three independent Markov Chain Monte Carlo chains were run for each analysis, with states sampled every 1,000 steps. Tracer (23) was used to assess convergence and confirm that the effective sample size was at least 200 for the parameters of interest. 10% of each chain was discarded as burn-in, and the remaining samples across the three chains were pooled using LogCombiner (20), resulting in at least 200,000,000 iterations in combined chains. ### Prior distributions All parameters and their corresponding prior distributions are listed in supplemental table 1. For the sampling proportion, a uniform prior was chosen, with lower bound set to zero and upper bound set equal to the ratio of the number of sequences, corrected for down sampling, and the total number of reported cases during the sampling period. ### Sensitivity analyses The robustness of the phylodynamic inference to potential sampling biases was assessed by estimating a different sampling proportion for the control and intervention group. Furthermore, we assessed the sensitivity of the results to prior assumptions by setting the clock rate prior to a Lognormal(-17,1) distribution and the prior on *f* to a Lognormal(0,2) distribution. ### Data accessibility Raw reads are stored on the ENA under accession number PRJEB53138. Custom python scripts used for SNP clustering can be found at [https://github.com/conmeehan/pathophy](https://github.com/conmeehan/pathophy). XML files for BEAST analyses can be found at [https://github.com/EtthelWindels/tb-ecf-gambia](https://github.com/EtthelWindels/tb-ecf-gambia). ## Results and Discussion ### Design set up, cluster assignment and recruitment We chose a clustered randomized trial design over a before-and-after design as this most stringent study setup allows randomization and adjustment for longitudinal external effects that influence the whole study population. We positioned our study in the GBA, with a population of ∼700.000 inhabitants where ∼77% of all TB cases reported in the Gambia occur (personal communication, Gambian National Tuberculosis and Leprosy Control Program, NTLP). A total of 121 villages in the Greater Banjul Area, with a total population of 648,835, were assigned to either 10 intervention (331,800 people) or 10 control clusters (317,035 people) (Fig. 1). During three ECF rounds, 10,219 individuals were screened for TB and provided a sputum sample. Thirty-five individuals (0.34%) were AFB+ and therefore confirmed TB patients and attributable to the direct effect of the ECF strategy (Fig. 2). A total of 3,047 TB patients were recruited through the seven Chest Clinics in the GBA, 1,314 from intervention, 1,375 from control clusters (Fig. 2), and 358 not residing in a cluster. For an epidemiological overview of the ECF study population, see supplemental table 2. ![Figure 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/05/17/2024.05.17.24307536/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/F1) Figure 1: An overview of locations of ECF intervention and control clusters in the Greater Banjul Area ![Figure 2:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/05/17/2024.05.17.24307536/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/F2) Figure 2: Flowchart of study population recruitment into intervention and control areas within the TB-ECF trial as well as in INTERRUPTB. About 43% of isolates from intervention (563/1,314) and from control (588/1,375) clusters were retrieved for sub-culture and extraction of genomic DNA. The patients whose isolate was retrieved did not differ significantly from those who were not included in this analysis, except for higher rates of positive cultures (supplemental table 2). ### Dynamics of TB incidence Although the IRD tended to decrease around the interventions, the upper limit of the 95%CI never crossed the zero-difference line. This suggest that, based on case reporting, we did not observe a significant reduction of incidence in the intervention clusters at any time of the study period (Fig. 3). ![Figure 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/05/17/2024.05.17.24307536/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/F3) Figure 3: Longitudinal development of the study incidence rate difference in relation to three rounds of ECF intervention. Displayed is the 60-day moving average TB incidence rate difference (IRD, red line) and 95%CI (light red lines) between intervention and control clusters (for all clusters with more than 30 screened people). As the health facilities where TB cases were recruited served both the intervention and control arm, selection bias due to different access was probably limited. Therefore, assuming the study design and study location were appropriate, it is unclear why the intervention did not have any effect on the study outcomes. The direct effect of the intervention was unexpectedly small, with auramine microscopy on sputum collected the next morning only detecting 35 patients (0.34% of submitted sputum samples). Auramine microscopy has a reported sensitivity around 60% for the detection of pulmonary tuberculosis in symptomatic persons, although recent studies suggest that the majority of culture positive TB patients do not report a cough (24). Another possible explanation is the lower-than-expected TB prevalence. Indeed, based on WHO prevalence estimates at the time of study design, the TB prevalence in The Gambia was assumed to be 327/100,000. However, a nationwide TB prevalence survey (GAMSTEP) reported a much lower prevalence, i.e., 128/100,000. Indeed, in the latest 2021 WHO guidelines systematic screening for TB disease among the general population is recommended in areas with an estimated TB prevalence of 500/100,000 or higher, albeit with low certainty of evidence (25). Based on these prevalence estimates, the intervention retrieved above expected numbers in the prevalence survey (32/10,219 vs the expected ∼13/∼10,000 samples), yet below the expected yield after sensitizing on TB symptoms. ### Estimating TB transmission using genomic clustering proportion and changes in Re We sequenced the whole genome of 536 and 555 isolates (post QC and removal of mixed infections) from intervention and control areas, respectively. Most sequenced strains came from L4 (67.39%) or L6 (26.43%). Only 10 MDR-TB samples were predicted in this dataset based on genotyping, suggesting a low prevalence of drug resistance in this setting. Using a 5-SNP cut-off, 150 transmission clusters were detected in the data containing a total of 568 (52%) of patients. A total of 119 clusters were detected using the more stringent 1-SNP cut-off containing 349 (32%) of patients. We determined the clustering proportion of isolates belonging to intervention in comparison to the control group (Fig 4) but did not find significant differences between the two groups, even after considering various SNP thresholds as cut-offs for clustering (Chi-Squared test p-value 0.82). ![Figure 4:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/05/17/2024.05.17.24307536/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/F4) Figure 4: Estimating TB transmission in intervention and control clusters based on whole genome sequencing of bacterial populations and genetic clustering based on varying SNP thresholds. Clusters are assigned to an arm (0intervention and control) using two methods: >50%: if over 50% of the samples in cluster come from a specific arm, the clustered is designated as being from that arm (and clusters that are 50:50 to each arm are discounted); time: a cluster is assigned to an arm based on the assignment of the sample with the earliest sampling date. ![Figure 5:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/05/17/2024.05.17.24307536/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/F5) Figure 5: Estimating the impact of the intervention on the effective reproductive number (Re) of *Mycobacterium tuberculosis*. A) Prior and posterior distribution for the multiplicative effect of enhanced case finding on the effective reproductive number (*f* parameter in the phylodynamic model). The genomic data was found to have enough signal for analysis, as shown by the posterior distribution (prior + data) being different from the prior distribution. The intervention did not result in a significant decrease in the Re compared to the control arm, as shown by the posterior distribution overlapping with 1 (= no effect). B) Posterior distributions for the Re of patients in the two study arms, again showing no significant difference. Further transmission dynamics analyses were performed using BEAST to estimate the effective reproductive number (Re) in the two arms. If the intervention worked, the Re in the intervention group should be lower than the Re in the control group, indicating a reduction in transmission. However, all of our phylodynamic models showed no significant effect of the intervention on Re (posterior mean of the relative Re = 0.58-0.72), even with different prior assumptions on sampling evenness and molecular clock rates (Table 2). This correlates with what is observed in the incidence data from the entire ECF sample, indicating that the intervention did not significantly reduce transmission nor incidence in The Gambia. View this table: [Table 1.](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/T1) Table 1. Overview of WGS data. A total of 1294 samples were sequenced with 84% coming from intervention or control villages. The vast majority of samples were from L4 or L6. View this table: [Table 2.](http://medrxiv.org/content/early/2024/05/17/2024.05.17.24307536/T2) Table 2. Effect of intervention on effective reproduction number (Re) of Mtb. Phylodynamic analyses were used to examine if there was a reduction in the Re in the intervention arm of the trial vs the control. Although the effect mean is below 1, the 95% HPD interval includes 1, indicating no significant effect. *see Supplementary table 1 and Methods for default settings and priors This phylodynamic approach serves as an example of how molecular data, especially whole genome sequencing, can be incorporated into such interventions to better evaluate reductions in incidence during the study period and predict if future transmission and incidence will also be reduced (based on projected parameters). Such genomic epidemiology approaches have been used previously to indicate when small outbreaks in low incidence settings are over (26). However, here we show the additional power of phylodynamics to enhance outcome measures in higher incidence settings as well. ### Lessons learnt and limitations As the overall transmission-blocking effect is expected to be lower in this setting than in medium prevalence settings, a longer follow-up period after the last intervention round may have shown a lower incidence and transmission in the intervention arm. In addition, the intervention may not have had the expected reach, although the research team tried to be as inclusive as possible. During the community interventions, attendees were educated on various aspects of TB (using a film and presentation), with a special focus on three symptoms (weight loss, night sweats, persistent cough), which were in line with WHO recommendations (25), even if our current understanding suggests poor sensitivity of such symptom screen (24). All community meetings were interactive, with the research team encouraging the audience to ask any question they may have. Moreover, depending on the location of the village/cluster, the campaign was held in at least one of the four main languages, namely Wolof, Mandinka, Jola and Fula, but also any other local language spoken by the field team. However, we cannot fully dismiss the possibility that the delivered messages were not disseminated wide enough within the recipient population, thus not reaching the incident TB cases we aimed to invite for testing. It is also possible that the overall awareness of TB in the Gambian population was already high, and that TB patients in the control area would have attended the health facilities for TB diagnosis even without the sensitization campaigns, with minimal incremental knowledge gained with ECF. This seems to be the case as a study carried out before and after the intervention confirmed that background knowledge on TB was already high (27). After all, the Gambia has a long-lasting, ongoing TB case-contact platform (TBCC) in which thousands of TB patients and their household contacts had been enrolled and sensitized over the years (28). Such baseline knowledge studies should be done in areas where sensitization interventions are planned to be carried out, to ensure that they will have an impact, or are indeed necessary in the first place. Another consideration is the structuring of populations which may also affect transmission dynamics beyond what can be modelled as done here. We assumed no further structuring at the village level beyond assignment to trial arm. This may not be the case as interactions may be subdivided based on habits within villages. A better understanding of the anthropological aspects of *Mtb* transmission would help improve such phylodynamic models going forward. There are some diagnostic limitations which can lead to issues with further studies in such settings. For instance, TB patients may have been missed due to the limited sensitivity of the diagnostic tests used, explaining why the research team identified only 35 TB patients among 10,291 individuals who provided a sputum sample. For instance, a molecular test such as GenXpert, which became available in The Gambia during the trial implementation, would have probably (more than) doubled the sensitivity of fluorescent microscopy, especially in (pre) symptomatic patients with incipient TB. However, it is unlikely GenXpert would have led to a significant increase in case notifications from intervention clusters. Moreover, applying GenXpert rather than Fluorescent Microscopy would have required a high throughput set-up, with equipment and cartridges at a much higher cost (3). Another diagnostic shortcoming was the use of the MTP64-based rapid test for confirmation of positive MGIT cultures. After completing the trial, it was found that the globally used MPT64-based rapid tests do not have a sufficiently high sensitivity to confirm cultures of *Mtb* L5 and L6, (previously termed *M. africanum*), two lineage endemic to West Africa and The Gambia. Therefore, it is possible patients were missed during primary diagnosis when a positive MGIT culture with (false) negative MPT64-based rapid test would have been interpreted as a non-tuberculous mycobacterium. This would not have affected the primary outcome of the ECF trial which was based on TB diagnosed by the TB diagnostic and treatment centers rather than on the outcome of primary isolation. In the future, whole genome sequencing directly conducted on primary sputum could be a solution to avoid any culture bias towards certain mycobacterial lineages. Moreover, the high workload for field teams and laboratory staff warrants evaluation of the cost-effectiveness of such intervention in conditions as seen in The Gambia. A strength of our study is that we demonstrate for the first time that a combination of whole genome sequencing of bacterial populations followed by Bayesian phylodynamic analysis can be successfully used to estimate key transmission parameters in TB ECF studies. We suggest that such phylodynamic approaches be used in conjunction with classic epidemiology approaches to assess the impact of interventions on transmission dynamics. Such molecular epidemiological approaches can add an additional outcome measure for clinical trials and studies such as this in other settings. ### Conclusions As ECF in a Gambian low TB prevalence setting did not reduce incidence nor block transmission, we recommend for any future study to carefully consider the underlying TB prevalence, and to use a more sensitive diagnostic algorithm. Molecular studies confirmed that the ECF strategy did not impact TB transmission, an effect that could have been masked when solely monitoring case notifications. ## Supporting information Supplementary Figure 1 [[supplements/307536_file02.pdf]](pending:yes) Supplementary Table 1 [[supplements/307536_file03.docx]](pending:yes) Supplementary Table 2 [[supplements/307536_file04.docx]](pending:yes) ## Data Availability Raw reads are stored on the ENA under accession number PRJEB53138. Custom python scripts used for SNP clustering can be found at https://github.com/conmeehan/pathophy. XML files for BEAST analyses can be found at https://github.com/EtthelWindels/tb-ecf-gambia. [https://github.com/conmeehan/pathophy](https://github.com/conmeehan/pathophy) [https://github.com/EtthelWindels/tb-ecf-gambia](https://github.com/EtthelWindels/tb-ecf-gambia) [https://www.ebi.ac.uk/ena/browser/view/PRJEB53138](https://www.ebi.ac.uk/ena/browser/view/PRJEB53138) ## Ethical Approval For the ECF study ethical approval was granted by the Gambia Government / MRC Joint Ethics Committee. In brief, any newly diagnosed patient initiating TB therapy at any of the Gambia Government TB diagnostic and treatment centres in the Greater Banjul Area, regardless of age, residency, HIV status, or type of TB, with permanent residence in the Greater Banjul Area was eligible for inclusion. Individuals unable to understand the implications of study participation, whether through cognitive impairment or insurmountable language barrier, were excluded from the study. For the transmission blocking study, ethical approval was provided by the Gambia Government/ MRC Joint Ethics Committee and the Institutional Review Board of the Institute of Tropical Medicine, Antwerp, Belgium. Isolates from all ECF participants (except minors, i.e., <18 years old) were included. ## Funding This work was supported by an ERC grant [INTERRUPTB; no. 311725] to BdJ and by the Global Fund for Tuberculosis, AIDS and Malaria. ## Authorship contributions FG, BCdJ, MA developed and conducted the InterrupTB study; IA, MA, AR, UDA designed and conducted the TB-ECF study; FO, KJ coordinated all field work and conducted the ECF intervention, CJM, WM, BOA, and EMW conducted the bioinformatics component of the InterrupTB; FG, BOA, TF, FM, TJ, EG, AJR, BSK, KO, ED, WDA, KM, BS, FK were responsible for all mycobacteriology activities in both TB-ECF and InterrupTB; KL, AKM, SA, SD, DJ were part of data management and statistics team in both studies. All authors critically reviewed the manuscript. ## Competing interests The authors declare no competing interests. ## Acknowledgements In memoriam, Jacob Kweku Otu was the Higher Scientific Officer, MRC TB laboratory and was very dedicated to providing high quality diagnostic services for tuberculosis patients and study participants. * Received May 17, 2024. * Revision received May 17, 2024. * Accepted May 17, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Silva S, Arinaminpathy N, Atun R, Goosby E, Reid M. Economic impact of tuberculosis mortality in 120 countries and the cost of not achieving the Sustainable Development Goals tuberculosis targets: a full-income analysis. Lancet Glob Health. 2021 Oct 1;9(10):e1372–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2214-109X(21)00299-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34487685&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 2. 2.Corbett EL, Bandason T, Duong T, Dauya E, Makamure B, Churchyard GJ, et al. Comparison of two active case-finding strategies for community-based diagnosis of symptomatic smear-positive tuberculosis and control of infectious tuberculosis in Harare, Zimbabwe (DETECTB): a cluster- randomised trial. The Lancet. 2010 Oct 9;376(9748):1244–53. 3. 3.Marks GB, Nguyen, Nhung V., Nguyen, Phuong T.B., Nguyen, Thu-Anh, Nguyen Hoa B., Tran Khoa H., et al. Community-wide Screening for Tuberculosis in a High-Prevalence Setting. N Engl J Med. 2019 Oct 3;381(14):1347–57. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa1902129&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 4. 4.Rouillon A, Perdrizet S, Parrot R. Transmission of tubercle bacilli: The effects of chemotherapy. Tubercle. 1976 Dec;57(4):275–99. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0041-3879(76)80006-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=827837&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1976CQ48800006&link_type=ISI) 5. 5.Touray K, Adetifa IM, Jallow A, Rigby J, Jeffries D, Cheung YB, et al. Spatial analysis of tuberculosis in an urban west African setting: is there evidence of clustering? Trop Med Int Health TM IH. 2010 Jun;15(6):664–72. 6. 6.Sanoussi CN, Affolabi D, Rigouts L, Anagonou S, de Jong B. Genotypic characterization directly applied to sputum improves the detection of Mycobacterium africanum West African 1, under-represented in positive cultures. Picardeau M, editor. PLoS Negl Trop Dis. 2017 Sep;11(9):e0005900–e0005900. 7. 7.Lempens P, Meehan CJ, Vandelannoote K, Fissette K, de Rijk P, Van Deun A, et al. Isoniazid resistance levels of Mycobacterium tuberculosis can largely be predicted by high-confidence resistance-conferring mutations. Sci Rep. 2018 Dec;8(1):3246–3246. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-018-21378-x&link_type=DOI) 8. 8.Phelan JE, O’Sullivan DM, Machado D, Ramos J, Oppong YEA, Campino S, et al. Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs. Genome Med. 2019 Jun 24;11(1):41. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13073-019-0650-x&link_type=DOI) 9. 9.Ngabonziza JCS, Rigouts L, Torrea G, Decroo T, Kamanzi E, Lempens P, et al. Multidrug-resistant tuberculosis control in Rwanda overcomes a successful clone that causes most disease over a quarter century. J Clin Tuberc Mycobact Dis. 2022 Jan 24;27:100299. 10. 10.Kim D, Song L, Breitwieser FP, Salzberg SL. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res. 2016 Dec;26(12):1721–9. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjEwOiIyNi8xMi8xNzIxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDUvMTcvMjAyNC4wNS4xNy4yNDMwNzUzNi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 11. 11.Kohl TA, Utpatel C, Schleusener V, De Filippo MR, Beckert P, Cirillo DM, et al. MTBseq: a comprehensive pipeline for whole genome sequence analysis of Mycobacterium tuberculosis complex isolates. PeerJ. 2018;6:e5895. 12. 12.Comas I. Genome of the inferred most recent common ancestor of the Mycobacterium tuberculosis complex [Internet]. Zenodo; 2019 [cited 2024 May 17]. Available from: [https://zenodo.org/records/3497110](https://zenodo.org/records/3497110) 13. 13.Meehan CJ, Moris P, Kohl TA, Pečerska J, Akter S, Merker M, et al. The relationship between transmission time and clustering methods in Mycobacterium tuberculosis epidemiology. EBioMedicine. 2018 Aug;37():410–6. 14. 14.R Core Team. R: A language and environment for statistical computing [Internet]. 2021. Available from: [https://www.R-project.org/](https://www.R-project.org/) 15. 15.Dorai-Raj, Sundar. binom: Binomial Confidence Intervals for Several Parameterizations [Internet]. 2022. Available from: [https://CRAN.R-project.org/package=binom](https://CRAN.R-project.org/package=binom) 16. 16.Wickham, Hadley. The Split-Apply-Combine Strategy for Data Analysis [Internet]. 2011. Available from: [https://www.jstatsoft.org/v40/i01/](https://www.jstatsoft.org/v40/i01/) 17. 17.Wickham, Hadley. ggplot2: Elegant Graphics for Data Analysis [Internet]. 2016. Available from: [https://ggplot2.tidyverse.org](https://ggplot2.tidyverse.org) 18. 18.Kühnert D, Stadler T, Vaughan TG, Drummond AJ. Phylodynamics with Migration: A Computational Framework to Quantify Population Structure from Genomic Data. Mol Biol Evol. 2016 Aug 1;33(8):2102–16. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msw064&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27189573&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 19. 19.Gavryushkina A, Welch D, Stadler T, Drummond AJ. Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration. PLOS Comput Biol. 2014 Dec 4;10(12):e1003919. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1003919&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25474353&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 20. 20.Bouckaert R, Vaughan TG, Barido-Sottani J, Duchêne S, Fourment M, Gavryushkina A, et al. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLOS Comput Biol. 2019 Apr 8;15(4):e1006650. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1006650&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30958812&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 21. 21.Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu CH, Xie D, et al. BEAST 2: A Software Platform for Bayesian Evolutionary Analysis. PLOS Comput Biol. 2014 Apr 10;10(4):e1003537. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1003537&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24722319&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 22. 22.Leaché AD, Banbury BL, Felsenstein J, de Oca A nieto-Montes, Stamatakis A. Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies. Syst Biol. 2015 Nov 1;64(6):1032– 47. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/sysbio/syv053&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26227865&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 23. 23.Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst Biol. 2018 Sep 1;67(5):901–4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/sysbio/syy032&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29718447&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) 24. 24.Stuck L, Klinkenberg E, Abdelgadir Ali N, Basheir Abukaraig EA, Adusi-Poku Y, Alebachew Wagaw Z, et al. Prevalence of subclinical pulmonary tuberculosis in adults in community settings: an individual participant data meta-analysis. Lancet Infect Dis [Internet]. 2024 Mar 12 [cited 2024 May 17]; Available from: [https://www.sciencedirect.com/science/article/pii/S1473309924000112](https://www.sciencedirect.com/science/article/pii/S1473309924000112) 25. 25.World Health Organization. WHO operational handbook on tuberculosis: module 2: screening: systematic screening for tuberculosis disease [Internet]. World Health Organization; 2021 [cited 2024 May 17]. Available from: [https://www.who.int/publications-detail-redirect/9789240022614](https://www.who.int/publications-detail-redirect/9789240022614) 26. 26.Hatherell HA, Didelot X, Pollock SL, Tang P, Crisan A, Johnston JC, et al. Declaring a tuberculosis outbreak over with genomic epidemiology. Microb Genomics. 2016 May;2(5):e000060. 27. 27.Price, S. The Effect of a Community Sensitization Intervention on Knowlegde of TB in The Gambia. 2013. 28. 28.Hill PC, Ota MOC. Tuberculosis case-contact research in endemic tropical settings: design, conduct, and relevance to other infectious diseases. Lancet Infect Dis. 2010 Oct;10(10):723–32. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(10)70164-X&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20883968&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F05%2F17%2F2024.05.17.24307536.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000282911200020&link_type=ISI) [1]: /embed/graphic-1.gif [2]: /embed/graphic-2.gif