Regional spread of blaNDM-1-containing Klebsiella pneumoniae Sequence Type 147 in post-acute care facilities ============================================================================================================ * Zena Lapp * Ryan Crawford * Arianna Miles-Jay * Ali Pirani * William E. Trick * Robert A. Weinstein * Mary K. Hayden * Evan S. Snitkin * Michael Y. Lin ## Abstract **Background** Carbapenem-resistant *Enterobacterales* (CRE) harboring blaKPC have been endemic in Chicago-area healthcare networks for more than a decade. During 2016-2019, a series of regional point prevalence surveys identified increasing prevalence of blaNDM-containing CRE in multiple long-term acute care hospitals (LTACHs) and ventilator-capable skilled nursing facilities (vSNFs). We performed a genomic epidemiology investigation of blaNDM-producing CRE to understand their regional emergence and spread. **Methods** We performed whole-genome sequencing on NDM+ CRE isolates from four point-prevalence surveys across 35 facilities (LTACHs, vSNFs, and acute care hospital medical intensive care units) in the Chicago area and investigated the genomic relatedness and transmission dynamics of these isolates over time. **Results** Genomic analyses revealed that the rise of NDM+ CRE was due to the clonal dissemination of an ST147 *Klebsiella pneumoniae* strain harboring blaNDM-1 on an IncF plasmid. Dated phylogenetic reconstructions indicated that ST147 was introduced into the region around 2013 and likely acquired NDM around 2015. Analyzing genomic data in the context of patient transfer networks supported initial increases in prevalence due to intra-facility transmission in certain vSNFs, with evidence of subsequent inter-facility spread to connected LTACHs and vSNFs via patient transfer. **Conclusions** We identified a regional outbreak of blaNDM-1 ST147 that began in and disseminated across Chicago area post-acute care facilities. Our findings highlight the importance of performing genomic surveillance at post-acute care facilities to identify emerging threats. Keywords * NDM * *Klebsiella pneumoniae* * ST147 * genomic epidemiology * carbapenem resistance ## Introduction Carbapenem-resistant *Enterobacterales* (CRE) represent an urgent antibiotic resistance threat due to their resistance to first-line antibiotics and transmissibility in healthcare settings [1,2]. The emergence of epidemic lineages of CRE that are resistant to nearly all antibiotics and that cause infections with high mortality rates, such as *Klebsiella pneumoniae* carbapenemase (KPC) containing *Klebsiella pneumoniae* (KPC-Kp) sequence type (ST) 258 [3], has further escalated the need for more effective strategies to interrupt CRE transmission. Most interventions to prevent the spread of CRE and other healthcare-associated antibiotic resistance threats have been implemented at the level of individual healthcare facilities. However, there is now a multitude of evidence that the frequent movement of colonized and infected patients among regional healthcare facilities necessitates regional surveillance and infection prevention strategies [4]. Long-term acute care hospitals (LTACHs) and ventilator-capable skilled nursing facilities (vSNFs) are potentially high-impact settings for implementation of regional CRE surveillance and infection prevention interventions [5,6]. Patients in these facilities have been shown to be colonized with CRE at high rates, likely due to a combination of their chronic severe illness, long lengths of stay, and high rates of prior or on-going antibiotic exposure. Modeling and epidemiologic studies have suggested that the high CRE prevalence in LTACHs in particular has a significant impact on connected healthcare facilities with which they share patients [7,8]. Currently, less is known about the regional influence of vSNFs, although the even longer lengths of patient stay and more limited resources for infection prevention indicate that they might also be important settings in regional amplification of antibiotic resistance. A bundled infection prevention intervention [9] (Chicago PROTECT [10]) was initiated in July 2017 to control CRE in Chicago-area healthcare facilities, including in vSNFs and LTACHs. Serial point prevalence surveys conducted to monitor the impact of the intervention demonstrated that KPC-Kp levels remained stable across regional facilities. However, during the intervention period, New Delhi metallo-beta-lactamase (NDM) containing CRE prevalence unexpectedly increased in a subset of surveyed vSNFs. Here, we applied whole-genome sequencing to understand the molecular and epidemiological basis for the increase in NDM prevalence in the region. ## Materials & Methods ### Study isolates and metadata Starting from October 2016 until July 2019, 24 medical ICUs in 24 short term acute care hospitals, 6 LTACHs, and 8 vSNFs in the Chicago region were invited to participate in serial one day point prevalence surveys of residents on their skilled nursing and ventilator wards. Medical ICUs were surveyed once in 2016-2017; vSNFs and LTACHs were surveyed every 6-12 months. Local staff obtained a rectal swab sample from each patient and collected de-identified patient information assessed at time of survey (age up to 90 years of age, sex, respiratory support status, length of stay, contact precautions status, facility awareness of resident CRE status). Swabs were processed at a central lab within 6 hours of collection. Overnight growth from MacConkey agar plates was screened for 5 carbapenemase gene families (KPC, NDM, VIM, IMP & OXA-48) using multiplex PCR assays (Acuitas® MDRO gene test and Acuitas® Resistome test, OpGen, Gaithersburg, MD during 2016-2017; Xpert® Carba-R, Cepheid, Sunnyvale, CA during 2018-2019). For all analyses, only the first isolate of a given ST (for *K. pneumoniae* isolates) or species (for all other species) was used for each patient. We used a Fisher’s exact test to test for the statistical significance of the difference in NDM or KPC prevalence between the first and last surveys. Fisher’s exact p-values were corrected using the Benjamini-Hochberg method. ### Whole-genome sequencing Genomic DNA was extracted from cultures derived from single sub-cultured colonies. Genomic libraries were prepared with NEBNext Ultra DNA library prep kit and sequenced at the University of Michigan Advanced Genomics Core on an Illumina NovaSeq 6000. All sequenced isolates have been deposited under BioProject PRJNA686897. ### Genomic analysis We processed whole-genome sequences [11,12] and identified in-silico multi-locus sequence types [13,14], generated and annotated assemblies [12,15–20], called single-nucleotide variants (SNVs) [21–28], identified phylogenetic clustering of facilities [8], and calculated pairwise SNV distances between isolates [29]. Reference-based whole-genome alignments of study and public ST147 isolates [30–40] were used to generate a phylogenetic tree using IQ-TREE v1.6.12 [41,42]. We inferred ancestral dates of the phylogeny with the R package BactDating v1.0.12 [43–45]. NDM-containing plasmids were identified from publicly available complete plasmids [29,29,46–48,21,49,50]. See supplemental methods for details of the genomic analysis. ### Determining patient flow between facilities We constructed a patient transfer matrix of the Chicago metropolitan region using the Centers for Medicare and Medicaid Services’ minimum data set, Medicare Provider Analysis and Review [MEDPAR] limited data set, Medicaid Analytic eXtract Data from 2010-2012. Using the patient transfer matrix, we constructed a directed weighted patient transfer network of healthcare facilities in the Chicago area using R igraph v1.2.6 [51], including all healthcare facilities in the study, and patient flow was determined as in [8]. See supplemental methods for details about calculating patient flow. ### Data analysis and visualization Data analysis and visualization was performed in R v4.0.2 [52]. Data visualization used the following packages: tidyverse v1.3.0 [53], pheatmap v1.0.12 [54], lubridate v1.7.9.2 [55], tidytree v0.3.3 [56], treeio v1.12.0 [57], ggtree v2.2.4 [58,59], ggplotify v0.0.5 [60], ggnewscale v0.4.4 [61], and cowplot v1.1.0 [62]. Code for analysis and visualization can be found here: [https://github.com/Snitkin-Lab-Umich/ndm-st147-chicago-ms](https://github.com/Snitkin-Lab-Umich/ndm-st147-chicago-ms). ### Ethical Review Bacterial isolates and de-identified clinical metadata were collected under a prior surveillance project that underwent ethical review at the CDC and was determined to be a nonresearch activity (public health surveillance). The project was also evaluated independently at each participating healthcare facility and either deemed a public health assessment or human subjects research and approved by local review boards where applicable. ## Results ### Prevalence of NDM, but not KPC, increased over time in certain vSNFs that are not closely connected by patient transfer We first detected the presence of NDM+ CRE isolates in vSNFs and LTACHs during a regional point prevalence survey conducted in 2017 (survey 13; Fig 1A). To more closely monitor the potential emergence and spread of NDM in the region we performed three additional surveys for CRE colonization in regional vSNFs and LTACHs, each 6 months apart (surveys 13-1, 13-2, and 13-3; **Table 1**). We found that, while the prevalence of KPC+ CRE remained relatively stable over time in all facilities, the prevalence of NDM+ CRE increased in three vSNFs not closely connected by patient transfer (Fisher’s exact p < 0.05 for vSNFs J, K, and L; Fig 1B**, S1, S2; Table S1**). ![Figure 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F1) Figure 1: Prevalence of NDM is increasing over time in certain vSNFs. (A) Time window of when facilities were tested for each survey. (B) Proportion of NDM and KPC positive samples across surveys and facilities. vSNF O was not included as there was uneven sampling across surveys. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ICUs are not shown in panel B because of very low prevalence (see Table 1). View this table: [Table 1:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/T1) Table 1: Point prevalence survey results from ICUs, LTACHs, and vSNFs ### The majority of NDM+ isolates are *K. pneumoniae* ST147 and carry blaNDM-1 on an IncF plasmid To understand the molecular basis for the increase in NDM+ CRE we performed whole-genome sequencing on all CRE isolates from survey 13 and NDM+ isolates from the subsequent three follow-up surveys. We found that the presence of NDM was highly correlated with the presence of a suite of genes present on an NDM+ IncF plasmid isolated from *K. pneumoniae* [63] (**Fig S3**). Most isolates containing the IncF plasmid were blaNDM-1 *K. pneumoniae* ST147; however, one blaNDM-1 *Escherichia coli* ST354 isolate also contained the plasmid (Fig 2**, S4**). While short- read sequencing data alone is insufficient to provide structural data associating NDM with the plasmid backbone, the high degree of correlation between NDM and the IncF-associated plasmid genes in concert with the phylogenetic relationships between isolates strongly suggests that these genes are co-inherited (**Fig S3**). Interestingly, 9 of 17 (53%) isolates with an IncF plasmid in survey 13 were KPC+/NDM- (ST14, 1 ST258, and 1 novel), but this plasmid then became present in predominantly blaNDM-1 isolates, with only 0-3 isolates with the IncF plasmid being KPC+/NDM- in the subsequent surveys (0-7%) (**Fig S4**). In addition to NDM, the IncF plasmid harbored a number of antibiotic resistance genes from several different resistance classes and the qacE gene, which has been shown to provide resistance to common biocides such as chlorhexidine gluconate in *K. pneumoniae* [64] (**Fig S3**). ![Figure 2:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F2) Figure 2: The majority of NDM+ isolates are Kp ST147 and carry NDM on an IncF plasmid. Number of sequenced isolates of various species and sequence types, what carbapenemase(s) they contain, and whether they have the IncF plasmid. ### Regional ST147 isolates are phylogenetically distinct from all public isolates We investigated the phylogeography of ST147 in the Chicago area to determine whether circulating ST147 could be attributed to one or multiple importation events into the region. To this end we constructed a whole-genome phylogeny that included publicly available ST147 genomes from across the globe. Examination of the phylogenetic reconstruction revealed that all the ST147 isolates from the study form a monophyletic cluster, consistent with a single regional introduction (Fig 3). We also noted that while none of the NDM+ ST147 public isolates outside of the Chicago region harbor the IncF plasmid, the majority of ST147s from Chicago contain the plasmid. Moreover, one of the ST147 isolates in the KPC+/NDM- outgroup (from survey 13-2) contains the IncF plasmid but lacks NDM (**Fig S5**), suggesting that the plasmid may have been acquired in a locally circulating ST147 strain, followed by integration of a mobile element harboring NDM. A dated phylogenetic analysis of circulating ST147 yielded an estimate of August 2015 for when the NDM+ clade of ST147 first arose in the region (95% credible interval [CI]: February 2015 - March 2016; Fig 4**, S6; Table S2**), compared to an estimate of July 2013 for when ST147 first entered the region (CI: October 2011 – October 2014; **Table S2**). ![Figure 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F3) Figure 3: Study isolates are clonally separated from all publicly available isolates outside the Chicago region. Maximum likelihood phylogeny of study and public isolates annotated by geographic region and genomic element. ![Figure 4:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F4) Figure 4: NDM+ ST147 *Klebsiella pneumoniae* was introduced into the region around 2015. Dated phylogeny generated by bactdating. Grey bar on the root is the lower and upper bounds of the confidence interval. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ### Genomic evidence indicates that intra-facility transmission is driving prevalence at high- prevalence vSNFs After determining that the increase in NDM prevalence corresponds to a clonal outbreak of blaNDM-1 ST147, we investigated the potential transmission dynamics of this clone within and between healthcare facilities. We observed a substantial clustering of isolates from certain vSNFs on the phylogeny (Fig 5A**, S7**), which suggests potential intra-facility transmission. To further investigate whether these clusters may represent within-facility transmission, we calculated pairwise SNV distances among all pairs of isolates and compared these distances for isolates from the same facility (intra-facility pairs) to isolates from different facilities (inter-facility pairs) across surveys (Fig 5B). Indeed, starting in survey 13-2 we observed a disproportionate representation of small SNV distances (≤12 SNVs; see methods for threshold selection) consistent with intra-facility transmission in vSNFs. Of note, in survey 13-3 we observed spikes in small SNV distances for both intra- and inter-facility pairs, with closely related intra-facility pairs being primarily from vSNFs and closely related inter-facility pairs being from both vSNF- LTACH and vSNF-vSNF pairs (**Fig S8**). Putting these closely related inter-facility pairs in the context of the regional patient transfer network supports a potential role of patient transfer in regional blaNDM-1 ST147 spread in survey 13-3, with vSNF-LTACH and vSNF-vSNF isolate pairs less than 12 SNVs apart being from facilities with higher patient flow than isolate pairs with 12 or more SNVs (Wilcox p = 7.7e-6 and 1.4e-9, respectively; **Fig S9**). ![Figure 5:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F5) Figure 5: Intra-facility transmission is driving prevalence at high-prevalence vSNFs. (A) Number of isolates in the largest subclade of the maximum likelihood phylogeny containing ≥90% of isolates from the given facility (see methods for more details). (B) Pairwise SNV distances of isolates from the same and different facilities across surveys. The grey diamond at a pairwise SNV distance of 12 indicates the threshold for closely related isolates (see methods for details). vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ## Discussion We performed genomic analyses of CRE isolates collected through serial point-prevalence surveys in the Chicago area to understand the molecular and epidemiologic basis for an increase in NDM+ CRE prevalence across a regional healthcare network. Our analysis supports the increase in NDM+ CRE being due to the clonal dissemination of a single blaNDM-1 ST147 strain of *K. pneumoniae* that emerged in 2015. Putting genomic analysis in the context of the regional healthcare network supports this strain first reaching high prevalence in a small number of vSNFs due to intra-facility transmission, with potential subsequent spread to other connected healthcare facilities via patient transfer. Whole-genome sequencing showed that the majority of blaNDM-1 ST147 harbored an IncF multidrug resistance plasmid. Incorporating public data into the analysis revealed that these isolates formed a monophyletic clade, suggesting a single introduction into the region, either through importation of a pre-existing NDM+ ST147 strain or acquisition of blaNDM by a locally circulating ST147 strain. Furthermore, examination of the global phylogeny indicates that, while NDM+ ST147 has evolved multiple times in different locations and sometimes resulted in clonal outbreaks, none of the global NDM+ ST147 isolates we included in our analysis harbor the IncF plasmid found in our study isolates. The rapid and widespread dissemination of this strain in the region indicates that the NDM-carrying IncF plasmid we identified here can stably associate with an ST147 strain with epidemic potential. Given the potential negative impact of epidemic NDM- carrying *K. pneumoniae*, this possibility warrants close monitoring. By combining regional surveillance with genomic analysis, we were able to discern that NDM initially spread in three vSNFs, likely via intra-facility transmission, with evidence of subsequent spread to connected healthcare facilities. There are several factors that likely contributed to the spread of this NDM+ ST147 clone. First, vSNF patients are a high-risk population for carriage of CRE as they are chronically ill, are usually admitted from ICUs or LTACHs, and are often exposed to antibiotics [65]. Second, patients in vSNFs generally have long lengths of stay — often much longer than patient stays at LTACHs [66] — meaning that they have a longer period of time to acquire a multi-drug resistant organism. Furthermore, multibed rooms are common and the facilities themselves are often under-resourced from a staffing and infection control perspective [6], both of which could facilitate intra-facility spread. Our findings paired with these observations indicate that vSNFs may be important healthcare facilities to detect emerging threats and potentially contain them before widespread dissemination. In the current study, we note that NDM+ isolates were uncommon in ICUs, and the outbreak of ST147 might not have been detectable until much later if sampling were restricted to ICUs. Of note, the Chicago PROTECT intervention, a bundled infection prevention intervention designed to reduce the prevalence of CRE, began at around the same time as NDM+ ST147 began amplifying in the region. There are several potential reasons for this association. The intervention could have created a new niche for NDM+ isolates to thrive in, either due to NDM itself, or other genes on the associated plasmid. One possibility is that the qacE gene harbored on the NDM IncF plasmid might have provided a selective advantage in the context of an intervention that included chlorhexidine gluconate bathing. It is also possible that the emergence of NDM during the intervention was coincidental, and that the intervention might have even prevented a larger NDM+ outbreak from occurring across more facilities. Regardless, this study exemplifies how genomic surveillance can be used to monitor emerging threats and understand how they develop and circulate through a region. Our study has several strengths. Active surveillance of diverse types of healthcare facilities in the region allowed us to identify and investigate a potential multi-drug resistant organism threat earlier than would have been possible if serial point-prevalence surveys across several facility types were not ongoing. Furthermore, cross-sectional patient sampling within each survey allowed us to obtain a complete snapshot of CRE prevalence at a given facility at a given point in time, and to detect the increase in NDM+ isolates over time. Finally, we were able to leverage information from whole-genome sequencing to investigate the relatedness of isolates, as well as the intra- and inter-facility transmission dynamics of NDM across the healthcare network. Our study also has several important limitations. First, we only have data from discrete points in time for each survey. This could have led to potential biases in the number of NDM+ isolates sequenced at facilities given that the patients at these facilities had different average lengths of stay, and it also precluded more nuanced examination of intra-facility transmission dynamics. Second, we lack data from short-term acute care or community settings, particularly in the last three surveys, which limited our ability to examine the relative importance of other regional reservoirs for NDM+ ST147. However, the short-term acute care data that is available did not support their role in blaNDM-1 ST147 expansion. Another limitation is that we only had access to short-read sequencing data, which limited our ability to investigate more complex plasmid dynamics. While we plan to perform long-read sequencing on a subset of these isolates in the future, we find it notable that we were able to leverage publicly available complete plasmid sequences to determine that NDM was carried on the same plasmid in the majority of isolates. In conclusion, we were able to combine whole-genome sequencing of NDM+ isolates from serial point-prevalence surveys at vSNFs, LTACHs, and ICUs to identify and investigate the dissemination of an NDM+ clone of *K. pneumoniae*. These findings highlight the importance of performing regional point-prevalence surveys in not only acute care hospital ICUs, but also in post-acute care LTACHs and vSNFs, to identify potential multi-drug resistant organism threats as early as possible. Furthermore, our study identified an emerging blaNDM-1 ST147 clone of *K. pneumoniae* that could pose a threat to healthcare facilities worldwide if this plasmid/strain combination continues to spread. ## Funding This work was supported by Centers for Disease Control and Prevention Cooperative Agreement [grant number U54 CK000481] and SHEPheRD [Task Order No. 200-2011-42037], the National Science Foundation Graduate Research Fellowship Program [grant number DGE 1256260 to Z.L.], the National Institutes of Health via the Molecular Mechanisms of Microbial Pathogenesis Training Grant [T32AI007528, A.M-J.] and the National Institutes of Health [1R01AI148259-01 to E.S.S.]. OpGen, Inc. (Gaithersburg, MD) provided Acuitas® multi-drug resistant organism gene test and Acuitas® Resistome test in kind, during 2016-2017. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the Centers for Disease Control and Prevention, National Science Foundation, or the National Institutes of Health. ## Conflicts of interest M.L. and M.H. have received research support in the form of contributed product from OpGen, LLC and from Sage Products (now part of Stryker Corporation). M.L. has also received an investigator-initiated grant from CareFusion Foundation (now part of BD). M.H. and R.W. have participated in clinical studies where participating healthcare facilities received contributed product from Sage Products Inc., Molnlycke, Clorox, or Medline. Neither M.H., R.W. nor their hospitals received product, funding, payments, or any other form of compensation. ## Data Availability All sequenced isolates will be deposited under BioProject PRJNA686897. Code for analysis and visualization, as well as supplementary data, can be found here: https://github.com/Snitkin-Lab-Umich/ndm-st147-chicago-ms. [https://www.ncbi.nlm.nih.gov/bioproject/?term=prjna686897](https://www.ncbi.nlm.nih.gov/bioproject/?term=prjna686897) [https://github.com/Snitkin-Lab-Umich/ndm-st147-chicago-ms](https://github.com/Snitkin-Lab-Umich/ndm-st147-chicago-ms) ## Authors’ contributions All authors developed methodology and reviewed and edited the manuscript. ML, ES, MH, AMJ, and ZL conceptualized the research goals and aims. ZL, RC, AP, and ES developed and implemented software and curated the data. ZL, RC, and AP performed formal analysis. ML, MH and ES provided resources. ZL and ES wrote the original draft. ZL and RC visualized the results. ML, MH, AMJ, and ES provided supervision. ML, MH, and ES managed the project. ML, MH, and ES acquired funding. ## Supplementary material ### Supplementary methods #### Public isolates All Klebsiella genomes available as of October 15, 2020 in the PATRIC database [30] were downloaded. Furthermore, we included additional whole-genome sequences of NDM+ ST147 genomes identified through a literature search [31–39]. Information about the public genomes included can be found in supplementary data file 2. Complete genomes and any ST147 genome were used for downstream analysis. For isolates where only genome assemblies were available, we used wgsim v1.9 [40] to simulate reads of length 250 from the assemblies with no base error rate, rate of mutation and fraction of indels. These simulated reads were used to perform variant calling and generate a phylogenetic tree (see below). #### Multi-locus sequence typing Multi-locus sequence types (MLSTs) were determined with MLSTyper [13] for public isolates and with ARIBA v2.14.5 for study isolates [14]. #### Assembly generation and annotation For all study isolates, the reads were assembled using assemblage v1.2 [15], which runs the following steps sequentially: down-sample reads to 150X coverage using Seqtk v1.3 [16] and Mash v2.2.2 [17], clean down-sampled reads with Trimmomatic v0.39 [12], assemble clean reads with SPAdes v3.14.1 [18], perform post-assembly correction with PILON v1.23 [19], and remove contigs smaller than 500bp. Genomes were annotated with RAST v1.3.0 [20]. The quality of genomes was then assessed by the number of coding sequences and any genomes containing fewer than 4000 or greater than 8000 were excluded from the analysis. #### Sequencing quality control and ST147 single nucleotide variant identification The quality of sequencing reads was assessed with FastQC v0.11.9 [11], and Trimmomatic v0.39 [12] was used for trimming adapter sequences and low-quality bases. Single nucleotide variants (SNVs) were identified by (i) mapping filtered reads to the ST147 Kp46596 reference genome (Genbank accession NZ_CP059312.1) using the Burrows- Wheeler short-read aligner (bwa v0.7.17 [21]), (ii) discarding polymerase chain reaction duplicates with Picard v2.24.1 [22], and (iii) calling variants with SAMtools and bcftools v1.9 [23]. Variants were filtered from raw results using VariantFiltration from GATK v4.1.9.0 [24] (QUAL >100; MQ >50; >=10 reads supporting variant; and FQ < 0.025). In addition, a custom python script was used to filter out (mask) single nucleotide variants in the whole-genome alignment that were: (i) <5 base pairs (bp) in proximity to indels that were identified by GATK HaplotypeCaller [25], (ii) in a recombinant region identified by Gubbins v2.3.4 [26], in a phage region identified by the Phaster web tool [27] or (iii) they resided in tandem repeats of length greater than 20bp as determined using the exact-tandem program in MUMmer v3.23 [28]. Genomes were removed that showed evidence of contamination as inferred by poor mapping to the reference genome or excessive numbers of uniquely filtered variants. This whole-genome masked variant alignment was used to reconstruct a maximum likelihood phylogeny with IQ- TREE v1.6.12 [41] using the general time reversible model GTR+G and ultrafast bootstrap with 1000 replicates (-bb 1000) [42]. #### Identifying NDM-containing plasmids To identify the presence of beta-lactamase containing plasmids, we searched for previously sequenced, closed plasmids which were known to harbor genes of interest. First, CD-HIT v4.7 was used to identify orthologous genes [47]. NDM or KPC containing contigs from complete genomes were identified by the presence of a gene annotated as either of these beta- lactamases in the gff file, and all genes contained on these contigs were identified. The plasmid sharing the maximum number of genes with the NDM+ isolates was identified. Presence of the plasmid was determined as sharing 80% of the shared plasmid genes. A core gene alignment was created with cognac [46]. The pairwise substitution distance matrix was calculated from the alignment, which was then used to create a neighbor joining tree using ape v5.3 [29] and midpoint rooted using phytools v0.6-99 [48]. The rows of the gene presence absence matrix were then sorted by the tip labels on the neighbor joining tree. To confirm the presence of NDM containing plasmid, which was prevalent in the ST147 isolates, we utilized the raw sequencing data to calculate coverage across the plasmid sequence. We selected the complete genome with the best match for the NDM+ plasmid by shared gene content across the plasmid sequence as the reference sequence. The sequencing reads were aligned to the plasmid sequence using bwa mem v0.7.15 [21]. The sam files were parsed to identify reads which aligned to the NDM plasmid, and coverage was then calculated for 500 bp bins across the plasmid sequence. Antibiotic resistance genes were identified by querying the CARD database [49] with BLAST [50]. #### Inferring ancestral dates We used the R package BactDating v1.0.12 [43] to infer ancestral dates on the NDM+ clade of the ST147 maximum likelihood phylogeny, as well as the phylogeny including all study ST147 isolates. A root-to-tip analysis was first performed and indicated that there is temporal signal in the data. As we do not have access to actual isolate dates, a date range for each tree tip was determined based on the year of isolation as well as the survey and facility type that the isolate came from. The maximum likelihood phylogeny and the date ranges were used to infer ancestral dates. All available molecular clock models were tested, each run for 1,000,000 MCMC iterations. The R package coda v0.19-4 [45] was used to evaluate MCMC convergence. All models had similar estimated root dates for the NDM+ clade (range: 2013.96 to 2016.24), and slightly more variation for the ST147 isolates (1996 to 2014), with varying levels of uncertainty. The best fit models were the additive relaxed clock (ARC) family of models [44], with the best fit being mixed continuous ARC (cARC) for the NDM+ clade and ARC for all ST147 study isolates. The cARC model was chosen for all subsequent analyses with the NDM+ clade, and the ARC model was chosen for all subsequent analyses with the ST147 study isolates. The effective sample size for all parameters in the best models were above 400. #### Identifying phylogenetic clustering We subset the NDM+ clade of the ST147 maximum likelihood phylogeny by survey. For each survey-specific tree, we used a method modified from [8] to identify the largest subclades that contain ≥90% of isolates from the same facility. #### Calculating pairwise SNV distances Pairwise SNV distances were calculated for the gubbins recombination-filtered ST147 variant alignment using the dist.dna function in the R package ape v5.4-1 [29], with the “N” model (count), and pairwise deletion set to true to include both core and non-core genetic variation. We chose a pairwise SNV distance threshold based on the BactDating estimated mutation rate of ∼12 SNVs per year (**Table S2**). As the surveys were conducted 6 months apart, we considered closely related isolate pairs to be those with a SNV distance of ≤12 (i.e. 6 mutations per isolate). Wilcox tests were used to calculate significance, and all p-values were Bonferroni- corrected. #### Determining paths of maximum patient flow The patient transfer network consists of healthcare facility nodes and patient transfer edges. A patient transfer includes direct transfers of no more than one day between discharge from source facility and admission to destination facility, and indirect transfers of patients with an intervening stay in the community or an intermediate facility between discharge and readmission. Normalized edge weights were calculated by dividing the number of patient transfers from source facility to destination facility in 2016 (i.e. within 365 days) by the total number of outgoing patient transfers at the source facility. We calculated patient flow as the product of the normalized edge weights for the path of maximum patient flow [8]. R igraph v1.2.6 was used for all patient flow analyses [51]. ## Supplementary Figures ![Figure S1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F6.medium.gif) [Figure S1:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F6) Figure S1: NDM counts increase over time in some vSNFs and LTACHs. vSNF = ventilator- capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ![Figure S2:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F7.medium.gif) [Figure S2:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F7) Figure S2: High patient flow into and/or out of a facility does not correlate with high NDM prevalence. See methods for patient flow calculation. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ![Figure S3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F8.medium.gif) [Figure S3:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F8) Figure S3: The majority of *K. pneumoniae* ST147 isolates from this study contain the majority of genes from a multi-drug resistant IncF plasmid, including an NDM-/KPC+ outgroup. Heatmap of genes on plasmid rows are genome sequences and columns are genes. Rows are ordered by the phylogeny and columns are linear order of the genes on the complete plasmid sequence. Red indicates gene presence and blue indicates gene absence. Loci of interest are indicated above the heatmap. ![Figure S4:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F9.medium.gif) [Figure S4:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F9) Figure S4: The IncF plasmid is present in KPC+ isolates early on, but then becomes predominantly present in NDM+ isolates. Only isolates with the IncF plasmid are depicted here. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ![Figure S5:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F10.medium.gif) [Figure S5:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F10) Figure S5: the IncF plasmid is present in a KPC+ outgroup of the NDM+ ST147 clade. Maximum likelihood phylogeny of study ST147 isolates annotated by IncF plasmid presence, KPC, and NDM. ![Figure S6:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F11.medium.gif) [Figure S6:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F11) Figure S6: Root-to-tip distance of study isolates reveals strong temporal signal. ![Figure S7:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F12.medium.gif) [Figure S7:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F12) Figure S7: Isolates from a given vSNF often cluster on the phylogeny. Maximum likelihood phylogenies labeled by facility for each survey. These phylogenies were used to identify the phylogenetic clusters depicted in Figure 5A. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital, ICU = intensive care unit. ![Figure S8:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F13.medium.gif) [Figure S8:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F13) Figure S8: The majority of closely related intra-facility isolate pairs are from vSNFs, while there are closely related inter-facility vSNF-vSNF pairs and LTACH-vSNF pairs. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital. ![Figure S9:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2021/03/25/2021.03.16.21253722/F14.medium.gif) [Figure S9:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/F14) Figure S9: Increasing signal of transmission via the patient transfer network over time. Patient flow for inter-facility isolate pairs within 20 SNVs and not within 20 SNVs for different facility pairs over time. See methods for details on patient flow calculation. Bonferroni-adjusted p-values were calculated using the Wilcox test. vSNF = ventilator-capable skilled nursing facility. LTACH = long-term acute care hospital. ## Supplementary Tables View this table: [Table S1:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/T2) Table S1: Counts and prevalence of NDM and KPC in surveys 13 and 13-3. View this table: [Table S2:](http://medrxiv.org/content/early/2021/03/25/2021.03.16.21253722/T3) Table S2: Model convergence (effective sample sizes; ESSs), parameter estimates, and model fit (Deviance Information Criterion DIC) for dated trees. ## Supplementary data 1. Spreadsheet with isolate id, patient id, survey, survey year, facility, NDM/KPC, species, ST, plasmid. 2. Spreadsheet with metadata on public isolates. ## Acknowledgements We gratefully acknowledge the patients and staff of the facilities for their participation in this study. We thank officials from Chicago Department of Public Health, Cook County Department of Public Health, Illinois Department of Public Health, and Centers for Disease Control and Prevention for their direct involvement in and support of Chicago PROTECT. We also thank Louis Fogg and Vincent Young for fruitful discussion of the analyses presented here. We thank Ellen Benson, Mary Carl Froilan, Claire Heshmat, Jinal Makhija, and Mitali Shah for their role in specimen and data collection at participating healthcare facilities, and Pamela Bell and Karen Lolans for their role in laboratory analysis. ## Footnotes * For the CDC Prevention Epicenters Program * Summary: Whole-genome sequencing of carbapenem resistant *Enterobacterales* in Chicago area healthcare facilities revealed clonal dissemination of blaNDM-1 *Klebsiella pneumoniae* sequence type 147 within and between ventilator-capable skilled nursing facilities and long-term acute care hospitals. * Received March 16, 2021. * Revision received March 16, 2021. * Accepted March 25, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.Tacconelli E, Carrara E, Savoldi A, et al. Discovery, research, and development of new antibiotics: the WHO priority list of antibiotic-resistant bacteria and tuberculosis. The Lancet Infectious Diseases 2018; 18:318–327. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(17)30753-191&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 2. 2.Ansari U, Lawsin A, Campbell D, et al. Molecular Characterization of Carbapenem- Resistant Enterobacteriaceae in the USA, 2011–2015. Open Forum Infectious Diseases 2017; 4:S179–S179. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ofid/ofx163.327&link_type=DOI) 3. 3.Logan LK, Weinstein RA. The Epidemiology of Carbapenem-Resistant Enterobacteriaceae: The Impact and Evolution of a Global Menace. The Journal of Infectious Diseases 2017; 215:S28–S36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/infdis/jiw282&link_type=DOI) 4. 4.Ciccolini M, Donker T, Köck R, et al. Infection prevention in a connected world: The case for a regional approach. International Journal of Medical Microbiology 2013; 303:380–387. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijmm.2013.02.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23499307&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 5. 5.Lin MY, Lyles-Banks RD, Lolans K, et al. The importance of long-term acute care hospitals in the regional epidemiology of Klebsiella pneumoniae carbapenemase-producing Enterobacteriaceae. Clin Infect Dis 2013; 57:1246–1252. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/cid/cit500&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23946222&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 6. 6.Pacilli M, Kerins JL, Clegg WJ, et al. Regional Emergence of Candida auris in Chicago and Lessons Learned From Intensive Follow-up at 1 Ventilator-Capable Skilled Nursing Facility. Clinical Infectious Diseases 2020; 71:e718–e725. 7. 7.Lee BY, Bartsch SM, Lin MY, et al. How Long-Term Acute Care Hospitals Can Play an Important Role in Controlling Carbapenem-Resistant Enterobacteriaceae in a Region: A Simulation Modeling Study. American Journal of Epidemiology 2020; Available at: [https://doi.org/10.1093/aje/kwaa247](https://doi.org/10.1093/aje/kwaa247). Accessed 20 February 2021. 8. 8.Han JH, Lapp Z, Bushman F, et al. Whole-Genome Sequencing To Identify Drivers of Carbapenem-Resistant Klebsiella pneumoniae Transmission within and between Regional Long-Term Acute-Care Hospitals. Antimicrobial Agents and Chemotherapy 2019; 63. Available at: [http://aac.asm.org/content/63/11/e01622-19](http://aac.asm.org/content/63/11/e01622-19). Accessed 20 February 2021. 9. 9.Hayden MK, Lin MY, Lolans K, et al. Prevention of Colonization and Infection by Klebsiella pneumoniae Carbapenemase–Producing Enterobacteriaceae in Long-term Acute-Care Hospitals. Clinical Infectious Diseases 2015; 60:1153–1161. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/cid/ciu1173&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25537877&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 10. 10.Lin M, Froilan MC, Makhija J, et al. Regional Impact of a CRE Intervention Targeting High Risk Postacute Care Facilities (Chicago PROTECT). Infection Control & Hospital Epidemiology 2020; 41:s48–s49. 11. 11.Andrews S. s-andrews/FastQC. 2021. Available at: [https://github.com/s-andrews/FastQC](https://github.com/s-andrews/FastQC). Accessed 20 February 2021. 12. 12.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 2014; 30:2114–2120. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btu170&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24695404&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000340049100004&link_type=ISI) 13. 13.Crawford R. rdcrawford/MLSTyper. 2020. Available at: [https://github.com/rdcrawford/MLSTyper](https://github.com/rdcrawford/MLSTyper). Accessed 20 February 2021. 14. 14.Hunt M, Mather AE, Sánchez-Busó L, et al. ARIBA: rapid antimicrobial resistance genotyping directly from sequencing reads. Microb Genom 2017; 3. Available at: [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5695208/](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5695208/). Accessed 20 February 2021. 15. 15.. Snitkin-Lab-Umich/assemblage. Snitkin-Lab-Umich, 2021. Available at: [https://github.com/Snitkin-Lab-Umich/assemblage](https://github.com/Snitkin-Lab-Umich/assemblage). Accessed 20 February 2021. 16. 16.. Li H. lh3/seqtk. 2021. Available at: [https://github.com/lh3/seqtk](https://github.com/lh3/seqtk). Accessed 20 February 2021. 17. 17.Ondov BD, Treangen TJ, Melsted P, et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biology 2016; 17:132. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-016-0997-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27323842&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 18. 18.Bankevich A, Nurk S, Antipov D, et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J Comput Biol 2012; 19:455–477. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/cmb.2012.0021&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22506599&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 19. 19.Walker BJ, Abeel T, Shea T, et al. Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLOS ONE 2014; 9:e112963. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0112963&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25409509&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 20. 20.Aziz RK, Bartels D, Best AA, et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Genomics 2008; 9:75. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2164-9-75&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18261238&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 21. 21.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009; 25:1754–1760. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btp324&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19451168&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000267665900006&link_type=ISI) 22. 22.broadinstitute/picard. Broad Institute, 2021. Available at: [https://github.com/broadinstitute/picard](https://github.com/broadinstitute/picard). Accessed 20 February 2021. 23. 23.Li H, Handsaker B, Wysoker A, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009; 25:2078–2079. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btp352&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19505943&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000268808600014&link_type=ISI) 24. 24.McKenna A, Hanna M, Banks E, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010; 20:1297–1303. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjIwLzkvMTI5NyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzI1LzIwMjEuMDMuMTYuMjEyNTM3MjIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 25. 25.Poplin R, Ruano-Rubio V, DePristo MA, et al. Scaling accurate genetic variant discovery to tens of thousands of samples. bioRxiv 2018; :201178. 26. 26.Croucher NJ, Page AJ, Connor TR, et al. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Research 2015; 43:e15–e15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gku1196&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25414349&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 27. 27.Arndt D, Grant JR, Marcu A, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res 2016; 44:W16–W21. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkw387&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27141966&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 28. 28.Kurtz S, Phillippy A, Delcher AL, et al. Versatile and open software for comparing large genomes. Genome Biology 2004; 5:R12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/gb-2004-5-2-r12&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14759262&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 29. 29.Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 2019; 35:526–528. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bty633&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30016406&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 30. 30.Davis JJ, Wattam AR, Aziz RK, et al. The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities. Nucleic Acids Research 2020; 48:D606–D612. 31. 31.Greninger AL, Chorny I, Knowles S, Ng VL, Chaturvedi V. Draft Genome Sequences of Four NDM-1-Producing Klebsiella pneumoniae Strains from a Health Care Facility in Northern California. Genome Announc 2015; 3. 32. 32.Khong WX, Marimuthu K, Teo J, et al. Tracking inter-institutional spread of NDM and identification of a novel NDM-positive plasmid, pSg1-NDM, using next-generation sequencing approaches. J Antimicrob Chemother 2016; 71:3081–3089. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/jac/dkw277&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27494913&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 33. 33.Wyres KL, Wick RR, Judd LM, et al. Distinct evolutionary dynamics of horizontal gene transfer in drug resistant and virulent clones of Klebsiella pneumoniae. PLoS Genet 2019; 15:e1008114. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1008114&link_type=DOI) 34. 34.Becker L, Kaase M, Pfeifer Y, et al. Genome-based analysis of Carbapenemase-producing Klebsiella pneumoniae isolates from German hospital patients, 2008-2014. Antimicrob Resist Infect Control 2018; 7. Available at: [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5930415/](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5930415/). Accessed 21 February 2021. 35. 35.Pérez-Vázquez M, Sola Campoy PJ, Ortega A, et al. Emergence of NDM-producing Klebsiella pneumoniae and Escherichia coli in Spain: phylogeny, resistome, virulence and plasmids encoding blaNDM-like genes as determined by WGS. Journal of Antimicrobial Chemotherapy 2019; 74:3489–3496. 36. 36.Mansour W, Grami R, Jaidane N, et al. Epidemiology and Whole-Genome Analysis of NDM-1-Producing Klebsiella pneumoniae KP3771 from Tunisia. Microbial Drug Resistance 2019; 25:644–651. 37. 37.Roach D, Waalkes A, Abanto J, et al. Whole Genome Sequencing of Peruvian Klebsiella pneumoniae Identifies Novel Plasmid Vectors Bearing Carbapenem Resistance Gene NDM-1. Open Forum Infectious Diseases 2020; 7. Available at: [https://doi.org/10.1093/ofid/ofaa266](https://doi.org/10.1093/ofid/ofaa266). Accessed 21 February 2021. 38. 38.Marí-Almirall M, Cosgaya C, Pitart C, et al. Dissemination of NDM-producing Klebsiella pneumoniae and Escherichia coli high-risk clones in Catalan healthcare institutions. Journal of Antimicrobial Chemotherapy 2021; 76:345–354. 39. 39.Falcone M, Giordano C, Barnini S, et al. Extremely drug-resistant NDM-9-producing ST147 Klebsiella pneumoniae causing infections in Italy, May 2020. Eurosurveillance 2020; 25:2001779. 40. 40.Li H. lh3/wgsim. 2021. Available at: [https://github.com/lh3/wgsim](https://github.com/lh3/wgsim). Accessed 20 February 2021. 41. 41.Minh BQ, Schmidt HA, Chernomor O, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Molecular Biology and Evolution 2020; 37:1530–1534. 42. 42.Minh BQ, Nguyen MAT, von Haeseler A. Ultrafast Approximation for Phylogenetic Bootstrap. Molecular Biology and Evolution 2013; 30:1188–1195. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/mst024&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23418397&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000318165700018&link_type=ISI) 43. 43.Didelot X, Croucher NJ, Bentley SD, Harris SR, Wilson DJ. Bayesian inference of ancestral dates on bacterial phylogenetic trees. Nucleic Acids Res 2018; 46:e134. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gky783&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30184106&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 44. 44.Didelot X, Siveroni I, Volz EM. Additive Uncorrelated Relaxed Clock Models for the Dating of Genomic Epidemiology Phylogenies. Mol Biol Evol 2021; 38:307–317. 45. 45.. Plummer M, Best N, Cowles K, et al. coda: Output Analysis and Diagnostics for MCMC. 2020. Available at: [https://CRAN.R-project.org/package=coda](https://CRAN.R-project.org/package=coda). Accessed 20 February 2021. 46. 46.Crawford RD, Snitkin ES. cognac: rapid generation of concatenated gene alignments for phylogenetic inference from large, bacterial whole genome sequencing datasets. BMC Bioinformatics 2021; 22:70. 47. 47.Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 2012; 28:3150–3152. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bts565&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23060610&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000311902700023&link_type=ISI) 48. 48.Revell LJ. phytools: an R package for phylogenetic comparative biology (and other things). Methods in Ecology and Evolution 2012; 3:217–223. 49. 49.Alcock BP, Raphenya AR, Lau TTY, et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Research 2020; 48:D517–D525. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkz935&link_type=DOI) 50. 50.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol 1990; 215:403–410. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/jmbi.1990.9999&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2231712&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1990ED16700008&link_type=ISI) 51. 51.Csardi G, Nepusz T. The Igraph Software Package for Complex Network Research. InterJournal 2005; Complex Systems:1695. 52. 52.R: The R Project for Statistical Computing. Available at: [https://www.r-project.org/](https://www.r-project.org/). Accessed 20 February 2021. 53. 53.Wickham H, Averick M, Bryan J, et al. Welcome to the Tidyverse. Journal of Open Source Software 2019; 4:1686. 54. 54.Kolde R. pheatmap: Pretty Heatmaps. 2019. Available at: [https://CRAN.R-project.org/package=pheatmap](https://CRAN.R-project.org/package=pheatmap). Accessed 20 February 2021. 55. 55.Grolemund G, Wickham H. Dates and Times Made Easy with lubridate. Journal of Statistical Software 2011; 040. Available at: [https://ideas.repec.org/a/jss/jstsof/v040i03.html](https://ideas.repec.org/a/jss/jstsof/v040i03.html). Accessed 20 February 2021. 56. 56.. Yu G, Jones B, Arendsee Z. tidytree: A Tidy Tool for Phylogenetic Tree Data Manipulation. 2020. Available at: [https://CRAN.R-project.org/package=tidytree](https://CRAN.R-project.org/package=tidytree). Accessed 20 February 2021. 57. 57.Wang L-G, Lam TT-Y, Xu S, et al. Treeio: An R Package for Phylogenetic Tree Input and Output with Richly Annotated and Associated Data. Molecular Biology and Evolution 2020; 37:599–603. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msz240&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31633786&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 58. 58.Yu G, Smith DK, Zhu H, Guan Y, Lam TT-Y. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution 2017; 8:28–36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10/f9qv8x&link_type=DOI) 59. 59.Yu G, Lam TT-Y, Zhu H, Guan Y. Two Methods for Mapping and Visualizing Associated Data on Phylogeny Using Ggtree. Molecular Biology and Evolution 2018; 35:3041–3043. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msy194&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30351396&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 60. 60.Yu G. ggplotify: Convert Plot to ‘grob’ or ‘ggplot’ Object. 2020. Available at: [https://CRAN.R-project.org/package=ggplotify](https://CRAN.R-project.org/package=ggplotify). Accessed 20 February 2021. 61. 61.Campitelli E. ggnewscale: Multiple Fill and Colour Scales in ‘ggplot2’. 2021. Available at: [https://CRAN.R-project.org/package=ggnewscale](https://CRAN.R-project.org/package=ggnewscale). Accessed 20 February 2021. 62. 62.Wilke CO. cowplot: Streamlined Plot Theme and Plot Annotations for ‘ggplot2’. 2020. Available at: [https://CRAN.R-project.org/package=cowplot](https://CRAN.R-project.org/package=cowplot). Accessed 20 February 2021. 63. 63.Kwon T, Yang JW, Lee S, et al. Complete Genome Sequence of Klebsiella pneumoniae subsp. pneumoniae KP617, Coproducing OXA-232 and NDM-1 Carbapenemases, Isolated in South Korea. Genome Announc 2016; 4. Available at: [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4714118/](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4714118/). Accessed 20 February 2021. 64. 64.Abuzaid A, Hamouda A, Amyes SGB. Klebsiella pneumoniae susceptibility to biocides and its association with cepA, qacΔE and qacE efflux pump genes and antibiotic resistance. Journal of Hospital Infection 2012; 81:87–91. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jhin.2012.03.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22498639&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F25%2F2021.03.16.21253722.atom) 65. 65.Rossow J, Ostrowsky B, Adams E, et al. Factors Associated With Candida auris Colonization and Transmission in Skilled Nursing Facilities With Ventilator Units, New York, 2016–2018. Clinical Infectious Diseases 2020; Available at: [https://doi.org/10.1093/cid/ciaa1462](https://doi.org/10.1093/cid/ciaa1462). Accessed 4 March 2021. 66. 66.Fisch J, Lansing B, Wang L, et al. New Acquisition of Antibiotic-Resistant Organisms in Skilled Nursing Facilities. Journal of Clinical Microbiology 2012; 50:1698–1703. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamNtIjtzOjU6InJlc2lkIjtzOjk6IjUwLzUvMTY5OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzI1LzIwMjEuMDMuMTYuMjEyNTM3MjIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9)