Immunosequencing of the T-Cell Receptor Repertoire Reveals Signatures Specific for Identification and Characterization of Early Lyme Disease ============================================================================================================================================ * Julia Greissl * Mitch Pesesky * Sudeb C. Dalai * Alison W. Rebman * Mark J. Soloski * Elizabeth J. Horn * Jennifer N. Dines * Darcy B. Gill * Rachel M. Gittelman * Thomas M. Snyder * Ryan O. Emerson * Edward Meeds * Thomas Manley * Ian M. Kaplan * Lance Baldo * Jonathan M. Carlson * Harlan S. Robins * John N. Aucott ## ABSTRACT Highly specific T-cell responses play key roles in pathogen clearance and maintaining immunologic memory. Next-generation sequencing of the T-cell receptor (TCR) repertoire is an emerging diagnostic technology that capitalizes on the specificity of T-cell responses to probe pathogen exposure. The spirochete Borrelia burgdorferi (Bb) wields an array of antigens with dynamic and complex immunogenic potential, and application of TCR immunosequencing to characterize Bb infection presents opportunities to improve detection of early Lyme disease (LD). By immunosequencing TCR repertoires in blood samples from 3 independent cohorts of patients with early LD and controls from Lyme-endemic/non-endemic regions, we identified 251 public, LD-associated TCRs. These TCRs were used to train a classifier for detection of early LD. The classifier identified LD with 99% specificity and showed 1.9-fold higher sensitivity (56% vs 30%) compared with standard two-tiered testing (STTT). TCR positivity predicted subsequent seroconversion in 37% of STTT-negative patients, suggesting that the T-cell response is detectable before the humoral response. Higher TCR scores were associated with clinical measures of disease severity, including abnormal liver function tests, disseminated rash, and number of symptoms. A subset of LD-associated TCRs mapped to Bb antigens, supporting specificity of this approach. These results suggest that TCR testing may be a highly specific and sensitive approach for identifying LD, particularly in the initial days of illness. Keywords * *Borrelia burgdorferi* * Lyme disease * T-cell receptor * immunosequencing * standard two-tiered testing (STTT) * diagnostic ## INTRODUCTION Lyme disease (LD) is the most common tick-borne illness in the United States, with more than 450,000 estimated new cases annually (1–3). In the United States, LD is initiated by infection with the spirochetal bacterium *Borrelia burgdorferi (Bb)* transmitted from infected *Ixodes* ticks (1). In the days to weeks after the initial tick bite, symptoms may include a characteristic erythema migrans (EM) rash and nonspecific flu-like symptoms while signs and symptoms of disseminated infection can affect the joints, nervous system, heart, or other areas of the skin (4). Presence of EM rash in Lyme-endemic areas is highly suspicious for LD and warrants immediate treatment without further testing. However, guidelines recommend serologic testing to support a diagnosis of LD for individuals with absent or atypical EM or suspected disseminated infection (5, 6). The most common laboratory test for LD is standard two-tiered testing (STTT), which combines an initial enzyme-linked immunosorbent assay (ELISA) with a more specific immunoblot for positive or equivocal samples to detect antibodies against *Bb* (7, 8). However, STTT has some notable limitations, including poor sensitivity (25-50%) in the very early stages of infection, when most serologic testing is conducted (9, 10). This is likely attributable to the kinetics of the humoral response, as sensitivity of STTT improves with time, potentially exceeding 99% in untreated individuals with later-stage disease (11). Up to 60% of individuals testing negative within the first days to weeks after onset of symptoms may test positive upon re- testing 30 days later (10, 12–17). Additionally, false positives and interlaboratory variability may occur with STTT due to poor specificity and weak immunoreactivity (18, 19). These limitations underscore the unmet clinical need for improved methods for diagnosing LD, especially in the early stages of infection. Infection with *Bb* elicits a T-cell response with kinetics that differ from the humoral response (20–22). Evaluation of cytokine/chemokine profiles suggests that an active T-cell response is induced during acute infection, even in the absence of seroconversion, and returns to normal levels after treatment and symptom resolution (23). Thus, interrogating the T-cell response may be a useful strategy for detecting LD during the earliest stages of illness and understanding the immune response throughout disease progression. T-cell responses rely on the capacity of unique T-cell receptors (TCRs) to recognize specific peptide antigens presented on the cell surface by human leukocyte antigen (HLA) proteins for antigen-induced clonal expansion and differentiation into effector cells. Some of these expanded T cells become part of the memory compartment, where they can reside for many years as clonal populations of cells with identical TCR rearrangements (24–26). While the diversity of TCR recombination means that most of these disease-specific TCRs are “private,” or highly unique to one individual, part of the T-cell response to a disease is “public,” with identical amino acid sequences observed across multiple individuals, particularly those with shared HLA backgrounds (27). The public, disease-associated TCRs can be identified using a case/control study design (28–31) and matched to specific antigens through multiplex identification of antigen-specific TCRs (MIRA) (28, 32). Because these public clones are antigen- and HLA-specific, enrichment of such clones serves as a signature of infection in a given HLA context (28, 29). This approach has been successful in identification of past infection by cytomegalovirus (CMV) (28) and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) (29, 30, 33). Given the well-characterized roles of CD8+ and CD4+ T cells in viral control and clearance (34), it is not surprising that strong T-cell responses are elicited by SARS-CoV-2 and mediated by similar antigens and TCRs across individuals (29–31). While CD4+ T cells are known to differentiate into effector cell types with roles in promoting cellular and humoral immunity following many bacterial infections (35), the publicity of T-cell responses to bacterial pathogens has yet to be characterized. To better understand the utility of TCR immunosequencing in the context of bacterial infection, we leveraged the previously described case-control study design approach (28–31) to characterize LD-specific TCRs and develop a classifier to aid in diagnosis of LD. We tested the performance of the classifier in case and control cohorts distinct from those used in classifier training and assessed the correlation between T-cell responses and clinical features of LD. We also mapped a subset of the identified LD-associated TCRs to specific *Bb* antigens, supporting the biologic specificity of the TCR immunosequencing approach. ## RESULTS ### Identification of LD-associated TCRs To characterize the T-cell response to LD, we immunosequenced TCRβ in samples from 3 LD cohorts and a database of controls from Lyme-endemic and non-endemic regions (Tables 1 and 2). Public TCRs associated with early LD were identified from a subset of these samples comprising a case/control training dataset of 72 patients identified from the LDB (n=54) and Boca (n=18) cohorts who presented with STTT-positive early LD prior to 2019 and control repertoires (n=2,981) from a database of healthy individuals from non-endemic regions recruited for other studies and presumed to be LD-negative (Figure 1A). A total of 251 public TCRs associated with early LD, referred to as “enhanced sequences” (ESs), were identified based on statistical enrichment in cases compared with controls. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F1) Figure 1. LD-associated TCRs distinguish cases from controls in training cohorts. (A) Development and training of the classifier used to assess *Bb* infection using a T-cell receptor assay. The classifier was trained using TCR repertoires sequenced from 72 STTT–confirmed cases (orange) and 2,981 controls (blue) (see Table 2). A total of 251 LD-associated TCR sequences (enhanced sequences) were identified. Distribution of the number of TCR rearrangements encoding enhanced sequences as a function of the (log-transformed) total number of unique TCR rearrangements identified in a repertoire. Distribution of enhanced sequences in control samples approximately follows a logistic-growth curve (solid black line; dashed red lines indicate +2, +3, and +4 standard deviations from fit), which was used to define a scoring function. (B) Distribution of the model scores is largely invariant to the number of unique rearrangements in an independent set of controls from Lyme-endemic areas (n=2,627). Red line indicates 99th percentile distribution in this cohort (score=4.2675), which was defined as the positive-call threshold. (C) Model score distribution in a holdout set of repertoires from the LDB cohort (n=15 cases; n=48 controls), collected in 2019 and immunosequenced after model training. View this table: [Table 1.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/T1) Table 1. Description and criteria for cohorts used in study. View this table: [Table 2.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/T2) Table 2. Cohorts used for training, setting the classification threshold, verification and determination of sensitivity and specificity of the classifier. ### Enhanced TCR sequences are highly specific for identifying early LD Clonal expansion in response to *Bb* infection should lead to enrichment of LD-associated ESs in patients with LD. A low-level presence of LD-associated ESs would also be expected among healthy individuals. The observed background rate of ESs is a function of the number of sequenced T cells and the number of unique TCRs in each sample. Empirically, the background number of ESs in our training population fit a logistic-growth curve. To leverage the number of ESs and total unique productive TCR rearrangements as a diagnostic classifier, we modeled the number of ESs as a logistic-growth function of the number of unique productive TCRs sampled from a repertoire and fit this model to the 2,981 control repertoires in the training data (Figure 1A, right graph; black line represents the model fit). The resulting model provides a normalized estimate of the degree to which the ES signature of a case sample deviates from what is typically seen in our control data, expressed in standard deviations from the mean (Figure 1A, right graph, red dashed lines). This approach carefully controls specificity by considering thousands of control repertoires. The final positive/negative call threshold was set to a specificity of 99% on an independent set of controls from Lyme-endemic areas (n=2,627, consisting of 2,507 presumed LD-negative samples from our database and 120 STTT-negative controls from Lyme-endemic areas from the LDB cohort [Figure 1B; Tables 1 and 2]). To further verify the classifier and call threshold among patients with demographics similar to the training dataset, we applied the resulting model to a holdout set of samples from the LDB cohort collected in 2019 that included 15 laboratory-confirmed–positive (by STTT and/or polymerase chain reaction [PCR]) cases of early LD and 48 laboratory-confirmed–negative (by STTT) controls from Lyme-endemic areas with no history of Lyme or tick-borne infection (Figure 1C; Tables 1 and 2). Overall, 8 of 15 (53%) early LD samples and 0 of 48 (0%) controls from Lyme-endemic samples were identified as TCR-positive by the classifier. ### Sensitivity of TCR repertoire analysis identifying early LD To further assess the generalizability of our approach in an independent cohort collected with different protocols, we evaluated performance of the TCR classifier using samples from STTT- positive and STTT-negative patients who had clinically diagnosed early LD and documented EM at the time of their enrollment in the JHU cohort (Tables 1 and 2). Consistent with results from the LDB and Boca cohorts, 118 of 211 (56%) patients diagnosed with early LD were classified as TCR-positive (Table 3), while only 32 of 2,631 (1.2%) control samples from Lyme-endemic areas were TCR-positive (0/115 from LDB, 1/45 from JHU, and 31/2,471 from a database of individuals with unknown LD status living in Lyme-endemic regions in the United States and Europe) (Figure 2A). In contrast, only 64 of 211 (30%) LD-positive JHU samples were STTT- positive, representing a 1.9-fold reduction in sensitivity relative to TCR immunosequencing. To further confirm the specificity of the TCR signature, we evaluated 21 samples from individuals with PCR-confirmed anaplasmosis (n=4) or babesiosis (n=17) from the LDB and DLS cohorts. All 21 samples were negative according to the classifier (Figure 2A). ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F2) Figure 2. Validation of the TCR classifier in the JHU cohort and other holdout endemic controls. (**A**) Model score distribution in early LD samples from JHU (blue, n=211), in addition to holdout controls from Lyme-endemic areas. JHU (orange, n=45) and LDB (green, n=115) controls from Lyme-endemic areas were LD-negative based on clinical assessment and negative STTT testing. Other controls from Lyme-endemic areas (red, n=2,471) were drawn from our database of repertoires sampled from individuals in endemic regions in the United States and Europe who were presumed negative for LD. The *Anaplasma* samples (purple, n=4) were from the LDB cohort, and the *Babesia* samples were from the LDB and DLS cohorts (brown, n=5 and n=12, respectively). (**B**) Sensitivity of STTT and the TCR classifier for individuals in the JHU cohort, stratified by symptom duration (days) at time of enrollment, along with bootstrapped 95% CI. Participants were stratified based on self-reported symptom duration into bins of ≤4 days (n=73), 5-8 days (n=75) or >8 days (n=63). The sensitivities of both TCR testing and STTT increased with longer duration of symptoms reported at the time of testing. (**C**) Model score distribution for JHU early disease samples stratified by STTT serostatus at enrollment and posttreatment follow-up. Positive (blue, n=64): STTT-positive at enrollment; converter STTT (orange, n=38): STTT-negative at enrollment and STTT-positive at posttreatment follow-up; negative (green, n=109): STTT-negative at both visits. (**D**) Longitudinal dynamics of TCR scoring by serostatus for the JHU cohort. Positive (blue, n=53): STTT-positive at enrollment; converter (orange, n=32): STTT-negative at enrollment and STTT positive at posttreatment follow-up; negative (green, n=76): STTT-negative at both visits. **(E)** For the 163 patients who had STTT performed pretreatment and at 6-month follow-up, difference in sensitivities was calculated for recent Lyme infection between STTT and TCR test. In box-and-whisker plots, boxes indicate median ± IQR, and whiskers denote 1.5 times the IQR above the high quartile and below the low quartile. Significant differences in sensitivity were evaluated by T test. View this table: [Table 3.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/T3) Table 3. TCR classifier sensitivity stratified by serostatus in the JHU cohort. When examined as a function of self-reported duration of symptoms, sensitivity of T-cell and antibody tests increased with time, yet T-cell testing showed greater sensitivity at early time points (Figure 2B). If T-cell responses typically precede and facilitate B-cell responses in response to *Bb* infection, consistent with immune response to viral and bacterial pathogens (36, 37), then TCR positivity among a seronegative cohort should predict subsequent seroconversion, and a majority of STTT-positive samples should also be TCR-positive. Indeed, among the JHU case cohort, TCR testing was positive in 59 of 64 (92%) STTT-positive cases (58 of 61 [95%] STTT-positive by IgM), compared with 59 of 147 (40%) STTT-negative cases (Table 3), indicating that a detectible antibody response is highly predictive of a detectible T-cell response. Importantly, of the 59 TCR-positive/STTT-negative individuals, 22 (37%) subsequently seroconverted from STTT-negative at study enrollment to STTT-positive at the first post- treatment follow-up visit (∼3 weeks after enrollment), while only 16 of 88 (18%) individuals who were TCR-negative at baseline seroconverted over the same time period (*P*=0.01, Fisher’s exact test). Stratification of the JHU cohort demonstrated that median TCR model scores (Figure 2C) and classifier sensitivity (Figure S3 Table 3) were highest among individuals who were STTT-positive at enrollment, intermediate among those who seroconverted post treatment, and lowest among individuals who remained persistently STTT-negative, highlighting differences in the extent of LD-associated T-cell expansion in these serologically defined subpopulations. Taken together, these data indicate that detectable LD-associated T cells typically expand prior to detectible antibodies, suggesting that identification of LD may be aided by TCR immunosequencing during early phases of *Bb* infection. ### Disease-associated TCRs and seropositivity wane after treatment Previous data suggest that T-cell and humoral immune responses exhibit differing dynamics over the course of *Bb* infection (21). To probe these dynamics, we evaluated TCR repertoires in longitudinal samples from individuals enrolled in the JHU cohort. Patients were either antibiotic- naïve or had initiated 3 weeks of oral doxycycline treatment within the 72 hours prior to study enrollment, with samples collected at enrollment, immediately after treatment, and at 6 months post treatment. Immunosequencing of samples from 161 patients with samples available at all time points revealed that TCR responses waned significantly in the 6 months following treatment (Figure 2D) (30). Median model scores decreased from 6.1 to 2.5, and model sensitivity decreased from 56% (91/161) at enrollment to 32% (51/161) 6 months post treatment. Notably, the sensitivity of STTT also declined over the same time period, from 33% at enrollment to 12% at 6 months post treatment (14 of 115 patients with available STTT results) (Figure 2E). Similar to the results shown in Figure 2C, TCR model scores were higher across all time points among individuals who were STTT-positive at baseline compared with those who were STTT-negative (Figure 2D). T-cell testing was more sensitive than STTT for identification of LD post treatment (32% vs 12% at 6 months post treatment (Figure 2E) including in patients who did not undergo IgG seroconversion (not shown). ### T-cell responses correlate with clinical measures of LD severity The strong correlation observed between antibody and T-cell responses highlights the interconnectedness of the immune response in early LD and may also reflect underlying pathogen burden, disease severity, or other clinical measures that drive the immune response. We therefore explored potential associations between clinical parameters previously reported in the JHU study (38) and the strength of the T-cell response as measured by the TCR model score at diagnosis. In both univariate analyses (Figure 3A-D) and a multiple regression model (Table S1) that adjusted for sex, age, and serostatus, higher TCR scores were associated with markers of disease severity, including ≥1 elevated liver function test (aspartate aminotransferase, alanine aminotransferase, and alkaline phosphatase), disseminated rash, and the number of pretreatment LD-associated symptoms. Notably, the highest model scores were observed among STTT- positive individuals with disseminated rash and elevated liver function tests (Figure 3A, 3C, S2). ![Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F3) Figure 3. Clinical correlates of TCR scoring. TCR scores were stratified by (**A**) liver function test results (elevated [n=72] vs normal [n=139]), (**B**) lymphocyte counts (normal [n=150] vs low [n=61]), or (**C**) presentation of rash (multiple/disseminated [n=68] vs single [n=143]) and (**D**) plotted as a function of the number of Lyme-related symptoms (Spearman R2=0.17. *P* values, Mann-Whitney *U* test). In box-and-whisker plots, boxes indicate median ± IQR, and whiskers denote 1.5 times the IQR above the high quartile and below the low quartile. Sex, age, size of rash, and an abnormally low lymphocyte count were not associated with a difference in TCR model scores in this cohort (Table S1). ### Antigen specificity of Lyme-associated TCRs Analysis of TCR sequence similarity clustered 105 of the 251 ESs (42%) to one of 6 clusters, 5 of which were statistically associated with an HLA class-II heterodimer (Table 4 and Supplementary Materials). In a MIRA experiment that queried T cells derived from 395 healthy donors against 777 peptides from 26 *Bb* proteins, 6 ESs from *HLA-DRB3*02:02*–associated cluster 6 exactly matched TCRs that were mapped to the flagellin B (FlaB)-derived peptide MIINHNTSAINASRNNG, providing direct evidence of *Bb* specificity for these ESs. One of these TCR sequences was found in 26 individuals, all of whom expressed *HLA-DRB3*02:02* (among the 25 with available typing). A Basic Local Alignment Search Tool (BLAST) analysis of this peptide showed limited sequence similarity with non-*Borrelia* pathogens, and the associated TCRs were highly enriched in JHU cases compared to endemic controls (Fig. 4). ![Figure 4.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F4) Figure 4. Percentage of early LD samples with ESs assigned by MIRA to the indicated *Bb* antigens. FlaB (A/B), flagellin protein B antigen A/B; DbpA, decorin-binding protein A. View this table: [Table 4.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/T4) Table 4. TCR clusters as defined by connected components within a 1–amino-acid change in CDR3 and the same V-gene family. Three additional non-clustered TCRs were mapped to 3 separate antigens, 2 in FlaB and 1 in Dbpa, and each of these were similarly enriched in cases compared to controls (Fig. 4). As a negative statistical control, none of the LD ESs matched any TCRs derived from 507 individuals that were previously queried against 325 SARS-CoV-2–derived peptides. ## DISCUSSION Results from this study provide proof of principle that the high-throughput TCR sequencing and machine learning approach we previously applied for identification of viral infections (eg, CMV and SARS-CoV-2 [28, 29, 30]) can be adapted for identification of *Bb* infection. We demonstrate that TCR repertoire characterization for sequence-based identification of public, disease-specific TCRs in the setting of acute bacterial infection is a powerful and generalizable approach to aid in diagnosing disease. Identification of 251 LD-associated enhanced TCR sequences served as the basis for training a classifier capable of sensitive and specific detection of LD across 3 independent cohorts of patients with laboratory-confirmed and/or clinically diagnosed early LD. The TCR classifier identified patients with early LD with 1.9-fold greater sensitivity than STTT (56% vs 30%), while maintaining a specificity of 99%. Enhanced sensitivity was most apparent in early illness (44% vs 14%, or 3.1-fold increase in sensitivity ≤4 days since symptom onset), and TCR positivity was predictive of subsequent STTT seroconversion in 37% of STTT-negative individuals, consistent with expansion of LD-specific T cells preceding detectable antibody responses. TCR positivity was associated with STTT positivity at enrollment (92%) and did not decline as rapidly as serologic responses following treatment. Longitudinal analyses showed that TCR scores decreased with time post treatment, consistent with diminishment of the T-cell response with resolution of disease, yet remained more sensitive than STTT for identification of LD post treatment, including in patients who did not undergo IgG seroconversion. This observation suggests that TCR testing may be able to identify LD even in the absence of seroconversion at convalescence, which has been observed among individuals treated early in the course of disease (39). Furthermore, higher TCR scores correlated with clinical measures of disease, including elevated liver function tests, disseminated rash, and number of disease-associated symptoms. This suggests that the magnitude of the T-cell response is associated with the degree of disease severity. Finally, we mapped a subset of Lyme-associated ESs to known *Bb* antigens, supporting the high biologic specificity of a TCR immunosequencing approach (40). Our data imply that T-cell activation precedes the humoral response, although both T-cell‒ dependent and –independent responses have been implicated in clearance of *Borrelia* infection (37, 41). Currently, the primary CDC-recommended testing strategy for LD is STTT, which probes the humoral response to *Bb* (7, 8). Given the high prevalence of testing performed in patients during the early stages of LD infection, when sensitivity of STTT is poor (9), alternative approaches to detecting *Bb* are needed. Recently, modified two-tiered testing (MTTT) algorithms have been adopted that utilize 2 sequential ELISAs, eliminate immunoblotting, and demonstrate improved sensitivity over STTT in early LD (42). Even so, almost half of individuals with PCR- confirmed *Bb* infection do not produce a detectable serologic response (43). Our results confirm the ability of the TCR assay to identify LD in a large proportion of STTT-negative individuals prior to seroconversion. These data support further studies directly comparing TCR testing to MTTT to better understand the potential of TCR testing as both an alternative and complementary diagnostic approach to any serologic testing modality. The present analysis is limited to retrospective evaluation of samples previously collected from well-defined, prospective cohorts of clinically confirmed and/or laboratory-confirmed early LD. Additional prospective clinical validation studies are needed to further characterize the advantages of TCR testing relative to serology in scenarios where the spectrum of presenting illness, symptomology, and duration may vary. For example, further studies are needed to understand the utility of TCR testing in patients with persistent symptoms in the setting of LD. Approximately 10% to 20% of patients treated for LD experience long-term symptoms lasting 6 months or more after treatment, known as post-treatment Lyme disease (PTLD) (6, 44). TCR testing may provide a means for diagnosing these patients and minimize the potential for misleading interpretation of positive results (23, 45). In addition, we found that 11 of 12 individuals who presented with EM rash within the JHU cohort, but were both STTT-negative and *Bb* PCR-negative, were also TCR-negative. These data may reflect sensitivity limitations of both immune assays but could also be attributable to difficulties in clinically discriminating between LD and other similar tick-borne illnesses, such as STARI (43), suggesting exploration of TCR testing for differential diagnosis of LD and other similar tick-borne illnesses as another important area of future investigation. Although we examined potential assay cross-reactivity against *Anaplasma* and *Babesia* (Figure 2A), evaluation of cross-reactivity with other pathogens should be performed. Nevertheless, it is noteworthy that antigen mapping indicates that a subset of the identified TCRs are highly specific for known *Bb* proteins, including FlaB, a protein known to bind cross-reactive antibodies in the immunoblotting component of STTT (46). Results of this study suggest that TCR testing can have high clinical utility as a sensitive and specific diagnostic for LD that may facilitate earlier diagnosis of LD and initiation of antibiotic treatment to prevent development of severe illness in patients lacking definitive clinical signs/symptoms. Application of the present TCR classifier as a diagnostic assay will be prospectively evaluated relative to 2-tiered testing in an ongoing clinical validation study in patients presenting with suspected LD ([NCT04422314](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04422314&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom)). ## MATERIALS AND METHODS ### Study design This study employed previously collected samples to develop and validate a model for identifying early LD with high sensitivity and specificity. TCR repertoires were sequenced from 8,585 participants, including 298 cases of LD enrolled in prospective studies and 8,287 controls obtained from a variety of sources. Allocation of samples to sets used for classifier training, setting the classification threshold, and validation of the TCR assay was prespecified; all available samples meeting the quality control (QC) and assignment criteria were included in the analysis (see Assignment of Cohorts and Tables 1 and 2). The primary endpoint of the study was evaluation of sensitivity in the JHU cohort, which was selected based on the conservative enrollment criteria for that cohort. Inclusion and exclusion criteria for all cohorts are detailed below. All procedures involving human participants were conducted in accordance with the ethical standards of the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards. For the LDB cohort, institutional review board (IRB) approval was obtained for each site through the LDB sponsor protocol (Advarra IRB) or the institution-specific IRB. For the Boca cohort, IRB approval was obtained through the Advarra IRB. Human subject protocols for the JHU cohort were approved by the IRBs of Johns Hopkins University and Stanford University. All participants provided written informed consent prior to enrollment. ### Study cohorts For a detailed summary of all cohorts, see Tables 1 and 2. #### LDB cohort LDB is a program of the Bay Area Lyme Foundation. The LDB cohort enrolled individuals from the East coast and upper Midwest regions of the United States who presented with signs or symptoms consistent with early LD. Included patients presented with EM rash greater than 5 cm in diameter or an erythematous, annular, expanding skin lesion of 5 cm or less or presented with signs or symptoms (headache, fatigue, fever, chills, or joint or muscular pain) without an EM/annular lesion, but with a suspected tick exposure or tick bite, and with no history of chronic fatigue syndrome, rheumatologic disease, or multiple sclerosis. Individuals with tick-bite reactions (eg, a non-annular erythematous macule at the site of the tick bite) without EM or expanding annular lesion were excluded, as were those who had initiated antibiotics more than 48 hours before enrollment. Healthy individuals living in the same regions with no history of LD or tick-borne infection were enrolled as controls from Lyme-endemic areas. Real-time PCR for *Bb* detection in whole blood and serologic testing for antibodies against *Bb* using the immunoassays comprising STTT (non-reflexive) were conducted for all individuals. Laboratory-confirmed early LD samples were defined as being STTT-positive, PCR-positive (sample or culture), or having 2 positive ELISAs and an EM of 5 cm or larger in diameter. Laboratory-confirmed controls from Lyme-endemic areas were defined as STTT-negative. Additional details and baseline clinical characteristics of this cohort were previously published (47). Only laboratory-confirmed samples that were STTT-positive were used in model training. Only laboratory-confirmed–positive (by STTT and/or PCR) cases and laboratory-confirmed– negative (by STTT) controls from Lyme-endemic areas were included in analyses of model performance or to establish the final call threshold. Nine cross-reactivity case samples that were positive by PCR for *Anaplasma* (n=4) or *Babesia* (n=5) were used for additional specificity testing. #### Discovery Life Sciences (DLS) cohort Blood samples from 12 individuals found to be *Babesia*-positive by PCR testing were acquired from Discovery Life Sciences for use in specificity testing. #### Boca cohort Specimens were collected from antibiotic-treatment–naïve patients recruited at clinical sites throughout New York and New Jersey who presented with acute symptomology of a tick-borne illness. Participants had blood drawn on 3 occasions: 30 days or less post-tick bite while antibiotic-treatment–naive, 6 to 8 weeks post tick bite, and 16 to 24 weeks post tick bite. Whole blood samples were aliquoted, frozen, and stored at −80°C after collection. At each visit, information was captured regarding symptoms, date of symptom onset, treatment status, treatment regimen, and lab results. Specimens were received at Boca Biolistics Reference Laboratory (Pompano Beach, FL, USA) and characterized for relevant tick-borne pathogens, including *Bb*, *Babesia microti*, *Ehrlichia chaffeensis*, and *Anaplasma phagocytophilum*. Testing was performed in-house and at ARUP Laboratories (Salt Lake City, UT, USA) on matched serum collected from donors. DiaSorin and Immunonetics Lyme antibody testing was performed at Boca Biolistics Reference Laboratory, and IgM- and IgG-specific antibody screening for *Bb*, *B. microti*, *E. chaffeensis*, and *A. phagocytophilum* was performed at ARUP Laboratories. For the present study, immunosequencing was performed on the first available sample from 18 donors who were seropositive for *Bb* by ELISA and immunoblot (either IgG or IgM), all of whom were classified as STTT-positive by 2-tiered testing criteria. #### JHU cohort The SLICE study is a longitudinal, prospective cohort study that enrolled patients 18 years of age or older with early LD who were self-referred or recruited from primary or urgent care settings from 2008 to 2020. Eligible participants were enrolled primarily at study sites in Maryland, with a small number enrolled at a satellite site in Southeastern Pennsylvania. At enrollment, participants were required to have a visible EM of at least 5 cm in diameter diagnosed by a healthcare provider. All patients had received no more than 72 hours of appropriate antibiotic treatment for early LD at enrollment. Additional details and baseline clinical characteristics have been previously published (48). Participants without a clinical or serologic history of LD were recruited from similar primary care settings or through the community using flyers and online advertising to serve as controls from Lyme-endemic areas. Controls were required to be STTT- negative at the time of enrollment and at all subsequent visits. Participants in both groups were excluded for a range of self-reported prior medical conditions paralleling those listed in the proposed case definition for post-treatment LD syndrome (49), specifically, chronic fatigue syndrome, fibromyalgia, unexplained chronic pain, sleep apnea or narcolepsy, autoimmune disease, chronic neurologic disease, liver disease, hepatitis, HIV, cancer or malignancy in the past 2 years, major psychiatric illness, or drug or alcohol abuse. Patients with early LD were treated with 3 weeks of oral doxycycline (5) and seen regularly over the course of 1 to 2 years. Samples collected before treatment, immediately after treatment, and 6 months post treatment were used for the present study, in addition to samples collected from control participants at an initial study visit. Disseminated EM rash was defined as more than 1 rash site visible on physical exam at the pretreatment study visit, and local rash was defined as a single EM rash site. Duration of illness was determined by self-report of the number of days since the first appearance of LD-specific signs or symptoms. The number of LD symptoms at enrollment was obtained through an interviewer-administered questionnaire. STTT status was determined by Quest Diagnostics at each study time point using CDC recommendations incorporating duration of illness at time of testing (7). High-resolution HLA class I/II typing for the JHU cohort (cases only) was performed by Scisco Genetics, Inc., (Seattle, WA, USA) using the ScisGo HLA v6 typing kit, as previously described (50, 51). #### Database controls A total of 7,959 repertoires sampled during previous studies were selected from our database. Inclusion was determined at the cohort level and based on the size of the cohort, geographic region (United States and LD-endemic regions of Europe), and sequencing date (2019 or later, to ensure consistent lab sequencing protocols). Samples were classified as being either from endemic regions (Germany, Italy, or upper Midwest or Northeast regions of the United States) or non-endemic regions (other regions of the United States). All individuals in these cohorts were not tested but were presumed to be LD-negative. ### Assignment of cohorts Training cases were drawn from the LDB and Boca cohorts (Tables 1 and 2). To enrich for cases with a likely immune response and maximize our ability to detect LD-associated ESs, the training set was limited to 72 STTT-positive cases (54 LDB, 18 Boca). Training controls included 2,981 repertoires from individuals from non-endemic regions of the United States and Europe previously collected as part of other studies and presumed to be LD-negative. The positive-call threshold was set based on 2,507 presumed LD-negative samples collected from endemic regions available in our database, along with 120 confirmed STTT-negative controls from Lyme-endemic areas randomly selected from the LDB cohort. Additional LDB case (n=15) and control (n=48) samples collected during the 2019 tick season were sequenced after model training and used as an initial check of model specificity and generalizability. TCR assay sensitivity was evaluated in the JHU cohort. Repertoires sampled from 211 participants at the time of enrollment passed QC thresholds established after model training described below. A subset of patients in the JHU cohort (n=161) had sequenced repertoires that passed QC from samples collected before and after treatment and 6 months post treatment. Specificity of the final model was estimated based on 3 control cohorts from Lyme-endemic areas: (1) all controls from Lyme-endemic areas from JHU (n=45); (2) 50% (selected by random sampling) of controls from Lyme-endemic areas from tick seasons prior to 2019 in the LDB cohort (n=115 passed QC); and (3) 50% (selected by random sampling) of presumed controls from Lyme-endemic areas from our database (n=2,471 passed QC). ### TCR repertoire immunosequencing Immunosequencing of TCRβ CDR3 was performed using the immunoSEQ® Assay (Adaptive Biotechnologies, Seattle, WA, USA) and analyzed as previously described (52–54). For additional details, see the Supplementary Materials and Methods. ### Identification of LD-associated ESs Public TCRβ amino acid sequences associated with early LD were identified as described previously (28). Briefly, one-tailed Fisher’s exact tests (FETs) were performed on all unique TCR sequences to compare frequencies of ESs in early Lyme samples with those in presumed- negative controls. Unique sequences were defined based on the V-gene, J-gene, and CDR3 amino acid sequence. The *P*-value threshold for including a TCR in the ES list was treated as a hyperparameter and was selected to maximize model performance as described below. The resulting set of FET-defined ESs for cohort 𝒞 are denoted *Sfet𝒞*. A recent study in CMV demonstrated that many TCRs that are not identified as significant ESs in small training datasets based solely on FET may be selected in larger training sets (54). This observation motivates a simple classification problem: prediction of whether a TCR will be identified as an ES by FET when the dataset grows to a specified size. To this end, a logistic regression model was fitted, where the training data were the set of TCRs and corresponding features observed in a previously reported cohort 𝒞*CMV* labeled for CMV serostatus (28). The dependent binary variable was defined as 1 if the TCR was observed as in *Sfet*𝒞∗ for some large CMV-labeled cohort *C*CMV∗, and 0 otherwise. For each TCR, the following features were defined: (1) average and maximum convergent recombination (CR) for cases and controls; (2) average and maximum productive frequency for cases and controls; and (3) the number of sequences in *S**fet* similar to the TCR, defined as sharing the same V-gene, having identical CDR3 length, and differing by 1 amino acid. In practice, a larger CMV-labeled cohort was unavailable. However, as more than 50% of North American and European populations are seropositive for CMV (55), we applied a pseudo- labeling procedure to construct *C*∗ . Briefly, a logistic regression classifier (as defined in *[44]*) based on *S**fet* 𝒞*CMV* was trained on the labeled CMV cohort 𝒞*CMV*, then applied to all samples in the LD training cohort. The inferred CMV status was then treated as observed and combined with 𝒞 , resulting in 𝒞∗CMV , which was used to define *S**fet*𝒞**CMV*. The resulting logistic regression classifier was able to accurately predict which TCRs observed in 𝒞*CMV* but not in *S**fet*𝒞*CMV* would end up in *S**fet*𝒞**CMV* (area under the receiver operating characteristic [ROC] curve=0.84 in cross-validation; data not shown). Features receiving the greatest weight in this model were the CR counts in cases (likely indicating substantial clonal expansion) and the number of similar sequences in *S*𝒞*CMV* (likely indicating that the TCR responds to the same antigen as another ES). The model fitted to CMV was used to infer ESs for LD. Combining these inferred ESs with ESs identified by FET resulted in the final set of LD-associated ESs, *S𝒞Lyme*. Hyperparameters in this model were chosen using cross-validation in the context of the disease classification model described below. ### Inferring early LD status based on ES counts in TCR repertoires Given a set of ESs *S*, the pair (*y**i*, *x**i*) can then be defined for each repertoire *i*, where *x**i* is the total number of unique productive DNA TCR rearrangements in the sampled repertoire, and *y**i* < *x**i* is the number of those rearrangements that encode any of the ESs in *S*. If *y**i* is treated as sampled from a random variable *Y*, the expected value of *Y* given *x* can be considered. By the way ESs are defined, the distribution of *Y*|*x* is expected to vary substantially between cases and controls. While this could be treated as a classification problem to maximize the separation between cases and controls (as in [39, 40]), *Y*|*x* was instead explicitly modeled among control samples, with classification based on standard units of deviation above and below expectation. This approach provides superior control of specificity across populations given the extremely unbalanced nature of our case/control dataset. To model this distribution, *Y* was assumed to follow a binomial distribution, with mean *f*(*x*) = *y*max *p*(*x*) and variance *σ*2(*x*) = *y*max *p*(*x*)(1 − *p*(*x*)), where *y*max is the maximum number of ESs observed in any training repertoire, and ![Formula][1] for model parameters *w* and *b*. For a given (*y**i*, *x**i*), the number of standard deviations *y**i* is from the expected mean given *x**i* is then used as the model score: ![Formula][2] The model parameters *w* and *b* were chosen by minimizing the sum of squared residuals over the set of training control samples. The observed data are moderately overdispersed with respect to the estimated variance (Figure 1A). As such, the final call threshold *t* was chosen to fix the prespecified false-positive rate of 1% on a set of 2,627 presumed LD-negative control samples as described above. For a detailed description of TCR repertoire QC criteria, see the Supplementary Materials and Methods. ### Antigen mapping and HLA assignment The MIRA assay was used for antigen mapping as described previously (32). To assign HLA subtypes, ESs were clustered by a simple TCR amino acid similarity metric. ESs were considered to belong to a cluster if they shared a V-gene family and if all members of a cluster were connected by no more than a 1-Hamming difference between CDR3 regions. HLA subtypes were inferred based on the results of one-tailed FETs performed between each ES and every HLA subtype. For a detailed description of MIRA experiments and assignment of ESs to antigens and HLA subtypes, see the Supplementary Materials and Methods. ### Statistical analysis Detailed statistical analyses associated with development of the TCR classifier and MIRA-based TCR–antigen assignment are described above. Additional statistical analyses were performed using the Python packages, scipy (version 1.5.4) and statsmodels (version 0.12.2). Significant associations between TCR score and clinical variables were assessed by the Mann-Whitney *U* test in univariate analysis and by multivariable linear regression with sex, age, and serostatus as variables. *P* values <0.05 were considered significant. The correlation between TCR score and the number of LD-related symptoms was assessed by Spearman’s rank-order correlation. For comparisons of sensitivity, error bars represent mean ± 95% CI by bootstrap sampling. For box- and-whisker plots, boxes indicate median ± interquartile ranges (IQR), and whiskers denote 1.5 times the IQR above the high quartile and below the low quartile. ### Data availability Deidentified data are available by request from AdaptiveBiotechnologies Medical Information ([https://www.adaptivebiotech.com/medical-information-request/](https://www.adaptivebiotech.com/medical-information-request/)). ## Data Availability Data requests may be submitted for consideration to Adaptive Biotechnologies Medical Information (https://www.adaptivebiotech.com/medical-information-request/). [https://www.adaptivebiotech.com/medical-information-request](https://www.adaptivebiotech.com/medical-information-request) ## SUPPLEMENTARY MATERIALS ### Materials and Methods Figure S1. Immunosequencing input DNA distributions by cohort. Figure S2. Clinical correlates of TCR scoring in STTT+ and STTT- individuals. Figure S3. Receiver operating characteristic curves using all control samples from Lyme- endemic regions from Figure 2A as negatives. Table S1. Multiple logistic regression of TCR model score on clinical features. Table S2. Counts of enhanced sequences mapped to each protein by MIRA. ## SUPPLEMENTARY MATERIALS ### MATERIALS AND METHODS #### TCR repertoire immunosequencing Immunosequencing of TCRβ CDR3 was performed using the immunoSEQ® Assay (Adaptive Biotechnologies, Seattle, WA, USA). Extracted genomic DNA was amplified in a bias- controlled multiplex PCR, followed by high-throughput sequencing. Sequences were collapsed and filtered to identify and quantitate the absolute abundance of each unique TCRβ CDR3 region for further analysis as previously described (26, 52, 53). Sequencing reactions contained a median of 7,884.0 ng of input DNA (range, 239.9–55,186.4 ng) and yielded a median of 314,948 T-cell templates per sample (range, 15–1,837,496) (Figure S1). The T-cell fraction (percentage of T cells among the estimated number of nucleated cells input) ranged from 0.3% to 90% (median, 26.0%). #### TCR repertoire QC criteria The 2 key parameters of the classifier are the number of unique productive rearrangements, *x*, and the number of unique productive rearrangements encoding an ES, *y*. For a given blood sample, the value of *x* is determined by the quantity of DNA, the fraction of cells that are T cells, and the diversity of T cells. In rare cases, *x* is too small to yield meaningful information, or significantly larger than observed in our training data, making extrapolation of *Y*|*x* problematic. Therefore, acceptance criteria were predefined for the number of unique productive rearrangements based on the observed distribution of *x* in the training data. The information contained in ESs is asymmetric: for small *x*, large *y* is considered to be evidence of LD, while small *y* may simply reflect a lack of sequenced T cells. Thus, QC criteria were treated asymmetrically. Specifically, *x*max and *x*min were defined as the upper and lower QC thresholds, which were prespecified to be equal to the 1st and 99th percentiles, respectively, of *x* observed in the training data. A sample *i* then failed QC if *x**i* > *x*max , or if *x**i* < *x*min and *ModelScore*(*y**i*, *x**i*) < *t*. #### Determination of antigen specificity of Lyme-associated TCRs To evaluate the potential specificity of antigens recognized by TCRs in the LD classifier, ESs were clustered by sequence similarity. Statistical assignment of individual TCRs to HLA subtypes resulted in a consistent HLA assignment for the cluster. To further characterize TCR– antigen specificity, MIRA was used to identify TCR epitopes, as detailed in the next section. Query peptides (777 total) were derived from 26 *Bb* proteins, and individual peptides or groups of related peptides were assigned to 1 of 426 unique MIRA pools. MIRA was then performed on T cells derived from peripheral blood mononuclear cells collected from 395 healthy individuals using a version of the assay that selects for HLA-II‒restricted CD4+ T cells. Basic Local Alignment Search Tool (BLAST) analysis of the target epitope was used to compare *Bb* FlaB to other *Borrelia* species and other pathogens. Specificity of our approach was further evaluated by comparing the Lyme-associated ESs with a set of TCRs from 507 individuals that were previously mapped to 325 SARS-CoV-2 antigen pools by MIRA (29 and Supplemental reference 1). #### Antigen-stimulation experiments (MIRA assay) ##### Panel design The multiplex identification of antigen-specific TCRs (MIRA) assay was set up, performed, and analyzed as described previously (32). Briefly, 2 panels of peptides were designed and tested in the assay. The Lyme-MIRA1 panel used antigens that are known to be presented in LD, including tiled portions of the DbpA, OspC, OspA, BBK32, BBA52, and VlsE proteins (Supplemental reference 2). The Lyme-MIRA2 panel used peptides derived from antigens presented via HLA class II upon *Bb* infection. The *Bb*-derived antigens include elongation factor Tu (WP\_002657015.1), BB\_0418 (WP\_002658797.1), p83/100 (CAA57125.1), ABC transporter (PRR58667.1), lipoprotein LA7 (WP\_002657819.1), GAPDH (AAB53930.1), chaperonin GroEL (WP\_002657108.1), flagellin (WP_002661938.1), OspA (WP_010890378.1), and p66 (WP_002656762.1). The peptide tiling strategy was used across the entirety of each antigen, yielding a series of peptides, each 17 amino acids (aa) long with a 7-aa overlap between peptides. The peptides were pooled in a combinatorial fashion as described previously (32); peptides that were overlapping or in close proximity in the viral proteome were grouped together into antigen sets. Each antigen set was then placed in a subset of 5 unique pools out of 11 total pools in the Lyme-MIRA1 panel, or 6 pools out of 12 in the Lyme-MIRA2 panel, referred to as its occupancy. ##### Naïve antigen-stimulation experiments A total of 304 experiments were run with the Lyme-MIRA1 panel (all “naïve” experiments, see below) and 174 with the Lyme-MIRA2 panel. For the “naïve” experiments, CD14+ monocytes were selected from peripheral blood mononuclear cells (PBMCs; Miltenyi Biotech, Auburn, CA, USA) and stimulated with granulocyte-macrophage colony-stimulating factor (GM-CSF) and interleukin (IL)-4 (BioLegend, San Diego, CA, USA) to drive dendritic cell (DC) differentiation *in vitro*. On day 3, GM-CSF, IL-4, interferon-γ (IFN-γ; BioLegend, San Diego, CA, USA), and lipopolysaccharide (LPS; Sigma-Aldrich, St. Louis, MO, USA or eBioscience, Inc, San Diego, CA, USA) were added to promote DC maturation. Also on day 3, naïve T cells were isolated from PBMCs (StemCell, Vancouver, BC, Canada) and incubated overnight with IL-7. On day 4, naïve T cells were combined with the differentiated CD14+ monocytes, IL-21 (BioLegend, San Diego, CA, USA), and a pool of all peptides present in the panel to be used for restimulation. Cultures were supplemented with IL-7, IL-15, and IL-2 every 2 to 3 days for an additional 12 to 14 days. Cells harvested from the expansion culture were divided into a series of replicate cultures, and each was restimulated using a distinct peptide pool from the panel under investigation. After incubation at 37°C for ∼20 hours, each culture was stained with antibodies (BioLegend, San Diego, CA, USA) for sorting by flow cytometry. Cells were then washed and suspended in phosphate-buffered saline containing 2% fetal bovine serum (FBS), 1 mM EDTA, and 4,6-diamidino-2-phenylindole (DAPI) for exclusion of non-viable cells. Cells were acquired and sorted using a FACSMelody (BD Biosciences, Franklin Lakes, NJ, USA) instrument. Sorted antigen-specific (CD4+CD137+CD145+, CD25lo) T cells were pelleted and lysed in RLT Plus buffer (Qiagen, Germantown, MD, USA) for nucleic acid isolation. ##### Assignment of ESs to antigens To assign ESs to antigens, RNA was isolated using AllPrep DNA/RNA mini and/or micro kits, according to the manufacturer’s instructions (Qiagen). RNA was then reverse transcribed to cDNA using Vilo kits (Life Technologies, Carlsbad, CA, USA), and TCRβ amplification was performed using the immunoSEQ Assay described above. After immunosequencing, the behavior of T-cell clonotypes was examined by tracking read counts across each sorted pool. True antigen-specific clones should be specifically enriched in a unique occupancy pattern corresponding to the presence of 1 of the query antigens in 5 or 6 pools in the Lyme-MIRA1 and Lyme-MIRA2 panels. Methods used to assign antigen specificity to TCR clonotypes have been reported previously (32). In addition to these methods, a non- parametric Bayesian model was developed to compute the posterior probability that a given clonotype was antigen-specific. This model uses the available read counts of TCRs to estimate a mean-variance relationship within a given experiment, as well as the probability that a clone will have zero read counts due to incomplete sampling of low-frequency clones. Together, this model considers the observed read counts of a clonotype across all pools and estimates the posterior probability of a clone responding to all valid addresses and an additional hypothesis that a clone is activated in all pools (truly activated, but not specific to any of our query antigens). To define antigen-specific clones, we identified TCR clonotypes assigned to a query antigen from this model with a posterior probability ≥0.7. TCR sequences from MIRA were compared to the ES list based on V-gene, J-gene, and CDR3 amino acid sequences. Any exact matches between the 2 lists, in which the MIRA TCR sequence was found in at least 2 separate individuals, were considered sufficient to map the ES to the MIRA antigen. #### ES clustering and HLA inference Clustering of ESs was based on TCR amino acid similarity. Specifically, 2 TCRs were assigned to the same cluster if they shared a V-gene family (and so have similar CDR1 and CDR2), had identical length, and differed by, at most, 1 amino acid in the CDR3 region. Clusters with at least 5 ESs were reported (Table 4). A sequence motif representing the CDR3 amino acid sequences assigned to each cluster was generated using WebLogo (University of California, Berkeley, CA, USA) (Supplemental references 3, 4). To assign an ES to a single HLA subtype, a one-tailed FET was performed between that ES and every HLA subtype. The ES was assigned to the HLA subtype with the lowest *P* value; if the lowest *P* value was >0.001, no assignment was made. Contingency tables counted the number of individuals with/without the ES and with/without a given HLA subtype. For HLA-DQ and HLA- DP, α/β heterodimers were treated as distinct HLA subtypes; for example, individuals with 2 α subtypes and 2 β subtypes were treated as expressing all 4 possible heterodimers. An HLA subtype was assigned to an ES cluster if most (>50%) of the cluster members with an assigned HLA subtype were assigned to the same subtype. ![Figure S1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F5.medium.gif) [Figure S1.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F5) Figure S1. Immunosequencing input DNA distributions by cohort. Boxes indicate median ± IQR, and whiskers denote 1.5 times the IQR above the high quartile and below the low quartile. ![Figure S2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F6.medium.gif) [Figure S2.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F6) Figure S2. Clinical correlates of TCR scoring in STTT-positive (A–D) and STTT-negative (E–H) individuals. TCR scores were stratified by (**A, E**) liver function test results (elevated [n=72] vs normal [n=139]), (**B, F**) lymphocyte counts (normal [n=150] vs low [n=61]), or (**C, G**) presentation of rash (multiple/disseminated [n=68] vs single [n=143]) and (**D, H**) plotted as a function of the number of Lyme-related symptoms (Spearman R2=0.17. *P* values, Mann-Whitney *U* test). Ns, not significant. In box-and-whisker plots, boxes indicate median ± IQR, and whiskers denote 1.5 times the IQR above the high quartile and below the low quartile. ![Figure S3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/07/01/2021.07.30.21261353/F7.medium.gif) [Figure S3.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/F7) Figure S3. Receiver operating characteristic curves using all control samples from Lyme- endemic regions from Figure 2A as negatives. Areas under the receiver operating characteristic curves are 0.98, 0.89, and 0.71 for the positive, converter, and negative curves, respectively. FPR, false positive-rate; TPR, true-positive rate. View this table: [Table S1.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/T5) Table S1. Multiple logistic regression of TCR model score on clinical features. View this table: [Table S2.](http://medrxiv.org/content/early/2022/07/01/2021.07.30.21261353/T6) Table S2. Counts of enhanced sequences mapped to each protein by MIRA. ## Acknowledgments We are grateful to the research participants who contributed samples and data used in this study and to the physicians/health care providers who facilitated recruitment. Funding for this study was provided by Adaptive Biotechnologies. Institutional support was provided by the Steven and Alexandra Cohen Foundation (A.W.R., M.J.S., E.J.H., J.N.A.), Global Lyme Alliance (A.W.R., J.N.A.), National Institutes of Health grant P30 AR070254 (M.J.S.), and Bay Area Lyme Foundation (E.J.H.). Medical writing and editorial support were provided by Melanie Styers and Rachel Salmon of BluPrint Oncology Concepts and Kristin MacIntosh and Shahin Shafiani of Adaptive Biotechnologies. ## Footnotes * The authors declare a conflict of interest. M.P., J.N.D., D.B.G., R.M.G., T.M.S., and I.M.K. declare current or former employment with and equity ownership in Adaptive Biotechnologies. S.C.D. declares former employment with Adaptive Biotechnologies, employment with Stanford University School of Medicine, and equity ownership in Adaptive Biotechnologies. A.W.R. declares institutional support from the Steven and Alexandra Cohen Foundation and Global Lyme Alliance. M.J.S. declares institutional support from the Steven and Alexandra Cohen Foundation and National Institutes of Health grant P30 AR070254. E.J.H. declares compensation from the Lyme Disease Biobank and institutional support from Bay Area Lyme Foundation, Steven and Alexandra Cohen Foundation, and Adaptive Biotechnologies. R.O.E. and T.M. declare employment with Adaptive Biotechnologies during the time of this research. L.B. and H.S.R. declare employment with, equity ownership in, and leadership for Adaptive Biotechnologies. J.N.A. declares consulting with Tarsus Pharmaceuticals and Pfizer; participation in an advisory board with Adaptive Biotechnologies; expert testimony; membership in the Bay Area Lyme Foundation Scientific Advisory Board; Past Chair, 2018, HHS Tick-borne Disease Working Group, Office of HIV/AIDS and Infectious Disease Policy, Office of the Assistant Secretary of Health, Department of Health and Human Services; and institutional support from the Steven and Alexandra Cohen Foundation and Global Lyme Alliance. J.G., E.M., and J.M.C. declare employment with and equity ownership in Microsoft. * Order of the figures and tables and references have been updated to reflect edits made to clarify and streamline the text. Figure 1 has been updated, including an addition of a new panel, and new Figures 2E and S2 have been added. Table S1 has been updated and split into Tables 1 and 2. The previous Table S1 was deleted. Figure legends have been updated for clarity in main text and Supplement. Author affiliations have been updated. * Received July 30, 2021. * Revision received July 1, 2022. * Accepted July 1, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## REFERENCES 1. 1.Skar GL, Simonsen KA. 2022. Lyme disease, p1. In StatPearls [Internet]. StatPearls Publishing, Treasure Island, FL. 2. 2.Schwartz AM, Kugeler KJ, Nelson CA, Marx GE, Hinckley AF. 2021. Use of commercial claims data for evaluating trends in Lyme disease diagnoses, United States, 2010–2018. Emerg Infect Dis 27:499–507. 3. 3.Kugeler KJ, Schwartz AM, Delorey MJ, Mead PS, Hinckley AF. 2021. Estimating the frequency of Lyme disease diagnoses, United States, 2010–2018. Emerg Infect Dis 27:616–619. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3201/eid2702.202731&link_type=DOI) 4. 4.Steere AC, Strle F, Wormser GP, Hu LT, Branda JA, Hovius JWR, Li X, Mead PS. 2016. Lyme borreliosis. Nat Rev Dis Prim 2:16090. 5. 5.Lantos PM, Rumbaugh J, Bockenstedt LK, Falck-Ytter YT, Aguero-Rosenfeld ME, Auwaerter PG, Baldwin K, Bannuru RR, Belani KK, Bowie WR, Branda JA, Clifford DB, DiMario FJ, Halperin JJ, Krause PJ, Lavergne V, Liang MH, Cody Meissner H, Nigrovic LE, Nocton J (Jay) J, Osani MC, Pruitt AA, Rips J, Rosenfeld LE, Savoy ML, Sood SK, Steere AC, Strle F, Sundel R, Tsao J, Vaysbrot EE, Wormser GP, Zemel LS. 2021. Clinical practice guidelines by the Infectious Diseases Society of America (IDSA), American Academy of Neurology (AAN), and American College of Rheumatology (ACR): 2020 guidelines for the prevention, diagnosis, and treatment of Lyme disease. Arthritis Care Res 73:1–9. 6. 6.Wormser GP, Dattwyler RJ, Shapiro ED, Halperin JJ, Steere AC, Klempner MS, Krause PJ, Bakken JS, Strle F, Stanek G, Bockenstedt L, Fish D, Dumler JS, Nadelman RB. 2006. The clinical assessments treatment, and prevention of Lyme disease, human granulocytic anaplasmosis, and babesiosis: clinical practice guidelines by the Infectious Diseases Society of America. Clin Infect Dis 43:1089–1134. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/508667&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17029130&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000241106500001&link_type=ISI) 7. 7.Centers for Disease Control and Prevention. 1995. Recommendations for test performance and interpretation from the Second National Conference on Serologic Diagnosis of Lyme Disease. MMWR Morb Mortal Wkly Rep 44:590–591. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7623762&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 8. 8.Sigal LH. 1995. Diagnosis of Lyme disease. JAMA 274:1427–1428. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.1995.03530180021020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7474186&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 9. 9.Fix AD, Strickland GT, Grant J. 1998. Tick bites and Lyme disease in an endemic: Setting problematic use of serologic testing and prophylactic antibiotic therapy. JAMA 279:206–210. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.279.3.206&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9438740&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000071460700030&link_type=ISI) 10. 10.Marques AR. 2018. Revisiting the Lyme disease serodiagnostic algorithm: the momentum gathers. J Clin Microbiol 56:e00749–18. 11. 11.Waddell LA, Greig J, Mascarenhas M, Harding S, Lindsay R, Ogden N. 2016. The accuracy of diagnostic tests for Lyme disease in humans, a systematic review and meta-analysis of North American research. PLoS One 11:e0168613. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28002488&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 12. 12.Zweitzig D, Kopnitsky M, Zues Scientific. 2019. Validation of a modified two-tiered testing (MTTT) algorithm for the improved diagnosis of Lyme disease [White paper]. Zeus Scientific, Branchburg, NJ. 13. 13.Branda JA, Strle K, Nigrovic LE, Lantos PM, Lepore TJ, Damle NS, Ferraro MJ, Steere AC. 2017. Evaluation of modified 2-tiered serodiagnostic testing algorithms for early Lyme disease. Clin Infect Dis 64:1074–1080. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/cid/cix043&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28329259&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 14. 14.Nadelman RB, Nowakowski J, Forseter G, Goldberg NS, Bittker S, Cooper D, Aguero- Rosenfeld M, Wormser GP. 1996. The clinical spectrum of early Lyme borreliosis in patients with culture-confirmed erythema migrans. Am J Med 100:502–508. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0002-9343(95)99915-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8644761&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1996UP40400004&link_type=ISI) 15. 15.Nowakowski J, Schwartz I, Liveris D, Wang G, Aguero-Rosenfeld MEA, Girao G, McKenna D, Nadelman RB, Cavaliere LF, Wormser GP. 2001. Laboratory diagnostic techniques for patients with early Lyme disease associated with erythema migrans: a comparison of different techniques. Clin Infect Dis 33:2023–2027. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/324490&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11700579&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000172297500010&link_type=ISI) 16. 16.Schriefer ME. 2015. Lyme disease diagnosis: serology. Clin Lab Med 35:797–814. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cll.2015.08.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26593258&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 17. 17.Steere AC, McHugh G, Damle N, Sikand VK. 2008. Prospective study of serologic tests for Lyme disease. Clin Infect Dis 47:188–195. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/589242&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18532885&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000256905000007&link_type=ISI) 18. 18.John TM, Taege AJ. 2019. Appropriate laboratory testing in Lyme disease. Cleve Clin J Med 86:751–759. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiY2Nqb20iO3M6NToicmVzaWQiO3M6OToiODYvMTEvNzUxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDcvMDEvMjAyMS4wNy4zMC4yMTI2MTM1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 19. 19.Seriburi V, Ndukwe N, Chang Z, Cox ME, Wormser GP. 2012. High frequency of false positive IgM immunoblots for *Borrelia burgdorferi* in clinical practice. Clin Microbiol Infect 18:1236–1240. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1469-0691.2011.03749.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22369185&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 20. 20.Dressier F, Yoshinari NH, Steere AC. 1991. The T-cell proliferative assay in the diagnosis of Lyme disease. Ann Intern Med 115:533–539. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=1883122&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1991GG28300005&link_type=ISI) 21. 21.Vaz A, Glickstein L, Field JA, McHugh G, Sikand VK, Damle N, Steere AC. 2001. Cellular and humoral immune responses to *Borrelia burgdorferi* antigens in patients with culture-positive early Lyme disease. Infect Immun 69:7437–7444. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiaWFpIjtzOjU6InJlc2lkIjtzOjEwOiI2OS8xMi83NDM3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDcvMDEvMjAyMS4wNy4zMC4yMTI2MTM1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 22. 22.Bockenstedt LK, Wooten RM, Baumgarth N. 2021. Immune response to *Borrelia*: lessons from Lyme disease spirochetes. Curr Issues Mol Biol 42:145–190. 23. 23.Soloski MJ, Crowder LA, Lahey LJ, Wagner CA, Robinson WH, Aucott JN. 2014. Serum inflammatory mediators as markers of human Lyme disease activity. PLoS One 9:e93243. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0093243&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24740099&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 24. 24.Arstila TP, Casrouge A, Baron V, Even J, Kanellopoulos J, Kourilsky P. 1999. A direct estimate of the human αβ T cell receptor diversity. Science 286:958–961. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIyODYvNTQ0MS85NTgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wNy8wMS8yMDIxLjA3LjMwLjIxMjYxMzUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 25. 25.Neller MA, Burrows JM, Rist MJ, Miles JJ, Burrows SR. 2013. High frequency of herpesvirus-specific clonotypes in the human T cell repertoire can remain stable over decades with minimal turnover. J Virol 87:697–700. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoianZpIjtzOjU6InJlc2lkIjtzOjg6Ijg3LzEvNjk3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDcvMDEvMjAyMS4wNy4zMC4yMTI2MTM1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 26. 26.Robins HS, Campregher P V, Srivastava SK, Wacher A, Turtle CJ, Kahsai O, Riddell SR, Warren EH, Carlson CS. 2009. Comprehensive assessment of T-cell receptor β-chain diversity in αβ T cells. Blood 114:4099–4107. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTI6ImJsb29kam91cm5hbCI7czo1OiJyZXNpZCI7czoxMToiMTE0LzE5LzQwOTkiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wNy8wMS8yMDIxLjA3LjMwLjIxMjYxMzUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 27. 27.Venturi V, Price DA, Douek DC, Davenport MP. 2008. The molecular basis for public T-cell responses? Nat Rev Immunol 8:231–238. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nri2260&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18301425&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000253548600017&link_type=ISI) 28. 28.Emerson RO, DeWitt WS, Vignali M, Gravley J, Hu JK, Osborne EJ, Desmarais C, Klinger M, Carlson CS, Hansen JA, Rieder M, Robins HS. 2017. Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire. Nat Genet 49:659–665. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 29. 29.Snyder MT, Gittelman MR, Klinger M, May HD, Osborne JE, Taniguchi R, Jabran Zahid H, Kaplan MI, Dines NJ, Noakes TM, Pandya R, Chen X, Elasady S, Svejnoha E, Ebert P, Pesesky WM, De Almeida P, O’Donnell1 H, DeGottardi Q, Keitany G, Lu J, Vong A, Elyanow R, Fields P, Greissl J, Baldo1 L, Semprini S, Cerchione C, Nicolini F, Mazza M, Delmonte M Ottavia, Dobbs K, Laguna-Goya R, Carreño-Tarragona G, Barrio S, Imberti L, Sottini A, Quiros-Roldan E, Rossi C, Biondi A, Bettini RL, D’Angio M, Bonfanti P, Tompkins F. Miranda, Alba C, Dalgard C, Sambri V, Martinelli G, Goldman DJ, Heath RJ, Su CH, Notarangelo DL, Paz-Artal E, Martinez-Lopez J, Carlson M Jonathan, Robins SH. 2020. Magnitude and dynamics of the T- cell response to SARS-CoV-2 infection at both individual and population levels. medRxiv [https://doi.org/10.1101/2020.07.31.20165647](https://doi.org/10.1101/2020.07.31.20165647). 30. 30.Elyanow R, Snyder TM, Dalai SC, Gittelman RM, Boonyaratanakornkit J, Wald A, Selke S, Wener MH, Morishima C, Greninger AL, Gale Jr. M, Hsiang T-Y, Jing L, Holbrook MR, Kaplan IM, Zahid HJ, May DH, Carlson JM, Baldo L, Manley T, Robins HS, Koelle DM. 2022. T-cell receptor sequencing identifies prior SARS-CoV-2 infection and correlates with neutralizing antibodies and disease severity. JCI Insight 7:e150070. 31. 31.Dalai SC, Dines JN, Snyder TM, Gittelman RM, Eerkes T, Vaney P, Howard S, Akers K, Skewis L, Monteforte A, Witte PR, Wolf C, Nesse H, Herndon M, Qadeer J, Duffy S, Svejnoha E, Taromino C, Kaplan IM, Alsobrook J, Manley T, Baldo L. 6 May 2022. Clinical validation of a novel T-cell receptor sequencing assay for identification of recent or prior SARS-CoV-2 infection. Clin Infect Dis [https://doi.org/10.1093/cid/ciac353](https://doi.org/10.1093/cid/ciac353). 32. 32.Klinger M, Pepin F, Wilkins J, Asbury T, Wittkop T, Zheng J, Moorhead M, Faham M. 2015. Multiplex identification of antigen-specific T cell receptors using a combination of immune assays and immune receptor sequencing. PLoS One 10:e0141561. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0141561&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26509579&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 33. 33.Gittelman RM, Lavezzo E, Snyder TM, Zahid HJ, Carty CL, Elyanow R, Dalai SC, Kirsch I, Baldo L, Manuto L, Franchin E, Del Vecchio C, Pacenti M, Boldrin C, Cattai M, Saluzzo F, Padoan A, Plebani M, Simeoni F, Bordini J, Lorè NI, Lazarević D, Cirillo DM, Ghia P, Toppo S, Carlson JM, Robins HS, Crisanti A, Tonon G. 2022. Longitudinal analysis of T-cell receptor repertoires reveals shared patterns of antigen-specific response to SARS-CoV-2 infection. JCI Insight 7:e151849. 34. 34.Koszinowski UH, Reddehase MJ, Jonjic S. 1991. The role of CD4 and CD8 T cells in viral infections. Curr Opin Immunol 3:471–475. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0952-7915(91)90005-L&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=1684507&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1991GE11100005&link_type=ISI) 35. 35.Shepherd FR, McLaren JE. 2020. T cell immunity to bacterial pathogens: mechanisms of immune control and bacterial evasion. Int J Mol Sci 21:1–32. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijms21051664&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 36. 36.Marshall JS, Warrington R, Watson W, Kim HL. 2018. An introduction to immunology and immunopathology. Allergy Asthma Clin Immunol 14:49. 37. 37.Hastey CJ, Elsner RA, Barthold SW, Baumgarth N. 2012. Delays and diversions mark the development of B cell responses to Borrelia burgdorferi infection. J Immunol 188:5612. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiamltbXVub2wiO3M6NToicmVzaWQiO3M6MTE6IjE4OC8xMS81NjEyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDcvMDEvMjAyMS4wNy4zMC4yMTI2MTM1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 38. 38.Aucott JN, Rebman AW, Crowder LA, Kortte KB. 2013. Post-treatment Lyme disease syndrome symptomatology and the impact on life functioning: Is there something here? Qual Life Res 22:75–84. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11136-012-0126-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22294245&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 39. 38.Aucott JN, Rebman AW, Crowder LA, Kortte KB. 2013. Post-treatment Lyme disease syndrome symptomatology and the impact on life functioning: is there something here? Qual Life Res 22:75–84. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11136-012-0126-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22294245&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 40. 39.Rebman AW, Crowder LA, Kirkpatrick A, Aucott JN. 2015. Characteristics of seroconversion and implications for diagnosis of post-treatment Lyme disease syndrome: acute and convalescent serology among a prospective cohort of early Lyme disease patients. Clin Rheumatol 34:585–589. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10067-014-2706-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24924604&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 41. 40.Aguero-Rosenfeld ME, Nowakowski J, Bittker S, Cooper D, Nadelman RB, Wormser GP. 1996. Evolution of the serologic response to *Borrelia burgdorferi* in treated patients with culture- confirmed erythema migrans. J Clin Microbiol 34:1–9. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamNtIjtzOjU6InJlc2lkIjtzOjY6IjM0LzEvMSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA3LzAxLzIwMjEuMDcuMzAuMjEyNjEzNTMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 42. 41.McKisic MD, Barthold SW. 2000. T-cell-independent responses to Borrelia burgdorferi are critical for protective immunity and resolution of Lyme disease. Infect Immun 68:5190–5197. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiaWFpIjtzOjU6InJlc2lkIjtzOjk6IjY4LzkvNTE5MCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA3LzAxLzIwMjEuMDcuMzAuMjEyNjEzNTMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 43. 42.Pegalajar-Jurado A, Schriefer ME, Welch RJ, Couturier MR, MacKenzie T, Clark RJ, Ashton L V., Delorey MJ, Molins CR. 2018. Evaluation of modified two-tiered testing algorithms for Lyme disease laboratory diagnosis using well-characterized serum samples. J Clin Microbiol 56:e01943–17. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 44. 43.Mosel M, Rebman A, Carolan H, Montenegro T, Lovari R, Schutzer S, Ecker D, Yang T, Ramadoss N, Robinson W, Soloski M, Eshoo M, Aucott J. 2020. Molecular microbiological and immune characterization of a cohort of patients diagnosed with early Lyme disease. J Clin Microbiol 59:e00615–20. 45. 44.Aucott JN, Yang T, Yoon I, Powell D, Geller SA, Rebman AW. 2022. Risk of post-treatment Lyme disease in patients with ideally-treated early Lyme disease: a prospective cohort study. Int J Infect Dis 116:230–237. 46. 45.Kalish RA, McHugh G, Granquist J, Shea B, Ruthazer R, Steere AC. 2001. Persistence of immunoglobulin M or immunoglobulin G antibody responses to *Borrelia burgdorferi* 10-20 years after active Lyme disease. Clin Infect Dis 33:780–785. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/322669&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11512082&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170994700009&link_type=ISI) 47. 46.Zajkowska JM. 2014. Antibody-based techniques for detection of Lyme disease: a challenging issue. Antib Technol J 4:33–44. 48. 47.Horn E, Dempsey G, Schotthoefer A, Prisco U, McArdle M, Gervasi S, Golightly M, De Luca C, Evans M, Pritt B, Theel E, Iyer R, Liveris D, Wang G, Goldstein D, Schwartz I. 2020. The Lyme disease biobank: Characterization of 550 patient and control samples from the East Coast and Upper Midwest of the United States. J Clin Microbiol 58:e00032–20. 49. 48.Rebman AW, Yang T, Mihm EA, Novak CB, Yoon I, Powell D, Geller SA, Aucott JN. 2021. The presenting characteristics of erythema migrans vary by age, sex, duration, and body location. Infection 49:685–692. 50. 49.Aucott J, Crowder L, Kortte K. 2013. Development of a foundation for a case definition of post-treatment Lyme disease syndrome. Int J Infect Dis 17:e443–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijid.2013.01.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23462300&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 51. 50.Nelson WC, Pyo CW, Vogan D, Wang R, Pyon YS, Hennessey C, Smith A, Pereira S, Ishitani A, Geraghty DE. 2015. An integrated genotyping approach for HLA and other complex genetic systems. Hum Immunol 76:928–938. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.humimm.2015.05.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26027777&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 52. 51.Smith AG, Pereira S, Jaramillo A, Stoll ST, Khan FM, Berka N, Mostafa AA, Pando MJ, Usenko CY, Bettinotti MP, Pyo C-W, Nelson WC, Willis A, Askar M, Geraghty DE. 2019. Comparison of sequence-specific oligonucleotide probe vs next generation sequencing for HLA- A, B, C, DRB1, DRB3/B4/B5, DQA1, DQB1, DPA1, and DPB1 typing: toward single-pass high-resolution HLA typing in support of solid organ and hematopoietic cell transplant programs. HLA 94:296–306. 53. 52.Robins H, Desmarais C, Matthis J, Livingston R, Andriesen J, Reijonen H, Carlson C, Nepom G, Yee C, Cerosaletti K. 2012. Ultra-sensitive detection of rare T cell clones. J Immunol Methods 375:14–19. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jim.2011.09.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21945395&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000300913300003&link_type=ISI) 54. 53.Carlson CS, Emerson RO, Sherwood AM, Desmarais C, Chung MW, Parsons JM, Steen MS, LaMadrid-Herrmannsfeldt MA, Williamson DW, Livingston RJ, Wu D, Wood BL, Rieder MJ, Robins H. 2013. Using synthetic templates to design an unbiased multiplex PCR assay. Nat Commun 4:2680. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ncomms3680&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24157944&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) 55. 54.Pavlović M, Scheffer L, Motwani K, Kanduri C, Kompova R, Vazov N, Waagan K, Bernal FLM, Costa AA, Corrie B, Akbar R, Al Hajj GS, Balaban G, Brusko TM, Chernigovskaya M, Christley S, Cowell LG, Frank R, Grytten I, Gundersen S, Haff IH, Hovig E, Hsieh PH, Klambauer G, Kuijjer ML, Lund-Andersen C, Martini A, Minotto T, Pensar J, Rand K, Riccardi E, Robert PA, Rocha A, Slabodkin A, Snapkov I, Sollid LM, Titov D, Weber CR, Widrich M, Yaari G, Greiff V, Sandve GK. 2021. The immuneML ecosystem for machine learning analysis of adaptive immune receptor repertoires. Nat Mach Intell 2021 3:936–944. 56. 55.Zuhair M, Smit GSA, Wallis G, Jabbar F, Smith C, Devleesschauwer B, Griffiths P. 2019. Estimation of the worldwide seroprevalence of cytomegalovirus: A systematic review and meta- analysis. Rev Med Virol 29:e2034. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/rmv.2034&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) ## References cited in the Supplementary Materials only 1. 1.Nolan S, Vignali M, Klinger M, Dines J, Kaplan I, Svejnoha E, Craft T, Boland K, Pesesky M, Gittelman R, Snyder T, Gooley C, Semprini S, Cerchione C, Mazza M, Delmonte O, Dobbs K, Carreño-Tarragona G, Barrio S, Sambri V, Martinelli G, Goldman J, Heath J, Notarangelo L, Carlson J, Martinez-Lopez J, Robins H. 2020. A large-scale database of T-cell receptor beta (TCRβ) sequences and binding associations from natural and synthetic exposure to SARS-CoV-2. Res Sq [https://doi.org/10.21203/rs.3.rs-51964/v1](https://doi.org/10.21203/rs.3.rs-51964/v1). 2. 2.Singh P, Verma D, Backstedt BT, Kaur S, Kumar M, Smith AA, Sharma K, Yang X, Azevedo JF, Gomes-Solecki M, Buyuktanir O, Pal U. 2017. *Borrelia burgdorferi* BBI39 Paralogs, Targets of Protective Immunity, Reduce Pathogen Persistence Either in Hosts or in the Vector. J Infect Dis 215:1000. 3. 3.Crooks GE, Hon G, Chandonia J-M, Brenner SE. 2004. WebLogo: a sequence logo generator. Genome Res 14:1188–1190. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjE0LzYvMTE4OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA3LzAxLzIwMjEuMDcuMzAuMjEyNjEzNTMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 4. 4.Schneider T, Stephens R. 1990. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res 18:6097–6100. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/18.20.6097&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2172928&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F07%2F01%2F2021.07.30.21261353.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1990EF12100024&link_type=ISI) [1]: /embed/graphic-9.gif [2]: /embed/graphic-10.gif