Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

HIV Phylogenetics Reveals Overlapping Transmission Networks among Cities and Key Populations in Pakistan

View ORCID ProfileAdriano de Bernardi Schneider, View ORCID ProfileFrançois Cholette, Yann Pelcat, View ORCID ProfileAaron G. Lim, View ORCID ProfilePeter T. Vickerman, View ORCID ProfileAdiba Hassan, View ORCID ProfileLaura H. Thompson, James F. Blanchard, Faran Emmanuel, Tahira Reza, Nosheen Dar, Nadeem Ikram, View ORCID ProfileRichard Pilon, Chris Archibald, View ORCID ProfileJeffrey B. Joy, Paul Sandstrom, View ORCID ProfileJoel O. Wertheim
doi: https://doi.org/10.1101/2021.12.14.21267743
Adriano de Bernardi Schneider
1Department of Medicine, University of California San Diego, San Diego, 92103, USA
2Genomics Institute, University of California Santa Cruz, Santa Cruz, 95064, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Adriano de Bernardi Schneider
  • For correspondence: adeberna{at}ucsc.edu
François Cholette
3National HIV and Retrovirology Laboratories, National Microbiology Laboratory at the JC Wilt Infectious Disease Research Centre, Public Health Agency of Canada, Winnipeg, Canada
4Department of Medical Microbiology and Infectious Diseases, University of Manitoba, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for François Cholette
Yann Pelcat
5Public Health Risk Science Division, National Microbiology Laboratory at St-Hyacinthe, Public Health Agency of Canada, St-Hyacinthe, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aaron G. Lim
6Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Aaron G. Lim
Peter T. Vickerman
6Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Peter T. Vickerman
Adiba Hassan
1Department of Medicine, University of California San Diego, San Diego, 92103, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Adiba Hassan
Laura H. Thompson
7Institute for Global Public Health, University of Manitoba, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Laura H. Thompson
James F. Blanchard
7Institute for Global Public Health, University of Manitoba, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Faran Emmanuel
7Institute for Global Public Health, University of Manitoba, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tahira Reza
8Institute for Global Public Health Pakistan, Chak Shahzad, Islamabad, Pakistan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nosheen Dar
9Canada-Pakistan HIV/AIDS Surveillance Project, Islamabad, Pakistan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nadeem Ikram
10National AIDS Control Programme, National Institute of Health, Chak Shahzad, Islamabad, Pakistan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Richard Pilon
3National HIV and Retrovirology Laboratories, National Microbiology Laboratory at the JC Wilt Infectious Disease Research Centre, Public Health Agency of Canada, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Richard Pilon
Chris Archibald
11Surveillance and Epidemiology Division, Centre for Communicable Diseases and Infection Control, Public Health Agency of Canada, Ottawa, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeffrey B. Joy
12BC Centre for Excellence in HIV/AIDS, Vancouver, Canada
13Department of Medicine, University of British Columbia, Vancouver, Canada
14Bioinformatics Programme, University of British Columbia, Vancouver, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeffrey B. Joy
Paul Sandstrom
3National HIV and Retrovirology Laboratories, National Microbiology Laboratory at the JC Wilt Infectious Disease Research Centre, Public Health Agency of Canada, Winnipeg, Canada
4Department of Medical Microbiology and Infectious Diseases, University of Manitoba, Winnipeg, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joel O. Wertheim
1Department of Medicine, University of California San Diego, San Diego, 92103, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joel O. Wertheim
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The first case of HIV in Pakistan was documented in 1987, with multiple subtypes and circulating recombinant forms being introduced and currently circulating in the country. Since then, there has been a shift in the country from a low prevalence/high risk to a high-risk concentrated epidemic. Pakistan’s epidemic is concentrated among key populations at greater risk of HIV infection including people who inject drugs (PWID), Hijra sex workers (HSW), female sex workers (FSW), male sex workers (MSW), and men who have sex with men (MSM). This study focused on the geographical aspect as well as on the interactions between key populations at higher risk of contracting HIV. We aimed at understanding the behavior of these key populations at a molecular level with high granularity as well as investigating the possibility of multiple HIV-1 introductions in Pakistan. In this cross-sectional biological and behavioral survey, we collected dried blood spots (DBS) for the purposes of seroprevalence estimates and molecular epidemiology from individuals in 17 cities in Pakistan representing four key populations: PWID, HSW, MSW, and FSW. A total of 1153 envelope sequences (reference positions in HXB2: 7860-8274) of HIV were sequenced using a Sanger-based sequencing approach. To identify clusters based on the introduction of the virus in Pakistan from foreign countries we added 3623 publicly available HIV envelope sequences to our dataset. Phylogeographic inference suggests at least 15 independent introductions of the virus into Pakistan, with a total of 12 clusters ranging from 3 to 675 sequences in size containing sequences from Pakistan and neighboring countries exclusively. Our phylogenetic analysis shows a significant degree of connectivity and directionality suggesting broad and overlapping networks of HIV-1 transmission among cities and key populations in Pakistan.

Introduction

The first case of HIV in Pakistan was documented in 1987 in Karachi (1). Following this first recorded case, multiple subtypes and circulating recombinant forms (CRFs) have been identified and are currently circulating in the country (2–4). Still, before the implementation of integrated biological and behavioral surveillance (IBBS) of key populations in Pakistan in 2005 (5), little was known about the prevalence and associated risk behaviors of HIV in the country (6). Back in 2006, Pakistan was on the brink of a potentially severe epidemic (7), building upon an explosive HIV epidemic among people who inject drugs (PWID) in Karachi dating back as early as 2003 (8). HIV prevalence rapidly increased from 10.8% in 2005 to 38.4% in 2016-17 (5, 9). HIV transmission in large urban centers, such as Karachi and Lahore, has been observed in key populations such as PWID and female sex worker (FSW) (8). In 2011, the reported national HIV prevalence for FSW, Hijra (transgender) sex workers (HSW), male sex workers (MSW), and PWID was 0.8%, 7.2%, 3.1% and 37.8%, respectively (9). Although no significant changes in HIV prevalence was reported during the latest round of IBBS in 2016-17 for PWID (38.4%) and HSW (7.5%), suggesting an established epidemic, the prevalence in FSW and MSW increased approximately two fold from 0.8% to 2.2% and 3.1% to 5.6%, respectively (10). In Pakistan, high risk behaviors such as habitual needle sharing among PWID and frequent condom-less sex among PWID and sex workers, has been associated with an increase in HIV infections in Pakistan (10, 11).

HIV transmission in Asia, although known for its diversity, it is known to start with high rates of infection among PWID followed by sex workers and those who interact with them (7). In Pakistan, a rural to urban exodus as well as with migration of individuals from their native city to other cities in search of work has also been known to be a cause of transmission (7). A 2006 study of migrants observed that the use of condoms when in contact with sex workers was close to 10% (7). A separate study of female sex workers in 2013 found condom usage was <40% (12). Interestingly, although condoms are accessible in Pakistan, using condoms can be challenging given the country’s social conservatism (13).

Scaling up HIV prevention for all key populations in Pakistan is of utmost importance to reduce the overall burden of the disease in the country. A substantial degree of behavioral mixing has led to HIV transmission between key populations, such as PWID and men who have sex with men (MSM), who can then pass on the infection to their spouses and children (14). Along with high transmission rates among PWID, high HIV prevalence among sex workers has been observed, although no direct link of transmission between these populations has been established in Pakistan (15). Recent increases in HIV-1 prevalence among sex workers and MSM may be indicative of overlapping transmission networks among these key populations. It has been well documented that a significant percentage of PWID (>20%) are buying sex from HSW, FSW, and/or MSW (reported condom use very low). Furthermore, a non-negligible proportion of HSW (2.4%), FSW (5.9%), and MSM (4.2%) report injecting drugs (10). Additionally, in other countries in Asia a link between PWID and individuals buying and selling sex has been identified as the center of HIV epidemics (16). Although overlapping injecting and sexual networks have been reported, it hasn’t been shown empirically that this may lead to significant HIV transmission.

In this study, we report the sequencing of 1153 HIV envelope sequences from PWID and sex workers across 17 Pakistani cities. We investigate their phylogenetic relationships and transmission dynamics between key populations within Pakistan and ascertain the minimum number of introductions of HIV-1 in the country responsible for these clusters.

Materials and Methods

Data collection

Behavioral data and biological samples were collected in 2011 from individuals representing key populations at greater risk of contracting HIV including FSW, HSW, MSW, and PWID across 17 cities in Pakistan (Table 1). The sampling strategy was based on detailed mapping exercises of key populations in Pakistan undertaken prior to surveys and biological specimen collection. The mapping methodology is described in greater detail elsewhere (11). Field teams were dispatched across Pakistan to conduct the actual surveys and biological specimen collection. After obtaining informed consent, participants were interviewed using a structured questionnaire to collect information pertaining to socio-demographics, drug use practices, social networks, sexual behavior, and utilization of services for HIV testing and care. Once the interview was completed, participants were tested for HIV through the use of dried blood spot (DBS) specimens collected by the standard finger prick method using self-retracting lancets. DBS specimen cards were air dried for a minimum of 3 hours at ambient temperature prior to storage at room temperature in sealed gasimpermeable bags containing desiccant pouches. All DBS specimen cards were screened for HIV-1 at the National HIV Reference Laboratory at the National AIDS Programme (Islamabad, Pakistan) and the Armed Institute of Pathology (AFIP; Rawalpindi, Pakistan). All HIV-1 reactive cards and approximately 10% of HIV-1 non-reactive cards (used as quality control) were shipped to the National HIV and Retrovirology Laboratory (NHRL; Winnipeg, Manitoba, Canada) for HIV-1 envelope gene sequencing using Sanger sequencing.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

Sample distribution of collected HIV positive dried blood cards in Pakistan.

A routine, in-house HIV-1 genotyping assay was used to sequence a portion of the envelope gene (HXB2 reference positions: 7860-8274). Total nucleic acid was isolated from a single DBS (approximately 75 µL of whole blood) using an automated, magnetic silica-based, NucliSENS easyMAG (bioMerieux) according to the manufacturer’s specific protocol B. Each DBS was lysed for 1 hour at room temperature with gentle agitation in 2 mL of NucliSENS lysis buffer. Nucleic acid was eluted in 50 µL of NucliSENS extraction buffer 3 and stored at −80C until further testing. The envelope gene was amplified by reverse transcriptase PCR (RT-PCR) followed by nested PCR. RT-PCR thermal cycling conditions consisted of 50C/10 min, 98C/2 min followed by 40 cycles of 98C/10 sec, 50C/10 sec, 72C/60 sec and a final extension of 72/5 min. Each reaction contained 10.5 µL of water, 25 µL of SuperScript 2X Reaction Mix, 2 µL of primers gp41F1 and gp41R1 (200 nM final concentration), 0.5 µL of SuperScript IV RT mix, and 10 µL of template. Nested PCR thermal cycling conditions consisted of 94C/2 min followed by 40 cycles of 94C/15 sec, 60C/15 sec, 68C/30 sec and a final extension of 68/5 min. Each nested PCR reaction contained 16 µL of water, 25 µL of 2X Platinum II Master Mix, 2 µL of primer gp46F2 and gp48R2 (400 nM final concentration), and 5 µL of template. Primer sequences for gp41 were the following: gp41F1:CTTAGGAGCAGCAGGAAGCACTATGGG; gp41R1:AACGACAAAGGTGAGTATCCCTGCCTAA; gp41F2:ACAATTATTGTCTGGTATAGTGCAACAGCA; gp41R2:TCCTACTATCATTATGAATATTTTTATATA. Sanger sequencing was performed using BigDye Terminator v3.1 cycle sequencing kit with nested PCR primers. Sequence analysis and contig assembly were performed using Geneious software.

A total of 1153 HIV-1 partial envelope sequences were sequenced using the Sanger-based sequencing approach described above in Winnipeg at the JC Wilt Infectious Diseases Research Centre, National HIV and Retrovirology Laboratories, Public Health Agency of Canada. The sequences were deposited on GenBank (Accession numbers: MW304006MW305158).

Ethical approval was obtained from the Hope International and Health Canada/Public Health Agency of Canada Ethical Review Board (REB-2010-0044). Informed consent was obtained verbally from study participants for the behavioral survey and biological sampling components of this study. Verbal consent was chosen due to the varying levels of literacy among study participants as well as to protect their identities. Each consent was documented by a member of the data collection team.

Maps of prevalence

Cities with prevalence data greater than 0 were visualized as heatmaps. To create the heatmaps, data points were projected using ArcMap 10.7. A Jenk’s natural break method was used to classify the raster into five categories for cartographic representation.

Molecular data and sequence analyses

Multiple sequence alignment was performed using MAFFT v7.215 (17) under default settings. The alignments were visualized in AliView and ragged edges were trimmed manually when necessary (18). We classified the HIV sequences into subtypes using a combination of COMET (19) and phylogenetic inference using IQ-TREE (20). For this phylogenetic analysis we included 170 reference envelope sequences obtained from the Los Alamos National Laboratories HIV databases (LANL https://www.hiv.lanl.gov) encompassing all available subtypes and CRFs. Results from COMET analysis were compared to the clades to which each sequence clustered with the reference sequence.

We downloaded 3623 HIV envelope sequences from the LANL HIV sequence database, encompassing all the available A1, G and CRFs 02,35 and 63 from the HIV database at the time of search, creating a large diverse dataset with sequences from around the world from the same subtypes identified as the most prevalent on our dataset to identify individual clusters of HIV introduction in Pakistan (see supplementary data). All the geographic metadata available in the public domain for these envelope sequences was compiled (see supplementary data). We separated the Pakistani sequences from the initial dataset into twelve subset clusters to study the relationships between the taxon within each individual cluster. These subsets encompassed all sequences belonging to Pakistan and neighboring Asian countries.

Phylogenetic tree search

To determine the HIV clusters we performed a maximum-likelihood tree search method using FastTree (21) under the GTRCAT model of nucleotide substitution with 100 bootstraps and the -pseudo, -spr and mlacc options for improved accuracy. Clades containing only Pakistani sequences with a minimum of 85% bootstrap value were selected as individual clusters. When available, neighboring Asian countries that were under the same bootstrap threshold were also included as part of the clusters. The final phylogenetic tree was rendered using Evolview v2 (22).

Genetic distance

To calculate the genetic distance within each individual Pakistani clade we estimated the evolutionary divergence over sequence pairs within groups as implemented in MEGA X under default settings (23, 24). The analyses were conducted using the Maximum Composite Likelihood model (25).

Phylogeography of individual clusters

BEAST v.10.4 was used to attain the molecular dating and phylogeographic diffusion of the individual clusters (26). The phylogeographic models were used to estimate both the transmission between locations and key populations. We performed the phylogeographic diffusion in discrete space analysis using BSSVS (a Bayesian Stochastic Search Variable Selection) (27). We calculated the substitution model for each cluster using ModelFinder as implemented on IQ-TREE (20), and used the Skyride coalescent tree prior (28). Results were visualized in Tracer (29) and convergence was determined by Effective Sample Sizes of at least 200 per statistic. Strict clock and uncorrelated relaxed lognormal clock (URL) were tested for each cluster. We ran the model in triplicate with a minimum Markov chain Monte Carlo (MCMC) length of 100 million for each cluster and 10% burn-in to each original MCMC. The final maximum clade credibility trees with metadata annotation were rendered using FigTree (30).

Additionally, Bayes Factor (BF) and Markov Jumps were calculated for each individual cluster. We wrote a script to extract both Bayes Factor and Markov Jumps from BEAST log files (available at https://github.com/abschneider/phylogeography-density-plots). We calculated the BF for each individual cluster for the transition between cities and key populations. Results were compiled and a table summarizing all transmissions between cities and key populations was created. We used a BF threshold of ≥ 5 to accept a transmission as significant (31).

Phylogeography visualization

Individual networks were visualized using the MCC trees on StrainHub v1.0.1 (32). The combined transmission table was used to generate a network map for the transmission between cities. The network map was created using ESRI ArcMap version 10.8. Country administrative boundaries GADM version 3.6 (https://gadm.org/) were used as a basemap.

Data availability

All supplementary data can be found at GitHub (https://github.com/abschneider/Paper_HIV_Pakistan).

The Bayes Factor script is available at GitHub (https://github.com/abschneider/phylogeography-density-plots).

Results

Subtype classification

Subtype classification analysis using COMET (19) and phylogenetic inference using IQTREE (33) indicates at least six major subtypes/CRFs (Circulating Recombinant Forms) represented on our dataset: A1, G, C, D, CRF02_AG and CRF35. The sequences that had mismatches between both COMET and the phylogenetic analysis subtyping analyses were labeled as “unknown”. When results from COMET suggested the possibility of recombinants, and these were confirmed at the phylogenetic analysis, we classified according to the phylogenetic analysis result (i.e., “A1 (check for 35_AD)” / “35_AD” = 35_AD)(see Supplementary Material). Higher prevalence subtypes were identified on the dataset as A1 (64%), G (5%), CRF02_AG (2%) and CRF35 (4%), 24% of the dataset sequences were identified as unknown.

Nucleotide divergence

The nucleotide divergence found within each individual clade was relatively low (< 0.06) with an average of 0.04, with the exception of clade IV (0.12) (Figure 2). Clade IV (N=18), the clade with highest nucleotide divergence, is the only one with sequences identified as belonging to subtype C, which presents low prevalence over the entire dataset (>1%).

Fig. 1.
  • Download figure
  • Open in new tab
Fig. 1.

Location of the study population and prevalence of HIV within the area in distinct key populations (9). Top left: Female Sex Workers (FSW); Top Right: Hijra Sex Worker; Bottom Left: Male Sex Worker (MSW); Bottom Right: People Who Inject Drugs (PWID).

Fig. 2.
  • Download figure
  • Open in new tab
Fig. 2.

Estimates of average evolutionary divergence over sequence pairs within groups. The number of base substitutions per site from averaging over all sequence pairs within each group are shown. Analyses were conducted using the Maximum Composite Likelihood model (25). This analysis involved 1307 nucleotide sequences.

Phylogenetic analysis suggests multiple introductions of HIV in Pakistan

The maximum likelihood tree has 12 clades containing Pakistani sequences and three single terminal sequences from Pakistan (Figure 3). These independent clades include LANL sequences from countries around the world which separate them and indicate multiple independent introductions of HIV into Pakistan (see Supplementary Material for complete tree). The sequences clustered into a major cluster (I) with 675 sequences, six medium size clusters with 38-226 sequences (III, V, VI, IX, XI and XII) and five minor clusters with 3-18 sequences (II, IV, VII, VIII and X)(Table 2). Cluster XII contains sequences from Iran and Afghanistan, indicating transmission events happening between individuals across the border countries. The three sequences that did not cluster with other Pakistani sequences were ignored for individual cluster/transmission network analysis.

Fig. 3.
  • Download figure
  • Open in new tab
Fig. 3.

Maximum-likelihood tree of HIV envelope genes representing independently introduced HIV strains in Pakistan. Subtree is shown, LANL sequence terminals were omitted to aid the visualization, complete tree can be seen on supplementary material.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

Cluster distribution of samples

Bayesian inference and transmission dynamics of HIV clusters

Given the lack of temporal information for most of the clusters, we selected the cluster with most variable temporal data, cluster XII (Table 2), to calculate the clock and use it as the clock prior for all other individual cluster runs.

We performed analysis on ten of the twelve clusters. Clusters II and VIII only contained a single location and a single key population, thus, they were omitted from this analysis. The results of the individual analysis can be visualized in the supplementary material.

The combined data of each individual cluster shows transmission pattern between cities show three major cities for transmission given our current data, Karachi, Multan and Lahore, with Karachi mostly acting like a sink (over 70% of inbound transmission), Multan as a hub (inbound transmission equal to outbound transmission) and Lahore as a source (over 70% of outbound transmission) for HIV transmission (Figure 5). The transmission between key populations indicates connectivity between PWID and sex workers, which was observed in multiple clusters (I, III, IX, X and XI) as well as between FSW, MSW and HSW (I,IV,V,VI,IX) with Bayes Factor ≥ 5 (Figure 4).

Fig. 4.
  • Download figure
  • Open in new tab
Fig. 4.

Summarized HIV-1 transmission network between key populations in Pakistan. Arrows represent the direction of the transmission and numbers represent the number of observed transmissions between key populations that were supported by a Bayes Factor threshold ≥ 5.

Fig. 5.
  • Download figure
  • Open in new tab
Fig. 5.

Summarized HIV-1 transmission network in Pakistan. Pie charts size represent number of connections between cities and colors represent incoming and outgoing transmissions between cities. Transmission network was generated using Bayes Factor threshold ≥ 5. Network map was created using ESRI ArcMap version 10.8. Country administrative boundaries GADM version 3.6 () were used as a basemap.

Due to each individual cluster represent only the individual spread of that particular introduction in Pakistan, we elected to combine all the results in order to visualize the dynamics of transmission among all clusters and have a picture of the overall transmission pattern of HIV in Pakistan with our dataset. The rationale behind this is that regardless of separate introductions in the country, the dynamics of HIV occurs between hosts, rather than the virus, thus, combining the clusters give a better picture of the transmission pattern happening between the key populations in this study rather than each cluster separately.

Discussion

We characterized multiple introductions of HIV-1 into Pakistan, representing numerous subtypes: A1, G, C, D, CRF02_AG, and CRF35_AD. Our subtyping results are aligned with previous research which suggests that CRF02_AG/A1 recombinants predominate the prototype CRF02_AG viruses, while subtype A1 still dominates the virus population in Pakistan (2). According to Chen et al. (2), the majority of subtype A viruses in Pakistan were the result of a single introduction. In contrast, our results indicate the presence of subtype A1 in multiple independent clusters, proving that the former conclusion was hampered by the lack of evidence and highlighting the importance of a thorough phylogenetic analysis of HIV-1 genomic data in Pakistan. The presence of at least twelve introductions into the country of multiple subtypes, observed with the current dataset, validates the importance of cross border transmission in the onset of infectious diseases epidemics (34). The larger nucleotide divergence present at cluster IV, which may be explained given the low number of sequences for the subtypes present in our dataset for that clade (B, C and D), give initial evidence to support an even higher number of introductions to the country, although further studies with focus on these subtypes should be performed to allow a better understanding of their introductions into Pakistan.

Prevalence of HIV in Pakistan is concentrated in vulnerable populations such as PWID and sex workers, whereas prevalence of HIV in the general population seems to be underreported (35). As observed in our results, HIV transmission is not occurring in silos, with PWID and sex workers being interconnected as well as multiple cities across the whole country (Figures 4 and 5). Geographically, Lahore and Karachi are seen in our analysis as the main cities in Pakistan responsible for the spread of HIV-1 among these key populations. These two are major cities, with Karachi being the economic center of Pakistan, and Lahore being considered the cultural center with a significant sex worker population. A previous study pointed Larkana, the 15th most populous city in Pakistan, as the center of the HIV epidemic among PWID, indicating that the dynamics of the disease seems to change when additional key populations are included in the analysis (36). Nevertheless, to date, no study has connected these two key populations (sex workers and PWID) at a molecular level. Previous studies restricted to epidemiological field studies have suggested considerable overlapping HIV transmission occurring between PWID and sex workers and that overlapping transmission between these key populations existed based on surveys (15).

Neighboring countries, such as Iran, have reported HIV transmission by truck drivers (37). Commercial truck drivers in routes connecting main cities in Pakistan are also known to have contact with sex workers and potentially becoming an additional key population prompting long distance geographic spread of HIV (8, 38). Most HIV prevention efforts in Pakistan have focused on PWID and for several years this was not seem as of major concern given that HIV prevalence appeared to be relatively stable among sex workers (5, 39– 41). Recently, the increase of HIV prevalence observed among sex workers could also be explained by lack of interventions in these key populations as well as an overlapping of sexual/injecting networks with PWID (high prevalence). Although Pakistan has made significant advancements towards its HIV/AIDS response, political and social challenges enable the continuous transmission of HIV between these populations (42).

By combining the observed behavior of each individual introduction, we were able to understand the overall pattern of HIV-1 transmission among the key populations of this study. Given that individuals are unaware to which HIV subtype they are at risk of contracting, individual cluster dynamics are irrelevant from the perspective of understanding transmission networks in general when not trying to trace individual contact networks. This study focused on the geographical aspect as well as on the interactions between key populations at higher risk of contracting HIV.

Conclusion

Our findings document at least 15 distinct introductions of HIV into Pakistan, with a total of 12 clusters representing onward transmission within Pakistan. The combined information from the phylogeographic analysis of the individual clusters used to estimate the transmission between key populations suggests constant transmission of HIV between PWID and sex workers. Closely related sequences from Afghanistan and Iran, suggests repeated movement of HIV across these international borders, which could be explained by drug trade and refugees cross between the countries. Nevertheless, our results uncover only the surface of the epidemic given the limitation of what could be explored with our current dataset. Further studies with fine grained data should perform to understand better local transmission networks to allow for more comprehensive public health interventions and surveillance among key populations in Pakistan.

Data Availability

All supplementary data can be found at GitHub (https://github.com/abschneider/Paper_HIV_Pakistan). The Bayes Factor script is available at GitHub (https://github.com/abschneider/phylogeography-density-plots).

Funding

This work was partially supported by the National Institutes of Health (NIH) National Institute of Allergy and Infectious Diseases (grant numbers K01AI110181 and AI135992). The HIV/AIDS surveillance project was funded by the Canadian International Development Agency (CIDA) grant number PK-30849.

Acknowledgements

We thank the surveillance team for their efforts in mapping, data collection, specimen collection, and data entry. We also thank the participants who took part in the surveys for their time and contribution.

Footnotes

  • Fix typos and update Figure 4 caption.

Bibliography

  1. 1.↵
    Rafiq M Khanani, Amtul Hafeez, SM Rab, and Suraiya Rasheed. Human immunodeficiency virus-associated disorders in pakistan. AIDS research and human retroviruses, 4(2):149–154, 1988.
    OpenUrlCrossRefPubMedWeb of Science
  2. 2.↵
    Yue Chen, Bhavna Hora, Todd DeMarco, Sharaf Ali Shah, Manzoor Ahmed, Ana M Sanchez, Chang Su, Meredith Carter, Mars Stone, Rumina Hasan, et al. Fast dissemination of new hiv-1 crf02/a1 recombinants in pakistan. PloS one, 11(12):e0167839, 2016.
    OpenUrl
  3. 3.
    Uroosa Tariq, Azra Parveen, Furqana Akhtar, Faisal Mahmood, Syed Ali, and Syed Hani Abidi. Emergence of circulating recombinant form 56_cpx in pakistan. AIDS research and human retroviruses, 34(11):1002–1004, 2018.
    OpenUrl
  4. 4.↵
    Uroosa Tariq, Faisal Mahmood, Sadaf Naeem, Syed Ali, and Syed Hani Abidi. Emergence of hiv-1 unique dg recombinant form in pakistan. AIDS research and human retroviruses, 36(3):248–250, 2020.
    OpenUrl
  5. 5.↵
    National AIDS control program integrated biological and behavioral surveillance. round 1 report. islamabad, pakistan. National AIDS Control Program, 2005.
  6. 6.↵
    Michelle Munro, Merydth Holte-McKenzie, Sadia Ahmed, Chris P Archibald, James F Blan-chard, and Laura H Thompson. Second generation hiv surveillance in pakistan: policy challenges and opportunities. Sexually transmitted infections, 89(Suppl 2):ii48–ii52, 2013.
    OpenUrlAbstract/FREE Full Text
  7. 7.↵
    A Faisel and J Cleland. Migrant men: a priority for hiv control in pakistan? Sexually transmitted infections, 82(4):307–310, 2006.
    OpenUrlAbstract/FREE Full Text
  8. 8.↵
    Asma Bokhari, Naseer M Nizamani, Denis J Jackson, Naghma E Rehan, Motiur Rahman, Rana Muzaffar, Samina Mansoor, Hasan Raza, Kanwal Qayum, Philippe Girault, et al. Hiv risk in karachi and lahore, pakistan: an emerging epidemic in injecting and commercial sex networks. International journal of STD & AIDS, 18(7):486–492, 2007.
    OpenUrl
  9. 9.↵
    Pakistan National AIDS Control Program. Hiv second generation surveillance in pakistan. National Report Round IV. Canada-Pakistan HIV/AIDS Surveillance Project, 2011.
  10. 10.↵
    Pakistan National AIDS Control Program. Hiv second generation surveillance in pakistan - integrated biological behavioral surveillance in pakistan 2016-17. National Report Round V. Canada-Pakistan HIV/AIDS Surveillance Project, 2017.
  11. 11.↵
    Faran Emmanuel, Laura H Thompson, Momina Salim, Naeem Akhtar, Tahira E Reza, Hajra Hafeez, Sajid Ahmed, and James F Blanchard. The size and distribution of key populations at greater risk of hiv in pakistan: implications for resource allocation for scaling up hiv prevention programmes. Sexually transmitted infections, 89(Suppl 2):ii11–ii17, 2013.
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    Sharmistha Mishra, Laura H Thompson, Altaf Sonia, Nosheen Khalid, Faran Emmanuel, and James F Blanchard. Sexual behaviour, structural vulnerabilities and hiv prevalence among female sex workers in Pakistan. Sexually transmitted infections, 89(Suppl 2):ii34–ii42, 2013.
    OpenUrlAbstract/FREE Full Text
  13. 13.↵
    Alefiyah Rajabali, Saeed Khan, Haider J Warraich, Mohammad R Khanani, and Syed H Ali. HIV and homosexuality in Pakistan. The Lancet infectious diseases, 8(8):511–515, 2008.
    OpenUrlCrossRefPubMedWeb of Science
  14. 14.↵
    Muhammad R Khanani, Mehreen Somani, Sadiq S Rehmani, Nazle MC Veras, Marco Salemi, and Syed H Ali. The spread of hiv in pakistan: bridging of the epidemic between populations. PLoS One, 6(7):e22449, 2011.
    OpenUrlPubMed
  15. 15.↵
    Tahira Reza, Dessalegn Y Melesse, Leigh Anne Shafer, Momina Salim, Arshad Altaf, Altaf Sonia, Gayatri C Jayaraman, Faran Emmanuel, Laura H Thompson, and James F Blanchard. Patterns and trends in Pakistan’s heterogeneous HIV epidemic. Sexually transmitted infections, 89(Suppl 2):ii4–ii10, 2013.
    OpenUrlAbstract/FREE Full Text
  16. 16.↵
    Elizabeth Pisani, Purwa K Sucahya, Octavery Kamil, Saiful Jazan, et al. Sexual behavior among injection drug users in 3 indonesian cities carries a high potential for hiv spread to noninjectors. JAIDS Journal of Acquired Immune Deficiency Syndromes, 34(4):403–406, 2003.
    OpenUrl
  17. 17.↵
    Kazutaka Katoh and Daron M Standley. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution, 30 (4):772–780, 2013.
    OpenUrlCrossRefPubMedWeb of Science
  18. 18.↵
    Anders Larsson. Aliview: a fast and lightweight alignment viewer and editor for large datasets. Bioinformatics, 30(22):3276–3278, 2014.
    OpenUrlCrossRefPubMed
  19. 19.↵
    Daniel Struck, Glenn Lawyer, Anne-Marie Ternes, Jean-Claude Schmit, and Danielle Perez Bercoff. Comet: adaptive context-based modeling for ultrafast hiv-1 subtype identification. Nucleic acids research, 42(18):e144–e144, 2014.
    OpenUrlCrossRefPubMed
  20. 20.↵
    Lam-Tung Nguyen, Heiko A Schmidt, Arndt Von Haeseler, and Bui Quang Minh. Iq-tree: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular biology and evolution, 32(1):268–274, 2015.
    OpenUrlCrossRefPubMed
  21. 21.↵
    Morgan N Price, Paramvir S Dehal, and Adam P Arkin. Fasttree: computing large minimum evolution trees with profiles instead of a distance matrix. Molecular biology and evolution, 26(7):1641–1650, 2009.
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    Zilong He, Huangkai Zhang, Shenghan Gao, Martin J Lercher, Wei-Hua Chen, and Songnian Hu. Evolview v2: an online visualization and management tool for customized and annotated phylogenetic trees. Nucleic acids research, 44(W1):W236–W241, 2016.
    OpenUrlCrossRefPubMed
  23. 23.↵
    Sudhir Kumar, Glen Stecher, and Koichiro Tamura. Mega7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Molecular biology and evolution, 33(7):1870–1874, 2016.
    OpenUrlCrossRefPubMed
  24. 24.↵
    Sudhir Kumar, Glen Stecher, Michael Li, Christina Knyaz, and Koichiro Tamura. Mega x: molecular evolutionary genetics analysis across computing platforms. Molecular biology and evolution, 35(6):1547–1549, 2018.
    OpenUrlCrossRefPubMed
  25. 25.↵
    Koichiro Tamura, Masatoshi Nei, and Sudhir Kumar. Prospects for inferring very large phylogenies by using the neighbor-joining method. Proceedings of the National Academy of Sciences, 101(30):11030–11035, 2004.
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    Marc A Suchard, Philippe Lemey, Guy Baele, Daniel L Ayres, Alexei J Drummond, and Andrew Rambaut. Bayesian phylogenetic and phylodynamic data integration using beast 1.10. Virus evolution, 4(1):vey016, 2018.
    OpenUrlCrossRefPubMed
  27. 27.↵
    Philippe Lemey, Andrew Rambaut, Alexei J Drummond, and Marc A Suchard. Bayesian phylogeography finds its roots. PLoS computational biology, 5(9), 2009.
  28. 28.↵
    Vladimir N Minin, Erik W Bloomquist, and Marc A Suchard. Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics. Molecular biology and evolution, 25(7):1459–1471, 2008.
    OpenUrlCrossRefPubMedWeb of Science
  29. 29.↵
    Andrew Rambaut, Alexei J Drummond, Dong Xie, Guy Baele, and Marc A Suchard. Posterior summarization in bayesian phylogenetics using tracer 1.7. Systematic biology, 67(5): 901, 2018.
    OpenUrlCrossRefPubMed
  30. 30.↵
    Andrew Rambaut. Figtree v1. 4, 2012.
  31. 31.↵
    H Jeffreys. Theory of probability (3rd edt.) oxford university press. MR0187257, 432, 1961.
  32. 32.↵
    Adriano de Bernardi Schneider, Colby T Ford, Reilly Hostager, John Williams, Michael Cioce, Ümit V Çatalyürek, Joel O Wertheim, and Daniel Janies. Strainhub: A phylogenetic tool to construct pathogen transmission networks. Bioinformatics, 36(3):945–947, 2020.
    OpenUrl
  33. 33.↵
    Lam-Tung Nguyen, Heiko A Schmidt, Arndt von Haeseler, and Bui Quang Minh. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular biology and evolution, 32(1):268–274, 2014.
    OpenUrlCrossRefPubMed
  34. 34.↵
    Jonathan E Suk, Thomas Van Cangh, Julien Beaute, Cornelius Bartels, Svetla Tsolova, Anastasia Pharris, Massimo Ciotti, and Jan C Semenza. The interconnected and crossborder nature of risks posed by infectious diseases. Global health action, 7(1):25287, 2014.
    OpenUrl
  35. 35.↵
    Ali Ahmed, Furqan Khurshid Hashmi, and Gul Majid Khan. Hiv outbreaks in Pakistan. The Lancet HIV, 6(7):e418, 2019.
    OpenUrl
  36. 36.↵
    François Cholette, Jeffrey Joy, Yann Pelcat, Laura H Thompson, Richard Pilon, John Ho, Rupert Capina, Chris Archibald, James F Blanchard, Faran Emmanuel, et al. Hiv-1 phylodynamic analysis among people who inject drugs in pakistan correlates with trends in illicit opioid trade. PloS one, 15(8):e0237560, 2020.
    OpenUrl
  37. 37.↵
    Masoomeh Maarefvand, Besat Kassaie, Hamidreza Ghiasvand, Robabeh Abolfathi Gharibdoosti, and Jagdish Khubchandani. Sexual and drug use risk behaviors of internal long distance truck drivers in iran. Iranian journal of public health, 45(5):606, 2016.
    OpenUrl
  38. 38.↵
    Sohail Agha. Potential for hiv transmission among truck drivers in pakistan. Aids, 14(15): 2404–2406, 2000.
    OpenUrlCrossRefPubMedWeb of Science
  39. 39.↵
    National AIDS control program integrated biological and behavioral surveillance. round 2 report. islamabad, pakistan. National AIDS Control Program, 2007.
  40. 40.
    National AIDS control program integrated biological and behavioral surveillance. round 3 report. islamabad, pakistan. National AIDS Control Program, 2008.
  41. 41.↵
    Sajid Ahmad, Malik Naeem Akhtar, Nasir Khan, Sher Muhammad, Salman Shahid, Abdul Jabbar Shaikh, Michelle Munro, Simon Azariah, M.S. Mohiuddin, Faran Emmanuel, Tahira Reza, Chaker Riaz Baloch, Kiran Zulfiqar, Suleman Otho, Shahzad Arian, Muhammad Imran Zohaib, Muhammad Adnan, Alam Zaib Khan Barrech, Atif Irfan, Muhammad Younis, Rashida Parveen, Nadeem Ikran Amjad Ali Abbasi, Agha Babar Hussain, Faisal Nasir, Muhammad Asad, James Blanchard, Chris Archibald, Paul Sandstrom, and Alix Adrien. HIV second generation surveillance in Pakistan - national report round IV 2011. National AIDS Control Program, pages 1–106, 2011.
  42. 42.↵
    Sonal Singh, Marco Ambrosio, Iris Semini, Oussama Tawil, Muhammad Saleem, Muhammad Imran, and Chris Beyrer. Revitalizing the hiv response in pakistan: A systematic review and policy implications. International Journal of Drug Policy, 25(1):26–33, 2014.
    OpenUrl
Back to top
PreviousNext
Posted January 17, 2022.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
HIV Phylogenetics Reveals Overlapping Transmission Networks among Cities and Key Populations in Pakistan
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
HIV Phylogenetics Reveals Overlapping Transmission Networks among Cities and Key Populations in Pakistan
Adriano de Bernardi Schneider, François Cholette, Yann Pelcat, Aaron G. Lim, Peter T. Vickerman, Adiba Hassan, Laura H. Thompson, James F. Blanchard, Faran Emmanuel, Tahira Reza, Nosheen Dar, Nadeem Ikram, Richard Pilon, Chris Archibald, Jeffrey B. Joy, Paul Sandstrom, Joel O. Wertheim
medRxiv 2021.12.14.21267743; doi: https://doi.org/10.1101/2021.12.14.21267743
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
HIV Phylogenetics Reveals Overlapping Transmission Networks among Cities and Key Populations in Pakistan
Adriano de Bernardi Schneider, François Cholette, Yann Pelcat, Aaron G. Lim, Peter T. Vickerman, Adiba Hassan, Laura H. Thompson, James F. Blanchard, Faran Emmanuel, Tahira Reza, Nosheen Dar, Nadeem Ikram, Richard Pilon, Chris Archibald, Jeffrey B. Joy, Paul Sandstrom, Joel O. Wertheim
medRxiv 2021.12.14.21267743; doi: https://doi.org/10.1101/2021.12.14.21267743

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • HIV/AIDS
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)