Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

The genetic dissection of fetal haemoglobin persistence in sickle cell disease in Nigeria

View ORCID ProfileOyesola O. Ojewunmi, View ORCID ProfileTitilope A. Adeyemo, Ajoke I. Oyetunji, Bassey Inyang, Afolashade Akinrindoye, View ORCID ProfileBaraka S. Mkumbe, Kate Gardner, Helen Rooks, View ORCID ProfileJohn Brewin, View ORCID ProfileHamel Patel, Sang-Hyuck Lee, Raymond Chung, Sara Rashkin, View ORCID ProfileGuolian Kang, Reuben Chianumba, View ORCID ProfileRaphael Sangeda, View ORCID ProfileLiberata Mwita, Hezekiah Isa, View ORCID ProfileUche-Nnebe Agumadu, View ORCID ProfileRosemary Ekong, Jamilu A. Faruk, Bello Y. Jamoh, Niyi M. Adebiyi, Ismail A. Umar, Abdulaziz Hassan, Christopher Grace, View ORCID ProfileAnuj Goel, View ORCID ProfileBaba P.D. Inusa, View ORCID ProfileMario Falchi, Siana Nkya, View ORCID ProfileJulie Makani, View ORCID ProfileHafsat R. Ahmad, View ORCID ProfileObiageli Nnodu, View ORCID ProfileJohn Strouboulis, View ORCID ProfileStephan Menzel
doi: https://doi.org/10.1101/2023.05.16.23289851
Oyesola O. Ojewunmi
1School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Oyesola O. Ojewunmi
  • For correspondence: oyesola.1.ojewunmi{at}kcl.ac.uk
Titilope A. Adeyemo
2Department of Haematology & Blood Transfusion, Faculty of Clinical Sciences, College of Medicine, University of Lagos, Lagos, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Titilope A. Adeyemo
Ajoke I. Oyetunji
3Sickle Cell Foundation Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bassey Inyang
4Department of Medical Biochemistry, College of Health Sciences, University of Abuja
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Afolashade Akinrindoye
3Sickle Cell Foundation Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Baraka S. Mkumbe
5Sickle Cell Program, Department of Hematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
6Department of Biochemistry, Muhimbili University of Health and Allied Science, Dar es Salaam, Tanzania
7Tanzania Human Genetics Organisation
8Department of Artificial Intelligence and Innovative Medicine, Graduate School of Medicine, Tohoku University, Sendai, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Baraka S. Mkumbe
Kate Gardner
1School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
9Department of Haematology, Guy’s and St. Thomas’ NHS Foundation Trust, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Helen Rooks
1School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John Brewin
1School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
10Department of Haematological Medicine, King’s College Hospital, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John Brewin
Hamel Patel
11NIHR BioResource Centre Maudsley, NIHR Maudsley Biomedical Research Centre (BRC) at South London and Maudsley NHS Foundation Trust (SLaM) & Institute of Psychiatry, Psychology and Neuroscience (IoPPN), King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hamel Patel
Sang-Hyuck Lee
11NIHR BioResource Centre Maudsley, NIHR Maudsley Biomedical Research Centre (BRC) at South London and Maudsley NHS Foundation Trust (SLaM) & Institute of Psychiatry, Psychology and Neuroscience (IoPPN), King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Raymond Chung
11NIHR BioResource Centre Maudsley, NIHR Maudsley Biomedical Research Centre (BRC) at South London and Maudsley NHS Foundation Trust (SLaM) & Institute of Psychiatry, Psychology and Neuroscience (IoPPN), King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sara Rashkin
12St. Jude Children’s Research Hospital, Memphis, Tennessee, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Guolian Kang
12St. Jude Children’s Research Hospital, Memphis, Tennessee, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Guolian Kang
Reuben Chianumba
13Department of Haematology & Centre of Excellence for Sickle Cell Disease Research and Training, University of Abuja, Abuja, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Raphael Sangeda
5Sickle Cell Program, Department of Hematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
14Department of Pharmaceutical Microbiology, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Raphael Sangeda
Liberata Mwita
14Department of Pharmaceutical Microbiology, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Liberata Mwita
Hezekiah Isa
13Department of Haematology & Centre of Excellence for Sickle Cell Disease Research and Training, University of Abuja, Abuja, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Uche-Nnebe Agumadu
13Department of Haematology & Centre of Excellence for Sickle Cell Disease Research and Training, University of Abuja, Abuja, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Uche-Nnebe Agumadu
Rosemary Ekong
15Research Department of Genetics, Evolution and Environment, University College London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rosemary Ekong
Jamilu A. Faruk
16Department of Paediatrics, Ahmadu Bello University/Ahmadu Bello University Teaching Hospital, Zaria, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bello Y. Jamoh
17Department of Internal Medicine, Ahmadu Bello University/Ahmadu Bello University Teaching Hospital, Zaria, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Niyi M. Adebiyi
16Department of Paediatrics, Ahmadu Bello University/Ahmadu Bello University Teaching Hospital, Zaria, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ismail A. Umar
18Department of Biochemistry, Faculty of Life Sciences, Ahmadu Bello University, Zaria, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Abdulaziz Hassan
19Department of Haematology and Blood Transfusion Ahmadu Bello University, Zaria, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher Grace
20Radcliffe Department of Medicine, Division of Cardiovascular Medicine, University of Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anuj Goel
20Radcliffe Department of Medicine, Division of Cardiovascular Medicine, University of Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anuj Goel
Baba P.D. Inusa
21Evelina London Children’s Hospital, Guy’s and St. Thomas’ NHS Foundation Trust, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Baba P.D. Inusa
Mario Falchi
22Department of Twin Research & Genetic Epidemiology King’s College London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mario Falchi
Siana Nkya
6Department of Biochemistry, Muhimbili University of Health and Allied Science, Dar es Salaam, Tanzania
7Tanzania Human Genetics Organisation
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julie Makani
5Sickle Cell Program, Department of Hematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
23Department of Haematology and Blood Transfusion, Muhimbili University of Health and Allied Science, Dar es Salaam, Tanzania
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Julie Makani
Hafsat R. Ahmad
16Department of Paediatrics, Ahmadu Bello University/Ahmadu Bello University Teaching Hospital, Zaria, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hafsat R. Ahmad
Obiageli Nnodu
13Department of Haematology & Centre of Excellence for Sickle Cell Disease Research and Training, University of Abuja, Abuja, Nigeria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Obiageli Nnodu
John Strouboulis
1School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John Strouboulis
Stephan Menzel
1School of Cancer and Pharmaceutical Sciences, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stephan Menzel
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background The clinical severity of sickle cell disease (SCD) is strongly influenced by the level of fetal haemoglobin (HbF) persistent in each patient. Three major HbF loci (BCL11A, HBS1L-MYB, and Xmn1-HBG2) have been reported, but a considerable hidden heritability remains.

Aim Building on the power of a large and genetically diverse patient pool present in Nigeria, we conducted a genome-wide association study for HbF levels in patients from three regions of the country with a diverse ethnic make-up.

Methods We analysed genome-wide trait association in 1006 Nigerian patients with SCD (HbSS/HbSβ0), followed by a replication and meta-analysis exercise in four independent SCD cohorts (3,582 patients). To dissect association signals at the major loci, we performed stepwise conditional analysis, haplotype association analysis and included public functional annotation data (fGCTA).

Results Association signals were detected for BCL11A (lead SNP rs6706648, β=- 0.39, P=4.96 x 10-34) and HBS1L-MYB (lead SNP rs61028892, β=0.73, P=1.18 x 10-9), whereas the variant allele for Xmn1-HBG2 was found to be very rare. Genetically dissecting the two major loci, we defined trait-boosting haplotypes containing suspected or so-far unidentified causal variants. At BCL11A, one such haplotype (trait increase P < 0.0001) contains the putative functional variant rs1427407-‘T’ and a second haplotype (P < 0.0001) is tagged by the rs7565301-‘A’ allele, with no obvious candidate causal variant. At HBS1L- MYB, one haplotype (trait increase P < 0.0001) contains the likely functional rs66650371 (Δ3-bp), and a second (P < 0.0001) is tagged by the ‘C’ allele of rs6102889. Together, variants at BCL11A and HBS1L-MYB SNPs explained 24.1% of the trait variance.

We detected three novel association signals: SLC28A3 on chromosome 9 (rs115555854: β=- 0.73, P=2.52 x 10-8), TICRR on chromosome 15 (rs140496989: β=-0.43, P=3.34 x 10-8), and PIEZO2 on chromosome 18 (rs58817161: β= −0.63, P= 8.04 x 10-8). These appeared to be restricted to the Nigerian patient cohort and were not confirmed in the replication cohorts.

Conclusions Studying a diverse cohort of Nigerian patients with sickle cell disease, we genetically dissected the known fetal-haemoglobin loci BCL11A and HBS1L-MYB and detected putative new trait-associated regions.

Introduction

Sickle cell disease (SCD) is an inherited blood disorder that affects approximately 300,000 newborns globally every year. Half of these are born in Nigeria alone, where SCD is a major public health problem and contributes significantly to childhood morbidity and mortality as fifty percent of children do not live beyond their 5th birthday [1].

Polymerisation of haemoglobin S (HbS) under hypoxic conditions is central to the pathophysiologic events in SCD, giving rise to many debilitating complications including vaso-occlusive crises, stroke, and leg ulcers.

A wide variability in severity and clinical presentation is driven partially by additional genetic factors, such as the genotypic state at the site of mutation (homozygous: Hb SS; compound heterozygous: Hb SC, HbS beta thalassaemia), and by genetic disease modifiers at the beta globin gene locus or elsewhere in the genome. To discover the underlying genes and genetic variants is highly desirable; however, the immense complexity of clinical presentations and pathological pathways involved has been a major obstacle. A major breakthrough has been the discovery that a substantial persistence of fetal haemoglobin (HbF, α2γ2) in certain patients can ameliorate the disease severity and improve survival [2]. This led to intense efforts to find ways to therapeutically induce HbF expression.

Some HbF persists naturally in adults, to variable extent, forming a quantitative trait with up to 89% heritability in the general population and 60-70% in SCD patients [3, 4]. Since it has been challenging to assemble sufficiently large and homogeneous SCD study cohorts, the first HbF loci, and thus the first SCD modifier genes, outside the beta globin gene cluster, HBS1 like Translational GTPase-MYB Proto-Oncogene (HBS1L-MYB) and B-cell lymphoma/leukaemia 11A (BCL11A) were discovered through studies in an extended family [5] and in large non-anaemic population samples [6–8]. They were subsequently replicated in SCD patient cohorts [9]. Together, the three major HbF loci, the β-globin gene cluster on chromosome 11 (Xmn1-HBG2 - a gamma globin promoter polymorphism), BCL11A on chromosome 2, and HBS1L-MYB on chromosome 6, account for about 50% of HbF variability in healthy persons and 20-30% in individuals with SCD in many populations [7, 9–13]. Therefore, half of the overall trait heritability is still unexplained, probably due to a lack of statistical power to detect large numbers of rare or weak-effect variants. To overcome this, to discover novel HbF or SCD severity loci and to better dissect the known ones, statistical power can be enhanced by assembling larger patient cohorts, by combining existing studies, or by ‘genetic loading’, i.e., recruiting subjects with extreme trait expression or marked family history. Nigeria, with its large number of patients, provides unique opportunities in this respect. The enhanced haplotype resolution seen in African populations is expected to support fine-mapping and the exceptional ethnic diversity of Nigeria will allow the evaluation of genetic variants that are rare elsewhere. Here, we present a first genome-wide study of fetal haemoglobin in patients with SCD in Nigeria, searching for new loci and performing fine-mapping to identify candidate functional variants and gain further insight into HbF regulation and heritability in persons with SCD.

Materials and methods

Patients

One thousand one hundred and fifty-eight individuals with sickle cell disease (HbSS/HbSβ0) recruited from four sites in Nigeria: Lagos University Teaching Hospital, Lagos; Sickle Cell Foundation Nigeria, Lagos; University of Abuja Teaching Hospital, Abuja; and Ahmadu Bello University Teaching Hospital, Zaria (Supplementary table 1) were genotyped. Patients were excluded from this study if they were on hydroxyurea, younger than 5 years old, had blood transfusion in the preceding 3 months, or had HbSC/HbSβ+. A uniform study proforma was completed for data collection across the project sites.

Five millilitres venous blood sample was collected into an EDTA bottle and processed immediately for complete blood count and haemoglobin profiling. The buffy coat was stored at −200C for subsequent DNA extraction. Fetal haemoglobin levels were determined using high performance liquid chromatography (Bio-Rad D-10; Bio-Rad Laboratories, Hercules, CA, USA). FlexiGene DNA extraction kit (Qiagen GmbH, Hilden, Germany) was used for DNA extraction from buffy coat following the manufacturer’s protocols.

Genotyping and quality control

Genotyping was performed using the Infinium™ H3Africa Consortium Array containing ∼2.3 million markers. One thousand one hundred and fifty-eight samples were genotyped and processed in Illumina’s Genome Studio software (version 2.05) for variant calling following the COPILOT raw Illumina genotyping quality control (QC) protocols detailed in [14]. Seventy-seven samples with genotyping call rate less than 90% were excluded during the Illumina Genome Studio QC and no sample was excluded further due to sample quality as the genotyping call rate was 99.99%. Individual-level QC was carried out to exclude samples with sex discrepancies compared with X-chromosome derived sex, heterozygosity outliers (heterozygosity ± 3 SD from mean), and genetically identical individuals (identity by descent, pi-hat ∼ 1.0) (Supplementary table 2), retaining 1006 individuals for imputation and downstream analysis. Per-marker QC excluded SNPs with call rate less than 97%, minor allele frequency < 1%, SNPs that deviated from Hardy-Weinberg equilibrium (P < 10-8), leaving 1,925,391 autosomal SNPs and X-chromosome. The overall genotyping rate was 99.994%. Quality control was carried out using PLINK v1.90 (www.cog-genomics.org/plink/1.9/).

Imputation

Filtered genotyped data were used for imputation in TOPMed Imputation Server for pre-phasing with Eagle version 2.4 and imputation using Minimac4 (https://imputation.biodatacatalyst.nhlbi.nih.gov/#!pages/home). During post-imputation quality control, SNPs with INFO score < 0.4 and non-biallelic SNPs were excluded, retaining 14,816,370 SNPs for data analysis.

Replication data set and meta-analysis

To evaluate our putative novel loci and to explore the additional potential gained from assembling a larger dataset through international collaboration, we involved summary-statistical datasets from UK SCD patients studied by us (Gardner et al. 2018) and from HbF GWAS studies in SCD patients performed by researchers in Tanzania [15] and the US [16]. Details of the patients’ recruitment, genotyping/sequencing technologies and platforms, data quality control, imputation, and association analyses for the Tanzanian and African-American data sets have been fully described [15, 16].

The UK SCD GWAS dataset is described separately [17], manuscript in preparation). It involves 626 HbSS/Sβ0 patients and 179 HbSC patients of West African and African Caribbean descent, with sickle mutation genotype included as a covariate during the association analysis. The Tanzanian Muhimbili Sickle Cell Cohort GWAS dataset includes 1,187 sickle cell disease patients and the St Jude Children’s Research Hospital (Memphis, TN, US) GWAS dataset includes 584 African-American SCD patients. The UKSCD and Tanzania SCD data were analysed using GCTA software with ln [HbF%] as the trait, with correction for age and sex while the African-American data were analysed with EMMAX software, using square root–transformed HbF% data as the trait [18]. Due to the different transformations of %HbF, sample size weighted meta-analysis (Stouffer’s method in METAL software) [19] with genomic control correction was performed taken in the SCD patients in the UK, Tanzania, and African-American cohorts. To identify new association signals, a second meta-analysis was performed to include the Nigerian cohort.

Fine-mapping

The functional genome wide complex trait analysis (fGCTA) pipeline runs GCTA-COJO on scaled summary statistics based on the presence or absence of relevant annotations. The annotations used in this analysis are: K562 regulatory elements derived using chromHMM algorithm from Roadmap Epigenome[20, 21], HUDEP-2 obtained from peak calls from Chip-Atlas[22, 23] and GATA1 ChIP-seq on human peripheral derived blood-erythroblast [24]. By use of Bayesian methodology, variants that overlap with the annotation have a decreased standard error and vice versa. This pipeline was implemented in C (software is available here: https://github.com/cgrace1978) and was converted into a prior probability which is combined with the p-value of the variant to generate a posterior probability. Embedded Image The posterior is then used to generate a new standard error (while keeping the original variant betas). These scaled summary statistics used GCTA-COJO module that performs a multi-SNP conditional and joint analysis of the summary data [25, 26]. The output of GCTA-COJO is independent variants in the summary statistics, which have a joint p-value below 1×10-5.

Association analysis

To analyse data from a study population that is very diverse in terms of genetic/ethnic background, geographic origin, and age (median age of 15, range 5 - 60 years), we performed linear-mixed effects regression as implemented by GCTA (Genome-wide Complex Trait Analysis software, version 1.91.7) [25] to control for population stratification by incorporating genetic relationship matrix. This was done using the leave one chromosome out approach with age, sex, and recruitment sites as covariates. Natural logarithm of HbF was used to perform association testing. Manhattan and Q-Q plots were generated in R v.4.2.2 software using the qqman package [27]. P-value < 5.0 x 10-8 was regarded as being genome-wide significant.

Conditional analysis and locus association plots

Conditional analyses were carried out with conditional and joint analysis in GCTA (GCTA-COJO) using summary statistics [26]. Regional association plots were generated in locus zoom [28].

Haplotype association analysis

Haplotypes were constructed with Haploview 4.2 [29] using phased genotypes from TOPMed Imputation Server. Haplotype effects were estimated using ANOVA in R with adjustment for sex, age, and recruitment sites. Tukey’s post-hoc test was applied for multiple comparisons of means.

Data handling and statistical analyses were performed on the high-performance computing platform CREATE at King’s College London [30].

Results

Genome-wide association analysis in the Nigerian cohort

We conducted a genome-wide association analysis for 475 male and 531 female patients with sickle cell disease (HbSS/HbSβ0) recruited in Southwest, North-Central, and North-West Nigeria. The median HbF of the patients was 6.69 % (range: 0.8 – 32%) (Supplementary Table 1) and the population structure shows the diversity of the patients recruited for this study (Supplementary Figure 1).

We detected two of the three major HbF modifier loci (Figure 1, Table 1), BCL11A (lead SNP rs6706648, β = −0.39, P = 4.96 x 10-34) and HBS1L-MYB (lead SNP rs61028892, β = 0.73, P = 1.18 x 10-9). The third, XmnI-HBG2 (rs7482144), was not included in the association analysis with our trait measure since this critical variant at the beta-globin gene cluster was rare (MAF = 0.007), consistent with a previous finding [31]. A list of genome-wide significant SNPs reported to be associated with HbF in previous studies and this current study is shown in Supplementary Table 3.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

(a) Manhattan plot of −log10(P-values). The red line indicates the threshold for genome-wide significant (P < 5 x 10-8), the blue line for suggestive association (P < 1 x 10-6). (b) Quantile-Quantile plot of −log10(P-values) for all SNPs tested. The red line indicates the null hypothesis that observed P-values equal expected P-values. Since there is no deviation from the null hypothesis line except at the tail end, population structure is implied to be effectively controlled. The low λGC value (lambda genomic control inflation factor) of 1.004 indicates that population stratification has been adjusted for effectively by the model and relationship matrix.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1. List of genome-wide loci in Nigeria cohort and the replication cohorts

In addition, we detected two novel association signals at a genome-wide significance level: SLC28A3 on chromosome 9 (rs115555854: β = −0.73, P = 2.52 x 10-8) and TICRR on chromosome 15 (rs140496989: β = −0.43, P = 3.34 x 10-8). A third locus, PIEZO2 on chromosome 18, showed association at 5% false discovery rate (p-value < 1.37 x 10-7) (rs58817161: β = −0.63, P = 8.04 x 10-8) (Table 1).

Genetic dissection of the BCL11A and HBS1L-MYB loci

Both loci overlap erythroid enhancer elements for adjacent genes BCL11A and MYB (Figure 2). They contain multiple independent association signals, indicating the likely presence of several functionally active DNA variants interfering with enhancer function. Our fine-mapping strategy was directed at defining the haplotypes carrying HbF-boosting alleles and at identifying putative functional variants residing therein.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2. Regional association plots for BCL11A (left panel) and HBS1L-MYB loci (right panel).

The dashed line shows the threshold for genome-wide significance (p < 5 x 10-8). The black rectangular blocks show key erythroid regulatory elements, detected as DNase I hypersensitive sites for BCL11A [29] and LDB1 binding sites for HBS1L-MYB [33].

First, we chose variants tagging HbF-relevant haplotypes aided by conditional association analysis. We then surveyed our phase-aligned genotype data to identify regional haplotypes defined by these variants, recorded their presence in patients, and measured their effects. To spotlight putative functional variants, we included functional sequence annotation in our fine-mapping using functional GCTA (fGCTA). To differentiate between variants in close linkage disequilibrium in African datasets that appeared to have a similar impact on HbF values, we performed additional association analysis in a European non-anaemic (TwinsUK) cohort.

BCL11A

Stepwise regression analysis initially yielded two partially independent signals, rs6706648 (our lead SNP at BCL11A) and rs1427407 (Table 2), a variant widely reported in other cohorts [15] that has been reported to disrupt critical transcription factor sites and is considered functional [32]. The fGCTA approach identified the same variants as in the standard conditional analysis (Table 2) for the erythroblast GATA1, K562, and HUDEP-2 annotations; rs1427407 overlapped with a GATA1 site (Supplementary Table 4, Supplementary Figure 2). When we conditioned on rs1427407 alone (Supplementary Table 5), the most significant SNP obtained was not rs6706648, but rs7565301, which is less frequent but with a similar allelic effect. To define haplotypes of interest, we also considered rs7606173, a widely reported SNP that we found in close LD with our lead SNP (D′ = 1, r2 = 0.91) (Figure 2). Accordingly, the power to distinguish between these two SNPs in our patient dataset is limited. Somewhat surprisingly, the LD between them is significantly less tight in European populations (EUR: D′ = 0.99, r2 = 0.61 versus YRI: D′ = 1.0, r2 = 0.91 in 1000 Genomes Phase III data implemented in LDlink [33]. Therefore, we decided to test both for HbF association (ln[%F-cells] trait) in the TwinsUK dataset and found rs7606173 to be substantially stronger (β = −0.296, p = 2.29 x 10-79 vs. β = -− 0.243, p = 1.27 x 10-47 of rs6706648) (Supplementary Table 6).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2. Conditional analysis of chromosome 2 in the Nigerian cohort

We used SNPs rs1427407, rs7565301, and rs7606173 to define haplotypes at BCL11A, of which we observed six, four of them with an allele frequency above 1% (Figure 3). We estimated the effect of the common BCL11A haplotypes on our trait (ln [HbF%]) compared to ‘basal’ haplotype 4, which contains low-HbF alleles for all three tag SNPs (‘GGC’). Haplotype 1 (‘TGG’), containing the high-HbF allele for rs1427407, had a significant trait- increasing effect (+0.537, p < 0.0001) as had haplotype 3 (‘GAG’), which contains the high- HbF allele for rs7565301 (+0.319, p < 0.0001). Haplotype 2 (‘GGG’), which contains the high-HbF allele only for rs7606173, had a smaller effect (+0.197), which was not significant, possibly due to its low frequency of 2.6%. Haplotypes 2 and 3 did not have significantly different effects. These HbF-boosting haplotype effects were validated in the UK SCD datasets (Supplementary Table 7).

Figure 3.
  • Download figure
  • Open in new tab
Figure 3. Major haplotypes at the BCL11A locus as defined by key markers rs1427407, rs7565301, and rs7606173.

The left side of the graph shows the relationship of these tagging SNPs with other variants populating these haplotypes, i.e., associated alleles (top) and LD relationships (bottom). The right side shows haplotype effects on HbF values.

LD plot: Each diamond shows the value of Dʹ and the LD indicates white (r2 = 0), shades of grey (0 < r2 < 1), and black (r2 = 1).

HBS1L-MYB

Conditional analysis detected two separate association signals, the lead SNP, rs61028892, and rs9399137 (Table 3). fGCTA analysis identified rs61028892 as the peak SNP following the conditional analysis (Table 2). The K562 annotation identified the same secondary signal (rs9399137) with the conditional analysis. However, erythroblast GATA1 and HUDEP-2 tracks identified a different secondary signal (rs66650371) (Supplementary Table 8, Supplementary Figure 3). All these SNPs overlapped with their respective annotations. The ‘basal’ haplotype 3 (‘GT’), (Figure 4), carrying low-HbF variants at both positions, was overwhelmingly frequent (94%), as it is typical for African populations [34]. We observed two other haplotypes: haplotype 1 (‘GC’, carrying the high-HbF allele at rs9399137) with an HbF increasing effect of +0.497 (p < 0.0001) and haplotype 2 (‘CT’, carrying the high-HbF allele at rs61028892) with an effect of +0.671 (p < 0.0001). Similar haplotype effects were confirmed in the UK SCD datasets as shown in the Supplementary Table 9.

Figure 4.
  • Download figure
  • Open in new tab
Figure 4. Major haplotypes at the HBS1L-MYB locus as defined by key markers rs61028892 and rs9399137.

The left side of the graph shows the relationship of these tagging SNPs with other variants populating these haplotypes, i.e., associated alleles (top) and LD relationships (bottom). The right side shows haplotype effects on HbF values.

LD plot: Each diamond shows the value of Dʹ and the LD indicates white (r2 = 0), shades of grey (0 < r2 < 1), and black (r2 = 1).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3. Conditional analysis of chromosome 6 in the Nigerian cohort

Trait variability explained

We applied a multiple linear regression modelling to estimate the genetic effects on HbF (ln [HbF%]) using eight representative SNPs on BCL11A (rs7606173, rs1427407, rs6706648, rs6738440, rs7565301) and HBS1L-MYB (rs9399137, rs66650371 and rs61028892) and found that rs7606173, rs1427407, rs6738440, rs9399137, and rs61028892 account for 24.1% HbF variance in SCD patients.

Search for functionally active genetic variants

We detected haplotypes with significant trait-increasing effects (Figure 3) at BCL11A (haplotypes 1 and 3) and two at HBS1L-MYB (haplotypes 1 and 2, Figure 4), each expected to harbour at least one functional DNA variant causing higher HbF levels in patients. At BCL11A, the tag SNP for haplotype 1 (rs1427407) itself is likely to be functional [32]. For haplotype 3, candidates are the tag SNP rs7565301, variants in close LD with it (rs6729815: D′ = 1.0, r2 = 1.0; rs7599488: D′ = 1.0, r2 = 1.0; rs6545817: D′ = 0.98, r2 = 0.96; rs10189857: D′ = 1.0, r2 = 0.98; rs6545816: D′ = 0.98, r2 = 0.73) or other, so far undetected variants. At HBS1L-MYB, a 3-bp deletion (rs66650371) in close LD (D′ = 1.0, r2 = 1.0) with the tag SNP for haplotype 1 (rs9399137) has been suggested to be functional [35, 36]. For haplotype 2, candidates are the tag SNP rs61028892, variants in close LD with it (rs148826327: D′ = 1.0, r2 = 1.0, rs116460276: r2 = D′ = 1.0, r2 = 1.0) or an undetected variant.

Evaluation of novel association signals

A list of all genome-wide and suggestive loci detected in the Nigerian cohort is presented in Supplementary Table 10. The regional association plots for the novel genome-wide significant loci are shown in the Supplementary Figures 4-6.

For any novel locus detected near (above or below) the acknowledged threshold for genome-wide significance there is a chance of it being either a true (caused by functional DNA sequence changes) or a spurious association signal. To distinguish these scenarios, we sought to confirm (‘replicate’) our three novel signals in three further sickle cell populations of African descent, combined through meta-analysis (Table 1). We detected no significant association but the longitudinal data of 326 SCD African-American patients from St Jude Children’s Research Hospital showed a replication for PIEZO2 (p = 0.00028) (Supplementary Table 11).

To search for additional association signals, we conducted further meta-analysis, involving the genome-wide association results of all 3582 patients (Nigeria, UK, Tanzania, and African-American). We identified one genome-wide significant locus and nine suggestive genome-wide significant loci, in addition to those detected in the Nigerian cohort alone (Figure 5, Supplementary Table 12).

Figure 5.
  • Download figure
  • Open in new tab
Figure 5.

(a) Manhattan plot showing meta-analysis of 3582 SCD patients. The y-axis indicates the −log(P-value) for each variant in the meta-analysis and x-axis shows chromosomal position. (b). Quantile-Quantile plot with the genomic control inflation (λ) of 1.002.

Discussion

We conducted a genome-wide quantitative-trait association study for fetal-haemoglobin levels in 1,006 Nigerian patients with sickle cell disease recruited from three regions of the country. At the genome-wide significance level, we detected two of the three known major loci for HbF persistence [7, 9, 37], BCL11A (chromosome 2) and HBS1L-MYB (HMIP) (chromosome 6), and three new loci, SLC28A3 on chromosome 9, TICRR on chromosome 15, and PIEZO2 on chromosome 18.

HbF persistence is a widely studied genetic trait in sickle cell patients due to its important disease-ameliorating and life-prolonging effect, the broad availability of a reliable assay system (HPLC), and its reduced complexity compared to other SCD-severity related traits. HbF reactivation is a key therapeutic goal, targeted not only by the disease-modifying drug, hydroxyurea, but is also the subject of promising gene therapy approaches presently undergoing clinical trials [38, 39]. BCL11A and HBS1L-MYB are major HbF loci present in all human populations, including in SCD patients [7, 9, 37], but the third, Xmn1-HBG2, residing within the beta-globin gene cluster, is detected only in patients where so-called ‘Senegal’ and ‘Arab-Indian’ haplotypes surrounding the sickle mutation are prevalent. While it was unsurprising to encounter BCL11A and HBS1L-MYB in the current study, the genetic dissection of these loci has provided us with candidates for the underlying functionally active sequence variants for biological experiments to study locus effects and mechanisms of action.

BCL11A (B-cell lymphoma/leukaemia 11A) is a known repressor of γ-globin expression. It participates in the haemoglobin switch from γ to β-globin, but genetic variants present in intron 2 of the gene are associated with moderate HbF persistence, detected as increased F-cell numbers and overall increased HbF levels. These variants occupy the erythroid enhancer region, characterised by three DNase I-hypersensitive sites (DHSs) situated at +55 kb, +58 kb, and +62 kb from the transcription start site (TSS)[32].

We detected two haplotypes (defined by rs1427407, rs7565301, and rs7606173) with strong HbF-increasing effect at this locus. The first (haplotype TGG) carries HbF-increasing alleles at rs1427407 (‘T’) and rs7606173 (‘G’). rs1427407 is located within GATA1 and TAL1 binding motifs at the +62 kb site, suggesting that the T allele may disrupt GATA1 binding. Thus, rs1427407-‘T’ is the likely causal variant driving the effect of this haplotype. A second haplotype with a significant HbF-increasing effect (haplotype 3) is defined by the presence of the rs7565301 variant (‘A’ allele) and rs7606173 (‘G’). Haplotype 2, carrying only rs7606173 (‘G’) but not rs7565301-‘A’, did not have a significant effect. Hence, we suggest that a causative sequence variant closely linked with rs7565301 is likely driving the significant effect of haplotype 3 and possibly also a weak (diluted) effect at haplotype 2. We presently cannot exclude that rs7606173 (‘G’) or a closely linked variant cause the HbF- raising effects of haplotypes 1, 2, and 3.

Association of the BCL11A locus with HbF has been previously reported in diverse populations with different SNPs on BCL11A showing the strongest association with HbF levels in SCD patients (Supplementary Table 3) [7, 8, 15, 40]. In our dataset, these reported SNPs map to the same effect haplotypes we have identified. Thus, we suggest that the same underlying causative variants are responsible in all non-anaemic and patient populations studied so far.

The Tanzanian SCD cohort and the African-American cohort of Cooperative Study of SCD (CSSCD) showed rs1427407 as the strongest association signal with HbF, with rs6545816 and rs7606173 detected as secondary signals [15, 32]. The difference with our findings underscores the influence of population-specific allele frequencies and LD patterns on the identity of lead (sentinel) SNPs reported in specific studies.

At HBS1L-MYB, we detected two haplotypes (defined by rs61028892 and rs9399137) with significant HbF-boosting effect. Haplotype 1, tagged by rs9399137-‘C’, causes what has previously been described as the HMIP-2A sub-locus [34]. It harbours a 3-bp deletion (rs66650371), which was proposed to be a functional DNA change [35] and was found to disrupt a TAL-1 binding motif within a critical enhancer element for MYB, the gene encoding an essential transcriptional regulator of erythropoiesis [36]. Decreased promoter interaction (looping) of an allele carrying the deletion was observed in K562 cells [36], but it is not clear whether this was due to the deletion itself or closely linked variants occupying the same haplotype. HbF-increasing haplotype 2 is defined by the presence of rs61028892-‘C’, which also carries two closely linked alleles rs148826327-‘A’ (D′ = 1.0, r2 = 1.0), and rs116460276- ‘G’ (D′ = 1.0, r2 = 1.0). It is also linked to a series of variants that have previously served to define the HMIP-2B sublocus [34, 41]. rs61028892 is situated at the same MYB enhancer element (‘−84 LDB1 site’) as the 3-bp deletion rs66650371 and has previously been proposed as an additional casual variant in the HMIP region [42]. This suggests that the HMIP-2 locus might not contain sub-loci, but three functional alleles affecting enhancer element -84, two increasing HbF (rs66650371-del and rs61028892-‘C’) and one representing the baseline, low- HbF state (haplotype ‘GT’). The more precise definition of the HMIP-2 locus we have obtained with these data highlights the significantly lower high-HbF allele frequency at this site in African populations (5.6% total haplotype frequency) compared to individuals of European descent (>20%) and or to patients with substantial European admixture [7, 43].

Consistent with previous studies, HbF variability explained by BCL11A and HMIP in our study is 24.1%, which is lower than in non-anaemic populations (∼ 45%) [7, 12, 13]. Sex and age (up to 3%) are only minor contributors to the overall trait variability. Thus, after major HbF loci are taken into consideration, a substantial number of loci with smaller effects or loci that are sickle-cell specific remain to be discovered.

Lastly, the three novel HbF loci detected, SLC28A3 on chromosome 9 and TICRR on chromosome 15 with genome-wide significance, and PIEZO2 on chromosome 18 close to the acknowledged threshold were not confirmed in the British, Tanzanian, and African-American patients. Thus, whether these new loci harbour functional, HbF-affecting variants or represent spurious association signals will have to be resolved through additional genetic or biological investigation.

Conclusions

In the current study, we have refined the definition of two major fetal-haemoglobin persistence loci, BCL11A and HS1L-MYB, which will aid future functional studies that seek to identify new ways to induce HbF expression in patients with sickle cell disease and beta thalassaemia. We have detected new putative HbF loci, which are presently unconfirmed.

Finding biological pathways influencing SCD severity remains a major goal for genetic studies with patient populations. The statistical power to detect loci with small effects lies in the assembly of large patient cohorts and in combining world-wide studies.

Data Availability

All data produced in the present study are available upon reasonable request to the authors.

Funding

This work was funded in principle by MRC grant MR/T013389/1 (to SM and JS), with additional funding from EPSRC (Newton/GCRF) grant EP/X527920/1, from LIBRA and from the King’s College Hospital Charity. Twins UK is supported by the Wellcome Trust Core Award Grant Number 203141/Z/16/Z with additional support from the NIHR Oxford BRC. AG and CG were supported by the Wellcome Trust Core Award Grant Number 203141/Z/16/Z with additional support from the NIHR Oxford BRC. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.

Author’s contributions

Conception and design: SM, JS, OO, TA, ON. Patient recruitment: TA, AIO, BI, KG, JB, HI, UA, RC, JAF, BYJ, NMA, AH, BI, JM, HA, ON. Sample preparation: OO, AA, BI, KG, HR, RE, SN; Data curation: OO, RC, HP, S-HL, SR, GK, KG, RS, LM; Data Analysis and interpretation: OO, KG, BSM, SR, GK, CG, AG; Supervision: SM, TA, HA, SN, ON, IAU; Manuscript writing: OO and SM with input from all authors. All authors read and approved the final manuscript. Coordinated the project: SM and JS.

Availability of data and materials

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Ethics approval and consent to participate

Written informed consent was obtained from the patients and when children were recruited, their parents/guardians signed the consent form while children above the age of seven gave assent. This study was approved by the Health Research Ethics Committee of Lagos University Teaching Hospital (ADM/DCST/HREC/1686), University of Abuja Teaching Hospital (FCT/UATH/HREC/PR/144), and Ahmadu Bello University ethics committee (ABUCUHSR/2021/031). This study was conducted in accordance with the Helsinki Declaration (1975, as revised in 2008).

Consent for publication

Not Applicable

Competing interests

The authors declare that they have no competing interests.

Acknowledgements

The authors thank the Sickle Cell Foundation Nigeria, Lagos, Nigeria, and Dr Annette Akinsete for providing an enabling environment and facility for patients’ recruitment. We are grateful to all the patients for their participation in this study. We acknowledge Mr. Musa Mohammed who coordinated the laboratory activities including blood sampling and haematological investigations at Ahmadu Bello University Teaching Hospital, Zaria, Nigeria. The authors are also grateful to Prof Lawal W. Umar, the Coordinator of Institute of Child Health, (ICH) Banzazzau, Zaria, Nigeria; and the Resident doctors, house officers, and all staff at the Paediatric Haematology/Oncology Unit, Ahmadu Bello University Teaching Hospital, Zaria, Nigeria.

List of Abbreviations:

BCL11A
B-cell lymphoma/leukemia 11A
CSSCD
African-American cohort of Cooperative Study of SCD
DHS
DNase I-hypersensitive site
fGCTA
functional GCTA
GCTA
Genome-wide complex trait analysis
HbF
Fetal haemoglobin
HbS
haemoglobin S
HBS1L-MYB
HBS1 like Translational GTPase-MYB Proto-Oncogene
HbSS
Homozygosity for HbS
HbSβ
Compound heterozygote state for HbS and beta thalassaemia (HbSβ0/ HbSβ+)
HMIP
HBS1L-MYB intergenic polymorphism
LD
Linkage disequilibrium
MAF
Minor Allele Frequency
SCD
sickle cell disease
TOPMed
Trans-Omics for Precision Medicine
TSS
transcription start site.

References

  1. 1.↵
    Nnodu OE, Oron AP, Sopekan A, Akaba GO, Piel FB, Chao DL: Child mortality from sickle cell disease in Nigeria: a model-estimated, population-level analysis of data from the 2018 Demographic and Health Survey. Lancet Haematol 2021, 8(10):e723–e731.
    OpenUrl
  2. 2.↵
    Platt OS, Brambilla DJ, Rosse WF, Milner PF, Castro O, Steinberg MH, Klug PP: Mortality in sickle cell disease. Life expectancy and risk factors for early death. N Engl J Med 1994, 330(23):1639–1644.
    OpenUrlCrossRefPubMedWeb of Science
  3. 3.↵
    Garner C, Tatu T, Reittie JE, Littlewood T, Darley J, Cervino S, Farrall M, Kelly P, Spector TD, Thein SL: Genetic influences on F cells and other hematologic variables: a twin heritability study. Blood 2000, 95(1):342–346.
    OpenUrlAbstract/FREE Full Text
  4. 4.↵
    Steinberg MH, Sebastiani P: Genetic modifiers of sickle cell disease. Am J Hematol 2012, 87(8):795–803.
    OpenUrlCrossRefPubMed
  5. 5.↵
    Craig JE, Rochette J, Sampietro M, Wilkie AO, Barnetson R, Hatton CS, Demenais F, Thein SL: Genetic heterogeneity in heterocellular hereditary persistence of fetal hemoglobin. Blood 1997, 90(1):428–434.
    OpenUrlAbstract/FREE Full Text
  6. 6.↵
    Thein SL, Menzel S, Peng X, Best S, Jiang J, Close J, Silver N, Gerovasilli A, Ping C, Yamaguchi M et al: Intergenic variants of HBS1L-MYB are responsible for a major quantitative trait locus on chromosome 6q23 influencing fetal hemoglobin levels in adults. Proc Natl Acad Sci U S A 2007, 104(27):11346–11351.
    OpenUrlAbstract/FREE Full Text
  7. 7.↵
    Menzel S, Garner C, Gut I, Matsuda F, Yamaguchi M, Heath S, Foglio M, Zelenika D, Boland A, Rooks H et al: A QTL influencing F cell production maps to a gene encoding a zinc-finger protein on chromosome 2p15. Nat Genet 2007, 39(10):1197–1199.
    OpenUrlCrossRefPubMedWeb of Science
  8. 8.↵
    Uda M, Galanello R, Sanna S, Lettre G, Sankaran VG, Chen W, Usala G, Busonero F, Maschio A, Albai G et al: Genome-wide association study shows BCL11A associated with persistent fetal hemoglobin and amelioration of the phenotype of beta-thalassemia. Proc Natl Acad Sci U S A 2008, 105(5):1620–1625.
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    Lettre G, Sankaran VG, Bezerra MA, Araujo AS, Uda M, Sanna S, Cao A, Schlessinger D, Costa FF, Hirschhorn JN et al: DNA polymorphisms at the BCL11A, HBS1L-MYB, and beta-globin loci associate with fetal hemoglobin levels and pain crises in sickle cell disease. Proc Natl Acad Sci U S A 2008, 105(33):11869–11874.
    OpenUrlAbstract/FREE Full Text
  10. 10.
    Labie D, Dunda-Belkhodja O, Rouabhi F, Pagnier J, Ragusa A, Nagel RL: The -158 site 5’ to the G gamma gene and G gamma expression. Blood 1985, 66(6):1463–1465.
    OpenUrlAbstract/FREE Full Text
  11. 11.
    Gilman JG, Huisman TH: DNA sequence variation associated with elevated fetal G gamma globin production. Blood 1985, 66(4):783–787.
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    Galarneau G, Palmer CD, Sankaran VG, Orkin SH, Hirschhorn JN, Lettre G: Fine-mapping at three loci known to affect fetal hemoglobin levels explains additional genetic variation. Nat Genet 2010, 42(12):1049–1051.
    OpenUrlCrossRefPubMedWeb of Science
  13. 13.↵
    Gardner K, Fulford T, Silver N, Rooks H, Angelis N, Allman M, Nkya S, Makani J, Howard J, Kesse-Adu R et al: g(HbF): a genetic model of fetal hemoglobin in sickle cell disease. Blood Adv 2018, 2(3):235–239.
    OpenUrlAbstract/FREE Full Text
  14. 14.↵
    Patel H, Lee SH, Breen G, Menzel S, Ojewunmi O, Dobson RJB: The COPILOT Raw Illumina Genotyping QC Protocol. Curr Protoc 2022, 2(4):e373.
    OpenUrl
  15. 15.↵
    Mtatiro SN, Singh T, Rooks H, Mgaya J, Mariki H, Soka D, Mmbando B, Msaki E, Kolder I, Thein SL et al: Genome wide association study of fetal hemoglobin in sickle cell anemia in Tanzania. PLoS One 2014, 9(11):e111464.
    OpenUrlCrossRefPubMed
  16. 16.↵
    Rampersaud E, Kang G, Palmer LE, Rashkin SR, Wang S, Bi W, Alberts NM, Anghelescu D, Barton M, Birch K et al: A polygenic score for acute vaso-occlusive pain in pediatric sickle cell disease. Blood Adv 2021, 5(14):2839–2851.
    OpenUrl
  17. 17.↵
    Gardner K: Manuscript in preparation 2023.
  18. 18.↵
    Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, Sabatti C, Eskin E: Variance component model to account for sample structure in genome-wide association studies. Nat Genet 2010, 42(4):348–354.
    OpenUrlCrossRefPubMedWeb of Science
  19. 19.↵
    Willer CJ, Li Y, Abecasis GR: METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 2010, 26(17):2190–2191.
    OpenUrlCrossRefPubMedWeb of Science
  20. 20.↵
    Ernst J, Kellis M: ChromHMM: automating chromatin-state discovery and characterization. Nat Methods 2012, 9(3):215–216.
    OpenUrlCrossRefPubMedWeb of Science
  21. 21.↵
    Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ et al: Integrative analysis of 111 reference human epigenomes. Nature 2015, 518(7539):317–330.
    OpenUrlCrossRefPubMed
  22. 22.↵
    Zou Z, Ohta T, Miura F, Oki S: ChIP-Atlas 2021 update: a data-mining suite for exploring epigenomic landscapes by fully integrating ChIP-seq, ATAC-seq and Bisulfite-seq data. Nucleic Acids Research 2022, 50(W1):W175–W182.
    OpenUrlCrossRef
  23. 23.↵
    Oki S, Ohta T, Shioi G, Hatanaka H, Ogasawara O, Okuda Y, Kawaji H, Nakaki R, Sese J, Meno C: ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO reports 2018, 19(12):e46255.
    OpenUrlAbstract/FREE Full Text
  24. 24.↵
    Consortium EP: An integrated encyclopedia of DNA elements in the human genome. Nature 2012, 489(7414):57–74.
    OpenUrlCrossRefPubMedWeb of Science
  25. 25.↵
    Yang J, Lee SH, Goddard ME, Visscher PM: GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 2011, 88(1):76–82.
    OpenUrlCrossRefPubMed
  26. 26.↵
    Yang J, Ferreira T, Morris AP, Medland SE, Genetic Investigation of ATC, Replication DIG, Meta-analysis C, Madden PA, Heath AC, Martin NG et al: Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet 2012, 44(4):369–375, S361-363.
    OpenUrlCrossRefPubMed
  27. 27.↵
    Turner: qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. Journal of Open Source Software 2018, 3(25):731.
    OpenUrl
  28. 28.↵
    Boughton AP, Welch RP, Flickinger M, VandeHaar P, Taliun D, Abecasis GR, Boehnke M: LocusZoom.js: interactive and embeddable visualization of genetic association study results. Bioinformatics 2021, 37(18):3017–3018.
    OpenUrlCrossRefPubMed
  29. 29.↵
    Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2005, 21(2):263–265.
    OpenUrlCrossRefPubMedWeb of Science
  30. 30.↵
    King’s College London: King’s Computational Research, Engineering and Technology Environment (CREATE). 2023.
  31. 31.↵
    Adeyemo TA, Ojewunmi OO, Oyetunji IA, Rooks H, Rees DC, Akinsulie AO, Akanmu AS, Thein SL, Menzel S: A survey of genetic fetal-haemoglobin modifiers in Nigerian patients with sickle cell anaemia. Plos One 2018, 13(6).
  32. 32.↵
    Bauer DE, Kamran SC, Lessard S, Xu J, Fujiwara Y, Lin C, Shao Z, Canver MC, Smith EC, Pinello L et al: An erythroid enhancer of BCL11A subject to genetic variation determines fetal hemoglobin level. Science 2013, 342(6155):253–257.
    OpenUrlAbstract/FREE Full Text
  33. 33.↵
    Machiela MJ, Chanock SJ: LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 2015, 31(21):3555–3557.
    OpenUrlCrossRefPubMed
  34. 34.↵
    Menzel S, Rooks H, Zelenika D, Mtatiro SN, Gnanakulasekaran A, Drasar E, Cox S, Liu L, Masood M, Silver N et al: Global genetic architecture of an erythroid quantitative trait locus, HMIP-2. Ann Hum Genet 2014, 78(6):434–451.
    OpenUrlCrossRefPubMed
  35. 35.↵
    Farrell JJ, Sherva RM, Chen ZY, Luo HY, Chu BF, Ha SY, Li CK, Lee AC, Li RC, Li CK et al: A 3-bp deletion in the HBS1L-MYB intergenic region on chromosome 6q23 is associated with HbF expression. Blood 2011, 117(18):4935–4945.
    OpenUrlAbstract/FREE Full Text
  36. 36.↵
    Stadhouders R, Aktuna S, Thongjuea S, Aghajanirefah A, Pourfarzad F, van Ijcken W, Lenhard B, Rooks H, Best S, Menzel S et al: HBS1L-MYB intergenic variants modulate fetal hemoglobin via long-range MYB enhancers. J Clin Invest 2014, 124(4):1699–1710.
    OpenUrlCrossRefPubMed
  37. 37.↵
    Wonkam A, Bitoungui VJN, Vorster AA, Ramesar R, Cooper RS, Tayo B, Lettre G, Ngogang J: Association of Variants at BCL11A and HBS1L-MYB with Hemoglobin F and Hospitalization Rates among Sickle Cell Patients in Cameroon. Plos One 2014, 9(3).
  38. 38.↵
    Demirci S, Leonard A, Essawi K, Tisdale JF: CRISPR-Cas9 to induce fetal hemoglobin for the treatment of sickle cell disease. Mol Ther Methods Clin Dev 2021, 23:276–285.
    OpenUrl
  39. 39.↵
    Frangoul H, Altshuler D, Cappellini MD, Chen YS, Domm J, Eustace BK, Foell J, de la Fuente J, Grupp S, Handgretinger R et al: CRISPR-Cas9 Gene Editing for Sickle Cell Disease and beta-Thalassemia. N Engl J Med 2021, 384(3):252–260.
    OpenUrlPubMed
  40. 40.↵
    Bhatnagar P, Purvis S, Barron-Casella E, DeBaun MR, Casella JF, Arking DE, Keefer JR: Genome-wide association study identifies genetic variants influencing F-cell levels in sickle-cell patients. J Hum Genet 2011, 56(4):316–323.
    OpenUrlCrossRefPubMed
  41. 41.↵
    Mtatiro SN, Mgaya J, Singh T, Mariki H, Rooks H, Soka D, Mmbando B, Thein SL, Barrett JC, Makani J et al: Genetic association of fetal-hemoglobin levels in individuals with sickle cell disease in Tanzania maps to conserved regulatory elements within the MYB core enhancer. BMC Med Genet 2015, 16:4.
  42. 42.↵
    Canver MC, Lessard S, Pinello L, Wu Y, Ilboudo Y, Stern EN, Needleman AJ, Galacteros F, Brugnara C, Kutlar A et al: Variant-aware saturating mutagenesis using multiple Cas9 nucleases identifies regulatory elements at trait-associated loci. Nat Genet 2017, 49(4):625–634.
    OpenUrlCrossRefPubMed
  43. 43.↵
    Leonardo FC, Brugnerotto AF, Domingos IF, Fertrin KY, de Albuquerque DM, Bezerra MA, Araujo AS, Saad ST, Costa FF, Menzel S et al: Reduced rate of sickle-related complications in Brazilian patients carrying HbF-promoting alleles at the BCL11A and HMIP-2 loci. Br J Haematol 2016, 173(3):456–460.
    OpenUrl
Back to top
PreviousNext
Posted May 18, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The genetic dissection of fetal haemoglobin persistence in sickle cell disease in Nigeria
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The genetic dissection of fetal haemoglobin persistence in sickle cell disease in Nigeria
Oyesola O. Ojewunmi, Titilope A. Adeyemo, Ajoke I. Oyetunji, Bassey Inyang, Afolashade Akinrindoye, Baraka S. Mkumbe, Kate Gardner, Helen Rooks, John Brewin, Hamel Patel, Sang-Hyuck Lee, Raymond Chung, Sara Rashkin, Guolian Kang, Reuben Chianumba, Raphael Sangeda, Liberata Mwita, Hezekiah Isa, Uche-Nnebe Agumadu, Rosemary Ekong, Jamilu A. Faruk, Bello Y. Jamoh, Niyi M. Adebiyi, Ismail A. Umar, Abdulaziz Hassan, Christopher Grace, Anuj Goel, Baba P.D. Inusa, Mario Falchi, Siana Nkya, Julie Makani, Hafsat R. Ahmad, Obiageli Nnodu, John Strouboulis, Stephan Menzel
medRxiv 2023.05.16.23289851; doi: https://doi.org/10.1101/2023.05.16.23289851
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
The genetic dissection of fetal haemoglobin persistence in sickle cell disease in Nigeria
Oyesola O. Ojewunmi, Titilope A. Adeyemo, Ajoke I. Oyetunji, Bassey Inyang, Afolashade Akinrindoye, Baraka S. Mkumbe, Kate Gardner, Helen Rooks, John Brewin, Hamel Patel, Sang-Hyuck Lee, Raymond Chung, Sara Rashkin, Guolian Kang, Reuben Chianumba, Raphael Sangeda, Liberata Mwita, Hezekiah Isa, Uche-Nnebe Agumadu, Rosemary Ekong, Jamilu A. Faruk, Bello Y. Jamoh, Niyi M. Adebiyi, Ismail A. Umar, Abdulaziz Hassan, Christopher Grace, Anuj Goel, Baba P.D. Inusa, Mario Falchi, Siana Nkya, Julie Makani, Hafsat R. Ahmad, Obiageli Nnodu, John Strouboulis, Stephan Menzel
medRxiv 2023.05.16.23289851; doi: https://doi.org/10.1101/2023.05.16.23289851

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)