Regulatory Variants on the Leukocyte Immunoglobulin-Like Receptor Gene Cluster are Associated with Crohn’s Disease and Interact with Regulatory Variants for TAP2 =================================================================================================================================================================== * Kwangwoo Kim * Shin Ju Oh * Junho Lee * Ayeong Kwon * Chae-Yeon Yu * Sangsoo Kim * Chang Hwan Choi * Sang-Bum Kang * Tae Oh Kim * Dong Il Park * Chang Kyun Lee ## Abstract **Background and Aims** Crohn’s disease (CD) has a complex polygenic etiology with high heritability. We keep putting an effort to identify novel variants associated with susceptibility to CD through a genome-wide association study (GWAS) in large Korean populations. **Methods** Genome-wide variant data from 902 Korean patients with CD and 72,179 controls were used to assess the genetic associations in a meta-analysis with previous Korean GWAS results from 1,621 patients with CD and 4,419 controls. Epistatic interactions between CD-risk variants of interest were tested using a multivariate logistic regression model with an interaction term. **Results** We identified two novel genetic associations with the risk of CD near *ZBTB38* and within the leukocyte immunoglobulin-like receptor (LILR) gene cluster (*P*<5×10−8), with highly consistent effect sizes between the two independent Korean cohorts. CD-risk variants in the LILR locus are known quantitative trait loci (QTL) for multiple LILR genes, of which *LILRB2* directly interacts with various ligands including MHC class I molecules. The LILR lead variant exhibited a significant epistatic interaction with CD-associated regulatory variants for *TAP2* involved in the antigen presentation of MHC class I molecules (*P*=4.11×10−4), showing higher CD-risk effects of the *TAP2* variant in individuals carrying more risk alleles of the LILR lead variant (OR=0.941, *P*=0.686 in non-carriers; OR=1.45, *P*=2.51×10−4 in single-copy carriers; OR=2.38, *P*=2.76×10−6 in two-copy carriers). **Conclusions** This study demonstrated that genetic variants at two novel susceptibility loci and the epistatic interaction between variants in LILR and *TAP2* loci confer risk of CD. Key Words * Crohn’s disease * GWAS * genetic variants ## 1. Introduction Crohn’s disease (CD) is a complex inflammatory bowel disease with strong genetic components. The risk of developing CD is significantly higher in individuals with a family history of the disease compared to the general population.1 Twin studies estimated the heritability of CD to be 75%, showing that the disease concordance rate for CD is much higher in monozygotic twins than in dizygotic twins.2 Genome-wide association studies (GWASs) have been highly productive in identifying variant-level genetic components associated with susceptibility to CD, including *IL23R, NOD2*, and others.3 To date, the number of reported CD loci is over 240 that have opened up new avenues to understand the underlying mechanisms of disease development and to identify potentially effective drug targets for CD. However, the highly polygenic genetic architecture of CD resulted in a large number of CD-risk variants remaining hidden in the human genomes, leading to considerably large missing heritability. GWAS variants, including reported CD loci, explain a small fraction of the heritability of CD,2 suggesting that a continuous effort to identify CD-risk variants is necessary to better understand the dysregulation of the gastrointestinal immune system in patients with CD. Indeed, a Bayesian simulation approach that inferred the genetic architecture of common complex disorders suggested the presence of hundreds of common causal variants with weak to modest effect sizes as well as multiple rare variants with strong effects.4 In addition, the non-additive epistatic effects among independently inherited genetic variants are largely unknown; however, these are expected to substantially explain a proportion of the missing heritability.5 Moreover, GWASs in non-European individuals have been relatively infrequent.6 Considering the ethnicity-associated genetic heterogeneity and the limited ethnic transferability of polygenic risk scores, it is crucial to conduct more genetic studies in relatively understudied populations, including the Korean population. In recent decades, the incidence and prevalence of inflammatory bowel disease has been steadily increasing in Korea.7 In this study, we generated genome-wide variant data from a large Korean population and performed a meta-analysis with existing Korean GWAS association summary statistics to identify robust novel association signals and gene-gene interactions to expand the knowledge of the genetic etiology of CD. ## 2. Materials and Methods ### 2.1. GWAS subjects Our case-control GWAS newly analyzed the individual-level genotype data of 902 patients with CD and 72,179 healthy controls, with a subsequent meta-analysis with previous GWAS association statistics8 estimated from 1,621 patients with CD and 4,419 controls. Among the newly analyzed subjects, patients with CD were recruited from Kyung Hee University Hospital (KHUH; n=253 from a subset of a multi-omics study cohort with a [ClinicalTrials.gov](http://ClinicalTrials.gov) identifier [NCT03589183](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT03589183&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom)) and 15 tertiary hospitals (n=649 from a retrospective multicenter CD cohort from the IMPACT study9). All patients met the diagnostic criteria of CD based on clinical, serological, and radiological results as well as endoscopy and histology results.10 Patients with indeterminate colitis were excluded. All study participants were aged ≥16 years at study enrollment and provided peripheral blood samples for genomic DNA extraction. Controls were recruited through the Korean Genome Epidemiology Study (KoGES), providing genomic DNAs to the National Biobank of Korea. The study was conducted in accordance with the ethical principles of the Declaration of Helsinki and was approved by the Institutional Review Board of the participating hospitals (KHUH 2018-03-006-018, KHUH 2021-04-082, KBSMC 2016-07-029). All participants provided written informed consent for participation in the study. ### 2.2. Genetic variant data Genome-wide variant data were generated using a custom Affymetrix array named KoreanChip11 developed by the Korea National Institute of Health. The array comprised over 800,000 variants yielding high coverage of genomic variants with moderate to high imputation quality, particularly for Korean-specific, functional, and low-frequency variants.11 After a general quality control procedure for genotyping data (**Supplementary Table 1**), whole-genome imputation for 902 cases and 72,179 controls was performed based on the reference panel of the 1000 Genomes Project (1KGP) phase 3 through the Michigan Imputation Server12 using Eagle2 and Minimac4. A total of 8,950,452 unique variants with minor allele frequencies (MAFs) ≥ 0.5%, Hardy-Weinberg equilibrium (HWE; *P*-values for HWE ≥ 10−6 in controls and 10−10 in cases), and imputation accuracy values R2 ≥ 0.3 were included in all subsequent statistical analyses. HLA imputation was performed separately by the Michigan Imputation Server using Eagle2 and Minimac4 based on the multi-ethnic HLA reference panel13. ### 2.3. Statistical tests #### 2.3.1. Single-marker association A single-marker association for CD in our study population was assessed using multivariate logistic regression adjusted for the top five principal components and sex to calculate an odds ratio (OR) and its 95% confidence interval (CI) for each variant. We defined the boundaries of each susceptibility locus by merging the positions of the CD-associated variants in the 250 kb flanking regions of each variant so that the susceptibility loci were at least 500 kb apart from each other. In addition, we visually inspected the regional association plots for each locus to confirm the independence of the CD associations. #### 2.3.2. Meta-analysis with previous Korean GWAS results A previous genome-wide association summary statistics dataset for Korean CD was retrieved from Jung *et al*.8 This dataset included 5,885,366 variants from an independent Korean cohort consisting of 1,621 CD cases and 4,419 controls, after excluding 307 variants with mismatched allele codes from dbSNP v150. A GWAS meta-analysis of our newly generated GWAS results with the previous GWAS results was performed by the METAL program14 using an inverse-variance fixed-effects model after genomic control for each individual dataset, bringing the final total sample size to 79,121 (n = 2,523 CD cases and 76,598 controls). The extended major histocompatibility complex (MHC) region in a high linkage disequilibrium (LD) was assumed to span from 25 to 36 Mb on chromosome 6. To identify a credible set of the most likely causal variants associated with CD in each susceptibility locus, a Bayesian fine-mapping approach using FINEMAP v1.4 was applied to estimate the posterior probability (PP) for each variant to be causal based on the meta-analysis results and the in-sample LD from the newly analyzed GWAS samples.15 A 95% credible set has the minimum number of the best causal candidate variants whose cumulative sum of PPs is greater than 0.95. #### 2.3.3. Estimation of heritability from GWAS variants A heritability of CD was calculated from all genome-wide variants listed in our meta-analysis by using LD score regression (LDSC)16, based on the genetic architecture of the 1KGP East Asians. The latest prevalence of CD was 0.00030 in the Korean population7; therefore, this value was used to compute heritability values on the liability scale. #### 2.3.4. Epistatic interaction test Epistatic interaction tests were performed to examine whether a joint effect between two genetic variants of interest deviated significantly from a multiplicative joint effect. For this, a multivariate logistic regression model with an interaction term between the two variants was employed using the newly generated GWAS dataset, with adjustments for the top five principal components and sex. Specifically, we evaluated the epistatic interaction between a novel CD-risk lead variant of interest and potentially functional CD-risk variants in genes related to the MHC class I-mediated antigen presentation pathway. Pathway gene members (n = 28) were retrieved from the gene ontology term ‘antigen processing and presentation of peptide antigens via MHC class I’ (GO: 0002474) using the msigdbr package17. Functional variants included protein-altering variants and expression quantitative trait loci (eQTL). To collect the most relevant eQTL variants, we extracted only the lead eQTL variants for each gene in the pathway based on the previous eQTL results18 from six different immune cell types of Japanese individuals that share similar genetic characteristics with Koreans in terms of allelic frequency and LD. ## 3. Results ### 3.1. Identification of three novel susceptibility loci for CD We performed a meta-analysis of the genome-wide association results of the newly generated CD GWAS data and previously published GWAS data8, providing the disease association statistics for 9,044,589 autosomal variants in 2,523 CD cases and 76,598 controls. The genomic-control inflation factor (λGC) was 1.038 and the LDSC intercept was 0.974, indicating little evidence of systemic bias and population stratification in the study cohort. A principal component analysis also supported a highly consistent genetic background between the cases and controls (**Supplementary Figure 1**). There were 1,227 variants surpassing the genome-wide significance threshold in 28 non-MHC loci (**Figure 1**). Of these, there were two novel association signals at rs255774 within the leukocyte immunoglobulin-like receptor (LILR) gene cluster (OR = 1.32, 95%CI = 1.22 to 1.42, *P*meta = 1.71 × 10−12) and at rs6802753 near *ZBTB38* (OR = 1.28, 95%CI = 1.18 to 1.38, *P*meta = 4.45 × 10−10). The lead variant in each novel susceptibility locus showed a modest effect size, which was highly consistent between the two Korean datasets (**Table 1**). In addition, the associations of 26 reported non-HLA loci, including *TNFSF15-TNFSF8, DUSP5, GPR35*, and *NOD2*, were replicated in the Korean population (**Supplementary Table 2**). The liability-scale narrow-sense heritability *h**2* estimated from the GWAS meta-analysis variants was 19.1%, of which approximately 3.0% accounted for the two novel loci. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/03/28/2023.03.28.23287842/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2023/03/28/2023.03.28.23287842/F1) Figure 1. Manhattan plot for the genome-wide association meta-analysis of Crohn’s disease in the Korean population. Novel and known susceptibility loci are labeled in red and black, respectively. View this table: [Table 1.](http://medrxiv.org/content/early/2023/03/28/2023.03.28.23287842/T1) Table 1. Genetic associations of two novel susceptibility loci for Crohn’s disease in Korean populations ### 3.2. Functional annotation of CD-risk variants #### 3.2.1. LILR gene cluster locus We identified multiple variants associated with CD in the LILR gene cluster region (Figure 2A). A Bayesian fine-mapping analysis suggested a credible variant set in which the listed variants had a relatively high posterior probability of being causal (**Supplementary Table 3**). These variants (n = 5) with over 95% cumulative probability of being causal were all located in non-coding elements and have previously been associated with the expression or alternative splicing of multiple genes, including *LILRB2, LILRB3, LILRB4*, and *LILRA6*, in whole blood according to GTEx Analysis Release V8 (dbGaP Accession phs000424.v8.p2). ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/03/28/2023.03.28.23287842/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2023/03/28/2023.03.28.23287842/F2) Figure 2. Regional association plots of the novel susceptibility loci for Crohn’s disease. **(A)** *ZBTB38* locus **(B)** leukocyte immunoglobulin-like receptor gene cluster locus. The lead variant rs255774 was detected as a splicing QTL (sQTL)19 associated with the variation in intron splicing efficiency of *LILRB2* in whole blood (GTEx splicing phenotype number: chr19:54218413:54276264:clu21769). The intron splicing associated with the CD-risk allele *C* of rs255774 results in the exclusion of all three immunoreceptor tyrosine-based inhibitory motifs (ITIMs) in the cytoplasmic tail of LILRB2 proteins that interact directly with MHC class I molecules, even when these are present on the same cells.20 Therefore, as the ITIMs inhibit MHC class I-mediated activating signals, it is plausible that the CD-risk variant downregulates an inhibitory pathway of the MHC class I-mediated immune response. Various approaches have long demonstrated the importance of antigen presentation by MHC class I molecules in the pathogenesis of many inflammatory diseases, including CD. In particular, the genetic variants in the MHC region explained a large fraction of the known genetic predisposition to CD in multiple ancestries, and an HLA association fine-mapping study clearly pinpointed the genetic contribution of MHC class I genes, such as *HLA-B* and *HLA-C*, to the development of CD.21 #### 3.2.2. The *ZBTB38* locus The *ZBTB38* locus has been well documented owing to the pleiotropic effects of its genetic variants on various human traits, including height.22 Notably, the lead variant rs6802753 and its LD proxies (*r*2 > 0.6) with genome-wide association significance were limited to *ZBTB38* (**Figure 2**B). Due to the high LD in the locus, it is challenging to statistically narrow the list of these variants down to a small number of putative causal candidate variants. There were 39 variants in the 95% credible set; however, none of the variants had PPs of over 0.1 (**Supplementary Table 4**). Some of the variants in the 95% credible set of the *ZBTB38* locus were located with active chromatin states23, and one of the proxies (rs6440008 with *P*meta=2.31×10−9) was previously associated with the expression level of *ZBTB38* in lymphoblastoid cell lines.24 In addition, although none of proxies in a strong LD with the lead variant resided in protein-coding elements, there was a relatively weakly correlated nonsynonymous proxy, rs62282002 (*r**2*=0.61), on *ZBTB38* with a suggestive CD association (*P*meta=4.75×10−6), which was previously reported to be associated with serum total protein levels25 and idiopathic short statures26. ### 3.3. Identification of the epistatic interaction between the LILR variant rs255774 and *TAP2*-regulatory variants Several LILRs, including *LILRB2*, are known to recognize a broad range of ligands, including MHC class I molecules, and contribute to the regulation of the MHC class I-mediated immune response.27 In this study, we aimed to identify the epistatic interaction between the lead CD-risk variant rs255774 in the LILR locus and CD-risk functional variants (lead eQTL or HLA variants) in the genes involved in the MHC class I-mediated antigen presentation using the newly generated GWAS dataset. There were 28 genes listed in the GO term “antigen processing and presentation of peptide antigens via MHC class I” (GO:0002474). Among these, 11 genes within the extended MHC region showed significant CD associations in our GWAS (**Supplementary Table 5**). Of the 11 genes, nine had significant eQTLs in various immune cell types of individuals with East Asian ancestry (lead_eQTLs:gene:tissue pairs = 29; **Supplementary Table 6**), and *HLA-B* showed significant CD associations at the level of HLA classical alleles and amino acid residues (**Supplementary Table 7**). The multivariate logistic regression with an interaction term between the LILR variant rs255774 and putatively functional CD-risk variants in the MHC class I pathway revealed that the LILR variant rs255774 interacts with a lead eQTL for *TAP2* (*P*interaction = 4.11 × 10−4) among the functional variants (**Supplementary Table 6**). Specifically, two CD-risk alleles in LILR and *TAP2* synergistically conferred risk of CD (ORintereaction=1.496). For example, when the study subjects were stratified according to the number of the CD-risk alleles of the LILR variant rs255774, the effect sizes of the CD-risk allele of *TAP2*-regulatory variant rs9276627 increased in individuals carrying more CD-risk LILR alleles (**Table 2**) and vice versa (**Supplementary Table 8**). In addition, the effect allele frequencies of rs9276627 were highly consistent among the rs255774-stratified control groups in contrast to the case groups, which indicates that the identified interaction follows the assumption of independence between two variants in the general population. Indeed, TAP2 transports peptide fragments from cytosol to the endoplasmic reticulum lumen for antigen processing and presentation mediated by MHC class I molecules, and the CD-risk variant of the *TAP2* eQTL rs9276627 is associated with increased expression of *TAP2*. View this table: [Table 2.](http://medrxiv.org/content/early/2023/03/28/2023.03.28.23287842/T2) Table 2. Synergistic interaction effect of two quantitative trait loci for *TAP2* and LILR genes on susceptibility to Crohn’s disease ## Discussion There is the growing list of susceptibility loci for CD mostly through GWASs, but a substantial proportion of CD susceptibility loci remains unidentified.2 Our large-scale genome analysis, which took advantage of the Korean-specific high-coverage GWAS array, the so-called KoreanChip, as well as a meta-analysis using another ancestry-matched GWAS dataset, successfully identified and characterized two dependable CD susceptibility loci with highly consistent effect sizes in two independent case-control datasets of Korean individuals. The novel loci identified in this study further contributed to 3.0% of the SNP-based heritability in the Korean cohort. The Bayesian inference of posterior inclusion probability was used to suggest the list of the most likely causal variants in the two novel loci. This finding supports the crucial role of non-coding variants in the pathogenesis of complex polygenic disorders. In particular, we identified five variants in a credible set, including potentially regulatory variants in the LILR locus. Of these, the two most significant variants explained >0.75 of the cumulative sum of PP. Notably, the lead CD-risk variant was an sQTL for *LILRB2* and an eQTL for multiple genes, including *LILRB3, LILRB4, LILRA6*, and *RPS9*. Regarding the LILR locus, the CD-relevant effector genes were not precisely determined, given the multiple target genes of CD-associated QTLs and the limited knowledge about the specific functions of each candidate gene in the gastrointestinal immune system. Experimental approaches and genetic analyses using different ancestries are necessary to identify disease-relevant genes. Nevertheless, we suspect that *LILRB2*, interacting with MHC class I molecules, may confer risk of CD via a synergistic gene-gene interaction with *TAP2* because risk variants on two genes in the same pathway may have a greater chance of interacting with each other in a biological sense. We identified the presence of an additional CD-risk effect on individuals with both the CD-risk QTL alleles for LILR genes and *TAP2* in a dose-dependent manner. TAP2 is a member of the ATP-binding cassette transporter family and is involved in transferring cytosolic peptide fragments into the endoplasmic reticulum lumen, after which the fragments are loaded on the MHC class I molecules before being presented on the cell surface.28 The expression of *TAP2* is upregulated to facilitate the MHC class I-mediated antigen presentation in various cell types including macrophage and dendritic cells.29 In our study, the results indicate that the risk allele *C* of the CD-associated *TAP2* eQTL rs9276627 is associated with increased levels of *TAP2*. Consistently, elevated expression levels of *TAP2* was observed in patients with CD through a colon transcriptome analysis.30 Therefore, the risk alleles of both the sQTL for *LILRB2* and the eQTL for *TAP2* appear to be associated with the upregulation of the MHC class I-mediated immune response. Another novel CD association at rs6802753 on 3q23 suggests that *ZBTB38* is the sole candidate causal gene. *ZBTB38* encodes “Zinc-finger and BTB Domain Containing 38” that regulates transcription while binding to methylated DNA elements.31 GWASs have demonstrated that variants in *ZBTB38* have pleiotropic effects on several human traits, including height,32 cancer,33 and prion disease34. The CD-risk variant rs6440008 in a high LD with the lead variant rs6802753 is a known eQTL whose CD-risk allele *C* is associated with the upregulation of *ZBTB38*. Nevertheless, the exact physiological and immunological role of *ZBTB38* have been understudied. Through immunological approaches, *ZBTB38* was reported to be hypomethylated and transcriptionally upregulated in adaptive NK cells.35 Similarly, the murine homolog gene *Zbtb38* was highly expressed in a murine model of rheumatoid arthritis due to its hypomethylated promoter and interfered with an anti-inflammatory pathway by transcriptionally repressing *IL1r2* in B cells.36 In non-immune cells, the knockdown of *Zbtb38* resulted in apoptosis.37 In addition, a cancer study showed that increased expression of *ZBTB38* in human bladder cancer induced a more aggressive phenotype with enhanced migration and invasive growth through modulation of the Wnt/β-catenin signaling pathway.38 In summary, we identified two novel susceptibility loci for CD and an epistatic interaction effect between CD-risk variants on risk of disease. Given the novel putative causal genes and variants identified, this GWAS provides novel insights into the underlying pathogenesis of CD involving the dysregulation of both LILR genes in the MHC class I-mediated antigen presentation pathway and *ZBTB38*. ## Supporting information Supplementary materials [[supplements/287842_file04.pdf]](pending:yes) ## Data Availability All data produced in the present study are available upon reasonable request to the authors. ## Funding This work was supported by the National Research Foundation of Korea (NRF-2017R1A5A2014768 and NRF-2020R1A2B5B02002259). ## Conflict of Interest The authors have no conflicts of interest to declare. ## Author Contributions KK and CKL designed the study. KK, SJO, JHL, and CKL wrote the manuscript. SJO, SK, CHC, SBK, TOK, DIP, HJK, and CKL recruited/characterized patients with CD, obtained DNA samples, and/or generated GWAS data. KK, SJO, HJK, and CKL obtained genetic data from the KoGES under the permission of Centers for Disease Control of Korea. KK, JL, AK, and CYY analyzed genetic data. All authors participated in critical revision of the manuscript and approved the final version. ## Data Availability The data used and analyzed for this study are available from the corresponding author upon reasonable request. ## Acknowledgements We would like to thank the following individuals and group for their contributions to this work: the IMPACT study group for the contribution to genotyping for patients with Crohn’s disease; study coordinators in all participating hospitals for their help in recruiting subjects; professors Ho-Su Lee and Byong Duk Ye at the University of Ulsan College of Medicine for providing the GWAS summary statistics from their previous CD GWAS. ## Footnotes * * K.K. and S.J.O. are co-first authors. * Received March 28, 2023. * Revision received March 28, 2023. * Accepted March 28, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Van Limbergen J, Russell RK, Drummond HE, et al. Definition of phenotypic characteristics of childhood-onset inflammatory bowel disease. Gastroenterology 2008;135:1114–22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1053/j.gastro.2008.06.081&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18725221&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000259982800018&link_type=ISI) 2. 2.Chen GB, Lee SH, Brion MJ, et al. Estimation and partitioning of (co)heritability of inflammatory bowel disease from gwas and immunochip data. Hum Mol Genet 2014;23:4710–20. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddu174&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24728037&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 3. 3.Jans D, Cleynen I. The genetics of non-monogenic ibd. Hum Genet 2023. 4. 4.Stahl EA, Wegmann D, Trynka G, et al. Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nat Genet 2012;44:483–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2232&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22446960&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 5. 5.Eichler EE, Flint J, Gibson G, et al. Missing heritability and strategies for finding the underlying causes of complex disease. Nat Rev Genet 2010;11:446–50. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg2809&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20479774&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000277806700013&link_type=ISI) 6. 6.Loos RJF. 15 years of genome-wide association studies and no signs of slowing down. Nat Commun 2020;11:5900. 7. 7.Kwak MS, Cha JM, Lee HH, et al. Emerging trends of inflammatory bowel disease in south korea: A nationwide population-based study. J Gastroenterol Hepatol 2019;34:1018–26. 8. 8.Jung S, Ye BD, Lee HS, et al. Identification of three novel susceptibility loci for inflammatory bowel disease in koreans in an extended genome-wide association study. J Crohns Colitis 2021;15:1898–907. 9. 9.Kang EA, Jang J, Choi CH, et al. Development of a clinical and genetic prediction model for early intestinal resection in patients with crohn’s disease: Results from the impact study. J Clin Med 2021;10. 10. 10.Maaser C, Sturm A, Vavricka SR, et al. Ecco-esgar guideline for diagnostic assessment in ibd part 1: Initial diagnosis, monitoring of known ibd, detection of complications. J Crohns Colitis 2019;13:144–64. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ECCO-JCC/JJY113&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 11. 11.Moon S, Kim YJ, Han S, et al. The korea biobank array: Design and identification of coding variants associated with blood biochemical traits. Sci Rep 2019;9:1382. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-018-37832-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 12. 12.Das S, Forer L, Schonherr S, et al. Next-generation genotype imputation service and methods. Nat Genet 2016;48:1284–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3656&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27571263&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 13. 13.Luo Y, Kanai M, Choi W, et al. A high-resolution hla reference panel capturing global population diversity enables multi-ancestry fine-mapping in hiv host response. Nat Genet 2021;53:1504–16. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00935-7&link_type=DOI) 14. 14.Willer CJ, Li Y, Abecasis GR. Metal: Fast and efficient meta-analysis of genomewide association scans. Bioinformatics 2010;26:2190–1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btq340&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20616382&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000281738900017&link_type=ISI) 15. 15.Benner C, Spencer CC, Havulinna AS, et al. Finemap: Efficient variable selection using summary data from genome-wide association studies. Bioinformatics 2016;32:1493–501. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btw018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26773131&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 16. 16.Bulik-Sullivan BK, Loh PR, Finucane HK, et al. Ld score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 2015;47:291–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3211&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25642630&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 17. 17.Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005;102:15545–50. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTAyLzQzLzE1NTQ1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDMvMjgvMjAyMy4wMy4yOC4yMzI4Nzg0Mi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 18. 18.Ishigaki K, Kochi Y, Suzuki A, et al. Polygenic burdens on cell-specific pathways underlie the risk of rheumatoid arthritis. Nat Genet 2017;49:1120–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3885&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28553958&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 19. 19.Li YI, Knowles DA, Humphrey J, et al. Annotation-free quantification of rna splicing using leafcutter. Nat Genet 2018;50:151–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-017-0004-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29229983&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 20. 20.Kang X, Kim J, Deng M, et al. Inhibitory leukocyte immunoglobulin-like receptors: Immune checkpoint proteins and tumor sustaining factors. Cell Cycle 2016;15:25–40. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/15384101.2015.1121324&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26636629&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 21. 21.Goyette P, Boucher G, Mallon D, et al. High-density mapping of the mhc identifies a shared role for hla-drb1*01:03 in inflammatory bowel diseases and heterozygous advantage in ulcerative colitis. Nat Genet 2015;47:172–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3176&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25559196&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 22. 22.Yengo L, Vedantam S, Marouli E, et al. A saturated map of common genetic variants associated with human height. Nature 2022;610:704–12. 23. 23.Ernst J, Kellis M. Chromatin-state discovery and genome annotation with chromhmm. Nat Protoc 2017;12:2478–92. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nprot.2017.124&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29120462&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 24. 24.Lappalainen T, Sammeth M, Friedlander MR, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature 2013;501:506–11. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature12531&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24037378&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000324826300049&link_type=ISI) 25. 25.Kanai M, Akiyama M, Takahashi A, et al. Genetic analysis of quantitative traits in the japanese population links cell types to complex human diseases. Nat Genet 2018;50:390–400. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0047-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29403010&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 26. 26.Wang Y, Wang ZM, Teng YC, et al. An snp of the zbtb38 gene is associated with idiopathic short stature in the chinese han population. Clin Endocrinol (Oxf) 2013;79:402–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/cen.12145&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23302005&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 27. 27.Abdallah F, Coindre S, Gardet M, et al. Leukocyte immunoglobulin-like receptors in regulating the immune response in infectious diseases: A window of opportunity to pathogen persistence and a sound target in therapeutics. Front Immunol 2021;12:717998. 28. 28.Lapenna A, Omar I, Berger M. A novel spontaneous mutation in the tap2 gene unravels its role in macrophage survival. Immunology 2017;150:432–43. 29. 29.Jenner RG, Young RA. Insights into host responses against pathogens from transcriptional profiling. Nat Rev Microbiol 2005;3:281–94. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrmicro1126&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15806094&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000227952800010&link_type=ISI) 30. 30.Yang L, Tang S, Baker SS, et al. Difference in pathomechanism between crohn’s disease and ulcerative colitis revealed by colon transcriptome. Inflamm Bowel Dis 2019;25:722–31. 31. 31.Marchal C, Defossez PA, Miotto B. Context-dependent cpg methylation directs cell-specific binding of transcription factor zbtb38. Epigenetics 2022;17:2122–43. 32. 32.Gudbjartsson DF, Walters GB, Thorleifsson G, et al. Many sequence variants affecting diversity of adult human height. Nat Genet 2008;40:609–15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.122&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18391951&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000255366700029&link_type=ISI) 33. 33.Kote-Jarai Z, Olama AA, Giles GG, et al. Seven prostate cancer susceptibility loci identified by a multi-stage genome-wide association study. Nat Genet 2011;43:785–91. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.882&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21743467&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 34. 34.Mead S, Uphill J, Beck J, et al. Genome-wide association study in multiple human prion diseases suggests genetic risk factors additional to prnp. Hum Mol Genet 2012;21:1897–906. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddr607&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22210626&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000302302400018&link_type=ISI) 35. 35.Schlums H, Cichocki F, Tesi B, et al. Cytomegalovirus infection drives adaptive epigenetic diversification of nk cells with altered signaling and effector function. Immunity 2015;42:443–56. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.immuni.2015.02.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25786176&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F03%2F28%2F2023.03.28.23287842.atom) 36. 36.Ocsko T, Toth DM, Hoffmann G, et al. Transcription factor zbtb38 downregulates the expression of anti-inflammatory il1r2 in mouse model of rheumatoid arthritis. Biochim Biophys Acta Gene Regul Mech 2018;1861:1040–7. 37. 37.Oikawa Y, Matsuda E, Nishii T, Ishida Y, Kawaichi M. Down-regulation of cibz, a novel substrate of caspase-3, induces apoptosis. J Biol Chem 2008;283:14242–7. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamJjIjtzOjU6InJlc2lkIjtzOjEyOiIyODMvMjEvMTQyNDIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wMy8yOC8yMDIzLjAzLjI4LjIzMjg3ODQyLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 38. 38.Jing J, Liu J, Wang Y, et al. The role of zbtb38 in promoting migration and invasive growth of bladder cancer cells. Oncol Rep 2019;41:1980–90.