Whole-Exome sequencing identifies a Type 2 Diabetes candidate mutation in PTPRF =============================================================================== * Erik J.M. Toonen * Michael Kwint * Alexander Hoischen * Cees J. Tack ## Abstract **Objectives** Identification of genetic factors involved in Type 2 diabetes mellitus (T2DM) has been challenging. While genome-wide association studies (GWAS) have identified over 60 loci associated with T2DM, these variants only explain a small proportion of the total heritability of the disease. Whole-exome sequencing has become a powerful approach to identify genetic variants that are not captured by GWAS. We applied exome sequencing to identify causal genetic variants in a family with a 2-generation history of T2DM characterized by substantial insulin resistance, hypertension and isolated hypertriglyceridemia. Methods: Exome sequencing was performed on genomic DNA of two affected family members. Twenty-four identified variants that were present in both family members were further tested for segregation in all other family members by Sanger sequencing. Results: Three rare missense variants, located in the genes *PTPRF, FUCA1* and *FBXO30*, were present in all affected family members but also in one unaffected family member. Protein-protein interaction analysis showed that *PTPRF* strongly interacts with several members of the insulin signaling pathway. Conclusions: Our results suggest that the variant in the *PTPRF* gene might be causal for an unusual T2DM subtype that is characterized by a severe insulin resistance and isolated hypertriglyceridemia. Keywords * Whole-exome sequencing * Insulin resistance * Genetics of Type 2 Diabetes * Hypertriglyceridemia * Rare genetic variants ## Introduction Type 2 diabetes mellitus (T2DM) is a complex multifactorial trait where both environmental and genetic factors contribute to the pathogenesis of the disease. Based on twin and segregation studies, the heritability of type 2 diabetes has been estimated to be between 36 and 83 percent [1, 2]. Large-scale genome-wide association studies (GWAS) have contributed towards understanding the inherited basis of diabetes. In the past few years, GWA studies have identified over numerous loci associated with T2DM [3-7]. However, these identified genetic variants account for less than 6 percent of the heritability estimated from family studies [8]. Because these common variants explain only a small fraction of the estimated heritability, it has been hypothesized that rare genetic variants with large effect sizes contribute to the genetic predisposition of T2DM. Traditional genotyping platforms used in GWAS are primarily designed to capture common genetic polymorphisms but offer poor coverage of rare genetic variants, hence rare disease variants with large effect sizes are easily missed [9]. Approaches other than GWAS are necessary to identify the remaining genetic variants underlying the pathogenesis of type 2 diabetes in order to improve diagnosis and treatment strategies. Recently, whole-exome sequencing has become a powerful strategy to identify low-frequency and rare variants not captured by GWAS [10]. Sequencing of all genes (exome) is an approach relying on the hypothesis that functional disease-associated variation resides in the coding regions. Whole-exome sequencing has proven to be valuable for identifying mutations responsible for monogenic diseases [11] and emerging reports show that exome sequencing can also be applied to uncover variation associated with complex human traits [12, 13]. While T2DM consists of both decreased insulin secretion and reduced insulin action (insulin resistance), most genetic defects identified so far relate to insulin secretion and genetic factors in obesity-induced insulin resistance seem to be underreported [14]. In the present study, we apply exome sequencing to identify genetic variants that may have a role in disease susceptibility in a family with a 2-generation history of type 2 diabetes, with an unusual phenotype characterized by pronounced insulin resistance and hypertriglyceridemia. ## Methods ### Family members We recruited a family with a 2-generation family history of T2DM. The index cases were under chronic diabetes care in the hospital. Due to privacy reasons, the pedigree is not published, please contact the corresponding author for further information. Case presentations The index case presented with diabetes in his late thirties with a six months period of weight loss (14 kg) and balanitis. Body weight at presentation 80.5 kg, height 1.73 (BMI 26.7 kg/m2), blood pressure 150/100. Laboratory analysis revealed HbA1c 12.5%, ALAT slightly elevated, urine ketones negative. Patient started oral combination therapy (SU and metformin), with temporary improvement, but secondary glycemic failure after approximately 2 years. Before start of insulin his laboratory parameters were: HbA1c: 9.2%, total cholesterol 5.3, Triglycerides 4.9, HDL-cholesterol 0.70, LDL 2.51 mmol/L (all fasting and without statin), fasting insulin level: 30 mU/L, C-peptide: 1.54 nmol/L, ASAT 21, ALAT 39, urine albumin 20 mg/L. Patient was started on insulin, eventually administered by pump, up to 140 U/day (body weight increased to 94 kg, BMI 31.4 kg/m2). His triglyceride levels remained elevated. See Table 1 for details. View this table: [Tabel 1.](http://medrxiv.org/content/early/2020/09/13/2020.09.11.20189399/T1) Tabel 1. Characteristics of family members A relative was diagnosed with diabetes in his sixties during routine medical examination. He was overweight (weight 87.5 kg, height 1.78, BMI 27,6 kg/m2) and had hypertension, micro-albuminuria, hypertriglyceridemia and low HDL-cholesterol. He was eventually treated with insulin, up to 160 U/day. He always had elevated triglyceride levels. The clinical phenotype of affected family members was characterized by substantial insulin resistance, hypertension and hypertriglyceridemia without hypercholesterolemia. Data from one affected relative were retrieved from another hospital. Data from an unaffected relatives were obtained during a short hospital visit, data from one unaffected relative were obtained through his GP and by telephone. In family members with diabetes, detailed information regarding onset of diabetes, complications and treatment was obtained. In unaffected family members, BMI and blood pressure was measured and fasting glucose, and HbA1c (to exclude diabetes) and lipids levels were measured. Results are depicted in Table 1. There was no information suggesting consanguineous marriage in the family. ### Non-related diabetes subjects To validate our initial results obtained from the analysis of the family members, we selected 24 non-related subjects who matched the characteristics of the affected family members (insulin > IU/kg or >100 U/day; TG > 1,5 mmol/l; BMI < 35 kg m2). Subjects were selected from the Diabetes Pearl patient cohort. This cohort is a large, well-characterized observational cohort of type 2 diabetes patients. The Diabetes Pearl cohort is a collaborative biobank initiative from the Dutch University Medical Centers ([www.parelsnoer.org](http://www.parelsnoer.org)) [15]. DNA samples were retrieved from the Biobank after approval of the ethics committee. ### Whole-Exome sequencing Exome sequencing was essentially performed as described previously [16, 17]. In brief, exome enrichment was performed using the SureSelect Human Exon 50-Mb Kit (Agilent, Santa Clara, CA, USA), covering approximately 22.000 genes, followed by SOLiD4 sequencing (Life Technologies, Carlsbad, CA, USA). Reads were mapped to the hg19 reference genome using SOLiD LifeScope software version 2.1. All nonsynonymous variants shared in both affected individuals and absent or with very low frequency (0.1%) in dbSNP, were further investigated in healthy and affected family members. ### Sanger Sequencing Nonsynonymous variants identified by exome sequencing were tested in healthy and affected family members for co-segregation by traditional Sanger sequencing. Forward and reverse PCR primers for each exon in which the variant was located were selected in an intronic sequence 50 bp away from the intron/exon boundaries. Unique primers were designed using the Primer3 software. PCR products were verified on 2% agarose gel, purified using ExoSAP-IT (Affymetrix, Santa Clara, CA, USA) and analyzed on ABI3730 genetic analyzers (Life Technologies, Carlsbad, CA, USA). Ion torrent library preparation and sequencing All 32 coding exons of the *PTPRF* gene were analyzed in 24 unrelated diabetes patients who matched the clinical characteristics of the affected family members. The screening of 24 patients for variants in the *PTPRT* gene was carried out by using an Amplicon-based deep-sequencing approach on the Ion PGM sequencer (Life Technologies, Carlsbad, CA, USA). PCR amplicons were generated per sample using standard PCR protocols. Primer sequences are available upon request. PCR products per sample were pooled, and subsequently sheared to 200-bp fragments by the use of Covaris E210 (Covaris Inc.). Library preparation was performed using the Ion Xpress™ Plus Fragment Library Kit in combination with the Ion Xpress Barcode adapters 1-96 kit. Emulsion PCR and enrichment were carried out on the Ion OneTouch System using the Ion PGM TemplateOT2 200 Kit as described by the manufacturer. Sequencing was performed using the 200bp kitv2, with Ion Sphere particles loaded to an Ion318™ chip (Life Technologies Carlsbad, CA, USA). Raw sequencing reads were mapped to the hg19 genome using the Torrent Server with the use of the Torrent suite software. FASTQ files were subsequently analyzed and visualized in SeqNext (JSI Medical systems). ### Protein-protein interaction analysis Protein-protein interactions (PPI) were identified using the STRING database (STRING v9.1; [http://string-db.org/](http://string-db.org/)) [18]. STRING builds functional protein association networks based on compiled available experimental evidence. A confidence score is assigned to each identified protein-protein association and this score will increase when an association is supported by several types of evidence. In this study, only PPI are shown that yielded a high confidence score (≥0.7). ### Ethics All family members were informed about the goal of the study and consented to participation. ## Results ### Characteristics of family members Characteristics of affected and unaffected family members are shown in Table 1. ### Whole-Exome sequencing identifies variants within the PTPRF and ASXL1 genes Exome sequencing was performed on genomic DNA of two family members with an average 46- and 81-fold coverage leading to 85.2% and 87.9% of the exome with at least 10-fold coverage, respectively. We identified 38,067 and 39,293 genetic variants per proband in the two family members, respectively. Variants were annotated by a bioinformatics pipeline as previously described [16]. In addition, the variant had to be present in at least 20-80% of all reads suggestive of heterozygous changes in a dominant model of disease. Subsequently, genetic variants were prioritized based on predicted amino acid consequences and overlap with common genetic variants (listed in dbSNP version 135 and/or present in an in-house database containing >10,000 analyzed exomes of predominantly European ancestry). All filter steps are shown in Supplementary Table 1. We identified 21 unique non-synonymous variants present in both affected family members that were present in genes with known expression in adipose tissue (Table 2, Supplementary Table 1). All 21 variants were tested for segregation in all other family members by Sanger sequencing (Table 3). Two rare missense variants were present in all affected family members but also in one unaffected family member. Variants c.3614A>G (rs200885607; p.(N11965S); located in the gene *Protein tyrosine phosphatase, receptor type F (PTPRF;* NM_002840)) and c.904C>T p.(R302C); located in the gene *ASXL transcriptional regulator 1* (*ASXL1*; NM_015338)) were also present in the non-affected family member II/4 (Table 3). The c.3614A>G variant within *PTPRF* was detected once in the in genome-wide sequence data from the 1000 Genomes Project which include the genomes of 1088 people (or 2176 chromosomes) [19]. The variant was present at very low frequency in public exome datasets, e.g. ExAc browser frequency 12/121398 alleles, i.e. allele frequency of 9.885 x 10-5; with 7/66730 alleles in Non-Finish Europeans (0.0001049 allele frequency) ([http://exac.broadinstitute.org/](http://exac.broadinstitute.org/) [August 2019]) and also with very low allele frequency in the in-house exome database (0.00045 in > 8000 exomes). The *ASXL1* variant was never observed in ExAc nor in the in-house database. *In silico* predictions for both variants can be seen in Supplementary Table 2. View this table: [Table 2.](http://medrxiv.org/content/early/2020/09/13/2020.09.11.20189399/T2) Table 2. Overview of the 21 unique nonsynonymous variants present in both affected family members. View this table: [Table 3.](http://medrxiv.org/content/early/2020/09/13/2020.09.11.20189399/T3) Table 3. Segregation of the 21 identified variants in all family members. ### PTPRF strongly interacts with proteins involved in insulin signaling To investigate/identify in which biological processes and pathways the genes *PTPRF* and *ASXL1* are involved, we used the STRING database to search for predicted and published protein-protein interactions. Analysis showed that *PTPRF* strongly interacts with several members of the insulin signaling pathway. High confidence scores were assigned to the interactions between *PTPRF* and *Insulin receptor (INSR)* and *Insulin receptor substrate* 1 and *2 (IRS1* and 2) (Table 4, Figure 1A). Indeed, *PTPRF* was demonstrated to be expressed in several insulin sensitive tissues where it interacts with the insulin receptor and dephosphorylates its tyrosine-kinase domain, thereby inhibiting its activation and downstream signaling [20]. These results point out to an important role for *PTPRF* in insulin signaling. Protein-protein interaction analysis also revealed that *ASXL1* did not interact with proteins involved in insulin signaling, glucose metabolism or other biological processes that are likely to play a role in the pathogenesis of type 2 diabetes (Table 4 and Figure 1B). View this table: [Table 4.](http://medrxiv.org/content/early/2020/09/13/2020.09.11.20189399/T4) Table 4. Predicted interaction partners for *PTPRF* and *ASXL1*. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/09/13/2020.09.11.20189399/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2020/09/13/2020.09.11.20189399/F1) Figure 1. Protein-protein interaction (PPI) networks of *PTPRF* and *ASXL1*. PPI networks of **(A)** *PTPRF* and **(B)** *ASXL1*. A confidence score is assigned to each interaction based upon direct (physical) and indirect (functional) associations. Confidence score cut-off was ≥0,7 (high confidence) ### Investigation of the PTPRF gene in an additional cohort of T2DM patients The results described above clearly pointed out to *PTPRF* as a strong candidate gene for this T2DM phenotype. To validate our results, we investigated the *PTPRF* gene for the possible presence of genetic variants in an additional cohort of 24 unrelated T2DM patients. These patients were selected based upon disease characteristics that matched the characteristics of the affected family members (insulin > 1U kg or 100 U/day; TG > 1,5 mmol/l; BMI < 35 kg m2). All coding exons of the *PTPRF* gene were sequenced in these 24 patients by the use of Ion torrent sequencing. The c.3614A>G variant was not present in these 24 patients. Also no additional rare variants were detected in this cohort. ## Discussion In the present study, we applied whole-exome sequencing to identify genetic variants possibly involved in disease susceptibility in a family with T2DM, characterized by substantial insulin resistance, hypertension and hypertriglyceridemia in the absence of hypercholesterolemia. We identified the rare variant c.3614>G, located within the *PTPRF* gene, as a novel candidate susceptibility variant. Sequencing of all coding exons of the *PTPRF* gene in a cohort of 24 non-related T2DM patients did not reveal any additional variants. Our results suggest that the c.3614A>G variant might play a role in an unusual T2DM subtype that is characterized by a severe insulin resistance and “isolated” hypertriglyceridemia. Identification of genetic variants responsible for specific forms of diabetes provide new insights regarding underlying disease aetiology. These new insights may help to improve diagnosis and to optimize treatment strategies in these specific T2DM subtypes. The c.3614A>G variant was detected in all affected family members but also in one unaffected family member. Although this family member was not diagnosed with type 2 diabetes, his triglyceride levels were elevated. All family members with diabetes had episodes with grossly elevated TG levels (>10 mmol/L), fairly independent of age, but occasionally also showed less elevated TG levels. TG level was, however, always above the normal range. TG level in the genetically affected family member without diabetes was moderately elevated in one single blood sample. Perhaps this person had a combination of relative active lifestyle and excellent beta-cell function, resulting in absence of diabetes. Additionally, T2DM is a multifactorial disease in which both genetic as environmental factors play a role. Another reason why this individual carries the c.3614A>G variant but was not diagnosed with T2DM might be reduced penetrance. As often seen for multifactorial diseases with a heritable component, the c.3614A>G variant may have reduced penetrance, meaning that some individuals fail to express the trait, even though they carry the allele. The degree of penetrance is difficult to determine and is often age related. Given that the onset of symptoms in T2DM are age related and affected by both genetic as well as environmental factors such as nutrition, it is possible that this individual will develop the disease in the near future, although he is currently older than the age at which the diabetes was diagnosed in his brothers. Although we cannot exclude the variant located in *ASXL1* as a candidate, the evidence for *PTPRF* as a candidate gene for T2DM is more convincing. Our finding that a genetic variant within *PTPRF* may be involved in the pathogenesis of type 2 diabetes is supported by earlier studies. Miscio and co-workers investigated whether common polymorphisms within the *PTPRF* gene were associated with insulin resistance. They identified a single nucleotide polymorphism (SNP) within the promoter region (−127T>A; rs3001722) in which the A allele (minor allele) was associated with lower body mass index (BMI), waist circumference and mean blood pressure in a cohort of 589 unrelated non-diabetic subjects. In addition, the A allele was associated with lower triglycerides, glucose and insulin levels during an oral glucose tolerance test (oGTT) in an independent cohort of 307 unrelated non-diabetic individuals [21]. Menzaghi and co-workers report an association between the SNP 2518A>G (rs2782641), located within intron 3, and cononary artery disease (CAD) in a T2DM patient cohort including 592 individuals [22]. To our knowledge, the c.3614A>G variant has not been reported in literature in relation to T2DM. These studies, together with our results, suggest that genetic variants within *PTPRF* are contributing to insulin resistance and T2DM in at least a subpopulation of patients. *PTPRF* is a receptor-type transmembrane protein tyrosine phosphatase (PTPase) that is expressed in several insulin target tissues [23]. It physically interacts with the insulin receptor and dephosphorylates its tyrosine-kinase domain thereby inhibiting insulin signaling [24]. Transgenic mice overexpressing *PTPRF* are insulin resistant [25] and, conversely, *PTPRF* knock-out mice are characterized by increased insulin sensitivity [26]. *PTPRF* is therefore a potential candidate gene for insulin resistance. Regarding *ASXL1*, no studies were reported showing evidence that *ASXL1* is involved in the pathogenesis of T2DM. Most studies investigating the genetic background of T2DM are focusing on identifying common variants that are associated with the disease [8, 27-29]. However, most studies have not been particularly successful in discovering new disease genes. As a consequence, the rare variant hypothesis [30, 31], which states that rare variants with large effects are the primary drivers of common diseases, has received renewed attention. Studies investigating Mendelian diseases have demonstrated that a family-based approach is a good method to detect these rare variants with relatively high impact on disease [32]. Rare variants are not present in the general population at any reasonable frequency and an advantage of a family-based study is that more observations of a rare variant can be made in large families. However, type 2 diabetes is a late-onset disease and recruiting such families is challenging. We identified this family because of the unusual phenotype of the index patient, and noted the familial existence of the combination of severe insulin resistance and “isolated” hypertriglyceridemia: hypertriglyceridemia in the absence of hypercholesterolemia. As expected, hypertriglyceridemia was associated with decreased HDL-cholesterol levels. In this study, we used the family-based approach combined with whole-exome sequencing to detect variants that may have a role in susceptibility for the disease in this family. We were able to identify a rare variant located in a strong T2DM candidate gene, which suggests that the family-based approach combined with whole-exome sequencing may be a useful strategy to investigate the genetic background of type 2 diabetes. While our findings suggest that the c.3614A>G variant in *PTPRF* may underly the phenotype of the this family, we were unable to identify a second family within our facility that resembles the disease characteristics observed in this first family, despite sharing this as a candidate gene via genematcher.com [33, 34]. In addition, no variants in *PTPRF* were identified in a patient cohort of 24 unrelated T2DM patients who phenotypically matched the characteristics of the affected family members. Our study lacks confirmation through functional studies, which should be included in future investigations. We realize that it is possible that the identified variant is a false positive result. However, we would like to stress that the variant in *PTPRF* was identified using an unbiased, non-candidate gene driven, whole exome-wide approach and that the gene is known to be involved in insulin signaling and associated with metabolic disturbances. To our opinion, this suggests that *PTPRF* is a potential candidate gene that, at least, should be added to the discussion regarding genetic variants underlying T2DM. In that light, it might be best to see this report as a case report providing researchers and clinicians the opportunity to respond and set up collaborations in order to identify additional cases/families. In summary, we applied whole-exome sequencing to identify genetic variants possibly involved in T2DM disease susceptibility in a family with highly aggregated type 2 diabetes. Our results suggest that the c.3614A>G variant in the *PTPRF* gene play a role in an unusual T2DM subtype that is characterized by a severe insulin resistance and isolated hypertriglyceridemia. These results indicate that combining the family-based approach with whole-exome sequencing may be a useful strategy to further unravel the genetic background of type 2 diabetes. The genetic variant identified within this study represents a promising candidate for stratification of T2DM patients regarding mild or severe insulin resistance. Follow-up studies are needed to confirm that the identified variant in the *PTPRF* gene is indeed responsible for severe insulin resistance and isolated hypertriglyceridemia and to further unravel the precise working mechanism responsible for this unusual T2DM phenotype. If several independent studies are able to confirm our initial results, this genetic variant may have additional value for diagnosis and to optimize treatment strategies in patients with this unusual T2DM subtype that is characterized by severe insulin resistance and isolated hypertriglyceridemia. ## Data Availability Availability of data and material Please contact author for data requests List of abbreviations : ALAT : alanine aminotransferase ASAT : Aspartaat-aminotransferase BMI : body mass index CAD : coronary artery disease GP : general practitioner GWAS : genome-wide association study HbA1c : hemoglobin A1c HDL : high-density lipoprotein LDL : low-density lipoprotein oGTT : oral glucose tolerance test PCR : polymerase chain reaction protein-protein : interaction SNP : single nucleotide polymorphism SU : Sulphonylureas T2DM : Type 2 diabetes mellitus TG : triglycerides ## Funding E.T. was supported by an EFSD/Novo Nordisk research grant provided by the European Foundation for the Study of Diabetes (EFSD) and by a grant of the Else-Kröner-Fresenius-Stiftung. ## Conflicts of interests None declared ## Declarations ### Ethics approval and consent to participate The authors state that they have obtained appropriate institutional review board approval or have followed the principles outlined in the Declaration of Helsinki for all human or animal experimental investigations. All family members were informed about the goal of the study and consented to participation. Study was approves by the The local Medical Ethical Committee, Commissie Mensgebonden Onderzoek (CMO) regio Arnhem-Nijmegen. ## Consent for publication Consent to publish was obtained from all individuals. ### Availability of data and material Please contact author for data requests ## Funding E.T. was supported by an EFSD/Novo Nordisk research grant provided by the European Foundation for the Study of Diabetes (EFSD) and by a grant of the Else-Kröner-Fresenius-Stiftung. ## Author contributions ET: designed the study, performed the experiments, analyzed and interpreted the results, wrote the manuscript. MK: performed the experiments, analyzed and interpreted the results. AH: designed the study, analyzed and interpreted the results, revised the manuscript. CT: designed the study, analyzed and interpreted the results, revised the manuscript. ## Acknowledgements The authors would like to thank Lisenka E.L.M. Vissers, Petra de Vries, Marloes Steehouwer and Christian Gilissen for their help and advice during this study. The authors are very grateful to the patients and family members who made it possible to perform this study. * Received September 11, 2020. * Revision received September 11, 2020. * Accepted September 13, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Kaprio, J., et al., Concordance for type 1 (insulin-dependent) and type 2 (non-insulin-dependent) diabetes mellitus in a population-based cohort of twins in Finland. Diabetologia, 1992. 35(11): p. 1060–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02221682&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=1473616&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1992JT73800008&link_type=ISI) 2. 2.Diabetes mellitus in twins: a cooperative study in Japan. Committee on Diabetic Twins, Japan Diabetes Society. Diabetes Res Clin Pract, 1988. 5(4): p. 271–80. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0168-8227(88)80062-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=3069409&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1988Q703500006&link_type=ISI) 3. 3.O’Rahilly, S., I. Barroso, and N.J. Wareham, Genetic factors in type 2 diabetes: the end of the beginning? Science, 2005. 307(5708): p. 370–3. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzMDcvNTcwOC8zNzAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wOS8xMy8yMDIwLjA5LjExLjIwMTg5Mzk5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 4. 4.Witka, B.Z., et al., Type 2 Diabetes-Associated Genetic Polymorphisms as Potential Disease Predictors. Diabetes Metab Syndr Obes, 2019. 12: p. 2689–2706. 5. 5.Fuchsberger, C., et al., The genetic architecture of type 2 diabetes. Nature, 2016. 536(7614): p. 41–47. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature18642&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27398621&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 6. 6.Moltke, I., et al., A common Greenlandic TBC1D4 variant confers muscle insulin resistance and type 2 diabetes. Nature, 2014. 512(7513): p. 190–3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature13425&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25043022&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 7. 7.Flannick, J., et al., Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls. Nature, 2019. 570(7759): p. 71–76. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-019-1231-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31118516&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 8. 8.Morris, A.P., et al., Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet, 2012. 44(9): p. 981–90. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2383&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22885922&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 9. 9.Lohmueller, K.E., et al., Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes. Am J Hum Genet, 2013. 93(6): p. 1072–86. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2013.11.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24290377&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 10. 10.Holm, H., et al., A rare variant in MYH6 is associated with high risk of sick sinus syndrome. Nat Genet, 2011. 43(4): p. 316–20. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.781&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21378987&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 11. 11.Ng, S.B., et al., Exome sequencing identifies the cause of a mendelian disorder. Nat Genet, 2010. 42(1): p. 30–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.499&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19915526&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000273055100013&link_type=ISI) 12. 12.Samocha, K.E., et al., A framework for the interpretation of de novo mutation in human disease. Nat Genet, 2014. 13. 13.Bernier, R., et al., Disruptive CHD8 Mutations Define a Subtype of Autism Early in Development. Cell, 2014. 158(2): p. 263–76. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2014.06.017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24998929&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000340943600008&link_type=ISI) 14. 14.Grarup, N., et al., Genetic susceptibility to type 2 diabetes and obesity: from genome-wide association studies to rare variants and beyond. Diabetologia, 2014. 57(8): p. 1528–41. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00125-014-3270-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24859358&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000338997500003&link_type=ISI) 15. 15.Van’t, R.E., et al., The Diabetes Pearl: Diabetes biobanking in The Netherlands. BMC.Public Health, 2012. 12: p. 949. 16. 16.Hoischen, A., et al., De novo mutations of SETBP1 cause Schinzel-Giedion syndrome. Nat Genet, 2010. 42(6): p. 483–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.581&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20436468&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000278081500009&link_type=ISI) 17. 17.van, d. V, et al., STAT1 mutations in autosomal dominant chronic mucocutaneous candidiasis. N.Engl.J.Med., 2011. 365(1): p. 54–61. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa1100102&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21714643&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000292459400009&link_type=ISI) 18. 18.Franceschini, A., et al., STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res, 2013. 41(Database issue): p. D808–15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gks1094&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23203871&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000312893300114&link_type=ISI) 19. 19.Genomes Project, C., et al., A map of human genome variation from population-scale sequencing. Nature, 2010. 467(7319): p. 1061–73. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature09534&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20981092&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000283548600039&link_type=ISI) 20. 20.Hashimoto, N., et al., Insulin receptor protein-tyrosine phosphatases. Leukocyte common antigen-related phosphatase rapidly deactivates the insulin receptor kinase by preferential dephosphorylation of the receptor regulatory domain. J Biol Chem, 1992. 267(20): p. 13811–4. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamJjIjtzOjU6InJlc2lkIjtzOjEyOiIyNjcvMjAvMTM4MTEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wOS8xMy8yMDIwLjA5LjExLjIwMTg5Mzk5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 21. 21.Miscio, G., et al., The allelic variant of LAR gene promoter -127 bp T-->A is associated with reduced risk of obesity and other features related to insulin resistance. J Mol Med (Berl), 2004. 82(7): p. 459–66. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15150650&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000223173600006&link_type=ISI) 22. 22.Menzaghi, C., et al., The protein tyrosine phosphatase receptor type f (PTPRF) locus is associated with coronary artery disease in type 2 diabetes. J Intern Med, 2008. 263(6): p. 653–4. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18298487&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 23. 23.Tsujikawa, K., et al., Distinct functions of the two protein tyrosine phosphatase domains of LAR (leukocyte common antigen-related) on tyrosine dephosphorylation of insulin receptor. Mol Endocrinol, 2001. 15(2): p. 271–80. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1210/me.15.2.271&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11158333&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000166762200006&link_type=ISI) 24. 24.Goldstein, B.J., et al., Regulation of the insulin signalling pathway by cellular protein-tyrosinephosphatases. Mol Cell Biochem, 1998. 182(1-2): p. 91–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1006812218502&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9609118&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000073343400011&link_type=ISI) 25. 25.Zabolotny, J.M., et al., Overexpression of the LAR (leukocyte antigen-related) protein-tyrosine phosphatase in muscle causes insulin resistance. Proc Natl Acad Sci U S A, 2001. 98(9): p. 5187–92. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czo5OiI5OC85LzUxODciO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wOS8xMy8yMDIwLjA5LjExLjIwMTg5Mzk5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 26. 26.Ren, J.M., et al., Transgenic mice deficient in the LAR protein-tyrosine phosphatase exhibit profound defects in glucose homeostasis. Diabetes, 1998. 47(3): p. 493–7. [Abstract](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZGlhYmV0ZXMiO3M6NToicmVzaWQiO3M6ODoiNDcvMy80OTMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wOS8xMy8yMDIwLjA5LjExLjIwMTg5Mzk5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 27. 27.Replication, D.I.G., et al., Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat Genet, 2014. 46(3): p. 234–44. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2897&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24509480&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 28. 28.Albrechtsen, A., et al., Exome sequencing-driven discovery of coding polymorphisms associated with common metabolic phenotypes. Diabetologia, 2013. 56(2): p. 298–310. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00125-012-2756-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23160641&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000313075500010&link_type=ISI) 29. 29.Manning, A.K., et al., A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nat Genet, 2012. 44(6): p. 659–69. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2274&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22581228&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 30. 30.Bodmer, W. and C. Bonilla, Common and rare variants in multifactorial susceptibility to common diseases. Nat Genet, 2008. 40(6): p. 695–701. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.f.136&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18509313&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000256212000007&link_type=ISI) 31. 31.Pritchard, J.K. and N.J. Cox, The allelic architecture of human disease genes: common disease-common variant…or not? Hum Mol Genet, 2002. 11(20): p. 2417–23. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/11.20.2417&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12351577&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000178515900009&link_type=ISI) 32. 32.Saad, M. and E.M. Wijsman, Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes. Genet Epidemiol, 2014. 38(1): p. 1–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/gepi.21776&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24243664&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 33. 33.Sobreira, N., et al., GeneMatcher: a matching tool for connecting investigators with an interest in the same gene. Hum Mutat, 2015. 36(10): p. 928–30. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.22844&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26220891&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom) 34. 34.Sobreira, N., et al., New tools for Mendelian disease gene identification: PhenoDB variant analysis module; and GeneMatcher, a web-based tool for linking investigators with an interest in the same gene. Hum Mutat, 2015. 36(4): p. 425–31. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.22769&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25684268&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F13%2F2020.09.11.20189399.atom)