Genotype-first analysis in an unselected health system-based population reveals variable phenotypic severity of COL4A5 variants =============================================================================================================================== * McKenzie Zellers * Kaushal Solanki * Melissa A Kelly * Karyn M Murphy * Kyle Retterer * H. Les Kirchner * Ion Dan Bucaloiu * Bryn Moore * Tooraj Mirshahi * Alexander R Chang ## Abstract **Introduction** Our knowledge of X-linked Alport Syndrome [AS] comes mostly from selected cohorts with more severe disease. **Methods** We examined the phenotypic spectrum of X-linked AS in males and females with a genotype-based approach using data from the Geisinger MyCode DiscovEHR study, an unselected health system-based cohort with exome sequencing and electronic health records. Patients with *COL4A5* variants reported as pathogenic (P) or likely pathogenic (LP) in ClinVar, or protein-truncating variants (PTVs), were each matched with up to 5 controls without *COL4A3/4/5* variants by sociodemographics, diabetes diagnosis, and year of first outpatient encounter. AS-related phenotypes included dipstick hematuria, bilateral sensorineural hearing loss (BSHL), proteinuria, decreased eGFR, and ESKD. **Results** Out of 170,856 patients, there were 29 hemizygous males (mean age 52.0 y [SD 20.0]) and 55 heterozygous females (mean age 59.3 y [SD 18.8]) with a *COL4A5* P/LP variant, including 48 with the hypomorphic variant p.Gly624Asp. Overall, penetrance (having any AS phenotypic feature) was highest for non-p.Gly624Asp P/LP variants (males: 94%, females: 85%), intermediate for p.Gly624Asp (males: 77%, females: 69%), compared to controls (males: 32%; females: 50%). The proportion with ESKD was highest for males with P/LP variants (44%), intermediate for males with p.Gly624Asp (15%) and females with P/LP variants (10%), compared to controls (males: 3%, females 2%). Only 47% of individuals with *COL4A5* had completed albuminuria screening, and a minority were taking renin-angiotensin aldosterone system (RAAS) inhibitors. Only 38% of males and 16% of females had a known diagnosis of Alport syndrome or thin basement membrane disease. **Conclusion** In an unselected cohort, we show increased risks of AS-related phenotypes in men and women compared to matched controls, while showing a wider spectrum of severity than has been described previously and variability by genotype. Future studies are needed to determine whether early genetic diagnosis can improve outcomes in Alport Syndrome. Keywords * Alport Syndrome * Chronic Kidney Disease * Clinical Epidemiology * Kidney Failure * Genetic Renal Disease * ESKD ## Introduction Alport syndrome (AS) is the second most common genetic cause of end stage kidney disease behind polycystic kidney disease, affecting approximately one in 2320 individuals1,2. Nearly 85% of total cases of AS in the past have been due to variants in the *COL4A5* gene found on the X chromosome, with the rest of individuals with AS having variants in the *COL4A3* and *COL4A4* genes responsible for autosomal recessive and dominant inheritance patterns1. Individuals with AS can have phenotypic manifestations related to abnormalities in the basement membranes of the glomerulus, cochlea of the ear, and lens of the eye, resulting in microscopic hematuria, sensorineural hearing loss, and ocular disturbances. Each collagenous IV chain contains a collagenous domain made of Gly Xaa Yaa triplet repeats, resulting in a rigid protein structure3. Thus, glycine missense variants disrupting the intermediate collagenous domains as well as protein-truncating variants (PTVs) are likely to be deleterious. Some but not all studies suggest a worse prognosis for PTVs than missense variants4–8. One glycine missense variant, p.Gly624Asp, is adjacent to a non-collagenous region and known to have a more mild phenotype.9 Although females are more likely to carry *COL4A5* variants due to the nature of the X-linked inheritance pattern, males are more severely impacted, with reported risk of progression to ESKD reported at nearly 100% in *COL4A5* males compared to 25% in similarly affected females10,11. An important limitation to the existing literature is that participants in these studies have been selected from families known to be affected by AS, likely leading to an overestimation of penetrance and phenotypic severity. As availability of genetic testing increases and costs of testing decrease, there is an urgent need to understand the penetrance and phenotypic spectrum in the general population to inform families about the implications of a genetic diagnosis of AS. Using a genotype-first approach, we aimed to examine the phenotypic spectrum of X-linked AS due to *COL4A5* variants using an unselected health system-based population. We hypothesized that both males and females with *COL4A5* variants would have higher risk of kidney phenotypes though the penetrance and phenotypic severity of disease would be lower than previously reported in more selected cohorts. ## Methods ### Study Population There were 170,856 research participants in MyCode who had whole exome sequencing data available after quality control. These research participants are patients in the Geisinger Health System at large who consented to participate in the MyCode Community Health Initiative and were sequenced through the Geisinger-Regeneron DiscovEHR collaboration12. This research study was approved by the Geisinger Institutional Review Board. Patients with ClinVar P/LP or PTVs, as defined below, were classified as cases. We queried ClinVar [date accessed 4/24/2023] for *COL4A5* (NM_033380.3) variants previously classified as pathogenic (P) or likely pathogenic (LP) by >1 submitter and no conflicts, or conflicting variants with a ratio of ≥6 P/LP to other classifications13. Predicted protein truncating variants (PTVs; frameshift, nonsense, splice) were identified from those called as high impact variants by variant effect predictor v109. All variants underwent manual review to ensure technical confidence and clinical relevance.14 Any variants that did not meet quality thresholds were confirmed in a Clinical Laboratory Improvement Amendments (CLIA)-certified laboratory using orthogonal technology prior to inclusion. ### Phenotypic outcomes Phenotypic data sources included International Classification of Diseases (ICD) 9 and 10 codes, blood and urine laboratory data, and chart review. ICD codes were used for hypertension, diabetes, hearing loss, bilateral sensorineural hearing loss, AS, focal segmental glomerulosclerosis (FSGS), and ESKD (**Supplemental Table 1**). We used the 2021 CKD-EPI equation to calculate eGFR, and we imputed eGFR of 5 ml/min/1.73m2 for patients with kidney transplant.15 Hematuria on dipstick was defined as having trace blood or greater on at least half of all urinalyses, excluding urinalyses positive for leukocyte esterase or nitrites.5 Age at time of ESKD was determined using linkage to the United States Renal Data System,16 and if these data were not available, by chart review. Presence of albuminuria was defined as ever having albumin to creatinine ratio (ACR) ≥ 30 mg/g, and presence of proteinuria by dipstick was defined requiring at least 2 dipsticks. Any phenotypic feature of AS was defined as hematuria on urinalysis or ICD code, albuminuria, eGFR <60 ml/min/1.73m2, FSGS ICD code, ESKD ICD code, or bilateral sensorineural hearing loss. Patients with most recent eGFR data at age ≥18 years were categorized into Kidney Disease Improving Global Outcomes (KDIGO) risk categories using estimated glomerular filtration rate (eGFR) and ACR (**Supplemental Table 2**).5,17 If ACR was not available, urine dipstick protein data was used for KDIGO risk categorization with dipstick protein 1+ twice classified as ACR 30-299 mg/g, and dipstick protein 2+ or greater twice classified as ACR 300+ mg/g. Chart review was performed on all cases to further examine diagnosis of AS or TBMD, kidney biopsies, treatment with angiotensin-converting enzyme inhibitors (ACEIs) or angiotensin receptor blockers (ARBs), family history, and genetic testing. ### Statistical analysis Patients with *COL4A5* P/LP variants were matched to up to 5 controls (patients without *COL4A3, COL4A4,* and *COL4A5* variants at allele frequency<0.01), with exact matching on age (within 1 year) at time of most recent eGFR, sex, race, diabetes diagnosis, and year of first outpatient encounter. We compared characteristics between patients with *COL4A5* P/LP variants and matched controls using chi-square tests for categorical outcomes and paired t-tests for continuous outcomes overall, and by sex and variant group. We used Poisson regression models with robust variance as the primary analysis to estimate prevalence ratios to determine AS-related phenotypic outcomes, stratified by sex and genotype (P/LP variant or the known hypomorphic p.Gly624Asp variant), using each subgroups’ matched controls. 18,19 We also conducted time-to-event analyses with ESKD as the outcome using Kaplan-Meier survival curves with age as the time variable. Patients were censored at age of last eGFR value. The proportional hazards assumption was graphically assessed, and Cox proportional hazards models were adjusted for age at time of first eGFR value in sex-stratified analyses. As prior *COL4A5* studies have suggested genotype-phenotype correlations with the worst outcomes for nonsense and frameshift variants, intermediate outcomes for splice variants, and milder phenotypes for missense and in-frame deletion variants,6,20,21 we also estimated risk of ESKD using Cox models for nonsense/frameshift variants, splice variants, the hypomorphic p.Gly624Asp variant, and other missense or in-frame deletion variants. We conducted sensitivity analyses, including time-to-event competing-risks regression using the method of Fine and Gray,22 and analyses using Firth logistic regression, including only one individual from each family. We considered p values <0.05 to be significant. Analyses were conducted using STATA/MP 15.1. ## Results Out of the 170,856 patients, there were 29 hemizygous males (mean age 52.0 y [SD 20.0]) and 55 heterozygous females (mean age 59.3 y [SD 18.8]) with a *COL4A5* P/LP variant. The most common *COL4A5* P/LP variant was p.Gly624Asp (n=48), and there were 17 other unique variants, including 4 other missense or in-frame deletion variants (n=16 individuals), 7 splicing variants (n=13 individuals), 4 frameshift variants (n=4 individuals), and 2 stop-gained variants (n=3 individuals). Of the 84 individuals with *COL4A5* P/LP variants, the vast majority were matched to 5 controls (78 individuals matched to 5 controls, 2 to 3 controls, 2 to 2 controls, 2 to 1 control). Compared to matched controls, individuals with *COL4A5* P/LP variants were more likely to have urinalysis (96% vs 82%, p=0.001) and ACR (47% vs 26%, p<0.001) results available in addition to having a hypertension diagnosis (68% vs 51%, p=0.004) whereas 100% of patients in both groups had evaluation of eGFR (**Table 1**). The median (interquartile interval) year of first outpatient visit was 2003.5 (2001-2009.5) for individuals with *COL4A5* and 2004 (2001-2009) for matched controls. View this table: [Table 1.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/T1) Table 1. Characteristics of *COL4A5* P/LP X-linked Alport Syndrome patients and matched controls ### Prevalence of AS Phenotypes by Sex Overall, males with *COL4A5* P/LP variants had the highest risk of AS-related phenotypes, followed by males with the hypomorphic p.Gly624Asp variant and females with *COL4A5* P/LP variants, with lowest risk in females with p.Gly624Asp (**Table 2**, **Figure 1**). The p.Gly624Asp variant was associated with a milder phenotype than other P/LP variants for both males and females. Males with p.Gly624Asp demonstrated lower penetrance (n=13; 0% AS ICD, 15% ESKD ICD, 31% hearing loss ICD, 23% eGFR <30 ml/min/1.73m2) compared to other hemizygous *COL4A5* P/LP males (n=16; 31% AS ICD, 44% ESKD ICD, 50% hearing loss ICD, 44% eGFR <30 ml/min/1.73m2). KDIGO risk categorization demonstrated that 33% of p.Gly624Asp males (mean age 55.7 y [SD 17.1] had a high-risk category or greater as compared to 80% of other hemizygous *COL4A5* P/LP males (mean age 49.0 y [SD 22.2]) (**Table 2**, **Figure 1**). Similarly, the prevalence of high-risk KDIGO category or greater in p.Gly624Asp females (mean age 60.3 y [SD 14.8] was 24% compared to 68% of other P/LP females (mean age 59.9 y [SD 18.1]). ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/09/2024.06.04.24308453/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/F1) Figure 1. KDIGO risk categories by sex and variant type KDIGO risk categories created using eGFR and ACR. If ACR not available, then urine dipstick protein data used with dipstick protein 1+ twice classified as ACR 30-299 mg/g, and dipstick protein 2+ or greater twice classified as ACR 300+ mg/g. KDIGO risk categories defined as: 0 - no CKD or hematuria; 1 - hematuria alone (eGFR ≥60, ACR <30); 2 – Moderately increased risk (eGFR 45-59 and ACR<30 or eGFR ≥60 with ACR 30-299); 3 – High risk (eGFR 30-44 with ACR<30 or eGFR 45-59 with ACR 30-299, or eGFR ≥60 with ACR 300+); 4 – Very high risk (eGFR 15-29 with ACR<300 or eGFR 30-44 with ACR ≥30 or eGFR 45-59 with ACR ≥300); 5 – Extremely high risk (eGFR<15 or eGFR 15-29 with ACR 300+; see Supplemental Table 2). Two cases and 6 controls <18 years of age were excluded from this categorization. View this table: [Table 2.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/T2) Table 2. Phenotypic Features of *COL4A5* P/LP X-linked Alport Syndrome by Sex and Genotype All groups had significantly higher risk of AS phenotypic features compared to matched controls with prevalence ratios (PR) of 3.13 (95% CI: 2.18, 4.48) for males with P/LP variants, 2.20 (95% CI: 1.39, 3.48) for males with p.Gly624Asp, 1.75 (95% CI: 1.34, 2.29) for females with P/LP variants, and 1.35 (95% CI: 1.03, 1.76) for females with p.Gly624Asp (**Table 3**, **Figure 2**). Risks of the most common AS manifestation, dipstick hematuria, were similar for males with P/LP variants (PR 2.36, 95% CI: 1.07, 5.23), males with p.Gly624Asp (PR 3.34, 95% CI: 1.80, 6.21), females with P/LP variants (PR 2.59, 95% CI: 1.73, 3.87), and females with p.Gly624Asp (PR 1.86, 95% CI: 1.28, 2.70). Males and females with p.Gly624Asp tended to have milder phenotypic presentation than their counterparts with other P/LP variants. For example, prevalence ratios for ESKD were 35.00 (95% CI: 4.57, 268.10) for males with P/LP variants, 3.08 (95% CI: 0.56, 16.80) for males with p.Gly624Asp, 9.30 (95% CI: 0.88, 98.67) for females with P/LP variants, and 1.61 (95% CI: 0.17, 15.07) for females with p.Gly624Asp. Males with P/LP and p.Gly624Asp variants had significantly increased risk of hearing loss ICD diagnosis whereas females with P/LP variants or p.Gly624Asp did not. Sensitivity analyses restricted to one individual per family showed consistent results (**Supplemental Tables 3**). Variant-level phenotypic characteristics, stratified by sex, are shown in **Supplemental Table 4**. ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/09/2024.06.04.24308453/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/F2) Figure 2. Prevalence Ratios for Alport Syndrome-related phenotypes Associations between *COL4A5* P/LP variants and renal phenotypes examined using Poisson regression with robust variance in analyses in stratified analyses by genotype and sex. View this table: [Table 3.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/T3) Table 3. Prevalence Ratios of Alport Syndrome Phenotypic Features by Sex and Genotype ### Time-to-event Analyses for ESKD by Genotype Kaplan-Meier survival curves for ESKD-free survival followed similar patterns (**Figure 3**). Overall, there was no difference in deaths in patients with *COL4A5* variants and controls (7% vs. 7%; p=0.95) with mean age of death of *COL4A5* patients 70.8 y (range 54.3-80.7). **Table 4** shows hazard ratios (HR) for ESKD were significant for males with P/LP variants (HR 20.14, 95% CI: 5.85, 69.38; p<0.001), females with P/LP variants (HR 5.65, 95% CI: 1.00, 31.75), and of borderline significance for males with the hypomorphic p.Gly624Asp variant (HR 5.38, 95% CI: 0.97, 29.75). Females with the hypomorphic p.Gly624Asp variant were not at significantly increased risk of ESKD (HR 1.95, 95% CI: 0.22, 17.59). When examined by additional variant types, (e.g. nonsense/frameshift variants, splice, hypomorphic p.Gly624Asp, other missense or in-frame), risk was similar between variant groups though 95% intervals were very wide due to small sample sizes (**Supplemental Table 5)**. Results accounting for competing risks of death were consistent (**Supplemental Table 6)**. Mean age of ESKD onset in males was 36.9 y (SD 15.0) males with P/LP variants and 50.0 (SD 5.7) for males with Gly624Asp. Mean age of ESKD onset was 51.7 (SD 12.6) for females with no difference between those with P/LP or Gly624Asp variants. ![Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/09/2024.06.04.24308453/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/F3) Figure 3. Kaplan-Meier Survival Curves for ESKD-free Survival Kaplan-Meier curves comparing risk of ESKD by genotype for females and males with age as the time variable. View this table: [Table 4.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/T4) Table 4. Event Rates and Hazard Ratios for ESKD by Genotype in Males and Females ### Individuals with a *COL4A5* P/LP variant and a second *COL4A3/4/5* variant There were 15 individuals who had an additional rare (AF<0.01) variant in *COL4A3/4/5* with 4 (27%) having ESKD (**Supplemental Table 7)**. The individuals with ESKD included a male with *COL4A5* splice variant c.438+2T>G and *COL4A4* p.Pro352Leu; a male with *COL4A5* p.Leu1655Arg and *COL4A3* p.Val950Ile variant; a male with *COL4A5* p.Leu1655Arg and *COL4A3* p.Thr1489Ile, and a female with *COL4A5* c.4964T>G and *COL4A3* p.Ile1659Val. ### Diagnosis of AS AS or thin basement membrane disease was documented in only 10/16 (63%) of males with P/LP variants, 1/13 (8%) of males with p.Gly624Asp, 4/20 (20%) of females with P/LP variants, and 4/35 (11%) of females with p.Gly624Asp. A total of 17 individuals (20%) had documentation of a native kidney biopsy performed, and none had undergone a skin biopsy to diagnose AS. Of those with native kidney biopsies, 9 (53%) had a biopsy diagnosis of AS, hereditary nephritis or thin basement membrane disease; FSGS was reported in 6 (33%) biopsies (**Table 5**). One patient had 2 biopsies as a child that were inconclusive (1st biopsy: insufficient sample, thin basement membrane disease; 2nd biopsy: IgM nephropathy, FSGS) and has yet to be diagnosed with AS. Only 2 (12%) had specific mention of staining for the alpha5 chain of type IV collagen. In terms of genetic diagnosis, only 5 (6%) received genetic testing and 2 (2%) additional patients reported having a family member who had genetic testing confirming diagnosis. View this table: [Table 5.](http://medrxiv.org/content/early/2024/08/09/2024.06.04.24308453/T5) Table 5. Characteristics of *COL4A5* P/LP X-linked Alport Syndrome with native kidney biopsies ### Receipt of Recommended Care for AS Only 40 (47%) of *COL4A5* patients had ever had ACR testing completed though 82 (96%) had urinalysis data. Overall, only 27 (32%) of *COL4A5* patients were receiving RAAS inhibitors. Among the 37 individuals with ACR ≥30 mg/g or 1+ protein twice on dipstick (and without ESKD), only 18 (49%) were receiving RAAS inhibitors. ## Discussion In this study using an unselected cohort, we provide more representative quantification of risks associated with X-linked AS, by sex and genotype. Risks of ESKD was highest among males with P/LP variants (44%), intermediate for males with p.Gly624Asp (15%) and females with P/LP variants (10%), and not significantly increased for females with p.Gly624Asp (3%). Overall, we show a wider spectrum of disease severity in both men and women with *COL4A5* variants than described in earlier studies that focused on families known to be afflicted by the disease. For example, prior studies suggested males with X-linked AS having a 100% chance of progression to ESKD10. In our cohort, 80% of *COL4A5 P/LP* males and 33% of *COL4A5* p.Gly624Asp males were categorized as having KDIGO high-risk category or greater, and there were 2 men above the age of 70 years who had eGFR ≥60 ml/min/1.73m2. This finding of more variable phenotypic severity has important implications in terms of providing more accurate information for families. The broader phenotypic spectrum using a genotype-first approach in an unselected cohort is consistent with other similar genotype-based studies for conditions ranging from autosomal dominant polycystic kidney disease to monogenic diabetes.5,23,24 We also uncovered gaps in care related to AS. Despite a median follow up time of approximately 14 years, allowing ample opportunity for various forms of testing, less than half of individuals had albuminuria screening completed and very few individuals with *COL4A5* had a diagnosis of AS. With our genotype-based approach, we are also able to study genotype-phenotype correlations in a less biased fashion than prior studies. Prior studies have suggested that nonsense/frameshift variants confer the worst prognosis, followed by splice variants, and then the mildest phenotypes for missense and in-frame deletion variants.6,20 In our study, we did not observe this genotype-phenotype correlation with similar observed risk for missense variants and PTVs. However, numbers were limited for these analyses, and it is possible that lumping variants into various groups may not reflect differential severity of individual variants within each group. Beyond p.Gly624Asp, which was associated with better outcomes for both males and females, we were unable to make firm conclusions about individual variants. Verifying genotype-phenotype correlations will require larger, additional cohort studies with unselected populations. Our study and others have shown that p.Gly624Asp, thought to have originated in Central and Eastern Europe due to a founder effect between the 12th and 13th century, confers a milder form of disease and clinical course which may be explained by the location of this variant within the type IV collagen gene25,26. The p.Gly624Asp variant is found adjacent to a non-collagenous interruption within the collagenous domain. It has been hypothesized that since p.Gly624Asp occurs adjacent to such an interruption, it likely increases the flexibility of an already flexible region, explaining why it results in a milder form of disease, as compared to variants inducing flexibility in the more rigid parts of the collagenous domain26. However, despite being associated with a milder presentation, males with p.Gly624Asp variants still had a 5-fold increased risk of developing eGFR<30 ml/min/1.73m2, and 15% had ESKD. Females are often considered by clinicians as simply being “carriers” of X-linked AS, and we found documentation of this in our chart review of this cohort. Although the manifestations of disease are on average less severe as compared to males, significant kidney risks are still present. An earlier study of families with X-linked AS by Jais et al. had reported a 30% risk of ESKD in females with *COL4A5* by the age of 60, with higher risk in those with concurrent proteinuria and hearing loss11. In our study, females with P/LP variants tended to be at increased risk of ESKD with 10% developing ESKD by the age of 65y, likely due to the unselected nature of our cohort. Regardless, the diagnosis of AS in women has important implications for affected women as well as for reproductive decision making and family planning. Prior research by Gross et al. has demonstrated a role for RAAS inhibitors in delaying onset of renal failure and improving life expectancy in AS27. Their study largely focused on AS in males with sibling pairs showing a significantly delayed onset of renal failure in the younger sibling thought to be due to earlier diagnosis and initiation of medical therapy. This delayed onset of renal failure with RAAS blockade was similarly seen in a study examining females with AS28. Even in males and females with hypomorphic Gly624Asp variants, RAAS inhibitors have been shown to be associated with reduced risk of end stage renal failure in an observational study by Boeckhaus et al29. Despite these studies, only 49% of patients were on RAAS inhibitors even when albuminuria was detected on laboratory studies, emphasizing the importance of early genetic testing and screening to identify such patients. This finding raises the question of whether or not diagnosing these individuals with AS would result in better risk awareness leading to improved adherence to medical therapy, and potentially better patient outcomes. The major strength of our study is the utilization of an unselected health population and exome sequencing data to inform our diagnosis of patients with X-linked AS, allowing for an in-depth exploration of the phenotypic spectrum of disease. Additionally, due to the availability of extensive longitudinal EHR data, we were able to examine a wide breadth of different phenotypic outcomes. However, EHR data was limited for traits such as ophthalmologic and audiometry data. Another limitation of our study is that all patients with *COL4A5* in our cohort were of European ancestry. In conclusion, we were able to show in a large, unselected cohort, the wide spectrum of phenotypic presentation of males and females with *COL4A5* variants. X-linked AS was underdiagnosed in both males and females with opportunities to improve monitoring of disease progression and better use of guideline-recommended RAAS inhibitors. Further studies are needed to determine whether early genetic diagnosis can improve outcomes in AS. ## Supporting information Supplemental Materials [[supplements/308453_file02.pdf]](pending:yes) ## Disclosures A.C. has served as a consultant for Novartis, Reata, and Amgen, and has received research funding from Novartis, Novo Nordisk, and Bayer. ## Data availability The data supporting the findings of this study are available within the article and its Supplementary Data files. Additional information for reproducing the results described in the article is available upon reasonable request and subject to a data use agreement. ## Author contributions Study design (ARC, HLK), data collection (MZ, KS, KR, BM, IDB, TM), analysis (ARC), interpretation of results (MZ, IDB, TM, ARC), drafted manuscript (MZ, ARC), revised critically for important intellectual content (all authors). All authors have approved the version to be published. ## Conflicts of interest No potential conflicts of interest relevant to this article were reported. ## Acknowledgements We are grateful to the many patients who contributed to the MyCode Community Health Initiative by providing genomic and electronic health information. We thank the Regeneron Genetics Center for providing funding for patient enrollment and exome sequencing for the DiscovEHR study, and we would like to acknowledge the Geisinger-Regeneron DiscovEHR Collaboration for the genotypic and phenotypic data. This work was supported by NIGMS grant GM111913 to T.M. We thank the USRDS for their support. The data reported here have been supplied by the United States Renal Data System (USRDS). The interpretation and reporting of these data are the responsibility of the author(s) and in no way should be seen as an official policy or interpretation of the U.S. government. ## Footnotes * Some additional validation of variants was done, and thus, the numbers are updated. HRs by genotype are shown for P/LP variants and Gly624Asp in the main manuscript now with the smaller subgroups moved to the supplement. * Received June 4, 2024. * Revision received August 8, 2024. * Accepted August 9, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Gibson J, Fieldhouse R, Chan MMY, et al. Prevalence Estimates of Predicted Pathogenic COL4A3-COL4A5 Variants in a Population Sequencing Database and Their Implications for Alport Syndrome. J Am Soc Nephrol. 2021;32(9):2273–2290. doi:10.1681/ASN.2020071065 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6OToiMzIvOS8yMjczIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDgvMDkvMjAyNC4wNi4wNC4yNDMwODQ1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 2. 2.Kashtan CE, Ding J, Garosi G, et al. Alport syndrome: a unified classification of genetic disorders of collagen IV α345: a position paper of the Alport Syndrome Classification Working Group. Kidney Int. 2018;93(5):1045–1051. doi:10.1016/J.KINT.2017.12.018 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.kint.2017.12.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29551517&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 3. 3.Hudson BG. The molecular basis of goodpasture and alport syndromes: Beacons for the discovery of the collagen IV family. Journal of the American Society of Nephrology. 2004;15(10):2514–2527. doi:10.1097/01.ASN.0000141462.00630.76 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6MTA6IjE1LzEwLzI1MTQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wOC8wOS8yMDI0LjA2LjA0LjI0MzA4NDUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 4. 4.Savige J, Ariani F, Mari F, et al. Expert consensus guidelines for the genetic diagnosis of Alport syndrome. Pediatr Nephrol. 2019;34(7):1175–1189. doi:10.1007/S00467-018-3985-4 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00467-018-3985-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29987460&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 5. 5.Solanki K V., Hu Y, Moore BS, et al. The Phenotypic Spectrum of COL4A3 Heterozygotes. Kidney Int Rep. 2023;8(10):2088–2099. doi:10.1016/j.ekir.2023.07.010 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ekir.2023.07.010&link_type=DOI) 6. 6.Gibson JT, Huang M, Shenelli Croos Dabrera M, et al. Genotype–phenotype correlations for COL4A3–COL4A5 variants resulting in Gly substitutions in Alport syndrome. Scientific Reports 2022 12:1. 2022;12(1):1–13. doi:10.1038/s41598-022-06525-9 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-022-06525-9&link_type=DOI) 7. 7.Gibson JT, de Gooyer M, Huang M, Savige J. A Systematic Review of Pathogenic COL4A5 Variants and Proteinuria in Women and Girls With X-linked Alport Syndrome. Kidney Int Rep. 2022;7(11):2454–2461. doi:10.1016/J.EKIR.2022.08.021 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.EKIR.2022.08.021&link_type=DOI) 8. 8.Kim JH, Lim SH, Song JY, et al. Genotype-phenotype correlation of X-linked Alport syndrome observed in both genders: a multicenter study in South Korea. Sci Rep. 2023;13(1). doi:10.1038/S41598-023-34053-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/S41598-023-34053-7&link_type=DOI) 9. 9.Żurowska AM, Bielska O, Daca-Roszak P, et al. Mild X-linked Alport syndrome due to the COL4A5 G624D variant originating in the Middle Ages is predominant in Central/East Europe and causes kidney failure in midlife. Kidney Int. 2021;99(6):1451–1458. doi:10.1016/J.KINT.2020.10.040 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.kint.2020.10.040&link_type=DOI) 10. 10.Kashtan CE, Ding J, Garosi G, et al. Alport syndrome: a unified classification of genetic disorders of collagen IV α345: a position paper of the Alport Syndrome Classification Working Group. Kidney Int. 2018;93(5):1045–1051. doi:10.1016/j.kint.2017.12.018 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.kint.2017.12.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29551517&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 11. 11.Jais JP, Knebelmann B, Giatras I, et al. X-linked Alport syndrome: Natural history and genotype-phenotype correlations in girls and women belonging to 195 families: A “European Community Alport Syndrome Concerted Action” study. Journal of the American Society of Nephrology. 2003;14(10):2603–2610. doi:10.1097/01.ASN.0000090034.71205.74 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6MTA6IjE0LzEwLzI2MDMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wOC8wOS8yMDI0LjA2LjA0LjI0MzA4NDUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 12. 12.Carey DJ, Fetterolf SN, Davis FD, et al. The Geisinger MyCode community health initiative: an electronic health record–linked biobank for precision medicine research. Genetics in Medicine. 2016;18(9). doi:10.1038/gim.2015.187 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/gim.2015.187&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26866580&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 13. 13.Landrum MJ, Lee JM, Benson M, et al. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 2016;44(D1). doi:10.1093/nar/gkv1222 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkv1222&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26582918&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 14. 14.Kelly MA, Leader JB, Wain KE, et al. Leveraging population-based exome screening to impact clinical care: The evolution of variant assessment in the Geisinger MyCode research project. Am J Med Genet C Semin Med Genet. 2021;187(1):83–94. doi:10.1002/AJMG.C.31887 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ajmg.c.31887&link_type=DOI) 15. 15.Inker LA, Eneanya ND, Coresh J, et al. New Creatinine-and Cystatin C-Based Equations to Estimate GFR without Race. N Engl J Med. 2021;385(19):1737–1749. doi:10.1056/NEJMOA2102953 [CrossRef](http://medrxiv.org/lookup/external-ref?access\_num=10.1056/NEJMOA2102953/SUPPL_FILE/NEJMOA2102953_DISCLOSURES.PDF&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 16. 16.U.S. Renal Data System. 2022 USRDS annual data report: Epidemiology of kidney disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, 2022. 17. 17.Stevens PE, Ahmed SB, Carrero JJ, et al. KDIGO 2024 Clinical Practice Guideline for the Evaluation and Management of Chronic Kidney Disease. Kidney Int. 2024;105(4):S117–S314. doi:10.1016/J.KINT.2023.10.018 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.KINT.2023.10.018&link_type=DOI) 18. 18.Talbot D, Mésidor M, Chiu Y, Simard M, Sirois C, Denis Talbot C. An Alternative Perspective on the Robust Poisson Method for Estimating Risk or Prevalence Ratios Running head: Alternative Perspective on Robust Poisson Method. doi:10.48550/arXiv.2112.00547 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.48550/arXiv.2112.00547&link_type=DOI) 19. 19.Chen W, Shi J, Qian L, Azen SP. Comparison of robustness to outliers between robust poisson models and log-binomial models when estimating relative risks for common binary outcomes: A simulation study. BMC Med Res Methodol. 2014;14(1). doi:10.1186/1471-2288-14-82 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2288-14-82&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24965498&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 20. 20.Kim JH, Lim SH, Song JY, et al. Genotype–phenotype correlation of X-linked Alport syndrome observed in both genders: a multicenter study in South Korea. Scientific Reports 2023 13:1. 2023;13(1):1–10. doi:10.1038/s41598-023-34053-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-023-34053-7&link_type=DOI) 21. 21.JAIS JP, KNEBELMANN B, GIATRAS I, et al. X-linked Alport Syndrome. Journal of the American Society of Nephrology. 2000;11(4):649–657. doi:10.1681/ASN.V114649 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6ODoiMTEvNC82NDkiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wOC8wOS8yMDI0LjA2LjA0LjI0MzA4NDUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 22. 22.Fine JP, Gray RJ. A Proportional Hazards Model for the Subdistribution of a Competing Risk. J Am Stat Assoc. 1999;94(446):496. doi:10.2307/2670170 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2307/2670170&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000081058500019&link_type=ISI) 23. 23.Chang AR, Moore BS, Luo JZ, et al. Exome Sequencing of a Clinical Population for Autosomal Dominant Polycystic Kidney Disease. JAMA. 2022;328(24):2412–2421. doi:10.1001/JAMA.2022.22847 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/JAMA.2022.22847&link_type=DOI) 24. 24.Mirshahi UL, Colclough K, Wright CF, et al. Reduced penetrance of MODY-associated HNF1A/HNF4A variants but not GCK variants in clinically unselected cohorts. Am J Hum Genet. 2022;109(11):2018–2028. doi:10.1016/J.AJHG.2022.09.014 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2022.09.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36257325&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) 25. 25.Gibson JT, Huang M, Shenelli Croos Dabrera M, et al. Genotype–phenotype correlations for COL4A3–COL4A5 variants resulting in Gly substitutions in Alport syndrome. Sci Rep. 2022;12(1). doi:10.1038/s41598-022-06525-9 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-022-06525-9&link_type=DOI) 26. 26.Żurowska AM, Bielska O, Daca-Roszak P, et al. Mild X-linked Alport syndrome due to the COL4A5 G624D variant originating in the Middle Ages is predominant in Central/East Europe and causes kidney failure in midlife. Kidney Int. 2021;99(6):1451–1458. doi:10.1016/j.kint.2020.10.040 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.kint.2020.10.040&link_type=DOI) 27. 27.Gross O, Licht C, Anders HJ, et al. Early angiotensin-converting enzyme inhibition in Alport syndrome delays renal failure and improves life expectancy. Kidney Int. 2012;81(5):494–501. doi:10.1038/ki.2011.407 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ki.2011.407&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22166847&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000300377000011&link_type=ISI) 28. 28.Temme J, Peters F, Lange K, et al. Incidence of renal failure and nephroprotection by RAAS inhibition in heterozygous carriers of X-chromosomal and autosomal recessive Alport mutations. Kidney Int. 2012;81(8):779–783. doi:10.1038/ki.2011.452 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ki.2011.452&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22237748&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F09%2F2024.06.04.24308453.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000302269300010&link_type=ISI) 29. 29.Boeckhaus J, Hoefele J, Riedhammer KM, et al. Lifelong effect of therapy in young patients with the COL4A5 Alport missense variant p.(Gly624Asp): a prospective cohort study. Nephrology Dialysis Transplantation. 2022;37(12):2496–2504. doi:10.1093/ndt/gfac006 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ndt/gfac006&link_type=DOI)