A common 1.6 Mb Y-chromosomal inversion predisposes to subsequent deletions and severe spermatogenic failure in humans ====================================================================================================================== * Pille Hallast * Laura Kibena * Margus Punab * Elena Arciero * Siiri Rootsi * Marina Grigorova * Rodrigo Flores * Mark A. Jobling * Olev Poolamets * Kristjan Pomm * Paul Korrovits * Kristiina Rull * Yali Xue * Chris Tyler-Smith * Maris Laan ## Abstract Male infertility is a prevalent condition, concerning 5-10% of men. So far, only some recurrent genetic factors have been described as confident contributors to spermatogenic failure. Here, we report the first re-sequencing study of the Y-chromosomal *Azoospermia Factor c* (*AZFc*) region combined with gene dosage and Y-haplogroup determination. In analysing 2,324 Estonian men, we uncovered a novel structural variant as a high-penetrant risk factor to male infertility. The Y lineage R1a1-M458, reported at >20% frequency in several European populations, carries a fixed ∼1.6 Mb long *r2/r3* inversion destabilizing the *AZFc* region and predisposing to recurrent microdeletions. Such complex rearrangements were significantly enriched among severe oligozoospermia cases. The carrier *vs* non-carrier risk to spermatogenic failure was increased 8.6-fold (*p* = 6.0 × 10−4). The finding contributes to improved molecular diagnostics and clinical management of infertility. Carrier identification in young age will facilitate timely counselling and reproductive decision-making. Keywords * idiopathic male infertility * Y-chromosomal *AZFc* region * re-sequencing * *gr/gr* deletion * complex structural rearrangements * Y haplogroup R1a1-M458 * *DAZ, BPY2, CDY1* dosage and variants ## Introduction The diagnosis of male factor infertility due to abnormal semen parameters concerns ∼10% of men (*1, 2*). In today’s andrology workup ∼60% of patients with spermatogenic failure remain idiopathic (*3*). Among the known causes, the most widely considered genetic factors are karyotype abnormalities (up to 17% of patients) and recurrent *de novo* microdeletions of the Y-chromosomal *AZFa* (∼0.8 Mb), *AZFb* (∼6.2 Mb) and *AZFc* (∼3.5 Mb) regions (2–10%)(*3-5*). For more than 15 years, testing for *AZF* deletions has been strongly recommended in the diagnostic workup for infertility patients with sperm concentration <5 x 106/ml (*6, 7*). Most deletion carriers represent patients with either azoospermia (no sperm) or cryptozoospermia (>0–1 million sperm/ejaculate) (*3, 8, 9*). The most prevalent deletion type is *AZFc* (∼80%), followed by the loss of *AZFa* (0.5–4%), *AZFb* (1–5%) and *AZFbc* (1–3%) regions (**Figure 1A**). Excess of recurrent *AZFc* deletions is promoted by the region’s complex genomic structure comprised of long direct and inverted amplicons of nearly identical DNA segments that lead to aberrant meiotic rearrangements in gametogenesis (*10, 11*) (**Figure 1B**). The *AZFc* full deletions remove all the multi-copy *DAZ (deleted in azoospermia 1), BPY2 (basic charge Y-linked 2*) and *CDY1* (*chromodomain Y-linked 1*) genes that are expressed in a testis-enriched manner and considered important in spermatogenesis (**Figure 1C**). ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F1) Figure 1. Y-chromosomal *AZFc* region and its partial deletions in the study group. **(A)** Schematic representation of the human Y chromosome with the *AZFa, AZFb* and *AZFc* regions shown as black bars. **(B)** Magnified structure of the *AZFc* region with approximate locations of multi-copy protein-coding genes, STS (sY) markers for the detection of *AZFc* partial deletions and the span of typical *gr/gr* and *b2/b3* deletions (*10*). P1-P3 (gray triangles) denote palindromic genomic segments consisting of two ‘arms’ representing highly similar inverted DNA repeats (>99.7% sequence identity) that flank a relatively short distinct ‘spacer’ sequence. Of note, the occurrence of the *b2/b3* deletion requires a preceding inversion in the *AZFc* region and therefore its presentation on the reference sequence includes also the retained segment (gray dashed line). Full details about alternative *gr/gr* and *b2/b3* deletion types are presented in **Figure S1.** **(C)** Dosage of multi-copy genes on human Y chromosomes with or without *AZFc* deletions. **(D)** Prevalence of the *gr/gr* and *b2/b3* deletions detected in the subgroups of this study. Fisher’s exact test was used to test the statistical significance in the deletion frequencies between the groups. PAR, pseudoautosomal region; MSY, male-specific region of the Y chromosome; cen, centromere; *AZF*, azoospermia factor region. The palindromic structure of the *AZFc* region also facilitates partial deletions that are rather frequently detected in the general population (*12-15*). The most prevalent partial deletion types, named after the involved amplicons as *g(reen)-r(ed)/g(reen)-r(ed*) (lost segment ∼1.6 Mb) and *b(lue)2/b(lue)3* (∼1.8 Mb) reduce the copy-number of *DAZ, BPY2* and *CDY1* genes by roughly 50% (**Figure 1B-C, Figure S1**). The published data on the contribution of *gr/gr* and *b2/b3* deletions to spermatogenic failure are inconsistent. In European populations, the carrier status of the *gr/gr* deletion increases a risk to low sperm counts ∼1.8-fold (*14, 16, 17*). Its more variable effect on spermatogenesis has been shown in Middle Eastern and Asian populations, where the *gr/gr* deletion is completely fixed in some Y lineages, e.g. haplogroups D2 and Q1a that are common in Japan and some parts of China (*18, 19*). In contrast, the *b2/b3* deletion appears to be a risk factor for spermatogenic impairment in several East Asian and African, but not in European or South Asian populations (*20, 21*). Notably, the *b2/b3* deletion is completely fixed in Y haplogroup N3 that has a high frequency (up to 90% in some populations) in Finno-Ugric, Baltic and some Turkic-speaking people living in Northern Eurasia (*14, 15, 22*). Thus, it is unlikely that the carriership of a *gr/gr* or *b2/b3* deletion *per se* has an effect on male fertility potential. It has been proposed that this broad phenotypic variability may be explained by the diversity of *gr/gr* and *b2/b3* deletion subtypes (*23*). Y chromosomes carrying partial *AZFc* deletions may differ for the content, dosage or genetic variability of the retained genes, the overall genetic composition reflected by phylogenetic haplogroups or the presence of additional structural variants. Only limited studies have analyzed the subtypes of *gr/gr* or *b2/b3* deletions and no straightforward conclusions have been reached for their link to spermatogenic failure (*17, 24, 25*). The current study represents the largest in-depth investigation of *AZFc* partial deletions in men recruited by a single European clinical center. We analysed 1,190 Estonian idiopathic patients with male factor infertility in comparison to 1,134 reference men from the same population, including 810 subjects with sperm parameter data available. Y chromosomes carrying *gr/gr* or *b2/b3* deletions were investigated for additional genomic rearrangements, Y-chromosomal haplogroups, dosage and sequence variation of the retained *DAZ, BPY2* and *CDY* genes. The study aimed to determine the role and contribution of *gr/gr* and *b2/b3* deletion subtypes in spermatogenic failure and to explore their potential in the clinical perspective. ## Results ### Enrichment of *gr/gr* deletions in Estonian idiopathic infertile men with reduced sperm counts The study analyzed 1,190 Estonian men with idiopathic infertility (sperm counts 0 – 39 x106/ejaculate) and a reference group comprised of 1,134 Estonian men with proven fatherhood (n=635) or representing healthy young men (n=499) (**Table 1, Table S1**). For all 2,324 study subjects, complete *AZFa, AZFb* and *AZFc* deletions were excluded. View this table: [Table 1.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/T1) Table 1. Characteristics of the patients with male factor infertility and reference groups used for comparison. The partial *AZFc* deletions identified were *gr/gr* (n=46), *b2/b3* (n=756) and *b1/b3* (n=1, reference case) (**Table 2**). A statistically significant excess of *gr/gr* deletions was detected in idiopathic male infertility patients (2.7%; n=32/1,190) compared to reference cases (1.2%; n=14/1,134) (Fisher’s exact test, *p*=0.016; OR = 2.2 [95% CI 1.2 - 4.2]) (**Figure 1D, Table S2**). The highest frequency of *gr/gr* deletion carriers (6.8%, n=6/88) was detected in cryptozoospermia cases (sperm count >0 – 1 x 106/ejaculate). However, in the reference group andrological parameters of men with or without the *gr/gr* deletion did not differ (**Table S3**). All 10 reference men with the *gr/gr* deletion and available andrological data were normozoospermic (220.3 [74.2-559.0] x 106 sperm/ ejaculate). Also their other andrological parameters were within the normal range, overlapping with those of the subjects without a *gr/gr* deletion. View this table: [Table 2.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/T2) Table 2. Summary of the identified Y-chromosomal *AZF* deletion subtypes. The patient and the reference groups exhibited similar prevalence of *b2/b3* deletions (388/1190, 32.6% *vs*. 367/1134, 32.4%; Fisher’s exact test, *p*=0.8). No apparent clinically meaningful genetic effects on andrological parameters were observed in either of the study groups (**Table S3-S4**). ### Significant overrepresentation of Y lineage R1a1-M458 in *gr/gr* deletion carriers The Y-chromosomal haplogroups determined in 31 patients and 13 reference men carrying a *gr/gr* deletion represented 20 different lineages (patients, 17; reference men, 10; **Figure 2A, Table S5a**). Combining the phylogenetic context with the data on exact missing *DAZ* and *CDY1* gene copies (*see below*) revealed that the *gr/gr* deletion events in 44 analyzed cases must have independently occurred at least 26 times. About two-thirds of these Y chromosomes belonged to haplogroup R1, whereas the rest represented A1b, G, I and J lineages. Notably, there was a highly significant overrepresentation of Y chromosomes belonging to lineage R1a1-M458 in the *gr/gr* deletion carriers compared to the known Estonian population frequency (22.7% *vs* 5.1%; Fisher’s exact test, *p*=5.3 x 10−4, OR = 5.5 [95% CI 2.2-13.7]; **Figure 2B, Table S5b)** (*26*). ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F2) Figure 2. Phylogenetic relationships and gene copies in study subjects with partial *AZFc* deletions. **(A)** Y-chromosomal lineages indicated with typed terminal markers (left), deleted (white)/retained (black) *DAZ* and *CDY1* gene copies (middle) and secondary rearrangements in the *AZFc* region (right) of idiopathic male factor infertility (n=31) and reference cases (n=13) carrying the *gr/gr* deletion. The human Y-chromosomal reference sequence has four *DAZ* and two *CDY1* copies; the retained gene copies on each Y chromosome with a *gr/gr* deletion are shown as filled boxes. Chromosomes carrying atypical *gr/gr* subtypes with the loss of either the *DAZ1/DAZ3* or *DAZ2/DAZ4* gene pair due to complex genomic rearrangement combining the previous *r2/r3* inversion with a subsequent *gr/gr* deletion, are highlighted with a dashed gray square. **(B)** Enrichment of the Y-chromosomal lineage R1a1-M458 and its sub-lineages in study subjects carrying the *gr/gr* deletion in comparison to the Estonian general population (data from (*26*)). Fisher’s exact test was used to test the statistical significance between the groups. **(C)** Y-chromosomal lineages indicated with typed terminal markers (left) and the copy number of the *DAZ, BPY2* and *CDY1* gene copies (right) determined for 382 idiopathic male factor infertility cases carrying the *b2/b3* deletion. The light gray box denotes *DAZ, BPY2* and *CDY1* gene dosage consistent with full *b2/b4* duplication(s). The legend for the deletion subtype is shown in the bottom right corner. Further information on the distribution of Y-chromosomal lineages in the carriers of *AZFc* partial deletions are provided in **Tables S5-S6** and the *AZFc* rearrangement types are detailed in **Figure S1** and **Tables S7-S11**. n, number; n.a., not available; Ref, reference cases Nearly all (99.4%) Estonian cases with the *b2/b3* deletion belonged to the Y haplogroup N3, in which this event is fixed (*13, 15*). The most commonly detected sub-lineage was N3a3a-L550 (∼51% of 436 typed chromosomes) and in total 15 different haplogroups that had diverged after the *b2/b3* deletion event in the common ancestor of N3 were present in Estonian men (**Figure 2C, Table S6**). *b2/b3* Y chromosomes representing non-N3 lineages were detected in two patients and three reference men. Lineage typing was possible for three of them, who carried either K-M9 (one patient) or R1a1a1b1a1a1c-CTS11962.1 (one patient and one reference case). ### Increased prevalence of *b2/b3* deletion followed by *b2/b4* duplication in infertile men The expected retained copy number of *DAZ, BPY2* and *CDY1* genes consistent with the typical *gr/gr* deletion was found in 37/44 (∼84%) cases (**Figure 2A, Table 2, Table S7)**. Three patients and three reference men carried a secondary *b2/b4* duplication adding one or more amplicons of [two *DAZ* – two *BPY2* – one *CDY1*] genes with no apparent effect on infertility status (Fisher’s exact test, *p*=0.34). Notably, four of six samples with secondary *b2/b4* duplication events were identified in haplogroup I. Similarly, 78.5% of patients and 84.3% reference men with the *b2/b3* deletion presented gene dosage consistent with the typical deletion (**Figure 2C, Table 2, Table S7-S8**). Indicative of recurrent secondary events, one or more *b2/b4* duplications of [two *DAZ* – one *BPY2* – one *CDY1*] genes were identified in 13 haplogroups, including non-N3 lineages K-M9 and R1a1a1b1a1a1c-CTS11962.1. Although secondary *b2/b4* duplications were detected with significantly higher prevalence in patients compared to the reference men (n=78/382, 20.4% vs n=35/249, 14.1%; Fisher’s exact test, *p*=0.026, OR = 1.57 [95% CI 1.02 - 2.42]), no consistent effect of increased gene copy number on andrological parameters was observed (**Tables S3-S4**). Reference men with *b2/b4* duplication compared to subjects with no *AZFc* rearrangements showed a trend for lower FSH (median 2.3 [5-95% range 1.4-7.5] *vs* 3.2 [1.3-7.1] IU/L; *P*<0.05) and LH (3.1 [1.7-5.0] *vs* 3.8 (1.7-7.2) IU/L; *P*<0.05). Additionally, in eight subjects with *AZFc* partial deletions, further atypical Y-chromosomal genomic rearrangements were detected, but also with no clear evidence for a phenotypic effect (**Table 2, Table S7**). The data gathered from this analysis thus suggest that the dosage of *DAZ, BPY2* and *CDY1* genes does not play a major role in modulating the pathogenic effect of the *gr/gr* and *b2/b3* deletions. ### No specific *DAZ* or *CDY1* gene copy is lost in men with spermatogenic failure The major *b2/b3* deletion subtype in both patients (99.7%) and reference cases (98.1%) was the loss of *DAZ3*-*DAZ4-CDY1a* genes, whereas the most frequent *gr/gr* deletion subtypes were the loss of *DAZ1-DAZ2-CDY1a* (41.9%, 18/43 cases) and *DAZ1-DAZ2-CDY1b* (25.6%, 11/43 cases) combinations (**Figure 2A, Table S9-S10**). The observed prevalence of the major *gr/gr* subtypes was concordant with the published data on other European populations (42.5% and 25.5%, respectively; (*17*)). As these *gr/gr* deletion subtypes are prevalent in the reference group (total 11 of 13, 84.6%), their major role in spermatogenic impairment can be ruled out. As a novel insight, a subset of these Y chromosomes showed lineage-specific loss of some exon 7 subtypes of the retained *DAZ4* gene (**Figure S2; Table S10**). All five exons 7Y were missing in the Y chromosomes with the *DAZ1-DAZ2-CDY1a* deletion that had occurred in sub-lineages of the R1a1a1b1a2 haplogroup (9/31 patients, 3/13 reference men). The exon 7F was lost in haplogroup I1 and its sub-lineages (5/31, 2/13) carrying the *DAZ1-DAZ2-CDY1b* deletion. There was no evidence that loss of *DAZ4* exons 7Y or 7F has any phenotypic consequences. Most likely, this observation reflects gene conversion events from *DAZ3* to *DAZ4* as the former lacks both, exons 7Y and 7F. Taken together, our findings indicate that neither the loss of the *DAZ1-DAZ2* nor the *DAZ3-DAZ4* gene pair, combined with either a *CDY1a* or *CDY1b* gene, directly causes spermatogenic failure. Interestingly, no Y chromosomes were observed with fewer than two retained *DAZ* genes. ### Y lineage R1a1-M458 carries a fixed *r2/r3* inversion predisposing to recurrent deletions Novel atypical *gr/gr* and *b2/b3* deletion subtypes with the loss of an unusual *DAZ* gene pair were identified (**Figure 2A, Table 2, Table S9-S11**). Eight patients and two reference cases with a *gr/gr* deletion were missing *DAZ2-DAZ4* genes. Loss of *DAZ1-DAZ3* genes followed by a subsequent *b2/b4* duplication event was identified in one infertile and one reference case with either *gr/gr* or *b2/b3* deletion, respectively. All but one subject with this atypical pair of lost *DAZ* genes belonged to the Y haplogroup R1a1-M458 and its sub-lineages, significantly enriched in *gr/gr* deletion carriers (**Figure 2B, Table S5b**). The most parsimonious explanation to explain the simultaneous deletion of either *DAZ1-DAZ3* or *DAZ2-DAZ4* genes is a preceding ∼1.6 Mb long inversion between the *r(ed)2* and *r(ed)3* amplicons (**Figure 3A**). This new inverted structure might be more susceptible to recurrent deletions as it has altered the internal palindromic structure of *AZFc* region. In *r2/r3* inversion chromosomes, the largest palindrome P1 is almost completely lost and the size of the palindrome P2 is greatly expanded by positioning the homologous *g1/g2* segments in an inverted orientation. The *r2/r3* inversion is consequently expected to destabilize the *AZFc* region as several long DNA amplicons with highly homologous DNA sequence are positioned in the same sequence orientation (*b2, b3* and *b4; g2* and *g3; y1* and *y3*). Therefore, they are prone to non-allelic homologous recombination mediating recurrent deletions and duplications. Since these atypical deletion subtypes were identified only in a specific Y-chromosomal haplogroup, the detected *r2/r3* inversion must have occurred only once in the common ancestor of R1a1-M458 sub-lineages. One patient with the loss of *DAZ2-DAZ4* carried haplogroup R1a1a1-M417, an ancestral lineage to R1a1-M458 (**Figure 2A**). However, lineage R1a1a1-M417 is not fixed for this inversion since its other sub-lineage, R1a1a1b1a2, does not carry it and any subsequent inversion restoring the exact original *AZFc* structure is not credible. The more parsimonious explanation is that the inversion occurred in a sub-lineage of R1a1a1-M417 that has to be yet determined. ![Figure 3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F3) Figure 3. Complex structural variants at the Y-chromosomal lineage R1a1-M458 and their effect on andrological parameters. **(A)** Schematic presentation of the Y chromosome with the *r2/r3* inversion compared to the reference sequence. The *r2/r3* inversion structure nearly destroys the large palindrome P1 and, consequently, destabilizes the *AZFc* region since several long DNA amplicons with highly similar DNA sequence (*b2, b3* and *b4*; *g2* and *g3*; *y1* and *y3*) are positioned in the same sequence orientation. This structure promotes non-allelic homologous recombination mediating recurrent deletion and duplication events. The approximate regions removed by the identified *gr/gr* and *b2/b3* deletions arising on the *r2/r3* inverted Y chromosome are shown as dashed lines. **(B)** Distribution of andrological parameters in the idiopathic male factor infertility cases (total sperm counts 0 – 39 x 106) subgrouped based on the structure of the *AZFc* region. The median (5–95% range) of each parameter is shown for cases carrying the *r2/r3* inversion plus secondary deletions compared to infertile men without any *AZFc* deletion. The Pairwise Wilcoxon Rank sum test was applied to estimate the statistical difference between groups. Threshold values (shown in blue) for sperm parameters corresponding to severe spermatogenic failure are based on international guidelines (*27*). For reproductive hormones, reference values of the laboratory service provider are shown. The empirical threshold for the total testis volume was based on routinely applied clinical criteria at the AC-TUH. For full details see **Table S3 and Table S11.** **(C)** The majority of idiopathic infertility cases carrying the *r2/r3* inversion plus secondary *AZFc* partial deletions (total n=10) exhibit severe oligoasthenoteratozoospemia (OAT) defined as extremely reduced sperm counts (<5 x 106/ml) and concentration (<10 x 106/ejaculate) combined with low fraction of sperms with normal morphology (<4% normal forms) and motility (<32% progressive motile spermatozoa). Reference values for andrological parameters have been applied as referred in sub-figure B. As total testis volume is mostly within the expected range, their infertility is not caused by intrinsic congenital testicular damage but rather due to severe spermatogenic failure *per se*. Del, deletion, inv, inversion, dupl, duplication, n, number; sec, secondary, mill, million, ej., ejaculate Based on the Y-chromosomal phylogenetic data, one additional patient was identified as an obligate carrier of the *r2/r3* inversion as his Y chromosome represents the lineage R1a1a1b1a1a1c-CTS11962.1 that was also identified in two cases with the *r2/r3* inversion. This patient exhibited signs of unusual deletion and duplication events in the *AZFc* region as he carried six *DAZ*, four *BPY2* and three copies of the *CDY1* gene (**Figure 2B, Table 2, Table S8**). Among the analyzed 2,324 men, 13 cases with the complex *AZFc* rearrangement combining *r2/r3* inversion with a subsequent deletion, represented 0.6% (**Table 3**). Considering the reported population prevalence of R1a1-M458 lineage in Estonians (5.1%; (*26*)), the estimated number of subjects representing this Y lineage in the study group was ∼119. Thus, approximately one in ten chromosomes with the *r2/r3* inversion had undergone a subsequent deletion event (13/119, 11%). View this table: [Table 3.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/T3) Table 3. Enrichment of the *AZFc r2/r3* inversion followed by a partial *AZFc* deletion in men with severe spermatogenic failure ### *r2/r3* inversion promotes recurrent deletions that lead to severe oligoasthenoteratozoospermia Idiopathic infertility cases carrying the *r2/r3* inversion with a subsequent deletion in the *AZFc* region (n=10) exhibited extremely low sperm counts compared to subjects without any *AZFc* deletions (median 2.0 *vs* 12.5 x 106/ejaculate; Wilcoxon test, *p*=0.011) (**Figure 3B, Figure S3, Table S3)**. Nine of 10 men showed severe spermatogenic failure (total sperm counts <10 x 106/ejaculate), either azoo- (n=1), crypto- (n=3) or severe oligozoospermia (n=5) (**Table S10-S11**). They also showed consistently the poorest sperm concentration (median 1.0 x 106/ml) and progressive motility (13%), as well as the lowest semen volume (2.3 ml) compared to the rest of analyzed infertile men. The data suggests that extreme oligoasthenoteratozoospermia (OAT) observed in these subjects was due to the severely affected process of spermatogenesis, whereas their testicular volume and hormonal profile were within the typical range of male factor infertility cases (**Figure 3C**). When all the men with andrological data (n=2,000) were stratified based on sperm counts, there was a highly significant enrichment of the *r2/r3* inversion with a subsequent deletion in men with severe spermatogenic failure (sperm counts 0 – 10 x 106) compared to the rest (1.7% *vs* 0.2%, Fisher’s exact test, *p*=6.0 x 10−4, OR=8.6 [95% CI 2.3 – 31.8]; **Table 3**). The estimated number of phenotyped subjects representing the Y haplogroup R1a1-M458 with the fixed *r2/r3* inversion was 102 (based on population prevalence 5.1%; (*26*)). Among carriers of this Y lineage, 33.7% of men with sperm counts 0–10 x 106 (9/27), but only 4.0% with sperm counts >10 x 106 (3/72) had undergone a subsequent *AZFc* partial deletion (Fisher’s exact test, *p*= 3.0 x 10−4, OR = 12.0 [95% CI 2.9 - 48.9]). Only three reference cases carried a Y chromosome with the *r2/r3* inversion with a subsequent *AZFc* partial deletion. At the time of phenotyping, all three subjects were younger (aged 18, 21 and 23 years) than the variant carriers in the idiopathic infertility group (median 32.4, range 26-51 years) (**Table S11**). The only reference subject with this complex *AZFc* rearrangement, but unaffected sperm analysis was the youngest (18 years). Notably, another reference man (23 years) with andrological data would actually be classified, based on WHO guidelines (*27*), as an oligozoospermia case (sperm concentration 12 x 106/ml *vs* threshold 15 x 106/ml). Also, his total sperm counts (39.4 x 106/ejaculate) represented a borderline value. ### Sequence diversity of the retained *DAZ, BPY2* and *CDY* genes is extremely low and has no detectable effect on sperm parameters The re-sequenced retained *DAZ1-4, BPY2* and *CDY1-2* genes were characterized by extremely low nucleotide variability in all Y-chromosomal lineages and deletion subtypes (**Table S12)**. For 476 samples (*gr/g*r, n=40; *b2/b3*, n= 436) re-sequenced for the >94 kb region, a total of 42 variants were identified with median 0.8 variants/kb and maximum two variants per individual. Most of them were previously undescribed (41), singletons (31) and/or non-coding SNVs/short indels (36) (**Table S13)**. The *CDY2a-CDY2b* genes harbored only one variable site, whereas *DAZ1-DAZ2* carried 24 or 26 SNVs/indels. Most variants appeared paralogous as both the reference and alternative alleles were identified. Among the four detected missense variants, CDY1b p.T419N was fixed in all three *CDY1b* copies present on the Y chromosome with the *b2/b3* deletion plus *b2/b4* duplications that represented an oligozoospermia case. However, the effect of this conservative substitution is unclear. There was thus no evidence that the sequence variation in *DAZ, BPY2* and *CDY* genes has any effect on infertility related parameters in the subjects examined. ## Discussion We conducted a comprehensive investigation of partial deletion subtypes of the Y-chromosomal *AZFc* region in 2,324 Estonian men, approximately half with idiopathic spermatogenic impairment (n=1,190) in comparison to the reference group (n=1,134). Importantly, 2,000 men had undergone full and uniformly conducted andrological workup at a single clinical center, facilitating fine-scale genotype-phenotype analysis. Previously, no study had undertaken re-sequencing of the retained *DAZ, BPY2* and *CDY* genes along with the assessment of the Y haplogroup, dosage and retained/deleted genes in the *gr/gr* or *b2/b3*-deleted chromosomes in both infertile men and controls. Concordant with reports from other European populations, the *gr/gr*, but not the *b2/b3* deletion, is a risk factor for spermatogenic impairment in Estonian men with >2-fold increased susceptibility to infertility (**Figure 1D**). However, the gathered data on the large group of reference men in the current study demonstrated the existence of Y chromosomes carrying a *gr/gr* deletion without any documented effect on andrological parameters (**Table S3**). As a novel finding, the study uncovered complex *AZFc* rearrangements within a specific Y haplogroup, R1a1-M458 and its sub-lineages, causing severe spermatogenic failure in the majority of carriers (**Figure 3, Table 3**). This Y lineage has undergone a ∼1.6 Mb *r2/r3* inversion in the *AZFc* region that has disrupted the structure of the palindromes P1 and P2, promoting subsequent recurrent deletions and consequently, severely impaired the process of spermatogenesis. Consistent with key early observations (*14, 17, 23*), this study supports the recurrent nature and high subtype diversity of the *AZFc* partial losses that are currently considered jointly under the umbrella term ‘*gr/gr* deletions’. The 44 detected *gr/gr* deletions in our study sample were estimated to have originated independently at least 26 times across the Y phylogenetic tree and include seven different combinations of *DAZ* and *CDY1* gene losses. Apparently, there is a substantial undescribed heterogeneity in the spread and structure of *gr/gr* deletions that in turn results in phenotypic variability of the genetic effects. Unexpectedly, one in four Estonian *gr/gr* deletion carriers belonged to the Y-chromosomal haplogroup R1a1-M458 (and its sub-lineages) (**Figure 2A-B**; 22.7% vs 5.1% reported as the Estonian population frequency (*26*)). Notably, a previous study has reported a significant enrichment of the haplogroup R1a (ancestral lineage to the R1a1-M458) among *gr/gr*-deleted chromosomes in the Polish population (*14*), which has a high prevalence, 25%, of R1a1-M458 (*26*) (**Table S14**). All the Estonian *gr/gr* cases and also additional *b2/b3* deletion chromosomes representing this Y lineage carried unusual retained *DAZ* gene pairs (*DAZ1-DAZ3* or *DAZ2-DAZ4*) in combination with either *CDY1a* or *CDY1b* gene copy (**Figure 2**). These complex *AZFc* rearrangements were best explained by a preceding (and apparently fixed in R1a1-M458) ∼1.6 Mb inversion between the homologous *r2* and *r3* amplicons, followed by recurrent secondary partial *AZFc* deletions (**Figure 3A**). The latter are facilitated by large ampliconic segments positioned in the same orientation. Inversions in the *AZFc* region are not uncommon, but none of the previously described inversions is expected to substantially disrupt the core palindromic structure of the *AZFc* region (**Figure S1**) (*13, 23*). In contrast, the *r2/r3* inversion disrupts the structure of palindrome P1 and expands the size of the P2 palindrome more than two-fold (**Figure 3A**). The critical role of intact P1-P2 palindromes in the *AZFc* structure is supported by the observation that no Y chromosomes have been described with a single *DAZ* gene copy, whereas the inverted *DAZ* gene pairs form the ‘heart’ of both P1 and P2. As a likely scenario, this complex *AZFc* rearrangement may predispose to spermatogenic impairment through substantial destabilization of the intra-chromosomal structure affecting meiotic recombination and chromosomal segregation. Among 12 Estonian subjects carrying the *AZFc r2/r3* inversion followed by partial deletions and with available data for sperm counts, nine cases exhibited severe spermatogenic failure, two had moderate oligozoospermia and only one case (aged 18 years) was normozoospermic (**Figure 3B-C**). This represented ∼8-9-fold enrichment of this complex rearrangement among men with severely reduced sperm counts (0 – 10 x 106; **Table 3**). This genetic effect was observed specifically on the effectiveness of spermatogenesis, whereas the measurements of bitesticular volume and reproductive hormone levels did not stand out among the rest of analyzed infertile men. To our knowledge, no other Y-lineage specific risk variants for spermatogenic impairment have been reported so far. Previously, the *DAZ2-DAZ4* deletion had been shown as a high-risk factor for male infertility in the Tunisian population, but the Y haplogroups of those subjects was not investigated (*24*). The survival of such a high-risk lineage in the population seems at first sight surprising, but may be accounted for by its possible age-specific effects on spermatogenesis, which may be exacerbated by the recent general decline in sperm count (*28*). In the past, this lineage may not have been disadvantageous. This study outcome has notable clinical implications for the improvement of molecular diagnostics and reducing the proportion of idiopathic male factor infertility cases. In Northern and Central Europe, the prevalence of R1a1-M458 haplogroup carrying the *r2/r3* inversion ranges from ∼1% in the Netherlands and Denmark to ∼2-5% in Austria, Hungary, Germany, Baltics and most Balkan countries, whereas it is widespread in Slavic populations and carried by 12-26% of men (**Figure 4, Table S14**)(*26*). In these populations, recurrent secondary *AZFc* partial deletions on Y chromosomes representing the R1a1-M458 haplogroup (and its sub-lineages) may potentially explain from 0.3% up to ∼9% of cases presenting severe spermatogenic impairment (sperm counts <10 million per ejaculate). Further studies in other populations and large samples of patients and normozoospermic controls are required to fully establish the value of extending the current recommended testing of Y-chromosomal deletions by including the analysis of this novel Y-lineage-specific pathogenic *AZFc* rearrangement (a detailed protocol is provided in **Supplemental Subjects and Methods**). ![Figure 4.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F4) Figure 4. The prevalence of the Y-chromosomal haplogroup R1a1-M458 carrying a fixed *r2/r3* inversion. Population frequencies of the haplogroup were derived from (*26*). The proportion of subjects carrying this Y lineage (and its sub-lineages) and who had undergone a subsequent secondary *AZFc* deletion among idiopathic cases with severe spermatogenic failure (sperm counts 0–10 x 106/ejaculate) was estimated for each population based on data from the current study in the Estonian population. For full details see **Table S14**. The evidence from the literature has shown that the increased prevalence of either *gr/gr* or *b2/b3* deletions in infertility cases appears to be population-dependent (*16, 20*). It can be speculated that also in other populations some specific Y lineages may carry *AZFc* structural variants that in combination with partial deletions (or other rearrangements) predispose to chromosomal instability in the complex process of spermatogenesis involving multiple well-coordinated cell divisions. So far, the largest conducted study on the Y-chromosomal phylogeny of *gr/gr* deletion carriers included 152 infertile subjects representing seven countries with different population genetic structures (*17*). However, the number of cases per population was low and the study included only 17 fertile men. Also, the study did not include fine-scale analysis of Y sub-lineages and the retained gene content. Long-range re-sequencing of the whole *AZFc* region in large numbers of men would be the preferred approach to uncover its structural complexity. Additional pathogenic *AZFc* rearrangements may also exist among Estonian infertile men. When omitting the cases carrying a *gr/gr* deletion at the *r2/r3* inversion background, a non-significant enrichment of the remaining *gr/gr* deletion chromosomes can be observed in patients compared to reference men (1.9 *vs* 1.1%, *p* < 0.1; **Table 2**). In addition to the main finding, our deep re-sequencing dataset revealed that neither the dosage, sequence variation nor exact copy of the retained *DAZ, BPY2* and *CDY1* gene showed any detectable effect on spermatogenic parameters. All chromosomes with *AZFc* partial deletions exhibit extremely low overall sequence variation of the retained *DAZ, BPY2* and *CDY* genes. This observation is consistent with previous reports showing low levels of genetic diversity of the human Y chromosome (*29*) and suggesting that novel variants may be rapidly removed by active gene conversion among Y-chromosomal duplicate genes or selective constraint (*30-32*). Among the re-sequenced 382 chromosomes with *b2/b3* deletions, no pathogenic mutations were detected in the single retained *BPY2* and *CDY1* gene copies. At the same time, the high rates of large structural rearrangements and copy number variation in the Y chromosome are well established, contrasting with low levels of sequence variation (*19, 33*). One in five or six Estonian Y chromosomes with *gr/gr* and *b2/b3* deletions had undergone secondary rearrangements with no apparent effect on tested andrological parameters and fertility potential (**Table 2**). In the literature, the data about the effects of secondary duplications after an initial *AZFc* partial deletion on sperm parameters are inconclusive. Some studies have suggested increased pathogenicity (*34-38*), whereas others have reported neutral or even positive effects on spermatogenesis (*17, 39-41*). However, further copy number reductions in this genomic region appear to be very rare - none of the 44 *gr/gr* or 631 *b2/b3* deletion carriers were identified with further reductions beyond what is expected from the initial deletion. In summary, we have undertaken a comprehensive study of the carriers of *AZFc* partial *gr/gr* and *b2/b3* deletions, and uncovered high levels of structural variation in the *AZFc* locus, but low sequence diversity of the coding genes within the region. As a major finding, we discovered a large inversion specific to the Y lineage R1a1-M458 that represents a hotspot for subsequent *AZFc* partial deletions. Men carrying Y chromosomes with this complex rearrangement have >10-fold increased risk of severe spermatogenic failure, but the consequences of this risk could be potentially alleviated by early identification of the variant carriers and facilitating the storage of their sperm samples. Our study results thus have the potential to improve clinical diagnostics and management of idiopathic impaired spermatogenesis in a significant fraction of men originating from Northern and Central European populations. ## Subjects and Methods ### Ethics statement The study was approved by the Ethics Review Committee on Human Research of the University of Tartu, Estonia (permissions 146/18, 152/4, 221/T-6, 221/M-5, 272/M-13, 267M-13, 286M-18, 288M-13), and sequencing/genotyping was approved at the Wellcome Sanger Institute under WTSI HMDMC 17/105. Written informed consent for evaluation and use of their clinical data for scientific purposes was obtained from each person prior to recruitment. All procedures and methods have been carried out in compliance with the guidelines of the Declaration of Helsinki. ### Study subjects Patients with idiopathic spermatogenic impairment (n=1,190) were recruited at the Andrology Centre at Tartu University Hospital (AC-TUH) in 2003-2015 (PI: M. Punab). Included cases showed reduced sperm counts (<39 x 106/ejaculate) in at least two consecutive semen analyses (*27*). Recruitment and sampling, semen analyses and hormone assays and definition of idiopathic cases have previously been described in detail (*3*). Men with known causes of male infertility detected during routine diagnostic workup were excluded, e.g. cryptorchidism, testicular cancer, orchitis/epididymitis, mumps orchitis, testis trauma, karyotype abnormalities and complete Y-chromosomal microdeletions. The final idiopathic infertility group included 104 azoospermia (no sperm), 88 cryptozoospermia (sperm counts >0–1 x 106/ejaculate), 319 severe oligozoospermia (1–10 x 106) and 679 moderate oligozoospermia (10–38 x 106) cases (**Table 1, Table S1**). The reference sample of Estonian men (n=1,134) comprised healthy young men (n=499) and subjects with proven fatherhood (n=635) (**Table 1, Table S1**). The cohort of ‘Estonian young men’ (n=499) was recruited at the AC-TUH in 2003-2004 (PI: M. Punab), representing a healthy male group with median age 18.6 (17.2-22.9) years at the time of recruitment (*42*). The subgroup of ‘Partners of pregnant women’ (n=324) includes male partners of pregnant women, recruited in 2010-2014 at the Tartu University Hospital and the West Tallinn Central Hospital (*3*). All participants in these subgroups were offered complete andrological workup at the AC-TUH; thus, 810 men (of 823) underwent sperm analysis. Further clinical phenotyping details are provided in **Supplemental Data**. The subgroup of ‘REPROMETA proven fathers’ (n=311) was recruited in 2006-2011 at the Women’s Clinic at Tartu University Hospital during the REPROMETA study (PI: M. Laan), originally designed to collect mother-father-placenta trios at delivery to investigate genetics of pregnancy complications (*43, 44*). In this study, the REPROMETA fathers represented reference men with proven fertility. Only self-reported age and BMI data were available for this subgroup. ### Genotyping Y-chromosomal microdeletions All study subjects (n=2,324) were typed for complete *AZFa* (loss of markers sY84 and sY86), *AZFb* (sY127 and sY134), *AZFc* (sY254 and sY255), and partial *AZFc* deletions *gr/gr* (sY1291), *b2/b3* (sY1191) and *b1/b3* (sY1161, sY1191 and sY1291) following established multiplex PCR protocols and PCR primers (*7, 45*) (**Table S15**). ### Re-sequencing of retained *DAZ, BPY2* and *CDY* genes using Illumina MiSeq Re-sequencing of the exonic regions of the retained *AZFc* genes (according to Ensembl release 84) in 476 cases with either *gr/gr* or *b2/b3* deletions targeted in total 94,188 bp per subject. *CDY, BPY2* and *DAZ* genes were amplified using eight, ten or 26 PCR primer pairs, respectively (**Table S16-S17**). The presence of all amplicons was confirmed using gel electrophoresis. Amplicons were pooled in equimolar concentrations, barcoded per sample, and sequenced (250 bp reads, paired-end) on Illumina MiSeq with at least 40 × coverage. BWA (v0.7.15) (*46*) was implemented to map the sequencing reads to a modified human genome reference (GRChg38), where *CDY1a, CDY2a, BPY2a* and either *DAZ3-DAZ4* (*gr/gr* carriers) or *DAZ1-DAZ2* (*b2/b3* carriers) remained unchanged, but the sequences of other *CDY, BPY2* and *DAZ* gene copies were replaced with ‘Ns’. SNVs and indels were identified using GATK HaplotypeCaller (v3.7) with a minimum base quality 20 and outputting all sites (*47*). Y-chromosomal phylogenetic markers were called using bcftools (v1.8) with minimum base quality 20, mapping quality 20 and defining ploidy as 1. Re-sequencing included 31 patients and 9 reference men with *AZFc gr/gr* deletions. Six men carrying *gr/gr* deletions were not analyzed due to DNA limitations (2 cases) or unavailable andrological data (4 cases). The analysis of *b2/b3* deletion carriers included 382 patients (haplogroup N3: n=380; non-N3, n=2) and 54 ‘Partners of pregnant women’ (N3: n=53; non-N3, n=1). ### Y-chromosomal haplogroup typing Y lineages of the *gr/gr* samples were defined using 14 markers included in the re-sequencing, plus 34 additional markers determined by Sanger sequencing or restriction fragment length polymorphism (RFLP) analysis (**Table S17-S18**). The *b2/b3*-deletion carriers were typed for Y marker N3-M46 (Tat) (*48*). The sub-lineages of the re-sequenced haplogroup N3 samples were defined in more detail using 16 phylogenetic markers from the Illumina MiSeq dataset, following established nomenclature (*22, 49*). For the other haplogroups, nomenclature according to the International Society of Genetic Genealogy (ISOGG, version 14.14) was followed. ### Determination of *DAZ, BPY2* and *CDY* gene dosage and gene types The Bio-Rad QX 200 Droplet Digital PCR system was used to quantify the copy numbers of the retained *DAZ, BPY2* and *CDY* genes for 44 *gr/gr* deletion carriers (31 cases, 13 reference men with sperm analysis data) and 631 *b2/b3* carriers (382 cases/249 reference men). PCR primers and probes for gene copy dosage detection are provided in **Table S19** and methodological details in **Supplemental Data**. The re-sequencing data of the *DAZ* genes covered nine paralogous sequence variants (PSVs) that were used to determine the retained gene copies in the *gr/gr* and *b2/b3* deletion carriers (**Table S20**). For the validation of the *DAZ* gene copy mapping approach, at least five *gr/gr* carriers were additionally typed for published SNV combinations differentiating the *DAZ* gene copies (*23, 50*). The retained *CDY1* gene was identified according to Machev et al. (*23*). ### Genetic association testing with andrological parameters Statistical testing for the associations between *AZFc gr/gr* or *b2/b3* deletions and andrological parameters was conducted using RStudio (version 1.2.1335) and data were visualised using ggplot2 (version 3.2.1) (*51*). Differences in continuous clinical variables between groups were compared using the non-parametric Pairwise Wilcoxon Rank Sum Test. Genetic association with the carrier status of *b2/b3* deletion and its subtypes was also tested using linear regression analyses (**Supplemental Data**). ## Supporting information Supplementary Tables [[supplements/245928_file02.xlsx]](pending:yes) ## Data Availability Illumina MiSeq re-sequencing data are available through the European Genome-phenome Archive (EGA, [https://www.ebi.ac.uk/](https://www.ebi.ac.uk/)) under the accession number: EGAS00001002157 [https://www.ebi.ac.uk/ega/home](https://www.ebi.ac.uk/ega/home) ## Supplemental Data Supplemental Data include Supplemental Subjects and Methods, four figures and 20 tables. ## Author contributions P.H. conceived and M.P., Y.X., C.T-S. and M.L. supervised the study. M.P. lead, O.P. and P.K. contributed critically to the patient recruitment, phenotyping and clinical data documentation. P.H., L.K. and E.A. performed experiments, P.H., L.K. and M.L. performed analyses. M.G., S.R. and R.F. provided data and assisted with analyses. M.P., K.R., K.M. and M.L. provided DNA samples. P.H., C.T-S., Y.X., M.L. and M.A.J. provided resources. P.H. and M.L. drafted the manuscript with contributions from all other authors. All authors contributed to the final interpretation of data and critical reading of the manuscript. ## Declaration of Interests The authors declare no competing interests. ## Data Availability Illumina MiSeq re-sequencing data are available through the European Genome-phenome Archive (EGA, [https://www.ebi.ac.uk/](https://www.ebi.ac.uk/)) under the accession number: EGAS00001002157 ## Web resources International Society of Genetic Genealogy (ISOGG) [https://isogg.org](https://isogg.org) RStudio: [http://www.rstudio.com/](http://www.rstudio.com/) Primer3plus: [https://primer3plus.com](https://primer3plus.com) Ensembl: [https://www.ensembl.org/](https://www.ensembl.org/) Variant Effect Predictor tool (VEP): [https://www.ensembl.org/Tools/VEP](https://www.ensembl.org/Tools/VEP) ## Supplemental Subjects and Methods ### Details on the clinical assessment of the study subjects All men, who had turned to Andrology Centre, Tartu University Hospital (AC-TUH) due to idiopathic infertility (n=1,190), as well as the participants of the ‘Estonian young men’ cohort (n=499) and the subgroup ‘Partners of pregnant women’ (n=324) were offered complete routine andrological workup. The subjects were examined by specialist andrologists at the AC-TUH, who had received respective training in clinical assessment and standardized andrological workup, locally and in collaboration with other European Andrology Academy (EAA) accredited centers. Also, anthropometric parameters were documented during clinical examination. Details are described in (*1*). Physical examination for the assessment of genital pathology and testicular size (orchidometer; made of birch wood, Pharmacia & Upjohn, Denmark) was performed with the patients in standing position. The total testes volume is the sum of right and left testicles. The position of the testicles in the scrotum, pathologies of the genital ducts (epididymis and ductus deference) and the penis, urethra, presence and if applicable grade of varicocele were registered for each subject. For 2,000 study subjects sperm analysis was performed, whereas 13 reference cases did not agree with this procedure. Semen samples were obtained by patient masturbation and semen analysis was performed in accordance with the World Health Organization (WHO) recommendations. In brief, after ejaculation, the semen was incubated at 37°C for 30–40 min for liquefaction. Semen volume was estimated by weighing the collection tube with the semen sample and subsequently subtracting the predetermined weight of the empty tube assuming 1 g = 1 mL. For assessment of the spermatozoa concentration, the samples were diluted in a solution of 0.6 mol,L NaHCO3 and 0.4% (v,v) formaldehyde in distilled water. The spermatozoa concentration was assessed using the improved Neubauer haemocytometers. Genomic DNA was extracted from EDTA-blood. After blood draw in the morning, serum and plasma fractions were separated immediately for hormone measurements (FSH, LH, testosterone). All laboratory analyses and routine genetic testing (karyotyping, Y-chromosomal microdeletions) were performed at the United Laboratories of Tartu University Hospital according to the established clinical laboratory guidelines. Detailed methodology and reference values for hormonal levels are available by the service provider: [https://www.kliinikum.ee/yhendlabor/analueueside-taehestikuline-register](https://www.kliinikum.ee/yhendlabor/analueueside-taehestikuline-register). ### Analysis of variant effects from the Illumina MiSeq dataset Variant effect prediction was performed using the Variant Effect Predictor tool (VEP, [https://www.ensembl.org/Tools/VEP](https://www.ensembl.org/Tools/VEP), Ensembl release 99) (*2*). The Combined Annotation Dependent Depletion (CADD) score ≥ 20, i.e., including variants among the top 1% of deleterious variants in the human genome, was considered indicative of potential functional importance of identified SNVs in the coding regions (*3*). ### *DAZ, BPY2* and *CDY* copy number detection using Droplet Digital PCR The PCR primers and probes were designed using Primer3plus (version 2.4.2), PCR reactions were performed according to the recommendations in the Droplet Digital PCR Application Guide (Bio-Rad, U.S.) (**Table S19**) and described in (*4*). A XQ200 Droplet Reader was used to measure the fluorescence of each droplet and QuantaSoft software (v1.6.6.0320; Bio-Rad) to cluster droplets into distinct fluorescent groups. The copy number of each gene was determined by calculating the ratio of target (unknown - *DAZ, BPY2* or *CDY*) and reference (single-copy *SRY* gene) concentration. ddPCR reactions for each gene were performed once for every sample. For samples carrying the *gr/gr* deletion, if the copy number obtained differed from the expected (two copies of *DAZ* and *BPY2*, three copies of *CDY*), then the ddPCR reaction was repeated. For *b2/b3* carriers typing was repeated for all samples not carrying the two most typical copy numbers (2-1-3 or 4-2-4 copies of *DAZ, BPY2* and *CDY* genes, respectively). Additionally, a total of 5% of random samples were replicated. If the copy number estimates between replicates differed by 0.8 or more, then a third replicate was performed, and the final copy number was calculated as average of the two closest replicates. ### Genetic association analysis using linear regression Genetic association with the carrier status of *b2/b3* deletion and its subtypes was also tested using linear regression analyses adjusted for age. For sperm parameters abstinence time and for total testosterone levels BMI estimates were additionally used as cofactors. Natural log transformation was used to achieve an approximately normal distribution of values. In all cases (except total sperm counts), the applied transformation resulted in a close-to-normal distribution of values. For the linear regression analyses, statistical significance threshold after correction for multiple testing was estimated *P*<1.0 x 10−3 (6 tests x 8 independent parameters). ### Protocol for the detection of *AZFc* rearrangement at the Y lineage R1a1a1b1a1a-M458: *r2/r3* inversion followed by partial deletions #### Step 1. Multiplex PCR for typing the *AZFc* partial deletions (modified from (*5, 6*)) The most commonly used method to detect *AZFc* partial deletions is the plus/minus STSs-based PCR assay that detects the presence of five markers around *AZFc* region (Figure 1B, Table S2). The multiplex PCR reactions contained the final concentrations of 1X PCR buffer B1 (Solis Biodyne, Estonia), 2.5 mM MgCl2, 2.5 mM dNTP, 2 µM PCR primers for STS markers sY1291 and sY1201, 3 µM PCR primers for STS markers sY1191, sY1206 and sY1161 (**Table S15**), 1U FIREPol® DNA polymerase (Solis Biodyne) and 10 ng of template genomic DNA per reaction. The following PCR conditions were used: for 5 min at 95°C, followed by 32 cycles of 30 sec at 95°C, 30 sec at 63°C and 1 min at 72°C, final extension of 10 min at 72°C and a 4°C hold. The presence/absence of PCR products in a reaction were checked on 2% agarose gel. Lack of amplification of STS marker sY1291 (but presence of all others) was used to determine the *gr/gr* deletion, and lack of sY1191 the *b2/b3* deletion. #### Step 2. Analysis of the allelic state of Y lineage R1a1a1b1a1a-specific phylogenetic marker M458 The phylogenetic marker M458 (rs375323198, A > G polymorphism, GRCh38 genomic coordinate: chrY: 22220317) indicating the carrier status of *r2/r3* inverted Y chromosome was amplified using the following conditions: PCR reactions contained the final concentrations of 1X PCR buffer B1 (Solis Biodyne), 2.5 mM MgCl2, 2.5 mM dNTP, 10 µM forward and reverse PCR primers for M458 (see **Table S18** for primer sequences), 1U FIREPol® DNA polymerase (Solis Biodyne OÜ) and 10 ng of template genomic DNA per reaction. The following PCR conditions were used: for 5 min at 95°C, followed by 32 cycles of 30 sec at 95°C, 30 sec at 52°C and 1 min at 72°C, final extension of 10 min at 72°C and a 4°C hold. The presence of the M458 marker in derived state (instead of the ancestral allele ‘A’ presence of allele ‘G’ at position 147) was determined using Sanger sequencing. #### Step 3. Analysis of retained *DAZ* gene copies by typing gene-specific PSVs (*optional*) The retained *DAZ* gene copies can be identified by typing gene copy-specific paralogous sequence variants (PSVs). In the current study, the retained *DAZ* copies were identified by determining the allelic states of 9 PSVs from the Illumina MiSeq re-sequencing data (see **Table S20** for full details). Additionally, a number of published protocols are available, for full details see (*7-9*). ![Figure S1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F5.medium.gif) [Figure S1.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F5) Figure S1. The human Y chromosome and the *Azoospermia Factor (AZF*) regions. **(A.)** The human Y chromosome drawn to approximate scale with regions of *AZFa, AZFb* and *AZFc* deletions shown below as black bars. PAR - pseudoautosomal region, MSY - male-specific region of the Y chromosome, cen - centromere, *AZF* - azoospermia factor region. **(B.)** Structure of the *AZFc* region on human Y-chromosomal reference sequence with approximate locations of protein-coding genes and STS (sY) markers used for detection of partial *AZFc* deletions (*10*). Alternative involved regions in *gr/gr* deletions arising on (**C.)** the human Y-chromosomal reference sequence and (**D.)** the Y chromosome with a preceding *b3/b4* inversion. The approximate region removed by each deletion is shown as a dashed line. **(E.)** Structure of the *AZFc* region undergone the *gr/gr* deletion and the proposed model of homologous recombination leading to the subsequent ‘*b2/b4*’ duplication. The light blue box denotes the recombination targets. The duplication is presumably the result of recombination between sister chromatids. **(F.)** Reported models for *b2/b3* deletions arising on the Y chromosomes with preceding *b2/b3* or *g1/g3* inversions. The approximate region removed by each deletion is shown as a dashed line. **(G.)** Structure of the *AZFc* region undergone the *b2/b3* deletion and the proposed model of homologous recombination leading to the subsequent ‘*b2/b4*’ duplication. The light blue box denotes the recombination targets. The duplication is presumably the result of recombination between sister chromatids. ![Figure S2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F6.medium.gif) [Figure S2.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F6) Figure S2. Location of the identified exonic variants in the *DAZ* genes. **(A.)** Schematic representation of the human Y-chromosomal reference sequence containing the *AZFc* region, with approximate locations of protein-coding genes and their direction of transcription shown as black triangles. Direct and inverted repeats with highly similar DNA sequences are denoted as coloured arrows (b - blue, t - teal, g - green, r - red, y - yellow as originally described by (*10*); arrow direction indicates the orientation of the gene from transcription start site. **(B.)** The structure of human *DAZ* genes, modified from (*7*) to represent the human Y-chromosomal reference sequence (GRCh38). Blue dashed arrows show the direction of transcription. Exons with high DNA sequence similarity within and between *DAZ* genes are denoted with the same fill colour. The number of highly similar exon 7 copies (in yellow) varies between genes and copies marked with the same letter denote identical sequences. Numbers below the gene structure denote exonic variants: 1 - DAZ1 p.H173Y or DAZ2 p.H173Y 2 - DAZ1 p.Q262E or DAZ2 p.Q262E 3 - DAZ1 p.Y243C or DAZ2 p.Y219C 4 - potential splicing variant in *DAZ3* or *DAZ4*. Black squares in *DAZ4* denote the deleted exons 7f and 7y in Y lineages I1 and R1a1a1b1a2, respectively. ![Figure S3.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F7.medium.gif) [Figure S3.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F7) Figure S3. Distribution of seminal parameters in idiopathic male factor infertility cases with spermatogenic impairment and reference subjects. Samples are grouped by genotype with the number of samples (n) shown in brackets. del - deletion, sec - secondary, inv - inversion, dupl - duplication, mill - million, ej. - ejaculate. Note different scaling of the Y-axis for the two study groups in panels B and C. ![Figure S4.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/12/09/2020.12.08.20245928/F8.medium.gif) [Figure S4.](http://medrxiv.org/content/early/2020/12/09/2020.12.08.20245928/F8) Figure S4. Distribution of hormonal and testicular parameters in idiopathic male factor infertility cases with spermatogenic impairment and reference individuals. Samples are grouped by genotype with the number of samples (n) shown in brackets. del - deletion, sec - secondary, inv - inversion, dupl - duplication. ## Acknowledgements We thank all the patients for making this study possible. The clinical team at the Andrology Centre, Tartu University Hospital is thanked for the professional phenotyping and assistance in patient recruitment over many years. Mart Adler and Eve Laasik are specifically acknowledged for the management of the Androgenetics Biobank, and all M.L. team members are thanked for their contributions to the DNA extractions. Our work was supported by the Wellcome Trust (098051). P.H. and L.K. were supported by Estonian Research Council Grant PUT1036 and M.P. and O.P. by PUT181. L.K., M.G., K.R. and M.L. were supported by Estonian Research Council Project IUT34-12, R.F. by IUT24-1 and S.R. by IUT24-1 and MOBTT53. Establishment of the cohort of Estonian infertility subjects was also supported by the EU through the European Regional Development Fund, project HAPPY PREGNANCY, no. 3.2.0701.12-004 (M.L., M.P., K.R.). * Received December 8, 2020. * Revision received December 8, 2020. * Accepted December 9, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1. A. Jungwirth, A. Giwercman, H. Tournaye, T. Diemer, Z. Kopa, G. Dohle, C. Krausz, European Association of Urology Working Group on Male, European Association of Urology guidelines on Male Infertility: the 2012 update. Eur Urol 62, 324–332 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.eururo.2012.04.048&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22591628&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 2. 2. J. Datta, M. J. Palmer, C. Tanton, L. J. Gibson, K. G. Jones, W. Macdowall, A. Glasier, P. Sonnenberg, N. Field, C. H. Mercer, A. M. Johnson, K. Wellings, Prevalence of infertility and help seeking among 15 000 women and men. Hum Reprod 31, 2108–2118 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/humrep/dew123&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27365525&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 3. 3. M. Punab, O. Poolamets, P. Paju, V. Vihljajev, K. Pomm, R. Ladva, P. Korrovits, M. Laan, Causes of male infertility: a 9-year prospective monocentre study on 1737 patients with reduced total sperm counts. Hum Reprod 32, 18–31 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/humrep/dew284&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27864361&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 4. 4. I. A. Olesen, A. M. Andersson, L. Aksglaede, N. E. Skakkebaek, E. Rajpert-de Meyts, N. Joergensen, A. Juul, Clinical, genetic, biochemical, and testicular biopsy findings among 1,213 men evaluated for infertility. Fertil Steril 107, 74–82 e77 (2017). 5. 5. F. Tuttelmann, F. Werny, T. G. Cooper, S. Kliesch, M. Simoni, E. Nieschlag, Clinical experience with azoospermia: aetiology and chances for spermatozoa detection upon biopsy. Int J Androl 34, 291–298 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1365-2605.2010.01087.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20609027&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 6. 6.ASRM, Diagnostic evaluation of the infertile male: a committee opinion. Fertil Steril 103, e18–25 (2015). 7. 7. C. Krausz, L. Hoefsloot, M. Simoni, F. Tuttelmann, A. European Academy of, N. European Molecular Genetics Quality, EAA/EMQN best practice guidelines for molecular diagnosis of Y-chromosomal microdeletions: state-of-the-art 2013. Andrology 2, 5–19 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.2047-2927.2013.00173.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24357628&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 8. 8. T. P. Kohn, J. R. Kohn, R. C. Owen, R. M. Coward, The Prevalence of Y- chromosome Microdeletions in Oligozoospermic Men: A Systematic Review and Meta-analysis of European and North American Studies. Eur Urol 76, 626–636 (2019). 9. 9. P. J. Stahl, P. Masson, A. Mielnik, M. B. Marean, P. N. Schlegel, D. A. Paduch, A decade of experience emphasizes that testing for Y microdeletions is essential in American men with azoospermia and severe oligozoospermia. Fertil Steril 94, 1753–1756 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.fertnstert.2009.09.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19896650&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 10. 10. T. Kuroda-Kawaguchi, H. Skaletsky, L. G. Brown, P. J. Minx, H. S. Cordum, R. H. Waterston, R. K. Wilson, S. Silber, R. Oates, S. Rozen, D. C. Page, The AZFc region of the Y chromosome features massive palindromes and uniform recurrent deletions in infertile men. Nat Genet 29, 279–286 (2001). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng757&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11687796&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000171911000013&link_type=ISI) 11. 11. H. Skaletsky, T. Kuroda-Kawaguchi, P. J. Minx, H. S. Cordum, L. Hillier, L. G. Brown, S. Repping, T. Pyntikova, J. Ali, T. Bieri, A. Chinwalla, A. Delehaunty, K. Delehaunty, H. Du, G. Fewell, L. Fulton, R. Fulton, T. Graves, S. F. Hou, P. Latrielle, S. Leonard, E. Mardis, R. Maupin, J. McPherson, T. Miner, W. Nash, C. Nguyen, P. Ozersky, K. Pepin, S. Rock, T. Rohlfing, K. Scott, B. Schultz, C. Strong, A. Tin-Wollam, S. P. Yang, R. H. Waterston, R. K. Wilson, S. Rozen, D. C. Page, The malespecific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature 423, 825–U822 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature01722&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12815422&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183585300035&link_type=ISI) 12. 12. S. Repping, H. Skaletsky, L. Brown, S. K. van Daalen, C. M. Korver, T. Pyntikova, T. Kuroda-Kawaguchi, J. W. de Vries, R. D. Oates, S. Silber, F. van der Veen, D. C. Page, S. Rozen, Polymorphism for a 1.6-Mb deletion of the human Y chromosome persists through balance between recurrent mutation and haploid selection. Nat Genet 35, 247–251 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng1250&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14528305&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000186273300020&link_type=ISI) 13. 13. S. Repping, S. K. van Daalen, C. M. Korver, L. G. Brown, J. D. Marszalek, J. Gianotten, R. D. Oates, S. Silber, F. van der Veen, D. C. Page, S. Rozen, A family of human Y chromosomes has dispersed throughout northern Eurasia despite a 1.8-Mb deletion in the azoospermia factor c region. Genomics 83, 1046–1052 (2004). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ygeno.2003.12.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15177557&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000221662900009&link_type=ISI) 14. 14. S. G. Rozen, J. D. Marszalek, K. Irenze, H. Skaletsky, L. G. Brown, R. D. Oates, S. J. Silber, K. Ardlie, D. C. Page,AZFc deletions and spermatogenic failure: a population-based survey of 20,000 Y chromosomes. Am J Hum Genet 91, 890–896 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2012.09.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23103232&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 15. 15. S. Fernandes, S. Paracchini, L. H. Meyer, G. Floridia, C. Tyler-Smith, P. H. Vogt, A large AZFc deletion removes DAZ3/DAZ4 and nearby genes from men in Y haplogroup N. Am J Hum Genet 74, 180–187 (2004). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/381132&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14639527&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000187723500018&link_type=ISI) 16. 16. S. K. Bansal, D. Jaiswal, N. Gupta, K. Singh, R. Dada, S. N. Sankhwar, G. Gupta, S. Rajender, Gr/gr deletions on Y-chromosome correlate with male infertility: an original study, meta-analyses, and trial sequential analyses. Sci Rep 6, 19798 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/srep19798&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26876364&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 17. 17. C. Krausz, C. Giachini, Y. Xue, M. K. O’Bryan, J. Gromoll, E. Rajpert-de Meyts, R. Oliva, I. Aknin-Seifer, E. Erdei, N. Jorgensen, M. Simoni, J. L. Ballesca, R. Levy, G. Balercia, P. Piomboni, E. Nieschlag, G. Forti, R. McLachlan, C. Tyler-Smith, Phenotypic variation within European carriers of the Y-chromosomal gr/gr deletion is independent of Y-chromosomal background. J Med Genet 46, 21–31 (2009). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjc6IjQ2LzEvMjEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8xMi8wOS8yMDIwLjEyLjA4LjIwMjQ1OTI4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 18. 18. C. M. de Carvalho, L. W. Zuccherato, M. Fujisawa, T. Shirakawa, A. K. Ribeiro-dos-Santos, S. E. Santos, S. D. Pena, F. R. Santos, Study of AZFc partial deletion gr/gr in fertile and infertile Japanese males. J Hum Genet 51, 794–799 (2006). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10038-006-0024-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16900294&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000240469700008&link_type=ISI) 19. 19. L. S. Teitz, T. Pyntikova, H. Skaletsky, D. C. Page, Selection Has Countered High Mutability to Preserve the Ancestral Copy Number of Y Chromosome Amplicons in Diverse Human Lineages. Am J Hum Genet 103, 261–275 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2018.07.007&link_type=DOI) 20. 20. S. K. Bansal, G. Gupta, S. Rajender, Y chromosome b2/b3 deletions and male infertility: A comprehensive meta-analysis, trial sequential analysis and systematic review. Mutat Res Rev Mutat Res 768, 78–90 (2016). 21. 21. S. Colaco, D. Modi, Genetics of the human Y chromosome and its association with male infertility. Reprod Biol Endocrinol 16, 14 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12958-018-0330-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29454353&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 22. 22. A. M. Ilumae, M. Reidla, M. Chukhryaeva, M. Jarve, H. Post, M. Karmin, L. Saag, A. Agdzhoyan, A. Kushniarevich, S. Litvinov, N. Ekomasova, K. Tambets, E. Metspalu, R. Khusainova, B. Yunusbayev, E. K. Khusnutdinova, L. P. Osipova, S. Fedorova, O. Utevska, S. Koshel, E. Balanovska, D. M. Behar, O. Balanovsky, T. Kivisild, P. A. Underhill, R. Villems, S. Rootsi, Human Y Chromosome Haplogroup N: A Nontrivial Time-Resolved Phylogeography that Cuts across Language Families. Am J Hum Genet 99, 163–173 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2016.05.025&link_type=DOI) 23. 23. N. Machev, N. Saut, G. Longepied, P. Terriou, A. Navarro, N. Levy, M. Guichaoua, C. Metzler-Guillemain, P. Collignon, A. M. Frances, J. Belougne, E. Clemente, J. Chiaroni, C. Chevillard, C. Durand, A. Ducourneau, N. Pech, K. McElreavey, M. G. Mattei, M. J. Mitchell, Sequence family variant loss from the AZFc interval of the human Y chromosome, but not gene copy loss, is strongly associated with male infertility. J Med Genet 41, 814–825 (2004). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjk6IjQxLzExLzgxNCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzEyLzA5LzIwMjAuMTIuMDguMjAyNDU5MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 24. 24. M. Ghorbel, S. Baklouti-Gargouri, R. Keskes, N. Chakroun, A. Sellami, F. Fakhfakh, L. Ammar-Keskes, gr/gr-DAZ2-DAZ4-CDY1b deletion is a high-risk factor for male infertility in Tunisian population. Gene 592, 29–35 (2016). 25. 25. C. Krausz, E. Casamonti, Spermatogenic failure and the Y chromosome. Hum Genet 136, 637–655 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00439-017-1793-8&link_type=DOI) 26. 26. P. A. Underhill, G. D. Poznik, S. Rootsi, M. Jarve, A. A. Lin, J. Wang, B. Passarelli, J. Kanbar, N. M. Myres, R. J. King, J. Di Cristofaro, H. Sahakyan, D. M. Behar, A. Kushniarevich, J. Sarac, T. Saric, P. Rudan, A. K. Pathak, G. Chaubey, V. Grugni, O. Semino, L. Yepiskoposyan, A. Bahmanimehr, S. Farjadian, O. Balanovsky, E. K. Khusnutdinova, R. J. Herrera, J. Chiaroni, C. D. Bustamante, S. R. Quake, T. Kivisild, R. Villems, The phylogenetic and geographic structure of Y-chromosome haplogroup R1a. Eur J Hum Genet 23, 124–131 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2014.50&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24667786&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 27. 27.World Health Organization, WHO Laboratory Manual for the Examination and Processing of Human Semen, 5th edn. (World Health Organization, Geneva, 2010). 28. 28. A. M. Andersson, N. Jorgensen, K. M. Main, J. Toppari, E. Rajpert-De Meyts, H. Leffers, A. Juul, T. K. Jensen, N. E. Skakkebaek, Adverse trends in male reproductive health: we may have reached a crucial ‘tipping point’. Int J Androl 31, 74–80 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1365-2605.2007.00853.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18194282&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000253710200002&link_type=ISI) 29. 29. M. A. Jobling, C. Tyler-Smith, Human Y-chromosome variation in the genome-sequencing era. Nat Rev Genet 18, 485–497 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg.2017.36&link_type=DOI) 30. 30. P. Hallast, P. Balaresque, G. R. Bowden, S. Ballereau, M. A. Jobling, Recombination dynamics of a human Y-chromosomal palindrome: rapid GC-biased gene conversion, multi-kilobase conversion tracts, and rare inversions. PLoS Genet 9, e1003666 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1003666&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23935520&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 31. 31. S. Rozen, H. Skaletsky, J. D. Marszalek, P. J. Minx, H. S. Cordum, R. H. Waterston, R. K. Wilson, D. C. Page, Abundant gene conversion between arms of palindromes in human and ape Y chromosomes. Nature 423, 873–876 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature01723&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12815433&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183585300047&link_type=ISI) 32. 32. B. Trombetta, F. Cruciani, Y chromosome palindromes and gene conversion. Hum Genet 136, 605–619 (2017). 33. 33. W. Shi, S. Louzada, M. Grigorova, A. Massaia, E. Arciero, L. Kibena, X. J. Ge, Y. Chen, Q. Ayub, O. Poolamets, C. Tyler-Smith, M. Punab, M. Laan, F. Yang, P. Hallast, Y. Xue, Evolutionary and functional analysis of RBMY1 gene copy number variation on the human Y chromosome. Hum Mol Genet 28, 2785–2798 (2019). 34. 34. C. Lu, F. Zhang, H. Yang, M. Xu, G. Du, W. Wu, Y. An, Y. Qin, G. Ji, X. Han, A. Gu, Y. Xia, L. Song, S. Wang, L. Jin, X. Wang, Additional genomic duplications in AZFc underlie the b2/b3 deletion-associated risk of spermatogenic impairment in Han Chinese population. Hum Mol Genet 20, 4411–4421 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddr369&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21852246&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000296103800010&link_type=ISI) 35. 35. Y. Yang, M. Ma, L. Li, D. Su, P. Chen, Y. Ma, Y. Liu, D. Tao, L. Lin, S. Zhang, Differential effect of specific gr/gr deletion subtypes on spermatogenesis in the Chinese Han population. Int J Androl 33, 745–754 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1365-2605.2009.01015.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20039973&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000281555900009&link_type=ISI) 36. 36. Y. W. Lin, L. C. L. Hsu, P. L. Kuo, W. J. Huang, H. S. Chiang, S. D. Yeh, T. Y. Hsu’, Y. H. Yu, K. N. Hsiao, R. M. Cantor, P. H. Yen, Partial duplication at AZFc on the Y chromosome is a risk factor for impaired spermatogenesis in Han Chinese in Taiwan. Hum Mutat 28, 486–494 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.20473&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17285591&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000245788700009&link_type=ISI) 37. 37. J. J. Ye, L. Ma, L. J. Yang, J. H. Wang, Y. L. Wang, H. Guo, N. Gong, W. H. Nie, S. H. Zhao, Partial AZFc duplications not deletions are associated with male infertility in the Yi population of Yunnan Province, China. J Zhejiang Univ Sci B 14, 807–815 (2013). 38. 38. B. Yang, Y. Y. Ma, Y. Q. Liu, L. Li, D. Yang, W. L. Tu, Y. Shen, Q. Dong, Y. Yang, Common AZFc structure may possess the optimal spermatogenesis efficiency relative to the rearranged structures mediated by non-allele homologous recombination. Sci Rep 5, 10551 (2015). 39. 39. C. Giachini, I. Laface, E. Guarducci, G. Balercia, G. Forti, C. Krausz, Partial AZFc deletions and duplications: clinical correlates in the Italian population. Hum Genet 124, 399–410 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00439-008-0561-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18807255&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000260767100008&link_type=ISI) 40. 40. D. Lo Giacco, C. Chianese, J. Sanchez-Curbelo, L. Bassas, P. Ruiz, O. Rajmil, J. Sarquella, A. Vives, E. Ruiz-Castane, R. Oliva, E. Ars, C. Krausz, Clinical relevance of Y-linked CNV screening in male infertility: new insights based on the 8-year experience of a diagnostic genetic laboratory. Eur J Hum Genet 22, 754–761 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2013.253&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24193344&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 41. 41. M. J. Noordam, G. H. Westerveld, S. E. Hovingh, S. K. van Daalen, C. M. Korver, F. van der Veen, A. M. van Pelt, S. Repping, Gene copy number reduction in the azoospermia factor c (AZFc) region and its effect on total motile sperm count. Hum Mol Genet 20, 2457–2463 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddr119&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21429917&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000290849200015&link_type=ISI) 42. 42. M. Grigorova, M. Punab, K. Ausmees, M. Laan, FSHB promoter polymorphism within evolutionary conserved element is associated with serum FSH level in men. Hum Reprod 23, 2160–2166 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/humrep/den216&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18567894&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000259144600028&link_type=ISI) 43. 43. T. Kikas, R. Inno, K. Ratnik, K. Rull, M. Laan, C-allele of rs4769613 near FLT1 represents a high-confidence placental risk factor in preeclampsia. Hypertension, (in press). 44. 44. D. Pilvar, M. Reiman, A. Pilvar, M. Laan, Parent-of-origin-specific allelic expression in the human placenta is limited to established imprinted loci and it is stably maintained across pregnancy. Clin Epigenetics 11, 94 (2019). 45. 45. Y. W. Lin, C. L. Hsu, P. H. Yen, A two-step protocol for the detection of rearrangements at the AZFc region on the human Y chromosome. Molecular Human Reproduction 12, 347–351 (2006). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molehr/gal038&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16608905&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000237931200008&link_type=ISI) 46. 46. H. Li, R. Durbin, Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btp324&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19451168&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000267665900006&link_type=ISI) 47. 47. A. McKenna, M. Hanna, E. Banks, A. Sivachenko, K. Cibulskis, A. Kernytsky, K. Garimella, D. Altshuler, S. Gabriel, M. Daly, M. A. DePristo, The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–1303 (2010). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjIwLzkvMTI5NyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzEyLzA5LzIwMjAuMTIuMDguMjAyNDU5MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 48. 48. T. Zerjal, B. Dashnyam, A. Pandya, M. Kayser, L. Roewer, F. R. Santos, W. Schiefenhovel, N. Fretwell, M. A. Jobling, S. Harihara, K. Shimizu, D. Semjidmaa Sajantila, P. Salo, M. H. Crawford, E. K. Ginter, O. V. Evgrafov, C. Tyler-Smith, Genetic relationships of Asians and Northern Europeans, revealed by Y-chromosomal DNA analysis. Am J Hum Genet 60, 1174–1183 (1997). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9150165&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997WX76000018&link_type=ISI) 49. 49. M. Karmin, L. Saag, M. Vicente, M. A. Wilson Sayres, M. Jarve, U. G. Talas, S. Rootsi, A. M. Ilumae, R. Magi, M. Mitt, L. Pagani, T. Puurand, Z. Faltyskova, F. Clemente, A. Cardona, E. Metspalu, H. Sahakyan, B. Yunusbayev, G. Hudjashov, M. DeGiorgio, E. L. Loogvali, C. Eichstaedt, M. Eelmets, G. Chaubey, K. Tambets, S. Litvinov, M. Mormina, Y. Xue, Q. Ayub, G. Zoraqi, T. S. Korneliussen, F. Akhatova, J. Lachance, S. Tishkoff, K. Momynaliev, F. X. Ricaut, P. Kusuma, H. Razafindrazaka, D. Pierron, M. P. Cox, G. N. Sultana, R. Willerslev, C. Muller, M. Westaway, D. Lambert, V. Skaro, L. Kovacevic, S. Turdikulova, D. Dalimova, R. Khusainova, N. Trofimova, V. Akhmetova, I. Khidiyatova, D. V. Lichman, J. Isakova, E. Pocheshkhova, Z. Sabitov, N. A. Barashkov, P. Nymadawa, E. Mihailov, J. W. Seng, I. Evseeva, A. B. Migliano, S. Abdullah, G. Andriadze, D. Primorac, L. Atramentova, O. Utevska, L. Yepiskoposyan, D. Marjanovic, A. Kushniarevich, D. M. Behar, C. Gilissen, L. Vissers, J. A. Veltman, E. Balanovska, M. Derenko, B. Malyarchuk, A. Metspalu, S. Fedorova, A. Eriksson, A. Manica, F. L. Mendez, T. M. Karafet, K. R. Veeramah, N. Bradman, M. F. Hammer, L. P. Osipova, O. Balanovsky, E. K. Khusnutdinova, K. Johnsen, M. Remm, M. G. Thomas, C. Tyler-Smith, P. A. Underhill, E. Willerslev, R. Nielsen, M. Metspalu, R. Villems, T. Kivisild, A recent bottleneck of Y chromosome diversity coincides with a global change in culture. Genome Res 25, 459–466 (2015). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjg6IjI1LzQvNDU5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMTIvMDkvMjAyMC4xMi4wOC4yMDI0NTkyOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 50. 50. S. Fernandes, K. Huellen, J. Goncalves, H. Dukal, J. Zeisler, E. Rajpert De Meyts, N. E. Skakkebaek, B. Habermann, W. Krause, M. Sousa, A. Barros, P. H. Vogt, High frequency of DAZ1/DAZ2 gene deletions in patients with severe oligozoospermia. Mol Hum Reprod 8, 286–298 (2002). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molehr/8.3.286&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11870237&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000174361000011&link_type=ISI) 51. 51. H. Wickham, ggplot2: Elegant Graphics for Data Analysis (Use R!, Springer-Verlag New York, New York, 2009). ## Supplemental references 1. 1. M. Punab, O. Poolamets, P. Paju, V. Vihljajev, K. Pomm, R. Ladva, P. Korrovits, M. Laan, Causes of male infertility: a 9-year prospective monocentre study on 1737 patients with reduced total sperm counts. Hum Reprod 32, 18–31 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/humrep/dew284&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27864361&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 2. 2. W. McLaren, L. Gil, S. E. Hunt, H. S. Riat, G. R. Ritchie, A. Thormann, P. Flicek, F. Cunningham, The Ensembl Variant Effect Predictor. Genome Biol 17, 122 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-016-0974-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27268795&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 3. 3. P. Rentzsch, D. Witten, G. M. Cooper, J. Shendure, M. Kircher, CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res 47, D886–D894 (2019). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 4. 4. W. Shi, S. Louzada, M. Grigorova, A. Massaia, E. Arciero, L. Kibena, X. J. Ge, Y. Chen, Q. Ayub, O. Poolamets, C. Tyler-Smith, M. Punab, M. Laan, F. Yang, P. Hallast, Y. Xue, Evolutionary and functional analysis of RBMY1 gene copy number variation on the human Y chromosome. Hum Mol Genet 28, 2785–2798 (2019). 5. 5. Y. W. Lin, C. L. Hsu, P. H. Yen, A two-step protocol for the detection of rearrangements at the AZFc region on the human Y chromosome. Molecular Human Reproduction 12, 347–351 (2006). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molehr/gal038&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16608905&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000237931200008&link_type=ISI) 6. 6. C. Krausz, L. Hoefsloot, M. Simoni, F. Tuttelmann, A. European Academy of, N. European Molecular Genetics Quality, EAA/EMQN best practice guidelines for molecular diagnosis of Y-chromosomal microdeletions: state-of-the-art 2013. Andrology 2, 5–19 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.2047-2927.2013.00173.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24357628&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) 7. 7. S. Fernandes, K. Huellen, J. Goncalves, H. Dukal, J. Zeisler, E. Rajpert De Meyts, N. E. Skakkebaek, B. Habermann, W. Krause, M. Sousa, A. Barros, P. H. Vogt, High frequency of DAZ1/DAZ2 gene deletions in patients with severe oligozoospermia. Mol Hum Reprod 8, 286–298 (2002). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molehr/8.3.286&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11870237&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000174361000011&link_type=ISI) 8. 8. N. Machev, N. Saut, G. Longepied, P. Terriou, A. Navarro, N. Levy, M. Guichaoua, C. Metzler-Guillemain, P. Collignon, A. M. Frances, J. Belougne, E. Clemente, J. Chiaroni, C. Chevillard, C. Durand, A. Ducourneau, N. Pech, K. McElreavey, M. G. Mattei, M. J. Mitchell, Sequence family variant loss from the AZFc interval of the human Y chromosome, but not gene copy loss, is strongly associated with male infertility. J Med Genet 41, 814–825 (2004). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjk6IjQxLzExLzgxNCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzEyLzA5LzIwMjAuMTIuMDguMjAyNDU5MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 9. 9. T. Vaszko, J. Papp, C. Krausz, E. Casamonti, L. Geczi, E. Olah, Discrimination of Deletion and Duplication Subtypes of the Deleted in Azoospermia Gene Family in the Context of Frequent Interloci Gene Conversion. PLoS One 11, e0163936 (2016). 10. 10. T. Kuroda-Kawaguchi, H. Skaletsky, L. G. Brown, P. J. Minx, H. S. Cordum, R. H. Waterston, R. K. Wilson, S. Silber, R. Oates, S. Rozen, D. C. Page, The AZFc region of the Y chromosome features massive palindromes and uniform recurrent deletions in infertile men. Nat Genet 29, 279–286 (2001). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng757&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11687796&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.08.20245928.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000171911000013&link_type=ISI)