Autism spectrum disorder trios from consanguineous populations are enriched for rare biallelic variants, identifying 32 new candidate genes =========================================================================================================================================== * Ricardo Harripaul * Ansa Rabia * Nasim Vasli * Anna Mikhailov * Ashlyn Rodrigues * Stephen F. Pastore * Tahir Muhammad * Thulasi Thiruvallur Madanagopal * Aisha Hashmi * Clinton Tran * Cassandra Stan * Katherine Aw * Maleeha Azam * Saqib Mahmood * Abolfazl Heidari * Raheel Qamar * Leon French * Shreejoy Tripathy * Zehra Agha * Muhammad Iqbal * Majid Ghadami * Susan L. Santangelo * Bita Bozorgmehr * Laila Al Ayadhi * Roksana Sasanfar * Shazia Maqbool * James A. Knowles * Muhammad Ayub * John B Vincent ## Abstract Autism spectrum disorder (ASD) is a neurodevelopmental disorder that affects about 1 in 55 children worldwide, imposing enormous economic and socioemotional burden on families and communities. Genetic studies of ASD have identified *de novo* copy number variants (CNVs) and point mutations that contribute significantly to the genetic architecture of ASD, but the majority of these studies were conducted in outbred populations, which are not ideal for detecting autosomal recessive (AR) inheritance. However, several studies of ASD in consanguineous populations point towards AR as an under-appreciated source of ASD variants. Here, we used trio whole exome sequencing to look for rare variants for ASD in 115 proband- mother-father trios from populations with high rates of consanguinity, namely Pakistan, Iran, and Saudi Arabia. We report 87 candidate sequence variants, with 57% biallelic, 21% autosomal dominant/*de novo*, and the rest X-linked. 52% of the variants were loss of function (LoF) or putative LoF (splice site, stop loss), and 47% nonsynonymous. Our analysis indicates an enrichment of biallelic variants. These include variants in genes previously reported for AR ASD and/or intellectual disability (ID) (*AGA*, *ASL*, *ASPA*, *BTN3A2, CC2D1A*, *DEAF1*, *HTRA2*, *KIF16B*, *LINS1*, *MADD*, *MED25*, *MTHFR*, *RSRC1*, *TECPR2*, *VPS13B*, *ZNF335*), and 32 previously unreported candidates, including 15 LoF or splice variants, in genes such as *DAGLA*, *EFCAB8*, *ENPP6*, *FAXDC2*, *ILDR2*, *PKD1L1*, *SCN10A*, and *SLC36A1*. We also identified several candidate biallelic exonic loss CNVs. The enrichment shown here for biallelic variants confirms that the genetic architecture for ASD among consanguineous populations is different to non-consanguineous populations. However, as has been shown in many recessive disorders, the genes reported here may also be relevant to outbred populations. ## Introduction Autism spectrum disorder (ASD) is characterized by deficits in social communication, also repetitive and restricted behaviours and interests. Apart from a small percentage of individuals who recover (1), ASD is a life-long condition and only about 20% have good outcome as reported in a recent meta-analysis of longitudinal studies (2). ASD is quite heterogeneous in terms of level of intellectual functioning, co-morbid psychiatric and behavioural problems, and these factors play an important role in the long term outcome of the diagnosis (3). In a systematic review its prevalence was estimated to be 0.76% internationally (4). In another systematic review of international data, the median estimate was 62/10,000 (5). In this review, the prevalence of ASD showed little variation by geographic region, ethnic, cultural and socioeconomic factors (5). The reported prevalence of ASD has increased with time. In the 1960s it was estimated to affect as few as 1 in 10000 individuals (5), prevalence studies from the 1980s suggested that as many as 72 in 10000 individuals had ASD, rising to 1% in the 2000s (5, 6) More recent studies report prevalence rates of more than 2%. (7–9). In the United States, the prevalence across eleven sites in 2012 was 14.5 per 1,000 (one in 69) children aged 8 years (10) and in 2016 it increased to 18.5 per 1,000 (one in 54) children (11). This increase at least partly is because of changes in diagnostic criteria, reporting practices, increased awareness (12–15). For example, a Swedish study showed that although ASD diagnoses became more common over time among individuals born between 1992 and 2002, the underlying level of autistic traits did not (9). Studies that examined the differences between individuals with and without ASD on symptom and cognitive measures indicated a decrease in the differences between the two groups (15, 16). However, in a recent study from Denmark only a third of ASD diagnoses could be explained because of broadened diagnostic criteria (13). The search for alternative explanations for the increase in the prevalence includes the possibility of a bigger contribution of the environmental factors that might have become more common with time and have caused a genuine increase in the numbers of affected individuals (17). Many of the reported environmental exposures associated with ASD, such as air pollution exposure, paternal age, and maternal psychotropic medication use during pregnancy tend to occur during the prenatal period (18, 19). This has been further examined in a longitudinal twin study where there was an increase in the ASD traits while the contribution of environmental factors had not changed (20). Describing the genetic architecture of ASD is the first step towards understanding the pathogenesis of the disorder. There has been significant progress to date, and in genetic studies of ASD it has been shown that *de novo* copy number variants (CNVs) and point mutations play a prominent role, however, the majority of these studies have been conducted in outbred populations (21) A systematic review of published literature from South Asian countries reported a prevalence of ASD in the range of 0.09-1.07 % (22). To date, there has been no community study of prevalence from Pakistan. Studies from child psychiatry clinics reported rates of 2.4% and 3.2%, 4.5%, 5% (23–26). Clinical services for ASD are available but limited to big cities. There are special schools in every big city for children with ASD (27). The literature on clinical presentation is limited. One study with a small sample size shows symptoms consistent with studies elsewhere in the world (28). Several projects have highlighted recessive inheritance as an important component of the genetic architecture of ASD. A large study of consanguineous versus non-consanguineous families in India concluded that consanguinity increases risk for ASD with an odds ratio of 3.22 (29). Morrow and colleagues used SNP microarrays to map homozygous loci in 104 small ASD families from the Middle East, Turkey and Pakistan, finding homozygous deletions implicating *SLC9A9*, *PCDH10*, *CNTN3* and others (30). Research in outbred populations also support AR inheritance as an important piece of the genetic puzzle for ASD. For example, previous work has estimated that loss-of-function (LoF) recessive mutations contribute 3% of ASD genetics in two US-based case-control cohorts (31). These findings are not limited to population isolates or ethnic subgroups (32, 33). Consanguineous marriages lead to a marked increase in the frequency of rare recessive disorders (34, 35). Many genetic variants will only be pathogenic in recessive form; this includes variants for ID, and almost certainly for ASD too. Identification of recessive genes in outbred populations is problematic, as the analysis pipelines for WES/WGS data are non-optimal for discovering compound heterozygous mutations. Populations with a high proportion of consanguinity have been important for describing autosomal recessive genes in ID in Pakistan (36, 37), Iran (38, 39), Syria (40), and Saudi Arabia (41), yet, to date, relatively few reports of AR genes underlying ASD in consanguineous populations have been published (30,42,43). Despite the higher prevalence of recessive inheritance in consanguineous populations, data from studies of developmental disorder (DD) suggest that *de novo* variants are also prominent, albeit in a lower proportion e.g. in a UK study of 6040 families from the Deciphering Developmental Disorders (DDD) study, individuals of Pakistani ancestry had 29.8% *de novo* compared with 49.9% in the European ancestry UK population (44). In an Iranian ID study, the *de novo* rate was 27.86% (45). Here, we report on a study of genetic and genomic variants in a cohort of 115 ASD trios from three countries with a high frequency of consanguineous marriages (Pakistan, Iran, and Saudi Arabia). Of the candidate sequence variants we report (not including CNVs), the proportion that are *de novo* is ∼24%, similar to the rates reported among Pakistani ancestry DD individuals in the UK and in ID individuals in Iran. We report 57% of the candidate sequence variants identified in the study are AR. This is somewhat higher than the 31% AR reported for Pakistani ancestry subjects in the UK DDD study (3.6% for those of European ancestry). Of the 28 sequence variants identified in genes previously linked to neurodevelopmental disorders, 25% were *de novo*, and 55% AR. This compares well with rates in a study of Iranian ID families (28% de novo, 69% AR) (45). ## Materials and Methods ### Trio Family Ascertainment Institutional Research Ethics Board approval was received for this study through the Centre for Addiction and Mental Health (CAMH) and other institutional recruiting sites. Twenty-seven trios were ascertained by Dr. Sasanfar by the Children’s Health and Evaluation project (CHEP) sponsored by the Special Education Organization (SEO), as described previously (46, 47). Children in the cohort were between ages 4 and 11, and received a clinical diagnosis of ASD by a child psychiatrist. All parents from Iranian cohorts filled out the Social Communication Questionnaire (SCQ) (48), using the *Diagnostic and Statistical Manual of Mental Disorders* (fourth edition-text revision (DSM-IV-TR) (49). The diagnosis of ASD was confirmed in all subjects using ADI-R (50) and ADOS (51) by a certified physician. Another 13 trios were ascertained through clinical services in Iran: Dr. Bita Bozorgmehr clinically assessed 10 trios and Dr. Abolfazl Heidari assessed three trios. Fifty-four Pakistani trios were recruited from the University of Health Sciences and Children Hospital, Lahore, Pakistan. Pakistani ASD trio probands were assessed using the Childhood Autism Rating Scale (CARS)(52). The clinical assessment has been performed by Drs. Ansa Rabia Saqib Mahmood, and Shazia Maqbool. A further eight Pakistani trios were recruited through COMSATS University, Islamabad, by Drs Raheel Qamar and Zehra Agha, with ascertainment through the National Rural Support Program and the Ghazi Barotha Taraqiata iDara. Diagnosis of ASD was performed using combined DSM-5 (53) and ICD-10 criteria (54), after assessment using CARS (52). The 13 Saudi Arabian trios were provided by Dr. Laila Al Ayadhi through the Autism Treatment Centre, King Saud University, Riyadh, Saudi Arabia, and included subjects aged 2-12 years old who had a diagnosis of ASD confirmed using the ADI-R (50), Autism Diagnostic Observation Schedule (ADOS) (51) and 3DI (Developmental, dimensional diagnostic interview) scales (55). Disease severity was also measured using the Childhood Autism Rating Scale (CARS)(52). Saudi subjects were excluded if they had dysmorphic features, or diagnosis of fragile X or other serious neurological (e.g., seizures), psychiatric (e.g., bipolar disorder) or known medical conditions. Overall, we have sequenced 115 trios: 62 trios from Pakistan, 40 trios from Iran, and 13 trios from Saudi Arabia. In total, we have sequenced 345 samples from the different institutions, with collaborator contributions listed in Table 1. View this table: [Table 1:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/T1) Table 1: Cohort Description. ### Whole Exome Sequencing, Alignment and Variant Calling Whole Exome Sequencing (WES) was performed using the Thruplex DNA-Seq (Rubicon Genomics) Library Preparation Kit with the Agilent SureSelect V5 Exome Capture kit. Samples were sheared using the Covaris ME220 Focused-Sonicator to ensure that the DNA was sheared to 200 bp. Samples were then analyzed using the Agilent 2100 Bioanalyzer System for fragment length distribution and quantification. Library preparation and adapter ligation was performed using the Thruplex DNA-seq kit. All trios, representing the proband, mother and father, were sequenced on the Illumina HiSeq 2500 or the Illumina NovaSeq sequencing system. After sequencing, FastQC was used to assess the quality of sequences and determine the adapter contamination and read quality. Unligated adapters and low quality sequences were removed using FastX Toolkit (Updated Aug 4th 2017: [https://github.com/agordon/fastx_toolkit](https://github.com/agordon/fastx_toolkit)) (56) and Trimmomatic (v 0.33: Updated Mar. 10th, 2015: [https://github.com/timflutre/trimmomatic](https://github.com/timflutre/trimmomatic)) (57) . Cleaned fastq files were then aligned to the hg19 reference genome using BWA 0.7.17 (58) and variants were called using the GATK best practices for variant calling and quality control (59). Briefly, after alignment, duplicate reads were removed using Samtools version 1.4 (60) and the GATK v 3.8 (61) was used for base recalibration, indel realignment and then variant calling using the Global Haplotyper (61). This created raw Variant Call Format (VCF) files. The variants called per sample were then checked for their transition-transversion ratio which was approximately 2.5. ### Annotation and Variant Prioritization VCF files were annotated using ANNOVAR (62), integrating allele frequencies, from gnomAD (63), The Greater Middle Eastern Variome Server (64) and ExAC (65), as well as functional pathogenicity scores from Polyphen2 (66), SIFT (67) and MutationTaster (68), MutationAssessor (69), FATHMM (70), and CADD (71). Variants were assessed, firstly, if they passed the quality metrics as set by the Convolutional Neural Network scoring algorithm by GATK (61). Briefly, variants were compared against known variants from dbSNP and a convolutional neural network was trained to filter out poor variant calls based on several metrics (61) and minor allele frequency (MAF) in gnomAD ([https://gnomad.broadinstitute.org/](https://gnomad.broadinstitute.org/)) of equal to or lower than 1 x 10-3 for autosomal recessive, or 1 x 10-5 for autosomal dominant/*de novo*, with a preference for variants that had no homozygotes in the gnomAD non-Neuro Cohort. For autosomal recessive variants, as the MAF used was on the conservative side, reanalysis was also performed using less stringent values (1 x 10-3, 1 x 10-2), to see if any variants in known ASD/ID genes were missed under the stringent analysis. Variants were then split into either homozygote or heterozygote categories and then prioritized by functional impact of mutation type. For example, LoF variants would carry more weight than missense variants leading to assessing stopgain, frameshift, splicing, and missense variants, in that order. The LoF mutations were given highest priority as it was assumed that LoF variants will disrupt protein function more severely and, depending on the position within the transcript, cause nonsense mediated mRNA decay (NMD). The variants were then assessed for their functional pathogenicity score, where variants that met at least two functional prediction scores between Polyphen2 (66), and MutationTaster (68) and CADD (71) were further investigated. Special attention was given to splicing with additional functional annotation scores being used for splicing events with the addition of FATHMM (70) and dbscSNV (72). Due to the large number of missense variants for autosomal dominant/*de novo* inheritance, stricter criteria were used where a variant would only be considered if it was predicted as pathogenic by all software used including Polyphen2 (66), MutationTaster (68), MutationAssessor (69), FATHMM (70), CADD(71) and M-CAP(73). The overall prioritization method is outlined in Figure 1. Variants were also analyzed to determine if they are in genes associated with any previous studies reporting association with ASD or ID. Variants in genes in pathways such as neuron growth and guidance were also considered as a higher priority. ![Figure 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2022/01/26/2021.12.24.21268340/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/F1) Figure 1: Variant prioritization methods where variants that pass GATK best practices would be filtered for MAF less than or equal to 10-4 and then categorized into recessive or homozygous variants. Variants are then prioritized based on assessment of potential damage to the protein, and then filtered based on scoring algorithms to predict pathogenicity. Finally, variants were Sanger-sequenced and segregation checked with both parents for validation. ### Splice site annotation Standard annotation software has been noted to be suboptimal for splicing variants and cryptic splice sites (74). To improve detection of splicing variants, particularly in neurodevelopmental disorders, SpliceAI was used to splicing variants or cryptic splice sites (75). All VCF files were run through the standard SpliceAI algorithm as per documentation and filtered for variants with a SpliceAI score greater than 0.5. Once variants were extracted, they were annotated with ANNOVAR (76) to add additional meta data for biological interpretation. Variants were then binned into heterozygous, X-linked and homozygous and filtered by gnoMAD (77) allele frequency (MAF < 0.01). Variants that survived filtering were then investigated for appropriate inheritance patterns (i.e. *de novo* for heterozygous variants and both parents contributing one allele for homozygous variants). ### Copy Number Variant (CNV) analysis Microarrays were run using DNA from 103 ASD probands from the 115 trios (where sufficient DNA was available), including 19 using Affymetrix CytoScanHD, and 84 using Illumina CoreExome. CNV analysis for the CytoScanHD arrays was performed using ChAS and PennCNV; for CoreExome using Illumina Genome Suite/CNVpartition and PennCNV. Evidence of homozygous loss CNVs was cross-references with WES data, using the Integrated Genome Viewer (IGV: [https://software.broadinstitute.org/software/igv/](https://software.broadinstitute.org/software/igv/); (78), and CNVs corroborated in this manner were then checked by PCR. Also, for WES data we performed CNV analysis using CLAMMS (79), XHMM (80) and CoNIFER (81). The analysis of exome sequencing data for CNVs has drawbacks due to the capture technology used. A majority of these methods employ a read based exome calling strategy after normalizing for the depth of sequencing across different regions. CLAMMS was used because it offered the ability to create subsets of the samples and construct more specific references through principal component analysis (PCA) of sequencing metrics. All softwares used were run according to standard procedures described on their respective documentation. Filtering was done for quality metrics outlined in the documentation. Once CNVs were called they were also flited against the gnomAD structural variation call set (82) and the Database of Genomic Variants (DGV: [http://dgv.tcag.ca/dgv/app/home](http://dgv.tcag.ca/dgv/app/home)). CNVs that overlapped more than 50% with gnomAD or DGV SVs were filtered out. Once the final call set after filtration was created, CNVs were then binned into homozygous, heterozygous or X-linked variants and further validated with qPCR to check whether they followed expected inheritance patterns for their respective zygosity. For instance, homozygous CNVs in the proband would have to have both parents that were heterozygous, and heterozygous proband variants would have to be *de novo*. X-linked variants were also further validated in males, as they lack a second copy of the X chromosome and are thus more likely to be damaging. CNV validations were also performed for family members, to confirm the inheritance pattern of candidate variants. ### Cross-referencing with other datasets In order to evaluate the strength of candidacy of the variants/genes identified here or to find supporting evidence, we attempted to cross-reference with other available datasets. We attempted where possible to identify rare variants in the genes we report here (Tables 2-4) that were either: 1. *De novo*, 2. Heterozygous LoF but status either de novo or unknown; 3. Biallelic (i.e. homozygous); 4. Putative biallelic (possibly compound heterozygous, but phasing unknown); 5. X-linked in males, maternally inherited. Data are shown in Supplementary Tables S1 to S3. View this table: [Table 2:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/T2) Table 2: Biallelic/homozygous variants: Genes previously reported as pathogenic for ASD/ID or other disorder are indicated, with annotation using (in order of priority): OMIM: [https://omim.org/](https://omim.org/); SFARI ASD genes [https://gene.sfari.org/](https://gene.sfari.org/); DDD gene2phenotype genes: [https://www.ebi.ac.uk/gene2phenotype/](https://www.ebi.ac.uk/gene2phenotype/); Geisinger developmental brain disorder gene database: [https://dbd.geisingeradmi.org/](https://dbd.geisingeradmi.org/); NPdenovo: [http://www.wzgenomics.cn/NPdenovo/](http://www.wzgenomics.cn/NPdenovo/); Gene4denovo: [http://www.genemed.tech/gene4denovo/search](http://www.genemed.tech/gene4denovo/search). Genes previously unreported for ASD or ID are in bold type. Control population frequencies show are from gnomAD (all, and for South Asian sub-population). All variants were validated by IGV (Integrated Genomics Viewer; Broad Institute) and by Sanger sequencing (except for IAU80/BTN3A2, for which unique PCR primers could not be designed). Validations are shown in Supplementary Materials. View this table: [Table 3:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/T3) Table 3: *De novo* variants. Genes previously reported as pathogenic for ASD/ID or other disorder are indicated, with annotation using (in order of priority): OMIM: [https://omim.org/](https://omim.org/); SFARI ASD genes [https://gene.sfari.org/](https://gene.sfari.org/); DDD gene2phenotype genes: [https://www.ebi.ac.uk/gene2phenotype/](https://www.ebi.ac.uk/gene2phenotype/); Geisinger developmental brain disorder gene database: [https://dbd.geisingeradmi.org/](https://dbd.geisingeradmi.org/); NP*denovo*: [http://www.wzgenomics.cn/NPdenovo/](http://www.wzgenomics.cn/NPdenovo/); Gene4denovo: [http://www.genemed.tech/gene4denovo/search](http://www.genemed.tech/gene4denovo/search). Genes previously unreported for ASD or ID are in bold type. Control population frequencies show are from gnomAD (all, and for South Asian sub-population). All variants were validated by IGV (Integrated Genomics Viewer; Broad Institute) and by Sanger sequencing (except for SMPA39/CLEC4E, for which DNA was exhausted). Validations are shown in Supplementary Materials. View this table: [Table 4:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/T4) Table 4: X-linked variants. Genes previously reported as pathogenic for ASD/ID or other disorder are indicated, with annotation using (in order of priority): OMIM: [https://omim.org/](https://omim.org/); SFARI ASD genes [https://gene.sfari.org/](https://gene.sfari.org/); DDD gene2phenotype genes: [https://www.ebi.ac.uk/gene2phenotype/](https://www.ebi.ac.uk/gene2phenotype/); Geisinger developmental brain disorder gene database: [https://dbd.geisingeradmi.org/](https://dbd.geisingeradmi.org/); NPdenovo: [http://www.wzgenomics.cn/NPdenovo/](http://www.wzgenomics.cn/NPdenovo/); Gene4denovo: [http://www.genemed.tech/gene4denovo/search](http://www.genemed.tech/gene4denovo/search). Genes previously unreported for ASD or ID are in bold type. Control population frequencies show are from gnomAD (all, and for South Asian sub-population). All variants were validated by IGV (Integrated Genomics Viewer; Broad Institute) and by Sanger sequencing. Validations are shown in Supplementary Materials. **MSSNG:** More than 10,000 individuals from the Autism Genetic Research Exchange (AGRE) repository and other cohorts have been whole-genome sequenced, through Autism Speaks, and data made available through the MSSNG database ([https://research.mss.ng](https://research.mss.ng); (83). MSSNG data was accessed in April 2020. **SFARI: SSC biallelic:** The Simons Simplex Collection (SSC) includes 2600 simplex autism or related developmental disorder families. Whole exome sequence data is available for ∼2,500 of these families, biallelic variants for our gene list were searched using the GPF browser ([https://gpf.sfari.org/gpf19/datasets/SSC/browser](https://gpf.sfari.org/gpf19/datasets/SSC/browser)). Putative compound heterozygous variants criteria for minor allele frequency (MAF) in SSC exome, gnomAD exome and genome frequencies, of <0.001, or, for missense variants, MCP scores >1 and CADD scores >18. **SVIP biallelic:** the autism Simons Variation in Individuals Project (SVIP) dataset was searched for putative biallelic variants through the GPF browser ([https://gpf.sfari.org/gpf19/datasets/SVIP/browser](https://gpf.sfari.org/gpf19/datasets/SVIP/browser)). Putative compound heterozygous variants checked for criteria including minor allele frequency (MAF) in SSC exome, gnomAD exome and genome frequencies, of <0.001, or, for missense variants, MCP scores >1 and CADD scores >18. **Deciphering Developmental Disorders (DDD) study:** includes sequence data for ∼14,000 children; this dataset was searched for variants in genes overlapping with our study through the known developmental genes list ([https://decipher.sanger.ac.uk/ddd#ddgenes](https://decipher.sanger.ac.uk/ddd#ddgenes)), as well as the list of research variants ([https://decipher.sanger.ac.uk/ddd#research-variants](https://decipher.sanger.ac.uk/ddd#research-variants)), which are 2,723 variants of unknown significance from 4,293 children. For *de novo* variants, all are constitutive, unless specifically noted as mosaic here. CNV variants are not included here. **Autism Sequencing Consortium:** genes with *de novo* variants, either in the control set or case set, from the Autism Sequencing Consortium (ASC) exome analysis browser ([https://asc.broadinstitute.org/](https://asc.broadinstitute.org/)). This dataset lists numbers of de novo protein truncating variants, as well as de novo missense variants with MCP scores either 1-2 or ≥2. The ASC dataset includes 6,430 probands, and includes the SSC data. **DeRubeis:** genes with *de novo* variants reported in the DeRubeis et al, 2014 study, with either LoF or missense variants (84). As subject IDs were not reported, overlap with other studies recorded here is not possible. **AutismKB:** additional studies where variants have been reported for the genes are taken from the Autism Knowledge Base ([http://db.cbi.pku.edu.cn/autismkb\_v2/quick_search.php](http://db.cbi.pku.edu.cn/autismkb_v2/quick_search.php)), (85). ### Neuroanatomical Enrichment Analysis The Allen Adult Human Brain Atlas (86), Brainspan (87), and a single-cell atlas of the mouse nervous system(88) were used to test for neuroanatomically specific expression. For each gene, expression was standardized across regions or cell-types. For these compartments, each gene was then ranked from most specific to most depleted. The area under the receiver operating statistic (AUC) was used to quantify specific expression for the genes of interest within a region or cell cluster. The Mann-Whitney U test was used to test statistical significance, and the Benjamini–Hochberg procedure (89)was used to correct for multiple tests. ## Results Our study used whole exome sequence data for a total of 115 proband/mother/father trios where the proband had a diagnosis of ASD. These included 62 from Pakistan, 40 from Iran, and 13 from Saudi Arabia. Of the probands, 91 were male, 24 were female (Table 1). ### Whole exome sequencing Of the 115 trios, 18 were found to have an autosomal biallelic mutation in a gene previously reported for ASD, ID or other neurodevelopmental disorder (see Table 2). Of these, eight were loss-of-function (LoF), or putative LoF (e.g. canonical splice site) mutations. *De novo* autosomal or X-linked mutations in known or previously reported ASD, ID, or other neurodevelopmental disorder were identified in a further seven trios (see Tables 3 and 4). A known *de novo* Rett syndrome-associated mutation (NM_004992: c.763C>T; p.Arg255*) was identified in one trio. ### Biallelic mutations We identified mutations in several genes that have been previously reported to be associated with autosomal recessive non-syndromic ID, such as *TECPR2* (41), *MTHFR* (40), *MADD* (41), *BTN3A2* (41), *LINS1* (39), *CC2D1A* (MIM 608443; MRT3; (90)), *RSRC1* (91, 92) and *ZNF335* (38), and a number known for syndromic or metabolic forms of autosomal recessive ID, including *VPS13B* (Cohen syndrome), *AGA* (aspartylglucosaminuria), *ASL* (arginosuccinic aciduria), *HTRA2* (3-methylglutaconic aciduria) *ASPA* (aspartoacylase deficiency/Canavan disease), and *MED25* (Basel-Vanagaite-Smirin-Yosef syndrome). *CC2D1A*, *VPS13B*, and *DEAF1* have also previously been associated with ASD or autistic features (93–95). Biallelic mutations in *DEAF1* are known to cause neurodevelopmental disorder with hypotonia, impaired expressive language, with or without seizures (MIM 617171). LoF mutations were reported for new candidate genes *HRNR*, *ILDR2*, *SCN10A*, *DDIT4L*, *SLC36A1*, *FAXDC2*, *GIMAP8*, *DAGLA*, *B4GALNT1*, *VPS16*, and *EFCAB8*, and potentially LoF (such as canonical splice site mutations) in *DENND1B*, *LRRC34*, *PKD1L1*, *CNPY4*, and *PPP1R36*. *VPS16* has previously been reported for autosomal dominant dystonia (MIM 608559), *B4GALNT1* for AR spastic paraplegia (MIM 609195), and *PKD1L1* for visceral heterotaxy 8 (MIM 617205). Heterozygous missense and LoF mutations in *SHH* are known to be involved in autosomal dominant holoprosencephaly 3 (HPE3; MIM 142945), and so it is somewhat surprising to find a homozygous missense variant associated with a milder phenotype with no obvious HPE3-related dysmorphic features. The Asn69Ile variant in *SHH* substitutes an asparagine for isoleucine at a residue that is conserved across vertebrates, is predicted to be damaging, and is found in heterozygous form in only 6 out of 125,748 gnomAD exomes. We report a homozygous missense variant in *SGSM3*. Previously *de novo* variants in *SGSM3* have been reported for ASD (96), and SFARI lists this an ASD gene with a score of 3 ([https://gene.sfari.org/](https://gene.sfari.org/)). ### *De novo* mutations *De novo* dominant mutations have been shown to be a major causal factor in ASD etiology. In this set of 87 identified WES variants from 115 trios, 21 variants were identified as *de novo* variants (18 autosomal and 3 X-linked). Among the *de novo* variants identified, variants in three of the genes have been identified in previous studies (*MECP2*, *SCN2A*, *MYT1L*, *ZNF292*). 12 were LoF variants in the genes *MECP2*, *TMIGD3*, *ECM1*, *SLAMF7*, *MYT1L*, *SCN2A*, *NCL*, *FAM53C*, *ZNF292* (2 trios), *ATP2B1*, and *RETN* (Tables 3 and 4). Four were putative LoF (splice site or stop loss) variants, including *ADGRF2*, *DGKZ*, *CLEC4E*, and *PPIL2*, and the remainder were nonsynonymous. There were also several other genes that were listed as *de novo* variants that have not been associated with ASD (or ID) previously. These variants have met the criteria for filtering as described in the Materials and Method, and may represent novel putative variants for ASD. ### ZNF292 (Mirzaa et al, 2020(46)) Our study identified a *de novo* LoF mutation in *ZNF292* (NM_015021:c.6159_6160del; p.Glu2054Lysfs*14) in two trios (IAU-65 (Iranian) and Autism-10 (Pakistani). These two trios and 26 additional families (from multiple studies and cohorts) with mutations in *ZNF292* were reported recently (97). Out of 28 families reported in the study, the mutations were *de novo* in all but one, in which the mutation was inherited in dominant fashion. The clinical features associated with *ZNF292* mutations spanned a range of severities and diagnoses, including ASD and ID. ### X-linked A number of mutations were identified in well-established ASD or ID genes, for instance *MECP2* (Rett syndrome: MIM 312750), *AIFM1* (Cowchock syndrome; MIM 310490), *MID2* (XLID101: MIM 300928). For other trios there are mutations in genes that have been implicated in other recent studies of ASD and/or ID. For instance, we report a hemizygous nonsynonymous mutation in *CLTRN*- hemizygous mutations in *CLTRN* have recently been reported in several males with neutral aminoaciduria accompanied with autistic features, anxiety, depression, compulsions and motor tics (98)- a clinical picture reminiscent of the autosomal recessive Hartnup disorder (MIM 234500). For other trios there are variants in genes linked with other nervous system disorders, such as *DRP2* in Charcot-Marie-Tooth disease (99). ### Splice site variants Eight of the biallelic autosomal mutations (16%), four of the *de novo* autosomal variants (22%), and two of the X-linked recessive variants (11%) were at canonical splice positions. In addition to detection of variants at canonical splice donor and splice acceptor sites, use of SpliceAI suggests that, for the biallelic nonsynonymous variant in the known ASD/ID gene *ASL* in trio SA7, an alternative explanation could be the generation of an alternative splice donor by the cytosine to thymine transition at this site. However, other splice algorithms such as [https://www.fruitfly.org/seq_tools/splice.html](https://www.fruitfly.org/seq_tools/splice.html) do not support this, and molecular experimental procedures may be needed to corroborate or refute this prediction. ### Overlap with studies in non-consanguineous populations Although not focused on consanguineous populations, WGS data from the MSSNG study, as well as WES studies such as DDD and SFARI, shows possible support for several of the candidate AR genes, with possible compound heterozygous damaging rare variants in *DNAH8* (in three MSSNG individuals), *CLCA4*, *ANO10*, *SCN10A*, and *WDR90* (see Supplementary Table S1). There was also evidence of disease-causing biallelic variants in known neurodevelopmental genes *MTHFR* (one homozygous), *AGA* (one homozygous), *DEAF1* (one homozygous), *ZNF335* (one homozygous, one putative compound heterozygous), and *VPS13B* (six homozygous, nine putative compound heterozygous). There was also support for new candidate dominant/*de novo* autosomal genes, including *ECM1*, *SLAMF7*, *NCL*, *DGKZ*, *ATP2B1* (5 individuals), *CBFA2T3*, *RETN*, and *PPIL2* (see Supplementary Table S2). There was strong support for X-linked recessive (in males, maternally inherited or *de novo*) candidate gene *CNGA2* (16 individuals), *GUCY2F* (13 individuals), *SAGE1* (eight individuals), as well as support for *EGFL6* (six individuals), *NRK* (six individuals), *PIR* (five individuals), *MSN* (five individuals), *ACTRT1* (five individuals), *TRPC5* (four individuals), and *ZNF185* (two individuals). ### Copy Number Variants A number of autosomal biallelic loss CNVs were identified using microarray data, which were then validated through observation of WES reads using IGV, and through molecular methods, using PCR and/or Sanger sequencing. Of these implicated genes, *DHRS4* and *KLK15* both have supportive evidence of neurobehavioral phenotype in mouse models (Supplementary Table S5). *DNAH7* shows strong transcription in brain tissues, and both *SHPK* and *WDR73* have high protein levels in cerebellum. Of the biallelic CNVs identified, *WDR73* has previously been associated with the autosomal recessive Galloway-Mowat syndrome 1 (MIM 251300), which includes microcephaly, delayed psychomotor development and cerebellar atrophy. A biallelic *WDR73* mutation has also been reported for ID (37). In other ASD or ID genome or exome datasets, a homozygous stop-gain and two homozygous missense mutations in *SIRPB1* were found in affected individuals in the MSSNG data (1-0157-003: Chr20: 1585484C>A; NM\_001135844.2:c.655G>T:p.Glu219*; AU3915301: Chr20:1584614T>G; NM_001135844.2:c.926A>C:p.Tyr309Ser; 3-0410-000: Chr20:1585517T>A; NM_001135844.2:c.622A>T:p.Ile208Phe). None of these variants are present in the gnomAD v3.1.2 control database. For *CYP2A7*, the MSSNG data affected individuals included a homozygous frameshift insertion (1-1093-003: Chr19: 41383105dupA; NM_000764.2: c.1151dupT: p.Leu385Profs*3) and two with homozygous missense mutations (both AU4302301 and AU008505: Chr19: 41382543C>T; NM_000764.2:c.1192G>A: p.Val398Met), neither variant present in gnomAD. While there is evidence of brain expression for *SIRPB1*, expression of *CYP2A7* appears to be restricted to liver and testes (Supplementary Table S9). However, the *WRD73*, *CYP2A7*, and *KLK15* CNVs were all identified in a proband with a biallelic splice mutation in the Canavan gene *ASPA*, the *SHPK* CNV is in an individual with a biallelic LoF mutation in known ID gene *DEAF1*, and the *SIRPB1* CNV is in an individual with a biallelic non-frameshift deletion in known gene *B4GALNT1*, and are thus less likely to be the main causative variants in these individuals (see Supplementary Table S4). A biallelic CNV loss implicates the gene *SEMG1*, however expression of this gene is restricted to the seminal vesicle (see Supplementary Table S9), and thus unlikely to be related to the ASD phenotype. There are also three large, multi-genic *de novo* CNV losses among the trios, which may also be pathogenic (Table 6). View this table: [Table 5:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/T5) Table 5: CNV summary: validated biallelic loss CNVs called by CNVpartition or ChAS. Biallelic losses called from microarray data by PennCNV and from WES data by CLAMMS (79)) did not validate using IGV, and hence are not included. Comparison with control population CNVs used the Database of Genomic Variants (DGV) Gold, accessed through the Decipher genomic browser ([www.deciphergenomics.org](http://www.deciphergenomics.org)), also through gnomAD. Coordinates given using hg19. View this table: [Table 6:](http://medrxiv.org/content/early/2022/01/26/2021.12.24.21268340/T6) Table 6: CNV summary: likely pathogenic, high confidence CNVs. Coordinates given using hg19. Comparison with control population CNVs used the Database of Genomic Variants (DGV) Gold, accessed through the Decipher genomic browser ([www.deciphergenomics.org](http://www.deciphergenomics.org)). ### Neuroanatomical Enrichment Analysis Testing for regional and cell-type specific expression did not indicate clear anatomical targets with higher expression of the candidate genes. Enrichment in the cerebellum and cerebral cortex was observed in the human developmental atlas, but this was not supported by either the mouse or adult human atlases. We found no consistent neuroanatomical expression pattern for the identified genes, suggesting heterogeneity of neural circuits disrupted. ## Discussion Many recent studies involving NGS in ASD have involved large cohorts, focusing predominantly on dominant/*de novo* inheritance. This is largely driven by the fact that autosomal recessive variants in novel genes are more difficult to identify in the outbred population, and informatics pipelines are not sufficiently tuned to identify compound heterozygote variants. Compound heterozygous variants would need to have variants on opposite alleles, and in order to identify them would require informatics pipelines that focus on multiple damaging heterozygous variants in the same gene in a proband, and comparing with genotypes from the maternal and paternal WES or WGS data. This requires extra care when analyzing variants. The use of genomewide association studies (GWAS) to identify variants is also inadequate for finding recessive variants because the sample size required would be significantly larger than current Psychiatric Genomics Consortium studies to have sufficient power to find biallelic variants. Also, currently most ASD genetics research emanates from North American and European countries, where larger families, which are optimal for recessive variant mapping are not as readily available. Using whole exome trio analysis in consanguineous families, we have enriched for recessive variants to assess the role these variants play in ASD in populations where endogamy is common. This study of 115 trios has identified WES variants for 28 trios in genes previously associated with ASD or other neurodevelopmental disorders, resulting in a diagnostic yield of 24%. In our ASD cohort, 18 recessive, five *de novo*, and five X-linked known genes. With a further three trios with large, multi-genic, loss CNVs validated experimentally as *de novo* and thus likely pathogenic (Table 6), this yield increases to 27%. There were many other variants identified that met filtering criteria which could potentially represent novel ASD genes or targets. Cumulatively, 18 candidate autosomal *de novo* variants were identified in the 115 trios and 50 candidate autosomal recessive variants identified in this cohort. Within this cohort we identified variants in five genes that are associated with known metabolic syndromes (4.3% of the diagnosis cohort): *AGA*, *ASL*, *HTRA2*, *ASPA,* and *MTHFR*. These genes may represent better clinical management opportunities for patients and potentially better therapies. We identified a biallelic nonsynonymous mutation in calcyphosine-like protein gene, *CAPSL*. Biallelic mutations have previously been reported for ID in both of the calcyphosine–related genes, *CAPS* (36) and *CAPS2* (41). Other than being calcium-binding EF-hand domain proteins, remarkably little is currently known about the role of calcyphosine. *EFCAB8*, in which a bialleic nonsense mutation was identified in trio SMPA9, is also a calcium-binding EF-hand domain protein. Given the possible involvement of all three calcyphosine related genes in neurodevelopmental disorders, further investigation into this biological pathway is clearly warranted. A number of the ID genes identified here among ASD trios have also previously been reported for ASD or ASD-like features, e.g. *CC2D1A* (94, 100), *VPS13B* (43), *DEAF1* (93), *ZNF335* (101), *ZNF292* (97,102,103), *MYT1L* (103, 104), *SCN2A* (96,105,106), and *PLXNA3* (107, 108). Of the candidate genes identified here, MSSNG, SFARI, and other datasets provide putative support (although it is not possible to determine whether two reported mutations, i.e. putative compound heterozygotes, in the same gene are on the same alleles or not, unless occurring within the same sequence read) for the biallelic genes *DNAH8*, *SCN10A*, *CLCA4*, *ANO10*, *WDR90* (Supplementary Table S1), and for *de novo*/dominant genes *NCL*, *SLAMF7*, *ADGRF2*, *DGKZ*, *ATP2B1*, *CBFA2T3*, *RETN*, and *PPIL2* (Supplementary Table S2). There are two genes identified here for which multiple trios from our cohort with mutations were identified: *ZNF292* and *SCN2A*. *De novo* mutations in *ZNF292* in were found in two families, including two frameshift deletions (Autism-10 and IAU65). These two trios, one from Pakistan, the other from Iran, and the frameshift mutation they share, were reported previously in Mirzaa et al, 2020 (97). *ZNF292* encodes a zinc finger protein, a transcription factor that binds to the promoter of the growth hormone gene, *GH* (109). Mutations in the known ASD gene *SCN2A* (105) were identified in two trios-one *de novo* and the other paternally inherited. In addition, a biallelic LoF mutation was identified in another voltage-gated sodium channel member (*SCN10A*), in which heterozygous gain of function missense mutations have previously linked to familial episodic pain syndrome 2 (FEPS2; MIM 615551)(110). For some of the autosomal recessive candidate genes, there is additional support from animal models. *Rasal2* (MGI:2443881) and *Vps1*6 (MGI:2136772) knockout mice have behavioral/neurological and nervous system phenotypes ([http://www.informatics.jax.org/](http://www.informatics.jax.org/); Supplementary Table S5). Note that VPS16 is a vesicle mediated trafficking protein similar to *VPS13B*. Homozygous knockout of *Ephb1* results in impaired contextual and cued conditioning, as well as abnormal freezing behaviour, and homozygous knockout of *Slc36a1* results in embryonic growth retardation, decreased freezing behaviour, and preweaning lethality ([www.mousephenotypes.org](http://www.mousephenotypes.org); Supplementary Table S5). Biallelic knockout of *Dagla* results in decreased brain size, hypoactivity, abnormal behaviour, and decreased thigmotaxis (Supplementary Table S5). In general, there is a high rate of neurodevelopmental phenotypes in knockout mice available for the genes with biallelic variants (∼70%), and in biallelic or hemizygous knockouts for the X-linked genes (∼58%). However, it is a remarkably contrasting story for the *de novo*/dominant variants, for which there is little or no support by way of neurodevelopmental or behavioural phenotypes in heterozygous mouse models (0%). Moreover, where there are biallelic knockout mouse models for genes from the *de novo*/dominant set, the phenotypes described are mainly unrelated to CNS and behaviour (<10%; Supplementary Table S5). This would suggest that the autosomal biallelic (and X-linked) mutations among our cohort are much more likely to be etiopathologically relevant to ASD. There are a number of trios for which there are two or more candidate variants (Supplementary Materials Table S4). For some of these, there may be a variant that clearly delivers a more plausible etiopathological explanation. For instance, trio SMPA3 carries a biallelic nonsynonymous change in the gene *MTHFR*, mutations in which have been reported to cause recessive homocystinuria and a known cause of neurodevelopmental and psychiatric disorders. For the same trio, there is a biallelic nonsynonymous change in *RASAL2*, and a maternally inherited hemizygous LoF mutation in *MAGEB2*, as well as a biallelic loss CNV encompassing *PCDHA9* and *10*. The likelihood of disease due to mutation in *MTHFR* thus diminishes the candidacy of alternative *RASAL2*, *MAGEB2* and *PCDHA9/10*. Similarly, for trio IAH2, a biallelic LoF in *TECPR2* – a gene which has been reported for autosomal recessive spastic paraplegia 49 (SPG49; MIM 615031) as well as for autosomal recessive ID (41) and hereditary sensory neuropathy with ID (111) – also lessens the case for the PCDHA9/10 biallelic loss CNV. For trio IABB2, the biallelic LoF mutation in *DEAF1* – a gene associated with an AR neurodevelopmental disorder with hypotonia, impaired expressive language, with or without seizures (NEDHELS; MIM 617171) – seems a highly probable candidate, and thus diminishes the possible involvement of biallelic loss CNVs at *UBE2U* and *SHPK*. The initial report for *DEAF1* was, in fact, in a consanguineous Omani family with autism, ID and epilepsy (112). A biallelic splice mutation in the Canavan disease (a severe neurodevelopmental disorder; MIM 271900) gene, *ASPA*, is the likely cause in trio IABB14, making candidate CNVs/genes *WDR73*, *KLK15*, and *CYP2A7* more likely to be benign. For other trios with two or more variants reported, the decision process was not as clear-cut or unequivocal. Seven *de novo* autosomal mutations reported for trio RQPA20 were all validated by Sanger sequencing, however, none of these candidate genes are particularly convincing in terms of the biology, support from other human disease studies, or support from mouse models. However, expression of the genes *DGKZ* and *CBFA2T3* are associated with numerous electrophysiological phenotypes (113)(Supplementary Table S6), and show high transcription and protein expression in brain regions, which could warrant further investigation into the effects of the variants identified. Since our initial methodology used relatively stringent MAF cut off criteria for biallelic variants (<0.001), we attempted to justify this level by reiterating the analysis but with lower thresholds (<0.01). Using the more relaxed criteria resulted in just two additional candidate variants, in the genes *CLCA4* (gnomAD South Asian MAF=1.72E-3), and *RASAL2* (gnomAD South Asian MAF=6E-3), but none in known neurodevelopmental genes. Our methodology was particularly stringent in regard to *de novo* nonsynonymous variants, owing to the large number of likely spurious calls, and the inclusion of only variants predicted as damaging by all algorithms used may have excluded some *bona fide* mutations. Other variants, particularly intronic or intergenic, would also likely be missed by our approach. Also, a proportion of exons, and particularly in GC-rich regions, are either missed or have low coverage in WES. Follow up with whole genome sequencing could be considered as a possible next step. However, parsing and assessing rare non-coding variants is particularly challenging. It was recently estimated in the DDD cohort that recessive variants make up 3.6% of diagnoses, whereas *de novo* variants would contribute 48.6% of variants (44). The same study also compared homozygous variants in a narrowly defined group of Pakistani Ancestry in the British Isles (PABI), which is most like our cohort (44). The PABI cohort had 356 (333 undiagnosed) probands, 110 of whom (30.9%) had biallelic coding candidate variants, approximately half of which are in known DD genes and half in novel candidates (44). For comparison, our study reports very similar yields for recessive variants, with 16% of probands with biallelic variants in known NDD genes (18/115), or 34% including those with biallelic variants in novel candidate genes (39/115). We report 24 *de novo* variants (18 autosomal, three X-linked, three CNVs) in 15 of 115 individuals (13%), which is somewhat lower than those reported for the PABI cohort (29.8%). This difference is most likely due to differences in the methodologies or stringency used for calling *de novo* variants in the respective studies. Overall, the findings demonstrate the importance of autosomal recessive mutations in ASD in countries, or in cohorts, with high rates of consanguinity, as well as identifying genes that should be examined for autozygous *and* allozygous mutations in outbred populations, just as would be done for more established autosomal recessive gene disorders. ## Supporting information Supplementary materials [[supplements/268340_file02.pdf]](pending:yes) Supplementary tables [[supplements/268340_file03.xlsx]](pending:yes) ## Data Availability All data produced in the present study are available upon reasonable request to the authors ## Conflict of Interest Statement The authors declare no conflicts of interest in relation to the current study. ## Acknowledgements The authors thank the families for their participation in this study. This work was supported by grants from the Canadian Institutes of Health Research to JBV (#MOP-102758 and #PJT-156402), also a National Alliance for Research on Schizophrenia and Depression (NARSAD) Young Investigator Award to NV. RH was supported by a Peterborough K.M. Hunter Charitable Foundation Graduate Scholarship. AR and AH were supported by International Research Fellowship Program scholarships from the Pakistani Higher Education Commission. CS was supported by a Summer University Research Program award from the Institute of Medical Science, University of Toronto. Computations were performed on the CAMH Specialized Computing Cluster. The SCC is funded by The Canada Foundation for Innovation, Research Hospital Fund. The authors wish to acknowledge the resources of MSSNG (research.mss.ng), Autism Speaks and The Centre for Applied Genomics at The Hospital for Sick Children, Toronto, Canada. We also thank the participating families for their time and contributions to this database, as well as the generosity of the donors who supported this program. DDD: The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003], a parallel funding partnership between Wellcome and the Department of Health, and the Wellcome Sanger Institute [grant number WT098051]. The views expressed in this publication are those of the author(s) and not necessarily those of Wellcome or the Department of Health. The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South REC, and GEN/284/12 granted by the Republic of Ireland REC). The research team acknowledges the support of the National Institute for Health Research, through the Comprehensive Clinical Research Network. ## Footnotes * § Equal contribution * Received December 24, 2021. * Revision received January 25, 2022. * Accepted January 26, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Helt M, Kelley E, Kinsbourne M, Pandey J, Boorstein H, Herbert M, et al. Can children with autism recover? If so, how? Neuropsychol Rev. 2008 Dec;18(4):339–66. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11065-008-9075-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19009353&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000262858300007&link_type=ISI) 2. 2.Mason D, Capp SJ, Stewart GR, Kempton MJ, Glaser K, Howlin P, et al. A Meta-analysis of Outcome Studies of Autistic Adults: Quantifying Effect Size, Quality, and Meta-regression. J Autism Dev Disord. 2021 Sep;51(9):3165–79. 3. 3.Lai M-C, Kassee C, Besney R, Bonato S, Hull L, Mandy W, et al. Prevalence of co-occurring mental health diagnoses in the autism population: a systematic review and meta-analysis. Lancet Psychiatry. 2019 Oct;6(10):819–29. 4. 4.Baxter AJ, Brugha TS, Erskine HE, Scheurer RW, Vos T, Scott JG. The epidemiology and global burden of autism spectrum disorders. Psychol Med. 2015 Feb;45(3):601–13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S003329171400172X&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25108395&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 5. 5.Elsabbagh M, Divan G, Koh Y-J, Kim YS, Kauchali S, Marcín C, et al. Global Prevalence of Autism and Other Pervasive Developmental Disorders. Autism Res. 2012 Jun;5(3):160–79. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/aur.239&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22495912&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000305072800003&link_type=ISI) 6. 6.Baird G, Simonoff E, Pickles A, Chandler S, Loucas T, Meldrum D, et al. Prevalence of disorders of the autism spectrum in a population cohort of children in South Thames: the Special Needs and Autism Project (SNAP). Lancet Lond Engl. 2006 Jul 15;368(9531):210–5. 7. 7.Developmental Disabilities Monitoring Network Surveillance Year 2010 Principal Investigators, Centers for Disease Control and Prevention (CDC). Prevalence of autism spectrum disorder among children aged 8 years - autism and developmental disabilities monitoring network, 11 sites, United States, 2010. Morb Mortal Wkly Rep Surveill Summ Wash DC 2002. 2014 Mar 28;63(2):1–21. 8. 8.Kim YS, Leventhal BL, Koh Y-J, Fombonne E, Laska E, Lim E-C, et al. Prevalence of autism spectrum disorders in a total population sample. Am J Psychiatry. 2011 Sep;168(9):904– 12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1176/appi.ajp.2011.10101532&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21558103&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000294484100010&link_type=ISI) 9. 9.Lundström S, Reichenberg A, Anckarsäter H, Lichtenstein P, Gillberg C. Autism phenotype versus registered diagnosis in Swedish children: prevalence trends over 10 years in general population samples. BMJ. 2015 Apr 28;350:h1961. 10. 10.Christensen DL, Braun KVN, Baio J, Bilder D, Charles J, Constantino JN, et al. Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2012. Morb Mortal Wkly Rep Surveill Summ Wash DC 2002. 2018 Nov 16;65(13):1–23. 11. 11.Maenner MJ. Prevalence of Autism Spectrum Disorder Among Children Aged 8 Years — Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2016. MMWR Surveill Summ [Internet]. 2020 [cited 2021 Jun 5];69. Available from: [https://www.cdc.gov/mmwr/volumes/69/ss/ss6904a1.htm](https://www.cdc.gov/mmwr/volumes/69/ss/ss6904a1.htm) 12. 12.Fombonne E. Editorial: The rising prevalence of autism. J Child Psychol Psychiatry. 2018;59(7):717–20. 13. 13.Hansen SN, Schendel DE, Parner ET. Explaining the increase in the prevalence of autism spectrum disorders: the proportion attributable to changes in reporting practices. JAMA Pediatr. 2015 Jan;169(1):56–62. 14. 14.Lord C, Brugha T, Charman T, Cusack J, Dumas G, Frazier T, et al. Autism spectrum disorder. Nat Rev Dis Primer. 2020 Jan 16;6(1):5. 15. 15.Rødgaard E-M, Jensen K, Vergnes J-N, Soulières I, Mottron L. Temporal Changes in Effect Sizes of Studies Comparing Individuals With and Without Autism: A Meta-analysis. JAMA Psychiatry. 2019 Nov 1;76(11):1124–32. 16. 16.Arvidsson O, Gillberg C, Lichtenstein P, Lundström S. Secular changes in the symptom level of clinically diagnosed autism. J Child Psychol Psychiatry. 2018 Jul;59(7):744–51. 17. 17.Hertz-Picciotto I, Schmidt RJ, Krakowiak P. Understanding environmental contributions to autism: Causal concepts and the state of science. Autism Res Off J Int Soc Autism Res. 2018 Apr;11(4):554–86. 18. 18.Bölte S, Girdler S, Marschik PB. The contribution of environmental exposure to the etiology of autism spectrum disorder. Cell Mol Life Sci CMLS. 2019 Apr;76(7):1275–97. 19. 19.Modabbernia A, Velthorst E, Reichenberg A. Environmental risk factors for autism: an evidence-based review of systematic reviews and meta-analyses. Mol Autism. 2017;8:13. 20. 20.Taylor MJ, Rosenqvist MA, Larsson H, Gillberg C, D’Onofrio BM, Lichtenstein P, et al. Etiology of Autism Spectrum Disorders and Autistic Traits Over Time. JAMA Psychiatry. 2020 Sep 1;77(9):936–43. 21. 21.Ramaswami G, Geschwind DH. Genetics of autism spectrum disorder. Handb Clin Neurol. 2018;147:321–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/B978-0-444-63233-3.00021-X&link_type=DOI) 22. 22.Hossain MD, Ahmed HU, Jalal Uddin MM, Chowdhury WA, Iqbal MS, Kabir RI, et al. Autism Spectrum disorders (ASD) in South Asia: a systematic review. BMC Psychiatry. 2017 Aug 1;17(1):281. 23. 23.Imran N, Bhatti MR, Anwar A, Najmi F, Haider II. Children’s Mental Health: Pattern of referral, distribution of disorders and service use in child psychiatry outpatient setting. Pak J Med Sci. 2012;28(1):22–6. 24. 24.Syed EU, Hussein SA, Yousafzai AW. Developing Services with Limited Resources: Establishing a CAMHS in Pakistan. Child Adolesc Ment Health. 2007;12(3):121–4. 25. 25.Tareen A, Mirza I, Minhas A, Minhas F, Rahman A. Developing a child and adolescent mental health service in a low-income country: a global partnership model. Psychiatr Bull. 2009 May;33(5):181–3. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToicGJyY3BzeWNoIjtzOjU6InJlc2lkIjtzOjg6IjMzLzUvMTgxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDEvMjYvMjAyMS4xMi4yNC4yMTI2ODM0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 26. 26.Sarwat A, Ali SMI, Ejaz MS. Mental health comorbidity in children: a hospital based study in child psychiatry clinic. Pak J Med Sci. 2009;25(6):982–5. 27. 27.Minhas A, Vajaratkar V, Divan G, Hamdani SU, Leadbitter K, Taylor C, et al. Parents’ perspectives on care of children with autistic spectrum disorder in South Asia – Views from Pakistan and India. Int Rev Psychiatry. 2015 May 4;27(3):247–56. 28. 28.Rauf NK, Anis-ul-Haq null, Aslam N, Anjum U. Characteristic symptoms and adaptive behaviors of children with autism. J Coll Physicians Surg--Pak JCPSP. 2014 Sep;24(9):658–62. 29. 29.Mamidala MP, Kalikiri MK, Praveen Kumar PTV, Rajesh N, Vallamkonda OR, Rajesh V. Consanguinity in India and its association with autism spectrum disorder. Autism Res Off J Int Soc Autism Res. 2015 Apr;8(2):224–8. 30. 30.Morrow EM, Yoo S-Y, Flavell SW, Kim T-K, Lin Y, Hill RS, et al. Identifying autism loci and genes by tracing recent shared ancestry. Science. 2008 Jul 11;321(5886):218–23. 31. 31.Lim ET, Raychaudhuri S, Sanders SJ, Stevens C, Sabo A, MacArthur DG, et al. Rare complete knockouts in humans: population distribution and significant role in autism spectrum disorders. Neuron. 2013 Jan 23;77(2):235–42. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2012.12.029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23352160&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000314199400006&link_type=ISI) 32. 32.Howrigan DP, Simonson MA, Davies G, Harris SE, Tenesa A, Starr JM, et al. Genome-wide autozygosity is associated with lower general cognitive ability. Mol Psychiatry. 2016;21(6):837–43. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2015.120&link_type=DOI) 33. 33.Kaiser VB, Svinti V, Prendergast JG, Chau Y-Y, Campbell A, Patarcic I, et al. Homozygous loss-of-function variants in European cosmopolitan and isolate populations. Hum Mol Genet. 2015 Oct 1;24(19):5464–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddv272&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26173456&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 34. 34.Bundey S, Alam H. A five-year prospective study of the health of children in different ethnic groups, with particular reference to the effect of inbreeding. Eur J Hum Genet EJHG. 1993;1(3):206–19. 35. 35.Modell B, Darr A. Science and society: genetic counselling and customary consanguineous marriage. Nat Rev Genet. 2002 Mar;3(3):225–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg754&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11972160&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000174159100019&link_type=ISI) 36. 36.Harripaul R, Vasli N, Mikhailov A, Rafiq MA, Mittal K, Windpassinger C, et al. Mapping autosomal recessive intellectual disability: combined microarray and exome sequencing identifies 26 novel candidate genes in 192 consanguineous families. Mol Psychiatry. 2018;23(4):973–84. 37. 37.Riazuddin S, Hussain M, Razzaq A, Iqbal Z, Shahzad M, Polla DL, et al. Exome sequencing of Pakistani consanguineous families identifies 30 novel candidate genes for recessive intellectual disability. Mol Psychiatry. 2017 Nov;22(11):1604–14. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2016.109&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 38. 38.Hu H, Kahrizi K, Musante L, Fattahi Z, Herwig R, Hosseini M, et al. Genetics of intellectual disability in consanguineous families. Mol Psychiatry. 2018 Jan 4; 39. 39.Najmabadi H, Hu H, Garshasbi M, Zemojtel T, Abedini SS, Chen W, et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature. 2011 Sep 21;478(7367):57–63. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature10423&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21937992&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000295575400034&link_type=ISI) 40. 40.Reuter MS, Tawamie H, Buchert R, Hosny Gebril O, Froukh T, Thiel C, et al. Diagnostic Yield and Novel Candidate Genes by Exome Sequencing in 152 Consanguineous Families With Neurodevelopmental Disorders. JAMA Psychiatry. 2017 Mar 1;74(3):293–9. 41. 41.Anazi S, Maddirevula S, Salpietro V, Asi YT, Alsahli S, Alhashem A, et al. Expanding the genetic heterogeneity of intellectual disability. Hum Genet. 2017 Nov;136(11–12):1419– 29. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00439-017-1843-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28940097&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 42. 42.Al-Mubarak B, Abouelhoda M, Omar A, AlDhalaan H, Aldosari M, Nester M, et al. Whole exome sequencing reveals inherited and de novo variants in autism spectrum disorder: a trio study from Saudi families. Sci Rep. 2017 Jul 18;7(1):5679. 43. 43.Yu TW, Chahrour MH, Coulter ME, Jiralerspong S, Okamura-Ikeda K, Ataman B, et al. Using whole-exome sequencing to identify inherited causes of autism. Neuron. 2013 Jan 23;77(2):259–73. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2012.11.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23352163&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000314199400009&link_type=ISI) 44. 44.Martin HC, Jones WD, McIntyre R, Sanchez-Andrade G, Sanderson M, Stephenson JD, et al. Quantifying the contribution of recessive coding variation to developmental disorders. Science. 2018 07;362(6419):1161–4. 45. 45.Kahrizi K, Hu H, Hosseini M, Kalscheuer VM, Fattahi Z, Beheshtian M, et al. Effect of inbreeding on intellectual disability revisited by trio sequencing. Clin Genet. 2019;95(1):151–9. 46. 46.Mirzaa GM, Chong JX, Piton A, Popp B, Foss K, Guo H, et al. De novo and inherited variants in ZNF292 underlie a neurodevelopmental disorder with features of autism spectrum disorder. Genet Med. 2020 Mar;22(3):538–46. 47. 47.Sasanfar R, Haddad SA, Tolouei A, Ghadami M, Yu D, Santangelo SL. Paternal age increases the risk for autism in an Iranian population sample. Mol Autism. 2010 Feb 22;1(1):2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/2040-2392-1-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20678245&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 48. 48.Rutter, Michael, Bailey A, Lord C, Cianchetti C, Fancello GS. SCQ: Social Communication Questionnaire: Manuale. OS: Giunti; 2007. 49. 49.American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders, 4th Edition, Text Revision. 4th edition. Washington, DC: American Psychiatric Association; 2000. 943 p. 50. 50.Lord C, Rutter M, Le Couteur A. Autism Diagnostic Interview-Revised: a revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J Autism Dev Disord. 1994 Oct;24(5):659–85. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02172145&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7814313&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1994PM02100009&link_type=ISI) 51. 51.Lord C, Rutter M, Goode S, Heemsbergen J, Jordan H, Mawhood L, et al. Autism diagnostic observation schedule: a standardized observation of communicative and social behavior. J Autism Dev Disord. 1989 Jun;19(2):185–212. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02211841&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2745388&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1989AC89500001&link_type=ISI) 52. 52.Schopler E, Reichler RJ, DeVellis RF, Daly K. Toward objective classification of childhood autism: Childhood Autism Rating Scale (CARS). J Autism Dev Disord. 1980 Mar;10(1):91– 103. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02408436&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=6927682&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1980JT61700009&link_type=ISI) 53. 53.American Psychiatric Association, American Psychiatric Association, DSM-5 Task Force. Diagnostic and statistical manual of mental disorders: DSM-5. [Internet]. 2013 [cited 2016 Dec 14]. Available from: [http://dsm.psychiatryonline.org/book.aspx?bookid=556](http://dsm.psychiatryonline.org/book.aspx?bookid=556) 54. 54.World Health Organization. The ICD-10 Classification of Mental and Behavioural Disorders [Internet]. World Health Organization; 2010. Available from: [https://www.who.int/classifications/icd/en/bluebook.pdf](https://www.who.int/classifications/icd/en/bluebook.pdf) 55. 55.Skuse D, Warrington R, Bishop D, Chowdhury U, Lau J, Mandy W, et al. The developmental, dimensional and diagnostic interview (3di): a novel computerized assessment for autism spectrum disorders. J Am Acad Child Adolesc Psychiatry. 2004 May;43(5):548–58. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00004583-200405000-00008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15100561&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000221014800008&link_type=ISI) 56. 56.FASTX-Toolkit [Internet]. [cited 2021 Dec 18]. Available from: [http://hannonlab.cshl.edu/fastx\_toolkit/](http://hannonlab.cshl.edu/fastx_toolkit/) 57. 57.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinforma Oxf Engl. 2014 Aug 1;30(15):2114–20. 58. 58.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv13033997 Q-Bio [Internet]. 2013 May 26 [cited 2021 Dec 18]; Available from: [http://arxiv.org/abs/1303.3997](http://arxiv.org/abs/1303.3997) 59. 59.DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011 May;43(5):491–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.806&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21478889&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000289972600023&link_type=ISI) 60. 60.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinforma Oxf Engl. 2009 Aug 15;25(16):2078–9. 61. 61.Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinforma. 2013;43:11.10.1–11.10.33. 62. 62.Yang H, Wang K. Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR. Nat Protoc. 2015 Oct;10(10):1556–66. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nprot.2015.105&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26379229&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 63. 63.Wang Q, Pierce-Hoffman E, Cummings BB, Alföldi J, Francioli LC, Gauthier LD, et al. Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes. Nat Commun. 2020 May 27;11(1):2539. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-019-12438-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32461613&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 64. 64.Scott EM, Halees A, Itan Y, Spencer EG, He Y, Azab MA, et al. Characterization of Greater Middle Eastern genetic variation for enhanced disease gene discovery. Nat Genet. 2016;48(9):1071–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3592&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 65. 65.Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016 18;536(7616):285–91. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature19057&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27535533&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000381804900026&link_type=ISI) 66. 66.Adzhubei I, Jordan DM, Sunyaev SR. Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet. 2013 Jan;Chapter 7:Unit7.20. 67. 67.Ng PC, Henikoff S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003 Jul 1;31(13):3812–4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkg509&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12824425&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183832900117&link_type=ISI) 68. 68.Schwarz JM, Rödelsperger C, Schuelke M, Seelow D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods. 2010 Aug;7(8):575–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth0810-575&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20676075&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000280500000014&link_type=ISI) 69. 69.Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 2011 Sep 1;39(17):e118. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkr407&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21727090&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000295184800007&link_type=ISI) 70. 70.Shihab HA, Rogers MF, Gough J, Mort M, Cooper DN, Day INM, et al. An integrative approach to predicting the functional effects of non-coding and coding sequence variation. Bioinforma Oxf Engl. 2015 May 15;31(10):1536–43. 71. 71.Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019 Jan 8;47(D1):D886–94. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gky1016&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 72. 72.Jian X, Boerwinkle E, Liu X. In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res. 2014 Dec 16;42(22):13534–44. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gku1206&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25416802&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 73. 73.Jagadeesh KA, Wenger AM, Berger MJ, Guturu H, Stenson PD, Cooper DN, et al. M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity. Nat Genet. 2016 Dec;48(12):1581–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3703&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27776117&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 74. 74.Song L, Sabunciyan S, Florea L. CLASS2: accurate and efficient splice variant annotation from RNA-seq reads. Nucleic Acids Res. 2016 Jun 2;44(10):e98. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkw158&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26975657&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 75. 75.Jaganathan K, Kyriazopoulou Panagiotopoulou S, McRae JF, Darbandi SF, Knowles D, Li YI, et al. Predicting Splicing from Primary Sequence with Deep Learning. Cell. 2019 Jan 24;176(3):535–548.e24. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 76. 76.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010 Sep;38(16):e164. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkq603&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20601685&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 77. 77.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020 May;581(7809):434–43. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2308-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32461654&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 78. 78.Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011 Jan;29(1):24–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nbt.1754&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21221095&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286048900013&link_type=ISI) 79. 79.Packer JS, Maxwell EK, O’Dushlaine C, Lopez AE, Dewey FE, Chernomorsky R, et al. CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data. Bioinformatics. 2016 Jan 1;32(1):133–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btv547&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26382196&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 80. 80.Fromer M, Purcell SM. Using XHMM Software to Detect Copy Number Variation in Whole-Exome Sequencing Data. Curr Protoc Hum Genet. 2014 Apr 24;81:7.23.1–21. 81. 81.Krumm N, Sudmant PH, Ko A, O’Roak BJ, Malig M, Coe BP, et al. Copy number variation detection and genotyping from exome sequence data. Genome Res. 2012 Aug;22(8):1525–32. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjIyLzgvMTUyNSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAxLzI2LzIwMjEuMTIuMjQuMjEyNjgzNDAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 82. 82.Collins RL, Brand H, Karczewski KJ, Zhao X, Alföldi J, Francioli LC, et al. A structural variation reference for medical and population genetics. Nature. 2020 May;581(7809):444–51. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2287-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32461652&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 83. 83.Yuen RKC, Merico D, Cao H, Pellecchia G, Alipanahi B, Thiruvahindrapuram B, et al. Genome-wide characteristics ofde novomutations in autism. NPJ Genomic Med. 2016 Aug 3;1:160271–1602710. 84. 84.De Rubeis S, He X, Goldberg AP, Poultney CS, Samocha K, Cicek AE, et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature. 2014 Nov 13;515(7526):209–15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature13772&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25363760&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000344631400038&link_type=ISI) 85. 85.Yang C, Li J, Wu Q, Yang X, Huang AY, Zhang J, et al. AutismKB 2.0: a knowledgebase for the genetic evidence of autism spectrum disorder. Database J Biol Databases Curation. 2018 Jan 1;2018. 86. 86.Hawrylycz MJ, Lein ES, Guillozet-Bongaarts AL, Shen EH, Ng L, Miller JA, et al. An anatomically comprehensive atlas of the adult human brain transcriptome. Nature. 2012 Sep 20;489(7416):391–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature11405&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22996553&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000308860900037&link_type=ISI) 87. 87.Li M, Santpere G, Imamura Kawasawa Y, Evgrafov OV, Gulden FO, Pochareddy S, et al. Integrative functional genomic analysis of human brain development and neuropsychiatric risks. Science. 2018 14;362(6420). 88. 88.Zeisel A, Hochgerner H, Lönnerberg P, Johnsson A, Memic F, van der Zwan J, et al. Molecular Architecture of the Mouse Nervous System. Cell. 2018 Aug 9;174(4):999–1014.e22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2018.06.021&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30096314&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 89. 89.Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc Ser B Methodol. 1995 Jan;57(1):289–300. 90. 90.Basel-Vanagaite L, Attia R, Yahav M, Ferland RJ, Anteki L, Walsh CA, et al. The CC2D1A, a member of a new gene family with C2 domains, is involved in autosomal recessive non-syndromic mental retardation. J Med Genet. 2006 Mar;43(3):203–10. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjg6IjQzLzMvMjAzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDEvMjYvMjAyMS4xMi4yNC4yMTI2ODM0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 91. 91.Perez Y, Menascu S, Cohen I, Kadir R, Basha O, Shorer Z, et al. RSRC1 mutation affects intellect and behaviour through aberrant splicing and transcription, downregulating IGFBP3. Brain J Neurol. 2018 Apr 1;141(4):961–70. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/brain/awy045&link_type=DOI) 92. 92.Maddirevula S, AlZahrani F, Anazi S, Almureikhi M, Ben-Omran T, Abdel-Salam GMH, et al. GWAS signals revisited using human knockouts. Genet Med Off J Am Coll Med Genet. 2018 Jan;20(1):64–8. 93. 93. Nabais Sá MJ, Jensik PJ, McGee SR, Parker MJ, Lahiri N, McNeil EP, et al. De novo and biallelic DEAF1 variants cause a phenotypic spectrum. Genet Med Off J Am Coll Med Genet. 2019 Sep;21(9):2059–69. 94. 94.Manzini MC, Xiong L, Shaheen R, Tambunan DE, Di Costanzo S, Mitisalis V, et al. CC2D1A regulates human intellectual and social function as well as NF-κB signaling homeostasis. Cell Rep. 2014 Aug 7;8(3):647–55. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.celrep.2014.06.039&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25066123&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 95. 95.Douzgou S, Petersen MB. Clinical variability of genetic isolates of Cohen syndrome. Clin Genet. 2011 Jun;79(6):501–6. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21418059&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 96. 96.Sanders SJ, Murtha MT, Gupta AR, Murdoch JD, Raubeson MJ, Willsey AJ, et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature. 2012 Apr 4;485(7397):237–41. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature10945&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22495306&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000303799800041&link_type=ISI) 97. 97.Mirzaa GM, Chong JX, Piton A, Popp B, Foss K, Guo H, et al. De novo and inherited variants in ZNF292 underlie a neurodevelopmental disorder with features of autism spectrum disorder. Genet Med Off J Am Coll Med Genet. 2019 Nov 14; 98. 98.Pillai NR, Yubero D, Shayota BJ, Oyarzábal A, Ghosh R, Sun Q, et al. Loss of CLTRN function produces a neuropsychiatric disorder and a biochemical phenotype that mimics Hartnup disease. Am J Med Genet A. 2019 Dec;179(12):2459–68. 99. 99.Berciano J, García A, Gallardo E, Peeters K, Pelayo-Negro AL, Álvarez-Paradelo S, et al. Intermediate Charcot-Marie-Tooth disease: an electrophysiological reappraisal and systematic review. J Neurol. 2017 Aug;264(8):1655–77. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 100.100.Sener EF, Onal MG, Dal F, Nalbantoglu U, Ozkul Y, Canatan H, et al. Novel alterations of CC2D1A as a candidate gene in a Turkish sample of patients with autism spectrum disorder. Int J Neurosci. 2020 Dec 28;1–8. 101.101.Wu J, Yu P, Jin X, Xu X, Li J, Li Z, et al. Genomic landscapes of Chinese sporadic autism spectrum disorders revealed by whole-genome sequencing. J Genet Genomics Yi Chuan Xue Bao. 2018 Oct 20;45(10):527–38. 102.102.Guo H, Wang T, Wu H, Long M, Coe BP, Li H, et al. Inherited and multiple de novo mutations in autism/developmental delay risk genes suggest a multifactorial model. Mol Autism. 2018;9:64. 103.103.Wang T, Guo H, Xiong B, Stessman HAF, Wu H, Coe BP, et al. De novo genic mutations among a Chinese autism spectrum disorder cohort. Nat Commun. 2016 Nov 8;7:13316. 104.104.Coursimault J, Guerrot A-M, Morrow MM, Schramm C, Zamora FM, Shanmugham A, et al. MYT1L-associated neurodevelopmental disorder: description of 40 new cases and literature review of clinical and molecular aspects. Hum Genet. 2021 Nov 8; 105.105.Weiss LA, Escayg A, Kearney JA, Trudeau M, MacDonald BT, Mori M, et al. Sodium channels SCN1A, SCN2A and SCN3A in familial autism. Mol Psychiatry. 2003 Feb;8(2):186–94. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/sj.mp.4001241&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12610651&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000181195900012&link_type=ISI) 106.106.Jiang Y, Yuen RKC, Jin X, Wang M, Chen N, Wu X, et al. Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing. Am J Hum Genet. 2013 Aug 8;93(2):249–63. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2013.06.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23849776&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 107.107.Steele JL, Morrow MM, Sarnat HB, Alkhunaizi E, Brandt T, Chitayat DA, et al. Semaphorin-Plexin Signaling: From Axonal Guidance to a New X-Linked Intellectual Disability Syndrome. Pediatr Neurol. 2021 Oct 18;126:65–73. 108.108.Wang T, Zhang Y, Liu L, Wang Y, Chen H, Fan T, et al. Targeted sequencing and integrative analysis of 3,195 Chinese patients with neurodevelopmental disorders prioritized 26 novel candidate genes. J Genet Genomics Yi Chuan Xue Bao. 2021 Apr 20;48(4):312–23. 109.109.Lipkin SM, Näär AM, Kalla KA, Sack RA, Rosenfeld MG. Identification of a novel zinc finger protein binding a conserved element critical for Pit-1-dependent growth hormone gene expression. Genes Dev. 1993 Sep;7(9):1674–87. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXNkZXYiO3M6NToicmVzaWQiO3M6ODoiNy85LzE2NzQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wMS8yNi8yMDIxLjEyLjI0LjIxMjY4MzQwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 110.110.Faber CG, Lauria G, Merkies ISJ, Cheng X, Han C, Ahn H-S, et al. Gain-of-function Nav1.8 mutations in painful neuropathy. Proc Natl Acad Sci U S A. 2012 Nov 20;109(47):19444–9. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTA5LzQ3LzE5NDQ0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDEvMjYvMjAyMS4xMi4yNC4yMTI2ODM0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 111.111.Neuser S, Brechmann B, Heimer G, Brösse I, Schubert S, O’Grady L, et al. Clinical, neuroimaging, and molecular spectrum of TECPR2-associated hereditary sensory and autonomic neuropathy with intellectual disability. Hum Mutat. 2021 Jun;42(6):762–76. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 112.112.Rajab A, Schuelke M, Gill E, Zwirner A, Seifert F, Morales Gonzalez S, et al. Recessive DEAF1 mutation associates with autism, intellectual disability, basal ganglia dysfunction and epilepsy. J Med Genet. 2015 Sep;52(9):607–11. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjg6IjUyLzkvNjA3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDEvMjYvMjAyMS4xMi4yNC4yMTI2ODM0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 113.113.Bomkamp C, Tripathy SJ, Bengtsson Gonzales C, Hjerling-Leffler J, Craig AM, Pavlidis P. Transcriptomic correlates of electrophysiological and morphological diversity within and across excitatory and inhibitory neuron classes. PLoS Comput Biol. 2019 Jun;15(6):e1007113. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1007113&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31211786&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 114.114.Anazi S, Maddirevula S, Salpietro V, Asi YT, Alsahli S, Alhashem A, et al. Expanding the genetic heterogeneity of intellectual disability. Hum Genet. 2017 Nov;136(11–12):1419– 29. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00439-017-1843-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28940097&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom) 115.115.Kaiser VB, Svinti V, Prendergast JG, Chau Y-Y, Campbell A, Patarcic I, et al. Homozygous loss-of-function variants in European cosmopolitan and isolate populations. Hum Mol Genet. 2015 Oct 1;24(19):5464–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddv272&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26173456&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F01%2F26%2F2021.12.24.21268340.atom)