Literature DB >> 36199823

Integrative Functional Genomic Analysis in Multiplex Autism Families from Kazakhstan.

Anastassiya Perfilyeva1, Kira Bespalova1,2, Yuliya Perfilyeva3,4, Liliya Skvortsova1, Lyazzat Musralina1, Gulnur Zhunussova1, Elmira Khussainova1, Ulzhan Iskakova5, Bakhytzhan Bekmanov1, Leyla Djansugurova1.   

Abstract

The study of extended pedigrees containing autism spectrum disorder- (ASD-) related broader autism phenotypes (BAP) offers a promising approach to the search for ASD candidate variants. Here, a total of 650,000 genetic markers were tested in four Kazakhstani multiplex families with ASD and BAP to obtain data on de novo mutations (DNMs), common, and rare inherited variants that may contribute to the genetic risk for developing autistic traits. The variants were analyzed in the context of gene networks and pathways. Several previously well-described enriched pathways were identified, including ion channel activity, regulation of synaptic function, and membrane depolarization. Perhaps these pathways are crucial not only for the development of ASD but also for ВАР. The results also point to several additional biological pathways (circadian entrainment, NCAM and BTN family interactions, and interaction between L1 and Ankyrins) and hub genes (CFTR, NOD2, PPP2R2B, and TTR). The obtained results suggest that further exploration of PPI networks combining ASD and BAP risk genes can be used to identify novel or overlooked ASD molecular mechanisms.
Copyright © 2022 Anastassiya Perfilyeva et al.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36199823      PMCID: PMC9529466          DOI: 10.1155/2022/1509994

Source DB:  PubMed          Journal:  Dis Markers        ISSN: 0278-0240            Impact factor:   3.464


1. Introduction

ASD is a spectrum of psychological characteristics that describe a wide range of abnormal behavior and difficulties in social cooperation and communication, as well as severely restricted interests and frequently repetitive behaviors. Relevance of the ASD problem arises from the high incidence of this pathology all over the world, including Kazakhstan. According to official data, in 2021, there were 4,887 children with ASD in Kazakhstan, but experts believe that this indicator is ten times higher. According to the statistics of WHO and CDC, there are at least 30,000 children with ASD in Kazakhstan (https://inbusiness.kz/ru/last/v-kazahstane-30-tysyach-detej-stradayut-autizmom). The etiology of this pathology is extremely difficult and is probably determined by a combination of genetic susceptibility and environmental factors. Determining the specific contribution of these factors to ASD is difficult due to the lack of population-based, longitudinal evidence necessary to establish conclusive links between exposure, genotypic responses, and phenotypic consequences [1]. Some studies steered the debate toward the greater importance of environmental factors rather than a genetic predisposition to ASD [2, 3]. Other studies showed little support for general environmental influences [4, 5]. Most recent studies suggest that environmental exposures may be a catalyst for deleterious DNMs leading to ASD [1], whereas genetic factors are considered the predominant causes of ASD [6, 7]. A strong contribution of heritable factors in the etiology of ASD is supported by twin studies and studies of first-degree relatives. Indeed, the risk of a child being diagnosed with ASD is increased at least 25-fold in a family where a brother or sister has already been diagnosed with autism [8]. Independent twin studies show concordance rates of 60-92% in monozygotic twins versus 0-10% in dizygotic twins [9, 10]. If only one child in a family has ASD, the other twin may have delayed speech, reading, and spelling difficulties [11]. This study by Folstein et al. of siblings and parents of affected children with mild cognitive and behavioral impairments led to the concept of the BAP [11]. Apparently, ASD families with multiple occurrences and relatives with BAP have a higher genetic loading for ASD [12], making them a good model for studies when environmental factors are excluded or have minimal influence. Such families are not uncommon in ASD, and several studies of such pedigrees have been published [13-16]. The prevalence of BAP in ASD families is also not low. A large-scale study by Sasson et al. estimates that the prevalence rate of BAP among parents of children with ASD ranges from 14 to 23% [17]. A meta-analysis of twin studies found that there is no disruption between ASD and BAP in genetic modeling, suggesting that ASD as a disorder can be conceptualized as the extreme of BAP symptoms/behaviors [18]. If this is the case, the inclusion of individuals with BAP in a study of multiplex families should increase the power of the study to determine the genetic structure of ASD [16, 19]. A complex understanding of the genetic structure of ASD requires unbiased knowledge of the number of risk loci, their penetrance, and allele frequencies [19]. The collected data to date provided conclusive evidence for three categories of genetic structure, including common SNPs (MAF > 1%), inherited rare variants (MAF < 1%), and DNMs that have been identified in the proband and are not found in the genome of the biological parents [20]. Genetic models suggest that at least 50% of the variance in ASD may be due to common inherited variants [21], which act in aggregate while having little effect individually. Despite evidence for a significant role of common variants in ASD risk, rare genetic variations may be associated with higher individual risk [22]. Maintenance of genetic susceptibility to ASD despite reduced transmission of risk variants may be due to DNMs [23, 24]. The relative contribution of spontaneous DNMs to the ASD etiology is estimated between 5 and 15% [23]. In several cases of syndromic ASD, a single DNM appears to be sufficient to cause the onset of ASD symptoms [25], suggesting that this DNM disrupts key loss-of-function intolerant genes. Despite a considerable genetic heterogeneity underlying ASD, there is compelling evidence that a large number of risk genes can be integrated into a much smaller number of protein-protein interaction (PPI) networks [26]. Previous studies have shown that ASD genes functionally converge in synapse development, axon alignment, neuron motility, synaptic transmission, chromatin remodeling, transcription and translation regulation, ion transport, and cell adhesion [27-32]. As far as we know, these studies were mainly focused on investigating genes affected in children with ASD, but not in relatives with subclinical phenotypes of BAP. The foregoing suggests that inclusion of ASD-related genes from first-degree relatives with BAP in the PPI network may help to better understand the development of autistic traits in the family. Will the main pathways of development of autistic traits change from those shown so far in this case? If ASD is simply the extreme end of the distribution of autistic traits that make up BAP, there will not be a large shift in the main trajectory. However, will other less studied convergent signaling mechanisms or protein interactions contributing to ASD pathology be identified? The previously discovered BAP genes [33-37] lead to the assumption that BAP gene loci generally correlate with ASD loci. However, several loci were found to be significant only for BAP [13], suggesting that the absence of the BAP putative risk gene in the PPI networks may be a missing link to understanding the initial biological mechanisms of ASD. Therefore, here, we focused on a set of four extended pedigrees with ASD and BAP. The aim of the study was to identify putative candidate genes and to investigate functional relationships between these genes using PPI network analysis. This is the first genetic study of Kazakhstani families with ASD.

2. Materials and Methods

2.1. Sampling

Families for this study were selected using a database of 400 Kazakhstani families with ASD children. The database was created within the framework of the previously implemented project 0118РК00503 in 2018-2021. We applied the following inclusion criteria for families: two or more children with ASD AND BAP among first-degree relatives AND Kazakhstani ancestry. Exclusion criteria were a simplex family OR/AND fragile X syndrome. A total of 13 families (3%, 95% CI: 1.7-5.5%) met the inclusion and exclusion criteria. Three families were out of the country at the time of the study, two families were single parents, and four families declined to participate in the study for one reason or another. Thus, four families took part in the study. Samples of saliva were collected from all children with ASD as well as from their parents and neurotypical siblings using a collection kit (Zeesan) provided by TellmeGen. Collection was conducted after obtaining informed consent from at least one of the parents. The study was approved by the Ethics Committee of the Institute of Human and Animal Physiology, Almaty, Kazakhstan. The children recruited in this study were diagnosed with ASD by psychiatrist. The Child Autism Rating Scale (CARS) was used to assess the severity of ASD [38]. The Broad Autism Phenotype Questionnaire (BAPQ) was used to assess BAP traits [39, 40].

2.2. Data Generation

DNA isolation from the collected biomaterial and data generation were performed using the Infinium Global Screening Array (GSA) v3.0 run on the Illumina iScan Platform at TellmeGen CA (Valencia, Spain). A total of 650,000 genetic markers were analyzed using 10,000 probes (99.99% reliability). A triplicate analysis was performed.

2.3. Data Analysis

Family trees were generated using the GenoPro2020 software (https://genopro.com/2020/). TellmeGen CA applied standardized quality control measures to filter out low-quality data (a call rate lower than 0.99) from the SNP list and compiled all obtained results into csv files, which were sent to our laboratory for further analysis. Genetic variants were aligned to the GRCh37 human reference genome and annotated in accordance with the nomenclature of the HGVS (Human Genome Variation Society) [41]. Gene-based annotation was performed using the RStudio software (http://rstudio.com/products/rstudio/) with gene definitions from the database of dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/), ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/), SFARI (Simons Foundation Autism Research Initiative, http://gene.sfari.org/), and GWAS catalog (Genome-Wide Association Studies, https://http://www.ebi.ac.uk/gwas/home). The MAF (minor allele frequency) was estimated using the databases ALFA (allele frequency aggregator, http://nih.gov), 1000G (1000 Genomes, http://www.1000genomes.org/), gnomAD (Genome Aggregation, https://gnomad.broadinstitute.org/), andTOPMed (trans-Omics for Precision Medicine, https://www.nhlbi.nih.gov/science/trans-omics-precision-medicine-topmed-program). The GSA includes ∼640,000 single nucleotide polymorphisms (SNPs) and ∼10,000 indels (insertion/deletion). SNPs that are missing from a fraction of individuals in the cohort were filtered out. SNPs with a MAF > 1% associated with ASD according to the GWAS catalog (p < 0.00001) were included in the list of common variants. SNPs with a MAF ≤ 0.01% associated with ASD according to the ClinVar database and inherited by a child with ASD from a parent with BAP were included in the list of rare inherited variants. DNMs were identified according to the scenario: both parents carry a homozygous reference allele and the child is heterozygous, i.e., carries one copy each of the alleles REF and ALT. The variants were classified as pathogenic, probably pathogenic, of unclear significance (VUS), benign, or probably benign according to the ACMG (American College of Medical Genetics and Genomics) guidelines [12]. Pathogenic mutations included stop codon variants (frameshift and nonsense mutations), variants with uncorrected splicing, and variants with previously established pathogenic effects according to ClinVar database. In silico tools such as SIFT (Sorting Intolerant From Tolerant, http://sift-dna.org) and Polymorphism Phenotyping-2 (PolyPhen-2, http://genetics.bwh.harvard.edu/pph2/) were used to predict deleterious effects of missense variants on protein structure and function. We filtered out variants that were most likely nonpathogenic (benign and likely benign) or with MAF < 1% in order to identify clinically relevant rare DNMs.

2.4. Data Visualization and Functional Interpretation

To characterize the relationships between the ASD/BAP candidate genes in each family, we projected them into the PPI network. The InnateDB (Knowledge Resource for Innate Immunity Interactions and Pathways, https://www.innatedb.com/) was used to retrieve predicted interactions for the identified candidate genes [42, 43]. The OmicsNet 2.0 software (https://www.omicsnet.ca) was used to construct the PPI network. This is a novel web-based tool for creation and visualization of complex biological networks. The software supports ten molecular interaction databases for protein-protein, miRNA-target, TF-target, and enzyme-metabolite interactions and provides multiple methods for network customization using a powerful WebGL technology to enable native 3D display of complex biological networks in modern web browsers [44]. The WalkTrap algorithm in OmicsNet 2.0 was applied to further partition of the PPI into modules. The algorithm assumes that a random walker tends to be trapped in dense parts of a network corresponding to modules. Functional annotation and enrichment analysis of genes were performed according to the GO (Gene Ontology, http://geneontology.org/) [45], KEGG (Kyoto Encyclopedia of Genes and Genomes, https://www.genome.jp/kegg), and REACTOME (http://www.reactome.org) databases using the g:Profiler (https://biit.cs.ut.ee/gprofiler/). This is an open web server for characterizing and manipulating gene lists. It is updated every three months following the quarterly releases of the Ensembl databases [46]. The g:Profiler Bonferroni correction was used, and only pathways with an adjusted p value (padj) < 0.05 were considered significantly enriched.

3. Results

3.1. Characteristics of Subjects

The study included four multiplex families (Figure 1). The mean age (± standard deviation) of the ASD children was 9.1 ± 4.2 years. The ratio of male to female children with ASD was 7 : 1. The mean ages of parents and neurotypical siblings were 39.3 ± 3.5 and 14.5 ± 7.8 years, respectively.
Figure 1

Family trees. Filled symbols indicate children with ASD, and half-filled symbols indicate BAP carriers. Information on year of birth, nationality, and CARS or BAPQ score is presented for each individual.

Family 1 has two boys with moderate ASD and one neurotypical girl. Family 2 has two sons with severe and moderate ASD and one neurotypical daughter. Family 3 has two sons with moderate autism. In Family 4, the mother has two children from different marriages. The son has severe ASD, and the daughter has moderate autism. The BAPQ data indicated that the fathers from Families 1, 2, and 3 and the mothers from Families 2 and 4 have autistic traits with high scores across the domains of ASD. The fathers from Families 1 and 3 and the mother from Family 2 have high aloofness subscale scores, while the father from Family 2 has pragmatic language deficits. The mother from Family 4 has either BAP or ASD and shows rigid personality and pragmatic language deficits. All family members are Kazakh except the father and his daughter from Family 4. They are Russian.

3.2. Identification and Annotation of ASD and BAP Associated Variants/Genes

A total of 650,000 genetic markers were analyzed to generate data on DNMs, common, and rare variants that may contribute to autistic traits. A total of 72 common variants associated with ASD were identified, including three regulatory region variants (4.2%), three prime UTR variants (4.2%), 15 intergenic variants (20.8%), 48 intron variants (66.6%), one missense variant (1.4%), one noncoding transcript exon variant (1.4%), and one VUS variant (1.4%). A total of 29 (40%) of 72 common variants overlapped in four pedigrees (Table 1). Further analysis demonstrated 50 rare inherited variants, including 40 missense (80%), three splice donors (6%), five synonymous (10%), and two stop-gain variants (4%). Two rare variants (4%) occurred in all four pedigrees (Table 2).
Table 1

Characteristics of dbSNPs, associated with ASD according to the GWAS database.

No.RegionMapped_geneSNPContextRisk allele frequencyFamily no
1TRIM33rs6537825missense_variant A = 0.085217 (31257/366794, ALFA)A = 0.092924 (24596/264690, TOPMED)A = 0.115222 (28970/251428, GnomAD_exome)2; 3; 4
2AMPD1rs926938intergenic_variant A = 0.474113 (125493/264690, TOPMED)A = 0.499930 (114386/228804, ALFA)A = 0.480234 (67204/139940, GnomAD)1; 2; 3; 4
312q23.1Y_RNA-NArs6538761intron_variant A = 0.349973 (99039/282990, ALFA)A = 0.339321 (89815/264690, TOPMED)A = 0.332810 (46600/140020, GnomAD)1; 2; 3; 4
4rs4307059intron_variant C = 0.354951/79164 (ALFA)1; 2; 3; 4
58q24.13SMILRrs12543592intron_variant G = 0.450958 (67558/149810, ALFA)1; 2; 3; 4
617p13.3SGSM2rs2447097intron_variant T = 0.421255 (111502/264690, TOPMED)T = 0.457423 (72131/157690, ALFA)T = 0.422399 (59146/140024, GnomAD)1; 2; 3; 4
71p21.1NA-CDK4P1rs11184553intergenic_variant A = 0.487925 (129149/264690, TOPMED)A = 0.43423 (13263/30544, ALFA)2; 3; 4
85q21.2LOC105379109rs325485intron_variant A = 0.336235 (88998/264690, TOPMED)A = 0.383385 (96695/252214, ALFA)A = 0.343482 (48025/139818, GnomAD)1
96p22.1GPR89P-RSL24D1P1rs17693963intergenic_variant C = 0.07605 (6941/91270, ALFA)4
1010q25.1SORCS3rs1021363intron_variant A = 0.253202 (67020/264690, TOPMED)A = 0.261206 (36609/140154, GnomAD)A = 0.30279 (6616/21850, ALFA)1; 2; 3; 4
1114q32.33RNU7-160P, BAG5rs10149470regulatory_region_variant A = 0.453024 (119911/264690, TOPMED)A = 0.450706 (63125/140058, GnomAD)A = 0.47706 (22524/47214, ALFA)1; 2; 3; 4
121p31.1NEGR1rs1620977intron_variant A = 0.201863 (53431/264690, TOPMED)A = 0.267664 (59447/222096, ALFA)A = 0.204083 (28452/139414, GnomAD)1; 3; 4
133p14.2CADPSrs1452075intron_variant C = 0.275542 (85602/310668, ALFA)C = 0.263913 (69855/264690, TOPMED)C = 0.264183 (37011/140096, GnomAD)1; 2; 3
145q33.2GALNT10rs34509057intron_variant A = 0.165276 (43747/264690, TOPMED)A = 0.178673 (25035/140116, GnomAD)A = 0.20461 (5869/28684, ALFA)1
1511q24.2NRGNrs55661361intron_variant A = 0.434229 (114936/264690, TOPMED)A = 0.436872 (61128/139922, GnomAD)A = 0.29541 (7146/24190, ALFA)1; 2; 3; 4
162p16.1ACTG1P22rs11682175intron_variant C = 0.378579 (100206/264690, TOPMED)C = 0.381349 (53401/140032, GnomAD)C = 0.44219 (37947/85816, ALFA)1; 2; 3
17Xp22.12LOC105373146rs1378559intron_variant C = 0.102214 (27055/264690, TOPMED)C = 0.103850 (10650/102552, GnomAD)C = 0.13327 (4653/34914, ALFA)2; 4
187q21.11PCLOrs2522831intron_variant C = 0.420114 (111200/264690, TOPMED)C = 0.451828 (70140/155236, ALFA)C = 0.412847 (57811/140030, GnomAD)1; 2; 3; 4
1918q21.2DCCrs10164055intron_variant C = 0.292176 (77336/264690, TOPMED)C = 0.316869 (44375/140042, GnomAD)1
2018q21.2TCF4rs12967143intron_variant G = 0.475005 (125729/264690, TOPMED)G = 0.453865 (63532/139980, GnomAD)1; 2; 3; 4
2120q13.12SLC12A5rs12624433intron_variant A = 0.196184 (51928/264690, TOPMED)A = 0.202170 (28328/140120, GnomAD)A = 0.18751 (6001/32004, ALFA)1; 2; 4
2218q21.2NA-LINC01929rs11874716intergenic_variant G = 0.397461 (105204/264690, TOPMED)G = 0.33788 (10187/30150, ALFA)2; 4
23LINC02551rs10791097intron_variant T = 0.396830 (105037/264690, TOPMED)T = 0.404173 (56615/140076, GnomAD)T = 0.42591 (10204/23958, ALFA)1; 2; 3; 4
243p14.2FHITrs1353545intron_variant C = 0.377823 (100006/264690, TOPMED)C = 0.359634 (50292/139842, GnomAD)C = 0.35375 (7099/20068, ALFA)1; 2; 3
253p22.2HSPD1P6-LINC02033rs75968099intergenic_variant T = 0.250519 (66310/264690, TOPMED)T = 0.259647 (36321/139886, GnomAD)T = 0.31905 (11244/35242, ALFA)1; 3
2611q24.1GRAMD1Brs77502336intron_variant C = 0.271272 (71803/264690, TOPMED)C = 0.270710 (37940/140150, GnomAD)C = 0.28936 (5315/18368, ALFA)2; 3; 4
275q31.2ETF1rs3849046intron_variant C = 0.492134 (130263/264690, TOPMED)C = 0.49090 (37400/76186, ALFA)1; 2; 3; 4
2814q24.2RGS6rs2332700intron_variant C = 0.237391 (62835/264690, TOPMED)C = 0.240436 (33662/140004, GnomAD)C = 0.25146 (4750/18890, ALFA)1; 2; 3; 4
2912p13.33CACNA1Crs1024582intron_variant A = 0.235419 (62313/264690, TOPMED)A = 0.300859 (31805/105714, ALFA)1; 2; 4
304q33CLCN3rs10520163intron_variant C = 0.499018 (175400/351490, ALFA)C = 0.442945 (117243/264690, TOPMED)C = 0.440639 (61678/139974, GnomAD)1; 2; 3; 4
317q33DGKIrs37350253_prime_UTR_variant C = 0.288232 (76292/264690, TOPMED)C = 0.286664 (40102/139892, GnomAD)C = 0.33618 (19781/58840, ALFA)1; 2; 3; 4
325q12.1SMIM15-AS1rs171748intron_variant A = 0.389289 (103041/264690, TOPMED)A = 0.395943 (55421/139972, GnomAD)1; 2; 3; 4
3317p13.3SRRrs4523957intron_variant G = 0.460962 (122012/264690, TOPMED)G = 0.456330 (63899/140028, GnomAD)G = 0.40695 (13754/33798, ALFA)3; 4
346p22.2BTN2A1rs1977199intron_variant G = 0.335797 (88882/264690, TOPMED)G = 0.332093 (46319/139476, GnomAD)1; 2; 3; 4
35TSNARE1rs4129585intron_variant A = 0.312399 (82689/264690, TOPMED)A = 0.415568 (85636/206070, ALFA)A = 0.327265 (45819/140006, GnomAD)1; 2; 3; 4
3614q32.33COA8rs12887734intron_variant T = 0.248290 (65720/264690, TOPMED)T = 0.258449 (35666/138000, GnomAD)T = 0.27298 (5156/18888, ALFA)1; 2; 3; 4
375q33.2GRIA1rs12522290regulatory_region_variant G = 0.115290 (30516/264690, TOPMED)G = 0.121303 (17003/140170, GnomAD)G = 0.14219 (2686/18890, ALFA)4
386p21.1ZNF318rs73416724non_coding_transcript_exon_variant G = 0.108221 (28645/264690, TOPMED)G = 0.106380 (14899/140054, GnomAD)G = 0.11175 (2111/18890, ALFA)1
39CACNB2rs7893279intron_variant G = 0.102354 (27092/264690, TOPMED)G = 0.102536 (14367/140116, GnomAD)G = 0.11104 (8492/76480, ALFA)1; 3; 4
406p21.33MSH5, MSH5-SAPCD1rs707939intron_variant A = 0.359700 (113481/315488, ALFA)A = 0.258582 (68444/264690, TOPMED)A = 0.343005 (85522/249332, GnomAD_exome)1; 2; 3; 4
418q24.3TSNARE1rs67756423intron_variant C = 0.215191 (56959/264690, TOPMED)C = 0.207610 (29092/140128, GnomAD)2; 4
42CACNA1Irs5995756intron_variant T = 0.401938 (106389/264690, TOPMED)T = 0.443669 (63426/142958, ALFA)T = 0.383588 (53667/139908, GnomAD)1; 2; 3; 4
436p22.1MIR3143-RPL10P2rs911186intergenic_variant G = 0.254724 (67423/264690, TOPMED)G = 0.224222 (37589/167642, ALFA)G = 0.247801 (34709/140068, GnomAD)3; 4
4415q25.1CHRNA3rs8042374intron_variant G = 0.316495 (83773/264690, TOPMED)G = 0.250356 (58410/233308, ALFA)G = 0.286205 (39794/139040, GnomAD)1; 2; 3; 4
453p26.3CNTN4rs17194490intron_variant T = 0.14142 (11348/80244, ALFA)3; 4
466p22.2BTN3A2rs132185913_prime_UTR_variant C = 0.302939 (80185/264690, TOPMED)C = 0.299397 (41927/140038, GnomAD)C = 0.30182 (24657/81694, ALFA)3; 4
476p21.33MUCL3, SFTA2rs3132581intron_variant A = 0.089059 (23573/264690, TOPMED)A = 0.124702 (20499/164384, ALFA)A = 0.094994 (13286/139862, GnomAD)4
482q37.1SNORCrs2675968intron_variant T = 0.301515 (79808/264690, TOPMED)T = 0.301837 (42253/139986, GnomAD)T = 0.30176 (9576/31734, ALFA)1; 2
4910q22.3ZMIZ1rs703970intron_variant A = 0.380653 (100755/264690, TOPMED)A = 0.408319 (94164/230614, ALFA)A = 0.379565 (53168/140076, GnomAD2; 3; 4
502q32.1ZNF804Ars7597593intron_variant T = 0.475753 (125927/264690, TOPMED)T = 0.403458 (85471/211846, ALFA)T = 0.470696 (65791/139774, GnomAD)3; 4
517q33NA-ZP3P2rs10250997intergenic_variant A = 0.144509 (38250/264690, TOPMED)A = 0.114385 (23197/202798, ALFA)A = 0.147202 (20600/139944, GnomAD)2; 4
523p22.2HSPD1P6, LINC02033rs9834970Regulatory region variant C = 0.455287 (120510/264690, TOPMED)C = 0.498254 (88177/176972, ALFA)C = 0.443239 (62057/140008, GnomAD)1; 2; 3
532p16.3FOXN2-PPP1R21-DTrs7565792intergenic_variant T = 0.433488 (114740/264690, TOPMED)T = 0.403518 (63605/157626, ALFA)1; 2; 4
5410p14LOC105376387rs6602217intron_variant C = 0.084034 (24182/287766, ALFA)C = 0.158298 (41900/264690, TOPMED)C = 0.156390 (21899/140028, GnomAD)3; 4
5516p13.2LOC101927026rs12325410intron_variant G = 0.156282 (45041/288204, ALFA)G = 0.172092 (45551/264690, TOPMED)G = 0.163921 (22966/140104, GnomAD)1; 2; 4
5616p13.2GRIN2Ars8058295intron_variant A = 0.320919 (84944/264690, TOPMED)A = 0.325310 (45557/140042, GnomAD)A = 0.28528 (24029/84230, ALFA)3; 4
57rs6694545 A = 0.268749 (73956/275186, ALFA)A = 0.433889 (114846/264690, TOPMED)A = 0.417634 (58432/139912, GnomAD)1; 3; 4
588p12RPL10AP3-NArs2609653intergenic_variant C = 0.054090 (14317/264690, TOPMED)C = 0.051313 (12505/243702, ALFA)C = 0.059383 (8325/140192, GnomAD)2; 4
5918q21.2RNA5SP459-TCF4rs9951150intergenic_variant A = 0.482444 (127698/264690, TOPMED)G = 0.456371 (105084/230260, ALFA)A = 0.486253 (68018/139882, GnomAD)1; 2; 3
601p31.1NA-ADGRL4rs10873998intergenic_variant T = 0.466702 (112367/240768, ALFA)1; 2; 3; 4
6115q25.3NTRK3rs1104918intron_variant G = 0.239850 (63486/264690, TOPMED)G = 0.19576 (6837/34926, ALFA)1; 4
622q11.2IL1R2, IL1R1rs2310173intron_variant T = 0.455849 (131145/287694, ALFA)T = 0.358208 (94814/264690, TOPMED)2; 3; 4
6313q33.3rs12871532intergenic_variant C = 0.460323 (121843/264690, TOPMED)C = 0.454100 (63544/139934, GnomAD)1; 2; 3; 4
645q32PPP2R2Brs609412intron_variant G = 0.260245 (91536/351730, ALFA)G = 0.376361 (99619/264690, TOPMED)G = 0.355398 (49770/140040, GnomAD)1; 2; 3; 4
654q31.3RNA5SP169rs360932intron_variant G = 0.443636 (117426/264690, TOPMED)G = 0.437177 (61189/139964, GnomAD)G = 0.38767 (20945/54028, ALFA)1; 2; 3; 4
662q31.3rs13418455intergenic_variant T = 0.342787 (93958/274100, ALFA)T = 0.283577 (75060/264690, TOPMED)T = 0.293173 (41009/139880, GnomAD)2; 3; 4
672q37.3HDAC4rs3791556intron_variant A = 0.163726 (49335/301326, ALFA)A = 0.171291 (45339/264690, TOPMED)A = 0.162051 (22715/140172, GnomAD)1; 2; 3; 4
681p31.1rs4650608intergenic_variant C = 0.280812 (74328/264690, TOPMED)C = 0.312248 (56703/181596, ALFA)C = 0.293815 (41147/140044, GnomAD)4
697p22.3MRM2rs77990063_prime_UTR_variant T = 0.353602 (101853/288044, ALFA)T = 0.344482 (91181/264690, TOPMED)T = 0.333479 (46647/139880, GnomAD)1; 2; 3
708p12PPP1R3B, RPL10AP3rs6990255intron_variant T = 0.048947 (15545/317586, ALFA)T = 0.100899 (26707/264690, TOPMED)T = 0.103487 (14494/140056, GnomAD)2; 4
719p24.3SMARCA2-RN7SL592Prs4741652intergenic_variant C = 0.218916 (57945/264690, TOPMED)C = 0.266060 (50818/191002, ALFA)1; 2; 3; 4
7210q21.2ANK3rs10994359intron_variant C = 0.086131 (22798/264690, TOPMED)C = 0.077921 (10922/140168, GnomAD)1; 2; 3; 4
Table 2

Characteristics of rare inherited variants.

No.NameSIFT/PolyPhenClinical significance (last reviewed)dbSNP IDGMAFTypeFamily No.
1NM_000055.2 (BCHE):c.293A>G (p.Asp98Gly)Deleterious (0.01)/possibly damaging (0.511)Pathogenic/likely pathogenic (last reviewed: May 29, 2020)rs17998070.00599 (C)Missense variant1
2NM_019000.4 (RETREG1):c.503C>G (p.Ser168Ter)-/-Pathogenic (last reviewed: Jun 3, 2020)rs1378527390.00020 (A)Stop gained1; 2; 3; 4
3NM_004646.3 (NPHS1):c.2869G>A (p.Val957Met)Deleterious (0.02)/probably damaging (0.995)Likely pathogenic (last reviewed: May 25, 2018)rs1148491390.00280 (G)Missense variant2
4NM_017882.3 (CLN6):c.307C>T (p.Arg103Trp)Deleterious (0)/probably damaging (0.999)Conflicting interpretations of pathogenicity (last reviewed: Sep 21, 2020)rs2010954120.00020 (A)Missense variant4
5NM_000256.3 (MYBPC3):c.649A>G (p.Ser217Gly)Deleterious (0.01)/possibly damaging (0.541)Conflicting interpretations of pathogenicity (last reviewed: Sep 20, 2021)rs1387538700.00180 (C)Missense variant4
6NM_000369.5 (TSHR):c.1349G>A (p.Arg450His)Deleterious (0)/probably damaging (0.999)Conflicting interpretations of pathogenicity (last reviewed: Sep 1, 2021)rs1892618580.00020 (A)Missense variant4
7NM_003122.4 (SPINK1):c.194+2T>C-/-Conflicting interpretations of pathogenicity (last reviewed: Oct 7, 2020)rs1489543870.00080 (G)Splice donor variant3
8NM_000036.3 (AMPD1):c.860A>T (p.Lys287Ile)Deleterious (0)/probably damaging (1)Conflicting interpretations of pathogenicity (last reviewed: Oct 31, 2018)rs345261990.01098 (A)Missense variant1; 4
9NM_020631.5 (PLEKHG5):c.2485G>T (p.Asp829Tyr)Deleterious (0)/possibly damaging (0.876)Conflicting interpretations of pathogenicity (last reviewed: Oct 29, 2020)rs2001625210.00020 (A)Missense variant1
10NM_000540.3 (RYR1):c.9713A>G (p.Glu3238Gly)Deleterious (0.01)/possibly damaging (0.503)Conflicting interpretations of pathogenicity (last reviewed: Nov 3, 2020)rs2009506730.00020 (G)Missense variant4
11NM_001849.4 (COL6A2):c.2795C>TTolerated (0.23)/benign (0.041)Conflicting interpretations of pathogenicity (last reviewed: Nov 27, 2020)rs1177258250.00160 (T)Missense variant4
12NM_000335.5 (SCN5A):c.4821C>T (p.Leu1607=)-/-Conflicting interpretations of pathogenicity (last reviewed: Nov 25, 2020)rs454370990.00100 (A)Synonymous variant3; 4
13NM_001267550.2(TTN):c.102595A>G (p.Ile34199Val)-/benign (0.151)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs563472480.00599 (C)Missense variant3
14NM_001267550.2(TTN):c.101891G>A (p.Arg33964His)-/benign (0.018)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs556695530.00599 (T)Missense variant3
15NM_001267550.2(TTN):c.95297C>T (p.Ser31766Phe)-/probably damaging (0.998)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs1914848940.00599 (A)Missense variant3
16NM_001267550.2(TTN):c.90536G>A (p.Arg30179His)-/probably damaging (0.998)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs1495673780.00619 (T)Missense variant3
17NM_001267550.2(TTN):c.82560C>A (p.Asn27520Lys)-/probably damaging (0.997)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs562648400.00579 (T)Missense variant3
18NM_001267550.2(TTN):c.49919G>C (p.Ser16640Thr)-/probably damaging (0.991)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs556630500.00599 (G)Missense variant3
19NM_001267550.2(TTN):c.47545C>A (p.Pro15849Thr)-/benign (0.15)Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs1461814770.00599 (T)Missense variant3
20NM_001267550.2(TTN):c.1137A>G (p.Arg379=)-/-Conflicting interpretations of pathogenicity (last reviewed: Nov 21, 2020)rs559725470.00599 (C)Synonymous variant3
21NM_172364.5(CACNA2D4):c.2120G>A (p.Arg707His)Tolerated (0.4)/possibly damaging (0.795)Conflicting interpretations of pathogenicity (last reviewed: Nov 18, 2020)rs760649260.00719 (T)Missense variant3
22NM_018100.4(EFHC1):c.685T>C (p.Phe229Leu)Tolerated (0.12)/benign (0.039)Conflicting interpretations of pathogenicity (last reviewed: May 5, 2021)rs1378527760.00180 (C)Missense variant3
23NM_003126.4(SPTA1):c.6421C>T (p.Arg2141Trp)Deleterious (0.01)/probably damaging (1)Conflicting interpretations of pathogenicity (last reviewed: May 30, 2019)rs412735190.00160 (A)Missense variant4
24NM_001271208.2(NEB):c.571G>C (p.Glu191Gln)Deleterious (0.09)/probably damaging (0.999)Conflicting interpretations of pathogenicity (last reviewed: May 18, 2021)rs356869680.00719 (G)Missense variant3
25NM_000784.4(CYP27A1):c.1151C>T (p.Pro384Leu)Deleterious (0)/probably damaging (0.991)Conflicting interpretations of pathogenicity (last reviewed: May 18, 2021)rs412726870.00859 (T)Missense variant3
26NM_000237.3(LPL):c.953A>G (p.Asn318Ser)Tolerated (0.24)/benign (0.137)Conflicting interpretations of pathogenicity (last reviewed: May 18, 2021)rs2680.00519 (G)Missense variant1
27NM_000243.2(MEFV):c.1772T>C (p.Ile591Thr)Tolerated (0.55)/benign (0)Conflicting interpretations of pathogenicity (last reviewed: May 18, 2021)rs114660450.00439 (G)Missense variant1
28NM_001370466.1(NOD2):c.332G>A (p.Arg111Gln)Deleterious (0)/benign (0.116)Conflicting interpretations of pathogenicity (last reviewed: May 14, 2018)rs1048954560.00020 (A)Missense variant4
29NM_014588.5(VSX1):c.479G>A (p.Gly160Asp)Tolerated (0.3)/benign (0.156)Conflicting interpretations of pathogenicity (last reviewed: Mar 6, 2018)rs743154330.00260 (A)Missense variant4
30NM_020812.4(DOCK6):c.4862T>C (p.Val1621Ala)Deleterious (0)/benign (0.161)Conflicting interpretations of pathogenicity (last reviewed: Mar 19, 2021)rs2017388180.00120 (G)Missense variant2
31NM_000136.3(FANCC):c.77C>T (p.Ser26Phe)Deleterious (0)/probably damaging (0.948)Conflicting interpretations of pathogenicity (last reviewed: Jun 1, 2021)rs18003610.00260 (A)Missense variant2
32NM_006432.4(NPC2):c.441+1G>A-/-Conflicting interpretations of pathogenicity (last reviewed: Jun 1, 2021)rs1401300280.00100 (T)Splice donor variant3; 4
33NM_000492.4(CFTR):c.1584G>A (p.Glu528=)-/-Conflicting interpretations of pathogenicity (last reviewed: Jul 22, 2021)rs18000950.01058 (A)Synonymous variant4
34NM_001267550.2(TTN):c.14698G>A (p.Ala4900Thr)-/probably damaging (0.926)Conflicting interpretations of pathogenicity (last reviewed: Jul 1, 2021)rs726489230.00180 (T)Missense variant2
35NM_001079802.2(FKTN):c.1297A>G (p.Thr433Ala)Tolerated (0.52)/benign (0.005)Conflicting interpretations of pathogenicity (last reviewed: Jul 1, 2021)rs1419184320.00100 (G)Missense variant4
36NM_022168.4(IFIH1):c.1879G>T (p.Glu627Ter)-/-Conflicting interpretations of pathogenicity (last reviewed: Jan 1, 2021)rs357446050.00140 (A)Stop gained1
37NM_001267550.2(TTN):c.48727C>T (p.Pro16243Ser)-/benign (0.068)Conflicting interpretations of pathogenicity (last reviewed: Feb 4, 2021)rs726772420.00160 (A)Missense variant2
38NM_000548.5(TSC2):c.1939G>A (p.Asp647Asn)Deleterious (0.01)/probably damaging (0.995)Conflicting interpretations of pathogenicity (last reviewed: Dec 7, 2020)rs455093920.00040 (A)Missense variant2
39NM_000371.4(TTR):c.417G>A (p.Thr139=)-/-Conflicting interpretations of pathogenicity (last reviewed: Dec 7, 2020)rs22763820.00359 (A)Synonymous variant1
40NM_001267550.2(TTN):c.65775C>T (p.Ser21925=)-/-Conflicting interpretations of pathogenicity (last reviewed: Dec 6, 2020)rs726468670.00160 (A)Synonymous variant2
41NM_000527.5(LDLR):c.148G>T (p.Ala50Ser)Tolerated (0.33)/benign (0.169)Conflicting interpretations of pathogenicity (last reviewed: Dec 6, 2020)rs1378539600.00060 (A)Missense variant1
42NM_012472.6(DNAAF11):c.1391C>T (p.Pro464Leu)Deleterious (0)/probably damaging [1]Conflicting interpretations of pathogenicity (last reviewed: Dec 31, 2019)rs1391314850.00080 (A)Missense variant1
43NM_005634.2(SOX3):c.157G>C (p.Val53Leu)Tolerated (0.11)/benign (0.001)Conflicting interpretations of pathogenicity (last reviewed: Dec 31, 2019)rs2003611280.00265 (G)Missense variant4
44NM_000065.4(C6):c.2381+2T>C-/-Conflicting interpretations of pathogenicity (last reviewed: Dec 22, 2017)rs762029090.00120 (G)Splice donor variant1
45NM_001037.5(SCN1B):c.28G>A (p.Gly10Ser)Tolerated (0.4)/possibly damaging (0.862)Conflicting interpretations of pathogenicity (last reviewed: Dec 2, 2020)rs725520270.00319 (A)Missense variant2
46NM_006214.4(PHYH):c.734G>A (p.Arg245Gln)Deleterious (0.01)/benign (0.393)Conflicting interpretations of pathogenicity (last reviewed: Aug 6, 2020)rs626199190.00539 (T)Missense variant4
47NM_201596.3(CACNB2):c.1816C>T (p.Arg606Trp)Deleterious (0.03)/probably damaging (0.928)Conflicting interpretations of pathogenicity (last reviewed: Aug 6, 2020)rs617339680.00439 (G)Missense variant4
48NM_000275.3(OCA2):c.1441G>A (p.Ala481Thr)Tolerated (0.09)/possibly damaging (0.621)Conflicting interpretations of pathogenicity (last reviewed: Apr 20, 2021)rs746533300.00799 (T)Missense variant1;3
49NM_025216.3(WNT10A):c.511C>T (p.Arg171Cys)Deleterious (0)/probably damaging (0.93)Conflicting interpretations of pathogenicity (last reviewed: Apr 13, 2021)rs1169985550.00319 (T)Missense variant1; 2;3; 4
50NM_001110792.2(MECP2):c.638C>T (p.Ala213Val)Tolerated (0.07)/benign (0)Benign/likely benign (last reviewed: Dec 31, 2019)rs617483810.00477 (A)Missense variant4
DNMs were found only in children with ASD but not in neurotypical siblings. In total, 12 heterozygous DNMs were identified in three families, including nine missense variants, two nonsense mutations, and one splice variant (Table 3). No DNMs were detected in Family 3. We found no identical mutations in ASD siblings.
Table 3

DNMs in children with ASD.

ChildFamily No.NameClinical significance (last reviewed) ClinVarTypedbSNP IDGMAF
AU 531NM_000162.5(GCK):c.1018A>G (p.Ser340Gly)Likely pathogenic (last reviewed: Aug 18, 2011)Missense variantrs193922255NA
AU 541NM_138413.4(HOGA1):c.769T>G (p.Cys257Gly)Pathogenic/likely pathogenic (last reviewed: Sep 21, 2020)Missense variantrs2676067641000 genomes project 0.00020
AU 2092NM_000090.3(COL3A1):c.637G>A (p.Gly213Ser)PathogenicMissense variantrs587779557NA
AU 2092NM_003060.4(SLC22A5):c.287G>C (p.Gly96Ala)Conflicting interpretations of pathogenicity (last reviewed: Dec 6, 2020)Missense variantrs377767450gnomAD 0.00013
AU 2092NM_172107.4(KCNQ2):c.2245G>T (p.Glu749Ter)Pathogenic (last reviewed: Jan 21, 2020)Stop gainedrs796052658ALFA (0/10680) 0.00000
AU 2092NM_000071.3(CBS):c.457G>A (p.Gly153Arg)Pathogenic/likely pathogenic (last reviewed: Jul 24, 2019)Missense variantrs745704046gnomAD, exomes 0.00004
AU 2102NM_022455.4(NSD1):c.5146+1G>APathogenic/likely pathogenic (last reviewed: Jul 30, 2019)Splice donor variantrs587784139NA
AU 2102NM_006031.6(PCNT):c.5767C>T (p.Arg1923Ter)Pathogenic/likely pathogenic (last reviewed: Feb 12, 2020)Stop gainedrs119479062gnomAD, exomes 0.00001
AU 2164NM_000090.3(COL3A1):c.2022G>T (p.Lys674Asn)PathogenicMissense variantrs587779643NA
AU 2164NM_172056.2(KCNH2):c.2390C>A (p.Ala797Asp)Likely pathogenic (last reviewed: Jun 12, 2013)Missense variantrs794728389NA
AU 2164NM_152722.5(HEPACAM):c.740C>T (p.Thr247Ile)Uncertain significance (last reviewed: Jan 13, 2018)Missense variantrs1456197841000 genomes project 0.00020
AU 2174NM_025216.3(WNT10A):c.1087A>C (p.Asn363His)Conflicting interpretations of pathogenicity (last reviewed: Dec 3, 2020)Missense variantrs34972707NHLBI ESP exome variant 0.00028

3.3. PPI Network and Functional Enrichment Analysis

We prioritized candidate genes 57, 60, 58, and 73 in Families 1, 2, 3, and 4, respectively. The PPI networks for these genes were constructed for each pedigree. As a result, four networks with the following properties were obtained: 614 nodes, 672 edges, and 35 seeds for Family 1, 746 nodes, 870 edges, and 36 seeds for Family 2, 669 nodes, 743 edges, and 39 seeds for Family 3, and 923 nodes, 1092 edges, and 50 seeds for Family 4. After partitioning into modules, these networks were divided into 14 significant modules for Families 1 and 2, 11 modules for Family 3, and 20 modules for Family 4 (Figure 2). The number of connections of a node or the degree of centrality (DC) showed that ten genes, namely HDAC4, CFTR, MECP2, NOD2, PPP2R2B, TCF4, TRIM33, TSC2, TTN, and TTR, play a nodal role in the generated networks and form the largest modules (Table 4). The highest-ranking node in all networks was HDAC4 (DC = 202), except in Family 4, where CFTR played a greater role (DC = 222). In Families 2 and 3, another high-ranking node was TTN (DC = 104). TCF4, PPP2R2B, and HDAC4 were common hub genes for all four networks.
Figure 2

PPI networks for four families visualized by OmicsNet 2.0. The size of the modules is proportional to their degree.

Table 4

The list of hub genes with the degree values (degree ≥ 50).

Family No.IDGeneDC
19759HDAC4202
6925TCF454
5521PPP2R2B51
7276TTR50

29759HDAC4202
7273TTN104
51592TRIM3373
7249TSC262
6925TCF454
5521PPP2R2B51

39759HDAC4202
7273TTN104
51592TRIM3373
6925TCF454
5521PPP2R2B51

41080CFTR222
9759HDAC4202
51592TRIM3373
4204MECP260
6925TCF454
5521PPP2R2B51
64127NOD250
We then assumed that the set of identified genes for each pedigree work together and can be integrated into a single module. We defined them as disease modules and performed the enrichment analysis. A total of 92 enriched terms for Family 1 (18 GO MF, 44 GO BP, 26 GO CC, and 4 REAC), 19 enriched terms for Family 2 (18 GO MF, 44 GO BP, 26 GO CC, and 4 REAC), 37 enriched terms for Family 3 (9 GO MF, 9 GO BP, and 19 GO CC), and 155 enriched terms for Family 4 (29 GO MF, 72 GO BP, 46 GO CC, 7 REAC, and 1 KEGG) were identified. The results of the top 15 terms in each GO category and all results in REACTOME and KEGG are presented in Table 5.
Table 5

Top 15 terms in each GO category and results for REACTOME and KEGG.

Family No.SourceTerm_nameTerm_idAdjusted p
1GO:MFVoltage-gated calcium channel activity involved in AV node cell action potentialGO:00860563,329E-04
GO:MFVoltage-gated calcium channel activityGO:00052458,797E-04
GO:MFVoltage-gated calcium channel activity involved in cardiac muscle cell action potentialGO:00860071,109E-03
GO:MFGated channel activityGO:00228361,270E-03
GO:MFIon channel activityGO:00052163,921E-03
GO:MFVoltage-gated ion channel activityGO:00052444,350E-03
GO:MFVoltage-gated channel activityGO:00228324,434E-03
GO:MFHigh-voltage-gated calcium channel activityGO:00083314,982E-03
GO:MFChannel activityGO:00152676,566E-03
GO:MFPassive transmembrane transporter activityGO:00228036,631E-03
GO:MFCalcium channel activityGO:00052621,450E-02
GO:MF4-Hydroxy-2-oxoglutarate aldolase activityGO:00087001,696E-02
GO:MFInorganic cation transmembrane transporter activityGO:00228901,868E-02
GO:MFCalcium ion transmembrane transporter activityGO:00150852,112E-02
GO:MFCation channel activityGO:00052612,351E-02
GO:BPMembrane depolarization during action potentialGO:00860105,793E-06
GO:BPMembrane depolarizationGO:00518991,478E-04
GO:BPCalcium ion importGO:00705092,539E-04
GO:BPMembrane depolarization during cardiac muscle cell action potentialGO:00860123,450E-04
GO:BPRegulation of ion transportGO:00432695,961E-04
GO:BPRegulation of biological qualityGO:00650081,127E-03
GO:BPAnterograde transsynaptic signalingGO:00989161,414E-03
GO:BPChemical synaptic transmissionGO:00072681,414E-03
GO:BPCation transportGO:00068121,458E-03
GO:BPAction potentialGO:00015081,506E-03
GO:BPTranssynaptic signalingGO:00995371,579E-03
GO:BPMembrane depolarization during atrial cardiac muscle cell action potentialGO:00989121,645E-03
GO:BPRegulation of neurotransmitter levelsGO:00015051,650E − 03
GO:BPRegulation of membrane potentialGO:00423911,682E − 03
GO:BPSynaptic signalingGO:00995362,185E − 03
GO:CCSynaptic membraneGO:00970605,243E − 05
GO:CCPostsynaptic membraneGO:00452116,704E − 05
GO:CCPostsynapseGO:00987943,189E − 04
GO:CCSynapseGO:00452023,561E − 04
GO:CCVoltage-gated calcium channel complexGO:00058914,325E − 04
GO:CCT-tubuleGO:00303158,299E − 04
GO:CCNeuron projectionGO:00430051,019E − 03
GO:CCCalcium channel complexGO:00347041,957E − 03
GO:CCPostsynaptic densityGO:00140692,909E − 03
GO:CCAsymmetric synapseGO:00322793,279E − 03
GO:CCGlutamatergic synapseGO:00989783,547E − 03
GO:CCIntegral component of plasma membraneGO:00058873,884E − 03
GO:CCL-type voltage-gated calcium channel complexGO:19904544,532E − 03
GO:CCPostsynaptic specializationGO:00995724,632E − 03
GO:CCAxonGO:00304245,015E − 03
REACNCAM1 interactionsREAC:R-HSA-4190372,470E − 03
REACNCAM signaling for neurite out-growthREAC:R-HSA-3751656,919E − 03
REACPhase 2-plateau phaseREAC:R-HSA-55768932,470E − 02
REACDefective GCK causes maturity-onset diabetes of the young 2 (MODY2)REAC:R-HSA-56190733,866E − 02

2GO:MFInterleukin-1 receptor activityGO:00049081,217E − 02
GO:MFInterleukin-1 bindingGO:00199662,083E − 02
GO:MFCation transmembrane transporter activityGO:00083242,968E − 02
GO:MFGated channel activityGO:00228363,603E − 02
GO:BPIon transmembrane transportGO:00342201,466E − 02
GO:BPCation transportGO:00068122,046E − 02
GO:BPRegulation of reactive oxygen species biosynthetic processGO:19034262,146E − 02
GO:BPRegulation of transportGO:00510494,096E − 02
GO:BPReactive oxygen species biosynthetic processGO:19034094,859E − 02
GO:CCNode of RanvierGO:00332689,149E − 04
GO:CCAxon initial segmentGO:00431944,511E − 03
GO:CCSynapseGO:00452021,322E − 02
GO:CCPostsynapseGO:00987942,761E − 02
GO:CCGlutamatergic synapseGO:00989782,787E − 02
GO:CCPostsynaptic densityGO:00140693,305E − 02
GO:CCAsymmetric synapseGO:00322793,655E − 02
GO:CCSynaptic membraneGO:00970604,836E − 02
GO:CCPostsynaptic specializationGO:00995724,886E − 02
REACInteraction between L1 and AnkyrinsREAC:R-HSA-4450954,993E − 02

3GO:MFVoltage-gated ion channel activityGO:00052442,709E − 03
GO:MFVoltage-gated channel activityGO:00228322,786E − 03
GO:MFGated channel activityGO:00228365,046E − 03
GO:MFIon channel activityGO:00052162,213E − 02
GO:MFInterleukin-1 receptor activityGO:00049083,967E − 02
GO:MFChannel activityGO:00152674,309E − 02
GO:MFCation transmembrane transporter activityGO:00083244,359E − 02
GO:MFPassive transmembrane transporter activityGO:00228034,364E − 02
GO:MFCation channel activityGO:00052614,642E − 02
GO:BPMembrane depolarization during action potentialGO:00860102,140E − 03
GO:BPChemical synaptic transmissionGO:00072686,930E − 03
GO:BPAnterograde transsynaptic signalingGO:00989166,930E − 03
GO:BPTranssynaptic signalingGO:00995377,659E − 03
GO:BPSynaptic signalingGO:00995361,026E − 02
GO:BPMembrane depolarization during cardiac muscle cell action potentialGO:00860122,544E − 02
GO:BPMembrane depolarization during atrial cardiac muscle cell action potentialGO:00989122,640E − 02
GO:BPSignal release from synapseGO:00996434,238E − 02
GO:BPNeurotransmitter secretionGO:00072694,238E − 02
GO:CCPostsynapseGO:00987941,950E − 03
GO:CCGlutamatergic synapseGO:00989782,275E − 03
GO:CCZ discGO:00300182,747E − 03
GO:CCI bandGO:00316744,378E − 03
GO:CCSynaptic membraneGO:00970604,834E − 03
GO:CCSynapseGO:00452025,661E − 03
GO:CCPostsynaptic membraneGO:00452118,485E − 03
GO:CCIon channel complexGO:00347021,235E − 02
GO:CCPostsynaptic densityGO:00140692,124E − 02
GO:CCSomatodendritic compartmentGO:00364772,218E − 02
GO:CCAsymmetric synapseGO:00322792,351E − 02
GO:CCPresynapseGO:00987932,828E − 02
GO:CCSarcomereGO:00300172,897E − 02
GO:CCVoltage-gated calcium channel complexGO:00058913,140E − 02
GO:CCPostsynaptic specializationGO:00995723,151E − 02

4GO:MFGated channel activityGO:00228361,641E − 07
GO:MFIon channel activityGO:00052161,256E − 06
GO:MFChannel activityGO:00152673,180E − 06
GO:MFPassive transmembrane transporter activityGO:00228033,237E − 06
GO:MFVoltage-gated ion channel activityGO:00052442,328E − 05
GO:MFVoltage-gated channel activityGO:00228322,418E − 05
GO:MFCation channel activityGO:00052619,049E − 05
GO:MFInorganic molecular entity transmembrane transporter activityGO:00153181,140E − 04
GO:MFInorganic cation transmembrane transporter activityGO:00228902,734E − 04
GO:MFIon transmembrane transporter activityGO:00150754,094E − 04
GO:MFVoltage-gated calcium channel activity involved in AV node cell action potentialGO:00860564,671E − 04
GO:MFLigand-gated channel activityGO:00228345,175E − 04
GO:MFLigand-gated ion channel activityGO:00152765,175E − 04
GO:MFVoltage-gated cation channel activityGO:00228435,725E − 04
GO:MFCation transmembrane transporter activityGO:00083246,190E − 04
GO:BPMembrane depolarization during action potentialGO:00860102,786E − 08
GO:BPRegulation of membrane potentialGO:00423913,523E − 08
GO:BPMembrane depolarizationGO:00518991,673E − 06
GO:BPAction potentialGO:00015083,104E − 05
GO:BPMembrane depolarization during atrial cardiac muscle cell action potentialGO:00989123,486E − 05
GO:BPRegulation of biological qualityGO:00650085,156E − 05
GO:BPMulticellular organismal signalingGO:00356376,981E − 05
GO:BPBehaviorGO:00076107,368E − 05
GO:BPIon transmembrane transportGO:00342201,226E − 04
GO:BPCardiac muscle cell action potentialGO:00860012,034E − 04
GO:BPRegulation of ion transportGO:00432692,416E − 04
GO:BPMembrane depolarization during AV node cell action potentialGO:00860453,473E − 04
GO:BPRegulation of ion transmembrane transportGO:00347654,035E − 04
GO:BPCation homeostasisGO:00550804,197E − 04
GO:BPMembrane depolarization during cardiac muscle cell action potentialGO:00860124,806E − 04
GO:CCIon channel complexGO:00347025,650E − 07
GO:CCTransmembrane transporter complexGO:19024954,620E − 06
GO:CCTransporter complexGO:19903516,231E − 06
GO:CCPostsynapseGO:00987946,902E − 06
GO:CCSynaptic membraneGO:00970601,047E − 05
GO:CCPostsynaptic membraneGO:00452111,501E − 05
GO:CCCation channel complexGO:00347031,982E − 05
GO:CCNeuron projectionGO:00430055,502E − 05
GO:CCIntrinsic component of plasma membraneGO:00312261,206E − 04
GO:CCSynapseGO:00452022,776E − 04
GO:CCSarcolemmaGO:00423833,037E − 04
GO:CCCell surfaceGO:00099865,783E − 04
GO:CCVoltage-gated calcium channel complexGO:00058916,149E − 04
GO:CCPostsynaptic densityGO:00140698,046E − 04
GO:CCPlasma membrane regionGO:00985909,173E − 04
KEGGCircadian entrainmentKEGG:047133,488E − 02
REACNCAM1 interactionsREAC:R-HSA-4190373,277E − 06
REACNCAM signaling for neurite out-growthREAC:R-HSA-3751651,888E − 05
REACButyrophilin (BTN) family interactionsREAC:R-HSA-88516801,215E − 02
REACLong-term potentiationREAC:R-HSA-96202441,438E − 02
REACAxon guidanceREAC:R-HSA-4224752,470E − 02
REACNervous system developmentREAC:R-HSA-96751083,267E − 02
REACPhase 2-plateau phaseREAC:R-HSA-55768933,338E − 02
Families 1, 3, and 4 showed very similar patterns of the GO MF pathways. The enriched molecular function in these families was ion channel activity (GO: 0086056, GO: 0005245, GO: 0086007, GO: 0022836, GO: 0005216, GO: 0005244, GO: 0022832, and GO: 0015267). In Family 2, interleukin-1 receptor activity (IL-1) (GO: 0004908) and binding (GO: 0019966), cation transmembrane transporter (GO: 0008324), and gated channel activity (GO: 0022836) were enriched. In the BP category, candidate genes were mainly enriched in biological processes, such as membrane depolarization (GO: 0086010, GO: 0051899, GO: 0086012, GO: 0098912, and GO: 0086045) and regulation of synaptic functions (anterograde transsynaptic signaling GO: 0098916, chemical synaptic transmission GO: 0007268, transsynaptic signaling GO: 0099537, and synaptic signaling GO: 0099536). In addition, the processes of reactive oxygen biosynthesis (GO: 1903409, GO: 1903426) and ion transport (GO: 0034220, GO: 0006812) were enriched in Family 2. In the CC category, candidate genes were enriched in synapses (GO: 0097060, GO: 0045211, GO: 0098794, GO: 0045202, and GO: 0098978) and ion channel complexes (GO: 0005891, GO: 0034704, GO: 1990454, GO: 0034702, and GO: 0034703). In Family 2, candidate genes were also found in the node of Ranvier (GO: 0033268) and in the initial segment of the axon (GO: 0043194). A significant KEGG pathway associated with circadian control (KEGG: 04713) was found in Family 4. The most significant REACTOME pathways included NCAM1 interactions (REAC: R-HSA-419037) and signaling for neurite out-growth (REAC: R-HSA-375165), and phase 2-plateau phase (REAC: R-HSA-5576893) in Families 1 and 4, and interaction between L1 and Ankyrins (REAC: R-HSA-445095) in Family 3.

4. Discussion

Recent studies suggest that in models of the genetic architecture of ASD, common and rare variants interact additively to form susceptibility [47-49]. Common variants likely play a major role in population-level susceptibility, whereas rare mutations contribute substantially to individual susceptibility [21]. Following this hybrid model, we used polygenic risk scores to analyze four extended pedigrees of Kazakhstani ancestry and prioritized ASD risk genes with common and rare inherited and DNM variants. The combination of ASD and BAP was used to improve the performance of risk gene identification. We then performed integrative analysis by constructing PPI networks. We were particularly interested in the nodal elements of the obtained PPI networks. We hypothesized that any perturbation at these important nodes could trigger abnormal conditions such as diseases [50, 51]. According to the obtained results, ten genes clearly formed potentially important nodes in the PPI networks. Six of these genes, namely HDAC4, MECP2, TCF4, TRIM33, TTN, and TSC2, belong to the SFARI category 1-2 (high-confidence and strong candidate genes) and are widely associated with the neuropathological mechanisms of ASD [31, 52–70]. The CFTR, NOD2, PPP2R2B, and TTR genes were not found in the SFARI databases, and data on the role of these genes in ASD are very sparse [71, 72]. However, although the exact mechanism is not clear, there is some evidence of a link between these genes and ASD. The CFTR gene controls secretion and absorption of ions and water in epithelial tissues [73]. Immunohistochemical staining with a mouse monoclonal antibody directed against the C-terminal amino acid sequence of human CFTR revealed diffuse neuronal expression of CFTR in ten human control fetuses at 13 to 40 weeks of gestation [74]. This study showed that CFTR has an early and widespread distribution during development. In addition, a case of autism associated with a genetic variant of CFTR and early exposure to herpes simplex virus (HSV) has been described [71]. The NOD2 gene belongs to the intracellular NOD-like receptor family and plays an important role in the immune response to intracellular bacterial lipopolysaccharides (LPS) [75]. The central role in maintaining the balance between the gut microbiota and the host immune response to control inflammation [76] makes NOD2 one of the most important susceptibility genes for inflammatory bowel diseases [77-82]. At the same time, a number of studies confirm that autistic children are at higher risk for this disorder [83-88]. Moreover, there is evidence of an association between maternal inflammatory bowel disease and ASD in children [89, 90]. The PPP2R2B gene encodes a neuron-specific B regulatory subunit of protein phosphatase 2 (PP2A), which regulates synaptic plasticity [91]. Some studies suggested that DNMs in the PPP2R2B gene may partially contribute to the genetic landscape of intellectual disability [92], but we found only one study linking this gene to ASD [72]. However, this gene may be a strong ASD candidate given a recent study, which highlights a role of another subunit of PP2A (PPP2R5D) in dendrites and synapses using neuron-specific protein network of ASD risk genes [31]. Another strong candidate may be the TTR gene, which is involved in the transport of thyroid [93] and retinol [94]. The involvement of TTR in novel functions, such as neuroprotection, is part of the very recent and constantly evolving knowledge [95]. In addition, TTR has been shown to interact with the GABAA receptor subunit and regulate its expression and function [96]. GABA receptors play an important role in brain development and synchronization of neural network activity. Since these receptors are located on synaptic and extrasynaptic membranes, a deficiency of GABA receptors leads to a lack of neurotransmission and is associated with ASD [97, 98]. Considering that disease genes tend to cluster and cooccur at central sites in the network [48], the above-mentioned genes may represent a priority list for further validation studies. Another rationale for constructing a PPI network with ASD and BAP risk genes was to identify convergent signaling pathways. Despite the multiplicity of ASD risk genes in each pedigree, our results suggest overlapping functions involving a limited number of biological pathways. Thus, most of the ASD networks is localized in specific cellular compartments such as axons, ion channel complex, and synapses, whereas most biological processes involve ion channel activity, regulation of synaptic function, and membrane depolarization. These findings confirm the results of previous studies that described synaptic functions and ion channel activity in the development of ASD [31, 99–102] and allow us to hypothesize that the main course of development of autistic traits from BAP to ASD does not change. However, we also identified several novel or poorly characterized signaling pathways, such as circadian entrainment, neural cell adhesion molecule 1 (NCAM1) interaction, butyrophilin family (BTN2 and BTN3) interaction, and the interaction between L1 and ankyrins. The first of these pathways may be of particular interest given the growing evidence for circadian disruption in ASD patients [103-105]. The genes that form NCAM1 interactions gene set are involved in neuronal development and synaptic plasticity [106], and perhaps this pathway is not so unexpected for ASD. Apparently, NCAM1 can be considered a general vulnerability factor for neurological and psychiatric disorders [107]. The role of BTN2 and BTN3 and related proteins in the neurodevelopmental disorders is much less studied [108, 109]. BTNs are regulators of immune responses and exert both stimulatory and inhibitory effects on immune cells [110-112]. The BTN enriched gene set correlates a previous finding of a dysregulated immune system in ASD [113-120]. Ankyrin B (AnkB) is an adaptor and scaffold for motor proteins and various ion channels that is expressed ubiquitously in the organism, including the brain [121]. L1 interaction with AnkB mediates branching and synaptogenesis of cortical inhibitory neurons. AnkB mutations and polymorphisms are associated with ASD [23, 69, 122, 123], but the detailed mechanisms underlying the neurological symptoms associated with AnkB are unknown. Interestingly, both the NCAM1 interaction pathway and the interaction between L1 and ankyrins were prioritized in a study of the role of rare variants in biological processes and molecular pathways leading to the pathogenesis of Alzheimer's disease [124], indicating the prospects for their further investigation in the context of neurological disorders. The final important finding of our study is the identification of DNMs in affected children. Detailed information on these DNMs can be found in Table 6. Some of these DNMs have been previously described in ASD and/or other neurodevelopmental disorders [125-130], and others are indirectly associated with ASD. In this context, the p.Ala797Asp mutation in the potassium channel gene KCNH2 was of particular interest. This DNM results in a nonconservative amino acid exchange of a nonpolar alanine residue for a negatively charged aspartic acid residue at a conservative position (https://www.ncbi.nlm.nih.gov/clinvar/variation/200440/). An in silico analysis revealed that this mutation affects the protein structure or functions (https://www.ncbi.nlm.nih.gov/clinvar/variation/200440/). The data on the clinical significance of this variant are lacking. This study appears to be the first report on this DNM in an affected individual.
Table 6

Detailed information about DNMs.

DNMGene functionExpressionSupporting evidence, ClinVar
c.1018A>GGlucokinase, phosphorylates glucose to produce glucose-6-phosphate (the first step in most glucose metabolic pathways)Pancreas and liverThe variant was changed to likely pathogenic upon submission. Other variants in the GCK gene that alter enzyme activity have been associated with various types of diabetes and hyperinsulinemic hypoglycemia [139]. There is a report showing that neonatal hypoglycemia increases the risk of ASD threefold in children born at term [140].
c.769T>G4-Hydroxy-2-oxoglutarate aldolase 1, catalyzes the final step in the metabolic pathway of hydroxyproline, releasing glyoxylate and pyruvateKidney, liver, heart, fat, and brainThe variant results in a nonconservative amino acid change in the encoded protein sequence. Three of five in silico tools predicted a deleterious effect of the variant on protein function. The variant was found at a frequency of 6.4E − 05 in 249558 control chromosomes, most notably at a frequency of 0.00082 within the East Asian subpopulation in the gnomAD database. This frequency is not higher than the maximum expected for a pathogenic variant in HOGA1 causing primary hyperoxaluria, type III, (0.0015) and does not allow conclusions to be drawn about the significance of the variant. c.769T > G has been reported in the literature in homozygous and compound heterozygous states in several individuals with primary hyperoxaluria, type III [141144]. These data indicate that the variant is very likely to be associated with disease. At least one publication reports experimental evidence evaluating an impact on protein function and demonstrated that the variant resulted in no measurable activity [145]. Two clinical diagnostic laboratories have submitted clinical-significance assessments for this variant to ClinVar after 2014 and classified the variant as pathogenic/likely pathogenic. There are reports that hyperoxaluria may be involved in the pathogenesis of ASD in children [146, 147].
c.637G>Ac.2022G>TCollagen type III alpha 1 chain, encoding the pro-alpha1 chains of the collagen type III, is found in extensible connective tissues such as skin, lung, uterus, intestine, and the vascular system, often in association with type I collagenGall bladder, placenta, and 12 other tissuesThe variants are associated with Ehlers-Danlos syndrome, type 4 [148, 149]. Ehlers-Danlos syndrome type 4 shares several similar neurophenotypes with ASD, such as mood disorders, proprioceptive impairments, sensory hyper/hyposensitivities, eating disorders, and suicidality [150].
c.287G > CSolute carrier family 22 member 5, transporter for organic cations and a sodium-dependent transporter with high affinity for carnitinePotassium voltage-gated channel subfamily Q member 2, integral membrane proteins of potassium channelKidney, small intestine and 23 other tissuesBrain, adrenal, and testisThe variant results in a nonconservative amino acid change in the major facilitator superfamily domain (IPR020846) of the encoded protein sequence. Five of five in silico tools predicted a deleterious effect of the variant on protein function. The variant allele was found at a frequency of 0.00041 in 195700 control chromosomes. This frequency is not significantly higher than would be expected for a pathogenic variant in SLC22A5 causing systemic primary carnitine deficiency (0.00041 vs. 0.0046), which does not allow to conclude the significance of the variant. c.287G > C has been reported in the literature in several individuals with suspected systemic primary carnitine deficiency [151, 152]. These data do not allow any conclusion about variant significance. At least one publication reports experimental evidence indicating that the variant reduced carnitine transport activity to less than 20% of wild-type in vitro [152]. Three clinical diagnostic laboratories have submitted clinical-significance assessments for this variant to ClinVar after 2014 without evidence for independent evaluation. These laboratories cited the variant with conflicting assessments: one laboratory classified the variant likely pathogenic, one laboratory classified the variant as likely benign, and a third laboratory classified the variant as uncertain significance. Based on the evidence outlined above, until additional information becomes available, the variant was classified as VUS-possibly pathogenic. An association between primary carnitine deficiency and ASD has been reported [153, 154]. It is hypothesized that carnitine deficiency in the brain causes nonsyndromal autism with extreme male tendency [155].The variant is predicted to result in loss of normal protein function due to protein truncation as the last 124 amino acids of the protein are lost.
c.457G>ACystathionine beta-synthase, catalyzes the conversion of homocysteine to cystathionine, the first step of the transsulfuration pathwayLiver, brain and 6 other tissuesThe variant involves the modification of a conserved nucleotide located within the pyridoxal phosphate-dependent enzyme domain (InterPro). 5/5 in silico tools predict a deleterious outcome for this variant. This variant was found in 1/156632 control chromosomes at a frequency of 0.0000064, which does not exceed the estimated maximum expected allele frequency of a pathogenic CBS variant (0.0030414). In addition, functional studies in yeast suggest that the variant may affect protein function [156]. The variant has been reported in a Saudi Arabian family, in which two affected patients with homocystinuria were homozygous for the variant inherited from unaffected heterozygous parents [157]. There are reports that children with classical homocystinuria may have isolated ASD due to cystathionine-β-synthase deficiency [158, 159].
c.5146+1G>ANuclear receptor binding SET domain protein 1, enhances androgen receptor transactivationTestis, thyroid and 25 other tissuesThe variant affects a donor splice site in intron 14 of the NSD1 gene. It is expected to disrupt RNA splicing and likely results in an absent or disrupted protein product. Donor and acceptor splice site variants generally result in loss of protein function [160], and loss-of-function variants in NSD1 are known to be pathogenic and the major cause of Sotos syndrome [161163]. One report found several rare variations of the NSD1 gene in individuals with ASD, although the variants were not considered pathogenic [164].
c.5767C>TPericentrin, interacts with the microtubule nucleation component gamma-tubulin and is probably important for the normal functioning of centrosomes, the cytoskeleton, and cell cycle progressionTestis, bone marrow and 24 other tissuesThe variant has been classified as pathogenic according to ACMG in the context of microcephalic osteodysplastic primordial dwarfism type II. The variant produces a premature translational stop signal (p.Arg1923∗) in the PCNT gene. It is expected to result in absent or impaired protein product. Loss-of-function variants in PCNT are known to be pathogenic [165, 166].
c.2390C>APotassium voltage-gated channel subfamily H member 2, encodes a component of a voltage-activated potassium channel found in cardiac muscle, neurons, and microgliaBone marrow, testis and 14 other tissuesThe variant results in a nonconservative amino acid substitution of a nonpolar alanine residue with a negatively charged aspartic acid residue at a position that is conserved across species. In silico analysis predicts Ala797Asp is probably damaging to the protein structure/function. Mutations in nearby residues (Glu788Asp, Glu788Lys, Arg791Trp, Gly800Glu, Gly800Trp) have been reported in association with Long QT syndrome (LQTS), further supporting the functional importance of this region of the protein. Furthermore, the Ala797Asp variant was not observed in approximately 6,500 individuals of European and African American ancestry in the NHLBI exome sequencing project, indicating it is not a common benign variant in these populations. In summary, while Ala797Asp is a good candidate for a disease-causing mutation, with the clinical and molecular information available at this time we cannot unequivocally determine the clinical significance of this variant.
c.740C>THepatic and glial cell adhesion molecule, acts as a homodimer and is involved in cell motility and cell-matrix interactionsBrain, fat and liverThe variant was classified as a variant of unknown significance for megalencephalic leukoencephalopathy with subcortical cysts. Other rare mutations in the HEPACAM gene have been found to cause either macrocephaly and mental retardation with or without autism or benign familial macrocephaly [129].
c.1087A>CWnt family member 10A, a member of the WNT gene family, involved in oncogenesis and several developmental processes, including cell fate regulation and cell patterning during embryogenesisSkin, placenta and 16 other tissuesThe variant was not observed in significant frequency in approximately 5300 individuals of European and African American ancestry in the NHLBI exome sequencing project, suggesting that it is not a common benign variant in these populations. The variant is a semiconservative amino acid substitution that may affect secondary protein structure because these residues differ in some properties. This substitution occurs at a position that is conserved across species, and in silico analysis predicts that this variant is likely to affect protein structure/function.
Taken together, the DNMs that we found only in children with ASD cannot explain the heritable nature of ASD in the studied families. However, because their greatest number was found in children with severe autism (child AU209 with the most severe ASD has four DNMs), we can assume that dbSNP and rare inherited variants represent a common genomic burden. Their combinations converge in common biological processes and likely contribute to the increased threshold of susceptibility to ASD, while the severity of ASD is determined by DNMs. Similarly, it has been previously reported that patients carrying DNMs in two or more candidate genes exhibit more severe phenotypes of ASD [131]. At the same time, the results showed that the genetic heterogeneity of ASD is so great that different DNMs could be identified even in siblings.

4.1. Limitations

We understand that this study has many limitations given the latest genomic technologies, bioinformatics methods, and the large-scale studies [132-134]. However, paradoxically, the large amount of data generated by these studies has raised new challenges and questions, and many more studies and approaches are needed to unravel the complex mechanisms of ASD. In our brief study, we attempted to use a novel approach by constructing PPI networks based on putative causative genes for ASD and BAP. For our study, we chose extended pedigrees, which provided a good opportunity to examine inherited genetic risk factors. We integrated three major genetic components of ASD, and we believe that the genes identified in this study are considered penetrant enough to cause ASD-related traits and should be prioritized for further validation. However, the number of variants that microarrays can contain is limited. GSA tends to focus on relatively common variants, so the study has a bias in its design. It is possible that other undetected or uncharacterized variants not included in this study play a critical role. Risk alleles may be at the level of rare inherited copy number variants (CNVs) [135-138]; therefore, examination of CNVs within these families will be a subject of further study. In addition, we performed our analysis with samples that came mainly from families of Kazakh descent. For this reason, our results cannot be generalized to other populations without further investigation. Future approaches should ideally use whole-genome sequencing in extended pedigrees of not only Kazakh ancestry in conjunction with comprehensive clinical validation of detected deleterious variants.

5. Conclusion

This study is an attempt to describe the genetic trajectory of autistic trait development in four extended pedigrees of Kazakhstani ancestry. Construction of networks based on putative causative genes for ASD and BAP revealed no differences in major functional pathways compared with those shown in previous studies for ASD only. Nevertheless, our study uncovered several nodal genes and signaling pathways that have not previously been associated with ASD but for whose relevance there are strong biological arguments. The obtained results highlight the importance of including subclinical phenotypes in the search for inherited causes of ASD and provide insights into previously unknown convergent disease pathways. The study is also interesting regarding new DNMs that may contribute to the pathogenesis of ASD.
  155 in total

1.  Whole exome sequencing in extended families with autism spectrum disorder implicates four candidate genes.

Authors:  Nicola H Chapman; Alejandro Q Nato; Raphael Bernier; Katy Ankenman; Harkirat Sohi; Jeff Munson; Ashok Patowary; Marilyn Archer; Elizabeth M Blue; Sara Jane Webb; Hilary Coon; Wendy H Raskind; Zoran Brkanac; Ellen M Wijsman
Journal:  Hum Genet       Date:  2015-07-24       Impact factor: 4.132

2.  Mutations in DHDPSL are responsible for primary hyperoxaluria type III.

Authors:  Ruth Belostotsky; Eric Seboun; Gregory H Idelson; Dawn S Milliner; Rachel Becker-Cohen; Choni Rinat; Carla G Monico; Sofia Feinstein; Efrat Ben-Shalom; Daniella Magen; Irith Weissman; Celine Charon; Yaacov Frishberg
Journal:  Am J Hum Genet       Date:  2010-09-10       Impact factor: 11.025

Review 3.  Copy-number variation in the pathogenesis of autism spectrum disorder.

Authors:  Emiko Shishido; Branko Aleksic; Norio Ozaki
Journal:  Psychiatry Clin Neurosci       Date:  2013-12-29       Impact factor: 5.188

4.  Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations.

Authors:  Brian J O'Roak; Laura Vives; Santhosh Girirajan; Emre Karakoc; Niklas Krumm; Bradley P Coe; Roie Levy; Arthur Ko; Choli Lee; Joshua D Smith; Emily H Turner; Ian B Stanaway; Benjamin Vernot; Maika Malig; Carl Baker; Beau Reilly; Joshua M Akey; Elhanan Borenstein; Mark J Rieder; Deborah A Nickerson; Raphael Bernier; Jay Shendure; Evan E Eichler
Journal:  Nature       Date:  2012-04-04       Impact factor: 49.962

5.  High resolution analysis of rare copy number variants in patients with autism spectrum disorder from Taiwan.

Authors:  Chia-Hsiang Chen; Hsin-I Chen; Wei-Hsien Chien; Ling-Hui Li; Yu-Yu Wu; Yen-Nan Chiu; Wen-Che Tsai; Susan Shur-Fen Gau
Journal:  Sci Rep       Date:  2017-09-20       Impact factor: 4.379

Review 6.  The Relationship between Autism and Ehlers-Danlos Syndromes/Hypermobility Spectrum Disorders.

Authors:  Emily L Casanova; Carolina Baeza-Velasco; Caroline B Buchanan; Manuel F Casanova
Journal:  J Pers Med       Date:  2020-12-01

Review 7.  Biological Timing and Neurodevelopmental Disorders: A Role for Circadian Dysfunction in Autism Spectrum Disorders.

Authors:  Ethan Lorsung; Ramanujam Karthikeyan; Ruifeng Cao
Journal:  Front Neurosci       Date:  2021-03-12       Impact factor: 4.677

8.  GWAS Meta-Analysis Reveals Shared Genes and Biological Pathways between Major Depressive Disorder and Insomnia.

Authors:  Yi-Sian Lin; Chia-Chun Wang; Cho-Yi Chen
Journal:  Genes (Basel)       Date:  2021-09-26       Impact factor: 4.096

9.  Butyrophilin Btn2a2 inhibits TCR activation and phosphatidylinositol 3-kinase/Akt pathway signaling and induces Foxp3 expression in T lymphocytes.

Authors:  Johannes U Ammann; Anne Cooke; John Trowsdale
Journal:  J Immunol       Date:  2013-04-15       Impact factor: 5.422

10.  Bacterial sensor Nod2 prevents inflammation of the small intestine by restricting the expansion of the commensal Bacteroides vulgatus.

Authors:  Deepshika Ramanan; Mei San Tang; Rowann Bowcutt; P'ng Loke; Ken Cadwell
Journal:  Immunity       Date:  2014-07-31       Impact factor: 31.745

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.