Literature DB >> 32450902

Exome sequencing study revealed novel susceptibility loci in subarachnoid hemorrhage (SAH).

Xiwa Hao1, Jiangxia Pang1, Ruiming Li1, Lin Lv1, Guorong Liu1, Yuechun Li1, Guojuan Cheng1, Jingfen Zhang2.   

Abstract

AIM: To expand our current understanding of the genetic basis of subarachnoid hemorrhage (SAH), and reveal the susceptibility genes in SAH risk.
METHODS: We conducted whole-exome sequencing (WES) in a cohort of 196 individuals, including 94 SAH patients and 94 controls, as well as 8 samples that belong to two pedigrees. Systematically examination for rare variations (through direct genotyping) and common variations (through genotyping and imputation) for SAHs were performed in this study.
RESULTS: A total of 16,029 single-nucleotide polymorphisms (SNPs) and 108,999 short indels were detected in all samples, and among them, 30 SNPs distributed on 17 genes presented a strong association signal with SAH. Two novel pathogenic gene variants were identified as associated risk loci, including mutation in TPO and PALD1. The statistical analysis for rare, damaging variations in SAHs identified several susceptibility genes which were involved in degradation of the extracellular matrix and transcription factor signal pathways. And 25 putative pathogenic genes for SAH were also identified basic on functional interaction network analysis with the published SAH-associated genes. Additionally, pedigree analysis revealed autosomal dominant inheritance of pathogenic genes.
CONCLUSION: Systematical analysis revealed a key role for rare variations in SAH risk and discovered SNPs in new complex loci. Our study expanded the list of candidate genes associated with SAH risk, and will facilitate the investigation of disease-related mechanisms and potential clinical therapies.

Entities:  

Keywords:  Genome-wide association analysis (GWAS); Pedigree analysis; Rare variations; Subarachnoid hemorrhage (SAH); Whole-exome sequencing (WES)

Mesh:

Year:  2020        PMID: 32450902      PMCID: PMC7249693          DOI: 10.1186/s13041-020-00620-6

Source DB:  PubMed          Journal:  Mol Brain        ISSN: 1756-6606            Impact factor:   4.041


Introduction

Subarachnoid hemorrhage (SAH), the rarest but most fatal type of stroke, has shown an annual incidence of 8–10/100,000 persons (2007), 30-day case fatality of 35–45% in western countries [17, 20]. In China, annual incidence (per 100,000 persons) of SAH was 6.2, which is slightly lower than in western countries [40]. The majority of patients with SAH usually suffered from ruptured intracranial aneurysm (IA). SAH risk was considered to be related to smoking, hypertension, and poor socioeconomic status [2, 14]. Moreover, studies based on molecular mechanisms have shown that genetic factors also play an important role in the formation, growth and rupture of IA [6, 11, 24, 32, 38]. Therefore, IA is identified as a complex disease that influenced by various genes and environmental factors. Conducting early detection and intervention by identifying risk factors may facilitate to avoid the formation and rupture of IA, and is crucial for the reduced incidence of SAH [12]. However, comprehensive knowledge of pathogenic and ruptured mechanisms of IA has not yet been defined. Some studies have focused on the pathogenic mechanisms of IA in China, and several susceptibility genes have been identified. Due to the limitation of technology and small sample size, it is still necessary to further study the genetic factors of IA in China. Although genome-wide association analysis (GWAS) studies have found some novel gene loci related to IAs, they can only explain part of the genetic risk. Most of the GWAS studies have focused on both unruptured and ruptured IAs, thus, the gene loci highly related to ruptured IAs have not been completely detected. Moreover, rare, damaging variants also play an important role in complex diseases. With advances in sequencing technology, genetic analysis is gradually extending to rare variants, which often have more obvious functional consequences of harmful phenotypes [13, 29]. In this study, we set out to systematically examine rare variation (through direct genotyping) and common variation (through genotyping and imputation) for SAHs by whole-exome sequencing (WES) in a cohort of 196 samples. Mendelian inheritance analysis for the SAH pedigrees was also performed to identified the susceptibility genes. Our study constitutes a detailed simultaneous assessment of causal variations in a large sample of SAHs, offer an opportunity to better understand both the biological and genetic architecture of this type of complex disease.

Materials and methods

Study cohorts

We prospectively collected 196 samples, including 8 samples that belong to two pedigrees from Central Hospital of Baotou. The cohorts included 94 SAH cases(with ruptured intracranial aneurysm confirmed by Digital Subtraction Angiography, DSA and computed tomography, CT) and 94 controls(for each case, 1 control without SAH will be sort for interview. Controls will be matched on the basis of the following criteria: gender (sex) 10-year age strata (ie 10–19, 20–29, etc) sector of suburb of residence in Baotou (North, East, etc). Controls will be chosen from the spouse, relative or friend of patients without SAH who are currently in the same hospital as the case.). This study was approved by the Human Research Ethics Committee of Central Hospital of Baotou, and all participants provided written informed consent. Comprehensive clinical information was provided in Table S1, including height, weight, BMI, gender and age etc.

Whole-exome sequencing

Genomic DNA was isolated from peripheral whole blood samples of participants by using Genomic DNA Extraction Kit (Invitrogen, South San Francisco, CA, USA). The Qubit 3.0 fluorometer and gel electrophoresis were used to evaluate DNA quantity and integrity, respectively. The sequencing paired-end libraries were constructed for each sample and captured using SureSelect Human All Exon V6 kit (Agilent Technologies, Santa Clara, CA, USA) following the manufacturer’s instructions. All libraries were sequenced on BGI-SEQ 500 platform at BGI to obtain a desired depth of ~100X. The sequencing depths of each sample are listed in Table S2.

Whole-exome sequencing (WES) data processing and variant calling

To get high quality data, Trimmomatic [5] was used to filter out low-quality reads which contained adaptors, high base error rate (> 50%), and highly unknown base proportion (> 10%) from the raw sequencing data. The cleaned reads were aligned to human reference genome (UCSC hg19) by the Burrows-Wheeler Aligner-MEM (v.0.7.15) [26] with default parameters. All the aligned reads were further processed using Picard tools (v2.5.0) and Genome Analysis Toolkit (GATK, v3.7) [28] with default parameters, which included deduplication, base quality recalibration, and multiple-sequence realignment prior to mutation detection. Variant calling was performed for all the samples by using the Haplotype Caller algorithm in GATK with the parameters “-stand_call_conf 30 -stand_emit_conf 10 -minPruning 3”. Each variant was filtered using GATK hard filters with the parameters “QD<2.0 || FS>60 || MQ<40 || MQRankSum<-12.5 || ReadPosRankSum<-8.0” for SNPs and “QD < 2.0 || FS > 200 || ReadPosRankSum < -20” for Indels to reduce the false positive rate. We then called genotypes jointly across all samples at the remaining sites, followed by genotype refinement using the BEAGLE imputation software (v5.0) [7]. The variants were subsequently annotated by multiple databases using the ANNOVAR tool [37].

Sample quality control

The standard quality screening conducted independently in each sample included SNP and sample call rates (> 90%), Hardy–Weinberg equilibrium, Mendelian errors, gender inconsistencies and checks for population stratification. To obtain a high-quality set of samples, the outlier samples discovered using principal-component analysis in GCTA [39] were removed from further analysis.

Association testing

The single marker association analyses with SAH were performed using an additive genetic model implemented in SNPTEST (http://www.stats.ox.ac.uk/~marchini/software/gwas/snptest.html) for the common SNPs (MAF > 10%). Age, sex, BMI, smoking, drinking, body fat, and diabetes were used as covariates in the analysis.

Rare SNP filtering

We used different allele frequency threshold in several public population databases: 1000G (http://browser.1000genomes.org/index.html), ExAC, ESP etc., to filter out common variants. Then, only variants with frequency less than the thresholds in all these databases were considered as the rare SNPs of SAHs.

Functional impact prediction

Each variant category has to be assessed with a specific set of tools to predict their functional impact. Here, we assumed that synonymous variants have no functional impact, and all the stop gain and stop loss variants were considered as the deleterious mutations. The functional predictions of missense variants were performed by seven computational methods (SIFT (Ng, 2001 #4097), Polyphen2 [1], MutationTaster [33], CADD [27], REVEL [21], M-CAP [22], LRT [10]). The pathogenicity of missense mutations was assumed if predicted pathogenic by at least five out of the computational methods. The dpsi score were employed to determine the pathogenicity of splicing mutations.

Gene-based burden analysis

Gene-base test were performed for the rare, damaging variants. For each gene, we computed the burden of rare, damaging variants in SAH cases and controls, respectively. Fisher’s exact test was applied to determine the significantly associated genes in SAHs. Those genes with a P-value of less than 0.05 were identified as susceptibility genes in SAHs. SKAT-O [25] was also applied for burden test, which allowing for variants with opposite directions of effect to reside in the same gene.

Inheritance analysis in pedigrees

The SNPs were called from the 2 pedigrees, and were further filtered as the filtering criterion of rare SNPs. Then, all the SNPs were subjected to functional impact prediction. Mendelian inheritance analysis was performed for the diseasing causing SNPs with 4 inheritance patterns, including (1) dominant inheritance pattern; (2) recessive inheritance pattern; (3) semi-dominant inheritance pattern; (4) compound heterozygote inheritance pattern.

The network analysis

The SAH-associated genes were collected from the published studies. The STRING database and associated search tools [35] were used for identifying interacting partners of a list of SAH-associated genes. We employed the identified interacting partners as the candidate pathogenic genes in SAHs.

Results

Cohorts description and whole-exome sequencing

In this study, we performed ~100x whole-exome sequencing (WES) for 94 SAH cases, 94 controls, and 2 pedigrees. Comprehensive description of the height, weight, sex, age and the other clinical variables of the cohort are provided in Table S1. In brief, the SAH group included 55 hypertension, 10 diabetic, 11 hyperlipidemia, 46 smoker/former-smoker, and 22 drinker/former-drinker. The control group included 37 hypertension, 9 diabetic, 9 hyperlipidemia, 42 smoker/former-smoker, and 23 drinker/former-drinker. The statistics of the WES data was provided in Table S2 and S3, including effective bases, SNPs numbers, Indel numbers and Ti/Tv rate etc. for each sample. We totally discovered 716,029 single-nucleotide polymorphisms (SNPs) and 108,999 short indels in all the samples. We then applied Genome Analysis Toolkit (GATK) VQSR for SNVs to distinguish true sites of genetic variation from sequencing artifacts. Then, 549,553 SNPs were remained, including 148,967 exonic SNPs, for the further analysis (See Method section, Table S4). Following sample quality control, the whole-exome sequences of 93 patients with SAH and 92 controls were jointly analyzed (See Method section).

Imputation into GWAS

For imputation purposes, we conducted a genome-wide single-variant analysis of the common SNPs (minor allele frequency, MAF > 0.1) comparing the 93 SAH cases and 92 controls. The associations with SAH risk were tested using logistic regression adjusted for sex, BMI, smoking, drinking, body fat, and diabetes as covariates. The genomic inflation factor (λ = 1.006) showed no evidence of inflated test statistics. There were 30 SNPs distributed on 17 genes presented a strong association signal with SAH (Table 1, Fig. 1). In these SNPs, three of them were in exon, and one in UTR3, and the other in intron. We obtained two loci reached genome-wide significance, within the introns of two genes TPO and PALD1, respectively (Fig. 2), implies a putative functional role in the pathogenesis of SAHs.
Table 1

SNPs with the strongest association with SAH from the GWAS results

ChrPosRefAltCases MAFControls MAFORP valueFunctionGene
1072,300,743GC0.0870.2743.976.94E-07intronicPALD1
21,437,410CT0.2450.4842.901.16E-06intronicTPO
231,189,236AG0.2120.4252.752.37E-06intronicGALNT14
231,189,304CT0.2120.4252.752.37E-06intronicGALNT14
231,189,345TG0.2120.4252.752.37E-06intronicGALNT14
231,189,401GA0.2120.4252.752.37E-06intronicGALNT14
231,189,439AG0.2120.4252.752.37E-06intronicGALNT14
1072,306,967TC0.1740.3662.743.33E-06intronicPALD1
1072,306,978AC0.1740.3662.743.33E-06intronicPALD1
21,437,163CA0.1680.3762.986.23E-06intronicTPO
1950,189,818CG0.0980.0160.151.09E-05intronicPRMT1
203,846,843TC0.3480.1720.391.34E-05UTR3MAVS
1072,289,778TC0.1090.2693.011.82E-05exonicPALD1
1950,195,455AG0.0920.0160.162.38E-05intronicCPT1C
41.85E+ 08AG0.1470.0320.192.65E-05intronicTRAPPC11
728,449,965CT0.0330.1404.822.73E-05intronicCREB5
21,442,417TC0.1630.3552.822.80E-05intronicTPO
21,442,476CT0.1630.3552.822.80E-05intronicTPO
558,334,645GA0.0540.1994.323.43E-05intronicPDE4D
1072,307,101CT0.6030.3982.303.58E-05exonicPALD1
1435,062,166TC0.1140.2632.784.50E-05intronicSNX6
71.51E+ 08GC0.3150.5162.324.78E-05intronicNUB1
1814,796,080AG0.2340.3822.025.39E-05intronicANKRD30B
21,426,621AG0.3320.5322.296.31E-05intronicTPO
91.02E+ 08TC0.1960.0650.286.60E-05intronicGALNT12
1523,049,369AG0.3370.1770.428.36E-05intronicNIPA1
575,427,935AG0.2720.4522.219.09E-05exonicSV2C
1072,288,900GA0.3970.5912.200.000103intronicPALD1
1050,683,438CT0.2770.1180.350.000104intronicERCC6
670,970,299TC0.2230.0910.350.000106intronicCOL9A1
Fig. 1

Quantile–quantile (Q–Q) plots of the meta-analyses of genome-wide association studies (GWAS) results for SAH

Fig. 2

Manhattan plot depicting the GWAS results for SAH. Each dot represents a single-nucleotide polymorphism (SNP), with the chromosomal position on the x axis and the P-value on the y axis

SNPs with the strongest association with SAH from the GWAS results Quantile–quantile (Q–Q) plots of the meta-analyses of genome-wide association studies (GWAS) results for SAH Manhattan plot depicting the GWAS results for SAH. Each dot represents a single-nucleotide polymorphism (SNP), with the chromosomal position on the x axis and the P-value on the y axis TPO encodes a membrane-bound glycoprotein that plays a major role in thyroid gland function. Mutations in this gene are associated with several disorders of thyroid hormonogenesis, i.e., congenital hypothyroidism, and congenital goiter [31, 34]. As depicted in Fig. 2, another SNP within Phosphatase Domain Containing Paladin 1 (PALD1) also showed a significant signal. PALD1 is thought to be involved in the formation of vascular endothelium [36].

The role of rare variations in SAH risk

It is plausible that analysis of rare variants could explain additional disease risk or trait variability. We next investigated the rare variants across the cohorts by applying the frequency filtering (see Method section). The variants were defined as rare if their frequency in various databases were less than the corresponding threshold (Table S5). Each rare variant was assessed with a specific set of tools to predict their functional impact (see Method section). The 30,651 potential damaging rare variants remained for further analysis. We employed two gene-based methods to identify the susceptibility genes in SAHs. Rare variants burden testing was performed between SAH cases and control samples by Fisher’s exact test, and 38 susceptibility genes, such as gene OBSCN, TJP1, ADGRV1, and FBN3 etc., were obtained (Table 2). However, when variants with opposite directions of effect in the same gene, the testing power will be reduced. We then employed another analysis with SKAT-O [25] to identify the signals, which both allowed for variants with opposite directions of effect to reside in the same gene. The SKAT-O identified 37 signals (Table 3), which were highly overlap with the results from the burden test (92.3%, Fig. 3), which suggested that these genes could be directly involved in ALS risk.
Table 2

Candidate genes in SAH identified by burden test of rare variants

GeneNumber of Cases with mutationsNumber of Controls with mutationsNumber of Cases without mutationsNumber of Controls without mutationsP valueOR
OBSCN261168820.0052.834
ABCG87087930.007Inf
PIGG7087930.007Inf
GOLGA26088930.015Inf
MTMR46088930.015Inf
MYH16088930.015Inf
OTOGL6088930.015Inf
TCF36088930.015Inf
TJP110284910.0175.374
KMT2C8186920.0188.482
ADGRV116678870.0212.958
FBN39285910.0304.782
ABCG55089930.030Inf
BCL95089930.030Inf
C1orf945089930.030Inf
CEBPZ5089930.030Inf
COG35089930.030Inf
CTC15089930.030Inf
FDXR5089930.030Inf
GRIK35089930.030Inf
INCENP5089930.030Inf
IQGAP35089930.030Inf
KNDC15089930.030Inf
LETMD15089930.030Inf
METTL225089930.030Inf
NCOA65089930.030Inf
PDZD75089930.030Inf
PIF15089930.030Inf
PLXNA45089930.030Inf
RAPGEFL15089930.030Inf
RGS145089930.030Inf
SCN7A5089930.030Inf
THEG5089930.030Inf
VEPH15089930.030Inf
ZFP905089930.030Inf
LENG87187920.0337.339
NIPBL7187920.0337.339
TECPR27187920.0337.339
Table 3

Candidate genes in SAH identified by SKAT-O analysis

GeneP valueNumber of Marker AllNumber of Marker TestMACmMethod binMAP
OBSCN0.00537364037ER.A−1.000
ABCG80.0108777ER0.003
PIGG0.0109777ER0.003
GOLGA20.0216666ER0.007
MTMR40.0217776ER0.007
MYH10.0216666ER0.007
TCF30.0216666ER0.007
KMT2C0.0269999ER0.001
FBN30.03913121211ER0.000
ABCG50.0445555ER0.014
BCL90.0445555ER0.014
C1orf940.0445555ER0.014
CCDC102A0.0447555ER0.014
CEBPZ0.0445555ER0.014
COG30.0444455ER0.014
CTC10.0445555ER0.014
FDXR0.0445555ER0.014
GRIK30.0445555ER0.014
INCENP0.0443355ER0.014
IQGAP30.0446555ER0.014
KNDC10.0445555ER0.014
LETMD10.0444455ER0.014
METTL220.0444455ER0.014
NCOA60.0445555ER0.014
OTOGL0.0445455ER0.014
PDZD70.0443355ER0.014
PIF10.0446555ER0.014
PLXNA40.0444455ER0.014
RAPGEFL10.0443355ER0.014
RGS140.0444455ER0.014
SCN7A0.0445555ER0.014
THEG0.0445555ER0.014
VEPH10.0445555ER0.014
ZFP900.0444455ER0.014
LENG80.0508888ER0.002
NIPBL0.0508888ER0.002
TECPR20.0509888ER0.002
Fig. 3

The number of overlapped genes between burden test and SKAT-O analysis

Candidate genes in SAH identified by burden test of rare variants Candidate genes in SAH identified by SKAT-O analysis The number of overlapped genes between burden test and SKAT-O analysis The overlapped genes were further subjected to functional enrichment analysis. These genes were overrepresented in some pathways related to cellular organization, i.e., adherens junction, and degradation of the extracellular matrix; and transcription factor signal, i.e., TGF-beta signal pathway (Table 4).
Table 4

The enriched pathways for the overlapped genes between burden test and SKAT-O analysis

PathwaysGenes
Adherens_junctionTJP1 PTPRM
AngiogenesisFGFR2 UNC5B NOTCH3 FGFR4 FLT4
Apelin_signaling_pathwayNOTCH3 ADCY8
Cell_cycle_Role_of_SCF_complex_in_cell_cycle_regulationFZR1 MAPK8
Degradation_of_the_extracellular_matrixFBN3 ADAMTS18 NCAM1 NTN4
LAMB2 LAMB1 CAPN1 COL20A1 FBLN2
COL14A1 ITGA2 NCAN P4HA2
Development_TGF-beta_receptor_signalingFZR1
Elastic_fibre_formationFBN3 FBLN2
Endochondral_OssificationRUNX2
ERK_signalingMYH1 TCF3 FBN3 ECM2 FGFR2 PRKCQ RPS6KA1 FLT4
TCF19 LAMB2 LAMB1 CAPN1 COL20A1 RASGRP1
NOTCH3 ADCY8 ARHGEF16 CDH12 CDH19 COL14A1 FGFR4
IL12RB1 ITGA2 MAPK8 NCAN NTRK3 PLCD4 ARHGEF2
HTLV-I_infectionTCF3 POLE ADCY8 CRTC3 HLA-DPA1 MAPK8
Integrin_PathwayMYH1 FBN3 ECM2 PRKCQ LAMB2 LAMB1 CAPN1
COL20A1 ADCY8 CD36 COL14A1 ITGA2 MAPK8 NCAN
PAK_PathwayMYH1 TCF3 TJP1 FGFR2 NOX4 PRKCQ PTPRH
TCF19 NOTCH3 FGFR4 FLT4 IL12RB1
MAPK8 NTRK3 PLCD4 PTPN3 PTPRM GPLD1
Sertoli-Sertoli_Cell_Junction_DynamicsMYH1 TJP1 SAFB ITGA2 MAPK8 RAB17 RAB34 ARHGEF2
SMAD_Signaling_NetworkPSMB8 PSMD5
Smooth_Muscle_ContractionNULL
TGF-beta_receptor_signaling_activates_SMADsMTMR4
TGF-beta_receptor_signalingZFYVE16
TGF-beta_signaling_pathway_KEGGID4 INHBA SMAD6 ZFYVE16
TGF-beta_Signaling_PathwaysMAPK8 RUNX2
The enriched pathways for the overlapped genes between burden test and SKAT-O analysis

Pedigree analysis

We performed Mendelian inheritance analysis for two SAH pedigrees with probable inheritance patterns, including (1) dominant inheritance pattern; (2) recessive inheritance pattern; (3) semi-dominant inheritance pattern; (4) compound heterozygote inheritance pattern. Pathogenicity of missense mutations was assumed if predicted pathogenic by at least five out of seven computational methods (SIFT, PolyPhen2, LRT, MutatationTaster, M-CAP, CADD, and REVEL). The potential disease causing variants were only performed in dominant inheritance pattern, and there were 35 and 15 SNPs identified in these two pedigrees, respectively (Table 5 and 6). Twelve and seven candidate genes were identified in pedigree 1 and 2, respectively (Table 7). The gene COL1A2, a pathogenic gene in pedigree 2, was also reported to be associated with SAH phenotype [15].
Table 5

The potential disease causing SNPs in dominant inheritance pattern for pedigree 1

ChrPosFunctionGeneSIFTPp2LRTMTM-CAPCADDREVEL
12.18E+ 08exonicGPATCH20.001.00DD0.2334.00.65
12.24E+ 08exonicCCDC1850.041.00.D0.0219.60.35
12.25E+ 08exonicDNAH140.010.56UD0.0423.00.09
254,609,069intergenicC2orf73 SPTBN10.00..N0.000.20.03
255,795,456exonicPPP4R3B.0.39DD0.0323.60.72
21.79E+ 08exonicTTN0.230.02.N0.0514.10.12
21.8E+ 08exonicTTN0.160.80.D0.0319.00.44
22.2E+ 08exonicCFAP650.000.98ND0.0134.00.19
22.2E+ 08exonicSTK11IP..NA.39.0.
22.23E+ 08exonicPAX30.020.22DD0.0721.40.48
349,169,107exonicLAMB20.030.15ND0.0222.20.11
31.83E+ 08intronicATP11B0.03..N0.015.80.02
31.94E+ 08exonicATP13A30.320.15NN0.036.80.25
4871,443exonicGAK0.060.42DD0.0726.30.53
46,873,370exonicKIAA02320.040.20DD0.0125.10.23
474,276,089exonicALB0.031.00NN0.0522.40.02
772,397,374exonicPOM1210.070.51NN0.0123.20.03
787,179,859exonicABCB10.180.01DD0.0514.00.26
71.17E+ 08exonicCTTNBP20.010.99DD0.0427.40.36
71.29E+ 08exonicIRF50.210.00NN0.0315.10.25
71.3E+ 08exonicCPA40.000.89DD0.3025.10.31
91.31E+ 08exonicODF20.000.99DD0.0228.30.31
126,458,130exonicSCNN1A0.15..D0.1013.70.09
1495,562,384exonicDICER10.180.00NN0.050.50.02
1689,865,550intronicFANCA0.00..N0.013.8.
177,231,013exonicNEURL40.000.01DD0.0422.70.26
177,483,148exonicCD680.000.19ND0.0222.50.19
177,691,426exonicDNAH20.080.43ND0.0122.50.06
1773,564,902exonicLLGL20.010.71DD0.0427.40.53
1774,085,300exonicEXOC70.220.02DD0.0017.80.08
2055,777,539exonicBMP70.010.74DD0.0829.00.22
2146,929,308exonicCOL18A10.140.16NN0.1010.50.12
2230,074,259exonicNF20.590.02DD0.0818.40.43
2250,721,594exonicPLXNB20.510.00NN0.0514.30.27
2250,945,311exonicLMF20.000.99DD0.6728.00.41
Table 6

The potential disease causing SNPs in dominant inheritance pattern for pedigree 2

ChrPosFunctionGeneSIFTPp2LRTMTM-CAPCADDREVEL
11.5E+ 08exonicHIST2H2AC0.000.10ND0.0248.00.15
11.52E+ 08exonicRPTN0.010.02.N0.00316.30.01
11.57E+ 08exonicIQGAP30.001.00DD0.15435.00.84
11.62E+ 08exonicDUSP120.101.00DD0.00923.40.16
656,471,328intronicDST0.010.17N.0.03312.00.10
720,782,555exonicABCB50.000.99DD0.03927.80.70
729,132,261exonicCPVL0.040.99ND0.14226.60.47
794,057,039exonicCOL1A20.100.98DN0.08226.20.44
71.01E+ 08exonicMUC170.020.61.N0.0035.60.04
71.51E+ 08exonicCHPF20.030.89DD0.04923.30.32
890,936,937exonicOSGIN20.380.03DD0.0111.30.07
173,030,476exonicOR1G10.050.01.N013.60.03
174,619,845exonicARRB20.040.90DD0.03432.00.15
176,683,525exonicFBXO390.140.09DD0.02419.80.17
177,733,695exonicDNAH20.010.89NN0.00423.80.11
Table 7

The candidate genes in two pedigrees

Candidate genes in pedigree 1Candidate genes in pedigree 2
GPATCH2IQGAP3
CFAP65DUSP12
PAX3ABCB5
GAKCPVL
KIAA0232COL1A2
CTTNBP2CHPF2
CPA4ARRB2
ODF2
NEURL4
LLGL2
BMP7
LMF2
The potential disease causing SNPs in dominant inheritance pattern for pedigree 1 The potential disease causing SNPs in dominant inheritance pattern for pedigree 2 The candidate genes in two pedigrees

Putative pathogenic genes for SAHs

Protein–protein interactions were known as mediating many cellular functions, including cell cycle progression, signal transduction, and metabolic pathways. The genes that interacted with the known SAH genes may influence the SAH phenotypes by participating in the same network/pathway. Basic on previous studies, we collected 28 SAH associated genes (Table S6), and these genes were further assessed the direct and indirect associations with other genes by STRING [35]. In total, we identified 47 putative interacted genes with the SAH (Table S7). To look deep into the pathogenic genes associated with SAH, we selected the overlapped genes among the results from the burden test, SKAT-O analysis and putative interacted genes (Table 8). Finally, we identified 25 putative pathogenic genes for SAH.
Table 8

The overlapped genes among the results from burden test, SKAT-O analysis and PPI analysis

Candidate pathogenic genes
MYH1CD36
TJP1COL14A1
FGFR2FGFR4
NCAM1FLT4
NOX4FZR1
RPS6KA1ID4
LAMB2IL12RB1
LAMB1ITGA2
COL20A1MAPK8
FBLN2NCAN
POLEP4HA2
NOTCH3RUNX2
ADCY8
The overlapped genes among the results from burden test, SKAT-O analysis and PPI analysis Among these genes, FBLN2 has been identified a member of fibulin family, and is responsible for maintenance of the adult vessel wall after injury [8]. BMP7 was reported to play an important role in facilitating recovery after stroke in rat [9]. ITGA2 is responsible for adhesion of platelets and other cells to collagens and organizations of extracellular matrix. Previous study demonstrated ITGA2-deficient mice overexpressed transforming the growth factor TGFβ [16, 19], which was known to be highly associated with aortic aneurysm and IA. Moreover, both ITGA2 and TTN were involved in hemostasis [3, 30]. Notably, Notch signaling plays a pivotal role during vascular development [4, 18]. Mutations in NOTCH3 have been identified as the underlying cause of cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL), the most common inherited stroke and dementia syndrome in the group of degenerative small vessel diseases [23]. Our findings demonstrated the therapeutic potential of modifying these signaling in SAHs.

Discussion

Subarachnoid hemorrhage (SAH) is the rarest but most fatal type of stroke, identification of genetic variants that confer susceptibility to SAH is clinically important to prevent it [20]. In the present study, we performed WES for SAH cases and controls, to identify causal variations that associated with SAH risk in China, which enabled us to systemically evaluate protein-altering variants and candidate functional genes. Across GWAS with a total of 188 samples, we found a genome-wide significant association of SNPs in TPO and PALD1 with SAH risk. These two genes are involved in disorders of thyroid hormonogenesis and formation of vascular endothelium, respectively. Previous studies of IA have identified SERPINA3 (rs4934) as associated risk loci in the Finnish population, and CSPG2 (rs251124) and HSPG2 (rs3767137) loci as susceptibility sites in the Dutch population. However, in our cohorts, there was no significantly associated signal in these genes, which may due to the different genetic background among the populations. We then investigated the role of low-frequency variants of intermediate effect in SAH risk through rare SNPs analysis. The pathogenic genes with rare, damaging SNPs were enriched in some pathways related to cellular organization, i.e., degradation of the extracellular matrix; and transcription factor signal, i.e., TGF-beta signaling pathway. TGF-beta signaling plays a vital role in vasculogenesis and maintenance of blood vessel, and is involved in aortic aneurysm and IA. These results highlight the functional importance of rare variations in SAH risk. The two pedigree samples were mainly used to performed Mendelian inheritance analysis, and it revealed autosomal dominant inheritance of pathogenic genes. In the same time, some potential disease causing variants were also found, such as the gene COL1A2 which was reported to be associated with SAH phenotype [15]. Combing the results from the network analysis of known SAH-associated genes, we obtained a list of candidate susceptibility genes. Among these genes, several were demonstrated to be associated with maintenance of blood vessel, including FBLN2, ITGA2, BMP7, and NOTCH3. NOTCH3 is known to be associated with the most common inherited stroke, CADASIL. These potential targets needed to be further validated in experiment models both in vivo and in vitro, which may facilitate to develop clinical strategies for early detection and intervention. In conclusion, we have identified a key role for rare variations in SAH and discovered SNPs in new complex loci. However, there are still some limitations to our current study due to the small sample size and availability of family genetic data. In future, the identified candidate genes, i.e., TPO, PALD1 and ITGA2, will be necessary to validate in independent study populations or a larger sample size for Chinese population. Determination of genotypes for SNPs in these genes will guide the development of therapeutic strategies for SAH. Additional file 1: Table S1. Sample background information. Table S2. Basic information of sequencing. Table S3. reads mapping statistics. Table S4. Variation distribution statistics. Table S5. Variation filtering threshold. Table S6. Known genes in SAH. Table S7. Interacted genes with known genes in SAH
  40 in total

1.  Study of 18 functional hemostatic polymorphisms in mucocutaneous bleeding disorders.

Authors:  Ana I Antón; Rocio González-Conejero; Vanessa Roldán; Teresa Quiroga; Beatriz Sánchez-Vega; Javier Corral; Vicente Vicente; Diego Mezzano
Journal:  Ann Hematol       Date:  2010-06-09       Impact factor: 3.673

2.  Notch3 mutations in CADASIL, a hereditary adult-onset condition causing stroke and dementia.

Authors:  A Joutel; C Corpechot; A Ducros; K Vahedi; H Chabriat; P Mouton; S Alamowitch; V Domenga; M Cécillion; E Marechal; J Maciazek; C Vayssiere; C Cruaud; E A Cabanis; M M Ruchoux; J Weissenbach; J F Bach; M G Bousser; E Tournier-Lasserve
Journal:  Nature       Date:  1996-10-24       Impact factor: 49.962

3.  Health outcomes 1 year after subarachnoid hemorrhage: An international population-based study. The Australian Cooperative Research on Subarachnoid Hemorrhage Study Group.

Authors:  M L Hackett; C S Anderson
Journal:  Neurology       Date:  2000-09-12       Impact factor: 9.910

4.  Neuroregenerative effects of BMP7 after stroke in rats.

Authors:  Jenny Chou; Brandon K Harvey; Chen-Fu Chang; Hui Shen; Marisela Morales; Yun Wang
Journal:  J Neurol Sci       Date:  2005-10-19       Impact factor: 3.181

Review 5.  Incidence of subarachnoid haemorrhage: a systematic review with emphasis on region, age, gender and time trends.

Authors:  N K de Rooij; F H H Linn; J A van der Plas; A Algra; G J E Rinkel
Journal:  J Neurol Neurosurg Psychiatry       Date:  2007-04-30       Impact factor: 10.154

6.  Five novel inactivating mutations in the thyroid peroxidase gene responsible for congenital goiter and iodide organification defect.

Authors:  Carina M Rivolta; Sebastián A Esperante; Laura Gruñeiro-Papendieck; Ana Chiesa; Christian M Moya; Sabina Domené; Viviana Varela; Héctor M Targovnik
Journal:  Hum Mutat       Date:  2003-09       Impact factor: 4.878

7.  A study of inbreeding and kinship in intracranial aneurysms in the Saguenay Lac-Saint-Jean region (Quebec, Canada).

Authors:  M De Braekeleer; L Pérusse; L Cantin; J M Bouchard; J Mathieu
Journal:  Ann Hum Genet       Date:  1996-03       Impact factor: 1.670

8.  Subarachnoid hemorrhage and family history. A population-based case-control study.

Authors:  P S Wang; W T Longstreth; T D Koepsell
Journal:  Arch Neurol       Date:  1995-02

9.  STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.

Authors:  Damian Szklarczyk; Annika L Gable; David Lyon; Alexander Junge; Stefan Wyder; Jaime Huerta-Cepas; Milan Simonovic; Nadezhda T Doncheva; John H Morris; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.