Literature DB >> 29093733

Exome-Wide Meta-Analysis Identifies Rare 3'-UTR Variant in ERCC1/CD3EAP Associated with Symptoms of Sleep Apnea.

Ashley van der Spek1, Annemarie I Luik2, Desana Kocevska3, Chunyu Liu4,5,6, Rutger W W Brouwer7, Jeroen G J van Rooij8,9,10, Mirjam C G N van den Hout7, Robert Kraaij1,8,9, Albert Hofman1,11, André G Uitterlinden1,8,9, Wilfred F J van IJcken7, Daniel J Gottlieb12,13,14, Henning Tiemeier1,15, Cornelia M van Duijn1, Najaf Amin1.   

Abstract

Obstructive sleep apnea (OSA) is a common sleep breathing disorder associated with an increased risk of cardiovascular and cerebrovascular diseases and mortality. Although OSA is fairly heritable (~40%), there have been only few studies looking into the genetics of OSA. In the present study, we aimed to identify genetic variants associated with symptoms of sleep apnea by performing a whole-exome sequence meta-analysis of symptoms of sleep apnea in 1,475 individuals of European descent. We identified 17 rare genetic variants with at least suggestive evidence of significance. Replication in an independent dataset confirmed the association of a rare genetic variant (rs2229918; minor allele frequency = 0.3%) with symptoms of sleep apnea (p-valuemeta = 6.98 × 10-9, βmeta = 0.99). Rs2229918 overlaps with the 3' untranslated regions of ERCC1 and CD3EAP genes on chromosome 19q13. Both genes are expressed in tissues in the neck area, such as the tongue, muscles, cartilage and the trachea. Further, CD3EAP is localized in the nucleus and mitochondria and involved in the tumor necrosis factor-alpha/nuclear factor kappa B signaling pathway. Our results and biological functions of CD3EAP/ERCC1 genes suggest that the 19q13 locus is interesting for further OSA research.

Entities:  

Keywords:  CD3EAP; ERCC1; exome; genetics; sequence analysis; sleep; sleep apnea syndromes

Year:  2017        PMID: 29093733      PMCID: PMC5651235          DOI: 10.3389/fgene.2017.00151

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.772


Introduction

Sleep is a complex and essential biological process that has been conserved across diverse animal species throughout evolution (Rechtschaffen, 1998). Although normal healthy sleep highly varies within and between adults (Van Dongen et al., 2005; Knutson et al., 2007; Mezick et al., 2009), it has to consist of adequate duration, good quality, proper timing and regularity, and the absence of sleep disturbances or disorders (Consensus Conference Panel et al., 2015). Several large epidemiological studies have shown that short or disturbed sleep is associated with various cognitive (Pilcher and Huffcutt, 1996; Yaffe et al., 2014), psychiatric (Lovato and Gradisar, 2014; Peters van Neijenhof et al., 2016; Cosgrave et al., in press) and health consequences e.g., diabetes mellitus (Gottlieb et al., 2005; Yaggi et al., 2006), activation of pro-inflammatory pathways (Patel et al., 2009), and cardiovascular diseases (Hoevenaar-Blom et al., 2011). One of the most common causes of short and disturbed sleep is sleep apnea. Sleep apnea is a highly prevalent (Peppard et al., 2013) sleep breathing disorder, with obstructive sleep apnea (OSA) as the most common type (Mehra et al., 2007). OSA affects up to 38% of the general adult population (Senaratna et al., 2016) and untreated OSA has been associated with severe health problems (Young et al., 2002a) such as hypertension (Peppard et al., 2000; Pedrosa et al., 2011), cardiovascular disease (Shamsuzzaman et al., 2003; Marin et al., 2005; Gottlieb et al., 2010), stroke (Yaggi et al., 2005), type 2 diabetes (Shaw et al., 2008; Aurora and Punjabi, 2013; Kendzerska et al., 2014), impaired cognitive function (Kim et al., 1997; Yaffe et al., 2011), depression (Peppard et al., 2006), and increased mortality (Marshall et al., 2008; Young et al., 2008; Punjabi et al., 2009). The main characteristic of OSA is the partial or complete obstruction of the upper airways during sleep, causing oxyhemoglobin desaturations and arousals from sleep. This leads to sleep fragmentation and decreased periods of slow wave and REM sleep (McNicholas, 2008; American Academy of Sleep Medicine, 2014). Consequently, the two most common signs and symptoms of OSA are snoring and excessive daytime sleepiness (Gottlieb et al., 1999) where the latter can result in personal and occupational problems, and an increased risk of traffic and work-related accidents (Young et al., 2002a; McNicholas, 2008; American Academy of Sleep Medicine, 2014). OSA is a complex trait influenced by both environment and genetics (Redline et al., 1995; Redline and Tishler, 2000) with obesity, age, and sex as most important risk factors (Redline et al., 1994; Bixler et al., 2001; Young et al., 2002a,b, 2004; Peppard et al., 2013). About 40% of the variance in apneic activity can be explained by genetic factors (Redline et al., 1995). At least half of the genetic contribution to sleep apnea acts through mechanisms independent of obesity (Patel et al., 2008). Previous genetic studies have focused on several candidate genes for breathing disorders, where the most studied genes are the angiotensin-converting enzyme gene (ACE) (Lin et al., 2004; Bostrom et al., 2007; Patel et al., 2007); apolipoprotein, allele E4 (APOE ϵ4) (Kadotani et al., 2001; Gottlieb et al., 2004); serotonin receptors and transporters genes (5-HT2A, 5-HT2C, 5-HTT) (Sakai et al., 2005; Ylmaz et al., 2005; Bayazit et al., 2006; Larkin et al., 2010; Qin et al., 2014); adrenergic receptors (ADRB2/3) (Mills et al., 1995; Grote et al., 2000); and tumor necrosis factor (TNF) (Riha et al., 2005; Popko et al., 2008; Bhushan et al., 2009). However, the results of these studies have been inconsistent or have yet to be confirmed (Sleiman and Hakonarson, 2011). Using linkage analysis, a method to identify the chromosomal location of the disease influencing genes, two regions on chromosome 2p16 and 19q13 were found to be suggestively linked with OSA independent of obesity (Palmer et al., 2003). Genome wide association studies (GWASs) could provide more information on common variants involved in the pathogenesis of OSA. Until now only a few GWASs have been reported for OSA. Loci in GPR83 and C6ORF183/CCDC162P were found to be significantly associated with OSA (Cade et al., 2016), and a locus in the neuregulin-1 (NRG1) gene was suggestively implicated (Baik et al., 2015). Two other studies used customized or targeted genotyping arrays and identified loci in PPARGC1B (Kripke et al., 2015), PTGER3 (Patel et al., 2012), PLEK (Patel et al., 2012), and LPAR1 (Patel et al., 2012) to be associated with OSA. However, most of these findings were not replicated. Consequently, the genetic architecture of OSA remains largely unexplored. In the present study we aimed to identify genetic variants associated with symptoms of sleep apnea, assessed using the Pittsburgh Sleep Quality Index (PSQI). We performed a GWAS using whole-exome sequence (WES) data of 1,475 individuals from two Dutch studies. Subsequently, we replicated our findings in an independent sample.

Materials and methods

Study populations

Discovery cohorts

The discovery sample consists of participants from two cohorts including the Erasmus Rucphen Family (ERF) study and the Rotterdam Study (RS) from The Netherlands. ERF is a family-based study that includes inhabitants of a genetically isolated community in the Southwest of the Netherlands, ascertained as part of the Genetic Research in Isolated Population program. The ERF cohort includes ~3,000 living descendants of 22 founder couples, who had at least six children baptized in the community church. Individuals who were 18 years or older were invited to participate in the study. Data was collected between 2002 and 2005 (Pardo et al., 2005). The study was approved by the Medical Ethics Committee of the Erasmus Medical Center (EMC), Rotterdam, The Netherlands. All participants provided written informed consent and all investigations were carried out in accordance with the Declaration of Helsinki. RS is a prospective cohort study ongoing since 1990, which aims to investigate determinants of disease occurrence and progression in the elderly (Hofman et al., 2015). Initially, the RS included 7,983 individuals of 55 years of age or over, living in the well-defined Ommoord district in Rotterdam, The Netherlands. All participants were examined at baseline by an at home interview and an extensive set of examinations in the research facility in Ommoord. The RS was approved by the Medical Ethics Committee of the EMC and by the Ministry of Health, Welfare and Sport of the Netherlands. All participants provided written informed consent to participate in the study. All investigations were carried out in accordance with the Declaration of Helsinki. Study participants from ERF and RS were assessed for sleep phenotypes using a self-administered questionnaire including questions from the PSQI (Buysse et al., 1989). The PSQI has been specifically designed to measure sleep quality and sleep disturbances over a 1-month time interval. Symptoms of sleep apnea were assessed by asking the participants “How often did you or your partner notice long pauses between breaths while asleep?” Answers were provided on a categorical scale ranging from 1 to 4 (1. not during the past month; 2. less than once per week; 3. once or twice per week; 4. more than twice per week). Symptoms of sleep apnea were assessed in 1,366 ERF participants and 2,660 RS participants, where for the latter data of the fourth visit was used as it had the largest participation.

Replication cohort

The replication sample included participants from the offspring cohort of the population-based prospective Framingham Heart Study (FHS) (Dawber et al., 1951). The offspring cohort was recruited between 1971 and 1975, including 5,124 offspring of the original FHS cohort and their spouses (Kannel et al., 1979). The study was approved by the Institutional Review Board for Human Research of the Boston University Medical Center, Boston, MA, USA. Each participant provided written informed consent. FHS has collected sleep data using the Sleep Heart Health Study sleep habits questionnaire (Quan et al., 1997). Symptoms of sleep apnea scores were constructed as a combination of the following questions: “A. Are there times when you stop breathing during your sleep?” with answers “yes”, ”no”, “I don't know” and “B. If yes to question A: How often do you have times when you stop breathing during your sleep?”. Answers to question B were provided on a categorical scale ranging from 1 to 5 (1. Rarely, less than one night per week; 2. Sometimes, one or two nights per week; 3. Frequently, three to five nights per week; 4. always or almost always, six or seven nights per week; 5. I don't know). Individuals with answers “I don't know” were excluded, since this option is not available in the PSQI. The constructed symptoms of sleep apnea score had answers ranging from 1 to 4, matching the PSQI: 1. not during the past month (A2); 2. less than once per week (A1 and B1); 3. once or twice per week (A1 and B2); 4. more than twice per week (A1 and B3 or A1 and B4).

Sequencing and quality control

In ERF Genomic DNA was extracted from peripheral venous blood utilizing the salting out method (Miller et al., 1988). Exomes of 1,336 ERF participants were sequenced at the Erasmus Center for Biomics of the Cell Biology department of the EMC, The Netherlands, using the Agilent V4 capture kit on an Illumina HiSeq2000 sequencing machine with the TruSeq Version 3 protocol (Amin et al., 2016b). The sequence reads were aligned to the human genome build 19 (hg19) using Burrows Wheeler Aligner (BWA) (Li and Durbin, 2009) and the NARWHAL pipeline (Brouwer et al., 2012). Aligned reads were further processed using IndelRealigner, MarkDuplicates and TableRecalibration tools from the Genome Analysis Toolkit (GATK) (Mckenna et al., 2010), and Picard (http://broadinstitute.github.io/picard/). Genetic variants were called using the GATK UnifiedGenotyper tool. Individuals with low concordance to genotyping array or with a low call rate and low quality variants (Phred quality score <30, call rate <90%) and out of Hardy-Weinberg equilibrium (HWE) (p < 10−6), were removed. The final dataset for ERF included 528,617 single nucleotide variants (SNVs) in 1,308 individuals (Amin et al., 2016b) of whom 654 individual also had phenotype data on symptoms of sleep apnea available. Exomes of 2,628 individuals from the RS population were sequenced at the Human Genotyping facility of the Internal Medicine department at the EMC, the Netherlands, to an average depth of 54x using the Nimblegen SeqCap EZ V2 capture kit on an Illumina Hiseq2000 sequencer using the TruSeq Version 3 protocol (Amin et al., 2016b). The sequenced reads were aligned to hg19 using BWA (Li and Durbin, 2009). Subsequently, the aligned reads were processed further using Picard's MarkDuplicates, SAMtools (Li et al., 2009), and GATK (Mckenna et al., 2010). Genetic variants were called using the Haplotypecaller from GATK. Samples with low concordance to genotyping array (<95%), low transition to transversion ratio (<2.3) and high heterozygote to homozygote ratio (>2.0) were removed and additionally SNVs with a low call rate (<90%) and out of HWE (p < 10−8) were also removed from the data. The final dataset included 600,806 SNVs in 2,356 individuals (Amin et al., 2016a) of whom 821 individuals also had phenotype data on symptoms of sleep apnea available. For both ERF and RS, file handling and formatting was done using VCFtools (Danecek et al., 2011) and PLINK (Purcell et al., 2007) (http://pngu.mgh.harvard.edu/purcell/plink/). Annotation of the variants was performed using SeattleSeq Annotation 138 (http://snp.gs.washington.edu/SeattleSeqAnnotation138/). In FHS exomes of 1,271 participants were sequenced using Illumina HiSeq2000 and 2500 platforms. DNA samples were constructed into Illumina paired-end pre-capture libraries according to the manufacturer's protocol. For exome capture, two, four or six pre-capture libraries were pooled together and hybridized to the HGSC VCRome 2.1 design (Bainbridge et al., 2011) (42 Mb, NimbleGen). After sequencing the HGSC Mercury analysis pipeline (https://www.hgsc.bcm.edu/content/mercury) and Illumina CASAVA software were used to perform sequencing analysis and to de-multiplex the pooled samples. Sequenced reads were aligned to Genome Reference Consortium Human Build 37 (GRCh37) using BWA (Li and Durbin, 2009) producing BAM files (Li et al., 2009). The aligned reads were recalibrated using GATK (Depristo et al., 2011) together with BAM sorting, duplicate read marking, and realignment near insertions or deletions. SNVs, insertions and deletions were called using Atlas2 (Challis et al., 2012). SNVs were excluded with low SNV posterior probability (<0.95), low variant read count (<3), variant read ratio <0.25 or >0.75, strand-bias of more than 99% variant reads in a single strand direction, or total coverage <10. Reference calls with <10 × coverage were also set to missing. Variants were excluded outside exon capture regions (VCRome 2.1), multi-allelic sites, monomorphic sites, missing rate >20%, mappability score <0.8, mean depth of coverage >500, or not fulfilling HWE (p < 5 × 10−6). Samples were excluded with missingness >20%, less than 6 SD from mean depth, more than 6 SD for singleton count, or outside of 6 SD for heterozygous to homozygous ratio or transition to transversion ratio. Variants were annotated using ANNOVAR (Wang et al., 2010) and dbNSFP v2.0 (https://sites.google.com/site/jpopgen/dbNSFP) according to the GRCh37 reference genome and National Center for Biotechnology Information RefSeq. The final dataset included 1,749,755 SNVs in 1,271 individuals of whom 472 individuals also had phenotype data on symptoms of sleep apnea available.

Statistical analyses

Descriptive analysis was performed using IBM SPSS Statistics version 21 (IBM Corp. Released 2012. IBM SPSS Statistics for Windows, Version 21.0. Armonk, NY: IBM Corp.). Study specific exome analyses and meta-analysis of the individual study data were performed using the seqMeta v1.5 library of the R software (http://cran.r-project.org/web/packages/seqMeta/). Single variant association analysis was performed by assuming an additive effect. In ERF and FHS a linear mixed effects model was used adjusting for familial relationships by including the kinship matrix. To account for population stratification in the RS, we tested the association of ten principal components with the phenotype. None of them was significantly associated with symptoms of sleep apnea and we did not include them in the analysis. The regression analysis was performed using the four categories of symptoms of sleep apnea score as a continuous trait, adjusting for the three main risk factors for OSA; age, sex and body mass index (BMI) (kg/m2). Meta-analysis was performed using a fixed effects model. Variants that were present in both discovery cohorts (ERF and RS, 115,526 variants) were tested for association, giving a Bonferroni corrected p-value threshold of 4.3 × 10−7. All variants that showed significant or suggestive (p < 1.0 × 10−6) association signals in the discovery samples, were tested for replication in FHS. Bonferroni correction was also applied to correct for multiple testing in the replication stage.

Results

Descriptive statistics of the study populations are presented in Table 1. The mean age in RS was 75 years ( = 27.4 kg/m2), where the mean age in ERF was 46 years ( = 26.7 kg/m2) and 59 years in FHS ( = 27.5 kg/m2). The prevalence of symptoms of sleep apnea was higher in the ERF population, where 16.8% of the participants reported to have experienced apneas during the last month, compared to 11.6 and 6.6% of the RS and FHS participants, respectively (Table 2).
Table 1

Descriptive statistics of the study populations.

ERF RS FHS
N 654821472
Age (years), mean ± SD46.4 ± 13.475.0 ± 6.159.2 ± 9.4
Male (%)42.546.848.5
BMI (kg/m2), mean ± SD26.7 ± 4.427.4 ± 4.027.5 ± 4.7

ERF, Erasmus Rucphen Family study; RS, Rotterdam Study; FHS, Framingham Heart Study; N, number of participants; BMI, body mass index.

Table 2

Answers to the sleep apnea question for the discovery and replication populations.

ERF (%) RS (%) FHS (%)
How often did you or your partner notice long pauses between breaths while asleep (a so-called sleep apnea)?Not during the last month544 (83.2)726 (88.4)441 (93.4)
Less than once a week48 (7.3)44 (5.4)16 (3.4)
Once or twice a week32 (4.9)32 (3.9)6 (1.3)
More than twice a week30 (4.6)19 (2.3)9 (1.9)
Total654821472

ERF, Erasmus Rucphen Family study; RS, Rotterdam Study; FHS, Framingham Heart Study.

Descriptive statistics of the study populations. ERF, Erasmus Rucphen Family study; RS, Rotterdam Study; FHS, Framingham Heart Study; N, number of participants; BMI, body mass index. Answers to the sleep apnea question for the discovery and replication populations. ERF, Erasmus Rucphen Family study; RS, Rotterdam Study; FHS, Framingham Heart Study. The exome-wide association results and the distribution of the test statistic (λ = 1.02) are illustrated in Figures 1, 2 respectively. Significant associations of symptoms of sleep apnea were observed with six rare variants [minor allele frequency (MAF) <1%] (located in ACE, AIFM3, LIPJ, MUC2, AP2A2, SH3BP1) (Table 3). Suggestive associations of symptoms of sleep apnea were observed with 11 rare variants (located in KANK2, LCN6, TRAF3, PLEK, HIF1A, SLC45A3, ERCC1/CD3EAP, MRGPRE, GRAMD4, TYW5, CST5) (Table 3). Of all 17 variants, only seven were polymorphic in the replication sample and could be tested for association (Table 4). Of the six significantly associated variants, two could be tested for association with symptoms of sleep apnea in the FHS (located in MUC2 and SH3BP1).
Figure 1

Manhattan plot of the meta-analysis of symptoms of sleep apnea. This plot shows −log10 transformed p-values (y-axis) for all SNPs present in the meta-analysis according to their position on each chromosome (x-axis). The red dashed line represents the Bonferroni corrected p-value threshold for significance (p < 4.3 × 10−7) and the blue dashed line indicates the threshold for suggestive associations (p < 1.0 × 10−6).

Figure 2

Quantile-Quantile plot of the meta-analysis of symptoms of sleep apnea. The QQ-plot shows the observed p-values plotted on the y-axis against the expected values of the test statistics on the x-axis (X2-distribution). The red line shows the distribution under the null hypothesis.

Table 3

Meta-analysis association results, filtered on p < 1.0 × 10−6.

Marker name Gene Chr Position Minor/major CADD* Function GVS* Poly Phen2* GERP score* ERF (N = 654) RS (N = 821) Meta-analysis (N = 1,475)
MAF Beta SE p-value MAF Beta SE p-value MAF Beta SE p-value
rs137910205 ACE 1761,561,775A/G0.50Synonymous−9.940.0012.460.737.37 × 10−040.0012.630.601.15 × 10−050.0012.560.463.15 × 10−08
rs178276 AIFM3 2221,331,950C/G6.73Intron−3.740.002–0.290.525.75 × 10−010.0041.530.255.03 × 10−100.0031.190.227.68 × 10−08
rs77091298 LIPJ 1090,356,568G/T15.40Missense0.604.120.0021.890.522.74 × 10−040.0011.680.427.61 × 10−050.0011.760.338.04 × 10−08
rs9735156 MUC2 111,093,641C/T1.95Synonymous−2.970.0012.460.737.78 × 10−040.0011.770.422.93 × 10−050.0011.940.371.16 × 10−07
11:977099 AP2A2 11977,099G/A15.33Missense0.302.960.0012.250.732.14 × 10−030.0012.590.601.61 × 10-050.0012.450.461.27 × 10−07
rs149928566 SH3BP1 2238,039,746T/C13.28Missense0.72−2.340.0020.030.539.52 × 10−010.0051.110.203.14 × 10−080.0040.970.192.04 × 10−07
rs117057052 KANK2 1911,277,278C/T3.31Missense0.373.110.0012.390.731.09 × 10−030.0011.670.428.60 × 10−050.0011.850.374.78 × 10−07
9:139642861 LCN6 9139,642,861C/T2.75Non-coding exon−2.030.0051.060.304.81 × 10−040.0012.540.602.47 × 10−050.0021.360.275.31 × 10−07
rs148461790 TRAF3 14103,369,593A/G15.27Missense near splice0.434.530.0011.660.732.28 × 10−020.0012.730.605.23 × 10−060.0012.300.466.83 × 10−07
rs34515106 PLEK 268,607,978C/A14.50Missense0.805.800.0020.940.527.27 × 10−020.0012.100.427.80 × 10−070.0011.640.336.87 × 10−07
rs149348765 HIF1A 1462,204,819T/G14.84Missense0.595.410.0080.890.241.46 × 10−040.0011.410.428.62 × 10−040.0041.020.217.89 × 10−07
rs139592793 SLC45A3 1205,632,166T/C6.52Synonymous1.320.0021.220.434.24 × 10−030.0012.670.608.42 × 10−060.0011.710.358.80 × 10−07
rs2229918 ERCC1, CD3EAP 1945,912,924G/C7.633-prime-UTR0.570.0030.490.371.80 × 10−010.0031.370.273.39 × 10−070.0031.070.228.98 × 10−07
rs191846883 MRGPRE 113,249,162A/G5.89Synonymous3.550.0021.610.521.85 × 10−030.0040.850.215.87 × 10−050.0030.960.209.61 × 10−07
22:47058906 GRAMD4 2247,058,906T/C1.44Intron−5.200.0021.330.521.05 × 10−020.0012.710.606.16 × 10−060.0011.920.399.78 × 10−07
2:200803697 TYW5 2200,803,697A/G38.00Stop-gained4.450.0011.710.731.94 × 10-020.0012.640.601.02 × 10−050.0012.270.469.83 × 10−07
rs142729279 CST5 2023,858,232A/G1.78Synonymous0.460.0011.710.731.94 × 10−020.0012.640.601.02 × 10−050.0012.270.469.89 × 10−07

Chr, Chromosome; CADD, Combined Annotation Dependent Depletion; GERP, Genomic Evolutionary Rate Profiling; ERF, Erasmus Rucphen Family study; RS, Rotterdam Study; MAF, Minor Allele Frequency, SE, Standard Error, –, unknown;

SeattleSeq Annotation Database 138.

All effects are reported for the minor allele.

Table 4

Replication results, filtered on p < 1.0 × 10−6.

Marker name Gene Meta-analysis (N = 1475) Replication FHS (N = 472) Meta-analysis (discovery and replication, N = 1947)
MAF Beta SE p-value MAF Beta SE p-value MAF Beta SE p-value
rs137910205 ACE 0.0012.560.463.15 × 10−08
rs77091298 LIPJ 0.0011.760.338.04 × 10−08
rs9735156 MUC2 0.0011.940.371.16 × 10−070.002−0.140.340.690.0010.830.259.09 × 10−04
11:977099 AP2A2 0.0012.450.461.27 × 10−07
rs149928566 SH3BP1 0.0040.970.192.04 × 10−070.010−0.160.160.330.0050.330.127.69 × 10−03
rs117057052 KANK2 0.0011.850.374.78 × 10−070.005−0.120.220.590.0020.390.193.66 × 10−02
9:139642861 LCN6 0.0021.360.275.31 × 10−07
rs148461790 TRAF3 0.0012.300.466.83 × 10−07
rs34515106 PLEK 0.0011.640.336.87 × 10−07
rs149348765 HIF1A 0.0041.020.217.89 × 10−070.002–0.150.340.660.0040.710.186.09 × 10−05
rs139592793 SLC45A3 0.0011.710.358.80 × 10−070.002–0.270.340.430.0020.700.243.91 × 10−03
rs2229918 ERCC1, CD3EAP 0.0031.070.228.98 × 10−070.0030.870.281.84 × 10−030.0030.990.176.98 × 10−09
rs191846883 MRGPRE 0.0030.960.209.61 × 10−07
22:47058906 GRAMD4 0.0011.920.399.78 × 10−07
2:200803697 TYW5 0.0012.270.469.83 × 10−07
rs142729279 CST5 0.0012.270.469.89 × 10−070.002−0.020.340.950.0010.780.284.35 × 10−03

MAF, Minor Allele Frequency; SE, Standard Error; –, Not available; FHS, Framingham Heart Study.

All effects are reported for the minor allele.

Manhattan plot of the meta-analysis of symptoms of sleep apnea. This plot shows −log10 transformed p-values (y-axis) for all SNPs present in the meta-analysis according to their position on each chromosome (x-axis). The red dashed line represents the Bonferroni corrected p-value threshold for significance (p < 4.3 × 10−7) and the blue dashed line indicates the threshold for suggestive associations (p < 1.0 × 10−6). Quantile-Quantile plot of the meta-analysis of symptoms of sleep apnea. The QQ-plot shows the observed p-values plotted on the y-axis against the expected values of the test statistics on the x-axis (X2-distribution). The red line shows the distribution under the null hypothesis. Meta-analysis association results, filtered on p < 1.0 × 10−6. Chr, Chromosome; CADD, Combined Annotation Dependent Depletion; GERP, Genomic Evolutionary Rate Profiling; ERF, Erasmus Rucphen Family study; RS, Rotterdam Study; MAF, Minor Allele Frequency, SE, Standard Error, –, unknown; SeattleSeq Annotation Database 138. All effects are reported for the minor allele. Replication results, filtered on p < 1.0 × 10−6. MAF, Minor Allele Frequency; SE, Standard Error; –, Not available; FHS, Framingham Heart Study. All effects are reported for the minor allele. A significant association of symptoms of sleep apnea with rs2229918, located on chromosome 19q13 in the overlapping 3′-untranslated region (UTR) of the ERRC1 and CD3EAP genes (Figure 3), was observed in the replication sample (p = 1.84 × 10−3). Moreover, both the frequency (MAFFHS = 0.3%) and the effect size of the minor allele (G; βFHS = 0.87) were consistent with that of the discovery cohorts (MAF = 0.3%, β = 1.07) suggesting that each copy of the minor allele (G) can result in a shift to a higher category in self-reported apnea symptoms (PSQI). Meta-analysing the discovery and replication cohorts yielded an increased significance of the association of rs2229918 with symptoms of sleep apnea (p = 6.98 × 10−9, β = 0.99).
Figure 3

Regional association plot for rs2229918. Rs2229918 is located in purple. The dots show the variants tested in this region on chromosome 19. The −log10 transformed p-values are plotted on the y-axis and the genes and positions of the variants (Mb) in this region are depicted on the x-axis.

Regional association plot for rs2229918. Rs2229918 is located in purple. The dots show the variants tested in this region on chromosome 19. The −log10 transformed p-values are plotted on the y-axis and the genes and positions of the variants (Mb) in this region are depicted on the x-axis.

Discussion

This study aimed at identifying genetic variants associated with symptoms of sleep apnea by performing a meta-analysis of WES data. We identified a rare genetic variant (MAF = 0.3%), rs2229918, located in the shared 3′-UTR region of the ERCC1 and CD3EAP genes with a large effect on symptoms of sleep apnea. We show significant replication of rs2229918 in an independent sample. The CD3EAP gene is located in antisense orientation to ERCC1 where the 3′-UTRs of both genes overlap. This type of 3′-UTR overlap is conserved in mice and yeast suggesting an important biological function (OMIM #107325). 3′-UTRs can be highly enriched for regulatory elements such as binding sites for regulatory proteins and microRNAs and therefore are most likely involved in post-transcriptional regulation (Xie et al., 2005). ERCC1 encodes Excision Repair Cross-Complementation Group 1, a protein functioning in the nucleotide excision repair pathway and needed for the repair of DNA lesions but also involved in recombinational DNA repair and the repair of inter-strand crosslinks (Stelzer et al., 2016). Mutations in ERCC1 have been linked to cerebro-oculo-facio-skeletal syndrome 4, a severe autosomal recessive disorder characterized by growth retardation, dysmorphic facial features, arthrogryposis, and neurologic abnormalities (OMIM #610758). CD3EAP is a component of RNA polymerase I which synthesizes ribosomal RNA precursors and is involved in poly(A) RNA binding and DNA-directed RNA polymerase activity (Stelzer et al., 2016). CD3EAP is localized in the nucleus and mitochondria and has two isoforms, isoform 1 is involved in UBTF-activated (Upstream Binding Transcription Factor, RNA Polymerase 1) transcription, while isoform 2 is a component of preformed T-cell receptor complex. CD3EAP is involved in multiple pathways including rRNA expression and RNA Polymerase 1 transcription related pathways; RNA polymerase I promotor escape and transcription; gene expression; and the TNF-alpha/NF-kB signaling pathway (Stelzer et al., 2016). Previous genetic studies have associated NF-kB-dependent genes, especially TNF-α, with OSA (Riha et al., 2005; Ryan et al., 2006; Popko et al., 2008; Bhushan et al., 2009). Moreover, NF-kB is thought to play a key role in mediation of the inflammatory and cardiovascular consequences of OSA (Ryan et al., 2005; Garvey et al., 2009). GeneNetwork (Fehrmann et al., 2015) (http://129.125.135.180:8080/GeneNetwork/) shows that both ERCC1 and CD3EAP are expressed in tissues that may be related to obstruction of the upper airway or diseases of tissues/organs associated with OSA, such as muscle cells, cartilage, trachea, salivary glands, heart and heart ventricles, glucagon secreting cells, the neck and the tongue. This further supports that ERCC1 and CD3EAP are interesting candidate genes for symptoms of sleep apnea. Rs2229918 is located on chromosome 19q13, a previously identified region with suggestive evidence for linkage to OSA in European-Americans, independently of BMI (Palmer et al., 2003). Although APOE, a known candidate gene for OSA, is also located in this region, it did not show association with OSA in the present study. A previous study fine-mapped the APOE region and concluded that APOE does not explain the linkage signal, suggesting that APOE is not the causative locus (Larkin et al., 2006). Although the linkage analysis performed by Palmer et al. (2003) was redone by adding additional family members and families, the chromosome 19 region was not confirmed. However, this could be due to the genetic or disease heterogeneity (Larkin et al., 2008). Additionally, there were six rare variants (MAF < 0.4%) that surpassed the Bonferroni corrected p-value threshold, of which three (located in ACE, LIPJ and AP2A2) were monomorphic in the FHS and could not be tested for replication. Our top finding, rs137910205, a synonymous variant, is located in the ACE (angiotensin converting enzyme) gene, one of the most studied genes for OSA. Previous studies found an association between the ACE insertion/deletion polymorphism and an increased risk of hypertension in OSA patients (Lin et al., 2004; Bostrom et al., 2007), although results are conflicting (Patel et al., 2007). Further, plasma activity of ACE has been found to be increased in untreated OSA patients (Barcelo et al., 2001). Both carriers of rs137910205 (1 in each cohort) reported the highest score for symptoms of sleep apnea, i.e. these individuals have experienced pauses in breathing at least twice per week. The second variant is the missense variant, rs77091298, located in the LIPJ (Lipase Family Member J) gene. GeneNetwork showed that LIPJ is expressed in the nasopharynx, neck, and muscle cells, all highly relevant tissues in the pathogenesis of OSA (Fehrmann et al., 2015). The third variant that could not be tested for replication, 11:977099, has not been identified before. The variant is located in the AP2A2 gene (Adaptor Related Protein Complex 2 Alpha 2 Subunit), which is related to lipid binding (Stelzer et al., 2016). However, we caution against the interpretation of statistics when the number of carriers of the genetic variants is less than five. Larger sample sizes are needed to further investigate the possible association of these rare genetic variants with OSA. This study has some limitations regarding the study design. We have used questionnaire data for the assessment of symptoms of sleep apnea, which could introduce bias (Fedson et al., 2012). Although reports of breathing pauses more than twice per week are highly predictive of polysomnographic sleep apnea, self- or partner-reported breathing pauses have low sensitivity (Young et al., 2002b). Individuals with sleep apnea who experience predominantly hypopneas (shallow breathing) rather than apneas may be less likely to be identified with questionnaire data, as these individuals and their partners may be less likely to recognize these events. Another limitation of using questionnaire data is that the discrimination between OSA, central sleep apnea and mixed sleep apnea is not possible. Although the prevalence of central sleep apnea is generally much lower than OSA in particular in general population samples (Donovan and Kapur, 2016). Another limitation is that our findings might not be generalizable to other populations as all studies used in this analysis are predominantly European or European American populations. Previous studies have shown a difference in prevalence of sleep apnea between populations, where young African Americans may be at increased risk for sleep apnea (Redline et al., 1997) and had a higher apnea-hypopnea index relative to European Americans with OSA/hypopnea syndrome (Pranathiageswaran et al., 2013). The frequency of the rs2229918 minor allele (G), based on the 1000 Genomes data, also differs across populations (https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes/). Lastly, sleep apnea is a complex and heterogeneous disease influenced by many risk factors such as obesity, age, gender (Redline et al., 1994; Bixler et al., 2001; Young et al., 2002b,a, 2004; Peppard et al., 2013), craniofacial and upper airway abnormalities (Mayer et al., 1996; White, 2005), race (Redline et al., 1997; Li et al., 2000), alcohol intake (Young et al., 2002a), smoking (Wetter et al., 1994), and reduced nasal patency due to congestion and respiratory allergies (Young et al., 1997). Despite this phenotypic complexity, we have identified and replicated a rare variant associated with symptoms of sleep apnea. However, we have only used one replication sample and additional studies should further investigate the association of rs2229918 with sleep apnea using objective measurements. To conclude, this first meta-analysis of symptoms of sleep apnea using WES data identified a rare genetic variant, rs2229918 (MAF 0.3%), located in the 3′-UTR of ERCC1 and CD3EAP, associated with symptoms of sleep apnea. Both genes are interesting candidate genes for (symptoms of) sleep apnea based on their function and expression in tissues relevant for the pathogenesis of the disease. However, the involvement of rs2229918 in OSA pathology should be further examined in larger datasets with more objective measurements.

Author contributions

AvdS, CvD, and NA contributed to the conceptualization and design of this work; AvdS and CL were involved in the analysis of the data; AvdS, AL, DK, DG, HT, CvD, and NA were involved in interpretation of the results; AvdS and NA were involved in writing and revising the manuscript; AL, DK, RB, JvR, MvdH, RK, AH, AU, WvI, HT, and CvD were involved in data collection/preparation; AL, DK, CL, RB, JvR, MvdH, RK, AH, AU, WvI, DG, HT, and CvD contributed to the interpretation of the data, read and approved the final manuscript.

Conflict of interest statement

NA reports grants from Netherlands Brain Foundation, outside the submitted work. DG reports grants from NIH, during the conduct of the study; personal fees from VIVUS, Inc., outside the submitted work. RK reports grants from Netherlands Genomics Initiative (NGI), grants from Biobanking and Biomolecular Research Infrastructure Netherlands (BBMRI-NL), during the conduct of the study. AL reports grants and non-financial support from Big Health Ltd., outside the submitted work. HT reports grants from Netherlands Organization for Health Research and Development, during the conduct of the study. The other authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  100 in total

1.  A candidate gene study of obstructive sleep apnea in European Americans and African Americans.

Authors:  Emma K Larkin; Sanjay R Patel; Robert J Goodloe; Yali Li; Xiaofeng Zhu; Courtney Gray-McGuire; Mark D Adams; Susan Redline
Journal:  Am J Respir Crit Care Med       Date:  2010-06-10       Impact factor: 21.405

Review 2.  Frequently used sleep questionnaires in epidemiological and genetic research for obstructive sleep apnea: a review.

Authors:  Annette C Fedson; Allan I Pack; Thorarinn Gislason
Journal:  Sleep Med Rev       Date:  2012-03-17       Impact factor: 11.609

3.  Association of the -1438G/A polymorphism of the 5-HT2A receptor gene with obstructive sleep apnea syndrome.

Authors:  Yildirim A Bayazit; Metin Yilmaz; Tansu Ciftci; Emin Erdal; Oguz Kokturk; Tuba Gokdogan; Yusuf K Kemaloglu; Erdogan Inal
Journal:  ORL J Otorhinolaryngol Relat Spec       Date:  2006-01-30       Impact factor: 1.538

Review 4.  Connections between sleep and cognition in older adults.

Authors:  Kristine Yaffe; Cherie M Falvey; Tina Hoang
Journal:  Lancet Neurol       Date:  2014-10       Impact factor: 44.182

5.  Obstructive sleep apnea: the most common secondary cause of hypertension associated with resistant hypertension.

Authors:  Rodrigo P Pedrosa; Luciano F Drager; Carolina C Gonzaga; Marcio G Sousa; Lílian K G de Paula; Aline C S Amaro; Celso Amodeo; Luiz A Bortolotto; Eduardo M Krieger; T Douglas Bradley; Geraldo Lorenzi-Filho
Journal:  Hypertension       Date:  2011-10-03       Impact factor: 10.190

6.  Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals.

Authors:  Xiaohui Xie; Jun Lu; E J Kulbokas; Todd R Golub; Vamsi Mootha; Kerstin Lindblad-Toh; Eric S Lander; Manolis Kellis
Journal:  Nature       Date:  2005-02-27       Impact factor: 49.962

7.  Selective activation of inflammatory pathways by intermittent hypoxia in obstructive sleep apnea syndrome.

Authors:  Silke Ryan; Cormac T Taylor; Walter T McNicholas
Journal:  Circulation       Date:  2005-10-25       Impact factor: 29.690

8.  Sleep-disordered breathing and neuropsychological deficits. A population-based study.

Authors:  H C Kim; T Young; C G Matthews; S M Weber; A R Woodward; M Palta
Journal:  Am J Respir Crit Care Med       Date:  1997-12       Impact factor: 21.405

9.  Racial differences in sleep-disordered breathing in African-Americans and Caucasians.

Authors:  S Redline; P V Tishler; M G Hans; T D Tosteson; K P Strohl; K Spry
Journal:  Am J Respir Crit Care Med       Date:  1997-01       Impact factor: 21.405

Review 10.  The association of 5-HT2A, 5-HTT, and LEPR polymorphisms with obstructive sleep apnea syndrome: a systematic review and meta-analysis.

Authors:  Baodong Qin; Zhen Sun; Yan Liang; Zaixing Yang; Renqian Zhong
Journal:  PLoS One       Date:  2014-04-22       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.