Literature DB >> 28898263

Genome-wide analysis of parent-of-origin interaction effects with environmental exposure (PoOxE): An application to European and Asian cleft palate trios.

Øystein A Haaland1, Astanand Jugessur1,2,3, Miriam Gjerdevik1,2, Julia Romanowska1, Min Shi4, Terri H Beaty5, Mary L Marazita6, Jeffrey C Murray7, Allen J Wilcox8, Rolv T Lie1,9, Håkon K Gjessing1,3.   

Abstract

Cleft palate only is a common birth defect with high heritability. Only a small fraction of this heritability is explained by the genetic variants identified so far, underscoring the need to investigate other disease mechanisms, such as gene-environment (GxE) interactions and parent-of-origin (PoO) effects. Furthermore, PoO effects may vary across exposure levels (PoOxE effects). Such variation is the focus of this study. We upgraded the R-package Haplin to enable direct tests of PoOxE effects at the genome-wide level. From a previous GWAS, we had genotypes for 550 case-parent trios, of mainly European and Asian ancestry, and data on three maternal exposures (smoking, alcohol, and vitamins). Data were analyzed for Europeans and Asians separately, and also for all ethnicities combined. To account for multiple testing, a false discovery rate method was used, where q-values were generated from the p-values. In the Europeans-only analyses, interactions with maternal smoking yielded the lowest q-values. Two SNPs in the 'Interactor of little elongation complex ELL subunit 1' (ICE1) gene had a q-value of 0.14, and five of the 20 most significant SNPs were in the 'N-acetylated alpha-linked acidic dipeptidase-like 2' (NAALADL2) gene. No evidence of PoOxE effects was found in the other analyses. The connections to ICE1 and NAALADL2 are novel and warrant further investigation. More generally, the new methodology presented here is easily applicable to other traits and exposures in which a family-based study design has been implemented.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28898263      PMCID: PMC5595425          DOI: 10.1371/journal.pone.0184358

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

With a prevalence of 0.5 per 1000 live births, cleft palate only (CPO) is a common birth defect in humans [1, 2]. It is broadly categorized according to whether it occurs as an isolated defect or together with additional congenital anomalies. In this paper, we focus on isolated CPO. The particularly high heritability and recurrence risk of orofacial clefts [3-8] have spurred long-standing efforts to identify genetic variants controlling risk to these common birth defects. However, as with most other complex traits, the genetic variants identified thus far explain only a small fraction of the total heritability and familial recurrence, underscoring the need to examine etiologic mechanisms beyond simple child effects alone. One alternative is to investigate the effect of a risk-allele or haplotype based on whether it is inherited from the mother or the father (i.e., parent-of-origin (PoO) effects). A difference in effect by parent of origin could occur, for example, with genes that are subject to genomic imprinting [9], which occurs when the allele from one parent is silenced but the allele from the other parent is expressed. This possibility is especially relevant for perinatal disorders because the mother defines the prenatal environment of the fetus. Another popular approach is to explore the role of environmental factors, either independently or in combination with specific genetic variants (GxE effects). Although animal models have long demonstrated that environmental factors are important in clefting (reviewed in [10, 11]), the evidence from human studies is less conclusive. Among a wide array of environmental factors, maternal periconceptional smoking has been consistently associated with increased risk of clefting [12-14]. Since most environmental factors are modifiable, identifying GxE effects may help to target genetically susceptible subgroups of the population. A third, yet unexplored approach is to study PoO effects in interaction with environmental exposures (PoOxE); i.e., whether PoO effects vary according to the exposure status of the fetus. With the notable exception of Wang et al. (2011) [15], who assessed differential imprinting across environmental exposures in childhood asthma, the literature on PoOxE effect estimation is sparse. To address this gap, we have developed a comprehensive and user-friendly methodology that is not restricted by assumptions pertaining to imprinting. The theoretical foundation for these new methods has been presented by Skare et al. (2012) [14] and Gjerdevik et al. (2017) [16], and the methods themselves are available in the R-package Haplin [17]. The mathematics behind the PoOxE analyses is outlined in Materials and methods. This study is based on the case-parent trio study design, which is applicable to a wide range of etiologic scenarios pertinent to perinatal disorders [18]. We had GWAS data as well as information on periconceptional exposures from the mother (cigarette smoking, alcohol intake, vitamin use) and ethnicity (European, Asian, other) for the largest collection of CPO trios to date [19]. Our aim is to identify PoOxE effects in this data set.

Results

We conducted three sets of analyses: pooled analyses including all participants; analyses restricted to Europeans only; and analyses restricted to Asians only. The remaining ethnic groups in our data set were too small to justify separate analyses (Table 1). Given the phenotypic consistency in clefting across ethnicities, it is reasonable to assume that a proportion of the causal variants for clefting is shared across all ethnicities. Accordingly, we present the results of the pooled analyses first, followed by the Europeans-only and Asians-only analyses. The combination of three environmental exposures and the above subgroup analyses yielded a large amount of results. For simplicity, we chose to focus on the top 20 SNPs (sorted by observed p-value) from each analysis. Details about these SNPs, including relative risk ratios (RRRs), are provided in Table 2 and Fig 1, Table 3 and Fig 2, and Table 4 and Fig 3. The corresponding Manhattan plots are provided as supplementary online material (S1 to S3 Figs). Table 5 contains the full names of all the genes mentioned in Tables 2 to 4.
Table 1

Population distribution according to ethnicity and trio completeness.

EthnicityComplete triosIncomplete triosTotal
All46684550
European21554269
Asian23122253
Other*20828

Columns show the number of families.

*Separate analyses were not conducted for this group.

Table 2

Top 20 SNPs sorted according to p-value in the pooled analysis.

SNP aP-valueQ-valueRRR (95% CI)Gene symbol bShared c
SMOKINGrs11160994.3e-060.87.3 (3.1–17)NC
rs29641375.5e-060.86.9 (3.0–16)~ICE1Europe
rs29644475.6e-060.86.8 (3.0–16)~ICE1Europe
rs13486921.26e-0516.3 (2.8–15)NC
rs174017971.6e-0510.17 (0.08–0.38)~ICA1/ GLCCI1Europe
rs2478201.72e-0510.15 (0.06–0.36)ATP2C2
rs67644221.94e-05110.5 (3.6–31)NAALADL2Europe
rs42434122.26e-05110.2 (3.5–30)NAALADL2Europe
rs46958082.45e-0510.18 (0.08–0.40)NC
rs109368613.12e-0517.3 (2.9–19)NAALADL2Europe
rs48848143.91e-0515.4 (2.4–12)NC
rs117067604.17e-0519.5 (3.2–28)NAALADL2Europe
rs48689534.68e-0515.4 (2.4–12)NC
rs102289067.24e-0510.18 (0.08–0.42)STEAP1B
rs18897067.59e-0510.16 (0.06–0.39)NRG3
rs28616247.77e-0515.1 (2.3–12)NC
rs67816598.85e-0516.5 (2.5–17)NAALADL2Europe
rs10356319.63e-0510.20 (0.09–0.45)~WIF1
rs130783070.000114610.10 (0.03–0.32)CNTN4
rs79973690.000118114.9 (2.2–11)NC
ALCOHOLrs60929341.3e-060.56.7 (3.1–15)NCEurope
rs25878884.1e-060.70.19 (0.10–0.39)GNAO1
rs129015365e-060.75.3 (2.6–11)NC
rs169916451.02e-050.80.18 (0.08–0.38)PSMF1
rs18845111.07e-050.80.18 (0.08–0.38)NC
rs13961761.16e-050.80.18 (0.08–0.38)NC
rs115956561.26e-050.80.11 (0.04–0.30)GRID1Europe
rs126130262.13e-0514.2 (2.2–8.1)HAAO
rs102003712.32e-0510.21 (0.11–0.44)NC
rs124170423.22e-0510.19 (0.09–0.42)GALNT18
rs25602943.41e-0510.21 (0.10–0.44)NC
rs79924984.15e-0510.23 (0.12–0.47)NC
rs14519914.68e-0510.17 (0.07–0.40)NC
rs134181134.73e-0510.19 (0.09–0.42)NC
rs47945565.47e-0510.13 (0.05–0.35)NC
rs49103205.87e-0510.20 (0.09–0.43)GALNT18
rs49057415.9e-0510.25 (0.12–0.49)NC
rs104644196.26e-0510.22 (0.11–0.46)DPP6
rs98620036.5e-0516.1 (2.5–15)FHIT
rs47569306.84e-0514.3 (2.1–8.7)SAAL1
VITAMINrs28306341.1e-060.54.8 (2.6–9.0)NC
rs100870704.2e-060.70.15 (0.07–0.34)NC
rs100876435e-060.70.16 (0.07–0.35)NC
rs72450397.3e-060.70.26 (0.14–0.46)NCAsia
rs116593407.8e-060.70.25 (0.13–0.46)NC
rs29089079.3e-060.70.26 (0.14–0.47)NCAsia
rs130990911.19e-050.719 (5.2–72)NC
rs98744701.48e-050.70.25 (0.14–0.47)LSAMP
rs117872351.69e-050.70.17 (0.08–0.39)NC
rs19183671.94e-050.70.27 (0.15–0.49)NCEurope
rs1634742.14e-050.70.26 (0.14–0.48)ZNF659
rs60249562.2e-050.70.27 (0.15–0.50)NC
rs80870792.63e-050.70.27 (0.15–0.50)NCAsia
rs346467502.72e-050.70.28 (0.15–0.51)NC
rs48311292.93e-050.70.26 (0.14–0.49)LSAMP
rs99471982.94e-050.73.7 (2.0–6.9)NC
rs10267913.1e-050.73.5 (2.0–6.4)~IL22
rs109600723.44e-050.70.25 (0.13–0.48)NC
rs32140023.6e-050.73.8 (2.0–7.0)NC
rs65934453.9e-050.70.23 (0.12–0.46)NC

a SNP location according to the 1000 Genomes browser (Phase 3; https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes)

b NC: Not close to any known gene (at least within a 30 kb-distance). Pseudogenes and non-coding RNA (ncRNA) are excluded. ~: located within 30 kb of a gene

c Shared: Also among the top 20 SNPs in either the Asians-only or the Europeans-only analyses.

Fig 1

Pooled analyses of all ethnicities combined.

From left to right: smoking, alcohol intake, and vitamin use.

Table 3

Top 20 SNPs sorted according to p-value in the Europeans-only analysis.

SNPaP-valueQ-valueRRR (95% CI)Gene symbol bShared c
SMOKINGrs29644476e-070.140.09 (0.04–0.23)~ICE1Pooled
rs29641377e-070.140.09 (0.04–0.23)~ICE1Pooled
rs42434124.1e-060.417 (5.0–56)NAALADL2Pooled
rs67644224.1e-060.417 (5.0–56)NAALADL2Pooled
rs67710267.2e-060.411 (3.8–31)NC
rs109368617.6e-060.412 (4.0–36)NAALADL2Pooled
rs126784999e-060.40.08 (0.03–0.24)OXR1
rs125488869.1e-060.40.08 (0.03–0.24)OXR1
rs96617281.07e-050.48.1 (3.2–20)NC
rs117067601.07e-050.415 (4.5–49)NAALADL2Pooled
rs174017971.32e-050.50.13 (0.05–0.32)~ICA1/GLCCI1Pooled
rs75459401.34e-050.50.11 (0.04–0.30)MORN1
rs64542372.57e-050.80.11 (0.04–0.30)FAM46A
rs107776472.63e-050.80.15 (0.06–0.36)NC
rs93442082.75e-050.80.11 (0.04–0.30)NC
rs126208963.05e-050.80.09 (0.03–0.27)NC
rs173674093.59e-050.97.7 (2.9–20)ZHX2
rs67816594.15e-050.99.9 (3.3–29)NAALADL2Pooled
rs94493574.29e-050.90.12 (0.04–0.33)NC
rs93442104.44e-050.90.12 (0.04–0.33)NC
ALCOHOLrs60929347.5e-060.88.0 (3.2–20)NCPooled
rs7382612.2e-050.85.6 (2.5–12)BPIFC
rs104644192.56e-050.80.15 (0.06–0.36)DPP6Pooled
rs15632312.9e-050.80.09 (0.03–0.28)NC
rs130161273.37e-050.80.18 (0.08–0.41)NC
rs7601503.5e-050.810 (3.4–31)PCP4
rs22719863.51e-050.824 (5.2–105)NOS1
rs96585703.51e-050.823 (5.2–105)NOS1
rs115956563.68e-050.80.10 (0.03–0.29)GRID1Pooled
rs3291384.3e-050.80.18 (0.08–0.41)~CLDN18/DZIP1L
rs104980664.52e-050.86.7 (2.7–17)NC
rs64695484.53e-050.80.09 (0.03–0.29)NC
rs5148984.62e-050.86.7 (2.7–18)NC
rs100355804.63e-050.86.4 (2.6–16)FAM134B
rs76055684.77e-050.86.7 (2.7–17)NC
rs110764524.88e-050.80.18 (0.08–0.41)NC
rs68483134.97e-050.80.18 (0.08–0.42)PPARGC1A
rs71414165.1e-050.812 (3.5–38)NC
rs49380945.71e-050.80.16 (0.06–0.39)NC
rs171025055.77e-050.80.06 (0.02–0.24)NC
VITAMINrs14003161e-060.410 (4.0–26)DLG2
rs8810294.1e-060.812 (4.2–36)GPC1
rs109339731.1e-050.87.5 (3.1–19)GUCA1C
rs12906201.25e-050.80.14 (0.06–0.34)CYP4F3
rs7599981.45e-050.80.14 (0.06–0.34)CYP4F3
rs21444101.52e-050.80.12 (0.04–0.31)TBC1D22A
rs22752561.52e-050.89.7 (3.5–27)BNC2
rs19183671.67e-050.80.13 (0.05–0.33)NCPooled
rs70335122.17e-050.89.1 (3.3–25)BNC2
rs108981662.34e-050.80.12 (0.04–0.32)DLG2
rs112337742.34e-050.80.12 (0.04–0.32)DLG2
rs170513782.62e-050.810 (3.4–30)ANXA5
rs109326192.75e-050.86.6 (2.7–16)NC
rs7220972.94e-050.86.8 (2.8–17)NINJ2
rs9211713.3e-050.86.7 (2.7–17)GUCA1C
rs7518733.3e-050.87.6 (2.9–20)SYNJ2
rs7390123.4e-050.80.13 (0.05–0.34)TBC1D22A
rs26830454.54e-050.960.15 (0.06–0.38)CYP4F3
rs47996464.67e-050.9611 (3.4–34)NC
rs18384544.93e-050.967.7 (2.9–20)NC

a SNP location according to the 1000 Genomes browser (Phase 3; https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes)

b NC: Not close to any known gene (at least within a 30 kb-distance). Pseudogenes and non-coding RNA (ncRNA) are excluded. ~: located within 30 kb of a gene

c Shared: Also among the top 20 SNPs in either the Asians-only or the pooled analyses.

Fig 2

Analyses of the European sample.

From left to right: smoking, alcohol intake, and vitamin use.

Table 4

Top 20 SNPs sorted according to p-value in the Asians-only analysis.

SNP aPQRRR (95% CI)Gene symbol bShared c
rs125190785.8e-06112 (4.0–33)NC
rs13454057.5e-06112 (4.1–36)NC
rs118596291.02e-05111 (3.9–34)NC
rs20525091.03e-0510.06 (0.02–0.21)TENM2
rs74992152.99e-05110 (3.5–31)NC
rs29089153.01e-0510.11 (0.04–0.32)NC
rs29089073.68e-0510.11 (0.04–0.32)NCPooled
rs29643563.97e-0510.11 (0.04–0.32)NC
rs19966444.99e-0510.10 (0.03–0.30)NC
rs80870795.66e-0510.10 (0.03–0.31)NCPooled
rs66830707.91e-0510.11 (0.04–0.33)NC
rs79746469.1e-0510.11 (0.03–0.33)NC
rs131195499.29e-0517.9 (2.8–22)~HELT
rs129604899.89e-0510.11 (0.04–0.33)NC
rs23034470.000104210.12 (0.04–0.35)TPD52
rs20988980.000104210.12 (0.04–0.35)TPD52
rs13371610.000107310.12 (0.04–0.35)NC
rs12529510.0001210.13 (0.04–0.36)~MIS18BP1
rs22894870.0001416126 (4.8–141)PLIN1
rs72450390.000156510.12 (0.04–0.36)NCPooled

a SNP location according to the 1000 Genomes browser (Phase 3; https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes)

b NC: Not close to any known gene (at least within a 30 kb-distance). Pseudogenes and non-coding RNA (ncRNA) are excluded. ~: located within 30 kb of a gene

c Shared: Also among the top 20 SNPs in either the pooled or Europeans-only analyses.

Fig 3

Analyses of vitamin use in the Asian sample.

Table 5

Full gene names*.

SmokingAlcohol useVitamin use
ATP2C2: ATPase secretory pathway Ca2+ transporting 2BPIFC: BPI fold containing family CANXA5: Annexin A5
CNTN4: Contactin 4CLDN18: Claudin 18BNC2: Basonuclin 2
FAM46A: Family with sequence similarity 46 member ADPP6: Dipeptidyl peptidase like 6CYP4F3: Cytochrome P450 family 4 subfamily F member 3
GLCCI1: Glucocorticoid induced 1DZIP1L: DA2 introducing proteinDLG2: Discs large MAGUK scaffold protein 2
ICA1: Islet cell autoantigen 1FAM134B: Family with sequence similarity 134 member BGPC1: Glypican 1
ICE1: Interactor of little elongation complex ELL subunit 1FHIT: Fragile histidine triadGUCA1C: Guanylate cyclase activator 1C
MORN1: MORN repeat containing 1GALNT18: Polypeptide N-acetylgalactosaminyltransferase 18HELT: Helt bHLH transcription factor
NAALADL2: N-acetylated alpha-linked acidic dipeptidase like 2GNAO1: G protein subunit alpha o1IL22: interleukin 22
NRG3: Neuregulin 3GRID1: Glutamate ionotropic receptor delta type subunit 1LSAMP: Limbic system associated membrane protein
OXR1: Oxidation resistance 1HAAO: 3-hydroxyanthranilate 3,4-dioxygenaseMIS18BP1: MIS18 binding protein 1
STEAP1B: STEAP family member 1BNOS1: Nitric oxide synthase 1NINJ2: Ninjurin 2
WIF1: WNT inhibitory factor 1PCP4: Purkinje cell protein 4PLIN1: Perilipin 1
ZHX2: Zinc fingers and homeoboxes 2PSMF1: Proteasome inhibitor subunit 1SYNJ2: Synaptojanin 2
PPARGC1A: PPARG coactivator 1 alphaTBC1D22A: TBC1 domain family member 22A
SAAL1: Serum amyloid A like 1TENM2: Teneurin transmembrane protein 2
TPD52: Tumor protein D52
ZNF659: Zinc finger protein 659

*The full gene names were retrieved from the NCBI Entrez Gene Database (https://www.ncbi.nlm.nih.gov/gene).

Gene IDs used in Tables 2 to 4

Pooled analyses of all ethnicities combined.

From left to right: smoking, alcohol intake, and vitamin use.

Analyses of the European sample.

From left to right: smoking, alcohol intake, and vitamin use. Columns show the number of families. *Separate analyses were not conducted for this group. a SNP location according to the 1000 Genomes browser (Phase 3; https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes) b NC: Not close to any known gene (at least within a 30 kb-distance). Pseudogenes and non-coding RNA (ncRNA) are excluded. ~: located within 30 kb of a gene c Shared: Also among the top 20 SNPs in either the Asians-only or the Europeans-only analyses. a SNP location according to the 1000 Genomes browser (Phase 3; https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes) b NC: Not close to any known gene (at least within a 30 kb-distance). Pseudogenes and non-coding RNA (ncRNA) are excluded. ~: located within 30 kb of a gene c Shared: Also among the top 20 SNPs in either the Asians-only or the pooled analyses. a SNP location according to the 1000 Genomes browser (Phase 3; https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes) b NC: Not close to any known gene (at least within a 30 kb-distance). Pseudogenes and non-coding RNA (ncRNA) are excluded. ~: located within 30 kb of a gene c Shared: Also among the top 20 SNPs in either the pooled or Europeans-only analyses. *The full gene names were retrieved from the NCBI Entrez Gene Database (https://www.ncbi.nlm.nih.gov/gene). Gene IDs used in Tables 2 to 4 To adjust for multiple testing, we used a false discovery rate method where q-values are calculated from observed p-values [20]. We used a q-value of 0.1 to assess statistical significance, which means that at least 90% of the significant SNPs are expected to be true positives. Across all analyses, several SNPs had q-values ranging from 0.1 to 0.5 (Tables 2 to 4). This corresponds to a false discovery rate between 10% and 50%, implying that many of these SNPs are potentially associated with PoOxE effects. Fig 1 shows QQ-plots for the pooled analyses, comprising all ethnicities. All of the most significant SNPs are within the 95% confidence band at the upper right corner of the distribution. The lowest q-values were 0.8 for rs1116099 for maternal smoking, 0.5 for rs6092934 for maternal alcohol intake, and 0.5 for rs2830634 for maternal vitamin use (Table 2). QQ-plots for the Europeans-only analyses are shown in Fig 2. The plot for smoking is particularly notable because all the top 12 SNPs had lower p-values than expected, even though most of them were located within the 95% confidence band. Specific p-values and q-values for these SNPs are provided in Table 3. All of these q-values were below 0.5 for the top 12 SNPs, but markedly higher for the remaining SNPs. Among these 12 SNPs, both rs2964447 and rs2964137 had a q-value of 0.14 (RRR = 0.09, 95% CI: 0.04–0.23). For alcohol intake and vitamin use, the top SNPs were rs6092934 (q = 0.8, RRR = 8.0, 95% CI: 3.2–19.8) and rs1400316 (q = 0.4, RRR = 10.1, 95% CI: 4.0–25.6), respectively. The Asians-only analyses were uninformative due to the low number of trios in which the mother had smoked or consumed alcohol (Table 6). Consequently, tests for interaction had less power than the other analyses. For vitamin use, the QQ-plot did not deviate appreciably from the expected pattern (Fig 3). Table 4 shows the p-values and q-values for the top 20 SNPs. All the SNPs in the Asians-only analyses had q-values equal to one.
Table 6

Characteristics of maternal exposures according to ethnicity.

Ethnic groupMaternal exposureNoYesMissing
All* (n = 550)Smoking463861
Vitamin26522461
Alcohol4231225
European (n = 269)Smoking195740
Vitamin8815526
Alcohol1601081
Asian (n = 253)Smoking24580
Vitamin1705132
Alcohol24193

*Includes ethnicities that are not European or Asian

*Includes ethnicities that are not European or Asian Several of the top 20 SNPs were the same across the three main analyses (pooled, Europeans-only, and Asians-only). The pooled and Europeans-only analyses had eight of the top SNPs in common for PoOxSmoke, three for PoOxAlcohol, and one for PoOxVitamin (Table 2). Similarly, the pooled and Asians-only analyses had three of the top SNPs in common for PoOxVitamin (Table 2). As several of the top 20 SNPs were located in the gene for ‘N-acetylated alpha-linked acidic dipeptidase-like 2’ (NAALADL2), we generated a regional association plot for rs4243412, which was the SNP in NAALADL2 with the lowest p-value in the Europeans-only analysis (Fig 4). We created a similar plot for rs2964137, which was the SNP with the lowest p-value in the pooled analysis (Fig 5). This SNP is located near the ‘Interactor of little elongation complex ELL subunit 1’ (ICE1) gene, and was also found among the top 20 SNPs in the Europeans-only analysis (Table 2).
Fig 4

Regional association plot for rs4243412 in NAALADL2.

The lead SNP is shown in blue, with its associated p-value.

Fig 5

Regional association plot for rs2964137 near ICE1.

The lead SNP is shown in blue, with its associated p-value.

Regional association plot for rs4243412 in NAALADL2.

The lead SNP is shown in blue, with its associated p-value.

Regional association plot for rs2964137 near ICE1.

The lead SNP is shown in blue, with its associated p-value. Because PoO effects and maternal effects may be mutually confounded [21], we performed sensitivity analyses on the above-mentioned top 20 SNPs, and adjusted for potential maternal effects in each stratum of exposure. In these analyses, the RRRs were similar to those in Tables 2 to 4, and the Bonferroni corrected p-values for the interaction between maternal and environmental effects were all equal to 1.

Discussion

Our study used data from the largest collection of CPO trios to date [19] to investigate the hitherto untested possibility of interactions between PoO effects and maternal environmental exposures that have previously shown associations with clefts. We introduce new methodology that not only tests for PoOxE effects but also quantifies them as ratios of relative risks. All analyses were implemented in the R-package Haplin, which accommodates a wide range of etiologic scenarios suitable for family-based study designs. An example code for PoOxE analysis is provided in S1 Appendix.

Pooled analyses

For PoOxSmoke, all p-values were higher in the pooled analyses than in the Europeans-only analyses, suggesting a dilution of effects after pooling data. This reduction of the effect estimate in the pooled analyses may reflect heterogeneity of effect among the subgroups. The opposite was true for PoOxAlcohol, which may indicate a more consistent effect of alcohol across ethnicities. Regarding maternal smoking, multiple SNPs in NAALADL2 indicated the presence of PoOxSmoke effects. No genes or SNPs stood out in the PoOxVitamin analysis.

Europeans-only analyses

We found suggestive evidence of a PoOxSmoke effect for rs2964137 and rs2964447. Although neither SNP is located within any known gene, both lie near ICE1 and are only 2–15 kb from three copy-number variant regions (CNVRs). As in the pooled PoOxSmoke analysis, several top SNPs are located in NAALADL2. Previous analyses of data from genome rearrangements, GWAS, and gene-expression have linked this gene to various disorders, including mild retardation [22] and cancer [23]. We have not been able to find a connection between clefting and ‘Glucocorticoid induced transcript 1’ (GLCCI1), ‘Islet cell autoantigen 1’ (ICA1), or ‘Zinc finger and homeobox 2’ (ZHX2). Regarding PoOxAlcohol effects, ‘Nitric oxide synthase 1’ (NOS1) and ‘Dipeptidyl-peptidase 6’ (DPP6) were among the most interesting genes. NOS1 acts as a physiological modulator of skeletal muscle function and DPP6 is involved in embryonic craniofacial development [24, 25]. Another member of the nitric oxide synthase family, NOS3, is involved in the folate pathway and has previously been linked to a higher risk of isolated CL/P in a non-Hispanic white population [26]. Furthermore, analysis of biopsies of soft palate muscle tissues from children with isolated clefts showed that NOS1 immunoreactivity in the muscle fibers was strongly influenced by the cleft itself [27]. In the PoOxVitamin analysis, three SNPs were located in the ‘Discs, large homolog 2’ (DLG2) gene on chromosome 11q14.1. One of these SNPs in DLG2, rs1400316, had the lowest q-value (0.4). Little has been reported about its role in clefting. Three other genes, ‘Guanylate cyclase activator 1C’ (GUCA1C), ‘TBC1 domain family, member 22A’ (TBC1D22A) and ‘Cytochrome P450, family 4, subfamily F, member 3’ (CYP4F3), each contain two of the top 20 SNPs from this analysis. Based on the literature, however, GUCA1C and TBC1D22A do not appear to have any connections to clefting. In contrast, CYP4F3 belongs to the cytochrome P450 gene family, which is known to be involved in the biotransformation of endobiotics and xenobiotics [28], and may be relevant for clefting. Still, the q-values for SNPs in CYP4F3 were 0.8 or higher.

Asians-only analyses

Compared with European women, Asian women generally consume little alcohol and tobacco [29, 30], which would be expected to be even less among those who are pregnant or planning to be pregnant. This was also observed in our data (Table 6). Even though a lack of observations was not a problem for the PoOxVitamin analyses, all the q-values were equal to one and there were no convincing associations overall for this ethnic group. Regarding ethnic specificity and generalizability, none of the top SNPs in the Asians-only analyses were among the top SNPs in the Europeans-only analyses (Tables 3 and 4), which suggests ethnic-specific effects. Still, the lack of markers in common was somewhat unexpected, as GxE effects have previously been reported across the two ethnicities in the same sample population studied here [31]. However, that study used a different approach; the pooled sample was analyzed first and the top SNPs were verified to see whether the results were consistent across ethnicities. Additionally, the authors did not consider PoOxE.

Methodological considerations

The case-parent trio study design coupled with a large data set provided an excellent opportunity to explore PoOxE effects. The study design protects against false positives due to population substructure, because it aims at detecting asymmetries in allele transmission from parents to the affected child (proband), as opposed to considering only differences in allele frequencies at a population level. Still, if populations of different ethnicities react differently to a given exposure, such that there is a PoOxE effect in one population but not in the other, this effect may be muted or even go undetected in the combined population. It is therefore judicious to stratify analysis by ethnicity. PoO effects may be seen when a gene associated with a given phenotype is also subjected to genomic imprinting [32, 33]. Through DNA methylation, the expression of a particular gene can be upregulated or downregulated depending on its parental origin [9, 34]. It is thus reasonable to assume that maternal environmental exposures capable of influencing methylation patterns might also influence the phenotype differently for maternally and paternally inherited alleles. Hence, it is conceivable that looking specifically for PoOxE effects rather than standard PoO or GxE effects alone might increase the chance of finding gene effects that are indicative of, for instance, genomic imprinting. While PoOxE searches combine PoO searches with ordinary GxE searches in a natural way, there is a price to pay in the form of added complexity. Nevertheless, the total PoOxE effect at a locus with two alleles and a dichotomous environmental exposure can be measured as a single ratio of relative risks (RRR). We have where RRmat(S) and RRpat(S) are as explained in Materials and Methods, and RRR is the ratio of PoO effects in the two strata. If RRR > 1, the interpretation is that the PoO effect RRmat(1)/RRpat(1) in stratum 1 is larger than the corresponding RRmat(0)/RRpat(0) in stratum 0. Note that this may come about in different ways. For example, consider an allele that increases the risk only when inherited from exposed mothers, so that RRmat(1) > 1. Because the other RRs are equal to 1, RRR would be larger than 1. Similarly, if the allele is protective when inherited from unexposed mothers but has no effect in other situations, RRmat(0) < 1, and again RRR > 1. One might also observe more complex patterns, such as an increased risk when the allele is inherited from the mother, where this effect is larger among the exposed than the unexposed; that is, and we would again have RRR > 1. The actual direction of the effect may depend on which allele and exposure group are chosen as reference, which is a general problem when assessing GxE in case-only designs. While ordinary PoO analyses consider the ratio RRmat/RRpat for both strata combined, and ordinary GxE analyses consider RR(1)/RR(0) without accounting for parental origin, the full PoOxE RRR involves comparing four quantities—the effects of maternally and paternally derived alleles computed in both strata separately. Thus, a certain loss of power would be expected relative to the standard tests for PoO and GxE effects. This is indeed what we observe in the power simulations (Fig 6, right panel). We therefore decided not to include maternal genomic effects in the full GWAS analysis, since this is likely to further reduce power to detect PoO effects [21]. Instead, we performed sensitivity analyses to remove any positive confounding from maternal effects for the 20 most promising SNPs in each set of analyses (shown in Tables 2 to 4). It is not particularly likely that any of the genes involved in the sensitivity analyses would operate through maternal effects. Complex, but less likely scenarios where maternal effects cancel out PoO effects may be missed by this approach, however.
Fig 6

Simulation-based power curves.

Left panel: Power versus relative risk ratio (RRR) for different sample sizes, minor allele frequency (MAF) = 0.2, significance level = 0.05. Middle panel: Power versus RRR for different MAFs, total sample size = 500. Right panel: Power versus RR or RRR, as applicable to each effect type, MAF = 0.2, total sample size = 500. Note that the black curve with full squares is identical in all panels (based on a total of 500 trios, MAF = 0.2, and PoOxE). In the PoOxE analysis, we have varied the RR of the maternal allele with exposure status.

Simulation-based power curves.

Left panel: Power versus relative risk ratio (RRR) for different sample sizes, minor allele frequency (MAF) = 0.2, significance level = 0.05. Middle panel: Power versus RRR for different MAFs, total sample size = 500. Right panel: Power versus RR or RRR, as applicable to each effect type, MAF = 0.2, total sample size = 500. Note that the black curve with full squares is identical in all panels (based on a total of 500 trios, MAF = 0.2, and PoOxE). In the PoOxE analysis, we have varied the RR of the maternal allele with exposure status. As shown in Fig 6, PoOxE analyses will generally have lower power, given similar effect sizes, compared with PoO and GxE analyses. However, because PoOxE effects are measured as ratios of RRRs (see Eq (1)), it is hypothetically possible that PoOxE effects are larger than PoO effects or GxE effects, in particular in the presence of ‘qualitative interactions’, where effects are in opposite directions across strata. This is illustrated in S1 Appendix, and may partly explain some of the large effects in Tables 2 to 4. Under such scenarios, some of the lost power may be regained. Nevertheless, none of the q-values were lower than 0.14, which suggests that low power may have been an issue in this study. Still, several SNPs had q-values below 0.5, meaning that we expect fewer than half of them to be false positives. SNPs presented in Tables 2 to 4 should be interpreted as candidates to be further investigated in other studies. The next steps would be to replicate these candidates in other data sets/populations, followed by targeted functional analyses to help elucidate the importance of these SNPs in the interplay between environmental factors and risk of CPO. To summarize, this study presents new methodology, implemented in the R-package Haplin, to investigate PoOxE effects in the context of family trios or duos. Our analyses pointed to several SNPs with PoOxSmoke effects in the European sample. We were unable to assess the generalizability of this finding across ethnicities, because few of the Asian mothers smoked cigarettes or consumed alcohol. We did not find any evidence for PoOxAlcohol effects in the European sample, and there were no PoOxVitamin effects in either ethnicity. Still, these analyses highlight the versatility of Haplin in studying complex disease models.

Materials and methods

Study participants

The majority of the participants belonged to one of two major ethnicities (European or Asian). Table 1 outlines the population distribution by ethnicity and trio completeness, and Table 6 summarizes characteristics of the maternal exposures by ethnicity.

Quality control

Genotypes for 569 244 SNPs were available for the current analyses. The PLINK software [35] was used for quality control, with the following criteria applied for excluding SNPs: (i) >5% missing genotype for a given SNP, (ii) minor allele frequency (MAF) <5%, (iii) Hardy-Weinberg equilibrium (HWE) p-value <0.001 for parental alleles, (iv) >10% Mendelian error rate, and finally (v) linkage disequilibrium (LD) of r2 = 1 with other SNPs (to exclude SNPs with redundant information due to complete LD). Overall, genotypes for 550 families with isolated CPO were available for the current analyses. Criteria for excluding individuals were: (vi) >10% missing genotype within an individual, and (vii) >5% Mendelian errors within a family. Table 7 provides the total number of individuals after the above pruning. Because none of the families had Mendelian error rates >5%, they were all retained in the analyses. The total number of SNPs remaining after quality control is shown in Table 8, along with the different criteria used for pruning.
Table 7

Individuals before and after pruning.

Total individuals1659
Genotype call rate <10%84
Missing phenotype217
*Remaining parents783
*Remaining probands575
 -with two parents475
 -with one missing parent84
 -with two missing parents16

*Remaining individuals refer to those without missing phenotype or genotype call rate <10%.

Table 8

SNPs before and after pruning.

Total569244
Failed HWE-test80566
Failed missingness test2034
Failed SNP frequency test57442
Mendelian errors detected1129
r2 = 1 with flanking SNPs6170
*Remaining SNPs424401

*Remaining SNPs refer to those without deviations from HWE (p<0.001), more than 5% missed calls, minor allele frequencies <5%, or Mendelian errors >1%.

Genotyping rate in remaining individuals was above 0.998.

Note that a SNP may be excluded for failing more than one test criterion. Hence, the numbers do not necessarily add up.

*Remaining individuals refer to those without missing phenotype or genotype call rate <10%. *Remaining SNPs refer to those without deviations from HWE (p<0.001), more than 5% missed calls, minor allele frequencies <5%, or Mendelian errors >1%. Genotyping rate in remaining individuals was above 0.998. Note that a SNP may be excluded for failing more than one test criterion. Hence, the numbers do not necessarily add up.

Statistical analysis

All analyses were conducted using the statistical software package Haplin, http://people.uib.no/gjessing/genetics/software/haplin. Haplin is particularly tailored to the analysis of offspring-parent trios and duos, but is also applicable to case-control data [17]. It is implemented as a package in the statistical programming language R [36]. We applied the function haplinSlide to analyze all SNPs sequentially. For each SNP, a log-linear maximum likelihood model is applied to the trio genotype frequencies, allowing different risk of disease (penetrance) depending on the parent of origin of the allele. The effect of each SNP was assumed to be multiplicative in allele dose, with the most common (major) allele used as reference. Missing alleles were imputed using the EM-algorithm; standard errors and p-values were corrected for this imputation [17]. The following section outlines how the PoOxE effects are computed in Haplin. First, a PoO analysis is performed for each stratum of an exposure, where S = 0 represents the unexposed and S = 1 the exposed. The PoO analysis in stratum S computes two relative risks for a maternally inherited allele, and for a paternally inherited allele, where a0 is the reference allele, a1 is the alternative allele, and “a” denotes any one of the two alleles. The PoO relative risk ratio (RRRPoO) then compares the two separate relative risks, so that RRRPoO = 1 means a1 increases (or decreases) the risk by the same amount regardless of whether the allele is maternally or paternally inherited. Next, Haplin compares the RRRPoO for all strata. In the case of two strata, S = 0 represents the unexposed and S = 1 the exposed, and Haplin tests whether RRRPoO(0) = RRRPoO(1). The test is performed as a Wald test by exploiting the fact that the estimated log(RRRPoO(0)) and log(RRRPoO(1)) are independent and asymptotically normally distributed, as outlined in Skare et al. (2012) [14] and Gjerdevik et al. (2017) [16]. P-values from the PoOxE analyses were displayed in a QQ-plot, with expected p-values plotted against the observed. Under the null hypothesis of no PoOxE effect, all SNPs should lie along the diagonal line representing a uniform distribution, whereas significant SNPs are expected to appear markedly above the diagonal line and outside the confidence bands. To visualize the strength of the association signal and regional information flanking the most significant SNPs, we used a modified version of the R-script for regional plots available at http://www.broadinstitute.org/files/shared/diabetes/scandinavs/assocplot.R. The plot also displays the degree of LD between top SNPs and neighboring SNPs, recombination patterns, and positional information about genes in the region [37]. To assess the a priori power to detect PoOxE effects with our model, we performed power simulations based on 1000 replications and a significance level of 0.05 (Fig 6). The black line shows the power for a PoOxE analysis based on 500 case-parent trios (consistent with the sample size in this study), a MAF of 0.20, and equally-sized exposed and unexposed groups. The left panel of Fig 6 depicts different sample sizes and the middle panel depicts different MAFs. The right panel shows the power for different etiologic scenarios (child, PoO, GxE, and PoOxE). The child effect is the direct risk associated with the allele when it is carried by the child, regardless of parental origin or environmental exposures. The PoO effect is the risk associated with maternally-inherited alleles relative to paternally-inherited alleles. The GxE effect is the ratio of RRs in the two exposure groups. Finally, the PoOxE effect is the maternal to paternal risk ratio for the exposed divided by the same ratio for the unexposed.

Ethics approvals

This specific study did not need approval from an ethics committee because ethics approvals for the consortium were obtained from the respective ethics committees at each institution contributing data to the consortium. Details have been provided in our original publication [19].

Manhattan plots for the different exposures in the analyses of the pooled sample.

SNPs with p-values below 10−5 are in blue. (TIFF) Click here for additional data file.

Manhattan plots for the different exposures in the analyses of the European sample.

SNPs with p-values below 10−5 are in blue. (TIFF) Click here for additional data file.

Manhattan plots for the different exposures in the analyses of the Asian sample.

(TIFF) Click here for additional data file.

Example code for PoOxE analysis.

(DOCX) Click here for additional data file.
  33 in total

1.  Prevalence of diabetes among men and women in China.

Authors:  Wenying Yang; Juming Lu; Jianping Weng; Weiping Jia; Linong Ji; Jianzhong Xiao; Zhongyan Shan; Jie Liu; Haoming Tian; Qiuhe Ji; Dalong Zhu; Jiapu Ge; Lixiang Lin; Li Chen; Xiaohui Guo; Zhigang Zhao; Qiang Li; Zhiguang Zhou; Guangliang Shan; Jiang He
Journal:  N Engl J Med       Date:  2010-03-25       Impact factor: 91.245

2.  Maternal effects as the cause of parent-of-origin effects that mimic genomic imprinting.

Authors:  Reinmar Hager; James M Cheverud; Jason B Wolf
Journal:  Genetics       Date:  2008-02-01       Impact factor: 4.562

Review 3.  DNA methylation: roles in mammalian development.

Authors:  Zachary D Smith; Alexander Meissner
Journal:  Nat Rev Genet       Date:  2013-02-12       Impact factor: 53.242

4.  Evidence for gene-environment interaction in a genome wide study of nonsyndromic cleft palate.

Authors:  Terri H Beaty; Ingo Ruczinski; Jeffrey C Murray; Mary L Marazita; Ronald G Munger; Jacqueline B Hetmanski; Tanda Murray; Richard J Redett; M Daniele Fallin; Kung Yee Liang; Tao Wu; Poorav J Patel; Sheng-Chih Jin; Tian Xiao Zhang; Holger Schwender; Yah Huei Wu-Chou; Philip K Chen; Samuel S Chong; Felicia Cheah; Vincent Yeow; Xiaoqian Ye; Hong Wang; Shangzhi Huang; Ethylin W Jabs; Bing Shi; Allen J Wilcox; Rolv T Lie; Sun Ha Jee; Kaare Christensen; Kimberley F Doheny; Elizabeth W Pugh; Hua Ling; Alan F Scott
Journal:  Genet Epidemiol       Date:  2011-05-26       Impact factor: 2.135

5.  Expression of Dpp6 in mouse embryonic craniofacial development.

Authors:  Juan Du; Zhipeng Fan; Xin Ma; Yuanrong Gao; Yan Wu; Shuhong Liu; Yan Shen; Ming Fan; Songlin Wang
Journal:  Acta Histochem       Date:  2010-09-03       Impact factor: 2.479

Review 6.  Human cytochrome P450 4F3: structure, functions, and prospects.

Authors:  Laurent Corcos; Danièle Lucas; Catherine Le Jossic-Corcos; Yvonne Dréano; Brigitte Simon; Emmanuelle Plée-Gautier; Yolande Amet; Jean-Pierre Salaün
Journal:  Drug Metabol Drug Interact       Date:  2012-04-19

7.  Familial recurrence-pattern analysis of nonsyndromic isolated cleft palate--a Danish Registry study.

Authors:  K Christensen; L E Mitchell
Journal:  Am J Hum Genet       Date:  1996-01       Impact factor: 11.025

8.  Risk of oral clefts in twins.

Authors:  Dorthe Grosen; Camilla Bille; Inge Petersen; Axel Skytthe; Jacob von Bornemann Hjelmborg; Jacob Krabbe Pedersen; Jeffrey Clark Murray; Kaare Christensen
Journal:  Epidemiology       Date:  2011-05       Impact factor: 4.822

9.  Application of a novel hybrid study design to explore gene-environment interactions in orofacial clefts.

Authors:  Oivind Skare; Astanand Jugessur; Rolv Terje Lie; Allen James Wilcox; Jeffrey Clark Murray; Astrid Lunde; Truc Trung Nguyen; Håkon Kristian Gjessing
Journal:  Ann Hum Genet       Date:  2012-05       Impact factor: 1.670

10.  Epigenetics and Orofacial Clefts: A Brief Introduction.

Authors:  Gemma C Sharp; Evie Stergiakouli; Jonathan Sandy; Caroline Relton
Journal:  Cleft Palate Craniofac J       Date:  2018-02-26
View more
  8 in total

1.  Parent-of-origin-environment interactions in case-parent triads with or without independent controls.

Authors:  Miriam Gjerdevik; Øystein A Haaland; Julia Romanowska; Rolv T Lie; Astanand Jugessur; Håkon K Gjessing
Journal:  Ann Hum Genet       Date:  2017-11-02       Impact factor: 1.670

2.  A Genome-Wide Search for Gene-Environment Effects in Isolated Cleft Lip with or without Cleft Palate Triads Points to an Interaction between Maternal Periconceptional Vitamin Use and Variants in ESRRG.

Authors:  Øystein A Haaland; Rolv T Lie; Julia Romanowska; Miriam Gjerdevik; Håkon K Gjessing; Astanand Jugessur
Journal:  Front Genet       Date:  2018-02-26       Impact factor: 4.599

3.  Analysis of Parent-of-Origin Effects on the X Chromosome in Asian and European Orofacial Cleft Triads Identifies Associations with DMD, FGF13, EGFL6, and Additional Loci at Xp22.2.

Authors:  Øivind Skare; Rolv T Lie; Øystein A Haaland; Miriam Gjerdevik; Julia Romanowska; Håkon K Gjessing; Astanand Jugessur
Journal:  Front Genet       Date:  2018-02-22       Impact factor: 4.599

4.  Reciprocal F1 Hybrids of Two Inbred Mouse Strains Reveal Parent-of-Origin and Perinatal Diet Effects on Behavior and Expression.

Authors:  Daniel Oreper; Sarah A Schoenrock; Rachel McMullan; Robin Ervin; Joseph Farrington; Darla R Miller; Fernando Pardo-Manuel de Villena; William Valdar; Lisa M Tarantino
Journal:  G3 (Bethesda)       Date:  2018-11-06       Impact factor: 3.154

5.  A genome-wide scan of cleft lip triads identifies parent-of-origin interaction effects between ANK3 and maternal smoking, and between ARHGEF10 and alcohol consumption.

Authors:  Øystein Ariansen Haaland; Julia Romanowska; Miriam Gjerdevik; Rolv Terje Lie; Håkon Kristian Gjessing; Astanand Jugessur
Journal:  F1000Res       Date:  2019-06-24

6.  Genome-wide Interaction Study Implicates VGLL2 and Alcohol Exposure and PRL and Smoking in Orofacial Cleft Risk.

Authors:  Jenna C Carlson; John R Shaffer; Fred Deleyiannis; Jacqueline T Hecht; George L Wehby; Kaare Christensen; Eleanor Feingold; Seth M Weinberg; Mary L Marazita; Elizabeth J Leslie
Journal:  Front Cell Dev Biol       Date:  2022-02-10

7.  Gene-methylation interactions: discovering region-wise DNA methylation levels that modify SNP-associated disease risk.

Authors:  Julia Romanowska; Øystein A Haaland; Astanand Jugessur; Miriam Gjerdevik; Zongli Xu; Jack Taylor; Allen J Wilcox; Inge Jonassen; Rolv T Lie; Håkon K Gjessing
Journal:  Clin Epigenetics       Date:  2020-07-16       Impact factor: 6.551

Review 8.  Non-syndromic Cleft Palate: An Overview on Human Genetic and Environmental Risk Factors.

Authors:  Marcella Martinelli; Annalisa Palmieri; Francesco Carinci; Luca Scapoli
Journal:  Front Cell Dev Biol       Date:  2020-10-20
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.