Literature DB >> 31817908

Non-Syndromic Cleft Lip with or without Cleft Palate: Genome-Wide Association Study in Europeans Identifies a Suggestive Risk Locus at 16p12.1 and Supports SH3PXD2A as a Clefting Susceptibility Gene.

Iris Alm van Rooij1, Kerstin U Ludwig2, Julia Welzenbach2, Nina Ishorst2, Michelle Thonissen3, Tessel E Galesloot1, Edwin Ongkosuwito3, Stefaan J Bergé4, Khalid Aldhorae5, Augusto Rojas-Martinez6, Lambertus Alm Kiemeney1,7, Joris Robert Vermeesch8, Han Brunner9,10, Nel Roeleveld1, Koen Devriendt11, Titiaan Dormaar12,13, Greet Hens14, Michael Knapp15, Carine Carels8,9,16, Elisabeth Mangold2.   

Abstract

Non-syndromic cleft lip with or without cleft palate (nsCL/P) ranks among the most common human congenital malformations, and has a multifactorial background in which both exogenous and genetic risk factors act in concert. The present report describes a genome-wide association study (GWAS) involving a total of 285 nsCL/P patients and 1212 controls from the Netherlands and Belgium. Twenty of the 40 previously reported nsC/LP susceptibility loci were replicated, which underlined the validity of this sample. SNV-based analysis of the data identified an as yet unreported suggestive locus at chromosome 16p12.1 (p-value of the lead SNV: 4.17 × 10-7). This association was replicated in two of three patient/control replication series (Central European and Yemeni). Gene analysis of the GWAS data prioritized SH3PXD2A at chromosome 10q24.33 as a candidate gene for nsCL/P. To date, support for this gene as a cleft gene has been restricted to data from zebrafish and a knockout mouse model. The present GWAS was the first to implicate SH3PXD2A in non-syndromic cleft formation in humans. In summary, although performed in a relatively small sample, the present GWAS generated novel insights into nsCL/P etiology.

Entities:  

Keywords:  cleft lip with or without cleft palate; congenital malformation; genome-wide association study; orofacial cleft

Mesh:

Substances:

Year:  2019        PMID: 31817908      PMCID: PMC6947597          DOI: 10.3390/genes10121023

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Introduction

Orofacial clefting represents the second most common congenital malformation in humans, after various forms of heart defects all combined [1]. Despite advances in surgical correction, the disorder has lifelong implications for the health and social integration of those affected. Improved understanding of cleft etiology may facilitate development of new preventative measures and therapeutic approaches, and may improve genetic counseling for families at risk. Clefting can occur either as part of a complex malformation syndrome or as an isolated anomaly, and several cleft subphenotypes have been defined according to the affected anatomical structures. The most frequent subphenotype is non-syndromic cleft lip with or without cleft palate (nsCL/P). In European populations, the estimated prevalence of nsCL/P is around 1:1000 [1]. The etiology of nsCL/P is multifactorial, whereby genetic risk factors, environmental exposures, and potential gene–environment interactions all contribute to disease susceptibility. The estimated contribution of all combined genetic factors to nsCL/P is 90% [2]. Since 1989, diverse genetic approaches have been used to identify genes and pathways underlying nsCL/P, including linkage and candidate gene studies. However, prior to the commencement of the genomics era around a decade ago, extensive research efforts had identified only two common genetic factors that could be considered true nsCL/P-associated risk factors: (1) the regulatory region of the Interferon Regulatory Factor 6 (IRF6), which was identified in a candidate gene association study; and (2) the Forkhead Box E1 (FOXE1) risk locus, which was identified in a meta-analysis of linkage data [3,4]. In the genomics era, new DNA sequencing techniques have enabled whole-exome sequencing, which has led to the identification of potential nsCL/P susceptibility variants in Cadherin 1 (CDH1) and a small number of other genes [5,6,7]. Variants detected to date via exome sequencing have been dominant, heterozygous, and can be important for the respective family as carriers of such variants can be at high risk. However, these are rare findings, and currently explain only a small fraction of patients. Many key genetic findings in nsCL/P have resulted from another powerful tool of the genomics era, i.e., the genome-wide association study (GWAS) and follow-up analysis approach. To date, this approach has identified a total of 38 common risk loci for nsCL/P [8,9,10,11,12,13,14,15,16,17,18,19,20]. At the time of writing, a total 40 common nsCL/P risk loci are known. However, these account for only a modest proportion of the genetic variance of nsCL/P, e.g., up to 30% of the narrow-sense heritability in the European population [19]. Thus, the existence of further common risk loci must be assumed. Given the tremendous success of the nsCL/P GWAS and follow-up study approach over the past decade, further GWAS appear to be warranted. The present report describes a GWAS in a medium-sized nsCL/P case/control sample of European ethnicity recruited in the Netherlands and Belgium. In addition to the SNV-wise evaluation of data, gene-based and pathway analyses were performed. In these analyses, genetic marker data were aggregated to the level of whole genes or biological processing to test the joint association of all markers in the gene with the phenotype of interest, thereby increasing statistical power.

2. Materials and Methods

2.1. Ethics Statement

The study was approved by the Regional Committee on Research Involving Human Subjects Arnhem-Nijmegen, and the Review Board for Clinical Studies of the University Hospital KU Leuven. All study procedures were performed in accordance with the principles of the Declaration of Helsinki. The NBS protocol was approved by the Regional Committee on Research Involving Human Subjects Arnhem-Nijmegen. Written informed consent was obtained from all patients/participants, or their parents/legal guardians in the case of legal minors.

2.2. GWAS Patients

Patients were part of the AGORA project (Aetiological Research into Genetic and Occupational/Environmental Risk Factors for Anomalies in Children) [21]. The AGORA project commenced in 2005 and has established a large data and biobank of DNA samples and clinical and questionnaire data from: (i) children with congenital malformations or childhood cancer, (ii) their respective parents, and (iii) controls. Patients are recruited at two sites: (1) the Radboud University Medical Center (Radboudumc) Nijmegen, The Netherlands; and (2) University Hospital KU, Leuven, Belgium. Only patients with nsCL/P were eligible for inclusion in the present study. Patients with other cleft phenotypes, such as an isolated cleft palate, and patients with a possible specific malformation syndrome, intellectual disability, or other anomalies were excluded from the present study. In the Nijmegen initiative, collection of data and biomaterials from patients with clefting commenced in 2007. Intraoperative blood or saliva samples were collected, and parents were asked to complete a questionnaire and to donate blood or saliva samples. The questionnaire addresses demographics and family history, among other factors. At the beginning of 2012, three “biomaterial donation days” were arranged to collect blood or saliva from clefting patients who had been treated at Radboudumc prior to 2007. At these sessions, blood or saliva samples and questionnaire data were also collected from parents. In the Leuven initiative, collection of data and biomaterials from patients with clefting commenced in 2010. Blood samples and clinical and questionnaire data (e.g., demographics and family history) were collected from pediatric patients and their parents. At both Leuven and Nijmegen clinical sites, all pediatric clefting patients underwent a clinical examination by an orthodontist, a maxillofacial surgeon, a plastic surgeon, and a clinical geneticist from the Cleft Lip and Palate team. To determine phenotype classification in the present cohort, data were retrieved from the respective medical charts. The present analyses were performed using DNA samples from nsCL/P patients who were treated and followed up at: (1) Radboudumc (n = 219); or (2) University Hospital of KU Leuven (n = 66). The initial patient sample therefore comprised 285 nsCL/P patients. All patients were of self-reported European ancestry. Ancestral background in the Dutch and Belgian samples was assessed by identifying the origins of the grandparents and the parents, respectively.

2.3. GWAS Controls

Population-based controls (n = 1212) were drawn from the Nijmegen Biomedical Study (NBS), a population-based survey conducted by the Department for Health Evidence and the Department of Laboratory Medicine of the Radboud University Medical Center. This cohort was established to generate a universal reference population for the investigation of genetic variation and lifestyle and environmental exposures for a variety of traits and diseases in case/control studies [22]. A total of 22,451 age- and sex-stratified, randomly selected inhabitants of the municipality of Nijmegen received an invitation to fill out a postal questionnaire on lifestyle and medical history, and to donate an 8.5 mL blood sample in a serum separator tube and a 10 mL EDTA blood sample. The overall response to the questionnaire was 42% (n = 9350), and 69% (n = 6468) of the respondents donated blood samples.

2.4. GWAS Genotyping and Quality Control (QC)

Within the AGORA project, standard methods are used to extract DNA from blood collected in EDTA tubes or saliva specimens collected in ORAgene containers (DNA Genotek Inc., Ottawa, Canada). In the present study, genotyping of the 285 nsCL/P patients from AGORA was performed using the Illumina OmniExpressExome Array (Illumina, San Diego, CA, USA). Within the NBS project, DNA was genotyped using Illumina chips. For the present analyses, Illumina’s OmniExpress Array (Illumina, San Diego, CA, USA) genotype data for the 1212 NBS participants were available. A total of 718,286 SNVs were present in both patients and controls. Exclusion criteria for patient and control data were: (i) any discrepancy between documented and genotyped sex; (ii) a call rate of <99%; or (iii) evidence from multidimensional scaling (MDS) of ethnic outlier status (Figure S1). Un-relatedness between individuals was evaluated using the program KING [23]. Markers were excluded from the analysis if the minor allele frequency (MAF) was <1% or the call rate was <95% in either patients or controls. In addition, markers were excluded if there was deviation from Hardy–Weinberg equilibrium at p < 10−4 in controls or p < 10−6 in patients.

2.5. Imputation of GWAS Data

The combined post-QC dataset for patients and controls was subjected to imputation using the June 2014 release of the 1000 Genomes Project and the program IMPUTE2 [24]. SNVs with an INFO score ≥ 0.4 and a MAF > 1% were then tested for association using SNPTEST [25]. For the logistic regression analysis, the first five components obtained from MDS were used as covariates.

2.6. Genome-Wide Association Analysis

Four patients were excluded due to gender incompatibility (discrepancy between documented and genotyped sex). Two patients and one control sample were excluded due to a call rate < 99%. Five patients and 24 controls were excluded due to relatedness. Fifteen patients were excluded due to ethnic outlier status. Following these QC procedures, 259 patients and 1187 controls remained for further analysis. In total, 8,785,346 imputed SNVs with an INFO score ≥ 0.4 and a MAF > 1% were finally tested. For each SNV passing QC, a logistic regression model was considered, with the additively coded SNV as the predictor variable and the first five MDS coordinates as covariates. The p-value of the likelihood ratio test of no association was then calculated.

2.7. Replication Analysis for Two Interesting SNVs

Two SNVs were of particular interest, as they had small p-values and were located in regions not reported to be nsCL/P risk loci. For these two interesting candidate SNVs (rs73145631 at chromosome 12, and rs56383345 at chromosome 16), replication was performed by genotyping these SNVs in three case/control nsCL/P samples: (i) 223 nsCL/P patients and 978 controls of Central European descent (sample presented in Mangold et al., 2010); (ii) 156 nsCL/P patients and 337 controls from the Chiapas, Mexico (sample presented in Rojas-Martinez et al. 2013); and (iii) 231 nsCL/P patients and 422 controls from Yemen (sample presented in Böhmer et al., 2014) [10,26,27]. A MassARRAY genotyping assay was designed using the Assay Design Suite Software v 1.0 Software (AGENA Bioscience, San Diego, USA). The genotyping assay contained these two SNVs of interest and 26 SNVs from other projects. SNVs were genotyped using the MassARRAY system and end-point PCR, followed by matrix-assisted laser desorption/ionization time-of-flight (MALDI-ToF) mass spectrometry (AGENA Bioscience, San Diego, USA). Data analysis was performed using the AGENA Spectrodesigner Software package. Individual genotypes were assigned using the AGENA Typer Analysis software. Primers were synthesized at Metabion, Germany (individual primer sequences available upon request). Intra- and interplate duplicates were included for quality control purposes. No genotype inconsistencies were observed. Association statistics were calculated by applying the Armitage-trend test separately for each sample cohort. For each SNV, relative risks of the three replication cohorts were combined using fixed-effect meta-analysis.

2.8. Prioritization of Candidate Genes

To prioritize candidate genes from the present GWAS dataset, SNV data with an INFO score > 0.6 were uploaded into FUMAGWAS (https://fuma.ctglab.nl/) [28]. Gene and gene-set analyses implemented in FUMAGWAS were based on GWAS summary statistics. In the present study, these analyses were performed with MAGMA, a tool that can be used for gene and gene-set analysis from GWAS data [29]. Input SNVs were mapped to 18,644 protein coding genes. Genome wide significance was defined as a p-value of 0.05/18,644 = 2.682 × 10−6.

2.9. Expression Analysis using SysFACE

Evaluation of expression of a given candidate gene and its neighboring genes was performed using the bioinformatics tool SysFACE (systems tool for craniofacial expression-based gene discovery; https://bioinformatics.udel.edu/research/sysface). For each gene, expression data for various time-points of murine embryonic development in organs specific for the phenotype under study (maxilla, frontonasal, palate) were identified from microarray-based genome-level gene expression profiles across various mouse embryonic orofacial tissues.

3. Results

3.1. SNV-Based Analysis

A total of 615,168 autosomal markers passed QC for both samples. The genomic inflation factor lambda was 1.044. The Q–Q plot is shown in Figure S2. In the imputed GWAS data, 228 SNVs at a total of 25 different loci yielded p-values < 10−5 and an INFO score > 0.8 (Table 1, Table S1, Figure S3).
Table 1

Twenty-five nsCL/P risk loci achieving genome-wide significant or suggestive evidence for association in imputed Dutch/Belgian genome-wide association study (GWAS).

LeadSNV_ID 1ChromosomePos (hg19)Alleles 2Case Frequency AControl Frequency AAll ORp-Value 3
rs36068947119003293G/GC0.5920.7021.621.06 × 10-6
rs11247713128230494A/G0.1650.2611.782.54 × 10-6
rs12404189181851687T/C0.9510.8990.464.48 × 10-6
rs1457946471213880572A/G0.9150.9642.463.17 × 10-6
rs12663811236681990A/G0.680.7811.681.49 × 10-6
rs1342938922564600G/A0.7130.8071.686.34 × 10-6
rs14319032168468315T/C0.1130.1921.868.76 × 10-6
rs112762347513103319G/A0.9520.9863.565.80 × 10-6
rs48680995170982499T/C0.1560.0890.534.29 × 10-6
rs14110917472926872G/GA0.9240.8560.493.95 × 10-6
rs987525 8129946154C/A0.610.7481.898.73 × 10-11
rs153546210102973872A/G0.6230.5180.655.69 × 10-6
rs491805210105555131G/A0.3030.2030.582.34 × 10-6
rs1777030710115259535G/C0.9840.9420.262.59 × 10-6
rs1482486231223265077CA/C0.9070.9612.521.38 × 10-6
rs79800901267951884C/A0.9080.9582.282.35 × 10-6
rs73145631 12101109530G/A0.9850.9360.231.99 × 10-8
rs5681451112125789014C/T0.6080.7051.559.01 × 10-6
rs1844671329622636G/T0.4280.3160.621.53 × 10-6
rs105207881596126414T/C0.9730.9240.346.19 × 10-6
rs563833451626344915G/C0.9220.8450.464.17 × 10-7
rs116409521678093932G/T0.6930.5870.636.22 × 10-6
rs72155551729564603G/A0.2310.3311.643.28 × 10-6
rs612967041775721588G/A0.9320.9722.558.22 × 10-6
rs735124491918123050G/C0.8880.9462.197.00 × 10-6

1 = genome-wide significant SNVs, 2 major allele first, risk allele underlined, 3 only markers with a p-value < 10−5 are presented. Pos = position, OR = odds ratio. The 25 loci also included an interesting locus at chromosome 16p12.1. This locus gave a suggestive p-value (5 × 10−8 < p < 10−5), and its lead SNV was rs56383345, with a p-value of 4.17 × 10−7 (Figure 1).

Genome-wide significance (p < 5 × 10−8) was reached by 63 SNVs at two loci. One of the genome-wide significant SNVs was rs73145631 at 12q23.1, a locus that has not been reported as an nsCL/P risk locus previously. It was subjected to a replication step in three independent case/control samples (Table 2). In none of the three replication samples, nor after combining the replication data in a meta-analysis was the association for this SNV replicated. The other 62 genome-wide significant SNVs were located at an already well-known nsCL/P susceptibility locus at chromosome 8q24.21. This locus was initially identified in the Central European case/control sample that served as a replication sample for the present study [8]. It was later replicated in many other samples—among others, the Mexican and the Yemeni replication samples that were used for replication in the present study [26,27]. This locus was therefore not included in the replication step of the present study.
Table 2

Replication of two interesting SNVs in three replication samples of different biogeographical backgrounds.

SNV-IDp-Values after Genotyping of SNV in Respective Replication Sample
SNV-IDChr.Pos. (hg19)Bonn 1Mexico 2Yemen 3All 4OR (95% CI)
rs7314563112q23.11011095300.544n.a.*0.7140.4881.18 (0.74–1.90)
rs5638334516p12.126344915 0.027 0.577 ** 0.0099 0.00167 1.53 (1.17–1.98)

Bold = nominal significant result, italics = risk allele in this subsample not identical with risk allele in Dutch/Belgian discovery sample, Chr. = chromosome, pos. = position, OR = odds ratio, CI = confidence interval, n.a. = not applicable. * MAF below 1% in controls, therefore excluded (notable: rs17447439: 0.9% in controls vs. 1.7% in cases), ** 1.7% in cases and 1.2% in controls. 1 nsCL/P case control sample of 223 nsCL/P patients and 978 controls of Central European descent (no overlap with Bonn GWAS). 2 156 nsCL/P patients and 337 controls from the Chiapas, Mexico. 3 231 nsCL/P patients and 422 controls from Yemen. 4combined analysis of Bonn, Mexico, and Yemen.

This locus had not been reported as a recognized nsCL/P risk locus in previous studies. Therefore, rs56383345 was also subjected to replication in the three independent patient/control samples (Table 2). A nominally significant p-value was obtained for rs56383345 in both the European and the Yemeni sample (0.027 and 0.0099 respectively). The p-value obtained after combination of all three replication samples by meta-analysis was also nominally significant (p = 0.00167). However, no association signal was obtained in the Mexican sample only (p = 0.577). Rs56383345 is located in a 930 kb non-coding region. Evaluation of the 16p12.1 region with the bioinformatics tool SysFACE did not implicate any flanking genes as cleft candidate genes.

3.2. Replication of Previously Reported nsCL/P Susceptibility Loci

Of the 40 previously reported nsCL/P susceptibility loci, 20 showed at least a nominally significant p-value (p < 0.05) in the present dataset (Table 3). For three of these loci, the lead SNV from the literature and the lead SNV in the present dataset were identical. At 15 of the 20 loci, a SNV in substantial linkage disequilibrium (LD) (r > 0.6) with the lead SNV from the literature achieved a lower p-value. For two of the 20 loci, the lead SNVs were not in LD with the lead SNV from the literature.
Table 3

P-values for 40 literature nsCL/P risk loci in imputed Dutch/Belgian GWAS.

Literature Risk Locus 1Literature Lead SNV at Respective LocusBetter Dutch/Belgian GWAS lead SNV at Respective Locus
LocusOriginal StudySNVPos. (hg19)p-Value 2ORSNV 3Pos (hg19)p-Value 2,3OR
1p36 Ludwig et al. 2012 [12]rs74207118979874 2.11 × 10-5 1.54 rs36068947 19003293 1.06272 × 10-6 1.62
1p22Beaty et al. 2010 [11]rs560426945534380.2994460.90rs952499945584250.06162170.83
1q32.1 Rahimov et al. 2008 [3]rs642961209989270 0.00055997 0.69----
2p25.1Yu et al. 2017 [17]rs28798299724420.8988771.00----
2p24.2 Leslie et al. 2016 [16]rs755216733928 4.84 × 10-5 1.52rs6212269316734878 3.10168 × 10-5 1.56
2p21PKDCC Ludwig et al. 2017 [19]rs674096042181679 0.00745325 1.28 rs17029056 42158304 0.00011259 1.52
2p21THADALudwig et al. 2012 [12]rs7590268435401250.131981.20rs6544652436262120.08819381.23
3p11.1 Ludwig et al. 2012 [12]rs763242789534377 0.0342897 0.81rs379257289456555 0.0101378 0.77
3q12.1 Beaty et al. 2013 [13]rs79346499626028 8.21 × 10-5 0.70rs983213499836722 4.67043 × 10-5 0.66
3q28 Leslie et al. 2017 [20]rs76479869189553372 0.00247675 1.77rs17447439189549423 5.67543 × 10-5 2.26
3q29 Mostowska et al. 2018 [18]rs3382171970269270.09860850.85rs34099552196799735 0.0228988 1.27
4p16.2Yu et al. 2017 [17]rs190798948189250.7517761.05rs1093789348104910.6497520.94
4q28.1Yu et al. 2017 [17]rs9088221249062570.6575990.89rs768373041248681110.5328670.86
5p12Yu et al. 2017 [17]rs10462065440688460.2583441.18rs139738798441834190.1093771.26
8p11.23Yu et al. 2017 [17]rs13317382695140.7744450.97rs75168396380144290.30381.11
8q21 Ludwig et al. 2012 [12]rs1254331888868340 0.00118317 0.73----
8q22.1Yu et al. 2017 [17]rs957448955413020.5600230.94rs4442106956094880.06269760.83
8q24 Birnbaum et al. 2009 [8]rs987525129946154 8.73 × 10-11 1.89----
9q22.2 Yu et al. 2017 [17]rs1090890292224825 0.0178356 1.31rs203197092204172 0.00252225 1.41
9q22.32Yu et al. 2017 [17]rs10512248982597030.09387440.84rs28591501982786440.06964480.82
9q21.33 Moreno et al. 2009 [4]rs3758249100614140 0.00519618 1.32rs7033765100591705 0.00127545 1.38
10q25 Mangold et al. 2010 [10]rs7078160118827560 0.00019291 1.60rs5788208118836076 0.000106433 1.63
12q13.13Yu et al. 2017 [17]rs3741442533467500.1054650.31-4-4-4-4
12q13.2 Yu et al. 2017 [17]rs70570456435412 0.0418832 1.22rs77310756369506 0.0210978 1.26
12q21.1Yu et al. 2017 [17]rs2304269720802720.4833160.86rs11178895720894110.3218040.84
13q31.1 Ludwig et al. 2012 [12]rs800164180692811 0.00268945 1.34rs1184164680679302 0.00135074 1.37
14q22.1Ludwig et al. 2017 [19]rs4901118518561090.3676071.10rs60454187518565660.2797050.9
14q22.1Yu et al. 2017 [17]rs7148069518396450.7318531.03----
14q32.13Yu et al. 2017 [17]rs1243572953794990.3434131.12rs1243561953698860.2011461.16
15q13 Ludwig et al. 2016 [15]rs1258763330504230.05214381.19rs1332931033052553 0.0499003 0.83
15q22.2Ludwig et al. 2012 [12]rs1873147633126320.4698811.08rs12902152633139680.1676011.19
15q24Ludwig et al. 2017 [19]rs28689146750055750.4423031.08----
16p13.3 Sun et al. 2015 [14]rs804936739804450.1590991.13rs110767923968567 0.00316807 1.33
17q13.1Beaty et al. 2010 [11]rs989144689354160.1863881.14----
17q21.32 Yu et al. 2017 [17]rs183810545008935 0.00231174 0.73rs19790744982081 0.000282387 1.45
17q22Mangold et al. 2010 [10]rs227727547769550.1800571.14----
17q23.2 Leslie et al. 2016 [16]rs158836661076428 0.00901131 0.75rs7284314561052949 0.0057287 0.72
19p13.3Ludwig et al. 2017 [19]rs374610120508230.9292810.95----
19q12Leslie et al. 2016 [16]rs73039426335209610.269591.20----
20q12 Beaty et al. 2010 [11]rs1304124739269074 0.0101081 0.77rs3475352239278391 0.00037345 0.69

pos. = position, OR = odds ratio, - = no SNV with smaller p-value at this locus compared to literature lead SNV. 1 bold if locus reached a nominal significant p-value in Dutch/Belgian GWAS. 2 p-value in boldface if nominal significant. 3normal letters if in LD with literature lead SNV (r2 > 0.6; r2 taken from LD Link), italics if not in LD with literature lead SNV and p < 10−4. 4 MAF of lead SNV in Dutch/Belgian GWAS below 0.01, did not pass QC.

3.3. Gene-Based Evaluation and Gene-Set Analysis

For the gene-based evaluation, genome-wide significance was set by FUMAGWAS at p < 2.672 × 10−6 and was achieved by two different genes: (i) SH3 And PX Domains 2A (SH3PXD2A) at chromosome 10q24.33 (p = 1.82 × 10−8); and (ii) anoctamin 4 (ANO4) at chromosome 12q23.1 (p = 1.46×10−7) (Figure 2, Figure 3, Figure S4, Table S2). To date, neither of these genes has been proposed as a candidate gene for clefting in humans. For ANO4, the SysFACE evaluation revealed no expression at any relevant embryonic time-points in tissues of relevance to lip and palate development in the mouse model (Figure S5). However, Growth Arrest Specific 2 Like 3 (GAS2L3), which is located upstream of ANO4, is expressed in the murine maxilla at E11.5 to E12.5, and Insulin-Like Growth Factor 1 (IGF1), which is located downstream, is expressed in the murine maxilla at E11.5 to E12.5 and also in the murine palate at E14.5.
Figure 2

Regional association plot for the candidate gene SH3PXD2A.

Figure 3

An unreported suggestive locus at chromosome 12q23.1 (a) Regional association plot showing a genome-wide significant marker upstream of the anoctamin 4 (ANO4) gene. It is of note that in our dataset, there were no SNVs in LD with the lead SNV rs73145631. (b) Regional association plot for the ANO4 gene.

Genes with a suggestive p-value (2.672 × 10−6 < p < 10−4) in the gene analysis included the gene Paired Box 7 (PAX7) at chromosome 1p36.13. PAX7 is a well-known cleft candidate gene (Table 4). The PAX7 locus has shown genome-wide association in two previous studies, and rare, potentially pathogenic variants located in or near PAX7 have been reported in patients with orofacial clefting [30,31,32,33].
Table 4

Gene analysis with MAGMA as implemented in FUMAGWASGene 1.

Chromosomep-Value 1
SH3PXD2A 10 2.3729 × 10−8
ANO4 12 1.4588 × 10−7
CMSS1 32.9078 × 10−6
FILIP1L 37.1274 × 10−6
CLEC3A 161.0871 × 10−5
PAX72 11.9157 × 10−5
CCDC140 24.4909 × 10−5
ESR1 67.1554 × 10−5
IFITM3 118.5273 × 10−5
PANX1 118.7342 × 10−5
BLMH 179.5824 × 10−5

1bold = genome wide significant genes/p-values. 2 previously known nsCL/P susceptibility gene.

The gene-set analysis implemented in FUMAGWAS revealed twelve gene ontology (GO) gene sets with an adjusted p-value < 0.05 (Table S2). The two gene sets with the lowest p-values were GO_TISSUE_MORPHOGENESIS (adjusted p = 0.0259) and GO_KERATINOCYTE_PROLIFERATION (adjusted p = 0.0259).

4. Discussion

The present report describes a GWAS performed in nsCL/P patients from the Netherlands and Belgium and unaffected controls from the Netherlands. The sample comprised 259 patients and 1187 controls, and thus represents a medium-sized nsCL/P cohort. Nonetheless, the replication of 20 of the 40 previously reported nsCL/P susceptibility loci—four of which were originally identified in Asian samples—demonstrated the power of the sample. Notably, at the well-established nsCL/P susceptibility locus at chromosome 8q24.21, the analyses identified 62 genome-wide significant SNVs. Among others, the results of the gene-set analysis prioritized GO_TISSUE_MORPHOGENESIS and GO_KERATINOCYTE_PROLIFERATION. This was consistent with the hypothesis that nsCL/P arises from the disturbed proliferation, adhesion, and apoptosis of cells in the facial prominences during embryogenesis, and demonstrated the validity of the present Dutch/Belgian sample [33]. The SNV-based analysis identified a novel locus at chromosome 16p12.1 yielding suggestive evidence of association (lead SNV rs56383345 with a p = 4.17 × 10−7). This association was replicated in two of the three replication series (Central European and Yemeni). The association was not replicated in the Mexican patient/control series. However, the MAF for this SNV in the Mexican series was <2%, so the sample would not be expected to have much power to detect a true association. The SNV rs56383345 maps to a 930 kb non-coding region at 16p12.1, and is not located in any currently known regulatory element [34]. The nearest flanking genes are heparan sulfate-glucosamine 3-sulfotransferase 4 (HS3ST4) upstream and C16orf82 downstream. Neither of these genes, nor any other genes near rs56383345, has any reported role in cleft development. Furthermore, no orofacial clefting has been reported in those patients from the DECIPHER database [35] who have a copy number variant encompassing the new suggestive susceptibility locus or either one of the flanking genes. Twenty of the 40 recognized nsCL/P susceptibility loci were replicated with at least a nominally significant p-value (p < 0.05) in the present study. At 3 of these 20 replicated loci, the lead SNV reported in the literature and the lead SNV in the present dataset were identical, and at 15 loci a SNV in LD (r2 > 0.6) with the lead SNV from the literature achieved a smaller p-value. Notably, at two of the 20 replicated loci, namely 1p36 and 3q29, additional peaks with lead SNVs not in LD with the lead SNV from the original literature were identified. This suggested that the original sample and the Dutch/Belgian sample have differing haplotype structures. In addition to the “typical” single-SNV analyses, the present study involved a gene analysis, which generated several interesting findings. In a gene analysis, genetic marker data are aggregated to the level of whole genes to test the joint associations of all markers in the gene with the phenotype [36]. Previous authors have suggested that this approach represents a potentially more powerful alternative to single-SNV analyses. The gene analysis approach has the advantage of considerably reducing the required number of tests, and renders possible detection of effects consisting of multiple weaker associations, which would otherwise be overlooked. The present gene analysis prioritized SH3PXD2A at chromosome 10q24.33 as a candidate gene for nsCL/P. SH3PXD2A is a protein-coding gene with 15 exons and a size of 267 kb. The SH3PXD2A gene product is necessary for the formation and function of podosomes, which are structures located on the cellular surface that establish close contact with the extracellular matrix, and are involved in cell migration and matrix degradation [37]. Observations in zebrafish and knockout mice suggest that SH3PXD2A is also a potential risk gene for orofacial clefting [38,39]. In mammals, the primary and secondary palate forms from cranial-neural-crest-cell-derived mesenchymal protuberances, and any alteration in the growth, proliferation, movement, adhesion, or death of cells composing the palatal structures can affect palatal architecture and lead to orofacial clefting. SH3PXD2A encodes TKS5, a scaffold protein shown to be fundamental to zebrafish neural crest cell migration in vivo [38]. However, even stronger support for that SH3PXD2A may be a cleft candidate gene has been generated by Cejudo-Martin et al., who showed that disruption of the mouse Sh3pxd2a gene was associated with complete cleft of the secondary palate in 50–90% of mutant mice [39]. Of note, the fact, that the mouse model in Cejudo-Martin et al. has a cleft of the secondary palate but not a cleft lip, does not speak against this gene being also involved in nsCL/P formation in humans. The present gene analysis provides the first strong support for the involvement of SH3PXD2A in non-syndromic cleft formation in humans. Gene evaluation of the present GWAS data also prioritized the gene anoctamin 4 (ANO4) at chromosomal band 12q23.1. Interestingly one SNV, rs73145631, located 79 kb upstream of the transcription start site of this gene, achieved genome-wide significance in this dataset. The SNV association signal should be considered independent of the MAGMA gene analysis, which only took into account signals located between the transcription start and stop sites of the gene. The protein product of the candidate gene, ANO4, is a transmembrane protein from the anoctamin family. This protein family plays a key role in diverse physiological functions, including ion transport, phospholipid scrambling, and the regulation of other ion channels. While the ANO1 and ANO2 proteins have been functionally characterized, the roles of other family members, such as ANO4, remain poorly understood and controversial [40]. There is currently no convincing support for ANO4 as a cleft susceptibility gene in the literature or publically available databases on gene expression, such as SysFace. However, it is possible that rather than being attributable to the ANO4 gene itself, this association signal was generated by a regulatory element located intronically in ANO4, which influences the activity of another nearby gene. Notably, two genes in the vicinity of the ANO4 signal are expressed at relevant embryonic time-points in tissues of relevance to lip and palate development in the mouse model: (i) Growth Arrest Specific 2 Like 3 (GAS2L3), located upstream from ANO4; and (ii) Insulin Like Growth Factor 1 (IGF1), located downstream [34]. In the present GWAS, the genome-wide significant association signal for rs73145631 in the ANO4 upstream region could be interpreted as additional independent support for the ANO4 locus, despite being based on the same data. However, for several reasons, these results must be interpreted with caution. First, no supportive association signal for rs73145631 was found in any of the three replication cohorts. Second, interpretation of the GWAS association signal for rs73145631 was hampered by the fact that no SNVs were in LD with this SNV in this dataset. Even though it was not replicated in any of the three replication samples, the finding may be a true association specific to the present Dutch/Belgian sample, or may represent a false positive signal from a single marker. After Asians, Europeans represent the second most common ethnicity in published nsCL/P association studies. The largest ethnically homogenous GWAS sample to date included 2033 patients of Chinese ancestry [17]. The largest European SNV data set explored to date combined 1158 nsCL/P patients from two large studies in a genome-wide meta-analysis [20]. Compared to such studies, the present sample size was relatively small, but this sample was valid and allowed the generation of interesting results warranting further investigation. The present sample is also of value in terms of future meta-analyses of GWAS data. These meta-analyses will increase statistical power for locus discovery, and facilitate the elucidation of the genetic background of orofacial clefting.

5. Conclusions

In summary, the present GWAS generated novel insights into the etiology of nonsyndromic orofacial clefting. The gene-based analysis provided strong support for SH3PXD2A as a candidate gene identified in animal models, and the SNV analysis identified two novel suggestive risk loci. The present results demonstrate how even medium-sized, clinically well characterized GWAS samples can improve knowledge of the genetic basis of nsCL/P.
  40 in total

1.  A genome-wide association study identifies a locus for nonsyndromic cleft lip with or without cleft palate on 8q24.

Authors:  Struan F A Grant; Kai Wang; Haitao Zhang; Wendy Glaberson; Kiran Annaiah; Cecilia E Kim; Jonathan P Bradfield; Joseph T Glessner; Kelly A Thomas; Maria Garris; Edward C Frackelton; F George Otieno; Rosetta M Chiavacci; Hyun-Duck Nah; Richard E Kirschner; Hakon Hakonarson
Journal:  J Pediatr       Date:  2009-08-04       Impact factor: 4.406

2.  Genome-wide association study identifies two susceptibility loci for nonsyndromic cleft lip with or without cleft palate.

Authors:  Elisabeth Mangold; Kerstin U Ludwig; Stefanie Birnbaum; Carlotta Baluardo; Melissa Ferrian; Stefan Herms; Heiko Reutter; Nilma Almeida de Assis; Taofik Al Chawa; Manuel Mattheisen; Michael Steffens; Sandra Barth; Nadine Kluck; Anna Paul; Jessica Becker; Carola Lauster; Gül Schmidt; Bert Braumann; Martin Scheer; Rudolf H Reich; Alexander Hemprich; Simone Pötzsch; Bettina Blaumeiser; Susanne Moebus; Michael Krawczak; Stefan Schreiber; Thomas Meitinger; Hans-Erich Wichmann; Regine P Steegers-Theunissen; Franz-Josef Kramer; Sven Cichon; Peter Propping; Thomas F Wienker; Michael Knapp; Michele Rubini; Peter A Mossey; Per Hoffmann; Markus M Nöthen
Journal:  Nat Genet       Date:  2009-12-20       Impact factor: 38.330

3.  AGORA, a data- and biobank for birth defects and childhood cancer.

Authors:  Iris A L M van Rooij; Loes F M van der Zanden; Ernie M H F Bongers; Kirsten Y Renkema; Charlotte H W Wijers; Michelle Thonissen; Elisabeth M J Dokter; Carlo L M Marcelis; Ivo de Blaauw; Marc H W A Wijnen; Peter M Hoogerbrugge; Jos P M Bokkerink; Michiel F Schreuder; Linda Koster-Kamphuis; Elisabeth A M Cornelissen; Livia Kapusta; Arno F J van Heijst; Kian D Liem; Robert P E de Gier; Anne Marie Kuijpers-Jagtman; Ronald J C Admiraal; Stefaan J Bergé; Jan Jaap van der Biezen; An Verdonck; Vincent Vander Poorten; Greet Hens; Jasmien Roosenboom; Marc R Lilien; Tom P de Jong; Paul Broens; Rene Wijnen; Alice Brooks; Barbara Franke; Han G Brunner; Carine E L Carels; Nine V A M Knoers; Wout F J Feitz; Nel Roeleveld
Journal:  Birth Defects Res A Clin Mol Teratol       Date:  2016-05-06

4.  Genetic risk factors for nonsyndromic cleft lip with or without cleft palate in a Mesoamerican population: Evidence for IRF6 and variants at 8q24 and 10q25.

Authors:  Augusto Rojas-Martinez; Heiko Reutter; Oscar Chacon-Camacho; Rafael B R Leon-Cachon; Sergio G Munoz-Jimenez; Stefanie Nowak; Jessica Becker; Ruth Herberz; Kerstin U Ludwig; Mario Paredes-Zenteno; Abelardo Arizpe-Cantú; Susanne Raeder; Stefan Herms; Rocio Ortiz-Lopez; Michael Knapp; Per Hoffmann; Markus M Nöthen; Elisabeth Mangold
Journal:  Birth Defects Res A Clin Mol Teratol       Date:  2010-07

5.  DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources.

Authors:  Helen V Firth; Shola M Richards; A Paul Bevan; Stephen Clayton; Manuel Corpas; Diana Rajan; Steven Van Vooren; Yves Moreau; Roger M Pettett; Nigel P Carter
Journal:  Am J Hum Genet       Date:  2009-04-02       Impact factor: 11.025

6.  Confirming genes influencing risk to cleft lip with/without cleft palate in a case-parent trio study.

Authors:  T H Beaty; M A Taub; A F Scott; J C Murray; M L Marazita; H Schwender; M M Parker; J B Hetmanski; P Balakrishnan; M A Mansilla; E Mangold; K U Ludwig; M M Noethen; M Rubini; N Elcioglu; I Ruczinski
Journal:  Hum Genet       Date:  2013-03-20       Impact factor: 4.132

7.  Fast and accurate genotype imputation in genome-wide association studies through pre-phasing.

Authors:  Bryan Howie; Christian Fuchsberger; Matthew Stephens; Jonathan Marchini; Gonçalo R Abecasis
Journal:  Nat Genet       Date:  2012-07-22       Impact factor: 38.330

8.  Meta-analysis Reveals Genome-Wide Significance at 15q13 for Nonsyndromic Clefting of Both the Lip and the Palate, and Functional Analyses Implicate GREM1 As a Plausible Causative Gene.

Authors:  Kerstin U Ludwig; Syeda Tasnim Ahmed; Anne C Böhmer; Nasim Bahram Sangani; Sheryil Varghese; Johanna Klamt; Hannah Schuenke; Pinar Gültepe; Andrea Hofmann; Michele Rubini; Khalid Ahmed Aldhorae; Regine P Steegers-Theunissen; Augusto Rojas-Martinez; Rudolf Reiter; Guntram Borck; Michael Knapp; Mitsushiro Nakatomi; Daniel Graf; Elisabeth Mangold; Heiko Peters
Journal:  PLoS Genet       Date:  2016-03-11       Impact factor: 5.917

9.  Genome-wide analyses of non-syndromic cleft lip with palate identify 14 novel loci and genetic heterogeneity.

Authors:  Yanqin Yu; Xianbo Zuo; Miao He; Jinping Gao; Yuchuan Fu; Chuanqi Qin; Liuyan Meng; Wenjun Wang; Yaling Song; Yong Cheng; Fusheng Zhou; Gang Chen; Xiaodong Zheng; Xinhuan Wang; Bo Liang; Zhengwei Zhu; Xiazhou Fu; Yujun Sheng; Jiebing Hao; Zhongyin Liu; Hansong Yan; Elisabeth Mangold; Ingo Ruczinski; Jianjun Liu; Mary L Marazita; Kerstin U Ludwig; Terri H Beaty; Xuejun Zhang; Liangdan Sun; Zhuan Bian
Journal:  Nat Commun       Date:  2017-02-24       Impact factor: 14.919

10.  A multi-ethnic genome-wide association study identifies novel loci for non-syndromic cleft lip with or without cleft palate on 2p24.2, 17q23 and 19q13.

Authors:  Elizabeth J Leslie; Jenna C Carlson; John R Shaffer; Eleanor Feingold; George Wehby; Cecelia A Laurie; Deepti Jain; Cathy C Laurie; Kimberly F Doheny; Toby McHenry; Judith Resick; Carla Sanchez; Jennifer Jacobs; Beth Emanuele; Alexandre R Vieira; Katherine Neiswanger; Andrew C Lidral; Luz Consuelo Valencia-Ramirez; Ana Maria Lopez-Palacio; Dora Rivera Valencia; Mauricio Arcos-Burgos; Andrew E Czeizel; L Leigh Field; Carmencita D Padilla; Eva Maria C Cutiongco-de la Paz; Frederic Deleyiannis; Kaare Christensen; Ronald G Munger; Rolv T Lie; Allen Wilcox; Paul A Romitti; Eduardo E Castilla; Juan C Mereb; Fernando A Poletta; Iêda M Orioli; Flavia M Carvalho; Jacqueline T Hecht; Susan H Blanton; Carmen J Buxó; Azeez Butali; Peter A Mossey; Wasiu L Adeyemo; Olutayo James; Ramat O Braimah; Babatunde S Aregbesola; Mekonen A Eshete; Fikre Abate; Mine Koruyucu; Figen Seymen; Lian Ma; Javier Enríquez de Salamanca; Seth M Weinberg; Lina Moreno; Jeffrey C Murray; Mary L Marazita
Journal:  Hum Mol Genet       Date:  2016-03-30       Impact factor: 5.121

View more
  10 in total

Review 1.  Genetics and signaling mechanisms of orofacial clefts.

Authors:  Kurt Reynolds; Shuwen Zhang; Bo Sun; Michael A Garland; Yu Ji; Chengji J Zhou
Journal:  Birth Defects Res       Date:  2020-07-15       Impact factor: 2.344

2.  [Exploring the association between de novo mutations and non-syndromic cleft lip with or without palate based on whole exome sequencing of case-parent trios].

Authors:  X Chen; S Y Wang; E C Xue; X H Wang; H X Peng; M Fan; M Y Wang; Y Q Wu; X Y Qin; J Li; T Wu; H P Zhu; J Li; Z B Zhou; D F Chen; Y H Hu
Journal:  Beijing Da Xue Xue Bao Yi Xue Ban       Date:  2022-06-18

3.  Whole-genome sequencing reveals de-novo mutations associated with nonsyndromic cleft lip/palate.

Authors:  Waheed Awotoye; Peter A Mossey; Jacqueline B Hetmanski; Lord J J Gowans; Mekonen A Eshete; Wasiu L Adeyemo; Azeez Alade; Erliang Zeng; Olawale Adamson; Thirona Naicker; Deepti Anand; Chinyere Adeleke; Tamara Busch; Mary Li; Aline Petrin; Babatunde S Aregbesola; Ramat O Braimah; Fadekemi O Oginni; Ayodeji O Oladele; Abimbola Oladayo; Sami Kayali; Joy Olotu; Mohaned Hassan; John Pape; Peter Donkor; Fareed K N Arthur; Solomon Obiri-Yeboah; Daniel K Sabbah; Pius Agbenorku; Gyikua Plange-Rhule; Alexander Acheampong Oti; Rose A Gogal; Terri H Beaty; Margaret Taub; Mary L Marazita; Michael J Schnieders; Salil A Lachke; Adebowale A Adeyemo; Jeffrey C Murray; Azeez Butali
Journal:  Sci Rep       Date:  2022-07-11       Impact factor: 4.996

4.  Genome-wide association study of multiethnic nonsyndromic orofacial cleft families identifies novel loci specific to family and phenotypic subtypes.

Authors:  Nandita Mukhopadhyay; Eleanor Feingold; Lina Moreno-Uribe; George Wehby; Luz Consuelo Valencia-Ramirez; Claudia P Restrepo Muñeton; Carmencita Padilla; Frederic Deleyiannis; Kaare Christensen; Fernando A Poletta; Ieda M Orioli; Jacqueline T Hecht; Carmen J Buxó; Azeez Butali; Wasiu L Adeyemo; Alexandre R Vieira; John R Shaffer; Jeffrey C Murray; Seth M Weinberg; Elizabeth J Leslie; Mary L Marazita
Journal:  Genet Epidemiol       Date:  2022-02-22       Impact factor: 2.344

5.  Polygenic risk impacts PDGFRA mutation penetrance in non-syndromic cleft lip and palate.

Authors:  Yao Yu; Rolando Alvarado; Lauren E Petty; Ryan J Bohlender; Douglas M Shaw; Jennifer E Below; Nada Bejar; Oscar E Ruiz; Bhavna Tandon; George T Eisenhoffer; Daniel L Kiss; Chad D Huff; Ariadne Letra; Jacqueline T Hecht
Journal:  Hum Mol Genet       Date:  2022-07-21       Impact factor: 5.121

6.  Co-occurrence of orofacial clefts and clubfoot phenotypes in a sub-Saharan African cohort: Whole-exome sequencing implicates multiple syndromes and genes.

Authors:  Lord J J Gowans; Noura Al Dhaheri; Mary Li; Tamara Busch; Solomon Obiri-Yeboah; Alexander A Oti; Daniel K Sabbah; Fareed K N Arthur; Waheed O Awotoye; Azeez A Alade; Peter Twumasi; Pius Agbenorku; Gyikua Plange-Rhule; Thirona Naicker; Peter Donkor; Jeffrey C Murray; Nara L M Sobreira; Azeez Butali
Journal:  Mol Genet Genomic Med       Date:  2021-03-14       Impact factor: 2.183

7.  Genome-Wide Association Study of Non-syndromic Orofacial Clefts in a Multiethnic Sample of Families and Controls Identifies Novel Regions.

Authors:  Nandita Mukhopadhyay; Eleanor Feingold; Lina Moreno-Uribe; George Wehby; Luz Consuelo Valencia-Ramirez; Claudia P Restrepo Muñeton; Carmencita Padilla; Frederic Deleyiannis; Kaare Christensen; Fernando A Poletta; Ieda M Orioli; Jacqueline T Hecht; Carmen J Buxó; Azeez Butali; Wasiu L Adeyemo; Alexandre R Vieira; John R Shaffer; Jeffrey C Murray; Seth M Weinberg; Elizabeth J Leslie; Mary L Marazita
Journal:  Front Cell Dev Biol       Date:  2021-04-09

8.  Genetic architecture of orbital telorism.

Authors:  Maria J Knol; Mikolaj A Pawlak; Sander Lamballais; Natalie Terzikhan; Edith Hofer; Ziyi Xiong; Caroline C W Klaver; Lukas Pirpamer; Meike W Vernooij; M Arfan Ikram; Reinhold Schmidt; Manfred Kayser; Tavia E Evans; Hieab H H Adams
Journal:  Hum Mol Genet       Date:  2022-05-04       Impact factor: 5.121

9.  Genome-wide association study in patients with posterior urethral valves.

Authors:  Loes F M van der Zanden; Carlo Maj; Oleg Borisov; Iris A L M van Rooij; Josine S L T Quaedackers; Martijn Steffens; Luca Schierbaum; Sophia Schneider; Lea Waffenschmidt; Lambertus A L M Kiemeney; Liesbeth L L de Wall; Stefanie Heilmann; Aybike Hofmann; Jan Gehlen; Johannes Schumacher; Maria Szczepanska; Katarzyna Taranta-Janusz; Pawel Kroll; Grazyna Krzemien; Agnieszka Szmigielska; Michiel F Schreuder; Stefanie Weber; Marcin Zaniew; Nel Roeleveld; Heiko Reutter; Wout F J Feitz; Alina C Hilger
Journal:  Front Pediatr       Date:  2022-09-27       Impact factor: 3.569

10.  Integrative approaches generate insights into the architecture of non-syndromic cleft lip with or without cleft palate.

Authors:  Julia Welzenbach; Nigel L Hammond; Miloš Nikolić; Frederic Thieme; Nina Ishorst; Elizabeth J Leslie; Seth M Weinberg; Terri H Beaty; Mary L Marazita; Elisabeth Mangold; Michael Knapp; Justin Cotney; Alvaro Rada-Iglesias; Michael J Dixon; Kerstin U Ludwig
Journal:  HGG Adv       Date:  2021-06-08
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.