Literature DB >> 31063268

Accumulation of rare coding variants in genes implicated in risk of human cleft lip with or without cleft palate.

Nicholas J Marini1, Kripa Asrani1, Wei Yang2, Jasper Rine1, Gary M Shaw2.   

Abstract

Cleft lip with/without cleft palate (CLP) is a common craniofacial malformation with complex etiologies, reflecting both genetic and environmental factors. Most of the suspected genetic risk for CLP has yet to be identified. To further classify risk loci and estimate the contribution of rare variants, we sequenced the exons in 49 candidate genes in 323 CLP cases and 211 nonmalformed controls. Our findings indicated that rare, protein-altering variants displayed markedly higher burdens in CLP cases at relevant loci. First, putative loss-of-function mutations (nonsense, frameshift) were significantly enriched among cases: 13 of 323 cases (~4%) harbored such alleles within these 49 genes, versus one such change in controls (p = 0.01). Second, in gene-level analyses, the burden of rare alleles showed greater case-association for several genes previously implicated in cleft risk. For example, BHMT displayed a 10-fold increase in protein-altering variants in CLP cases (p = .03), including multiple case occurrences of a rare frameshift mutation (K400 fs). Other loci with greater rare, coding allele burdens in cases were in signaling pathways relevant to craniofacial development (WNT9B, BMP4, BMPR1B) as well as the methionine cycle (MTRR). We conclude that rare coding variants may confer risk for isolated CLP.
© 2019 The Authors. American Journal of Medical Genetics Part A published by Wiley Periodicals, Inc.

Entities:  

Keywords:  cleft lip; folic acid; genetic association; rare variants

Mesh:

Substances:

Year:  2019        PMID: 31063268      PMCID: PMC6557678          DOI: 10.1002/ajmg.a.61183

Source DB:  PubMed          Journal:  Am J Med Genet A        ISSN: 1552-4825            Impact factor:   2.802


INTRODUCTION

Although their causes are largely unknown, orofacial clefts are suspected of being etiologically heterogeneous with both genetic and nongenetic risk factors (Dixon, Marazita, Beaty, & Murray, 2011). Clinical observations have long indicated that orofacial clefts should be classified into at least two distinct phenotypic groups: cleft lip with or without cleft palate (CLP) and cleft palate alone (CPO) (Fogh‐Anderson, 1967). Collectively, these are among the most common birth defects with a worldwide prevalence of ~1 in 700 live births (WHO, 2001). Genetic associations for CLP have been observed for several genes, including IRF6, FGFR2, FOXE1, MSX1, NECTIN1, BMP4, TBX22, and TGFa (Dixon et al., 2011) from either linkage analysis or candidate gene studies. In addition, genome‐wide association studies (GWAS) have identified approximately 15 additional risk loci (Beaty et al., 2010; Birnbaum et al., 2009; Grant et al., 2009; Leslie et al., 2016; Ludwig et al., 2012; Mangold et al., 2010) with some of these replicating in subsequent studies (e.g., – Jia et al., 2015). For some of these loci, such as IRF6 and NOG, follow‐up sequencing efforts have identified compelling functional variants as putative risk alleles (Leslie et al., 2015; Rahimov et al., 2008). Interestingly, there is apparently little overlap between the most significantly‐associated GWAS regions and genes previously implicated in CLP risk. This is probably indicative of the complex genetic etiologies underlying the CLP phenotype and the difficulties detecting associated loci at genome‐wide significance. Thus, there are likely to be numerous additional genetic risk factors yet to be identified. Indeed, the combined efforts of all these genetic approaches have been extremely fruitful, yet still explain only a fraction of the population burden of these human birth defects. Several nongenetic factors also appear to contribute to cleft phenotypes (Mossey, Little, Munger, Dixon, & Shaw, 2009). Of particular relevance to the current study is that maternal use of multivitamin supplements containing folic acid in early pregnancy has been associated with decreased risk of CLP (Shaw, Lammer, Wasserman, O'Malley, & Tolarova, 1995; Wilcox et al., 2007). Although some reports do not observe a reduction in risk associated with maternal folate supplementation (see Little et al., 2008), a recent meta‐analysis indicated a significant reduction in risk of CLP with maternal folic acid use (Jahanbin, Shadkam, Miri, Shirazi, & Abtahi, 2018). Because of this folic acid link, folate/onecarbon pathway genes have been a logical place to look for risk variants (Blanton et al., 2011; Boyles et al., 2009; Marini et al., 2016). However, many of the known common pathway SNPs have been evaluated, yet the results are still somewhat unconvincing. The exception to this seems to be the fairly consistent association of the BHMT/BHMT2/DMGDH locus on Chromosome 5 with CLP observed in a number of studies (Marini et al., 2016). Interestingly, while the locus often shows association, the local SNPs driving the association can vary between studies. Thus, the nature of the risk at this locus is still unclear. In this study we hypothesized that some genetic risk for CLP lies in rare variants and this has contributed to the lack of consistent findings and attendant difficulty in unraveling risk alleles. Thus, we sequenced the exons in 30 genes involved in folate metabolism in a 534‐member case–control population to determine if rare variant allele burden (in individual genes, subsets of genes, or the entire pathway) correlated with phenotype. We also sequenced the coding region of an additional 19 candidate genes (not related to folate metabolism) that are involved in lip/palate development or previously associated with CLP. Our results showed that, while there were no folate pathway‐wide trends in rare variant allele burden, two genes involved in methionine synthesis (BHMT and MTRR) showed convincing case‐associated trends for protein‐altering variants. Likewise, genes involved in BMP/WNT signaling (BMP4, BMPR1B, WNT9B), which have been previously implicated in orofacial cleft development (Juriloff, Harris, Mager, & Gagnier, 2014; Suzuki et al., 2009; Zhang et al., 2002), displayed higher allele burdens in cases. CLP cases also presented a greater number of loss‐of‐function alleles (frameshifts, truncations) in this candidate gene set.

METHODS

Study population

This case–control study included data on deliveries that had estimated due dates from 1995 to 2003. The study included live‐born infants with isolated CLP (N = 681), isolated CPO (N = 157), or without any structural malformation (controls; N = 706). Because the goal of the study was to determine whether rare coding variants in candidate genes underlie risk for CLP, variant discovery by gene sequencing was performed on an ethnically‐mixed subset of this population comprising 323 CLP cases and 211 controls. A second group (358 CLP cases, 157 CPO cases, 485 controls) was used to further validate the CLP case‐association of a frameshift mutation in BHMT. See Table 1 for race‐ethnic breakdown of the study population.
Table 1

Populations used in study

A. Discovery population
Race/ethnicityCLP cases (collective percentage)Controls (collective percentage)
Asian35 (10.8)25 (11.8)
Black13 (4.0)8 (3.8)
White, Hispanic190 (58.8)126 (59.7)
White, non‐Hispanic79 (24.5)48 (22.7)
Other6 (1.9)4 (1.9)
TOTAL 323 211
Populations used in study Case information was abstracted from hospital reports and medical records following established procedures by the California Birth Defects Monitoring Program (Croen, Shaw, Jensvold, & Harris, 1991). Each medical record was further reviewed by a medical geneticist (Edward J. Lammer, MD). Infants with trisomies were ineligible. Nonmalformed control infants were selected randomly to represent the birth population from which the cases were derived in selected counties and birth periods. This study, including the collection and use of archived newborn bloodspots, was approved by the California State Committee for the Protection of Human Subjects as well as Institutional Review Boards at Stanford University and the University of California, Berkeley.

Genomic DNA isolation

Genomic DNA (gDNA) was available from newborn screening dried blood spots obtained from linkage efforts made by the California Birth Defects Monitoring Program. gDNA was extracted from dried blood spots and whole‐genome amplified using QIAmp DNA Mini kits and Replica‐G Midi amplification kits (Qiagen), respectively, as described previously (Marini et al., 2011).

Illumina paired‐end sequencing of enriched target exons

3–5 μg of whole‐genome amplified DNA was used as starting material for Illumina library construction and target exon capture with Sure Select (Agilent) probes according to the Agilent SureSelectXT protocol (https://www.agilent.com/cs/library/usermanuals/public/G7530-90000.pdf). Briefly, DNA was fragmented with a Covaris sonicator, followed by end‐repair and A‐tailing, then ligation of sequencing adaptors and PCR amplification. Amplified libraries were then used to perform target exon capture by hybridization against a custom‐made capture library of 120 bp biotinylated oligonucleotide probes covering the 555 coding exons in 49 candidate genes of interest (Table S4). Probes (Agilent SureSelect) were designed via the SureDesign web tool using the genomic coordinates of target exons (genome build GRCh37/hg19). To ensure capture and coverage of the entire coding region and splice sites, probes were designed with an additional 50 bp “pad” that covered intronic regions adjacent to coding exons or, in the case of the first or last coding exon, covered the 5′‐ or 3′‐ untranslated regions. Following hybridization and streptavidin capture of enriched exons, captured libraries were barcoded with 6 bp tags with an additional round of PCR, quantified, and 12–16 sample libraries were pooled for 100 bp paired‐end sequencing on the Illumina HiSeq (v3 chemistry).

Read mapping and variant filtering

Raw sequence files (FASTQ) were imported into CLC Genomics Workbench (v.6; Qiagen) and mapped onto the human genome (GRCh37/hg19). Only full‐length, paired, uniquely‐mapping reads with <= 3 mismatches were used for variant calling. Variants with depth of coverage <30, or with genotype quality scores <25, were excluded. In addition, alleles not observed on both strands (with a minimum frequency ≥ 0.1) or with allelic imbalances (heterozygote genotype calls with one allele observed with frequency < 0.25 or homozygote alleles observed with frequency < 0.85) were also excluded. After compilation of the dataset, positions not covered in at least 95% of the cases or controls were then removed.

Variant calling accuracy

The accuracy of the variant calls in the filtered dataset was evaluated in two ways: (a) the concordance of genotype calls by Illumina sequencing and by TaqMan allelic discrimination assays (described in Marini et al., 2016) was determined for 8,537 variant calls (mostly common alleles) in 476 samples and (b) confirmation by Sanger sequencing for the subset of rare variants in genes with striking case‐related trends (~15% of all protein‐altering variants identified in the study). Verification sequencing was performed on original bloodspot gDNA preparations, prior to whole‐genome amplification, using amplicon‐specific M13‐capped PCR primers designed by VariantSEQr methodology (ThermoFisher) and deposited into the NCBI probe database (https://www.ncbi.nlm.nih.gov/probe/docs/projvariantseqr/). Each sample for verification had a corresponding negative control for comparison.

Analyses

To estimate the contribution of rare (MAF < 1%) coding variants (for which there may only be a single occurrence) to CLP risk, a simple allele burden test was applied on a gene‐by‐gene basis to highlight genes with prominent case–control skews. Alleles were summed by gene (or groups of genes) and case–control associations were tested by Fisher's exact test based on genotype distributions (i.e., carrier frequency). Although, this cannot be used as a strict test for association, statistical differences in case or control representation are readily observed with this metric. For common polymorphisms, variants were treated as categorical variables, that is, homozygous wild‐type as referent versus heterozygous or homozygous variant. Odds ratios and 95% confidence intervals were used to estimate risks.

RESULTS

Dataset definition, variant calling, and quality control

We took a candidate gene approach, with a focus on folate pathway genes, to investigate the potential contribution of rare coding variants to CLP risk by determining whether putative loss‐of‐function alleles or rare allele burdens in certain genes (or pathways) correlated with phenotype. Thus we sequenced the coding exons and exon/intron boundaries in 30 genes related to folate metabolism and 19 additional genes involved in craniofacial development in a case–control study (Table 1). The full list of genes queried, including exon annotations and coverage statistics are in Table S1. 532 coding exons out of the 555 targeted for enrichment in these 49 genes (96%) passed quality control for adequate coverage and sequence quality. Mean coverage for each of these 532 exons was similar between the case group and the control group, indicating each group was sequenced with equal efficiency (Figure S1). Variants were called in exons and in intron regions within 10 bp from exon/intron junctions as described in Methods. To evaluate the overall quality and accuracy of the variant calls, we performed two types of independent assessments on different subsets of variants. In the first, we established the concordance between the genotypes called from Illumina sequencing with those obtained from TaqMan allelic discrimination assays for many multiply‐occurring loci (i.e., not singletons) using TaqMan assays previously described (Marini et al., 2016). 91 variant positions in 24 genes were compared in 476 samples. Out of 8,537 variant calls, we observed 99.7% concordance between the two methods (Table S2). In a second quality‐control check, we evaluated all putative loss‐of‐function alleles (frameshifts and truncations), and all rare and singleton missense variant calls from genes whose alleles showed higher case representation, by exon‐specific Sanger resequencing (see Section 2). Out of 95 tested variants, all were confirmed except one frameshift in a low complexity region, which was subsequently removed (Table S3). Based on these analyses, we are confident in the sequencing accuracy of the data, necessary for rare variant genotyping. The final analytical dataset contained data for 773 variant positions: 285 synonymous, 380 protein‐altering, and 108 noncoding variants. One hundred fifty‐five (54%) of the synonymous alleles were singletons, whereas 243 (64%) of the protein‐altering alleles were singletons. The complete list of annotated variants, allele frequencies, and case–control distributions are in Table S4.

Common allele associations

Although our focus was on the distribution of rare coding alleles, sequence data also provided information on associations of more common variants. We observed a small number of nominally significant associations that are in agreement with previous studies. For example, association of nonsynonymous SNPs in DMGDH (S279P, homozygous OR = 0.5 [0.3–0.9], p = .022) and ALDH1L1 (I812V, heterozygous OR = 2.7 [1.3–5.5], p = .008) have been reported previously (Boyles et al., 2009; Marini et al., 2016). By far, the strongest statistically significant signal of any variant with high enough frequency to calculate was the nonsynonymous SNP V274I in IRF6 (heterozygous OR = 0.5 (0.3–0.7), p = 4E−04; homozygous OR = 0.5 (0.2–0.9), p = .046), a well‐studied polymorphism in orofacial cleft studies with numerous reported associations (Beaty, Marazita, & Leslie, 2016). A second IRF6 synonymous SNP (S153S) was also significantly associated in homozygotes (OR = 0.6 [0.4–0.9], p = .02). Finally, there were two linked synonymous SNPs in the VANGL2 gene (G445G and P467P; heterozygous OR = 1.8 [1.2–2.7], p = .004), which has not previously been associated with CLP. Odds ratios were calculated only from genotype frequencies in the entire, ethnically diverse population, and not in ethnically stratified sub‐populations, because this study was not sufficiently powered for such analyses. Our focus was on rare variants which are often impervious to meaningful ethnic stratification analyses owing to the infrequency of occurrence. However, we have ensured that the race‐ethnic breakdown was approximately the same in case and control groups (Table 1).

Association of putative loss‐of‐function alleles

The candidate gene set interrogated in this study is a mix of both folate pathway genes and genes implicated in cleft occurrence because of their roles in craniofacial development. Thus, this set was enriched for potential cleft risk genes over what might be considered in a broader sampling of the exome. Therefore, it is noteworthy that we observed a significant enrichment of putative loss‐of‐function alleles (frameshifts, truncations) in CLP cases: 13 cases harbored one such allele from this gene set versus only 1 control (Fisher's exact p = .01; Table 2). All alleles in Table 2 were seen only once in this population except for the deletion/frameshift at Lysine 400 in BHMT (K400fs), which was seen in four cases of differing ethnicity. It is further noteworthy that several of the genes represented in Table 2 (BHMT, DMGDH, MTRR, IRF6) have displayed significant CLP associations in multiple studies (see Section 4). It should be emphasized that functional impact of these alleles is only inferred, though they represent good candidates for mutations of consequence for cleft risk.
Table 2

Putative loss‐of‐function alleles

GeneVariantTypea Case appearanceb Control appearanceb
BHMT K400fsDeletion/frameshiftHispanic
Hispanic
Non‐Hispanic, White
Non‐Hispanic, White
DMGDH W495termb SNV/stop‐gainHispanic
GGH L190fsInsertion/frameshiftNon‐Hispanic, White
IRF6 Q112termb SNV/stop‐gainNon‐Hispanic, White
MTRR R386termb SNV/stop‐gainNon‐Hispanic, White
SARDH R457termb SNV/stop‐gainHispanic
SARDH R514termb SNV/stop‐gainHispanic
SP8 S261termb SNV/stop‐gainHispanic
SHMT1 R99termb SNV/stop‐gainHispanic
SHMT1 R207termb SNV/stop‐gainAsian
SHMT1 Y457termb SNV/stop‐gainAsian
TOTAL 13 1

Abbreviation: SNV, single nucleotide variation.

Ethnicity of newborn.

Putative loss‐of‐function alleles Abbreviation: SNV, single nucleotide variation. Ethnicity of newborn.

Folate pathway variation

Despite numerous epidemiological studies suggesting that maternal folic acid supplementation may modify risk for CLP, common SNP association studies with folate/onecarbon pathway variants have not been definitive in linking pathway loci to phenotype. Suggestive evidence exists for several loci from this group (e.g., Blanton et al., 2011; Boyles et al., 2009; Marini et al., 2016), though reproducibility across populations and small effect sizes remain problematic. The role of rare variants in folate pathway genes has not been adequately queried and may account for genetic risk yet to be identified. Indeed, the folate/onecarbon pathway may be particularly susceptible to the aggregate mutation burden throughout the pathway, since there are many co‐dependent and interacting components. Thus, we hypothesized that the aggregate burden of rare variants in the folate pathway (or related genes in pathway sub‐compartments) may confer risk for CLP and help explain the folate link. We have found such pathway analyses useful in understanding the role of folate pathway variation, common and rare, with respect to neural tube defect risk (Marini et al., 2011). By allele summing of rare variants (defined here as MAF < 1%), the group of folate/onecarbon pathway genes (N = 30) did not show an increased aggregate burden of either synonymous mutations (case aggregate allele frequency = 0.27, control aggregate allele frequency = 0.28; p = .57, Mann–Whitney test) or intron mutations within 10 bp of the exon boundary (case aggregate allele frequency = 0.12, control aggregate allele frequency = 0.10; p = .34, Mann–Whitney test) that might distinguish cases from controls. There was as slight increase in aggregated protein‐altering allele frequency for cases (case aggregate allele frequency = 0.38, control aggregate allele frequency = 0.31); though the distributions were not significantly different (p = .08). Upon sub‐dividing pathway genes into compartments with metabolically related genes (e.g., – purine synthesis, see (Marini et al., 2011)), it became obvious that the strongest case‐associated signals for rare, protein‐altering variants were restricted to two genes involved in methionine synthesis: BHMT (betaine‐homocysteine methyl transferase) and MTRR (methionine synthase reductase), both of which are necessary for the remethylation of homocysteine to methionine (Figure 1, Table 3). In addition to the four case occurrences of the K400fs allele in BHMT, nonsynonymous variants were found in an additional 7 cases and only a single control (Fisher's exact p = .03). Similarly, protein‐altering variants for MTRR were found in 13 cases versus 2 controls (p = .06).
Figure 1

The occurrences of all rare coding region variants (MAF <1.0%) were summed on a gene‐by‐gene basis to yield the total allele burden for synonymous (left panel) and protein‐altering (middle panel) variants. Protein‐altering variants encompass frameshifts, truncations, and missense alleles. Allele sum frequency is the carrier frequency (% heterozygote carriers) for each gene. p values were calculated (right panel) based on 2 × 2 Fisher's exact test of the summed carrier frequency for synonymous and protein‐altering changes and expressed as −log p [Color figure can be viewed at wileyonlinelibrary.com]

Table 3

Genes with strongest case representations of rare, protein‐altering variants

GeneCase appearancesa Control appearancesa
Variant#Variant#
BHMT D105N1V122M1
E136K1
P197S3
G199S2
K400fs4
BMP4 D56E1
R139H1
R162Q1
R223H1
R226Q1
R269Q1
BMPR1B P23S1
V67D1
V140I2
R149W1
Q153R1
M301V1
R371Q1
R413S1
MTRR H36Y1
N218K1
V265G1
T286M3
I290T1
K352N1
R386a 1
R418Q1
T490A2T490A1
V674A1V607I1
WNT9B A42V1
Q47R2
R94Q2R94Q1
N99S1
A126T3
K184N1
R222H1
S244L2

All variants were seen as heterozygotes; only MTRR T490A and WNT9B R94Q were seen in both cases and controls.

The occurrences of all rare coding region variants (MAF <1.0%) were summed on a gene‐by‐gene basis to yield the total allele burden for synonymous (left panel) and protein‐altering (middle panel) variants. Protein‐altering variants encompass frameshifts, truncations, and missense alleles. Allele sum frequency is the carrier frequency (% heterozygote carriers) for each gene. p values were calculated (right panel) based on 2 × 2 Fisher's exact test of the summed carrier frequency for synonymous and protein‐altering changes and expressed as −log p [Color figure can be viewed at wileyonlinelibrary.com] Genes with strongest case representations of rare, protein‐altering variants All variants were seen as heterozygotes; only MTRR T490A and WNT9B R94Q were seen in both cases and controls. Using a minor allele frequency cutoff of 5%, rather than 1%, yielded similar folate pathway‐wide trends: there were no significant differences in the aggregate burden of noncoding variants (p = .38) or synonymous variants (p = .81), but the aggregate burden of protein‐altering variants was significantly higher in cases (case aggregate allele frequency = 0.63, control aggregate allele frequency = 0.50; p = .02). No additional folate pathway genes, besides BHMT and MTRR, displayed significant case–control differences.

Individual gene analysis

For each of the 49 genes in this study, rare synonymous or protein‐altering variants were collapsed by summing and genotype frequencies for this allele burden metric were subject to Fisher's exact testing (Figure 1). Consistent with the absence of a signal seen from synonymous variation in the folate pathway, PTCH1 was the only gene with a statistically significant altered case–control distribution of rare synonymous variants: case carrier frequency (0.20) was approximately twofold higher than that in controls (0.12; p = .03). In contrast, there were several genes with higher carrier frequency of protein‐altering variants in cases with more striking case–control differentials (Figure 1 and Table 3). In addition to the BHMT and MTRR allele distributions discussed above, there were 13 case appearances (all heterozygotes) of missense alleles in the WNT9B gene, versus only one missense occurrence in controls (p = .01). One missense change, A126T, was found exclusively in three cases. Likewise, in the BMPR1B gene, which encodes a receptor for bone morphogenetic protein (BMP) secreted ligands, there were nine occurrences of missense alleles in cases without a single observation in controls (p = .01). Also noteworthy is the BMP4 gene, which encodes a specific, secreted bone morphogenetic ligand involved in embryonic head development. We observed six different missense alleles of BMP4 in case heterozygotes versus zero protein‐altering variants in controls (p = .09). These last three genes were included based on their roles in craniofacial development (and, in some cases, isolated cleft risk (Juriloff et al., 2014; Suzuki et al., 2009; Yang et al., 2017)). Thus, it was interesting to see striking accumulation of rare variation in CLP cases.

Replication of the case association of BHMT K400fs in a second population

As described above, we observed multiple case occurrences of a frameshift (delA) at lysine 400 in BHMT. This lesion occurs only 7 amino acids from the C‐terminus and is inferred (by nucleotide sequence) to replace these last 7 residues (KQKFKSQ) with 13 different amino acids (NKNSNHSSLDRSYF). Thus, the functional impact of this mutation is difficult to confidently discern. We took the same set of Exon 8‐specific resequencing primers used for variant verification (see Section 2) to genotype‐by‐sequence this variant in a second, case–control population (validation population in Table 1). We observed this variant as a heterozygote three additional times in the validation population: two additional CLP cases (one Hispanic, one non‐Hispanic, White) and one control (non‐Hispanic, White). We did not observe this variant in the cleft palate only (CPO) subset of this population (N = 157).

DISCUSSION

In this study we took a candidate gene approach to investigate the potential contribution of rare coding variants to CLP risk, with the reasoning that at least some of the genetic risk not yet identified in common allele studies may reside in rare variation. The contribution of rare, coding mutations in CLP risk has been previously suggested in multiple sequencing studies (Al Chawa et al., 2014; Leslie & Murray, 2013; Suzuki et al., 2009), though this has only recently begun to be systematically addressed (Aylward et al., 2016; Leslie et al., 2017). We focused on 49 candidate genes with an emphasis on nearly complete coverage of folate metabolism (N = 30 genes), since rare folate pathway variants have not been explored for CLP risk, particularly when considered in aggregate across the pathway or within related genesets. We queried 19 additional genes unrelated to folate metabolism, that have been implicated in disease risk by vertebrate models of clefting (e.g., WNT9B) or craniofacial development, or by human genetic association studies (e.g., BMP4, IRF6). From a catalog of the full spectrum of allelic variation within the coding regions of this target‐enriched gene set, we found two lines of evidence for the contribution of rare variants to CLP risk: (a) cases displayed an overabundance of putative loss‐of‐function alleles at these loci and (b) several mechanistically relevant genes showed similar case over‐representation for protein‐altering variants, in general.

Folate pathway variation: Methionine cycle genes

We sequenced the majority of folate/onecarbon pathway genes, which enabled us to interpret rare variation in the context of pathway function. This approach was useful in identifying folate‐related risk signatures for neural tube defects (Marini et al., 2011). While we did not see any pathway‐wide trends in allele burdens or distributions that significantly differentiated CLP cases from controls, we did observe two genes directly involved in methionine biosynthesis with a considerable over‐abundance of protein altering variants in cases: BHMT and MTRR (Table 3). Betaine‐homocysteine methyltransferase (BHMT) and Methionine synthase reductase (MTRR) are both involved in the methylation of homocysteine to yield methionine. They operate in different pathways; however, and while BHMT utilizes methyl groups generated from choline catabolism, the reaction involving MTRR utilizes 5‐methyl‐tetrahydrofolate. The CLP‐association of rare variants at BHMT is intriguing for two reasons: First, this locus, which is part of the metabolically‐related BHMT/BHMT2/DMGDH gene cluster on chromosome 5, is one of the most consistently associated loci in onecarbon pathway gene‐CLP association studies (reviewed in Marini et al., 2016). However, while the locus is often associated, the regional SNPs that drive the association can be different, as can the directionality of risk (i.e., whether odds ratios are < or >1). We postulated that these phenomena may reflect the contribution of underlying rare variants at this locus (which may occur on the haplotype of the major or minor allele of a nearby common variant) as has been previously suggested for clefts (Leslie & Murray, 2013) and as has been seen for regions highlighted by GWAS studies (Leslie et al., 2015). The distribution of rare variants at BHMT is also intriguing because of the appearance of a frameshift mutation (K400fs) that was observed in multiple CLP cases (discussed below). The increased mutation burden in these two genes brings attention to homocysteine clearance as relevant for modifying CLP risk, which has been previously suggested (Kumari, Ali, Sukla, Singh, & Raman, 2013; Wong et al., 1999). Furthermore, interference of methionine synthesis suggests perturbations in cellular methylation reactions (via S‐adenosylmethionine levels), which may manifest in DNA methylation changes associated with CLP, as has been observed (Alvizi et al., 2017; Sharp et al., 2016).

Putative loss‐of‐function alleles

Within this target gene set, CLP cases displayed a significantly greater frequency of indel frameshift and truncation alleles (p = .01; Table 2). One of the most intriguing alleles is the K400 fs deletion allele of BHMT, which was observed in four cases (both Hispanic and non‐Hispanic, White) but not in controls in the discovery population. A second, validation population revealed two additional CLP case occurrences and one control occurrence. Although the functional impact of this allele is yet to be determined, its observance in multiple cases is noteworthy particularly because global population frequencies of this variant are estimated at 2.2E−04 in the Genome Aggregation Database (www.gnomad.broadinstitute.org), substantially lower than our overall case frequency (MAF = 0.004 in all cases queried). The low number of occurrences for this allele does not allow an effective statistical test in populations of this size, nevertheless, multiple case observations in two separate populations is higher than expected based on the reported global MAF. It should be noted that this variant is described as low‐confidence in the gnomAD database. However, the signal was strong in our data and all instances clearly replicated in verification Sanger sequencing (see Figure S2, for example). Thus, we are confident of these variant calls. In addition to BHMT, we also observed case‐associated truncations in several other genes relevant to methyl donor production (MTRR R386*, DMGDH W495*, and SARDH R457*, R514*). Based on protein lengths (MTRR 698aa, DMGDH 866aa, SARDH 918aa), these changes are likely to be consequential. BHMT, DMGDH (dimethyl‐glycine dehydrogenase), and SARDH (sarcosine dehydrogenase) are sequential enzymatic steps in the catabolism of choline to glycine, which generates methyl donors for cellular methylation reactions as noted above. Interestingly, increased intake of choline in the maternal diet has been associated with decreased CLP risk (Shaw, Carmichael, Laurent, & Rasmussen, 2006). Outside of folate/onecarbon pathway genes, two case truncations were observed in genes previously implicated in orofacial cleft risk: IRF6 Q112* and SP8 S261*. Significant associations of IRF6 with CLP have been reported in multiple populations (Beaty et al., 2016) and may account for up to 12% of the genetic component of CLP in some populations (Zucchero et al., 2004). SP8 is a transcription factor in which mutations display severe craniofacial malformations including cleft palate (Kasberg, Brunskill, & Steven Potter, 2013).

Genes unrelated to folate metabolism

Three additional genes involved in craniofacial development also displayed striking case‐overrepresentation of missense variants: BMP4, BMPR1B, and WNT9B (Table 3). WNT9B and BMP4 are secreted signaling proteins regulating critical interacting signaling pathways in craniofacial development (Alexander, Piloto, Le Pabic, & Schilling, 2014). BMPR1B encodes a receptor for BMP4 and other bone morphogens. Significantly, all three genes have been implicated in orofacial cleft risk. BMP4 is one of the most promising candidate genes for CLP, with evidence shown in animal experiments (e.g., Liu et al., 2005) and association studies (e.g., Chen et al., 2014). Furthermore, a gene sequencing study (Suzuki et al., 2009) demonstrated that rare BMP4 coding mutations were enriched in CLP cases at similar frequencies to those shown here. Only the R162Q case‐specific nonsynonymous mutation was identified in both studies. Although rare coding variants in WNT9B or BMPR1B have yet to be reported as contributing to CLP risk, these genes have been implicated by other means. WNT9B is at a locus identified in a genome‐wide association study (Yu et al., 2017) and a murine model demonstrated that transcriptional repression of Wnt9b via epigenetic mechanisms leads to the development of cleft lip in mouse embryos (Juriloff et al., 2014). BMPR1B is at a locus associated with CLP by linkage analysis (Schultz et al., 2004) and has been identified as a genetic cause of Pierre Robin syndrome, in which cleft palate is a prominent component (Yang et al., 2017).

Rare versus private mutations

Of all the variants described in Tables 2 and 3, most have been seen before in the population, as measured by their annotation in the Genome Aggregation Database (www.gnomad.broadinstitute.org). Novel variants in this study are restricted to two missense variants in BMPR1B (P23S and V67D), and three putative loss‐of‐function truncations (DMGDH W495*, IRF6 Q112*, SP8 S261*). The frameshift allele in GGH (insT at L190) was previously observed once in over 250,000 alleles. All others have multiple occurrences, though at significantly lower frequencies in the general population than seen here in CLP cases. However, most of our observations are singletons (with a notable exception being the K400fs allele of BHMT), precluding true case frequency estimates. If rare alleles such as these contribute to CLP risk, it is likely that they will be uncovered only by sequencing/discovery approaches rather than array‐based genotyping, even for large array sizes. Furthermore, the degree to which risk loci identified by traditional association studies will reveal loci with underlying rare variant burdens (as we have suggested for the BHMT/BHMT2/DMGDH gene cluster) is still unknown. For example, Leslie et al. (2015) failed to find convincing evidence of CLP‐related rare allele burdens at GWAS loci in 1500 trios. It may be that the underlying genetic heterogeneity for CLP risk is sufficiently complex that many such burdened loci do not display genome‐wide significance in traditional association studies. Indeed, nearly all variants discussed here were heterozygous and none co‐occurred with any others. Thus, if they are involved in CLP risk, they must be part of a more complex etiology.

Strengths/limitations

Strengths of this study lie in the case–control design, which allows for a broader sampling of rare alleles than family‐based studies, and the assurance of accuracy in the sequencing data, which is essential for rare variant discovery. Although CLP risk loci identified in this study are consistent with previous studies and with biological mechanisms, the study is limited by both sample size and the ability to draw reliable associations from rare variants. Thus, statistical measures used here are not a true measure of association but of highlighting genes with differential case burdens. Therefore, replication of these findings in additional, larger populations is needed to afford significance of the potential associations observed here. Another limitation is that the functional impact of variants is unknown. We have assumed that the striking mutation skews are indicative of a contribution to CLP etiology. In support of this assumption, we used two computational algorithms to predict the functional impact of all missense mutations identified from the five genes in Table 3: CADD (Rentzsch, Witten, Cooper, Shendure, & Kircher, 2019) and PolyPhen2 (Adzhubei et al., 2010). Both tools estimated that mutations occurring only in cases were significantly enriched for deleterious changes when compared against variants in the same genes seen in controls (Figure S3). Nevertheless, given the potential fallibility of predictive algorithms, we await empirical determination of functional impact to enable more robust mechanistic conclusions with respect to phenotype. Despite these limitations, these results are consistent with several reports implicating rare coding variants in CLP risk. The clustering of risk genes reinforces focus on onecarbon metabolism and the BMP/WNT signaling pathways.

CONFLICT OF INTEREST

The authors declare no conflict of interest. Figure S1 Target exon coverage in cases vs. controls. Click here for additional data file. Figure S2 Sanger sequencing trace verification of BHMT K400 fs allele Click here for additional data file. Figure S3 Functional predictions for case‐only protein‐altering variants Click here for additional data file. Table S1 Gene/exon annotations; Sequencing coverage/failures. Click here for additional data file. Table S2 Genotype concordance between exon sequencing and TaqMan allelic discrimination assay. Click here for additional data file. Table S3 Rare variant verification by resequencing gDNA prior to whole genome amplification. Click here for additional data file. Table S4 Annotations and genotype distributions for 773 variants scored in this study. Click here for additional data file.
  43 in total

1.  A genome-wide association study of cleft lip with and without cleft palate identifies risk variants near MAFB and ABCA4.

Authors:  Terri H Beaty; Jeffrey C Murray; Mary L Marazita; Ronald G Munger; Ingo Ruczinski; Jacqueline B Hetmanski; Kung Yee Liang; Tao Wu; Tanda Murray; M Daniele Fallin; Richard A Redett; Gerald Raymond; Holger Schwender; Sheng-Chih Jin; Margaret E Cooper; Martine Dunnwald; Maria A Mansilla; Elizabeth Leslie; Stephen Bullard; Andrew C Lidral; Lina M Moreno; Renato Menezes; Alexandre R Vieira; Aline Petrin; Allen J Wilcox; Rolv T Lie; Ethylin W Jabs; Yah Huei Wu-Chou; Philip K Chen; Hong Wang; Xiaoqian Ye; Shangzhi Huang; Vincent Yeow; Samuel S Chong; Sun Ha Jee; Bing Shi; Kaare Christensen; Mads Melbye; Kimberly F Doheny; Elizabeth W Pugh; Hua Ling; Eduardo E Castilla; Andrew E Czeizel; Lian Ma; L Leigh Field; Lawrence Brody; Faith Pangilinan; James L Mills; Anne M Molloy; Peadar N Kirke; John M Scott; James M Scott; Mauricio Arcos-Burgos; Alan F Scott
Journal:  Nat Genet       Date:  2010-05-02       Impact factor: 38.330

2.  A method and server for predicting damaging missense mutations.

Authors:  Ivan A Adzhubei; Steffen Schmidt; Leonid Peshkin; Vasily E Ramensky; Anna Gerasimova; Peer Bork; Alexey S Kondrashov; Shamil R Sunyaev
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

3.  Key susceptibility locus for nonsyndromic cleft lip with or without cleft palate on chromosome 8q24.

Authors:  Stefanie Birnbaum; Kerstin U Ludwig; Heiko Reutter; Stefan Herms; Michael Steffens; Michele Rubini; Carlotta Baluardo; Melissa Ferrian; Nilma Almeida de Assis; Margrieta A Alblas; Sandra Barth; Jan Freudenberg; Carola Lauster; Gül Schmidt; Martin Scheer; Bert Braumann; Stefaan J Bergé; Rudolf H Reich; Franziska Schiefke; Alexander Hemprich; Simone Pötzsch; Regine P Steegers-Theunissen; Bernd Pötzsch; Susanne Moebus; Bernhard Horsthemke; Franz-Josef Kramer; Thomas F Wienker; Peter A Mossey; Peter Propping; Sven Cichon; Per Hoffmann; Michael Knapp; Markus M Nöthen; Elisabeth Mangold
Journal:  Nat Genet       Date:  2009-03-08       Impact factor: 38.330

Review 4.  Maternal Folic Acid Supplementation and the Risk of Oral Clefts in Offspring.

Authors:  Arezoo Jahanbin; Elaheh Shadkam; Hamid Heidarian Miri; Alireza Sarraf Shirazi; Mostafa Abtahi
Journal:  J Craniofac Surg       Date:  2018-09       Impact factor: 1.046

5.  Targeted scan of fifteen regions for nonsyndromic cleft lip and palate in Filipino families.

Authors:  R E Schultz; M E Cooper; S Daack-Hirsch; M Shi; B Nepomucena; K A Graf; E K O'Brien; S E O'Brien; M L Marazita; J C Murray
Journal:  Am J Med Genet A       Date:  2004-02-15       Impact factor: 2.802

6.  Genome-wide meta-analyses of nonsyndromic cleft lip with or without cleft palate identify six new risk loci.

Authors:  Kerstin U Ludwig; Elisabeth Mangold; Stefan Herms; Stefanie Nowak; Heiko Reutter; Anna Paul; Jessica Becker; Ruth Herberz; Taofik AlChawa; Entessar Nasser; Anne C Böhmer; Manuel Mattheisen; Margrieta A Alblas; Sandra Barth; Nadine Kluck; Carola Lauster; Bert Braumann; Rudolf H Reich; Alexander Hemprich; Simone Pötzsch; Bettina Blaumeiser; Nikolaos Daratsianos; Thomas Kreusch; Jeffrey C Murray; Mary L Marazita; Ingo Ruczinski; Alan F Scott; Terri H Beaty; Franz-Josef Kramer; Thomas F Wienker; Regine P Steegers-Theunissen; Michele Rubini; Peter A Mossey; Per Hoffmann; Christoph Lange; Sven Cichon; Peter Propping; Michael Knapp; Markus M Nöthen
Journal:  Nat Genet       Date:  2012-08-05       Impact factor: 38.330

7.  Risks of orofacial clefts in children born to women using multivitamins containing folic acid periconceptionally.

Authors:  G M Shaw; E J Lammer; C R Wasserman; C D O'Malley; M M Tolarova
Journal:  Lancet       Date:  1995-08-12       Impact factor: 79.321

8.  Genome-wide analyses of non-syndromic cleft lip with palate identify 14 novel loci and genetic heterogeneity.

Authors:  Yanqin Yu; Xianbo Zuo; Miao He; Jinping Gao; Yuchuan Fu; Chuanqi Qin; Liuyan Meng; Wenjun Wang; Yaling Song; Yong Cheng; Fusheng Zhou; Gang Chen; Xiaodong Zheng; Xinhuan Wang; Bo Liang; Zhengwei Zhu; Xiazhou Fu; Yujun Sheng; Jiebing Hao; Zhongyin Liu; Hansong Yan; Elisabeth Mangold; Ingo Ruczinski; Jianjun Liu; Mary L Marazita; Kerstin U Ludwig; Terri H Beaty; Xuejun Zhang; Liangdan Sun; Zhuan Bian
Journal:  Nat Commun       Date:  2017-02-24       Impact factor: 14.919

9.  Rescue of cleft palate in Msx1-deficient mice by transgenic Bmp4 reveals a network of BMP and Shh signaling in the regulation of mammalian palatogenesis.

Authors:  Zunyi Zhang; Yiqiang Song; Xiang Zhao; Xiaoyun Zhang; Cesar Fermin; YiPing Chen
Journal:  Development       Date:  2002-09       Impact factor: 6.868

10.  A multi-ethnic genome-wide association study identifies novel loci for non-syndromic cleft lip with or without cleft palate on 2p24.2, 17q23 and 19q13.

Authors:  Elizabeth J Leslie; Jenna C Carlson; John R Shaffer; Eleanor Feingold; George Wehby; Cecelia A Laurie; Deepti Jain; Cathy C Laurie; Kimberly F Doheny; Toby McHenry; Judith Resick; Carla Sanchez; Jennifer Jacobs; Beth Emanuele; Alexandre R Vieira; Katherine Neiswanger; Andrew C Lidral; Luz Consuelo Valencia-Ramirez; Ana Maria Lopez-Palacio; Dora Rivera Valencia; Mauricio Arcos-Burgos; Andrew E Czeizel; L Leigh Field; Carmencita D Padilla; Eva Maria C Cutiongco-de la Paz; Frederic Deleyiannis; Kaare Christensen; Ronald G Munger; Rolv T Lie; Allen Wilcox; Paul A Romitti; Eduardo E Castilla; Juan C Mereb; Fernando A Poletta; Iêda M Orioli; Flavia M Carvalho; Jacqueline T Hecht; Susan H Blanton; Carmen J Buxó; Azeez Butali; Peter A Mossey; Wasiu L Adeyemo; Olutayo James; Ramat O Braimah; Babatunde S Aregbesola; Mekonen A Eshete; Fikre Abate; Mine Koruyucu; Figen Seymen; Lian Ma; Javier Enríquez de Salamanca; Seth M Weinberg; Lina Moreno; Jeffrey C Murray; Mary L Marazita
Journal:  Hum Mol Genet       Date:  2016-03-30       Impact factor: 5.121

View more
  4 in total

1.  Identification of novel susceptibility genes for non-syndromic cleft lip with or without cleft palate using NGS-based multigene panel testing.

Authors:  Justyna Dąbrowska; Barbara Biedziak; Anna Szponar-Żurowska; Margareta Budner; Paweł P Jagodziński; Rafał Płoski; Adrianna Mostowska
Journal:  Mol Genet Genomics       Date:  2022-07-01       Impact factor: 2.980

2.  Genetic variants in S-adenosyl-methionine synthesis pathway and nonsyndromic cleft lip with or without cleft palate in Chile.

Authors:  Carlos Salamanca; Patricio González-Hormazábal; Andrea S Recabarren; Pamela A Recabarren; Roberto Pantoja; Noemi Leiva; Rosa Pardo; José Suazo
Journal:  Pediatr Res       Date:  2020-06-03       Impact factor: 3.756

3.  Large-scale sequencing of flatfish genomes provides insights into the polyphyletic origin of their specialized body plan.

Authors:  Zhenming Lü; Li Gong; Yandong Ren; Yongjiu Chen; Zhongkai Wang; Liqin Liu; Haorong Li; Xianqing Chen; Zhenzhu Li; Hairong Luo; Hui Jiang; Yan Zeng; Yifan Wang; Kun Wang; Chen Zhang; Haifeng Jiang; Wenting Wan; Yanli Qin; Jianshe Zhang; Liang Zhu; Wei Shi; Shunping He; Bingyu Mao; Wen Wang; Xiaoyu Kong; Yongxin Li
Journal:  Nat Genet       Date:  2021-04-19       Impact factor: 38.330

4.  Integrative approaches generate insights into the architecture of non-syndromic cleft lip with or without cleft palate.

Authors:  Julia Welzenbach; Nigel L Hammond; Miloš Nikolić; Frederic Thieme; Nina Ishorst; Elizabeth J Leslie; Seth M Weinberg; Terri H Beaty; Mary L Marazita; Elisabeth Mangold; Michael Knapp; Justin Cotney; Alvaro Rada-Iglesias; Michael J Dixon; Kerstin U Ludwig
Journal:  HGG Adv       Date:  2021-06-08
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.