| Literature DB >> 33898419 |
Nandita Mukhopadhyay1, Eleanor Feingold1,2,3, Lina Moreno-Uribe4, George Wehby5, Luz Consuelo Valencia-Ramirez6, Claudia P Restrepo Muñeton6, Carmencita Padilla7, Frederic Deleyiannis8, Kaare Christensen9, Fernando A Poletta10, Ieda M Orioli11,12, Jacqueline T Hecht13, Carmen J Buxó14, Azeez Butali15, Wasiu L Adeyemo16, Alexandre R Vieira1, John R Shaffer1,3, Jeffrey C Murray17, Seth M Weinberg1,3, Elizabeth J Leslie18, Mary L Marazita1,3,19.
Abstract
Orofacial clefts (OFCs) are among the most prevalent craniofacial birth defects worldwide and create a significant public health burden. The majority of OFCs are non-syndromic and vary in prevalence by ethnicity. Africans have the lowest prevalence of OFCs (~ 1/2,500), Asians have the highest prevalence (~1/500), Europeans and Latin Americans lie somewhere in the middle (~1/800 and 1/900, respectively). Thus, ethnicity appears to be a major determinant of the risk of developing OFC. The Pittsburgh Orofacial Clefts Multiethnic study was designed to explore this ethnic variance, comprising a large number of families and individuals (~12,000 individuals) from multiple populations worldwide: US and Europe, Asians, mixed Native American/Caucasians, and Africans. In this current study, we analyzed 2,915 OFC cases, 6,044 unaffected individuals related to the OFC cases, and 2,685 controls with no personal or family history of OFC. Participants were grouped by their ancestry into African, Asian, European, and Central and South American subsets, and genome-wide association run on the combined sample as well as the four ancestry-based groups. We observed 22 associations to cleft lip with or without cleft palate at 18 distinct loci with p-values < 1e-06, including 10 with genome-wide significance (<5e-08), in the combined sample and within ancestry groups. Three loci - 2p12 (rs62164740, p = 6.27e-07), 10q22.2 (rs150952246, p = 3.14e-07), and 10q24.32 (rs118107597, p = 8.21e-07) are novel. Nine were in or near known OFC loci - PAX7, IRF6, FAM49A, DCAF4L2, 8q24.21, NTN1, WNT3-WNT9B, TANC2, and RHPN2. The majority of the associations were observed only in the combined sample, European, and Central and South American groups. We investigated whether the observed differences in association strength were (a) purely due to sample sizes, (b) due to systematic allele frequency difference at the population level, or (c) due to the fact certain OFC-causing variants confer different amounts of risk depending on ancestral origin, by comparing effect sizes to observed allele frequencies of the effect allele in our ancestry-based groups. While some of the associations differ due to systematic differences in allele frequencies between groups, others show variation in effect size despite similar frequencies across ancestry groups.Entities:
Keywords: GWAS; family study; genetic factors in OFC; genetic heterogeneity; multiethnic
Year: 2021 PMID: 33898419 PMCID: PMC8062975 DOI: 10.3389/fcell.2021.621482
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
Study sample characteristics.
| Total pedigrees | 227 | 484 | 1,271 | 1,433 | 3,422 |
| CL/P pedigrees | 152 | 320 | 507 | 944 | 1,925 |
| Median CL/P pedigree size | 1 | 1 | 1 | 1 | 3 |
| Maximum CL/P pedigree size | 9 | 34 | 35 | 27 | 35 |
| Unaffected pedigrees | 75 | 164 | 764 | 489 | 1,498 |
| Median unaffected pedigree size | 1 | 1 | 1 | 3 | 1 |
| Maximum unaffected pedigree size | 5 | 4 | 15 | 21 | 21 |
| Total individuals | 428 | 1,691 | 3,273 | 4,038 | 9,447 |
| Total unaffected individuals | 274 | 1,246 | 2,704 | 2,988 | 7,226 |
| Total affected individuals | 154 | 445 | 569 | 1,050 | 2,221 |
| Median number of affected members per CL/P pedigree | 1 | 1 | 1 | 1 | 1 |
| Maximum number of affected per CL/P pedigree | 2 | 5 | 5 | 4 | 5 |
Figure 1Circular Manhattan plots of ALL, CSA, EUR, AFR and ASIA. Red lines are drawn at the 1e-06 significance level. Loci significant at the 1e-06 significance level are indicated by the gray dotted lines and numbered 1–18. Chromosomes are depicted by the black bands in the outermost circle.
Associations with p-values < 1e-06.
| 1 | 1p36.13 | rs9439714 (C) | 18,976,489 | 3.27E-08 | 0.15 ± 0.04 | ALL | |
| 2 | 1q32.2 | rs17015217 (A) | 209,966,629 | 1.46E-09 | −0.19 ± 0.05 | ALL, CSA | |
| 3 | 2p24.3-24.2 | rs6745357 (G) | 16,713,395 | 3.62E-10 | 0.25 ± 0.03 | ALL | |
| 5 | 5p13.2-13.1 | rs181764204 (T) | 38,205,024 | 2.63E-07 | 0.54 ± 0.10 | ALL | Huang et al., |
| 6 | 5q35.2-35.1 | rs17075892 (T) | 172,827,353 | 6.95E-08 | −0.25 ± 0.04 | ALL | Butali et al., |
| 7 | 8q21.3 | rs12543318 (C) | 88,868,340 | 7.45E-11 | 0.24 ± 0.04 | ALL, EUR | |
| 8 | 8q22.2 | rs185266751 (G) | 99,712,644 | 2.49E-07 | 0.50 ± 0.09 | ALL | Huang et al., |
| 9 | 8q24.21 | rs72728755 (A) | 129,990,382 | 2.80E-29 | 0.48 ± 0.05 | ALL, CSA, EUR | 8q24 |
| 10 | 9q33.1 | rs9408874 (T) | 118,279,489 | 2.59E-07 | 0.26 ± 0.05 | EUR | Huang et al., |
| 13 | 12q13.2 | rs34260065 (C) | 55,540,318 | 3.87E-07 | 0.12 ± 0.04 | CSA | Huang et al., |
| 14 | 17p13.1 | rs16957821 (G) | 8,948,104 | 9.12E-10 | 0.25 ± 0.04 | ALL | |
| 15 | 17q21.31-21.32 | rs7216951 (T) | 44,972,810 | 1.42E-07 | −0.20 ± 0.04 | ALL | |
| 16 | 17q23.2-23.3 | rs1588366 (G) | 61,076,428 | 1.09E-08 | −0.24 ± 0.05 | EUR | |
| 17 | 17q25.3 | rs1975866 (C) | 77,267,377 | 3.13E-08 | 0.39 ± 0.09 | AFR | Huang et al., |
| 18 | 19q13.11 | rs2003950 (A) | 33,546,283 | 1.21E-07 | 0.21 ± 0.04 | ALL |
Novel loci highlighted in bold;
Association p-values from the EMMAX program;
Effect size (Beta coefficient) and standard error of effect size from the GENESIS program.
Novel loci, closest gene, and contribution to craniofacial development.
| 2p12 | ALL, CSA | Evidence of methylation during craniofacial development | ||
| 8 | 8q22.2 | ALL, CSA | Transcription start site, upstream promoter and enhancer region | |
| 10 | 9q33.1 | EUR | Craniofacial super-enhancer region | |
| 10q22.2 | ALL, CSA | Transcription start site and promoter region | ||
| 10q24.32 | ALL, CSA | Promoter, transcription enhancer; | ||
| 13 | 12q13.2 | CSA | Craniofacial super-enhancer region; | |
| 17 | 17q25.3 | AFR | Craniofacial super-enhancer region; |
Peak numbers in bold indicate novel loci, other loci showed modest association (p < 1e-04) in GWAS of Chinese (Huang et al., .
Comparison of effect allele frequency and effect size of lead SNPs within each locus.
| 1 | 1p36.13 | 2 | Not significant | Low in ASIA | Smaller in ASIA at several SNPs | |
| 2 | 1q32.2 | 2 | Not significant | Low in EUR, AFR | Larger in EUR than CSA, ASIA | |
| 3 | 2p24.3-24.2 | 2 | Not significant | Effect allele common in all groups | Similar effect size estimates at most SNPs | |
| 4 | 2p12 | 1 | Not significant | Low in ASIA | Larger in ASIA but not statistically different | |
| 5 | 5p13.2-13.1 | 1+3 | Low in EUR, similar in others | Larger in EUR at SNPs with significant | ||
| 6 | 5q35.2-35.1 | 1 | Not significant | Similar across groups | More negative in AFR, but not statistically different | |
| 7 | 8q21.3 | 2+3 | Highly significant at a few SNPs, | Similar across groups | Smaller in ASIA at SNPs with | |
| 8 | 8q22.2 | 2 | Not significant | Common (15–50%) in CSA, a few variants rare in others (<2%). | Similar across groups | |
| 9 | 8q24.21 | 1 | Not significant | Low in ASIA (2–5%) | Similar across groups | |
| 10 | 9q33.1 | 3 | Significant | Similar across groups | Significantly larger in EUR | |
| 11 | 10q22.2 | 2 | Not significant except a few SNPs with | Rare in AFR (<2%) | Larger in AFR at SNPs at SNPs with | |
| 12 | 10q24.32 | 1 | Not significant | Low in all groups (<2–15%) | Similar across groups | |
| 13 | 12q13.2 | 3 | Common in all groups (15–45%) | Significantly larger in CSA, larger but not significantly so in AFR | ||
| 14 | 17p13.1 | 1 | Not significant except one, | Common in all groups | Significantly larger in AFR only for SNP with significant | |
| 15 | 17q21.31-21.32 | 1 | Not significant | Common in all groups (15–45%) | Similar across groups | |
| 16 | 17q23.2-23.3 | 3 | Significant, | Low in ASIA (5–15%) | Significantly larger in EUR | |
| 17 | 17q25.3 | 3 | Significant, | Common in all groups (>15%) | Significantly larger in AFR | |
| 18 | 19q13.11 | 2 | Not significant | Common in all groups | Similar effect sizes |
Figure 2Association peak in 17p13.1 (NTN1) consistent with category 1. SNPs are ordered by base-pair position in lower panels; Q-statistic p-value noted under each SNP in forest plot of effect size; Wright's FST value noted under each SNP name in heatmap of effect allele frequencies; lead variant and group with most significant p-value shown in red in heatmap.
Figure 4Association peak in 17q25.3 in the AFR group consistent with category 3. SNPs are ordered by base-pair position; Q-statistic p-value noted under each SNP in forest plot; lead variant and sample with highest p-value shown in red in heatmap, and FST values noted under each SNP.
Figure 3Association peak in 1q32.2 (IRF6) consistent with category 2. SNPs are ordered by base-pair position; Q-statistic p-value noted under each SNP in forest plot; lead variant and sample with highest p-value shown in red in heatmap and FST values noted under each SNP. The top 10 associated variants fall below our MAF filter in the EUR and ASIA groups.