| Literature DB >> 28087736 |
Kerstin U Ludwig1,2, Anne C Böhmer1,2, John Bowes3, Miloš Nikolic4,5, Nina Ishorst1,2, Niki Wyatt6, Nigel L Hammond6, Lina Gölz7, Frederic Thieme1,2, Sandra Barth1,2, Hannah Schuenke1,2, Johanna Klamt1,2, Malte Spielmann8,9,10, Khalid Aldhorae11, Augusto Rojas-Martinez12, Markus M Nöthen1,2, Alvaro Rada-Iglesias4,5, Michael J Dixon6, Michael Knapp13, Elisabeth Mangold1.
Abstract
Nonsyndromic cleft lip with or without cleft palate (nsCL/P) is among the most common human birth defects with multifactorial etiology. Here, we present results from a genome-wide imputation study of nsCL/P in which, after adding replication cohort data, four novel risk loci for nsCL/P are identified (at chromosomal regions 2p21, 14q22, 15q24 and 19p13). On a systematic level, we show that the association signals within this high-density dataset are enriched in functionally-relevant genomic regions that are active in both human neural crest cells (hNCC) and mouse embryonic craniofacial tissue. This enrichment is also detectable in hNCC regions primed for later activity. Using GCTA analyses, we suggest that 30% of the estimated variance in risk for nsCL/P in the European population can be attributed to common variants, with 25.5% contributed to by the 24 risk loci known to date. For each of these, we identify credible SNPs using a Bayesian refinement approach, with two loci harbouring only one probable causal variant. Finally, we demonstrate that there is no polygenic component of nsCL/P detectable that is shared with nonsyndromic cleft palate only (nsCPO). Our data suggest that, while common variants are strongly contributing to risk for nsCL/P, they do not seem to be involved in nsCPO which might be more often caused by rare deleterious variants. Our study generates novel insights into both nsCL/P and nsCPO etiology and provides a systematic framework for research into craniofacial development and malformation.Entities:
Mesh:
Year: 2017 PMID: 28087736 PMCID: PMC5409059 DOI: 10.1093/hmg/ddx012
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Association results for novel nsCL/P risk loci detected by imputation and replication
| Locus | Lead SNP | Chromosomal position | Alleles | GWAS_imputed | Combined analysis | |||
|---|---|---|---|---|---|---|---|---|
| (a) Loci identified with genome-wide significance by imputation and replication | ||||||||
| 2p21 | rs6740960 | 42181679 | 3.29E-09 | 1.68E-09 | 3.32E-04 | |||
| 14q22 | rs4901118 | 51856109 | 7.51E-07 | 5.44E-09 | 2.56E-02 | 2.00E-07 | ||
| 19p13 | rs3746101 | 2050823 | 2.30E-07 | 4.52E-04 | 3.17E-02 | 8.32E-05 | ||
| (b) Loci identified with genome-wide significance by combining imputed Bonn GWAS dataset and ( | ||||||||
| 15q24 | rs28689146 | 75005575 | 6.54E-03 (3.6E-02) | 2.35E-02 | 5.4E-08 | n.a. | ||
aDefined as lowest P-value in genome-wide imputation.
bRisk allele in bold.
cP-value for Bonn replication cohort only.
dNumber in brackets indicates P-value for rs28689146 in Bonn GWAS case-control cohort. This cohort was used for in silico combination with replication cohort of Leslie as there were overlapping individuals between Leslie et al. (11) and Beaty et al. (2).
eP-value from Leslie et al. 2016 (11).
fNo summary statistics for rs28689146 in Leslie et al. (11) for the European dataset.
Figure 1Regional association plots of four novel risk loci. Using genome-wide imputed data, in vitro and in silico replications, four novel risk loci for nsCL/P were identified. Plotted SNPs include both genotyped and imputed variants, respectively. (A) Two regions on chromosomes 2p21 and 14q22 reached genome-wide significance in the imputation analysis of metaall and further decreased in P-values when replication data from a mixed ethnicity case-control cohort were added. (B) Analysis of metaEuro and the European replication cohort revealed genome-wide significance for a region on chromosome 19p13. (C) In silico replication of previously suggestive findings at chromosome 15q24 (11) using the metaall data revealed genome-wide significance for rs28689146. A second variant (rs11072496, bottom panel) fails to reach this threshold but improves capture of the haplotype structure. Plots were generated using LocusZoom (26).
Analysis of credible SNPs at 24 nsCL/P risk regions
| Lead SNP | Closest gene | Chr | N SNPs | Length of region (bp) | SNP region 95% posterior probability | SNP region 99% posterior probability | References | Genotyped top SNP included | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Start | End | Interval (bp) | Start | End | Interval (bp) | |||||||||
| rs742071 | PAX7 | 1 | 317 | 110514 | 21 | 18939509 | 18992553 | 53044 | 33 | 18925512 | 18992583 | 67071 | Ludwig | Yes |
| rs560426 | ABCA4 | 1 | 124 | 41257 | 6 | 94539386 | 94551450 | 12064 | 9 | 94539386 | 94570218 | 30832 | Beaty | No |
| rs642961 | IRF6_DIEXF | 1 | 1373 | 584708 | 18 | 209950760 | 210022893 | 72133 | 25 | 209950760 | 210022893 | 72133 | Rahimov | Yes |
| rs4441471 | AC104623.2 | 2 | 536 | 210141 | 68 | 16706771 | 16735432 | 28661 | 77 | 16706771 | 16735432 | 28661 | Leslie | – |
| rs6740960 | C2orf91 | 2 | 758 | 174921 | 1 | 42181679 | 42181679 | 0 | 2 | 42181679 | 42198217 | 16538 | – | |
| rs7590268 | THADA | 2 | 1036 | 406228 | 9 | 43501585 | 43670144 | 168559 | 31 | 43453086 | 43839244 | 386158 | Ludwig | Yes |
| rs7632427 | EPHA3 | 3 | 1902 | 843457 | 158 | 89273019 | 89640350 | 367331 | 238 | 89273019 | 89751353 | 478334 | Ludwig | Yes |
| rs1384062 | FILIP1L_CMSS1 | 3 | 1222 | 722547 | 75 | 99436424 | 100088929 | 652505 | 175 | 99428998 | 100097616 | 668618 | Beaty | Yes |
| rs12543318 | DCAF4L2 | 8 | 1446 | 546749 | 2 | 88868340 | 88887382 | 19042 | 2 | 88868340 | 88887382 | 19042 | Ludwig | Yes |
| rs987525 | LINC00976 | 8 | 953 | 318710 | 3 | 129964873 | 129990382 | 25509 | 4 | 129958384 | 129990382 | 31998 | Birnbaum | No |
| rs3758249 | FOXE1 | 9 | 716 | 290739 | 88 | 100596439 | 100667871 | 71432 | 123 | 100585506 | 100670272 | 84766 | Moreno | Yes |
| rs4752028 | KIAA1598 | 10 | 466 | 365794 | 23 | 118786985 | 118890006 | 103021 | 28 | 118786985 | 118890006 | 103021 | Mangold | Yes |
| rs8001641 | LINC01080_SPRY2 | 13 | 498 | 167219 | 6 | 80697449 | 80701485 | 4036 | 11 | 80691352 | 80701485 | 10133 | Ludwig | No |
| rs4901118 | RP11-255G12.2 | 14 | 476 | 134026 | 2 | 51856109 | 51856566 | 457 | 2 | 51856109 | 51856566 | 457 | – | |
| rs1258763 | GREM1_FMN1 | 15 | 482 | 99614 | 13 | 33043284 | 33054523 | 11239 | 19 | 33043284 | 33054523 | 11239 | Ludwig | Yes |
| rs1873147 | TPM1 | 15 | 385 | 101368 | 13 | 63311425 | 63314519 | 3094 | 14 | 63311425 | 63314519 | 3094 | Ludwig | Yes |
| rs57490152 | CSK | 15 | 1721 | 818259 | 287 | 74686398 | 75231234 | 544836 | 748 | 74632243 | 75446488 | 814245 | – | |
| rs8049367 | CREBBP_ADCY9 | 16 | 217 | 43706 | 22 | 3968115 | 3982627 | 14512 | 33 | 3963187 | 3997415 | 34228 | Sun | Yes |
| rs1880646 | NTN1 | 17 | 245 | 77824 | 1 | 8919415 | 8919415 | 0 | 1 | 8919415 | 8919415 | 0 | Beaty | No |
| rs227731 | NOG | 17 | 215 | 87850 | 4 | 54773238 | 54777585 | 4347 | 5 | 54770864 | 54777585 | 6721 | Mangold | Yes |
| rs8071332 | TANC2 | 17 | 1079 | 618627 | 46 | 61043405 | 61389844 | 346439 | 73 | 61043405 | 61405359 | 361954 | Leslie | – |
| rs3746101 | MKNK2 | 19 | 208 | 74094 | 4 | 2048281 | 2051261 | 2980 | 10 | 2038819 | 2051261 | 12442 | – | |
| rs8113265 | SLC7A9 | 19 | 1410 | 365251 | 185 | 33132098 | 33495231 | 363133 | 691 | 33132098 | 33497141 | 365043 | Leslie | – |
| rs13041247 | MAFB | 20 | 390 | 146327 | 41 | 39238736 | 39286786 | 48050 | 62 | 39238736 | 39286786 | 48050 | Beaty | Yes |
N—number of SNPs in the SNP region interval.
aThis column refers to whether the originally presented lead SNP from the respective GWAS (i.e. genotyped SNP), is among the credible SNP set. In studies marked by ‘–’, imputation had already been performed.
bIdentified with lowest P-value in metaEuro dataset.
Co-localization analyses of nsCL/P imputation analyses results in autosomal regions of functionally annotated datasets
| Study name | Annotation | Number regions | Number SNPs | Enrichment metaEuro | Enrichment metaall | References | ||
|---|---|---|---|---|---|---|---|---|
| Neural crest cell line (NCC) | Active enhancer | 4293 | 6886 | 0.0473 | Rada-Iglesias | |||
| Mouse craniofacial tissue E11.5 (MCT_P) | Active promoter | 4250 | 15 297 | 0.002 | 0.03 | 0.041 | Attanasio | |
| Mouse craniofacial tissue E11.5 (MCT_E) | Active enhancer | 7068 | 24 743 | 0.004 | Attanasio | |||
| Neural crest cell line (Active and primed CRMs) | Active and primed enhancers | 22 792 | 180 810 | Data from Rada-Iglesias | ||||
| Neural crest cell line (Active CRMs) | Active enhancers | 16 177 | 141 231 | Data from Rada-Iglesias | ||||
| Mouse limb tissue E11.5 (CON1) | Active enhancer | 2030 | 3713 | 0.10 | 0.18 | 0.30 | 0.35 | Visel |
| IRF6 binding sites in keratinocytes (CON2) | IRF6 binding sites | 3893 | 1219 | 0.6 | 0.26 | 0.005 | 0.19 | Botti |
CON—control datasets, CRM—cis-regulatory module.
aP-values one-sided; italics if significant after correction for multiple testing (P < 0.00156).
Figure 2Polygenic risk score analysis in nsCL/P and nsCPO. The amount of variance explained in the target samples based on the polygenic score derived from the Bonn case-control GWAS nsCL/P discovery sample: (A) for the nsCL/P European trio sample; (B) for the nsCPO European trio sample. The amount of variance explained, denoted as Nagelkerke’s pseudo R2, is shown on the y-axis. Different association test P-value thresholds (PT) in the discovery dataset are represented by different colours. Results are shown for both pruned and unpruned analyses, respectively.