| Literature DB >> 26853712 |
Yong-Biao Zhang1,2, Jintian Hu2, Jiao Zhang2,3, Xu Zhou2, Xin Li4, Chaohao Gu1, Tun Liu2, Yangchun Xie2, Jiqiang Liu5, Mingliang Gu1, Panpan Wang1, Tingting Wu1, Jin Qian2, Yue Wang2, Xiaoqun Dong6, Jun Yu1, Qingguo Zhang2.
Abstract
Craniofacial microsomia (CFM) is a rare congenital anomaly that involves immature derivatives from the first and second pharyngeal arches. The genetic pathogenesis of CFM is still unclear. Here we interrogate 0.9 million genetic variants in 939 CFM cases and 2,012 controls from China. After genotyping of an additional 443 cases and 1,669 controls, we identify 8 significantly associated loci with the most significant SNP rs13089920 (logistic regression P=2.15 × 10(-120)) and 5 suggestive loci. The above 13 associated loci, harboured by candidates of ROBO1, GATA3, GBX2, FGF3, NRP2, EDNRB, SHROOM3, SEMA7A, PLCD3, KLF12 and EPAS1, are found to be enriched for genes involved in neural crest cell (NCC) development and vasculogenesis. We then perform whole-genome sequencing on 21 samples from the case cohort, and identify several novel loss-of-function mutations within the associated loci. Our results provide new insights into genetic background of craniofacial microsomia.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26853712 PMCID: PMC4748111 DOI: 10.1038/ncomms10605
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Manhattan plots of the P values calculated from the genome-wide association study at the discovery stage.
(a) Data were collected from 939 cases with craniofacial microsomia and 2,012 controls on 792,342 SNPs that had passed the quality control. The −log10(logistic regression P value) of each SNP is shown as a function of genomic position on the autosomes (hg19). Genome-wide significance (solid red line; P≤6.3 × 10−8) and suggestive significance (solid blue line; P≤1 × 10−5) are denoted. (b–h) Regional plots shows the association of craniofacial microsomia risk with all significant loci, continuous genomic regions surrounding the lead SNPs, including 2p21 (b), 2q37.2 (c), 3p12.3 (d), 10p14 (e), 11q13.3 (f), 13q22.1 (g) and 15q24.1 (h). Each point represents a SNP plotted with its −log10P value as a function of genomic position (hg19). Imputation analysis is shown with circles and direct genotyping with squares. In each regional plot, the purple symbol denotes the lead SNP, showing its name on the top of each plot. The colour coding of the rest of the SNPs showed in LD with the lead SNP: red, r2≥0.8; gold, 0.6≤r2<0.8; green, 0.4≤r2<0.6; cyan, 0.2≤r2<0.4; blue, r2<0.2; grey, r2 unknown. Recombination rates were estimated from ASN population of 1KG project (Mar 2012). Gene annotations were taken from the UCSC genome browser.
Most significantly associated risk variants with craniofacial microsomia.
| Chr. (band) | Lead SNPs | Position | Alleles | GWAS (939 versus 2,012) | Replication (443 versus 1,669) | Combined (1,382 versus 3,681) | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| OR (95% CI) | OR (95% CI) | Freq. | OR (95% CI) | |||||||
| 2p21 | rs17802111 | 46509657 | A/G | 1.44 × 10−11 | 1.48 (1.32–1.66) | 5.45 × 10−7 | 1.47 (1.26–1.71) | 0.46 | 9.57 × 10−18 | 1.48 (1.35–1.62) |
| 2q33.3 | rs7420812 | 206435709 | G/A | 2.01 × 10−6 | 1.33 (1.18–1.49 ) | 1.32 × 10−4 | 1.35 (1.16–1.58) | 0.37 | 6.74 × 10−10 | 1.33 (1.22–1.46) |
| 2q37.2 | rs3754648 | 237021346 | A/G | 6.33 × 10−9 | 1.41 (1.26–1.58) | 2.53 × 10−5 | 1.38 (1.19–1.61) | 0.42 | 5.09 × 10−13 | 1.39 (1.27–1.53) |
| 3p12.3 | rs13089920 | 78552232 | G/A | 1.06 × 10−70 | 5.21 (4.34–6.25) | 2.62 × 10−46 | 5.05 (4.05–6.31) | 0.25 | 2.15 × 10−120 | 5.18 (4.51–5.95) |
| 10p14 | rs10905359 | 8449891 | A/C | 6.58 × 10−9 | 0.71 (0.63–0.80) | 2.98 × 10−3 | 0.79 (0.68–0.92) | 0.33 | 5.11 × 10−9 | 0.76 (0.69–0.83) |
| 11q13.3 | rs11263613 | 69661334 | A/G | 7.91 × 10−12 | 1.71 (1.47–2.00) | 5.44 × 10−6 | 1.60 (1.31–1.96) | 0.19 | 3.61 × 10−17 | 1.68 (1.49–1.89) |
| 15q24.1 | rs10459648 | 74865440 | A/G | 2.86 × 10−12 | 0.67 (0.59–0.75) | 1.12 × 10−10 | 0.60 (0.52–0.70) | 0.40 | 1.05 × 10−23 | 0.63 (0.58–0.69) |
| 13q22.1 | rs17090300 | 74157451 | A/G | 6.26 × 10−9 | 2.42 (1.80–3.26) | 1.36 × 10−3 | 2.04 (1.32–3.15) | 1.04 × 10−11 | 2.31 (1.81–2.93) | |
| 4q21.1 | rs3923380 | 77468594 | C/A | 1.05 × 10−6 | 0.75 (0.67–0.84 ) | 5.32 × 10−2 | 0.86 (0.74–1.00) | 0.38 | 8.19 × 10−8 | 0.78 (0.71–0.86) |
| 10p13 | rs4750407 | 13795471 | G/A | 3.02 × 10−6 | 1.32 (1.18–1.49) | 2.69 × 10−2 | 1.19 (1.02–1.39) | 0.36 | 2.49 × 10−7 | 1.27 (1.16–1.40) |
| 13q22.3 | rs9574113 | 78418131 | G/A | 4.12 × 10−6 | 0.73 (0.64–0.84 ) | 8.62 × 10−2 | 0.86 (0.72–1.02) | 0.22 | 8.16 × 10−6 | 0.79 (0.71–0.88) |
| 14q22.1 | rs754423 | 52527187 | G/A | 4.14 × 10−7 | 1.32 (1.18–1.47 ) | 2.32 × 10−2 | 1.19 (1.02–1.37) | 0.53 | 1.19 × 10−7 | 1.27 (1.16–1.38) |
| 17q21.31 | rs7222240 | 43136195 | A/G | 6.13 × 10−6 | 2.01 (1.48–2.75 ) | 1.05 × 10−1 | 1.39 (0.93–2.08) | 0.04 | 9.49 × 10−6 | 1.73 (1.36–2.20) |
Chr. (band), cytogenetic band; CI, confidence interval; DEPICT, candidate genes predicted by DEPICT; Freq., frequency of the effect allele in cases; nearby genes, genes (including LincRNAs (large intergenic non-coding RNAs)) spanning or flanking (<200-kb away from) the lead SNP from UCSC genome browser; GRAIL, candidate genes predicted by GRAIL; GWAS, genome-wide association study; NCC; neural crest cell; OR, odds ratio for the minor allele; Position, physical position of human genome version of hg19; SNP, (single-nucloetide polymorphism) rsID of the lead variant; Varexpl, variance in liability to microtia explained by the locus at the prevalence rate of 1.4/10,000 in China.
*The former is for cases number and the latter is for controls number.
Figure 2Graphic display of the similarity among the 25 gene ontology terms and their P values.
The P value (Fisher's exact test) of each term and kappa similarity among terms were derived from the Database for Annotation, Visualization and Integrated Discovery (DAVID). The 25 nodes represents 25 gene ontology terms. The P values of the 25 nodes are indicated by the gradations in the colour red. The similarities between them are indicated by edges scaled according to their correlation (only correlations with a Kappa >0.5 are shown; the correlation are divided into 5 levels equally from 0.5 to 1) and node size represents connection times among nodes.