| Literature DB >> 31314787 |
Anshuman Sewda1, A J Agopian1, Elizabeth Goldmuntz2,3, Hakon Hakonarson2,4, Bernice E Morrow5, Deanne Taylor2,6, Laura E Mitchell1.
Abstract
Conotruncal heart defects (CTDs) are among the most common and severe groups of congenital heart defects. Despite evidence of an inherited genetic contribution to CTDs, little is known about the specific genes that contribute to the development of CTDs. We performed gene-based genome-wide analyses using microarray-genotyped and imputed common and rare variants data from two large studies of CTDs in the United States. We performed two case-parent trio analyses (N = 640 and 317 trios), using an extension of the family-based multi-marker association test, and two case-control analyses (N = 482 and 406 patients and comparable numbers of controls), using a sequence kernel association test. We also undertook two meta-analyses to combine the results from the analyses that used the same approach (i.e. family-based or case-control). To our knowledge, these analyses are the first reported gene-based, genome-wide association studies of CTDs. Based on our findings, we propose eight CTD candidate genes (ARF5, EIF4E, KPNA1, MAP4K3, MBNL1, NCAPG, NDFUS1 and PSMG3). Four of these genes (ARF5, KPNA1, NDUFS1 and PSMG3) have not been previously associated with normal or abnormal heart development. In addition, our analyses provide additional evidence that genes involved in chromatin-modification and in ribonucleic acid splicing are associated with congenital heart defects.Entities:
Year: 2019 PMID: 31314787 PMCID: PMC6636758 DOI: 10.1371/journal.pone.0219926
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Summary of conotruncal heart defects data cohorts.
The participants were recruited from the Children’s Hospital of Philadelphia (CHOP) and the Pediatric Cardiac Genomics Consortium (PCGC). CHOP-Trios and PCGC-Trios were analyzed using eFBAT-MM whereas SKAT-C was used to analyze the two case-control cohorts (CHOP-CC1 and CHOP-CC2). The cases in CHOP-CC1 are the Caucasian subset of cases in CHOP-Trios. One of 483 Caucasian cases was excluded during QC procedures prior to SKAT-C analysis.
Characteristics of patients with conotruncal defects in the Children’s Hospital of Philadelpia (CHOP) and Pediatric Cardiac Genomics Consortium (PCGC) datasets.
| N (%) | ||||||
|---|---|---|---|---|---|---|
| CHOP-Trios / | CHOP-CC2 | PCGC-Trios | ||||
| | 483 | (75.5) | 406 | (100.0) | 244 | (70.1) |
| | 157 | (24.5) | 0 | (0.0) | 73 | (29.9) |
| | 387 | (60.5) | 236 | (58.1) | 192 | (60.6) |
| | 253 | (39.5) | 170 | (41.9) | 125 | (39.4) |
| | 250 | (39.1) | 134 | (33.0) | 104 | (32.8) |
| | 125 | (19.5) | 80 | (19.7) | 64 | (20.2) |
| | 133 | (20.8) | 109 | (26.8) | 44 | (13.9) |
| | 66 | (10.3) | 25 | (6.2) | 46 | (14.5) |
| | 30 | (4.7) | 22 | (5.4) | 7 | (2.2) |
| | 18 | (2.8) | 19 | (4.7) | 13 | (4.1) |
| | 11 | (1.7) | 10 | (2.5) | 9 | (2.8) |
| | 7 | (1.1) | 7 | (1.7) | 30 | (9.5) |
aThe cases used in CHOP-CC1 are the subset of the cases included in the CHOP-Trios (i.e. the non-Hispanic Caucasian cases, N = 483).
Summary of eFBAT-MM and SKAT-C analyses and results.
| eFBAT-MM | SKAT-C | |||
|---|---|---|---|---|
| CHOP-Trios | PCGC Trios | CHOP-CC1 | CHOP-CC2 | |
| 5,578,860 | 6,812,971 | 5,601,587 | 5,601,152 | |
| 3,446,735 | 4,502,285 | 3,502,419 | 3,495,988 | |
| 21,256 | 22,247 | 21,212 | 21,269 | |
| 1.03 | 1.04 | 1.09 | 1.09 | |
| 13 | 13 | 25 | 27 | |
a Variants with minor allele frequency <0.05
b Genes are listed by p-value (lowest to highest). Specific p-values are provided in Tables B-E in S1 File.
Fig 2Quantile-quantile plot of eFBAT-MM test gene-level meta-analysis p-values.
Fig 3Quantile-quantile plot of SKAT-C test gene-level meta-analysis p-values.
Genes with suggestive evidence of association (p<1E-03) in the trio-based meta-analysis.
| CHOP-Trios | PCGC-Trios | Meta-analysis | ||||
|---|---|---|---|---|---|---|
| Gene | Function | Number of variants | p-value | Number of variants | p-value | p-value |
| Protein coding | 2,662 | 8.9E-02 | 2,738 | 8.7E-05 | 9.8E-05 | |
| Protein coding | 534 | 3.3E-02 | 789 | 3.7E-04 | 1.5E-04 | |
| Protein coding | 37 | 4.1E-03 | 53 | 3.1E-03 | 1.6E-04 | |
| Pseudogene | 38 | 9.8E-01 | 83 | 2.3E-05 | 2.7E-04 | |
| Protein coding | 1,020 | 1.6E-03 | 1,063 | 2.1E-02 | 3.7E-04 | |
| Protein coding | 39 | 2.4E-04 | 52 | 1.5E-01 | 4.1E-04 | |
| Protein coding | 1,585 | 2.4E-02 | 1,954 | 2.0E-03 | 5.3E-04 | |
| RNA gene | 26 | 2.0E-02 | 25 | 2.9E-03 | 6.2E-04 | |
| Protein coding | 317 | 5.4E-02 | 352 | 1.5E-03 | 8.4E-04 | |
| Protein coding | 81 | 2.9E-02 | 104 | 2.9E-03 | 8.6E-04 | |
| Pseudogene | 7 | 4.2E-01 | 15 | 2.0E-04 | 8.8E-04 | |
a gene-level p-values from eFBAT-MM test
Genes with suggestive evidence of association (p<1E-03) in the case-control based meta-analysis.
| CHOP-CC1 | CHOP-CC2 | Meta-analysis | ||||
|---|---|---|---|---|---|---|
| Gene | Function | Number of variants | p-value | Number of variants | p-value | p-value |
| Protein coding | 41 | 6.5E-02 | 40 | 2.6E-05 | 2.4E-05 | |
| RNA gene | 6 | 3.9E-05 | 7 | 5.3E-02 | 2.9E-05 | |
| Protein coding | 252 | 9.9E-05 | 272 | 9.5E-02 | 1.2E-04 | |
| Protein coding | 208 | 2.5E-04 | 185 | 4.2E-02 | 1.3E-04 | |
| Protein coding | 28 | 6.9E-04 | 31 | 1.8E-02 | 1.5E-04 | |
| Protein coding | 150 | 5.1 E-01 | 162 | 3.0E-05 | 1.9E-04 | |
| Protein coding | 12 | 6.3E-03 | 15 | 2.5E-03 | 1.9E-04 | |
| RNA gene | 29 | 1.8 E-01 | 28 | 1.1E-04 | 2.3E-04 | |
| Protein coding | 59 | 1.2E-03 | 67 | 2.3E-02 | 3.1E-04 | |
| Protein coding | 16 | 3.1E-04 | 14 | 9.1E-02 | 3.2E-04 | |
| Protein coding | 237 | 2.6E-02 | 271 | 1.2E-03 | 3.6E-04 | |
| RNA gene | 66 | 1.8 E-01 | 80 | 1.8E-04 | 3.6E-04 | |
| Protein coding | 323 | 1.5 E-01 | 406 | 2.2E-04 | 3.7E-04 | |
| Protein coding | 11 | 5.3E-03 | 11 | 6.7E-03 | 4.0E-04 | |
| Protein coding | 1,101 | 2.8E-02 | 1,396 | 1.3E-03 | 4.2E-04 | |
| Protein coding | 76 | 6.8E-04 | 84 | 5.5E-02 | 4.2E-04 | |
| Protein coding | 43 | 3.3 E-01 | 44 | 1.2E-04 | 4.4E-04 | |
| Protein coding | 79 | 8.1 E-01 | 91 | 5.2E-05 | 4.7E-04 | |
| RNA gene | 14 | 4.7E-04 | 12 | 9.5E-02 | 4.9E-04 | |
| Protein coding | 769 | 1.1E-04 | 784 | 5.7 E-01 | 6.4E-04 | |
| Protein coding | 22 | 1.7E-03 | 28 | 4.0E-02 | 7.2E-04 | |
| Protein coding | 21 | 9.5E-02 | 20 | 7.5E-04 | 7.5E-04 | |
| Protein coding | 181 | 8.6E-02 | 192 | 8.8E-04 | 8.0E-04 | |
| Protein coding | 66 | 2.2E-03 | 52 | 3.7E-02 | 8.4E-04 | |
| Protein coding | 25 | 3.8E-03 | 24 | 2.2E-02 | 8.7E-04 | |
| Protein coding | 114 | 1.7 E-01 | 112 | 5.4E-04 | 9.5E-04 | |
| Protein coding | 234 | 7.8E-04 | 251 | 1.2 E-01 | 9.9E-04 | |
a gene-level p-values from SKAT-C
Genes with suggestive evidence of association (p<1E-03) in either dataset and with a meta-analysis p-value that is lower than that obtained in either of the contributing analyses.
| Gene | Gene Name | Function | Gene-based Test | Dataset 1 p-value | Dataset 2 p-value | Meta-analysis p-value | Day | Day |
|---|---|---|---|---|---|---|---|---|
| ADP-ribosylation factor 5 | Protein-coding | SKAT-C | 3.84E-03 | 2.17E-02 | 8.67E-04 | 95.3 | 88.8 | |
| Chondrolectin | Protein-coding | SKAT-C | 2.80E-02 | 1.32E-03 | 4.15E-04 | 5.8 | 10.4 | |
| DDB1 and CUL4 associated factor 16 | Protein-coding | SKAT-C | 1.70E-03 | 3.98E-02 | 7.18E-04 | No data | No | |
| Eukaryotic translation initiation factor 4E | Protein-coding | SKAT-C | 2.55E-02 | 1.24E-03 | 3.60E-04 | 81.7 | 76.4 | |
| Family with sequence similarity 225 member 1 | RNA gene | SKAT-C | 3.86E-05 | 5.28E-02 | 2.87E-05 | -- | -- | |
| F-box only protein 47 | Protein-coding | SKAT-C | 2.17E-03 | 3.69E-02 | 8.36E-04 | 43.8 | 28.3 | |
| Glioma pathogenesis-related protein 1 | Protein-coding | eFBAT-MM | 2.85E-02 | 2.88E-03 | 8.55E-04 | 57.6 | 34.9 | |
| Karyopherin alpha-1 | Protein-coding | eFBAT-MM | 5.37E-02 | 1.50E-03 | 8.41E-04 | 82.4 | 77.8 | |
| Chromosome 1 open reading frame 177 | Protein-coding | SKAT-C | 2.52E-04 | 4.19E-02 | 1.32E-04 | 41.4 | 16.1 | |
| Long intergenic non-protein coding RNA 207 | RNA gene | eFBAT-MM | 2.01E-02 | 2.86E-03 | 6.17E-04 | -- | -- | |
| Uncharacterized LOC100287036 | Protein-coding | SKAT-C | 6.34E-03 | 2.49E-03 | 1.90E-04 | No | No | |
| Mitogen-activated protein kinase kinase kinase 3 | Protein-coding | eFBAT-MM | 1.56E-03 | 2.11E-02 | 3.74E-04 | 76.1 | 76.9 | |
| Muscleblind-like splicing regulator 1 | Protein-coding | eFBAT-MM | 3.29E-02 | 3.70E-04 | 1.50E-04 | 66.9 | 78.2 | |
| Non-SMC condensin 1 complex subunit G | Protein-coding | SKAT-C | 6.79E-04 | 5.54E-02 | 4.21E-04 | 83.5 | 70.5 | |
| NADH-ubiquinone oxidoreductase Fe-S protein 1 | Protein-coding | SKAT-C | 8.63E-02 | 8.79E-04 | 7.95E-04 | 94 | 97.1 | |
| Protein kinase D2 | Protein-coding | SKAT-C | 1.17E-03 | 2.31E-02 | 3.12E-04 | 48.1 | 66.9 | |
| Proteasome assembly chaperone 3 | Protein-coding | SKAT-C | 6.50E-02 | 2.55E-05 | 2.37E-05 | 79 | 54.1 | |
| Regulator of G protein signaling | Protein-coding | SKAT-C | 5.31E-03 | 6.65E-03 | 3.97E-04 | 56.4 | 38.9 | |
| Receeptor tyrosine kinase-like orphan receptor 1 | Protein-coding | eFBAT-MM | 2.42E-02 | 2.02E-03 | 5.33E-04 | 65.6 | 63.5 | |
| SH2 domain-containing protein D | Protein-coding | SKAT-C | 6.86E-04 | 1.82E-02 | 1.53E-04 | 65.5 | 36.2 | |
| Sialic acid-binding immunoglobulin-like lectin 11 | Protein-coding | eFBAT-MM | 4.09E-03 | 3.10E-03 | 1.56E-04 | 36.2 | No |
a When meta-analysis with p<1E-03 is FBAT, dataset 1 = CHOP Trios and dataset 2 is PCGC Trios. When meta-analysis with p<1E-03 is SKAT, dataset 1 = CHOP-CC1 patients with a CTD and controls and dataset 2 is CHOP-CC2 patients with a CTD and controls.
b Heart expression percentile rank [18].