| Literature DB >> 27670201 |
David M McKean1,2, Jason Homsy1,2,3, Hiroko Wakimoto1, Neil Patel4, Joshua Gorham1, Steven R DePalma1,5, James S Ware1,6,7, Samir Zaidi8, Wenji Ma9, Nihir Patel4, Richard P Lifton8,10, Wendy K Chung11, Richard Kim12, Yufeng Shen9,13, Martina Brueckner8, Elizabeth Goldmuntz14, Andrew J Sharp4,15, Christine E Seidman1,2,5, Bruce D Gelb4,15,16, J G Seidman1.
Abstract
Congenital heart disease (CHD), a prevalent birth defect occurring in 1% of newborns, likely results from aberrant expression of cardiac developmental genes. Mutations in a variety of cardiac transcription factors, developmental signalling molecules and molecules that modify chromatin cause at least 20% of disease, but most CHD remains unexplained. We employ RNAseq analyses to assess allele-specific expression (ASE) and biallelic loss-of-expression (LOE) in 172 tissue samples from 144 surgically repaired CHD subjects. Here we show that only 5% of known imprinted genes with paternal allele silencing are monoallelic versus 56% with paternal allele expression-this cardiac-specific phenomenon seems unrelated to CHD. Further, compared with control subjects, CHD subjects have a significant burden of both LOE genes and ASE events associated with altered gene expression. These studies identify FGFBP2, LBH, RBFOX2, SGSM1 and ZBTB16 as candidate CHD genes because of significantly altered transcriptional expression.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27670201 PMCID: PMC5052634 DOI: 10.1038/ncomms12824
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Identification of extreme ASE genes in subjects with CHD.
Shown are both alleles of a gene that differ by the SNP haploblocks ‘CC' (blue) and ‘GT' (red), as identified by WES, WGS or SNP-array genotyping. RNAseq analysis (read counts at heterozygous positions) reveals the expression of both alleles (biallelic RNA expression) or the disproportionate expression of one allele over another (ASE). RNAseq expression analyses (comparing each sample to the average of all other samples within a tissue group) identify relative loss and gain of expression. Variant analysis, in conjunction with RNAseq analysis, can further identify LOF mutations in the expressed allele (*).
Figure 2Extreme ASE genes preferentially identified in WGS subjects.
Shown in a are the number of genes with a minimum expression of 2 r.p.m. (grey bars), and the number of expressed genes that contain heterozygous SNPs (black bars) for CHD WGS (n=30 tissues) and CHD WES probands (n=142 tissues), GTEx donors (n=113 tissues) and mouse C57Bl6/Castaneus F1 hybrids (n=7 tissues). s.d. is indicated. b The distribution of extreme ASE events per subject by genotyping platform. Extreme ASE events were identified in >70% of WGS subjects (n=14). However, extreme ASE events were identified in only ∼20% of WES subjects (n=130) and ∼45% of GTEx donors (n=95).
ASE of imprinted genes in cardiovascular tissues.
| chr2:207139522-207179148 | 4/7 | 15/15 | 86 | C | P | 8.2 | ASE | |
| chr4:89617065-89619023 | 9/10 | 14/14 | 96 | C | P | 18.3 | NE | |
| chr6:3849631-3851551 | 4/5 | 22/23 | 93 | C | P | 37.9 | NE | |
| chr6:144261436-144385735 | 16/16 | 33/34 | 98 | C | P | 39.7 | ASE | |
| chr7:94285636-94299006 | 1/1 | 0/0 | 100 | C | P | 98.8 | ASE | |
| chr7:130126015-130146138 | 7/8 | 1/1 | 89 | C | P | 47.7 | ASE | |
| chr11:2016405-2170833chr11:2016405-2170833 | 12/124/4 | 68/691/1 | 99100 | NCC | MP | 32872677 | ASEASE | |
| chr14:101193201-101373305chr14:101193201-101373305chr14:101193201-101373305 | 17/175/51/1 | 39/402/80/0 | 9854100 | CNCC | PMM | 76.4285.33.2 | ASEASEASE | |
| chr15:23888695-25244225chr15:23888695-25244225chr15:23888695-25244225chr15:23888695-25244225 | 2/255/5560/600/0 | 0/0NS48/484/4 | 100100100100 | CCCC | PPPP | 3.963.868.999.1 | ASE | |
| chr19:57321444-57352094 | 14/15 | 14/17 | 88 | C | P | 103.2 | ASE | |
| chr20:36149606-36152090 | 1/2 | 0/0 | 50 | C | P | 21.1 | NS |
ASE, allele-specific expression; CHD, congenital heart disease; FHE, fetal heart expression (reads per million aligned reads); GTEx, Genotype-Tissue Expression Consortium; M, maternal; NE, not expressed; NS, no SNP; P, paternal; SNP, single-nucleotide polymorphism.
*Number subjects with silenced allele/number of informative subjects.
†Mouse ASE observed in pulmonary artery.
‡PEG10 and RTL1 are not expressed in postnatal cardiac tissues and are silenced in the only fetal subject in the study.
§Mouse ASE observed in skeletal muscle.
Extreme ASE genes with LOF variants in silenced allele or damaging variants in expressed allele.
*Reaction conditions:1/LiHMDS/2/[Pd(η3-C3H5)Cl]2/S-IPr·HCl=200/200/100/2.5/5; 0.1 M of ketone 1; T=30oC; B/L and dr was determined by 1H NMR, dr is the ratio of (±)-(syn,anti)-3/other diastereoisomers; Isolated yield. †T=50 oC. ‡Solvent=THF. §OBoc of 2 was replaced with OP(OEt)2. ||The yield was determined by 1H NMR.
Extreme ASE and biallelic LOE events with significantly altered gene expression.
| 1-05368 | DuctArt | 189.1 | 296±44 (15) | 0.64 | 1.6 × 10−2 | 1.4 × 10−3 | 0 | 1.8 × 10−5 | ||
| 1-01019 | RA | 60.3 | 133±51 (18) | 0.45 | 4.7 × 10−2 | 9.4 × 10−4 | 0 | 9.1 × 10−5 | ||
| 1-00384 | IVS | 65.9 | 108±12 (7) | 0.61 | 3.0 × 10−4 | 0 | 0 | 0 | ||
| 1-00713 | LV | 7.7 | 42±17 (9) | 0.18 | 4.4 × 10−2 | 0 | 0 | 9.1 × 10−5 | ||
| C417-01C417-01 | IVSLV | 169.6124.0 | 302±25 (7)224±46 (9) | 0.560.55 | <1.0 × 10−43.0 × 10−2 | 0 | 0 | 8.2 × 10−5 | ||
| CHD-1548CHD-1548CHD-1548 | LALVRA | 12.915.110.6 | 29±6.0 (10)25±3.0 (7)26±3.4 (9) | 0.440.600.41 | 6.1 × 10−38.0 × 10−4<1.0 × 10−4 | 0 | 2.8 × 10−4 | 2.6 × 10−3 | ||
| 1-01024 | RA | 5.6 | 1.0±1.7 (15) | 5.48 | 6.7 × 10−3 | 9.5 × 10−4 | 5.6 × 10−4 | 4.5 × 10−4 | ||
| 1-01984 | LA | 9.2 | 1.4±0.4 (3) | 6.86 | <1.0 × 10−4 | 9.5 × 10−4 | 5.6 × 10−4 | 4.5 × 10−4 | ||
| 1-02697 | RV | 26.9 | 4.2±3.5 (16) | 6.45 | <1.0 × 10−4 | 0 | 2.7 × 10−4 | 3.5 × 10−4 | ||
| 1-03051 | AO | 5.4 | 55±15 (6) | 0.10 | 8.0 × 10−4 | 0 | 0 | 0 | ||
| 1-03316 | AO | 1.2 | 22±5.8 (6) | 0.06 | 3.0 × 10−4 | 0 | 0 | 0 | ||
| 1-03948 | RV | 0.6 | 13±4.1 (20) | 0.05 | 1.9 × 10−3 | 0 | 0 | 0 | ||
| 1-00596 | RA | 2.6 | 30±8.7 (47) | 0.09 | 1.5 × 10−3 | 0 | 0 | 1.6 × 10−5 | ||
| 1-04119 | PA | 0.3 | 5.4±1.1 (10) | 0.06 | <1.0 × 10−4 | 0 | 0 | 0 | ||
| 1-02921 | RV | 0.0 | 5.5±1.3 (20) | 0 | <1.0 × 10−4 | 0 | 0 | 2.6 × 10−4 | ||
| 1-03948 | RV | 0.6 | 15±8.1 (20) | 0.04 | 4.0 × 10−2 | 0 | 0 | 1.6 × 10−5 | ||
| 1-02697 | PV | 0.5 | 14±3.4 (3) | 0.04 | 1.0 × 10−4 | 0 | 0 | 1.6 × 10−5 | ||
AF, allele frequency; AO, aorta; ASE, allele-specific expression; CHD, congenital heart disease; DuctArt, ductus arteriosus; ExAC, Exome Aggregation Consortium; IVS, interventricular septum; LA, left atrium; LOE, loss-of-expression; LOF, loss-of-function; LV, left ventricle; PA, pulmonary artery; PCGC, Pediatric Cardiac Genomics Consortium; PV, pulmonary valve; RA, right atrium; RV, right ventricle.
*Expression in reads per million aligned reads.
†P value calculated from z-score.
‡Includes de novo LOF mutations.
§Strong candidate CHD genes (identified in this study).
||Frequency of homozygous (ExAC) or combined homozygous and compound heterozygous (PCGC cases and controls) LOF mutations.
¶KMT2D de novo mutations identified in both 1-03948 (LOF) and 1-00596 (damaging-missense).
#LOE genes PHKG1 (1-03948, P>2.7 × 10−3) and FGFBP2 (1-02697, n=3) are not significant on their own, but occur in genes significant in other subjects.