| Literature DB >> 27058611 |
James R Priest1,2, Kazutoyo Osoegawa3, Nebil Mohammed4, Vivek Nanda5, Ramendra Kundu6, Kathleen Schultz4, Edward J Lammer4, Santhosh Girirajan7, Todd Scheetz8, Daryl Waggott6, Francois Haddad2,6, Sushma Reddy1,2, Daniel Bernstein1,2, Trudy Burns9, Jeffrey D Steimle10, Xinan H Yang10, Ivan P Moskowitz10, Matthew Hurles11, Richard P Lifton12,13, Debbie Nickerson14, Michael Bamshad14,15, Evan E Eichler13,14, Seema Mital16, Val Sheffield13,17, Thomas Quertermous2,6, Bruce D Gelb18, Michael Portman15, Euan A Ashley2,6.
Abstract
Congenital heart disease (CHD) has a complex genetic etiology, and recent studies suggest that high penetrance de novo mutations may account for only a small fraction of disease. In a multi-institutional cohort surveyed by exome sequencing, combining analysis of 987 individuals (discovery cohort of 59 affected trios and 59 control trios, and a replication cohort of 100 affected singletons and 533 unaffected singletons) we observe variation at novel and known loci related to a specific cardiac malformation the atrioventricular septal defect (AVSD). In a primary analysis, by combining developmental coexpression networks with inheritance modeling, we identify a de novo mutation in the DNA binding domain of NR1D2 (p.R175W). We show that p.R175W changes the transcriptional activity of Nr1d2 using an in vitro transactivation model in HUVEC cells. Finally, we demonstrate previously unrecognized cardiovascular malformations in the Nr1d2tm1-Dgen knockout mouse. In secondary analyses we map genetic variation to protein-interaction networks suggesting a role for two collagen genes in AVSD, which we corroborate by burden testing in a second replication cohort of 100 AVSDs and 533 controls (p = 8.37e-08). Finally, we apply a rare-disease inheritance model to identify variation in genes previously associated with CHD (ZFPM2, NSD1, NOTCH1, VCAN, and MYH6), cardiac malformations in mouse models (ADAM17, CHRD, IFT140, PTPRJ, RYR1 and ATE1), and hypomorphic alleles of genes causing syndromic CHD (EHMT1, SRCAP, BBS2, NOTCH2, and KMT2D) in 14 of 59 trios, greatly exceeding variation in control trios without CHD (p = 9.60e-06). In total, 32% of trios carried at least one putatively disease-associated variant across 19 loci,suggesting that inherited and de novo variation across a heterogeneous group of loci may contribute to disease risk.Entities:
Mesh:
Year: 2016 PMID: 27058611 PMCID: PMC4825975 DOI: 10.1371/journal.pgen.1005963
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Analytical approach for disease gene discovery in a cohort of AVSD trios.
Protein altering variants meeting allele frequency, quality, and read depth cutoffs from 59 trios were sorted with an inheritance model consistent with rare disease which combines de novo, compound heterozygous, and rare homozygous variants to collate variants into a list of genes [16]. Three analytical approaches were applied to the final dataset. Primary Analysis. De novo variants in the AVSD trios were cross-referenced with the genes in a module highly enriched for CHD and cardiac development obtained from unsupervised weighted-gene coexpression network analysis to identify a novel AVSD gene. Secondary Analyses. Variants from the AVSD trios were mapped onto a protein interaction network followed by burden testing in a replication cohort (details in Fig 3). In a final analysis employing inheritance modeling, de novo, compound heterozygous, and rare homozygous loci observed in the AVSD trios were compared with a predefined list of genes associated with human or mouse cardiac malformations. Statistical comparisons were performed in a control group of 59 control trios without CHD.
Fig 3An illustration of discovery of protein interaction networks for AVSD and validation by burden testing in separate a replication cohort.
Filtered SNPs and CNVs from the AVSD trio probands are mapped to a protein interaction network (represented by grey dots and black lines), and the network is pruned to yield subnetworks (green dots and black lines). The subnetworks represent variants in genes which have verified interactions at the protein level, often constituting a portion of a signaling pathway or enzymatic complex [22]. To validate the disease association of the individual subnetworks discovered in the trios, we performed burden testing for each subnetwork in a replication cohort of 100 AVSD cases originating largely from a separate study of AVSDs [10] along with 533 controls without congenital heart disease.
Fig 2Mutation at a highly conserved arginine residue in the DNA binding domain of NR1D2 is associated with AVSD.
(a) Multiple alignment of amino acid residues in the C-terminal aspect of the DNA binding domain of NR1D2 show strong conservation through evolution to the Drosophila paralog Eip75B. (b) A 3d representation of the crystal structure of NR1D1 complexed to DNA. The NR1D1 and NR1D2 DNA binding domains display 96% sequence identity (S3 Fig), and the altered arginine residue (black arrow) contacts the minor groove of DNA in the C-terminal portion of the DNA binding domain (RCSB 1A6Y) [82]. (c) The p.R175W mutation shows increased transcriptional activity in a cell culture assay (p = 0.0022, two-tailed t-test). A wild-type or mutant Nr1d2 construct was transfected in HUVEC cells along with a vector containing a synthetic response element driving a GFP reporter. Mean and standard deviation for each condition are displayed. (d) Two representative cardiac lesions in Nr1d2 -/- mouse hearts. An oblique coronal section of the heart of a spontaneously deceased P0 Nr1d2 -/- pup reveals a primum ASD and suggests a common AV-valve (white arrow) indicative of AVSD (star). A coronal section of an e17.5 Nr1d2 -/- heart shows an inlet VSD (plus sign) which is a type of VSD closely related to AVSDs in cardiac development [78,83]. Abbreviations: LV-left and RV-right ventricles, RA-right atrium, IVS- interventricular septum, VSD-ventricular septal defect, ASD-atrial septal defect.
Fig 4Protein interaction networks identify collagen genes putatively associated with atrioventricular septal defects.
(a) A network diagram displaying protein-protein interactions between genes. A greater number of networks (n = 86) and genes (n = 231) are detected by protein-interaction analysis in AVSD-trio subnetworks (green nodes) relative to the networks (n = 26) and genes (n = 60) detected in the Control-trio subnetworks (yellow nodes). A median p-value from a protein interaction network permutation procedure is reported for each group of subnetworks, which shows an enrichment of true protein-protein interactions in the AVSD-trio subnetworks (p = 0.01) that is not observed in the control trio subnetworks (p = 1). The black lines indicate a protein-protein interaction between two nodes in the subnetwork and the weight of the line corresponds to “heat” value output by the Hotnet2 algorithm. The number of protein interactions for each observed gene corresponds to the size of the node. The two collagen genes in the AVSD-trio subnetworks (COL2A1 and COL9A1) are labeled and the nodes highlighted in red. The networks from each group of trios are arranged in a circular layout. (b) A Venn diagram comparing genes/variants in the AVSD-trio subnetworks (green circle) and Control-trio subnetworks (yellow circle) to genes expressed during early mouse cardiac development (grey circle). The number of genes in each dataset and their overlap with other datasets is indicated and the p-value from a hypergeometric test is reported. A greater proportion of genes in the AVSD-trio subnetworks (60 of 231 genes, p = 9e-09) are found to be expressed in the early cardiac development dataset, compared to the control-trio subnetworks (10 of 60 genes, p = 0.34). (c) The subnetwork comprised of COL2A1 and COL9A1 are highlighted in a Manhattan plot of test statistics from the SKAT linear weight test for each of the 86 AVSD-trio subnetworks identifies an elevated burden of variation in a replication cohort of 100 AVSD subjects and 533 control subjects. The p-value is plotted for the chromosomal position of each gene within the subnetwork. A significance cutoff of 5.81e-04 was derived from the Bonferroni correction of 86 tests performed.
AVSD trio probands display de novo and inherited variation in genes related to human and mouse cardiac malformations.
| Gene | Proband | Inheritance | Amino Acid Changes | Prior Evidence |
|---|---|---|---|---|
| 3721 | compound heterozygous | p.L97R & p.S90L | Mouse Phenotype [ | |
| 1131 | compound heterozygous | p.E30V & p.D4500H | Mouse Phenotype [ | |
| 7952 | compound heterozygous | p.A159V & p.T362I | Mouse Phenotype [ | |
| 3731 | compound heterozygous | p.V372I & p.E173K | Mouse Phenotype [ | |
| 8522 | compound heterozygous | p.E1065K & p.V108M | Mouse Phenotype [ | |
| 28 | homozygous rare | p.L71F | Mouse Phenotype [ | |
| 2933 | compound heterozygous | p.V2285I & p.R509H | Clinical Association | |
| 155 | compound heterozygous | p.A933P & p.R361S | Clinical Association | |
| 2020 | compound heterozygous | p.E30G & p.S388G | Clinical Association | |
| 2023 | compound heterozygous | p.A936V & p.D208N | Clinical Association | |
| 54 | compound heterozygous | p.V81I & p.E199G | Clinical Association | |
| 54 | homozygous rare | p.E1295Q | Clinical Association | |
| 7952 | compound heterozygous | p.S377N & p.E1697G | Syndrome with CHD [ | |
| 2002 | compound heterozygous | p.R652H & p.R83Q | Syndrome with CHD [ | |
| 155 | compound heterozygous | p.L2408H & p.D1327G | Syndrome with CHD [ | |
| 135 | compound heterozygous | p.G688R & p.L62fs | Syndrome with CHD [ | |
| 3731 | compound heterozygous | p.A504V & p.K231E | Syndrome with CHD [ | |
| 2028 | p.R260Q | Syndrome with CHD [ |
Among 710 variants in 399 genes displaying de novo, homozygous, compound heterozygous inheritance in the AVSD-trio probands we observed 16 genes associated with human or mouse cardiac malformations. Additional variant annotations are found in S4 Table.
Discovered CNVs associated with AVSDs and other cardiac malformations.
| 22:21989140–23627391 | 1,638,251 | Duplication, paternally inherited | 28 genes including | Overlaps with 22q11.2 duplication region [ |
| 8:8175651–11853818 | 3,678,167 | Deletion, | 43 genes including | Canonical CHD gene [ |
| Y:119064–2562294 | 2,443,230 | Deletion | 13 genes, pseudoautosomal region | Previously associated with conotruncal CHD [ |
| 1:145645983–145763756 | 117,773 | Duplication | 5 genes including | 1q21.1 risk locus [ |
| 4:5692835–5800635 | 107,800 | Deletion | Causal gene in CHD associated Ellis-van-Creveld syndrome | |
| 8:10755273–11929256 | 1,173,983 | Deletion | 14 genes including | Canonical CHD gene [ |
Reported CNVs were larger than 100,000 bp, and detected by at least two of three independent methodologies (arrayCGH, consensus read depth based analysis; Conifer & XHMM, or a TaqMan CNV assay) and overlapped known CHD loci and were absent from controls. Coordinates refer to the hg19 build of the human genome reference.