| Literature DB >> 30532227 |
Hongjian Qi1,2, Lan Yu3, Xueya Zhou1,3, Julia Wynn3, Haoquan Zhao1,4, Yicheng Guo1, Na Zhu1,3, Alexander Kitaygorodsky1,4, Rebecca Hernan3, Gudrun Aspelund5, Foong-Yen Lim6, Timothy Crombleholme6, Robert Cusick7, Kenneth Azarow8, Melissa E Danko9, Dai Chung9, Brad W Warner10, George B Mychaliska11, Douglas Potoka12, Amy J Wagner13, Mahmoud ElFiky14, Jay M Wilson15,16, Debbie Nickerson17, Michael Bamshad17, Frances A High15,16,18, Mauro Longoni16,18, Patricia K Donahoe16,18, Wendy K Chung3,19,20, Yufeng Shen1,4,21.
Abstract
Congenital diaphragmatic hernia (CDH) is a severe birth defect that is often accompanied by other congenital anomalies. Previous exome sequencing studies for CDH have supported a role of de novo damaging variants but did not identify any recurrently mutated genes. To investigate further the genetics of CDH, we analyzed de novo coding variants in 362 proband-parent trios including 271 new trios reported in this study. We identified four unrelated individuals with damaging de novo variants in MYRF (P = 5.3x10(-8)), including one likely gene-disrupting (LGD) and three deleterious missense (D-mis) variants. Eight additional individuals with de novo LGD or missense variants were identified from our other genetic studies or from the literature. Common phenotypes of MYRF de novo variant carriers include CDH, congenital heart disease and genitourinary abnormalities, suggesting that it represents a novel syndrome. MYRF is a membrane associated transcriptional factor highly expressed in developing diaphragm and is depleted of LGD variants in the general population. All de novo missense variants aggregated in two functional protein domains. Analyzing the transcriptome of patient-derived diaphragm fibroblast cells suggest that disease associated variants abolish the transcription factor activity. Furthermore, we showed that the remaining genes with damaging variants in CDH significantly overlap with genes implicated in other developmental disorders. Gene expression patterns and patient phenotypes support pleiotropic effects of damaging variants in these genes on CDH and other developmental disorders. Finally, functional enrichment analysis implicates the disruption of regulation of gene expression, kinase activities, intra-cellular signaling, and cytoskeleton organization as pathogenic mechanisms in CDH.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30532227 PMCID: PMC6301721 DOI: 10.1371/journal.pgen.1007822
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Clinical summary of patients.
| Number | Percent | |
|---|---|---|
| Gender | ||
| Male | 212 | 58.6% |
| Female | 150 | 41.4% |
| CDH classification | ||
| Isolated | 208 | 57.5% |
| Complex | 149 | 41.2% |
| Unknown | 5 | 1.4% |
| Lesion side | ||
| Left | 270 | 74.6% |
| Right | 56 | 15.5% |
| Eventration/Morgagni/Agenesis | 11 | 3.0% |
| Unknown | 25 | 6.9% |
| CDH type | ||
| Bochdaleck | 294 | 81.2% |
| Other# | 22 | 6.1% |
| Unknown | 46 | 12.7% |
| DHREAMS cohort (n = 283): Time of recruitment | ||
| Neonatal | 229 | 80.9% |
| Fetal | 9 | 3.2% |
| Child | 45 | 15.9% |
| Discharge vital status (n = 283) | ||
| Survived | 241 | 85.2% |
| Deceased | 42 | 14.8% |
| Development assessment | ||
| At 2 years follow-up | 152 | 53.7% |
| At 5 years follow-up | 70 | 24.7% |
| No assessment at either 2 or 5 years | 128 | 45.2% |
| Additional anomalies in complex cases (n = 149) | ||
| Cardiovascular | 66 | 44.3% |
| Neurodevelopmental | 37 | 24.8% |
| Skeletal | 26 | 17.4% |
| Genitourinary | 14 | 9.4% |
| Gastrointestinal | 13 | 8.7% |
¶ Development assessment at 2 years follow-up include Vineland Adaptive Behavior Assessment (Vineland-II) and/or Bayley Scales of Toddler Development (Bayley-III); tests at 5 years follow-up include Vineland-II and/or Wechsler Preschool and Primary Scale of Intelligence (WPPSI).
§Neurodevelopmental conditions include congenital abnormalities in central nerves system, and developmental delay or neuropsychiatric disorders based on the follow-up developmental evaluations.
Burden of de novo coding variants.
| Gene Sets | Variant class | Number of variants | Baseline expectation | Fold enrichment | P-value |
|---|---|---|---|---|---|
| All Genes | Synonymous | 110 | 109.1 | 1.01 | 0.48 |
| Missense | 295 | 250.6 | 1.18 | 3.42E-03 | |
| D-mis | 138 | 93.7 | |||
| LGD | 57 | 32.9 | |||
| Constrained Genes | Synonymous | 34 | 38.8 | 0.88 | 0.80 |
| Missense | 112 | 88.1 | 1.27 | 7.91E-03 | |
| D-mis | 59 | 38.0 | |||
| LGD | 30 | 12.0 | |||
| Other Genes | Synonymous | 76 | 70.3 | 1.08 | 0.26 |
| Missense | 184 | 162.6 | 1.13 | 0.053 | |
| D-mis | 80 | 55.7 | 1.44 | 1.28E-03 | |
| LGD | 27 | 20.9 | 1.29 | 0.11 |
Constrained genes are defined by pLI metrics ≥0.5. LGD: likely gene disrupting, including frameshift, stop-gain, stop-loss, and variants at canonical splice sites; D-mis: predicted deleterious missense variants defined by CADD Phred score ≥25. The baseline expectations for different types of variants were calculated by the previous published method[23, 28]. The enrichment of observed number of variants was evaluated by a one-sided Poisson test.
Phenotype characteristics of patients with de novo coding variants in MYRF.
| Study | Sample ID | Genetic Sex | CADD Phred | Diaphragm defect | Cardiovascular defect | Urogenital defect | Other malformations | |
|---|---|---|---|---|---|---|---|---|
| Current study | 01–1008 | XY | c.235dupG: | - | L-CDH | ASD,VSD,ToF | Bilateral undescended testes | No |
| 01–0429 | XX | c.1303G>A:p.G435R | 32 | L-CDH | VSD | No internal genital organs, blind-ending vagina | Accessory spleen | |
| 04–0042 | XY | c.2036T>C:p.V679A | 25.9 | L-CDH | ASD,VSD | Unknown | Unknown (Deceased) | |
| 05–0050 | XY | c.2084G>A:p.R695H | 34 | CDH | HLHS | Ambiguous genitalia, undescended testes | Intellectual disability and motor delay at 2 years old | |
| 01–0033 | XX | c.1904-1G>A | 25 | R-CDH | Scimitar syndrome | Unknown | Unknown (Deceased) | |
| 01–0591 | XX | No | HLHS | Unknown | Unknown (Deceased) | |||
| CHU-11 | XY | c.1786C>T:p.Q596 | 37 | No | Dextrocardia | Swyer syndrome with female genitalia | Right pulmonary hypoplasia | |
| PCGC[ | 1–02264 | XY | c.1160T>C:p.F387S | 27.9 | No | AAH, CoA, HLHS | Ambiguous genitalia, hypospadias, undescended testis | No |
| 1–03160 | XY | c.1209G>C:p.Q403H | 27.6 | Right hemi-diaphragm eventration | Scimitar syndrome, AAH, ASD, BAV, HLHS, MS, VSD | Undescended testis | Lung hypoplasia | |
| 1–07403 | XY | c.1435C>G:p.L479V | 23.9 | No | BAV, CoA | Swyer syndrome with female genitalia | Short stature | |
| Pinz et al.[ | Case 1 | XY | c.2336+1G>A | 26.8 | No | Scimitar syndrome, cor triatriatum | Penoscrotal hypospadias, micropenis, unilateral cryptorchidism | Mild speech delay, pulmonary hypoplasia, tracheal anomalies |
| Case 2 | XY | c.2518C>T:p.R840 | 44 | R-CDH | Scimitar syndrome | Persistent urachus, Undescended testis | Cleft spleen, thymic involution, thyroid fibrosis | |
| Chitayat et al. [ | Fetus case | XY | c.1254_1255dupGA: p.T419RfsX14 | - | No | HLHS | Ambiguous external genitalia, right hepato-testicular fusion and left spleno-testicular fusion | Mild pulmonary hypoplasia, intestinal malrotation |
Abbreviations: L/R-CDH, (lef/right)-congenital diaphragmatic hernia; AAH, aortic arch hypoplasia; ASD, atrial septal defect; BAV, bicommissural aortic valve; CoA, coarctation of aorta; VSD, ventricular septal defects; ToF, Tetralogy of Fallot; MS, mitral stenosis; PCGC, Pediatric Cardiovascular Genetics Consortium.
* 01–0591 is the monozygotic twin of 01–0033.
# Genetic sex is based upon the chromosome complement.
Fig 1De novo coding variants in MYRF and their functional impact on transcriptome.
(a) Schematic diagram of the MYRF protein structure. DBD: DNA binding domain; ICA: Intramolecular chaperone auto-processing domain; Pro-rich: proline-rich region; TM: transmembrane helix. The position of DBD and ICA were based on the annotation from InterPro, and Pro-Rich and TM were from SwissProt. The coordinates are given with respect to the canonical isoform (1151 amino acids). The relative position of 12 de novo coding variants are displayed, including 6 discovered in the current study (shown in red), and five from published studies of congenital heart disease (CHD) [29, 30] (shown in blue). LGD variants are shown on top of the protein; and missense variants are on the other side. Shown below the protein structure is the density of missense variants in gnomAD (http://gnomad.broadinstitute.org/). A missense constraint region [37] is highlighted in red (observed/expected number of missense variants = 0.31) (b) Z-score for each gene is the standardized expression level across samples. Mean Z-scores of MYRF target genes in three MYRF variant carriers were shifted to the lower end as compared with other genes. (c) Gene-set enrichment analysis (GESA) was applied to genes ranked by the estimated fold change of expression level comparing MYRF variant carriers with other cases. The MYRF target genes tend to have lower ranks and majority of them were down-regulated in MYRF variant carriers (NES = -2.10, P<5.0E-4).
Fig 2Genetic overlap with other developmental disorders.
(a) Venn diagram shows 25 genes implicated in developmental disorders and congenital heart disease (DD and CHD genes, Materials and Methods) that are affected by damaging variants in CDH. (b) Enrichment of LGD and D-mis variants in DD and CHD genes. Enrichment was evaluated by comparing the observed number of de novo damaging variants in DD and CHD genes with the expected number of hits by randomly scattering the same number of variants to the exome while controlling for the number of variants, gene length, and sequence context (Materials and Methods). (c) Expression percentile ranks in the developing diaphragm [34], heart [70] and brain [70] are shown for all genes (green density), highlighting DD and CHD genes listed in (b). Smaller ranks correspond to higher expression levels.
Fig 3Burden of damaging de novo variants in different gene sets and sub classes of CDH.
(a) In complex cases, LGD variants were dramatically (4.6 fold) enriched in genes highly expressed (ranked in the top quartile) in mouse developing diaphragm (MDD) [34], and showed no enrichment in other quartiles. By comparison in isolated cases, LGD variants showed similar enrichment (~2 fold) across expression levels. (b) D-mis variants carried by complex cases also showed highest (2.4 fold) enrichment in the top quartile of MDD expression. Enrichment was evaluated by comparing observed number of variants to the baseline expectation[23, 70] using a one-sided Poisson test. Bars represent the 95% confidence intervals of estimated fold enrichment.
Fig 4A functional enrichment map of genes affected by de novo damaging variants in CDH.
(a) Enrichment results were visualized by a network of gene sets, where node size is proportional to the number of genes in each gene set and the thickness of edge represents the overlaps between gene sets. The significance of enrichment (p-value) is indicated by the color gradient. Functionally related gene sets are circled and manually labeled. Sub-clusters of network with similar functional annotations are grouped together as functional modules. (b) Mapping genes affected by damaging variants in CDH to the enriched functional groups shown in (a).