| Literature DB >> 35910219 |
Meihua Tan1,2, Xinrui Wang3,4, Hongjie Liu1, Xiaoyan Peng3, You Yang2, Haifei Yu3, Liangpu Xu5,6, Jia Li2,7, Hua Cao3,4.
Abstract
Congenital heart disease (CHD) is the most common congenital malformation in fetuses and neonates, which also represents a leading cause of mortality. Although significant progress has been made by emerging advanced technologies in genetic etiology diagnosis, the causative genetic mechanisms behind CHD remain poorly understood and more than half of CHD patients lack a genetic diagnosis. Unlike carefully designed large case-control cohorts by multicenter trials, we designed a reliable strategy to analyze case-only cohorts to utilize clinical samples sufficiently. Combined low-coverage whole-genome sequencing (WGS) and whole-exome sequencing (WES) were simultaneously conducted in a patient-only cohort for identifying genetic etiologies and exploring candidate, or potential causative CHD-related genes. A total of 121 sporadic CHD patients were recruited and 34.71% (95% CI, 26.80 to 43.56) was diagnosed with genetic etiologies by low-coverage WGS and WES. Chromosomal abnormalities and damaging variants of CHD-related genes could explain 24.79% (95% CI, 17.92 to 33.22) and 18.18% (95% CI, 12.26 to 26.06) of CHD patients, separately, and 8.26% (95% CI, 4.39 to 14.70) of them have simultaneously detected two types of variants. Deletion of chromosome 22q11.2 and pathogenic variants of the COL3A1 gene were the most common recurrent variants of chromosomal abnormalities and gene variants, respectively. By in-depth manual interpretation, we identified eight candidate CHD-causing genes. Based on rare disease-causing variants prediction and interaction analysis with definitive CHD association genes, we proposed 86 genes as potential CHD-related genes. Gene Ontology (GO) enrichment analysis of the 86 genes revealed regulation-related processes were significantly enriched and processes response to regulation of muscle adaptation might be one of the underlying molecular mechanisms of CHD. Our findings and results provide new insights into research strategies and underlying mechanisms of CHD.Entities:
Keywords: CHD-related genes; congenital heart disease; diagnostic yield; genetic etiology; whole-exome sequencing; whole-genome sequencing
Year: 2022 PMID: 35910219 PMCID: PMC9326225 DOI: 10.3389/fgene.2022.941364
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Primary cohort characteristics (n = 121 sporadic CHD).
| Variable | Outcome |
|---|---|
| Male, | 69 (57.02%) |
| Female, | 52 (42.98%) |
| Sample type, | |
| Amniotic fluid | 11 (9.09%) |
| Cord blood | 42 (34.71%) |
| Tissue | 68 (56.20%) |
| The latest ultrasonic testing time period, | |
| Fetus | 79 (65.29%) |
| Neonate | 42 (34.71%) |
| Clinical features, | |
| Isolated | 51 (42.15%) |
| Non-isolated | 68 (56.20%) |
| Lack of feature details | 2 (1.65%) |
Functional effects of SNP or InDels and variant classification of CNVs.
| Functional effects of SNP or InDels | Variant number (proportion) | Variant classification of CNVs | Variant number (proportion) |
|---|---|---|---|
| Frameshift | 606 (83.70%) | Benign | 36 (17.65%) |
| Nonsense | 80 (11.05%) | Likely Benign | 4 (1.96%) |
| Span | 7 (0.97%) | Variant of Uncertain Significance | 139 (68.14%) |
| Splice-3 | 16 (2.21%) | Likely Pathogenic | 1 (0.49%) |
| Splice-5 | 10 (1.38%) | Pathogenic | 24 (11.76%) |
| Stop-gain | 5 (0.69%) | Total | 204 (100%) |
| Total | 724 (100%) |
Definitive or candidate causative variants of WES and low coverage WGS.
| Gene | Sample N | Recurrence variants (samples N) | Chromosome abnormalities | Sample N |
|---|---|---|---|---|
|
| 4 | c.5delT (4), c.7_8insC (3) | del (22q11.21) | 6 |
|
| 3 | c.1548_1549delCT (3), c.1551_1552insAG (3) | T18 | 3 |
|
| 3 | NA | T21 | 3 |
|
| 3 | c.4883_4884insCC(2), c.4892_4893delGG (2) | XO | 3 |
|
| 2 | NsA | del (15q11.2) | 2 |
|
| 2 | c.1319delA (2), c.1325C > G (2) | Others | 15 |
|
| 2 | c.1296_1299delTCTT (2) | ||
|
| 2 | c.3730_3733delCAAA (2), c.4125delC (2), c.4127_4128insG (2), c.5104G > T (2) | ||
| Others | 17 | NA |
33 genes: ZEB2 (1), WDR35 (1), TTC37 (1), TRPM4 (1), TCAP (1), TAB2 (1), SYNE2(1), SON(1), SHANK3(1), SETBP1(1), RYR2 (1), RAD21 (1), PKP2(1), PEX12 (1), PEX1 (1), NSD1 (1), NONO(1), NODAL (1), NEK1 (1), MYLK (1), MIB1(1), MEGF8 (1), KMT2D (1), KMT2A (1), FREM2 (1), DSP(1), COL5A2 (1), CCDC114 (1), BMPR2 (1), BICC1(1), BBS9(1), BBS2(1), and ARID1A (1).
17 samples: 19S5253078R, 20S2200077, 20B10556839, 19B4152086, 19B4152090, 19B4152105, 19B4152110, 19D5253044, 19S5253038R, 19S5253106, 19S5347586, 19S5347592, 20B10556837 (1), 20B10556859, 20S10556845, 20S10556851, and 20S10556862.
16 chromosome abnormalities: T9 (1), T13 (1), T16 (1), del (1q42.13q44) (1), dup (4q12q12) (1), del (4q25q25) (1), del (7q11.23) (1), dup (7p12.3) (1), del (7q36.2q36.3) (1), del (11q22.3) (1), dup (12q21.31) (1), dup (14q21.3) (1), dup (17q25.3) (1), del (18q22.3q23) (1), del (18q21.33q23) (1), and dup (Xq28q28) (1).
15 samples: 20B10556837, 19B4152096, 19B4152103, 19B4152105, 19B4152086, 19D5253226, 19S5253093, 19S5253099, 20S10556850, 20S10556862, 19B5347543-1, 19S5253089R, 20S10556845, 20S2200077, and 20S2200078.
Candidate CHD-causing genes by manually in-depth interpretation of WES results.
| Gene symbol (OMIM ID) | Gene description | Associated phenotype (OMIM ID) | Inheritance | Genotype level | Variants | Sample ID (Zygosity) | ACMG rules | Pathogenicity |
|---|---|---|---|---|---|---|---|---|
|
| Encodes a nuclear outer membrane protein that binds cytoplasmic F-actin. This binding tethers the nucleus to the cytoskeleton and aids in the maintenance of the structural integrity of the nucleus | Emery-Dreifuss muscular dystrophy 5 (#612999) | AD | 3 | c.5961-2A > G | 19B4152105 (Het) | PVS1 PM2 PP3 | P |
|
| A muscle member of the immunoglobulin gene superfamily encodes myosin light chain kinase which is a calcium/calmodulin-dependent enzyme. This kinase phosphorylates myosin regulatory light chains to facilitate myosin interaction with actin filaments to produce contractile activity | Aortic aneurysm, familial thoracic 7 (#613780) | AD | 3 | c.589-2A > G | 19S5253038R (Het) | PVS1 PM2 PP3 | P |
|
| Encodes a member of the arm-repeat (armadillo) and plakophilin gene families. Plakophilin proteins contain numerous armadillo repeats, localize to cell desmosomes and nuclei, and participate in linking cadherins to intermediate filaments in the cytoskeleton. This gene product may regulate the signaling activity of beta-catenin | Arrhythmogenic right ventricular dysplasia 9 (#609040) | AD | 3 | c.1170+1G > A | 19S5253106 (Het) | PVS1 PM2 PP3 PP5 | P |
|
| Encodes a calcium-activated nonselective ion channel that mediates transport of monovalent cations across membranes, thereby depolarizing the membrane. The activity of the encoded protein increases with increasing intracellular calcium concentration, but this channel does not transport calcium | Progressive familial heart block, type IB (#604559) | AD | 3 | c.3504_3514delCGAACAGCGCC(p.Gln1170Serfs*10) | 20B10556837 (Het) | PVS1 PM2 | LP |
|
| Encodes a protein containing multiple ankyrin repeats and RING finger domains that functions as an E3 ubiquitin ligase. The encoded protein positively regulates Notch signaling by ubiquitinating the Notch receptors, thereby facilitating their endocytosis. This protein may also promote the ubiquitination and degradation of death-associated protein kinase 1 (DAPK1) | Left ventricular noncompaction 7 (#615092) | AD | 3 | c.813G > A (p.Trp271*) | 20B10556859 (Het) | PVS1 PM2 PP3 | P |
|
| Encodes a protein found in striated and cardiac muscle that binds to the titin Z1-Z2 domains and is a substrate of titin kinase, interactions thought to be critical to sarcomere assembly | Cardiomyopathy, hypertrophic, 25 (#607487) | AD | 3 | c.460_472delCGCTCCCTGTCCC(p.Arg154Alafs*30) | 20S10556845 (Het) | PVS1 PM2 PP3 | P |
|
| Encodes a protein that contains multiple simple repeats. The encoded protein binds RNA and promotes pre-mRNA splicing, particularly of transcripts with poor splice sites. The protein also recognizes a specific DNA sequence found in the human hepatitis B virus (HBV) and represses HBV core promoter activity | ZTTK syndrome (#617140) | AD | 3 | c.6754A > T (p.Lys2252*) | 20S10556862 (Het) | PVS1 PM2 PP3 | P |
|
| Encodes a protein that anchors intermediate filaments to desmosome plaques and forms an obligate component of functional desmosomes | Arrhythmogenic right ventricular dysplasia 8 (#607450) | AD | 3 | c.3384_3388delGGAAG (p.Glu1129Glnfs*3) | 19S5253078R (Het) | PVS1 PM2 PP3 | P |
| Dilated cardiomyopathy with woolly hair, keratoderma, and tooth agenesis (#615821) | AD | 3 |
AD, Autosomal dominant; LP, likely pathogenic; P, pathogenic.
Note: Genotype level 3 means the molecular basis for the disorder is known and at least a mutation has been found in the gene.
FIGURE 1Methodological workflows of genetic etiological diagnosis and CHD-related gene prediction. Each case had conducted low-coverage WGS and WES. The unique reads mean depth of whole-genome by low-coverage WGS was 0.37X. A total of 204 CNVs were identified in 95 samples, and the ACMG classifications of these 204 CNVs were demonstrated in a stacked bar chart. Six aneuploidy variants, seven LP|P CNVs, and eight candidates CHD pathogenic VUS CNVs were proposed as CHD causative CNVs. These 21 causative CNVs all explained 30 samples and some of the causative CNVs were diagnosed repeatedly in multiple samples (Table 3). The mean depth in the capture region of WES was 171.04X. Total 724 P|LP variants were identified in 268 genes involving 100 samples and their functional effect was shown in a stacked bar chart. Among these 724 P|LP variants there were 94 variants belonging to 33 definitive CHD genes and 8 variants belonging to candidate CHD-related genes. Combined with low-coverage WGS and WES, there were 42 samples identified with genetic etiology with a 34.71% diagnostic yield. We also explored the potential CHD association genes by KGGSeq software based on WES data. A total of 1,133 variants on 886 genes were predicted as rare disease-causing. 36 of the 886 were definitive CHD-related genes. The left 850 genes were further narrowed down based on interactions with curated CHD genes. Finally, 86 genes were proposed as CHD-related genes. Abbreviations: UR, unique reads; LP|P, likely pathogenic or pathogenic; CR, capture region.
FIGURE 2Landscape of distribution of genetic etiology and clinical features (N = 121). The heatmap on the left demonstrated the distribution of genetic etiologies detected in each sample. Each row represents a sample and the columns are five categories of genetic etiology. The correspondence colors of genetic etiology were summarized in the right legend. Four clinical characteristics including sample types, gender, latest ultrasonic testing, and clinical features of each sample were demonstrated in the right heatmaps separately. At the same time, we also displayed the distribution statistics of clinical characteristics on the top of the heatmap.
FIGURE 3Protein interactions and Genetic causes of CHD identified by WES. (A–C): protein interactions of gene FN1, IRS1, and CCT5. (D–E): Enriched biological processes of 886 genes (D) set and 86 genes (E) set.