| Literature DB >> 32128068 |
Xin Shi1, Li Zhang2, Kai Bai1, Huilin Xie1, Tieliu Shi3, Ruilin Zhang4, Qihua Fu5, Sun Chen1, Yanan Lu1, Yu Yu1,6, Kun Sun1.
Abstract
Pulmonary atresia (PA) is a rare congenital heart defect (CHD) with complex manifestations and a high mortality rate. Since the genetic determinants in the pathogenesis of PA remain elusive, a thorough identification of the genetic factors through whole exome sequencing (WES) will provide novel insights into underlying mechanisms of PA. We performed WES data from PA/VSD (n = 60), PA/IVS (n = 20), TOF/PA (n = 20) and 100 healthy controls. Rare variants and novel genes were identified using variant-based association and gene-based burden analysis. Then we explored the expression pattern of our candidate genes in endothelium cell lines, pulmonary artery tissues, and embryonic hearts. 56 rare damage variants of 7 novel candidate genes (DNAH10, DST, FAT1, HMCN1, HNRNPC, TEP1, and TYK2) were certified to have function in PA pathogenesis for the first time. In our research, the genetic pattern among PA/VSD, PA/IVS and TOF/PA were different to some degree. Taken together, our findings contribute new insights into the molecular basis of this rare congenital birth defect.Entities:
Keywords: ACMG, American College of Medical Genetics; CHD, congenital heart defect; CTD, Conotruncal defect; Congenital heart defect; ExAC, Exome Aggregation Consortium; FDR, False discovery rates; GEO, Gene Expression Omnibus; GSEA, gene set enrichment analysis; Gene mutations; HPAECs, Human Pulmonary Artery Endothelial Cells; LOF, loss-of-function; MAF, minor allele frequency; PA, Pulmonary atresia; PA/IVS, Pulmonary atresia with intact ventricular septum; PA/VSD, Pulmonary atresia with ventricular septal defect; PPI, protein–protein interactions; Pulmonary atresia; RT-qPCR, Reverse Transcription Quantitative PCR; RV, right ventricle; Rare variants; SNP, single nucleotide polymorphism; STRING, Search Tool for the Retrieval of Interacting Genes; TOF, tetralogy of Fallot; WES, whole exome sequencing; Whole-exome sequencing; gnomAD, Genome Aggregation Database
Year: 2020 PMID: 32128068 PMCID: PMC7044470 DOI: 10.1016/j.csbj.2020.01.011
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1The schematic diagram of systematic data analysis of the PA-associated genes/variants. An analytical strategy workflow of the different steps taken during whole-exome sequencing analysis with gene expression detection is shown. After variant calling and annotation, variants were then filtered by variant-based association analysis, gene-based burden analysis. Candidate genes were collected by gene expression analysis and network analysis. WES, whole-exome sequencing; PA/VSD, pulmonary atresia with ventricular septal defect; PA/IVS, pulmonary atresia with intact ventricular septum; TOF, tetralogy of Fallot; SNP, single nucleotide polymorphism; MAF, minor allele frequency; FDR, false discovery rate.
Summary of demographic and clinical information of 100 study patients.
| Patient characteristics | PA/VSD | PA/IVS | TOF |
|---|---|---|---|
| Mean age at diagnosis (years) | 4.4 ± 3.55 | 0.84 ± 0.43 | 1.66 ± 2.6 |
| Female, n, (%) | 21(35) | 10(50) | 8(40) |
| Male, | 39(65) | 10(50) | 12(60) |
| Total, n | 60 | 20 | 20 |
| Female-to-male ratio | 1:1.9 | 1:1 | 1:1.5 |
| Associated cardiac lesion, n, (%) | |||
| Left superior vena cava (LSVC) | 1(5) | 0 | 2(10) |
| Right aortic arch (RAA) | 1(1.7) | 2(10) | 3(15) |
| Total anomalous pulmonary vein connection (TAPVC) | 0 | 1(5) | 0 |
| Aortic stenosis (AS) | 0 | 1(5) | 1(5) |
Fig. 2The comparisons of the rare damaging variants between the PA and control groups. The number of variants in each variant classification, variant type, and SNV class are presented in (A), (B), and (C), respectively. (D) PA-associated variants by Fisher’s exact test. Each node represents a variant, and the y-axis represent the statistical significance level. The threshold of the P-value is 0.0001, which is represented by the red line. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3The PA-associated genes identified by gene-based burden analysis. (A) The word-cloud of the PA-associated genes (P < 0.05 & mutated in more than 3 samples). (B) The landscape of the PA-associated genes in cases and controls. The bars on the top and right of the panel represent the number of variant types for each sample and each gene, respectively. The grey cells indicate that no variants were detected in the corresponding samples. (C) The corrplot of 20 genes were identified by gene-based burden analysis, where *P < 0.05 and *P < 0.1. (D) The protein–protein interaction (PPI) network of known (green nodes) and novel (blue nodes) PA-associated genes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4The expression levels of the PA-associated genes in heart. The expression levels of novel PA-associated genes were investigated in cardiomyocytes of GSE67492 (A), GSE30428 (B), and human embryonic hearts (C). RT-qPCR was conducted to detect the expression levels of these novel PA-associated genes in HPAECs (D) and human pulmonary artery (E).
Rare variant of 7 candidate genes associated with PA.
| Chromosome | Position | Base change | Gene | AAChange | TxChange | ExAC | 1000G | gnomAD | SIFT | Polyphen2 | Mutation |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chr12 | 124270362 | A > G | DNAH10 | p.M373V | c.A1117G | – | – | – | D | D | D |
| Chr12 | 124311277 | A > G | DNAH10 | p.N1290S | c.A3869G | 6.63E −05 | – | – | D | P | D |
| Chr12 | 124317860 | A > G | DNAH10 | p.E1464G | c.A4391G | – | – | – | D | D | D |
| Chr12 | 124333343 | C > T | DNAH10 | p.R1888X | c.C5662T | 2.88E−05 | – | 3.23E−05 | – | – | A |
| Chr12 | 124341713 | C > - | DNAH10 | p.C2065fs | c.6195delC | 9.47E−06 | – | – | – | – | – |
| Chr12 | 124349245 | C > T | DNAH10 | p.R2220W | c.C6658T | 1.91E−05 | – | 6.46E−05 | D | D | D |
| Chr12 | 124399076 | C > T | DNAH10 | p.T3400M | c.C10199T | 0.0004 | 0.01 | 0.0005 | D | D | D |
| Chr12 | 124409730 | T > C | DNAH10 | p.V3849A | c.T11546C | – | – | – | D | D | D |
| Chr12 | 124409745 | G > A | DNAH10 | p.G3854E | c.G11561A | 0.0005 | 0.01 | 0.0006 | D | D | D |
| Chr6 | 56357793 | T > C | DST | p.N4711S | c.A14132G | 1.07E−05 | – | 3.23E−05 | T | D | – |
| Chr6 | 56362244 | T > C | DST | p.H4600R | c.A13799G | 2.85E−05 | – | – | T | D | – |
| Chr6 | 56376149 | T > C | DST | p.N4090S | c.A12269G | – | – | – | T | D | – |
| Chr6 | 56391232 | C > T | DST | p.R3900H | c.G11699A | 1.90E−05 | – | 3.23E−05 | D | D | – |
| Chr6 | 56417016 | G > A | DST | p.T3406M | c.C10217T | 2.87E−05 | – | 3.23E−05 | T | D | – |
| Chr6 | 56434698 | A > G | DST | p.S2493P | c.T7477C | – | – | – | T | D | – |
| Chr6 | 56475225 | C > A | DST | p.K1728N | c.G5184T | – | – | – | D | D | – |
| Chr6 | 56494178 | T > C | DST | p.K1416E | c.A4246G | – | – | – | D | D | – |
| Chr6 | 56494186 | T > A | DST | p.E1413V | c.A4238T | – | – | – | D | D | – |
| Chr6 | 56499727 | A > G | DST | p.Y1038H | c.T3112C | – | – | – | D | D | – |
| Chr6 | 56504561 | G > A | DST | p.T844I | c.C2531T | 0.0004 | – | 0.0004 | T | D | – |
| Chr6 | 56504843 | G > A | DST | p.R802C | c.C2404T | 4.74E−05 | – | 6.46E−05 | D | D | – |
| Chr6 | 56569155 | C > T | DST | p.V234M | c.G700A | – | – | 3.23E−05 | D | D | – |
| Chr6 | 56819282 | A > C | DST | p.F35C | c.T104G | 6.76E−05 | – | 6.46E−05 | D | D | – |
| Chr4 | 187517784 | C > T | FAT1 | p.V4304M | c.G12910A | 9.72E−06 | – | – | D | D | D |
| Chr4 | 187525089 | T > A | FAT1 | p.I3531F | c.A10591T | – | – | – | D | P | D |
| Chr4 | 187527348 | G > A | FAT1 | p.T3409M | c.C10226T | 0.0001 | 0.0017 | 3.24E−05 | T | D | D |
| Chr4 | 187535411 | T > C | FAT1 | p.N3055D | c.A9163G | 0.0001 | – | 9.68E−05 | D | D | D |
| Chr4 | 187539275 | A > G | FAT1 | p.L2822P | c.T8465C | 0.0007 | 0.01 | 0.0003 | D | P | D |
| Chr4 | 187539671 | T > C | FAT1 | p.Y2690C | c.A8069G | 4.74E−05 | – | 6.46E−05 | T | D | D |
| Chr4 | 187540271 | G > A | FAT1 | p.P2490L | c.C7469T | – | – | – | D | D | D |
| Chr4 | 187542491 | G > C | FAT1 | p.T1750R | c.C5249G | – | – | – | D | D | D |
| Chr4 | 187549895 | G > T | FAT1 | p.T1449K | c.C4346A | – | – | – | D | D | D |
| Chr4 | 187584696 | C > T | FAT1 | p.D1113N | c.G3337A | 0.0002 | 0.01 | 0.0002 | D | D | D |
| Chr1 | 186008985 | G > C | HMCN1 | p.V2052L | c.G6154C | – | – | – | T | D | D |
| Chr1 | 186038870 | C > T | HMCN1 | p.A2652V | c.C7955T | 0.0001 | – | 6.46E−05 | T | D | D |
| Chr1 | 186056393 | G > A | HMCN1 | p.G3031S | c.G9091A | 4.72E−05 | 0.0017 | 9.71E−05 | D | D | D |
| Chr1 | 186060046 | T > G | HMCN1 | p.I3295S | c.T9884G | 9.51E−06 | – | 9.72E−05 | D | P | D |
| Chr1 | 186064443 | A > G | HMCN1 | p.M3455V | c.A10363G | 1.89E−05 | – | 3.23E−05 | D | P | D |
| Chr1 | 186106797 | C > T | HMCN1 | p.P4584S | c.C13750T | 2.83E−05 | – | 3.23E−05 | D | D | D |
| Chr1 | 186147652 | G > T | HMCN1 | p.D5350Y | c.G16048T | – | – | – | D | D | D |
| Chr1 | 186157125 | C > T | HMCN1 | p.R5509W | c.C16525T | 6.60E−05 | – | – | D | D | D |
| Chr14 | 21679599 | T > C | HNRNPC | p.D268G | c.A803G | 0.0025 | – | 0.0022 | T | D | D |
| Chr14 | 21679600 | C > A | HNRNPC | p.D268Y | c.G802T | 0.0025 | – | 0.0022 | T | D | D |
| Chr14 | 20836703 | A > G | TEP1 | p.C2593R | c.T7777C | – | – | – | D | D | D |
| Chr14 | 20846278 | G > A | TEP1 | p.R1876W | c.C5626T | 4.71E−05 | – | 9.70E−05 | D | D | N |
| Chr14 | 20850789 | T > C | TEP1 | p.Y1378C | c.A4133G | 0.0002 | 0.0017 | 9.71E−05 | D | D | N |
| Chr14 | 20852074 | A > T | TEP1 | p.S1180T | c.T3538A | 2.24E−05 | – | – | T | D | D |
| Chr14 | 20854615 | C > T | TEP1 | p.R951H | c.G2852A | 2.94E−05 | – | 3.23E−05 | T | D | N |
| Chr14 | 20854642 | C > T | TEP1 | p.R942H | c.G2825A | 9.59E−06 | – | 3.23E−05 | D | D | D |
| Chr19 | 10468778 | G > A | TYK2 | p.R738W | c.C2212T | 7.42E−05 | – | – | D | D | D |
| Chr19 | 10469928 | G > A | TYK2 | p.R700W | c.C2098T | 3.98E−05 | – | 3.23E−05 | D | D | D |
| Chr19 | 10472598 | C > T | TYK2 | p.V603M | c.G1807A | 0.0003 | – | 0.0002 | D | D | N |
| Chr19 | 10473102 | G > A | TYK2 | p.R503X | c.C1507T | 1.22E−05 | – | – | – | – | A |
| Chr19 | 10476375 | C > T | TYK2 | p.V277M | c.G829A | 5.98E−05 | – | 3.23E−05 | D | D | N |
| Chr19 | 10477205 | C > T | TYK2 | p.E173K | c.G517A | 5.65E−05 | – | – | D | D | D |
| Chr19 | 10489024 | T > G | TYK2 | p.Q20P | c.A59C | 5.70E−05 | – | 3.24E−05 | T | P | D |
Fig. 5Mutational spectrum of five known PA genes. (A) The distribution of the five known PA genes, NOTCH1, DOCK6, FANCD2, FGD5, and FLT4, in the PA cases. (B) The specific amino acid sites of variants on the encoded proteins or protein domains.
Fig. 6PA-subtype associated genes/variants and variant types. (A) The PA-subtype-specific genes that only present in one of the three subtypes, and absent in controls. (B) The PPI subnetwork constructed by mapping the known PA genes and the PA-subtype associated genes. (C) The comparisons between the three PA-subtypes about the number of total variants, nonsynonymous SNVs, and LOF variants.