| Literature DB >> 36187926 |
Huiyao Chen1, Xiang Chen2, Liyuan Hu2, Chang Ye2, Jiantao Zhang2, Guoqiang Cheng2, Lin Yang1,3, Yulan Lu4, Xinran Dong4, Wenhao Zhou1,2,4.
Abstract
Background: Acute respiratory distress syndrome (ARDS) could account for a considerable proportion of neonatal death, while the genetic etiology and pathophysiology of neonatal ARDS remain elusive. In this case-control study, 515 neonates were enrolled in the China Neonatal Genomes Project (CNGP, NCT03931707) from August 2016 to June 2021, including 196 ARDS and 319 non-ARDS matched by sex, gestational age, birth weight, perinatal asphyxia, pneumonia, sepsis, and necrotizing enterocolitis. Clinical exome sequencing was used to detect genetic variants. Collapsing analyses together with permutation tests were used to identify ARDS risk genes enriched for rare variants in ARDS samples. In silico functional interaction analysis and expression pattern studies at different stages of lung development were used to investigate the biological functions of the risk genes.Entities:
Keywords: Acute respiratory distress syndrome; Collapsing analyses; Neonates; Rare variant; Risk gene
Year: 2022 PMID: 36187926 PMCID: PMC9486038 DOI: 10.1016/j.csbj.2022.08.055
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1Study design. Based on a case-control study with matched clinical information, we searched for strong confidence risk genes of neonatal ARDS through clinical exome sequencing and rare variant collapsing analyses. The specific process is divided into five steps. First, 515 samples were sequenced by clinical exome sequencing after propensity score matching by clinical characteristics (step a). Variant calling and quality control were performed by GATK and an in-home pipeline (step b). Ensembl Variant Effect Predictor (VEP) and ANNOVAR were used to annotate all variants, including protein-truncating variants (PTVs) and missense or non-synonymous variants (MISs) (step c). Rare-variant collapsing analyses were applied, and 55 ARDS potential risk genes were identified (P value < 0.05) (step d). Next, the results were evaluated using a quantile–quantile (QQ) plot (step e). Finally, we focused on eight strong confidence risk genes.
Clinical information of both ARDS and non-ARDS groups.
| No. (%) with data | 374 | 645 | 196 | 319 | ||
| Sex, Male | 126 (33.7) | 381 (59.1) | <0.001 | 107 (54.6) | 186 (58.3) | 0.462 |
| Gestational age, mean (SD), weeks | 34.10 (2.73) | 36.53 (1.94) | <0.001 | 35.92 (1.94) | 36.03 (1.88) | 0.501 |
| Birth weight, mean (SD), kg | 2.40 (0.66) | 2.81 (0.67) | <0.001 | 2.71 (0.63) | 2.63 (0.74) | 0.220 |
| Perinatal asphyxia | 90 (24.1) | 68 (10.5) | <0.001 | 33 (16.8) | 42 (13.2) | 0.309 |
| Pneumonia | 76 (20.3) | 29 (4.5) | <0.001 | 24 (12.2) | 28 (8.8) | 0.264 |
| Sepsis | 135 (36.1) | 136 (21.1) | <0.001 | 69 (35.2) | 91 (28.5) | 0.136 |
| Necrotizing enterocolitis | 33 (8.8) | 24 (3.7) | 0.001 | 5 (2.6) | 8 (2.8) | 1.000 |
Fig. 2Evaluation of the gene sets and strong confidence risk genes (SCGs) for ARDS. (a) The significantly enriched gene sets in rare variants. “N Genes Hit by Variants” refer to the number of genes in the set and the number of genes that had at least one rare variant hit, respectively. CI, confidence interval; OR, odds ratio. (b) Differential expression of eight SCGs in the pseudoglandular and canalicular stages of human lung development. (c) Expression patterns of eight SCGs in the five canonical stages of mouse lung development (EMB, embryonic; PSG, pseudoglandular; CAN, canalicular; SAC, saccular; ALV, alveolar). (d) Differential expression of EDNRB in the nasal cytology brushings of pediatric ARDS and non-ARDS. Expression levels were normalized for processing. * P < 0.05; ** P < 0.01; *** P < 0.001.
The number of variants in eight SCGs and the results of rare-variant collapsing analyses.
| 0 | 14 | 0 | 2 | – | – | 5.21E-05*** | 0.107 | |
| 0 | 13 | 0 | 4 | – | – | 1.25E-03** | 1 | |
| 0 | 20 | 0 | 11 | – | – | 1.91E-03** | 1 | |
| 2 | 29 | 2 | 21 | 4.91E-01 | 1 | 2.11E-03** | 1 | |
| 0 | 13 | 2 | 5 | – | – | 2.97E-03** | 1 | |
| 1 | 16 | 1 | 8 | – | – | 3.49E-03** | 1 | |
| 0 | 10 | 0 | 3 | – | – | 4.66E-03** | 1 | |
| 0 | 7 | 1 | 1 | – | – | 5.83E-03** | 1 | |
SCGs, strong confidence risk genes; P value is the significance of the Fisher’s exact test (one-sided) and adjP is the adjusted p value with Bonferroni correction; * P < 0.05; ** P < 0.01; *** P < 0.001.
Fig. 3EDNRB targets genes and their functions. (a-b) The 101 positively regulated genes (a) and 50 negatively regulated genes (b) of EDNRB were predicted by in silico functional interaction analysis. The thickness of the line represents the Spearman correlation. (c) Correlation of EDNRB with SFTPB and SFTPC expression in two stages of human lung development. (d) The top ten items of Gene Ontology enrichment analyses of the 151 EDNRB target genes were sorted by P value. The rich factor is the number of genes belonging to this term out of 151 NDNRB target genes divided by the number of genes belonging to this term out of 16,648 genes included in the human lung expression profile data.