| Literature DB >> 34032881 |
Mulong Du1,2, Joe G N Garcia3, Jason D Christie4, Junyi Xin5, Guoshuai Cai6, Nuala J Meyer4, Zhaozhong Zhu1, Qianyu Yuan1, Zhengdong Zhang5, Li Su1, Sipeng Shen2,7, Xuesi Dong1,2,7, Hui Li8, John N Hutchinson8, Paula Tejera1, Xihong Lin8,9, Meilin Wang10,11, Feng Chen12,13, David C Christiani14,15.
Abstract
PURPOSE: Acute respiratory distress syndrome (ARDS) is accompanied by a dysfunctional immune-inflammatory response following lung injury, including during coronavirus disease 2019 (COVID-19). Limited causal biomarkers exist for ARDS development. We sought to identify novel genetic susceptibility targets for ARDS to focus further investigation on their biological mechanism and therapeutic potential.Entities:
Keywords: ARDS; Biomarkers; COVID-19 severity; Causal inference; Multi-omics
Mesh:
Year: 2021 PMID: 34032881 PMCID: PMC8144871 DOI: 10.1007/s00134-021-06410-5
Source DB: PubMed Journal: Intensive Care Med ISSN: 0342-4642 Impact factor: 41.787
Fig. 1Flowchart of study design. First, in the identification phase, ancestry-specific and trans-ancestry GWAS meta-analyses of ARDS were performed to compare two distinct ancestry populations to identify novel susceptibility genes for all-cause ARDS. Second, in the annotation phase, in silico analyses combined with transcriptome data were used to functionally annotate novel loci to decipher corresponding biological and genetic effects on ARDS pathophysiology. Last, in the application phase, a certain omics features and clinical observations were shared between all-cause ARDS and COVID-19 severity, suggesting clinical care of ARDS development, possibly as COVID-19 progresses. Flowchart was created online with BioRender.com
Association between rs7967111 and ARDS in European populations
| Variant | Chr | BP | Locus | Non-effect/effect allele | EAF | Multiple models for genetic association | OR (95% CI) | |
|---|---|---|---|---|---|---|---|---|
| rs7967111 | 12 | 12,601,953 | A/G | 0.444 | Model 1: rs7967111 genotypes | 1.30 (1.17–1.45) | 9.67 × 10–7 | |
| Model 2: model 1 + age + sex | 1.33 (1.19–1.48) | 2.57 × 10–7 | ||||||
| Model 3: model 2 + PCs | 1.35 (1.21–1.50) | 7.63 × 10–8 | ||||||
| Model 4: model 3 + cohort | 1.36 (1.22–1.52) | 3.58 × 10–8 | ||||||
| Model 5: model 4 + pneumonia | 1.38 (1.23–1.55) | 2.15 × 10–8 |
ARDS acute respiratory distress syndrome; Chr chromosome; BP base pair position in GRCh37/hg19; OR odds ratio; CI confidence interval; EAF effect allele frequency; PCs population ancestry principal components 1–3
OR, CI, and P-values were calculated using logistic regression model with adjustments for confounders as appropriate. The result of model 3 was equivalent to GWAS meta-analysis in Europeans
Fig. 2Gene expression patterns across genotypes and groups. A eQTL analysis for genetic effects of rs7967111 on BORCS5 (left) and DUSP16 (right) expression. We used 46 pairs of genotyping and RNA-Seq data from in-house ARDS blood and calculated P-values using linear regression analysis. B Dynamic gene expression patterns in preclinical lipopolysaccharide (LPS)-induced lung injury models. Gene expression was extracted from Gene Expression Omnibus datasets of GSE9314 in mice and GSE5883 in humans. P-values were calculated by ANOVA test. C Correlation matrix plot for pairwise similarity (Spearman correlation) among candidate genes and abundances of 22 immune cell types, estimated from the blood transcriptome (B) using CIBERSORTx
Fig. 3Omics transferability assessment of all-cause ARDS. A Differential polygenic risk score (PRS) between ARDS cases and controls. PRS was calculated via weighting effect size derived from three severe COVID-19 GWAS assigned to each ARDS case and control in Europeans. P-value was calculated via t-test. B,C Differential expression analysis for BORCS5 and DUSP16 in dendritic cells derived from two single-cell RNA-Seq datasets of upper respiratory tract samples [21] (B) and peripheral blood mononuclear cells [22] (C) in COVID-19 patients and healthy controls, respectively. Dots represent cells expressing candidate genes and are colored by the severity of COVID-19. Healthy samples from (B) dataset were removed because only one cell expressed candidate genes. P-values were calculated from a Wilcoxon test. The y-axis is on a log-10 scale to show gene expression, and “n” indicates the number of cells detected with candidate gene expression. The solid line of each plot indicates the median of gene expression for each group, and box edges mark lower and upper quartiles of gene expression
Fig. 4Results of Mendelian Randomization analysis for all-cause ARDS. a Schematic diagram for assumptions of Mendelian Randomization analysis. Assumption 1: genetic instruments selected should be robustly associated with the exposure, usually underlying P < 5 × 10–8 (P < 1 × 10–6 or P < 1 × 10–4 as suggestive significance level); Assumption 2: genetic variants should not be associated with potential confounders; and Assumption 3: genetic variants of an exposure should affect the outcome risk only through the risk factor, not via alternative pathways. b Scatter plots for putative causal associations between six diseases/traits and all-cause ARDS under evidence score of ≥ 2. P-values were calculated with IVW or MR-Egger regression
| Integration analyses of genome and transcriptome data reveal a novel functional locus possibly involved in the regulation of immune-inflammatory response in ARDS pathophysiology, and causal inference indicates several clinical interventions of ARDS development. Our findings inform further investigation of host–pathogen biology and therapeutic targets for ARDS, especially during the COVID-19 pandemic. |