| Literature DB >> 33604698 |
Ana I Hernández Cordero1, Xuan Li2, Stephen Milne2,3,4, Chen Xi Yang2, Yohan Bossé5, Philippe Joubert5, Wim Timens6, Maarten van den Berge7, David Nickle8,9, Ke Hao10, Don D Sin2,3.
Abstract
SARS-CoV-2 is responsible for the coronavirus disease 2019 (COVID-19) and the current health crisis. Despite intensive research efforts, the genes and pathways that contribute to COVID-19 remain poorly understood. We, therefore, used an integrative genomics (IG) approach to identify candidate genes responsible for COVID-19 and its severity. We used Bayesian colocalization (COLOC) and summary-based Mendelian randomization to combine gene expression quantitative trait loci (eQTLs) from the Lung eQTL (n = 1,038) and eQTLGen (n = 31,784) studies with published COVID-19 genome-wide association study (GWAS) data from the COVID-19 Host Genetics Initiative. Additionally, we used COLOC to integrate plasma protein quantitative trait loci (pQTL) from the INTERVAL study (n = 3,301) with COVID-19 loci. Finally, we determined any causal associations between plasma proteins and COVID-19 using multi-variable two-sample Mendelian randomization (MR). The expression of 18 genes in lung and/or blood co-localized with COVID-19 loci. Of these, 12 genes were in suggestive loci (PGWAS < 5 × 10-05). LZTFL1, SLC6A20, ABO, IL10RB and IFNAR2 and OAS1 had been previously associated with a heightened risk of COVID-19 (PGWAS < 5 × 10-08). We identified a causal association between OAS1 and COVID-19 GWAS. Plasma ABO protein, which is associated with blood type in humans, demonstrated a significant causal relationship with COVID-19 in the MR analysis; increased plasma levels were associated with an increased risk of COVID-19 and, in particular, severe COVID-19. In summary, our study identified genes associated with COVID-19 that may be prioritized for future investigations. Importantly, this is the first study to demonstrate a causal association between plasma ABO protein and COVID-19.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33604698 PMCID: PMC7892327 DOI: 10.1007/s00439-021-02264-5
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 5.881
Fig. 1Study overview. The diagram summarizes the genomics datasets and analytic pipeline of the study. First, publicly available omics datasets were obtained, which were later processed using integrative genomics (IG) methods (Bayesian Colocalization and Summary-based Mendelian Randomization) to identify potential candidate genes for COVID-19 phenotypes. Lastly, using a Bayesian Colocalization and Mendelian Randomization approaches, we explored the causal association between the plasma protein levels of the most promising candidate gene and the risk of COVID-19
Fig. 2COVID-19 genomics and gene expression integration. a Colocalization of COVID-19 (hospitalization vs population) with gene expression in lung and blood tissues, respectively. The circles represent the probability (y axis) that a gene colocalizes (PP) with COVID-19 plotted against its chromosomal position (x axis). The red dashed horizontal line represents the threshold of significance (PP > 0.80). Red and green circles highlight the genes within previously identified COVID-19 loci, and those within suggestive COVID-19 loci (PGWAS < 5 × 10–05), respectively. b Results from the Summary Based Mendelian Randomization are displayed in this mirror Manhattan plot. The circles represent the association between susceptibility to COVID-19 (diagnosis vs population) and the gene expression (lung and blood) multiplied by the direction of the effect (y axis) plotted against the genes chromosomal position (x axis). The dotted horizontal lines (red) represent the threshold of significance (P < 0.001). SMR plot only shows the results that passed the heterogeneity test (see methods). Labels represent those genes that were also co-localized. c Regional plots for severe COVID-19 (COVID-19 hospitalization vs population) and lung tissue cis-eQTL at the ABO (left panel) and SLC22A5 (right panel) loci. Severe GWAS signals (top of the panel c) colocalize with the cis-QTL region (bottom of the panel c) at ABO (chromosome 9) and SLC22A5 (chromosome 5). The circles represent the −log10 association P values (y axis) of SNPs plotted against their chromosomal position (x axis). The co-localized SNP is shown inside each plot and PPH4 is at the left top corner of the lung eQTL regional plots. Pairwise linkage disequilibrium (r2, calculated from the 1000 Genomes European population) in each region is colored with respect to the lead GWAS SNP within each region
Fig. 3Bayesian Colocalization (a) and Mendelian Randomization (MR) (b) of ABO plasma protein and COVID-19. a Regional plot describing severe COVID-19 GWAS (COVID-19 hospitalization vs population) (top of a) and plasma protein pQTL (bottom of a) at the ABO locus. The circles represent the −log10 association P values (y axis) of SNPs plotted against their chromosomal position (x axis). The co-localized SNP is shown inside each plot and PPH4 is at the left top corner of the plasma protein pQTL regional plot. Pairwise linkage disequilibrium (r2, calculated from the 1000 Genomes European population) in each region is colored with respect to the lead GWAS SNP within each region. b Inverse variance weighting (IVW) MR (IVW-MR) of ABO plasma proteins on risk of severe COVID-19 (COVID-19 hospitalization vs population). The x axis represents the SNP effect on the plasma protein levels, and the y axis the SNP effect on severe COVID-19. The variants used for the MR are shown inside the plot with error bars that represent the 95% confidence intervals. The slope of the solid red line is the instrumental variables regression estimate of the effect of the protein on severe COVID-19, with dashed red lines representing the 95% confident interval. IVW-MR P value and estimate are shown in the top left corner