| Literature DB >> 32637948 |
Maik Pietzner1, Eleanor Wheeler1, Julia Carrasco-Zanini1, Johannes Raffler2, Nicola D Kerrison1, Erin Oerton1, Victoria P W Auyeung1, Jian'an Luan1, Chris Finan3,4, Juan P Casas5,6, Rachel Ostroff7, Steve A Williams7, Gabi Kastenmüller2, Markus Ralser8,9, Eric R Gamazon1,10, Nicholas J Wareham1,11, Aroon D Hingorani3,4,12, Claudia Langenberg1,8,11.
Abstract
Strategies to develop therapeutics for SARS-CoV-2 infection may be informed by experimental identification of viral-host protein interactions in cellular assays and measurement of host response proteins in COVID-19 patients. Identification of genetic variants that influence the level or activity of these proteins in the host could enable rapid 'in silico' assessment in human genetic studies of their causal relevance as molecular targets for new or repurposed drugs to treat COVID-19. We integrated large-scale genomic and aptamer-based plasma proteomic data from 10,708 individuals to characterize the genetic architecture of 179 host proteins reported to interact with SARS-CoV-2 proteins or to participate in the host response to COVID-19. We identified 220 host DNA sequence variants acting in cis (MAF 0.01-49.9%) and explaining 0.3-70.9% of the variance of 97 of these proteins, including 45 with no previously known protein quantitative trait loci (pQTL) and 38 encoding current drug targets. Systematic characterization of pQTLs across the phenome identified protein-drug-disease links, evidence that putative viral interaction partners such as MARK3 affect immune response, and establish the first link between a recently reported variant for respiratory failure of COVID-19 patients at the ABO locus and hypercoagulation, i.e. maladaptive host response. Our results accelerate the evaluation and prioritization of new drug development programmes and repurposing of trials to prevent, treat or reduce adverse outcomes. Rapid sharing and dynamic and detailed interrogation of results is facilitated through an interactive webserver ( https://omicscience.org/apps/covidpgwas/ ).Entities:
Year: 2020 PMID: 32637948 PMCID: PMC7337378 DOI: 10.1101/2020.07.01.182709
Source DB: PubMed Journal: bioRxiv
Figure 1Flowchart of the identification of candidate proteins and coverage by the SomaScan v4 platform within the Fenland cohort. More details for each protein targeted are given in Supplemental Table S1.
Figure 2Manhattan plot of cis-associations statistics (encoding gene ±500kb) for 179 proteins. The most significant regional sentinel protein quantitative trait loci (pQTL) acting in cis are annotated by larger dots for 104 unique protein targets (dashed line; p<5×10−8). Starred genes indicate those targeted by multiple aptamers (n=9 genes).
Figure 3Genetic correlation matrix of 86 unique proteins targeted by 93 aptamers with reliable heritability estimates (see Methods). Aptamers were clustered based on absolute genetic correlations to take activation as well repression into account and protein encoding genes were used as labels. The column on the far left indicates relevance to SARS-CoV-2 infection. Strong correlations (|r|>0.5) are indicated by black frames.
Figure 4Stacked bar chart showing the results from variance decomposition of plasma abundances of 106 aptamers targeting candidate proteins. For each candidate protein a model was fitted to decompose the variance in plasma levels including all 16 factors noted in the legend. cis/trans-GRS = weighted genetic risk score based on all single nucleotide polymorphisms associated with the aptamer of interest acting in cis and trans, respectively. BMI (body mass index), WHR (waist-to-hip ratio), HDL (high-density lipoprotein), LDL (low-density lipoprotein), eGFR (estimated glomerular filtration rate), ALT (alanine amino transaminase), BP (blood pressure)
Figure 5Results of predicted gene expression in each of five tissues and plasma abundances of 102 aptamers with at least one cis-pQTL on one of the autosomes using PrediXcan. Each panel displays results for a tissue. Each column contains results across successful gene expression models for the association with the aptamer listed on the x-axis. Red indicates nominally significant (p<0.05) positive z-scores (y-axis) and blue nominally significant inverse z-scores for associated aptamers. Protein encoding genes are highlighted by larger black circles. Orange background indicates all examples of significant associations between the protein encoding gene and protein abundance in plasma regardless if this was the most significant one. Top genes were annotated if those differed from the protein encoding gene.
Figure 6Circos plot summarizing genome-wide significant associations between 74 cis-pQTLs and 239 traits[31] in the inner ring and results from statistical colocalisation in the outer ring. The dashed line in the outer ring indicates a posterior probability of 75% of shared genetic signal between the protein and a phenotypic trait. Protein targets are classified on the basis of their reported relation to SARS-CoV-2 and COVID-19. Each slice contains any cis-pQTLs associated with the target protein annotated and effect estimates were aligned to the protein increasing allele, i.e. bars with a positive −log10(p-values) indicate positive associations with a trait from the database and vice versa. Clinical traits are grouped by higher-level categories and coloured accordingly. GIT = gastrointestinal tract, Misc = Miscellaneous, No coloc. pos. = colocalisation for secondary signals was not possible