| Literature DB >> 33730015 |
David Holcomb1, Aikaterini Alexaki1, Nancy Hernandez1, Ryan Hunt1, Kyle Laurie1, Jacob Kames1, Nobuko Hamasaki-Katagiri1, Anton A Komar2, Michael DiCuccio3, Chava Kimchi-Sarfaty1.
Abstract
Thrombosis is a recognized complication of Coronavirus disease of 2019 (COVID-19) and is often associated with poor prognosis. There is a well-recognized link between coagulation and inflammation, however, the extent of thrombotic events associated with COVID-19 warrants further investigation. Poly(A) Binding Protein Cytoplasmic 4 (PABPC4), Serine/Cysteine Proteinase Inhibitor Clade G Member 1 (SERPING1) and Vitamin K epOxide Reductase Complex subunit 1 (VKORC1), which are all proteins linked to coagulation, have been shown to interact with SARS proteins. We computationally examined the interaction of these with SARS-CoV-2 proteins and, in the case of VKORC1, we describe its binding to ORF7a in detail. We examined the occurrence of variants of each of these proteins across populations and interrogated their potential contribution to COVID-19 severity. Potential mechanisms, by which some of these variants may contribute to disease, are proposed. Some of these variants are prevalent in minority groups that are disproportionally affected by severe COVID-19. Therefore, we are proposing that further investigation around these variants may lead to better understanding of disease pathogenesis in minority groups and more informed therapeutic approaches.Entities:
Year: 2021 PMID: 33730015 PMCID: PMC8007013 DOI: 10.1371/journal.pcbi.1008805
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Graphic summary of ORF7a-VKORC1 interaction and possible effects.
The interaction between ORF7a and VKORC1 and possible effects of this interaction.
Sequence homology of selected SARS and SARS-CoV-2 proteins.
| Protein | Fraction Matching | Loglikelihood |
|---|---|---|
| N | 0.888626 | -0.04824 |
| ORF7a | 0.827869 | -0.05688 |
| nsp14 | 0.884393 | -0.19572 |
| ORF7b | 0.795455 | -0.01103 |
| nsp3 | 0.75078 | -0.46736 |
| nsp2 | 0.681818 | -0.44764 |
| nsp8 | 0.969697 | -0.12897 |
| nsp13 | 0.948767 | -0.1215 |
MSA fraction matching is the fraction of positions in the SARS-CoV-2 protein matching the homologous SARS protein, when both are aligned in an MSA. Higher number indicates more conserved position and the range is between 0 and 1.
MSA likelihood is the fraction of sequences in an MSA matching SARS-CoV-2 for a given column. Assuming all columns are independent, ∏P(x) gives the probability of finding the SARS-CoV-2 sequence in the MSA sequences, which ranges between 0 and 1. Taking log of this value gives log(∏P(x)) = ∑(log P(x)), an additive loglikelihood score which is nonpositive, with lower values indicating more positions in the SARS-CoV-2 sequence that differ from the MSA sequences.
Possible predicted effect of variants in VKORC, SERPING1 and PABPC4.
| Transcript | Location | Fraction matching in MSA | Change in splicing | Average change in mRNA MFE (Z-score) | miRNA summary |
|---|---|---|---|---|---|
| VKORC1 | |||||
| NM_024006.4:c.-4931C>T | 5’ UTR | 0.431818 | 2.20025 | ||
| NM_024006.4:c.-4851C>T | 5’ UTR | 0.754967 | 1.06847 | ||
| NM_024006.4:c.-2834C>A | 5’ UTR | ||||
| NM_024006.4:c.-1639G>A | 5’ UTR | 0.0625 | Possible splicing change | ||
| NM_024006.4:c.174-136C>T | Intron | 0.020228 | 0.83549 | ||
| NM_024006.4:c.283+124G>C | Intron | 0.133333 | -1.44871 | ||
| NM_024006.4:c.283+837T>C | Intron | 0.146727 | 1.35367 | ||
| SERPING1 | |||||
| NM_000062.2:c.-3537C>G | 5’ UTR | 0.009615 | |||
| NM_000062.2:c.-2415G>A | 5’ UTR | 0.033708 | Possible splicing change | 0.77565 | |
| NM_000062.2:c.-1675G>A | 5’ UTR | 0.426901 | |||
| NM_000062.2:c.52-696C>T | Intron | 0.068027 | 0.30003 | ||
| NM_000062.2:c.52-130C>T | Intron | 0.833333 | |||
| NM_000062.2:c.52-130C>T | Intron | 0.833333 | -0.28155 | ||
| NM_000062.2:c.550+794C>A | Intron | 0.693694 | 0.71906 | ||
| NM_000062.2:c.685+88G>A | Intron | 0.769231 | -0.75902 | ||
| Intron | 0.581818 | -0.47852 | miRNA decrease | ||
| Intron | 0.679245 | -0.68662 | |||
| Intron | 0.772313 | ||||
| Intron | 0.841912 | ||||
| Intron | |||||
| Intron | 0.793594 | 0.84106 | |||
| Intron | 0.015723 | miRNA gained | |||
| NM_000062.2:c.1029+1443G>C | Intron | 0.595745 | miRNA gained | ||
| Intron | 0.393617 | ||||
| NM_000062.2:c.1029+2111G>A | Intron | 0.687117 | miRNA gained | ||
| NM_000062.2:c.1030-2243T>G | Intron | 0.026616 | -1.68665 | ||
| NM_000062.2:c.1030-1975G>C | Intron | 0.02551 | |||
| NM_000062.2:c.1030-1436T>C | Intron | 0.823529 | |||
| NM_000062.2:c.1030-20A>G | Intron | 0.25 | -0.76751 | ||
| NM_000062.2:c.1438G>A | Exon | 0.399177 | Possible splicing change | ||
| NM_000062.2:c.*1323G>A | 3’ UTR | 0.878788 | miRNA lost | ||
| NM_000062.2:c.*1521G>T | 3’ UTR | 0.65873 | |||
| NM_000062.2:c.*2614A>T | 3’ UTR | 0.016129 | miRNA gained | ||
| PABPC4 | |||||
| NM_003819.3:c.-5600T>C | 5’ UTR | 0.48 | |||
| NM_003819.3:c.-4432G>A | 5’ UTR | 0.009317 | |||
| NM_003819.3:c.-4428A>G | 5’ UTR | 0.221505 | |||
| NM_003819.3:c.-3677T>G | 5’ UTR | 0.021645 | Possible splicing change | -0.59763 | |
| NM_003819.3:c.-3636G>A | 5’ UTR | 0.022378 | 2.31727 | ||
| NM_003819.3:c.-3198T>C | 5’ UTR | 0.856079 | Possible splicing change | ||
| NM_003819.3:c.-2286T>G | 5’ UTR | 0.210526 | |||
| NM_003819.3:c.-650C>T | 5’ UTR | 0.829978 | |||
| NM_003819.3:c.193+796C>G | Intron | 0.666667 | |||
| NM_003819.3:c.504-254C>A | Intron | 0.247191 | 0.26322 | ||
| NM_003819.3:c.738+85T>C | Intron | 0.333333 | miRNA lost | ||
| NM_003819.3:c.877-387C>T | Intron | miRNA lost | |||
| NM_003819.3:c.972+53A>T | Intron | ||||
| NM_003819.3:c.972+704C>G | Intron | 0.5 | |||
| NM_003819.3:c.1333+26C>G | Intron | 0.3125 | |||
| NM_003819.3:c.1621-348C>G | Intron | ||||
| NM_003819.3:c.*765C>A | 3’ UTR | ||||
| NM_003819.3:c.*1261C>T | 3’ UTR | 0.771242 | -0.91257 | miRNA decrease | |
| NM_003819.3:c.*4685A>G | 3’ UTR | 0.054945 | -3.4524 | miRNA decrease | |
| NM_003819.3:c.*5316C>T | 3’ UTR | 0.696181 | Possible splicing change | miRNA lost | |
Change in splicing is presented when all tools find a change in splicing and all hexamer scores are greater than one standard deviation from the mean, and is marked in red when the variant appears in an intron. mRNA MFE changes are normalized (converted into a Z-score) for KineFold, remuRNA, and mFold, then averaged. When all three mRNA MFE changes are above one standard deviation, we mark the value in underline. miRNA summaries are presented when all miRNA changes agree in direction, and the total change is at least 5. miRNA changes are underlined when the variant appears upstream.
Population frequencies (gnomAD) of GWAS variants of VKORC1, SERPING1, and PABPC4.
| Transcript | Global | African | American | Ashkenazi Jewish | East Asian | European | Other |
|---|---|---|---|---|---|---|---|
| VKORC1 | |||||||
| NM_024006.4:c.-4931C>T | 0.5758 | 0.5408 | 0.542 | 0.514 | 0.63146 | 0.619 | |
| NM_024006.4:c.-4851C>T | |||||||
| NM_024006.4:c.-2834C>A | 0.0049 | 0.0001 | 0.001 | 0 | 0 | 0.00762 | 0.007 |
| NM_024006.4:c.-1639G>A | 0.326 | 0.444 | 0.476 | 0.37236 | 0.369 | ||
| NM_024006.4:c.174-136C>T | 0.3261 | 0.443 | 0.476 | 0.37264 | 0.37 | ||
| NM_024006.4:c.283+124G>C | 0.4163 | 0.2564 | 0.442 | ||||
| NM_024006.4:c.283+837T>C | 0.6431 | 0.7907 | 0.546 | 0.517 | 0.62682 | 0.628 | |
| NM_014699.3:c.*2082G>C | 0.0049 | 0.0001 | 0.001 | 0 | 0 | 0.00763 | 0.007 |
| NM_014699.3:c.*2737G>T | 0.0048 | 0.017 | 0.002 | 0 | 0 | 0.00005 | 0 |
| SERPING1 | |||||||
| NM_000062.2:c.-3537C>G | 0.0275 | 0.0078 | 0.021 | 0.017 | 0 | 0.03846 | 0.041 |
| NM_000062.2:c.-2415G>A | 0.0939 | 0.0866 | 0.059 | 0.141 | 0.1113 | 0.09708 | 0.087 |
| NM_000062.2:c.-1675G>A | 0.0937 | 0.0862 | 0.059 | 0.141 | 0.1105 | 0.09697 | 0.087 |
| NM_000062.2:c.52-696C>T | 0.3927 | 0.4767 | 0.529 | 0.452 | 0.7655 | 0.31673 | 0.386 |
| NM_000062.2:c.52-130C>T | 0.385 | 0.448 | 0.525 | 0.455 | 0.7668 | 0.31739 | 0.385 |
| NM_000062.2:c.52-130C>T | 0.385 | 0.448 | 0.525 | 0.455 | 0.7668 | 0.31739 | 0.385 |
| NM_000062.2:c.550+794C>A | 0.3936 | 0.4761 | 0.531 | 0.451 | 0.7697 | 0.31779 | 0.388 |
| NM_000062.2:c.685+88G>A | 0.2225 | 0.1006 | 0.15 | 0.262 | 0.1157 | 0.28733 | 0.273 |
| NM_000062.2:c.685+1391C>T | 0.0248 | 0.0059 | 0.022 | 0.035 | 0 | 0.03499 | 0.038 |
| NM_000062.2:c.685+659C>T | 0.3901 | 0.4743 | 0.541 | 0.455 | 0.769 | 0.31084 | 0.381 |
| NM_000062.2:c.685+659C>T | 0.3901 | 0.4743 | 0.541 | 0.455 | 0.769 | 0.31084 | 0.381 |
| NM_000062.2:c.685+1100C>T | 0.2253 | 0.1124 | 0.147 | 0.262 | 0.1208 | 0.28765 | 0.274 |
| NM_000062.2:c.685+1550G>T | 0.2251 | 0.1127 | 0.152 | 0.264 | 0.1207 | 0.28734 | 0.269 |
| NM_000062.2:c.685+1770C>T | 0.2216 | 0.0992 | 0.15 | 0.262 | 0.1184 | 0.28696 | 0.27 |
| NM_000062.2:c.1029+926G>T | 0.2279 | 0.1 | 0.15 | 0.264 | 0.1224 | 0.29523 | 0.284 |
| NM_000062.2:c.1029+1443G>C | 0.2282 | 0.1004 | 0.15 | 0.269 | 0.1198 | 0.29577 | 0.284 |
| NM_000062.2:c.1029+2110T>C | 0.612 | 0.5191 | 0.469 | 0.538 | 0.2347 | 0.69271 | 0.624 |
| NM_000062.2:c.1029+2111G>A | 0.227 | 0.1003 | 0.15 | 0.264 | 0.1183 | 0.29407 | 0.283 |
| NM_000062.2:c.1030-2243T>G | 0.6129 | 0.5195 | 0.47 | 0.541 | 0.2387 | 0.69335 | 0.626 |
| NM_000062.2:c.1030-1975G>C | 0.0113 | 0.0022 | 0.008 | 0.024 | 0 | 0.01647 | 0.008 |
| NM_000062.2:c.1030-1436T>C | 0.0045 | 0.0014 | 0.001 | 0.003 | 0 | 0.00645 | 0.004 |
| NM_000062.2:c.1030-20A>G | 0.6134 | 0.5197 | 0.472 | 0.541 | 0.2461 | 0.69353 | 0.623 |
| NM_000062.2:c.1438G>A | 0.2282 | 0.1007 | 0.15 | 0.269 | 0.1202 | 0.29561 | 0.285 |
| NM_000062.2:c.*1323G>A | 0.2283 | 0.1009 | 0.151 | 0.269 | 0.1175 | 0.29578 | 0.285 |
| NM_000062.2:c.*1521G>T | 0.1496 | 0.0855 | 0.166 | 0.0942 | |||
| NM_000062.2:c.*2614A>T | 0.6058 | 0.4936 | 0.463 | 0.538 | 0.2277 | 0.69504 | 0.626 |
| PABPC4 | |||||||
| NM_003819.3:c.-5600T>C | 0.8127 | 0.5571 | 0.918 | 0.945 | 0.9909 | 0.90666 | 0.891 |
| NM_003819.3:c.-4432G>A | 0.0403 | 0.024 | 0.02506 | 0.047 | |||
| NM_003819.3:c.-4428A>G | 0.0432 | 0.024 | 0.02918 | 0.052 | |||
| NM_003819.3:c.-3677T>G | 0.1792 | 0.122 | 0.247 | 0.1111 | 0.24258 | 0.219 | |
| NM_003819.3:c.-3636G>A | 0.0052 | 0.0027 | 0.004 | 0.007 | 0 | 0.0068 | 0.007 |
| NM_003819.3:c.-3198T>C | 0.0025 | 0.001 | 0.002 | 0 | 0 | 0.00329 | 0.004 |
| NM_003819.3:c.-2286T>G | 0.0136 | 0.0031 | 0.014 | 0.01 | 0 | 0.01952 | 0.015 |
| NM_003819.3:c.-650C>T | 0.0079 | 0.0027 | 0.002 | 0 | 0 | 0.01172 | 0.008 |
| NM_003819.3:c.193+796C>G | 0.8013 | 0.913 | 0.945 | 0.9904 | 0.90714 | 0.895 | |
| NM_003819.3:c.504-254C>A | 0.1438 | 0.079 | 0.2 | 0.1093 | 0.19797 | 0.183 | |
| NM_003819.3:c.738+85T>C | 0.0573 | 0.012 | 0.01 | 0.014 | |||
| NM_003819.3:c.877-387C>T | 0.113 | 0.0243 | 0.065 | 0.131 | 0.1086 | 0.15452 | 0.146 |
| NM_003819.3:c.972+53A>T | 0.0018 | 0.0065 | 0 | 0 | 0 | 0 | 0 |
| NM_003819.3:c.972+704C>G | 0.0025 | 0.001 | 0.002 | 0 | 0 | 0.00328 | 0.004 |
| NM_003819.3:c.1333+26C>G | 0.0006 | 0.0001 | 0 | 0 | 0 | 0.00095 | 0 |
| NM_003819.3:c.1621-348C>G | 0.0003 | 0.0001 | 0 | 0 | 0 | 0.00042 | 0 |
| NM_003819.3:c.*765C>A | 0.0403 | 0.024 | 0.3656 | 0.02561 | 0.049 | ||
| NM_003819.3:c.*1261C>T | 0.0073 | 0.0256 | 0 | 0 | 0.0006 | 0.00005 | 0.003 |
| NM_003819.3:c.*4685A>G | 0.7986 | 0.911 | 0.945 | 0.991 | 0.90696 | 0.894 | |
| NM_003819.3:c.*5316C>T | 0.007 | 0.0028 | 0.008 | 0 | 0 | 0.00955 | 0.008 |
Population frequencies are taken from dbSNP. Populations with greater distance from global distribution are underlined.
Fig 2Predicted dock of VKORC1 and ORF7a transmembrane domain.
A. Five protein-protein docks depict one main binding site (teal, grey, yellow, green, blue). B. The lowest interface-energy model is shown as a surface representation. C. The lowest interface-energy model, with side chains shown in wheat for amino acids at the interface. D. Another view of the lowest interface-energy model, with side chains shown in wheat at the interface and hydrophobics shown in blue. Amino acids of VKORC1 necessary for vitamin K binding (83F, 80N, 135C, 55F) or warfarin binding (134V, 133I) are given in green.
Fig 3Plots of interface energy (I_sc) against interface root mean square error (I_rms).
Each point represents a complex formed from one of the top 5 ZDock outputs of VKORC1 and ORF7a proteins, using 10,000 decoys. All plots form energy funnels.
Variations’ clinical impact.
| Variation | Clinical Impact based on Literature |
|---|---|
| NM_024006.4: c.283+837T>C | South Indians carrying the C nucleotide require lower warfarin dosages relative to WT (T) [ |
| NM_024006.4: c.283+124G>C | European Americans carrying the G nucleotide require lower warfarin dosages relative to WT (C) [ |
| NM_024006.4: c.174-136C>T | Turkish carrying the T nucleotide require lower warfarin dosages [ |
| NM_024006.4: c.-1639G>A | Chinese carrying the A nucleotide require lower warfarin dosages relative to WT (G) [ |
| NM_024006.4: c.-4931C>T | South Indians carrying the T nucleotide require increased warfarin dosages relative to WT (C) [ |
| NM_000062.2: c.52-130C>T | Patients carrying the T nucleotide depicted worsened progression for age-related macular degeneration relative to WT (C) [ |
| NM_000062.2: c.1029+2110T>C | European and Mediterranean patients carrying the C nucleotide did not depict a higher association with hereditary angioedema relative to WT [ |
| NM_000062.2: c.1030-1975G>C | The intronic polymorphism 1030 +1975G>C has no pathogenic influence on hereditary angioedema although predicted as pathogenic [ |
| NM_000062.2: c.1030-20A>G | Association of the G allele with age-related macular degeneration was predicted to decrease the variant splicing form SERPING1, decrease protein expression and potentially limit the regulation of the compliment system [ |
| NM_000062.2: c.-2415G>A | Chinese Han patients carrying the A nucleotide did not demonstrate an increased risk of polypoidal choroidal vasculopathy relative to WT (G) [ |
| NM_000062.2: c.52-696C>T | Patients carrying the T nucleotide did not display an increased risk for anterior uveitis relative to WT (C) [ |
| NM_000062.2: c.52-130C>T | Chinese carrying the T nucleotide did not display a different association with age-related macular degeneration relative to WT (C) [ |
| NM_000062.2: c.685+659C>T | Caucasians carrying the A nucleotide failed to depict a greater association with AMD relative to WT (G) [ |
| NM_000062.2: c.685+1100C>T | European and Mediterranean patients carrying the T nucleotide failed to show a greater association with hereditary angioedema relative to WT (C) [ |
| NM_000062.2: c.1029+926G>T | European and Mediterranean patients carrying the T nucleotide failed to show a greater association with hereditary angioedema relative to WT (G) [ |
| NM_000062.2: c.1029+1443G>C | European and Mediterranean patients carrying the C nucleotide failed to show a greater association with hereditary angioedema relative to WT (G) [ |
| NM_000062.2: c.1029+2111G>A | European and Mediterranean patients carrying the A nucleotide failed to show a greater association with hereditary angioedema relative to WT (G) [ |
| NM_000062.2: c.1438G>A | Patients carrying the A nucleotide did not depict a change in Tacrolimus dosage requirements for transplant operations relative to WT (G) [ |
| NM_003819.3: c.504-254C>A | Increased risk for type 2 diabetes with the 40035928G>T polymorphism based on GWAS studies [ |
Population frequencies of missense and synonymous VKORC1 variants.
| VKORC1 variant | Warfarin Sensitivity | Prevalence | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Overall | African | Latino | Ashkenazi Jewish | East Asian | Finnish | Non-Finnish European | Other groups | South Asian | |||
| Missense | 106GT | Resistance | 0.00241 | 0 | 0.00166 | 0.03857 | 0 | 5.6E-05 | 0.00069 | 0.00460 | 0.00091 |
| 203AG | 0.00044 | 0 | 0 | 0 | 0.00571 | 0 | 8.8E-06 | 0.00016 | 9.8E-05 | ||
| 352GC | 0.00036 | 0 | 0 | 0 | 0.00359 | 0 | 0 | 0.00016 | 0.00078 | ||
| 202CT | 0.00036 | 6.2E-05 | 0 | 0 | 0 | 0.00083 | 0.00059 | 0.00065 | 0 | ||
| 79CG | 0.00031 | 0 | 0 | 0 | 0 | 0 | 6.8E-05 | 0 | 0.00217 | ||
| 196GA | 0.00020 | 0.00265 | 0.00014 | 0 | 0 | 0 | 0 | 0 | 3.3E-05 | ||
| 427GA | 0.00017 | 0 | 0.00116 | 0 | 0 | 0 | 8.8E-06 | 0.00016 | 0 | ||
| 390TG | 0.00012 | 0 | 0 | 0 | 0.00169 | 0 | 0 | 0 | 0 | ||
| 157CA | 0.00011 | 0 | 0 | 0 | 0.00139 | 0 | 0 | 0.00017 | 0 | ||
| 163TC | 1.0E-04 | 0 | 0.00067 | 0 | 0 | 0 | 0 | 0.00017 | 0 | ||
| Synonymous | 358CT | Resistance | 0.01558 | 0.19520 | 0.01186 | 0.00626 | 0.00011 | 0 | 0.00170 | 0.01044 | 0.00065 |
| 36GA | Resistance* | 0.01511 | 0.00210 | 0.01583 | 0.02262 | 5.6E-05 | 0.07348 | 0.01171 | 0.02296 | 0.00200 | |
| 129CT | 0.003643 | 0.00115 | 0.00093 | 0.00115 | 0 | 0.00504 | 0.00567 | 0.00415 | 0.00282 | ||
| 54GT | 7.7E-05 | 0 | 0.00030 | 0 | 0.00023 | 0 | 2.9E-05 | 0 | 3.3E-05 | ||
| 234AG | 3.6E-05 | 0 | 0.00012 | 0 | 0 | 0 | 3.5E-05 | 0.00016 | 0 | ||
| 54GC | 2.6E-05 | 7.1E-05 | 3.0E-05 | 0 | 0 | 0 | 2.9E-05 | 0 | 3.3E-05 | ||
| 18GA | 1.7E-05 | 0 | 9.0E-05 | 0 | 0 | 0 | 9.8E-06 | 0 | 0 | ||
| 111GA | 1.7E-05 | 0 | 3.0E-05 | 0 | 0.00011 | 0 | 9.8E-06 | 0 | 0 | ||
| 72CT | 1.7E-05 | 0 | 0 | 0 | 0 | 0 | 3.9E-05 | 0 | 0 | ||
| 186TG | 1.6E-05 | 0 | 0 | 0 | 0 | 0 | 3.5E-05 | 0 | 0 | ||
Warfarin sensitivity is determined by literature review.
Fig 4Locations of warfarin dosage affecting nonsynonymous variants in VKORC1.
VKORC1 is shown in salmon, while ORF7a is shown in grey. Warfarin dosage affecting nonsynonymous variants are shown in blue. Active site amino acids are shown in green.
Fig 5Structural alignment of ORF7a, CXADR, and PCDH1 proteins.
The alignment is largely confined to the beta sheets.