Literature DB >> 35636966

Transmembrane protease serine 2 (TMPRSS2) rs75603675, comorbidity, and sex are the primary predictors of COVID-19 severity.

Gonzalo Villapalos-García1, Pablo Zubiaur2,3, Rebeca Rivas-Durán4, Pilar Campos-Norte4, Cristina Arévalo-Román4, Marta Fernández-Rico4, Lucio García-Fraile Fraile4, Paula Fernández-Campos1, Paula Soria-Chacartegui1, Sara Fernández de Córdoba-Oñate5, Pablo Delgado-Wicke5, Elena Fernández-Ruiz5, Isidoro González-Álvaro6, Jesús Sanz4, Francisco Abad-Santos7,3, Ignacio de Los Santos8.   

Abstract

By the end of December 2021, coronavirus disease 2019 (COVID-19) produced more than 271 million cases and 5.3 million deaths. Although vaccination is an effective strategy for pandemic control, it is not yet equally available in all countries. Therefore, identification of prognostic biomarkers remains crucial to manage COVID-19 patients. The aim of this study was to evaluate predictors of COVID-19 severity previously proposed. Clinical and demographic characteristics and 120 single-nucleotide polymorphisms were analyzed from 817 patients with COVID-19, who attended the emergency department of the Hospital Universitario de La Princesa during March and April 2020. The main outcome was a modified version of the 7-point World Health Organization (WHO) COVID-19 severity scale (WHOCS); both in the moment of the first hospital examination (WHOCS-1) and of the severest WHOCS score (WHOCS-2). The TMPRSS2 rs75603675 genotype (OR = 0.586), dyslipidemia (OR = 2.289), sex (OR = 0.586), and the Charlson Comorbidity Index (OR = 1.126) were identified as the main predictors of disease severity. Consequently, these variables might influence COVID-19 severity and could be used as predictors of disease development.
© 2022 Villapalos-García et al.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35636966      PMCID: PMC9152129          DOI: 10.26508/lsa.202201396

Source DB:  PubMed          Journal:  Life Sci Alliance        ISSN: 2575-1077


Introduction

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic started in Wuhan, China, in December 2019. This virus causes the new coronavirus disease 2019 (COVID-19). By the end of December 2021, more than 271 million cases and 5.3 million deaths had been reported (1). Although vaccination is a proved effective strategy for pandemic control, it is not yet equally available in all countries of the world (2). Even in some developed countries, the slowdown in vaccination hinders the achievement of herd immunity (3, 4). Hence, reaching worldwide herd immunity seems highly unlikely in the medium to long term, and it is expected that the virus will remain a health problem in the following months and years. Therefore, the authorization of effective therapies and identification of prognostic biomarkers remains crucial to manage COVID-19 patients more rationally. This is of particular importance because strains emerge that may be potentially more infectious, could cause more severe disease and, above all, could escape the protection of vaccines. This could be the case for the emerging strain omicron (B.1.1.529), a novel variant of concern (5). Although several studies were published to date evaluating genetic biomarkers associated with COVID-19 severity, most were exploratory, showing heterogenic results, and still nowadays, a clinically relevant genetic biomarker was not described. The first were genes involved in virus entrance to the host, such as angiotensin converting enzyme 2 gene (ACE2) and transmembrane serine protease 2 gene (TMPRSS2). Different research groups have suggested several candidate variants of ACE2, namely rs2285666 or rs4646116 (6, 7). On the other hand, for the TMPRSS2, variants such as rs2298659, rs17854725, rs12329760, and rs75603675 were found to be different in the frequency in populations more affected by the disease (8, 9). However, a later genome-wide association study (GWAS) of severe COVID-19 with respiratory failure reported two clusters of genes associated with two different polymorphisms: rs11385942 in leucine zipper transcription factor like 1 gene (LZTFL1) and rs657152 in the ABO gene (10). Because of the disparity of the observed findings, additional confirmatory and exploratory studies are warranted. The aim of this work was to perform a review of the published single-nucleotide polymorphisms (SNPs) related to COVID-19 prognosis or severity by the end of 2020 and to evaluate them in an independent validation cohort. For this purpose, we genotyped 817 patients managed at Hospital Universitario de La Princesa, for a panel of 120 SNPs selected based on an extensive literature search.

Results

The population consisted on 817 patients, 453 (55.45%) males and 364 (44.55%) females. The range of age was 19 to 97 yr, where the mean age was 60 yr. The baseline characteristics of the study population are shown in Table 1. Biogeographical origin of patients was inferred by their country of birth: 636 were European, 161 were American, 7 were East Asian, 6 were Near Eastern, and 1 was Central/South Asian. Most patients were symptomatic and required hospitalization with oxygen supplementation at the moment of the first hospital visit (WHOCS-1 = 4, 51.38%), followed by asymptomatic or mild patients (WHOCS-1 = 1 and 2, 27.92%) and by symptomatic without need for oxygen supplementation (WHOCS-1 = 3, 19.74%). As for the severest clinical situation, 77 died or required ICU admission (WHOCS-2 = 6–7, 9.27%), and the remaining severity groups were distributed similarly like in WHOCS-1. Most patients presented a CCI of 2–8 (93.50%), patients with CCI = 1 accounted for 3.97%, and patients with CCI between 9 and 13 accounted for 2.53% of the population.
Table 1.

World Health Organization COVID-19 score at admission (WHOCS-1), maximum World Health Organization COVID-19 score (WHOCS-2), and Charlson Comorbidity Index (CCI).

MaleFemaleNMaleFemaleN
WHOCS-1WHOCS-2
 1,296 (45.28%)116 (54.72%)212 (25.59%)1,296 (44.5%)116 (55.5%)212 (25.59%)
 388 (53.33%)77 (46.67%)165 (20.2%)365 (52%)60 (48%)125 (15.3%)
 4263 (61.16%)167 (38.84%)430 (52.63%)4211 (56.42%)163 (43.58%)374 (45.78%)
 53 (75%)1 (25%)4 (0.49%)523 (79.31%)6 (20.69%)29 (3.55%)
 63 (50%)3 (50%)6 (0.73%)645 (78.95%)12 (21.05%)57 (6.98%)
713 (65%)7 (35%)20 (2.45%)
CCI
715 (62.5%)9 (37.5%)24 (2.94%)
 072 (49.32%)74 (50.68%)146 (17.87%)84 (57.14%)3 (42.86%)7 (0.86%)
 175 (49.67%)76 (50.33%)151 (18.48%)97 (100%)0 (0%)7 (0.86%)
 2106 (12.97%)80 (9.79%)186 (22.77%)102 (66.67%)1 (33.33%)3 (0.37%)
 366 (55.93%)52 (44.07%)118 (14.44%)111 (100%)0 (0%)1 (0.12%)
 444 (55.7%)35 (44.3%)79 (9.67%)120 (0%)1 (100%)1 (0.12%)
 536 (61.02%)23 (38.98%)59 (7.22%)132 (100%)0 (0%)2 (0.24%)
 623 (69.7%)10 (30.3%)33 (4.04%)Total453 (55.45%)364 (44.55%)817 (100%)
World Health Organization COVID-19 score at admission (WHOCS-1), maximum World Health Organization COVID-19 score (WHOCS-2), and Charlson Comorbidity Index (CCI). Treatments received before the first emergency room visit and during the admission in the hospital are described in Table 2. The most frequently prescribed treatments prior first emergency room were ACE inhibitors (ACEIs) and angiotensin II receptor blockers (ARA-II), received by 13.22% and 15.06% of patients, respectively. The most frequently used treatments for the management of the disease were hydroxychloroquine or chloroquine (74.30%), heparin (56.18%), lopinavir/ritonavir combination (49.69%), and corticoids (43.33%).
Table 2.

Treatments used (a) before first emergency room visit and (b) for the treatment of COVID-19 during admission.

Before first emergency room visit (n = 190, 23.25%)
 ACE inhibitors108 (13.22%)
 Angiotensin II receptor blocker123 (15.06%)
 Aldosterone antagonist8 (0.98%)
 Anticoagulants32 (3.92%)
 Corticoids22 (2.69%)
 Immunosuppressants21 (2.57%)
For the treatment of COVID-19 (n = 810, 99.14%)
 Hydroxychloroquine/chloroquine607 (74.30%)
 Corticoids354 (43.33%)
 Tocilizumab107 (13.10%)
 Heparin459 (56.18%)
 Remdesivir8 (0.98%)
 Lopinavir/ritonavir406 (49.69%)
 Transfusion of hyperimmune plasma9 (1.10%)
Treatments used (a) before first emergency room visit and (b) for the treatment of COVID-19 during admission. The univariate analysis of severity 1 and 2 variables is shown in Table S1, including a summary of nominally significant variables and those who reached a corrected P′ < 0.05, which were included in the multivariate analysis. Biogeographical group resulted nonsignificant. Males compared with females (OR = 0.586), a higher CCI (OR = 1.126) (covariates), dyslipidemia (OR = 2.289), and TMPRSS2 rs75603675 (C/C or C/A diplotypes, compared with the A/A diplotype) (OR =2.140) were significantly related to a higher severity 1 and 2 status, after multivariate analysis and Bonferroni correction for multiple comparisons (Fig 1).
Figure 1.

Forest-plot showing statistically significant associations and their odds ratio (OR) between the Charlson Comorbidity Index (CCI), sex, dyslipidemia, and the TMPRSS2 rs75603675 genotype and COVID-19 severity at the moment of the first hospital emergency room visit (severity 1) and the severest COVID-19 status (severity 2).

Table S1 Summary of nominally significant associations and whole results of the univariate analysis of (A) the severity 1 and severity 2 variables and (B) the WHOCS-1 and WHOCS-2 variables. The univariate analysis of WHOCS-1 and WHOCS-2 is shown in Table S1, including a summary of nominally significant variables and those who reached a corrected P′ < 0.05, which were included in the multivariate analysis. The same variables identified in the multivariate analysis of severities 1 and 2 were now observed in the multivariate analysis of WHOCS-1 and WHOCS-2 (Table 3).
Table 3.

Multivariate analysis of WHOCS-1 and WHOCS-2 variability.

Multivariate
WHOCS-1WHOCS-2
EstimateSEPPEstimateSE P P
(Intercept)1.9620.156<0.001<0.001(Intercept)2.5970.141<0.001<0.001
Dyslipidemia0.5790.137<0.001<0.001Dyslipidemia0.5510.124<0.001<0.001
TMPRSS2 rs75603675 C/C + C/A versus A/A0.5910.138<0.001<0.001TMPRSS2 rs75603675 C/C + C/A versus A/A0.4050.1250.0010.005
CCI0.1010.0300.0010.003CCI0.1200.027<0.001<0.001
Sex−0.3260.1220.0080.032Sex-0.4150.111<0.0010.001

WHOCS-1, modified World Health Organization COVID-19 severity scale at first hospital examination; WHOCS-2, highest score on the modified World Health Organization COVID-19 severity scale; CCI, Charlson Comorbidity Index; SE, standard error; P, nominal significance; P’, significance after Bonferroni correction for multiple comparisons.

Multivariate analysis of WHOCS-1 and WHOCS-2 variability. WHOCS-1, modified World Health Organization COVID-19 severity scale at first hospital examination; WHOCS-2, highest score on the modified World Health Organization COVID-19 severity scale; CCI, Charlson Comorbidity Index; SE, standard error; P, nominal significance; P’, significance after Bonferroni correction for multiple comparisons. Based on the estimates obtained from the multivariate analysis, the following equations were proposed to calculate WHOCS-1 and WHOCS-2 scales in infected in Table 4.
Table 4.

Equations for the prediction of WHOCS-1 and WHOCS-2 for COVID-19 severity of patients.

WHOCS-1
1.962Basal severity
 +0.579If patient has dyslipidemia
 +0.591If TMPRSS2 rs75603675 C/C or C/A genotypes are present
 +0.101x CCI
 −0.326If patient is female
Total:
WHOCS-2
2.597Basal severity
 +0.551If patient has dyslipidemia
 +0.405If TMPRSS2 rs75603675 C/C or C/A genotypes are present
 +0.120x CCI
 −0.415If patient is female
Total:
Equations for the prediction of WHOCS-1 and WHOCS-2 for COVID-19 severity of patients. The remaining variables, that is, drugs used before COVID-19 infection, drugs used for the treatment of the disease, the remaining polymorphisms, etc., were unrelated to disease severity in both analyses.

Discussion

The scientific community’s effort to investigate biomarkers for predicting the risk and severity of infection was strenuous since December 2019 to date. A huge amount of works were published in this regard, including reviews and systematic reviews (11). Although there is some consensus on which biomarkers can track disease progression (mainly pro-inflammatory cytokines, ferritin, etc.), there is no biomarker that can predict disease progression from baseline. Baseline health status and demographic characteristics, including sex, age, and comorbidities have been described as the main predisposing factors (12, 13, 14, 15, 16). However, a percentage of severity is not explained by the latter factors (12, 13). Genetic polymorphism may explain part of this susceptibility, which resulted in dozens of publications proposing several SNPs and other genetic alterations associated with susceptibility to COVID-19 infection and severity (Table 5). However, to our knowledge, very few studies validated these associations and their potential clinical relevance. Our intention was, therefore, to design a panel of polymorphisms to validate their usefulness in an independent set of patients.
Table 5.

Genes, single-nucleotide polymorphism (SNPs) identifiers, maximum allele frequencies in Iberians and Americans, and the impact of the reviewed and included variants.

GeneSNPs (rs)MAF IBSMAF AMRVariant impactReferences
ABCB1 rs10456420.46 (A)0.43 (A)Synonymous variant 17
rs11285030.38 (A)0.40 (A)Synonymous variant 17
rs20325820.41 (A)0.37 (A)Missense variant 17
rs20325820.02 (T)0.06 (T)Missense variant 17
ABO rs6571520.37 (A)0.30 (A)Intron variant 18
ACE rs42910.36 (T)0.28 (T)Regulatory region variant 19
rs1799752Not availableNot availableIntron variant 20
rs43430.44 (A)0.39 (G)Synonymous variant 7
ACE2 rs1436953100.04 (A)0.03 (A)Intergenic variant 21
rs21068090.28 (G)0.32 (G)Intron variant 22
rs19781240.43 (T)0.29 (T)Intron variant 23
rs59360290.47 (C)0.29 (C)Intron variant 21
rs19962250.47 (T)0.45 (C)Intron variant 21
rs46461560.34 (A)0.25 (A)Intron variant 7
rs22856660.25 (T)0.34 (T)Splice region variant 24
rs20741920.40 (T)0.40 (T)Intron variant 7
rs358033180.10 (T)0.07 (T)Synonymous variant 25
rs48305420.35 (C)0.28 (C)Intergenic variant 7
rs46461160.01 (C)<0.01 (C)Missense variant 26
rs46461880.14 (G)0.03 (G)Intron variant 27
rs413031710.03 (C)<0.01 (C)Missense variant 21
ADAM17 rs557906760.24 (T)0.13 (T)5′ UTR variant 28
rs126923860.30 (G)0.48 (G)5′ UTR variant 29
AGT rs6990.58 (G)0.37 (A)Missense variant 30
ApoE rs74120.06 (T)0.05 (T)Missense variant 31
rs4293580.14 (C)0.10 (C)Missense variant 31
CCL2 rs10246110.29 (G)0.49 (G)Regulatory region variant 32
CCL5 rs21075380.12 (T)0.23 (T)5′ UTR variant 32
CD14 rs25691900.49 (G)0.47 (G)Intron variant 32
CD147 rs82590.33 (A)0.38 (A)3′ UTR variant 33
CD69 rs110528770.36 (G)0.34 (G)3′ UTR variant 34
CLEC2D rs15600110.41 (A)0.36 (A)Intron variant 34
CRP rs11308640.32 (A)0.33 (A)3′ UTR variant 35
CSF3 rs22273220.40 (G)0.33 (G)5′ UTR variant 36
CYP2C19 rs122485600.21 (T)0.12 (T)Intron variant 17
rs42442850.15 (A)0.10 (A)Synonymous variant 17
CYP2C9 rs17998530.14 (T)0.10 (T)Missense variant 17
rs10579100.08 (C)0.04 (C)Missense variant 17
CYP3A4 rs67666821<0.01 (T x 6)<0.01 (T x 6)Frameshift variant 17
rs355993670.04 (A)0.03 (A)Intron variant 17
CYP3A5 rs7767460.08 (T)0.20 (T)Splice acceptor variant 17
CYP4V2 rs131462720.40 (A)0.47 (A)Missense variant 37
DPP4 rs561791290.01 (T)0.01 (T)Missense variant9, 38, and 39
rs1163027580.01 (C)0.016 (C)Splice acceptor variant9, 38, and 39
rs175740.38(G)0.17(G)Missense variant9, 38, and 39
rs178489160.01(C)0.03 (C)Intron variant9, 38, and 39
ENOX1 rs95949870.48 (C)0.49 (C)Intron variant 34
EPHX1 rs10517400.29 (C)0.32 (C)Missense variant 40
F11 rs22892520.41 (T)0.34 (T)Noncoding transcript exon variant 41
rs20369140.46 (T)0.46 (T)Intron variant 42
FGG rs20668650.22 (A)0.21 (A)Intergenic variant 43
G6PD rs1050828<0.01 (T)0.01 (T)Missense variant 17
rs10508290.01 (C)0.03 (C)Missense variant 17
rs5030868<0.01 (A)<0.01 (A)Missense variant 17
GC (DBP) rs70410.43 (A)0.46 (A)Missense variant 44
rs45880.27 (T)0.20 (T)Missense variant 44
HCP5 rs23950290.02 (G)0.02 (G)Noncoding transcript exon variant 45
HO-1 rs20717460.42 (T)0.31 (T)Intron variant 46
IFITM3 rs122520.03 (G)0.18 (G)Splice region variant47, 48, and 49
IFNL3 rs129798600.30 (T)0.40 (T)Intron variant 17
IL10 rs18008960.41 (C)0.30 (C)Intron variant 32
rs18008710.26 (A)0.33 (A)5′ UTR variant 32
IL13 rs18009250.18 (T)0.23 (T)Noncoding transcript exon variant 50
IL17A rs38190250.07 (A)0.34 (A)Intron variant51 and 52 Preprint
rs22759130.07 (A)0.21 (A)Intergenic variant 53
IL1B rs11436270.32 (G)0.45 (A)5′ UTR variant 50
rs11436340.20 (A)0.13 (A)Synonymous variant 50
IL1RN rs3159520.25 (C)0.21 (C)Missense variant 9
IL6 rs18007950.35 (C)0.18 (C)Intron variant50, 54, 55, and 56
rs18007960.05 (C)0.30 (C)Noncoding transcript exon variant32, 54, and 57
rs18188790.33 (A)0.48 (A)Regulatory region variant 58
IL6R rs22281450.40 (C)0.46 (A)Missense variant 59
rs43295050.15 (C)0.12 (C)Intron variant 17
rs75292290.37 (T)0.41 (T)Intron variant 60
rs112656180.16 (T)0.13 (T)Intron variant 17
rs120835370.20 (G)0.16 (G)Intron variant 17
INFL4 rs113227830.30 (T)0.40 (T)Frameshift variant 61
Intergenic regionrs101082100.49 (C)0.28 (C)Intron variant 62
rs7032970.48 (T)0.36 (C)Regulatory region variant 62
KCNMB1 rs7035050.38 (G)0.47 (G)Intron variant 62
LZTFL1 rs350445620.05 (G)0.05 (G)Intron variant 63
MTHFR rs18011310.27 (G)0.15 (G)Missense variant 64
rs18011330.44 (A)0.47 (A)Missense variant 64
NFKB rs283624910.42 (-)0.49 (-)Noncoding transcript exon variant 65
NLRP3 rs107545550.39 (G)0.42 (G)Intron variant66 and 67
PEAR1 rs120413310.15 (A)0.20 (A)Intron variant 68
PTGS1 rs103061140.06 (G)0.03 (G)Regulatory region variant 69
SLCO1B1 rs41490560.12 (C)0.13 (C)Missense variant17 and 70
TLR1 rs57435510.29 (C)0.47 (C)Intron variant 32
TLR2 rs18988300.29 (G)0.50 (G)Intron variant 32
rs76564110.36 (G)0.24 (G)Regulatory region variant 32
rs119382280.33 (A)0.49 (C)Intron variant 71
rs38040990.43 (C)0.33 (C)Synonymous variant 72
rs18167020.08 (T)0.16 (T)Noncoding transcript exon variant73 and 74
TLR4 rs19279110.22 (A)0.33 (A)Intron variant 32
TLR4 rs50307280.27 (A)0.24 (A)Intron variant75 and 76
TLR9 rs1870840.43 (G)0.43 (G)Intron variant 32
rs3521620.45 (T)0.46 (T)Noncoding transcript exon variant 32
TMPRSS2 rs559645360.50 (T)0.29 (T)Intron variant 32
rs3835100.50 (T)0.38 (T)Intron variant32, 77, and 78
rs4643970.47 (T)0.29 (T)Noncoding transcript exon variant 78
rs4637270.49 (A)0.27 (A)Intergenic variant 32
rs7134000.12 (T)0.11 (T)5′ UTR variant 79
rs81343780.09 (A)0.05 (A)Intron variant 80
rs4693900.36 (G)0.45 (A)Missense variant 78
rs7340560.47 (C)0.32 (A)Intron variant 32
rs20707880.50 (G)0.49 (G)Intron variant32, 77, and 78
rs123297600.18 (T)0.15 (T)Missense variant8, 79, and 81
rs776754060.08 (A)0.09 (A)3′ UTR variant 79
rs756036750.38 (A)0.27 (A)Missense variant 9
TNF rs18006100.13 (A)0.18 (A)Intron variant 82
rs17999640.18 (C)0.20 (C)Regulatory region variant 83
rs3615250.06 (A)0.08 (A)Regulatory region variant 84
TNF/TNKA rs18006290.15 (A)0.07 (A)Regulatory region variant32 and 54
TNKA rs18006300.13 (A)0.13 (A)Regulatory region variant 32
TRAF3IP2 rs131909320.07 (G)0.06 (G)Missense variant 85
rs339805000.08 (T)0.12 (T)Missense variant 62
rs131963770.06 (G)0.07 (G)Intron variant 86
VDR rs22285700.33 (A)0.48 (A)Start lost 32

MAF, maximum allele frequency; IBS, Iberians; AMR, Americans, UTR, untranslated region. Frequency data were obtained from Ensembl (87) and dbSNP (88).

Genes, single-nucleotide polymorphism (SNPs) identifiers, maximum allele frequencies in Iberians and Americans, and the impact of the reviewed and included variants. MAF, maximum allele frequency; IBS, Iberians; AMR, Americans, UTR, untranslated region. Frequency data were obtained from Ensembl (87) and dbSNP (88). To prevent bias, first, we proposed a very strict statistical analysis to avoid obtaining spurious results. Second, we proposed to correct for confounding factors in all the statistical tests performed and therefore decided to consider the CCI, which included known COVID-19 severity predictors such as age and obesity, and the sex as covariates. Third, we decided not to analyze some variables dependent on the pandemic situation at the time of recruitment. For example, we did not consider ICU admission as a valid measure of COVID-19 severity as this was restricted because of hospital collapse. In other words, some patients who reached sufficient severity to merit admission to an intensive care unit were not admitted because there was no bed available. Our findings on the predictors of COVID-19 severity are consistent with previous publications in the literature. The CCI was previously related to COVID-19 prognosis (89, 90), which is consistent with the correlation observed in this work between a higher score and higher WHOCS-1 and -2 scores. Furthermore, males get infected, require ICU admission, mechanical ventilation, and die more frequently than women (91), which is consistent with the protective effect observed here for the female sex regarding WHOCS-1 and more intensely with WHOCS-2. Additional studies are required to determine the underlying differences behind this sexual dimorphism. Moreover, dyslipidemia was previously related to severe COVID-19 prognosis (92), which is consistent with our findings, where the presence of dyslipidemia was related to a 0.579 and 0.551 higher WHOCS-1 or WHOCS-2 scores, respectively. Finally, TMPRSS2 rs75603675 C/C or C/A diplotypes, compared with the A/A diplotype, were related to 0.591 higher WHOCS-1 and 0.405 higher WHOCS-2 scores, respectively. The contribution of this polymorphism requires further discussion. The transmembrane serine protease 2, encoded by the TMPRSS2 gene, participates in several physiological and pathological situations, being up- and down-regulated by several hormonal processes. It is used by several viruses to enter host cells, including the Influenza virus and the human coronaviruses HCoV-229E, MERS-CoV, SARS-CoV, and SARS-CoV-2 (93, 94). Genetic polymorphism of this gene was described to affect disease severity. Particularly, the P.Val197Met (rs12329760) variant is defined as deleterious and previously reported to have a protective effect on the patients (95). Here, this variant had no effect on WHOCS-1 or -2 variability, whereas the P.Gly8Val (rs75603675) missense variant (C > A) was related to a significantly higher WHOCS-1 and WHOCS-2. In contrast, in one study, the prevalence of the TMPRSS2 rs75603675 A allele was similar between infected patients, compared with uninfected (96). Unfortunately, no information about infection severity is provided in the latter article. Therefore, to the best of our knowledge, this is the first work to propose that this variant has a significant impact on COVID-19 prognosis and severity. Probably, the reason behind this association is explained by the down-regulation of the protein in TMPRSS2 rs75603675 A allele carriers or the expression of a protein with a structural change that causes a less specific or impaired interaction with viral proteins, causing a less efficient internalization of viruses inside human cells. This is congruent with Latini et al observations: TPMRSS2 rs75603675 (C > A) A allele was in significantly lower frequency in populations that suffered more severe cases of COVID-19 (57). This suggested a possible protective effect toward COVID-19 infection. Other nominally significant associations were established between WHOCS scales and: IFNL4 rs12979860, ACEIs, obesity, ARA-II, HCP5 rs2395029, ACE rs1799752, DPP4 rs17574, HMOX1 rs2071746, IL10 rs1800896, IL6 rs1818879, NFKB1 rs28362491, and MX1 rs469390. Although some of these associations were previously observed, others were also persistently rejected. For instance, the use of renin-angiotensin-aldosterone system (RAAS) inhibitors was proposed to cause the up-regulation of the ACE2 receptor, the receptor for SARS-CoV-2 protein S, which enables cell infection; therefore, the use of this drugs was related to a higher risk for COVID-19 infection and worse prognosis (97, 98, 99, 100, 101, 102, 103). However, a bias was identified in this assumption as it were the conditions associated with the use of RAAS inhibitors (e.g., hypertension or heart disease) which actually led to a higher risk of worse prognosis (104). Consequently, we decided to use a sufficiently strict statistical approximation to control for this bias. Hence, we considered all associations mentioned above as negative, and not worth discussing, because this would contribute to confusion. One noteworthy disparity with our study is the GWAS study by the Severe COVID-19 GWAS Group (18). They report different genetic markers to ours. The explanation that we can elucidate to address this discrepancy is that our outcome variable is different. Our analysis focus in all COVID-19 severity stages, whereas they focus in the most severe cases. Thus, our findings may explain the susceptibility to infection in the early stages of the disease, whereas other factors such as 3p21.31 gene cluster and the ABO blood-group system may determine terminal status of COVID-19. Definitely, more research must be performed in other to clarify the true effect of the TMPRSS2 rs75603675 polymorphism in the outcome of this disease. With this work, we integrate multiple clinical factors and genetic factors to COVID-19 severity prediction. A variable that excellently captures the comorbidity of patients is used, the CCI index. The presence or absence of dyslipidemia complements the CCI index. Only with this information, for instance, a significantly higher WHOCS-1 and -2 scores can be predicted for a patient with dyslipidemia and a CCI of five versus a patient without dyslipidemia and CCI of 1. Although this was previously known, that is, patients with higher comorbidities are related to worse COVID-19 prognosis, our scale signifies a clinically relevant tool to better manage patients because it is clear and numerical. Furthermore, sex and TMPRSS2 rs75603675 stand as additional clinically relevant predictors of COVID-19 severity, which were included in the scales. Continuing with the previous example, if the first patient carried the C/C diplotype and was male and the second patient was female and carried the A/A diplotype, the predicted WHOCS-1 and WHOCS-2 scales would be: 3.637 and 4.153 for the first patient and 1.737 and 2.302 for the second. These predictions are related to specific clinical requirements (e.g., hospitalization or oxygen supplementation). This quantitative measurement could help in the optimization of clinical resources. Despite the merits of this work, it presents limitations that should be considered. First, patients were recruited from the first COVID-19 wave, which might be considered more than a limitation; it can be considered a strength. As this work was carried with patients infected during March–April 2020, the circulating strains were different from those circulating now. In this sense, the conclusions regarding TMPRSS2 rs75603675 should be confirmed in the currently circulating strains. Nonetheless, new strains, such as the omicron variant, could be more infectious or pathogenic, being the underlying mechanism of such pathogenicity an enhanced interaction between viral antigens and TMPRSS2. This interaction could be affected by genetic variants of this gene and cause an even greater difference of severity with new strains. Nevertheless, new studies shall confirm the relevance of TMPRSS2 rs75603675 in new circulating variants. Another limitation is that hospital protocols were severely affected by the emergency healthcare situation of the first COVID-19 wave, and adequacy of the therapeutic effort was needed. This, in combination with the retrospective nature of the study produces relative scarcity of severe and asymptomatic individuals. Consequently, the severity distribution might be skewed. In addition, the nature of the WHOCS-1 and WHOCS-2 analysis by means of the generalized linear model assumes a normal distribution of these variables. To avoid bias, we performed a strict statistical analysis, in exchange for assuming greater type II error. Moreover, the fact that the exact same variables related to severity 1 and 2 variables were identified is reassuring, along with the strict control for type-1 error.

Conclusions

The TMPRSS2 rs75603675 C/C or C/A genotypes compared with A/A, males compared with females, the presence of dyslipidemia, and a higher CCI score were associated with more severe COVID-19, at the first hospital visit and at the most severe point of disease progression. The integration of all these variables into the proposed equations could be a useful clinical tool for the rational management of patients with COVID-19. To our knowledge, this is the first work to propose a similar tool that integrates genetic data capable of predicting the prognosis of COVID-19.

Materials and Methods

Study design and participants

This study was designed with an observational and retrospective approach. A total of 817 patients with COVID-19, who attended the emergency department of the Hospital Universitario de La Princesa between 29 March and 29 April 2020, were recruited. Both inpatients and outpatients were considered. Patients were recruited consecutively according to their first visit to the emergency department. To avoid imbalances in the proportions of the severity groups because of the retrospective and consecutive nature of recruitment, checkpoints were applied to prioritize the selection of under-represented patients, that is, severe and mild patients. The first checkpoint was performed at the beginning of May 2021, with 617 patients recruited: 83 (13.5%) mild, 466 (75.5%) moderate, and 68 (11%) severe; and the second was performed at the end of May 2021, with 743 patients: 159 (21.5%) mild, 502 (67.5%) moderate, and 82 (11%) severe. The Ethics Research Committee of Hospital Universitario de La Princesa approved the study protocol. All subjects provided informed consent, except for the deceased. They were scheduled for sampling at the Department of Internal Medicine of Hospital Universitario de La Princesa; stored samples were retrieved from the deceased patients. Research compiled with Spanish and European legislation on biomedical research and with the revised Declaration of Helsinki.

Variables

Hospital and primary care medical records were used to retrieve disease and clinical information. The main outcome (dependent variable) was a modified version of the 7-point World Health Organization (WHO) COVID-19 severity scale (WHOCS) (105). Briefly, individuals are classified according to the following COVID-19 severity groups: (1) infected, asymptomatic; (2) symptomatic not requiring hospitalization; (3) COVID-19 requiring hospitalization without oxygen supplementation; (4) oxygen supplementation with mask or nasal prongs; (5) noninvasive ventilation or high flow oxygen; (6) intubation and mechanical ventilation in an intensive care unit (ICU); and (7) death. This scale was evaluated in the moment of the first hospital examination (WHOCS-1) and of the severest WHOCS score (WHOCS-2). For this work, WHOCS levels were grouped as follows: levels 1–2 were considered mild severity, levels 3–4 were considered moderate severity, and levels 5 or greater were considered severe COVID-19. The resulting variables were named “severity-1” and “severity-2,” that is, a transformed variable of the severity at admission and at the worst severity status. As independent variables or covariates, the following ones were analyzed: demographic characteristics (age and sex), comorbidity (obesity, dyslipidemia, tobacco or alcohol use, and the Charlson Comorbidity Index, CCI), biogeographical group (106) (inferred from the country of birth), relevant drugs used before COVID-19 infection, that is, angiotensin converter enzyme inhibitors (ACEIs) (e.g., enalapril, lisinopril), angiotensin receptor II antagonists (ARA-II) (e.g., losartan, irbesartan), aldosterone antagonists (e.g., eplerenone, spironolactone), oral anticoagulants (e.g., acenocoumarol, dabigatran), systemic corticosteroids (e.g., dexamethasone) or systemic immunosuppressants (e.g., tacrolimus or methotrexate), drugs for COVID-19 treatment (hydroxychloroquine or chloroquine, remdesivir, corticosteroids, tocilizumab, heparin, lopinavir/ritonavir, plasma transfusion), and SNPs.

Genotyping

Published articles evaluating SNPs and COVID-19 were addressed since January 2020 until December 2020. Moreover, articles evaluating polymorphism in important genes related to COVID-19 (e.g., ACE2, ACE, IL-6, or IFNs) or the coagulation cascade (e.g., F11, CRP) were screened however not necessarily published during the pandemic. Finally, variants related to safety and effectiveness of the drugs used for the treatment of the disease were included, that is, pharmacogenetic variants. Sample collection occurred in a period after discharge of the patients, during the months of January to May 2021. Patients who gave informed consent provided 5 ml of blood collected in an EDTA K2 tube. For deceased patients, samples were retrieved from stored collection. Genomic DNA was extracted from peripheral blood samples with a Maxwell RSC automated DNA extractor (Promega) following the manufacturer’s instructions. For genotype analysis, a QuantStudio 12K flex thermal cycler along with an OpenArray thermal block was used (Thermo Fisher Scientific). A customized genotyping array was designed with variants shown in Table 5. The references justifying the inclusion of these variants are included.

Statistical analysis

An online tool for sample size calculation was used (Sample Size Calculator available at https://riskcalc.org/samplesize/). The study was a retrospective observational cohort study. The α or type-1 error rate was set at 0.05, and the power, or 1-β, was set at 0.9. Exposure was defined as the presence or absence of one or several SNPs related to severe COVID-19. The event to be analyzed was severe COVID-19, defined as the event, that is, WHO COVID-19 severities 5, 6, and 7. Depending on the SNP prevalence, the k value (unexposed to exposed ratio) was set. For SNPs with low prevalence (e.g., 5%), k = 20. For SNPs with 20% prevalence, k = 4. For SNPs with 40% prevalence, k = 1.5. Assuming a probability of the event in the unexposed group (P0) of 0.4 (40%) and of 0.55 (55%) in the risk group (P1), the following sample sizes were required: 2,667 for k = 20 (127 exposed and 2,540 unexposed patients), 760 for k = 4 (127 exposed and 608 unexposed patients), and 510 for k = 1.5 (204 exposed and 306 unexposed). Therefore, a sample size between 510 and 2,667 was considered. Assuming greater differences (e.g., P0 = 0.3 and P = 0.7), significantly lower sample sizes were required. Finally, the sample size was determined according to the capacity of the genotyping platform, the available budget, and the latter estimations. A genotyping array containing 120 SNPs with a capacity for 920 samples was designed and purchased. Full coverage of the 920 samples genotyping capacity could not be achieved because of failures or the need for repetitions. Statistical analysis was conducted in R (107). To analyze severity 1 and 2 variables, a univariate analysis was initially performed by ordinal logistic regression with the MASS package (108). As independent variables, the following ones were explored: genetic variables (all SNPs were transformed into dummy variables, that is, grouping heterozygous subjects with the most frequent homozygous diplotype), biogeographic group, previous treatments and other comorbidities (e.g., dyslipidemia or tobacco use); CCI and sex were included as covariates, and the level of significance was adjusted based on the Bonferroni correction for multiple comparisons. Those variables with corrected P′ < 0.05 were introduced as independent variables in a multiple logistic regression analysis of severities 1 and 2, that is, the multivariate analysis. In this analysis, significance was similarly adjusted based on the Bonferroni correction for multiple comparisons. A secondary analysis of WHOCS-1 and WHOCS-2 was performed to provide a predictive equation of disease severity. For the univariate analysis, a generalized lineal model was performed by means of individual linear regression, with the CCI and sex as covariates and applying the Bonferroni correction for multiple comparisons. Those variables with P′ < 0.05 were introduced as independent variables in the multivariate analysis, in this case, by means of a generalized lineal model with CCI and sex as covariates again.

Availability of Data and Materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
  99 in total

1.  Systemic low-grade inflammation-associated lifestyle, diet, and genetic factors: A population-based cross-sectional study.

Authors:  Marina M Norde; Regina M Fisberg; Dirce M L Marchioni; Marcelo Macedo Rogero
Journal:  Nutrition       Date:  2019-09-19       Impact factor: 4.008

Review 2.  Structure, Function, and Evolution of Coronavirus Spike Proteins.

Authors:  Fang Li
Journal:  Annu Rev Virol       Date:  2016-08-25       Impact factor: 10.431

3.  The major genetic risk factor for severe COVID-19 is inherited from Neanderthals.

Authors:  Hugo Zeberg; Svante Pääbo
Journal:  Nature       Date:  2020-09-30       Impact factor: 49.962

4.  Reply to: Safety and efficacy of secukinumab treatment in a patient with ankylosing spondylitis and concomitant multiple sclerosis: a commentary.

Authors:  Federica Macaluso; Giuliana Guggino; Francesco Ciccia
Journal:  Clin Exp Rheumatol       Date:  2020-05-20       Impact factor: 4.862

5.  ACE2 and TMPRSS2 variants and expression as candidates to sex and country differences in COVID-19 severity in Italy.

Authors:  Rosanna Asselta; Elvezia Maria Paraboschi; Alberto Mantovani; Stefano Duga
Journal:  Aging (Albany NY)       Date:  2020-06-05       Impact factor: 5.682

Review 6.  Higher mortality of COVID-19 in males: sex differences in immune response and cardiovascular comorbidities.

Authors:  Laura A Bienvenu; Jonathan Noonan; Xiaowei Wang; Karlheinz Peter
Journal:  Cardiovasc Res       Date:  2020-12-01       Impact factor: 10.787

7.  Host Polymorphisms May Impact SARS-CoV-2 Infectivity.

Authors:  Patrick Brest; Sadal Refae; Baharia Mograbi; Paul Hofman; Gerard Milano
Journal:  Trends Genet       Date:  2020-08-10       Impact factor: 11.639

8.  COVID-19 and Genetic Variants of Protein Involved in the SARS-CoV-2 Entry into the Host Cells.

Authors:  Andrea Latini; Emanuele Agolini; Antonio Novelli; Paola Borgiani; Rosalinda Giannini; Paolo Gravina; Andrea Smarrazzo; Mario Dauri; Massimo Andreoni; Paola Rogliani; Sergio Bernardini; Manuela Helmer-Citterich; Michela Biancolella; Giuseppe Novelli
Journal:  Genes (Basel)       Date:  2020-08-27       Impact factor: 4.096

Review 9.  Important Pharmacogenetic Information for Drugs Prescribed During the SARS-CoV-2 Infection (COVID-19).

Authors:  Pablo Zubiaur; Dora Koller; Miriam Saiz-Rodríguez; Marcos Navares-Gómez; Francisco Abad-Santos
Journal:  Clin Transl Sci       Date:  2020-09-16       Impact factor: 4.689

View more
  1 in total

Review 1.  Human genetic basis of severe or critical illness in COVID-19.

Authors:  Xiao-Shan Ji; Bin Chen; Bi Ze; Wen-Hao Zhou
Journal:  Front Cell Infect Microbiol       Date:  2022-09-20       Impact factor: 6.073

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.