Literature DB >> 32835700

Variability in genes related to SARS-CoV-2 entry into host cells (ACE2, TMPRSS2, TMPRSS11A, ELANE, and CTSL) and its potential use in association studies.

Gilberto Vargas-Alarcón1, Rosalinda Posadas-Sánchez2, Julian Ramírez-Bello3.   

Abstract

BACKGROUND: The prevalence and mortality of the outbreak of the COVID-19 pandemic show marked geographic variation. The presence of several subtypes of the coronavirus and the genetic differences in the populations could condition that variation. Thus, the objective of this study was to propose variants in genes that encode proteins related to the SARS-CoV-2 entry into the host cells as possible targets for genetic associations studies.
METHODS: The allelic frequencies of the polymorphisms in the ACE2, TMPRSS2, TMPRSS11A, cathepsin L (CTSL), and elastase (ELANE) genes were obtained in four populations from the American, African, European, and Asian continents reported in the 1000 Genome Project. Moreover, we evaluated the potential biological effect of these variants using different web-based tools.
RESULTS: In the coding sequences of these genes, we detected one probably-damaging polymorphism located in the TMPRSS2 gene (rs12329760) that produces a change of amino acid. Furthermore, forty-eight polymorphisms with possible functional consequences were detected in the non-coding sequences of the following genes: three in ACE2, seventeen in TMPRSS2, ten in TMPRSS11A, twelve in ELANE, and six in CTSL. These polymorphisms produce binding sites for transcription factors and microRNAs. The minor allele frequencies of these polymorphisms vary in each community; indeed, some of them are high in specific populations.
CONCLUSION: In summary, using data of the 1000 Genome Project and web-based tools, we propose some polymorphisms, which, depending on the population, could be used for genetic association studies.
Copyright © 2020. Published by Elsevier Inc.

Entities:  

Keywords:  ACE2; COVID19; Cathepsin; Elastase; Polymorphisms; SARS-CoV2; TMPRSS11A; TMPRSS2

Mesh:

Substances:

Year:  2020        PMID: 32835700      PMCID: PMC7441892          DOI: 10.1016/j.lfs.2020.118313

Source DB:  PubMed          Journal:  Life Sci        ISSN: 0024-3205            Impact factor:   5.037


Introduction

The coronavirus disease 2019 (COVID-19) pandemic, as declared by the World Health Organization on 11 March 2020 [1], has accumulated 6057.853 confirmed cases globally until June 1 [2]. The etiologic agent of COVID-19 is a novel beta coronavirus [3], which was named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses [4]. Zhou et al. established that the SARS-CoV-2 is 96% identical at the whole-genome level to a bat SARS-like coronavirus and 79.5% identical to SARS-CoV [5]. Coronaviruses possess an enveloped, single, positive-stranded RNA genome that encodes for four membrane polypeptides, namely spike (S), envelope (E), membrane (M), and nucleocapsid (N) proteins [6]. The spike glycoprotein (S) present in the coronavirus envelope is used to bind and penetrate the host cells. The S protein is composed of two subunits: S1 and S2; the S1 subunit allows the virus to bind the host cell receptors, while S2 enables the fusion of viral and cellular membranes. The SARS-CoV-2 entry into target cells requires S protein priming by cellular proteases, which entails S protein cleavage at the S1/S2 and S2' sites [7]. Depending on virus strains and cell types, coronavirus (CoV) S proteins may be cleaved by one or several host proteases, including furin, cathepsins, transmembrane protease serine protease-2 (TMPRSS-2), neutrophil elastase (ELANE), and probably TMPRSS11A [[8], [9], [10], [11], [12], [13], [14], [15]]. The availability of these proteases on target cells largely determines whether CoV particles enter cells through plasma membrane or endocytosis. Hoffmann et al. demonstrated that SARS-CoV-2 uses the SARS-CoV receptor angiotensin-converting enzyme 2 (ACE2) for entry into target cells and the transmembrane protease 2 (TMPRSS2) for S protein priming [16]. In the same way, Ou et al. found that cathepsin L (CTSL) is critical for virus entry [17]. It has also been reported that the S protein of the A2a subtype has an additional elastase-specific proteolytic cleavage site that endows the virus with an increased ability to penetrate host cells [10]. This virus subtype was reported in China and spread rapidly in Europe and North America [18,19]. ACE2 is a carboxypeptidase that converts angiotensin II to angiotensin-(1-7) [Ang-(1-7)], which evokes anti-fibrosis, anti-hypertrophy, vasodilatation, and other beneficial effects [[20], [21], [22]]. Tissue-bound or membrane-bound ACE2 is a kind of transmembrane protein with a single metalloprotease active site and a transmembrane domain [23,24]. The ACE2 receptor is expressed at high levels not only in alveolar type-2 cells in the lung, but also in liver cholangiocytes, myocardial cells, esophagus keratinocytes, kidney proximal tubules, bladder urothelial cells, and gastrointestinal epithelial cells [25,26]. In the lung, ACE2 is abundantly expressed in Clara cells, type I and II alveolar epithelial cells, macrophages, endothelium, vascular smooth muscle cells, and bronchial epithelia [27]. ACE2 is encoded in the chromosome Xp22 and spans 39.98 kb of genomic DNA. This gene generates two transcripts that originate the same 805-amino-acid-residue protein; one transcript consists of 18 exons and 17 introns (transcript length: 3339 bps) and the other is composed of 19 exons and 18 introns (transcript length: 3507 bps). The ACE2 gene exhibits a high level of polymorphism; in fact, some single nucleotide polymorphisms (SNPs) have been associated with susceptibility to diseases, such as type 2 diabetes and hypertension [28,29]. The transmembrane serine protease TMPRSS2 is an essential enzyme that can cleave hemagglutinin of many subtypes of the influenza virus and the coronavirus S protein [30,31]. It has been reported that TMPRSS2 deficiency protects mice against H1N1, mouse-adapted H1N1, and H7N9 influenza A virus infections [30,32]. Recently, it has been shown that TMPRSS2 can help SARS-CoV-2 enter host cells by cleaving the S protein [16]. Matsuyama et al. demonstrated that TMPRSS2-expressing cell lines are highly susceptible to SARS-CoV, MERS-CoV, and SARS-CoV-2 [33]. The gene that encodes TMPRSS2 is polymorphic and is considered a susceptibility gene for H1N1 and H7N9 influenza [34]. Similarly, TMPRSS11A is another member of the subfamily of type II transmembrane serine proteases. This enzyme is synthesized as a zymogen and can be activated upon auto-proteolytic cleavage at a site located between the protease domain and the stem region [35]. Zmora et al. demonstrated that TMPRSS11A cleaves and activates the MERS-CoV spike protein and the influenza A virus hemagglutinin [36]. Elastase is known to be secreted by neutrophils as part of an inflammatory response to a viral infection and is also produced by opportunistic bacteria that can colonize virally infected respiratory tissue [37]. The increase of elastase activity as a result of an extreme inflammatory process produces an important pulmonary injury contributing significantly to the pathogenesis of chronic obstructive pulmonary disease, cystic fibrosis, acute respiratory distress syndrome, and pulmonary fibrosis [38,39]. Cathepsin L is a peptidase that preferentially cleaves peptide bonds with aromatic residues in the P2 position and hydrophobic residues in the P3 position [40]. It has been previously reported that cathepsin L participates in the viral glycoprotein processing of Ebola and SARS-CoV. It is well established that this viral process is important for cell membrane fusion and host cell entry [41]. Using inhibitors of cathepsin B and L in HEK 293/hACE2 cells, Ou et al. [17] demonstrated that the treatment with cathepsin L inhibitor decreases the entry of SARS-CoV-2 into the cells. This result suggests that cathepsin L could be very important for S protein priming in lysosome for viral entry. The outbreak of the COVID-19 pandemic shows marked geographic variation in its prevalence and mortality. This variability could be due to both the presence of several subtypes of the virus and the genetic differences in the human populations [18,19,[42], [43], [44]]. Considering this fact and the important role of the ACE2, TMPRSS2, TMPRSS11A, cathepsin L, and elastase in the process of virus entry into the host cell, the present study aims to propose possible variants in these loci for genetic association studies in patients with SARS-CoV-2 infection.

Methods

To identify the different single nucleotide variants (or insertions/deletions; INDELs) and their allelic frequencies, ACE2 (ID 59272), TMPRSS2 (ID 7113, protein sequence: NP_005647.3), TMPRSS11A (ID 339967, protein sequence; NP_001107859.1), ELANE (ID 1991, protein sequence: NP 001963.1)), and CTSL (ID 1514) SNPs were retrieved from dbSNPs (GRCh38), Ensembl Genome Browser, and 1000 Genome Project (phase 3). The number and origin of the subjects that comprise the four populations included in our analysis are as follows: (a) Mexican individuals (Americans) from Los Angeles: 50, 64, and 67 samples reported in dbSNPs, Ensembl Genome Browser, and 1000 Genome Project, respectively; (b) Han Chinese from Beijing (Asians): 43, 103, and 103 samples reported in dbSNPs, Ensembl Genome Browser, and 1000 Genome Project, respectively; (c) Yoruba from Ibadan, Nigeria (Africans): 113, 109, and 108 samples reported in dbSNPs, Ensembl Genome Browser, and 1000 Genome Project, respectively; (d) British (Europeans): 91 and 92 samples reported in Ensembl Genome Browser and 1000 Genome Project, respectively. All genes were identified by their ID and we took 2000 bp upstream of the transcription start site as the promoter region. The exons were divided into 5′ UTR and 3′ UTR regions, and the coding sequence. As for our in silico analysis, we used web-based tools (i) to identify the potential functional impact of the variants included in the tables, (ii) to test for linkage disequilibrium (LD), and (iii) to determine if these variants were tagSNPs among them. The SNPs function prediction of the SNPinfo Web Server (https://snpinfo.niehs.nih.gov/snpinfo/snpfunc.html) was used for the in silico analysis; this web-based tool identifies whether alleles create binding sites for microRNAs, for transcription factors or proteins that regulate splicing. In addition, SNPinfo predicts the effect of nonsynonymous and synonymous alleles on protein function. We conducted a prediction of deleterious SNPs (with a minor allele frequency greater than 1.5% in any of the four populations included in our study) using Polyphen-2 (http://genetics.bwh.harvard.edu/pph2/) and SIFT (https://sift.bii.a-star.edu.sg/). These web-based tools predict a possible impact of non-synonymous substitutions on the protein structure and function. Additionally, we used the ModPred server, which is a sequence-based predictor of potential post-transcriptional modification sites in proteins (http://montana.informatics.indiana.edu/ModPred/index.html). Of note, we used both sequence and structure-based prediction tools of proteins. After sequence alignment, we identified the potential deleterious effect of alleles of non-synonymous SNPs in TMPRSS2, TMPRSS11A, and ELANE. Linkage disequilibrium (LD) among SNPs included in our analysis is described at the bottom of each table. Several polymorphisms are TagSNPs because they are either in strong LD or have an r2 >0.95. Hence, it is enough to analyze a single TagSNP to capture other SNPs (because of the high LD between them), which are potentially associated with COVID-19. Genotype information of ACE2, TMPRSS2, TMPRSS11A, ELANE, and CTSL SNPs was downloaded from the Ensembl Genome Browser (https://www.ensembl.org/index.html). To obtain the LD between the SNPs of the 5 genes, we used the Haploview (V. 4.2) program (https://www.broadinstitute.org/haploview/downloads).

Results

Among all the genes analyzed, we included polymorphisms with frequencies greater than 1.5% in at least one of the four populations from the American, African, European, and Asian continents.

ACE2 polymorphisms

Table 1 shows thirteen ACE2 polymorphisms with a frequency greater than 1.5% in at least one of the populations reviewed. As can be seen, two of these polymorphisms (rs35803318 and rs4646179) were located in the coding sequence without a change of amino acid. On the other hand, three SNPs (located in the promoter or in the 5′ region near the gene) had a possible functional effect: rs7885856 (both alleles can create binding sites for AP2alpha, BCL6, CEBP, and ETS transcription factors), rs9698134 (the C allele can produce a binding site for HIC1 transcription factor), and rs9698150 (both alleles produced binding sites for BRCA, DBP, ETF, MYB, RFX, and WT1).
Table 1

ACE2 polymorphisms.

ACE2

MAF (%) in populations with different ancestry

Potential functional effect
Variant IDMinor alleleAMER (MXL)AFR (YRI)EUR (GBR)EAS (CHB)Amino acid position and changeYN
Coding sequence
rs35803318C/TT8.304.40Val749ValX
rs4646179A/GG012.200Asn690AsnX



Promoter and 5′ near the gene
rs113009615AAAAAA/AAAAAAA (INDEL)A2.117.70.70X
rs7885856G/AA2.1a7.300Both alleles can create binding sites for AP2ALPHA, BCL6, CEBP, and ETS.
rs112621533T/CC08.50.7c0X
rs11336754ATTT/ATT (INDEL)ATT4.214.60.72.5X
rs760084155G/AA18.7000X
rs765471058A/TT17.7000X
rs9698134C/TT2.1a17.7b0.7c0Allele C can create a binding site for HIC1
rs9698150G/CC2.1a17.7b0.7c0Both alleles can create binding sites for BRCA, DBP, ETF, MYB, RFX, and WT1
rs112593415A/GG2.1a17.7b0.7c0X
rs184697926A/CC00011.9X
rs142049267A/GG04.300X

ACE2; Angiotensin I Converting Enzyme 2, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, LD; Linkage disequilibrium. CCDS; consensus coding sequence.

ACE2 is located on chromosome Xp22.2. Five transcripts have been reported for ACE2, two of them synthesize the CCDS of 805 amino acids. The first transcript consists of 18 exons and 17 introns, 18 exons encode this protein, transcript length; 3339 bps. The second transcript consists of 19 exons and 18 introns, the CCDS consist of 18 exons, transcript length; 3507 bps.

Variants in high LD or tagSNPs between them in an American population.

Variants in high LD or tagSNPs between them in an African population.

Variants in high LD or tagSNPs between them in a European population.

ACE2 polymorphisms. ACE2; Angiotensin I Converting Enzyme 2, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, LD; Linkage disequilibrium. CCDS; consensus coding sequence. ACE2 is located on chromosome Xp22.2. Five transcripts have been reported for ACE2, two of them synthesize the CCDS of 805 amino acids. The first transcript consists of 18 exons and 17 introns, 18 exons encode this protein, transcript length; 3339 bps. The second transcript consists of 19 exons and 18 introns, the CCDS consist of 18 exons, transcript length; 3507 bps. Variants in high LD or tagSNPs between them in an American population. Variants in high LD or tagSNPs between them in an African population. Variants in high LD or tagSNPs between them in a European population.

TMPRSS2 polymorphisms

The TMPRSS2 polymorphisms are shown in Table 2 . In this case, thirty-nine polymorphisms had a frequency higher than 1.5% in at least one of the populations; four of them were located in the coding sequence and only one (rs12329760) produces an amino acid change (Val160Met). This change was probably damaging (PolyPhen-2, score 0.989, sensitivity 0.72, specificity 0.97). Indeed, using SIFT, we identified that this variant was deleterious (SIFT score 0.009). Our analysis, based in the ModPred server, showed that neither Val160 nor 160Met undergoes any possible post-translational modification (e.g., acetylation, proteolytic cleavage, glycosylation, phosphorylation). As for this gene, seventeen polymorphisms located in the promoter, in the 5′ region near the gene, and the 3′ UTR region had a possible functional effect. Ten polymorphisms located in the promoter and the 5′ region near the gene produced binding sites for several transcription factors, whereas seven located in the 3′ UTR region created potential binding sites for several microRNAs.
Table 2

TMPRSS2 polymorphisms.

TMPRSS2

MAF (%) in populations with different ancestry

Potential functional effect
Variant IDMinor alleleAMER (MXL)AFR (YRI)EUR (GBR)EAS (CHB)Amino acid position and changeYN
Coding sequence
rs61735794C/TT0.802.80Gly385GlyX
rs2298659G/AA31.216.221.429.6Gly290GlyX
rs17854725A/GG47.736.156.417.5Ile256IleX
rs61735789G/AA1.600.50Tyr180TyrX
rs12329760C/TT18.025.520.941.3Val160MetProbably Damaging (by PolyPhen-2)Deleterious (by SIFT)Without post-translational modification (by ModPred server)
rs3787950T/CC1.630.17.111.7Thr75ThrX
rs61735792G/AA001.10Pro63ProX



Promoter and 5′ near the gene
rs4303794A/CC28.1*41.2¤¤41.81±The C allele creates binding sites for AP2, and SP1, and WT1
rs11088551A/GG28.1*41.2¤¤41.81±The A allele creates inding sites for BRCA, MYB, NF1, and RFX
rs66492316GGCGCAGCGC/C (INDEL)C28.141.2¤¤41.81X
rs4303795A/GG28.1*41.241.21±The G allele creates binding sites for HNF4 and KID3
rs5844077G/GA (INDEL)G29.625.325.88.3X
rs76833541G/AA31.3010.40X
rs4283504G/TT16.437.012.623.3The T allele creates binding sites for DBP, HSF1, and NKX25
rs12481984T/CC27.3*39.4¤40.71±The C allele creates a binding site for HAND1E47
rs28707508G/AA25.8*38.4¤40.11±The A allele creates binding sites for HNF3, ALPHA, and TBP
rs552257429C/CT (INDEL)CT26.638.439.01.5X
rs12626358C/GG26.826.49.356.8The G allele creates a binding site for KAISO
rs8128074C/TT16.42.812.623.8The C allele creates binding sites for ETF, KROX, LRF, and SPZ1
rs56218846G/AA25.8*40.3¤40.11The A allele creates a binding site for PPAR_DR1
rs11281229T/TCCAGG (INDEL)TCCAGG25.840.340.10.9X
rs8127674A/GG25.8*40.3¤40.11The G allele creates binding sites for AP2ALPHA, ETF, and SPZ1



3′ near the gene
rs11088550G/AA12.5**09.30X
rs463727T/AA26.64.246.20.5X
rs462471G/AA36.7***34.3¤¤¤13.7‡‡53.4±±X
rs76000363G/AA12.5**5.612.1‡‡6.3**X



3′ UTR
rs143680939GA/G (INDEL)G12.55.612.16.8X
rs456142C/TT36.7***34.3¤¤¤13.7‡‡53.4±±The C allele creates a binding site for hsa-miR-548c-3p
rs112657409C/TT0.87.906.3±±±±X
rs2838038A/TT12.5**4.612.1‡‡6.3±±±The T allele can create a binding site for hsa-miR-943
rs462574G/AA24.214.8¤¤¤¤1.747.1±±The A allele can create a binding site for hsa-miR-1324
rs456298A/TT37.5***34.3¤¤¤13.7‡‡53.4±±The A allele can create a binding site for hsa-miR-450b-5p
rs17001042G/AA0.813.900The A alleles can create a binding site for hsa-miR-220b
rs11910678T/CC1.613.9¤¤¤¤06.3±±±±X
rs77675406G/AA12.5**4.612.1‡‡6.3±±±X
rs12627374C/TT00013.6The C allele can create a binding site for hsa-miR-345
rs62217525C/TT3.906.00The C allele can create a binding site for hsa-miR-1226
rs77996454G/AA4.60.800X
rs149695119TG/T (DELETION)T22.70.800X



Among MX1 and TMPRSS2
rs35074065AC/A (INDEL)A or delC26.64.643.40.5It has been reported that delC affects the TMPRSS2 and MX1 expressionRef 10

*, * *, * * * Variants in high LD or tagSNPs between them in an American population. ¤, ¤¤,¤¤¤, ¤¤¤¤ Variants in high LD or tagSNPs between them in an African population.

‡, ‡‡, Variants in high LD or tagSNPs between them in a European population. ±, ±±, ±±±, ±±±± Variants in high LD or tagSNPs between them in an Asian population.

TMPRSS2; Transmembrane protease, serine 2, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, UTR; Untranslated region, LD; Linkage disequilibrium, MX1; MX Dynamin Like GTPase 1. CCDS; Consensus coding sequence.

TMPRSS2 is located on chromosome 21q22.3.10 transcripts have been reported for TMPRSS2, six encode proteins, three of them are involved with CCDS. The first transcript consists of 14 exons and 13 introns, 13 exons encode this 492 amino-acid protein, transcript length; 3450 bps. The second transcript consists of 14 exons and 13 introns, 14 exons encode this 529 amino-acid protein, transcript length 3240 bp. The third transcript consists of 14 exons and 13 introns, 13 exons encode this 492 amino-acid protein, transcript length 1877 bps.

TMPRSS2 polymorphisms. *, * *, * * * Variants in high LD or tagSNPs between them in an American population. ¤, ¤¤,¤¤¤, ¤¤¤¤ Variants in high LD or tagSNPs between them in an African population. ‡, ‡‡, Variants in high LD or tagSNPs between them in a European population. ±, ±±, ±±±, ±±±± Variants in high LD or tagSNPs between them in an Asian population. TMPRSS2; Transmembrane protease, serine 2, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, UTR; Untranslated region, LD; Linkage disequilibrium, MX1; MX Dynamin Like GTPase 1. CCDS; Consensus coding sequence. TMPRSS2 is located on chromosome 21q22.3.10 transcripts have been reported for TMPRSS2, six encode proteins, three of them are involved with CCDS. The first transcript consists of 14 exons and 13 introns, 13 exons encode this 492 amino-acid protein, transcript length; 3450 bps. The second transcript consists of 14 exons and 13 introns, 14 exons encode this 529 amino-acid protein, transcript length 3240 bp. The third transcript consists of 14 exons and 13 introns, 13 exons encode this 492 amino-acid protein, transcript length 1877 bps.

TMPRSS11A polymorphisms

Out of twenty polymorphisms in the TMPRSS11A gene, six were in the coding sequence; three of these generated a nonsynonymous substitution (rs353163-Arg290Gln, rs139010197-Lys48Arg, rs977728-Met1Ile). According to PolyPhen-2 results, rs353163 was possibly benign (Polyphen-2 score 0.015, sensitivity 0.96, specificity 0.79) and tolerated (SIFT score 1). Using ModPred, we found that 290Gln was not affected; in contrast, variant Arg290 was predicted to undergo a translational modification: a proteolytic cleavage (score 0.71 and medium confidence). A similar result was observed with the 48Arg variant (rs139010197); this variant was predicted to be benign (PolyPhen-2 score 0.02, sensitive 0.95, specificity 0.8) and tolerated (SIFT score 0.53). Using ModPred, we identified that the 48Arg variant might undergo a proteolytic cleavage (score 0.57, low confidence). Alternatively, we did not identify any effect of Met1 or 1IIe (rs977728). Both five polymorphisms located in the promoter and the 5′ region near the gene and the two located in the 5′UTR region had a possible functional effect; these SNPs produced binding sites for several transcription factors. Finally, three out of five polymorphisms located in the 3′-UTR region created binding sites for some microRNAs (Table 3 ).
Table 3

TMPRSS11A polymorphisms.

TMPRSS11A

MAF (%) in populations with different ancestry

Potential functional effect
Variant IDMinor alleleAMER (MXL)AFR (YRI)EUR (GBR)EAS (CHB)Amino acid position and changeYN
Coding sequence
rs1371932A/GG46.132.449.531.1±Asp334AspX
rs353163C/TT46.113.440.115.5Arg290GlnBenign (by PolyPhen-2)Tolerated (by SIFT)Arg290 originates a proteolytic cleavage (by ModPred server)
rs1370840G/AA41.4*55.620.310.7±±Thr81ThrX
rs139010197T/CC0.804.40Lys48ArgBenign (by PolyPhen-2)Tolerated (by SIFT)48Arg originates a proteolytic cleavage (by ModPred server)
rs11930532T/CC41.4*78.720.310.7±±Val6ValX
rs977728C/TT39.1*10.620.99.7±±Met1IleBenign (by PolyPhen-2)Tolerated (by SIFT)Without post-translational modification (by ModPred server)



Promoter and 5′ near the gene
rs17088849T/CC15.613.422.053.4The C allele creates binding sites for BRCA and MYB
rs200058897TA/T (INDEL)T5.513.95.00X
rs536791104C/GG5.5**13.9¤5.0‡‡0X
rs6552135A/GG37.50.552.236.9The C allele creates binding sites for CEBPA, CEBPDELTA, and CEBP
rs17088850A/GG5.5**20.45.0‡‡0The C allele creates binding sites for BRCA and MYB
rs17088851T/CC5.5**11.6¤5.0‡‡0The C allele creates binding sites for AREB6, ETF, KID3, and SPZ1
rs720009T/AA1.624.100The C allele creates a binding site for GATA6



5′ UTR
rs6552134A/GG46.979.225.89.7±±The G allele creates a binding site for AP2ALPHA
rs11947613G/AA2.347.700The G allele creates a binding site for TBP



3′ UTR
rs4860265A/GG47.737.434.130.1±The G allele creates a binding site for hsa-miR-658
rs9998258T/CC2.3***1.86.6‡‡‡0The T allele creates binding sites for hsa-miR-1, hsa-miR-613, and hsa-miR-148b
rs33929303C/TT20.325.931.39.2X
rs28648375T/AA2.3***06.6‡‡‡0The T allele creates a binding site for hsa-miR-1244
rs12646286C/TT25.85.118.160.7X

*, * *, * * * Variants in high LD or tagSNPs between them in an American population. ¤, Variants in high LD or tagSNPs between them in an African population.

‡, ‡‡, ‡‡‡ Variants in high LD or tagSNPs between them in a European population. ±, ±± Variants in high LD or tagSNPs between them in an Asian population.

TMPRSS11A; Transmembrane Serine Protease 11A, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, UTR; untranslated region, LD; Linkage disequilibrium. CCDS; consensus coding sequence.

TMPRSS11A is located on chromosome 4q13.2. Three transcripts have been reported for TMPRSS11A, two produce CCDS. The first transcript consists of 10 exons and 9 introns, 10 exons encode this 421 amino-acid protein, transcript length; 3054 bps. The second transcript consists of 10 exons and 9 introns, 10 exons encode this 418 amino-acid protein, transcript length; 3247.

TMPRSS11A polymorphisms. *, * *, * * * Variants in high LD or tagSNPs between them in an American population. ¤, Variants in high LD or tagSNPs between them in an African population. ‡, ‡‡, ‡‡‡ Variants in high LD or tagSNPs between them in a European population. ±, ±± Variants in high LD or tagSNPs between them in an Asian population. TMPRSS11A; Transmembrane Serine Protease 11A, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, UTR; untranslated region, LD; Linkage disequilibrium. CCDS; consensus coding sequence. TMPRSS11A is located on chromosome 4q13.2. Three transcripts have been reported for TMPRSS11A, two produce CCDS. The first transcript consists of 10 exons and 9 introns, 10 exons encode this 421 amino-acid protein, transcript length; 3054 bps. The second transcript consists of 10 exons and 9 introns, 10 exons encode this 418 amino-acid protein, transcript length; 3247.

ELANE polymorphism

The polymorphisms of the ELANE gene are shown in Table 4 . As can be seen, two of them (rs17223045 and rs17216663) were located in the coding region. In fact, according to the bioinformatic analysis, rs17216663 provoked a change of amino acid (Pro257Leu), which is benign (PolyPhen-2 score 0.01, sensitivity 0.96, specificity 0.77) and tolerated (SIFT score 0.197). Using ModPred, we found that Pro257 undergoes hydroxylation (score 0.80, medium confidence), while 257Leu is not predicted to be post-translationally modified. In this gene, 12 polymorphisms had a possible functional effect. These polymorphisms were located in several regions of the gene and spawned binding sites for some transcription factors.
Table 4

ELANE polymorphisms.

ELANE

MAF (%) in populations with different ancestry

Potential functional effect
Variant IDMinor alleleAMER (MXL)AFR (YRI)EUR (GBR)EAS (CHB)Amino acid position and changeYN
Coding sequence
rs17223045C/TT0.811.61.10Asn130AsnX
rs17216663C/TT1.600.60Pro257LeuBenign (by PolyPhen-2)Tolerated (by SIFT)Pro257 undergoes a hydroxylation (by ModPred server)



Promoter and 5′ near the gene
rs74876755C/TT05.600X
rs10413889G/AA4.718.112.60.5The A allele creates a binding site for SPZ1
rs3761007G/AA4.707.725.7The G allele creates a binding site for DR4
rs3761006G/AA5.500.518.0The A allele creates binding sites for OCT and P53
rs10409474C/GG10.228.212.628.6The G allele creates a binding site for YY1
rs3761005T/AA44.568.531.359.7The A allele creates binding sites for CEBPDELTA and YY1
rs351107T/GG0.89.31.70The G allele creates a binding site for DBP
rs3761001G/AA14.856.025.3*29.6The G allele creates binding sites for USF and LRF
rs2007647G/AA7.09.724.2*1.0The A allele creates binding sites for ETS, HMGIY, NFAT, and OCT1
rs17216593C/TT0.87.40.60The T allele creates binding sites for PAX8 and SREBP
rs740021C/AA7.027.31.128.2The A allele creates a binding site for CEBPGAMMA



3′ near the gene
rs187713106T/AA0.811.13.30X
rs113311784T/TA (INDEL)TA12.56.516.538.8X
rs6510983C/TT15.638.025.81.9The A allele creates a binding site for CEBPA
rs17223066G/AA54.723.144.533.5The G allele creates a binding site for CREB

*, ‡, Variants in high LD or tagSNPs between them in a European, and Asian population, respectively.

ELANE; Elastase, neutrophil expressed, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, LD; Linkage disequilibrium. CCDS; consensus coding sequence.

ELANE is located on chromosome 19p13.3. Two transcripts have been reported for this gene, which produce CCDS. The first transcript consists of 5 exons and 4 introns, 5 exons encode this 267 amino-acid protein, transcript length; 909 bps. The second transcript consists of 6 exons and 5 introns, 5 exons encode this 267 amino-acid protein, transcript length; 1028.

ELANE polymorphisms. *, ‡, Variants in high LD or tagSNPs between them in a European, and Asian population, respectively. ELANE; Elastase, neutrophil expressed, MAF; Minor allele frequency, AMER; Americans, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, LD; Linkage disequilibrium. CCDS; consensus coding sequence. ELANE is located on chromosome 19p13.3. Two transcripts have been reported for this gene, which produce CCDS. The first transcript consists of 5 exons and 4 introns, 5 exons encode this 267 amino-acid protein, transcript length; 909 bps. The second transcript consists of 6 exons and 5 introns, 5 exons encode this 267 amino-acid protein, transcript length; 1028.

CTSL polymorphisms

In this gene, one polymorphism (rs11541204) was in the coding region, without a change of amino acid. Four out of nine polymorphisms located in the promoter region and the 5′ region near the gene, presented a possible functional effect: a binding site for some transcription factors. Both the polymorphism located in the 5′ UTR region and the one located in the 3′ region near the gene generated binding sites for transcriptional factors (Table 5 ).
Table 5

Cathepsin L polymorphisms.

CTSL (Cathepsin L)

MAF (%) in populations with different ancestry

Potential functional effect
Variant IDMinor alleleAMER (MXL)AFR (YRI)EUR (GBR)EAS (CHB)Amino acid position and changeYN
Coding sequence
rs11541204G/AA005.00Gln134GlnX



Promoter and 5′ near the gene
rs78985072G/AA4.7*0015.5±X
rs142421833C/TT4.7*0015.5±X
rs3128509G/AA45.313.041.83.4Both alleles can create binding sites for BRCA, GATA4, MYB, and RFX
rs111786311T/GG1.616.22.80X
rs11389221C/CAAA (INDEL)CAAA43.815.351.776.7X
rs56952354A/TT2.34.600Both alleles can create binding sites for GATA, GFI1, and TEL2
rs75567776G/CC2.37.900X
rs3118869C/AA39.846.847.332.5The C allele creates binding sites for SREBP, AHR, and AHRHIF
rs41307457C/AA3.123.62.80Both alleles can create binding sites for BRCA, DBP, LRF, MYB, and STAT4



5′ UTR
rs41312184C/TT1.60.511.00Both alleles can create binding sites for STAT, and RFX. The C allele can create a binding site for SF2ASF1



3′ near the gene
rs59063901G/AA0.83.72.80Both alleles can create binding sites for STAT, SPZ1, and GABP

*, ‡, ± Variants in high LD or are tagSNPs between them in an American, European, and Asian population, respectively.

MAF; Minor allele frequency, AMER; American, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, UTR; untranslated region, LD; Linkage disequilibrium. CCDS; consensus coding sequence.

CTSL is located on chromosome 9q21.33. Six transcripts have been reported for CTSL, three produce CCDS, and two of them synthesize the 333 amino acid protein. The first transcript consists of 8 exons and 7 introns, 7 exons encode this protein, transcript length; 1436 bps. The second transcript consists of 8 exons and 7 introns, 7 exons encode this protein, transcript length; 1654 pb.

Cathepsin L polymorphisms. *, ‡, ± Variants in high LD or are tagSNPs between them in an American, European, and Asian population, respectively. MAF; Minor allele frequency, AMER; American, AFR; Africans, EUR; Europeans, EAS; East Asia, MXL; Mexicans from Los Angeles, YRI; Yoruba in Ibadan, Nigeria, CHB; Han Chinese in Beijing, China, GBR; British in England and Scotland, Y; Yes, N; No, INDEL; Insertion/Deletion, UTR; untranslated region, LD; Linkage disequilibrium. CCDS; consensus coding sequence. CTSL is located on chromosome 9q21.33. Six transcripts have been reported for CTSL, three produce CCDS, and two of them synthesize the 333 amino acid protein. The first transcript consists of 8 exons and 7 introns, 7 exons encode this protein, transcript length; 1436 bps. The second transcript consists of 8 exons and 7 introns, 7 exons encode this protein, transcript length; 1654 pb.

LD between SNPs of ACE2, TMPRSS2, TMPRSS11A, ELANE, and CTSL

We conducted an LD analysis between the SNPs in ACE2 (Fig. 1 ), TMPRSS2 (Fig. 2 ), TMPRSS11A (Fig. 3 ), ELANE (Fig. 4 ), and CTSL (Fig. 5 ) proposed here. Thus, our analysis of LD included SNPs in these 5 genes but not INDELs. We observed several non-informative SNPs (minor allele frequency = 0%) in the 4 populations included in our analysis. For example, for TMPRSS2, 2, 7, 5, and 8 SNPs were eliminated in the American, African, European, and Asian populations included in our study, respectively. This reflects the heterogeneity between the populations. Similar results can be observed for ACE2, TMPRSS11A, ELANE, and CTSL.
Fig. 1

Linkage disequilibrium (r2) in the ACE2 gene in the included populations. Of the 13 variants shown in Table 1, two were INDELs and in two of them no information was found, so they were not added to the Haploview program. Of the remaining 9, some were not polymorphic in the different populations. Linkage disequilibrium (LD) between variants is shown in the figures, 5 in Americans (Fig. 1A), 7 in Africans (Fig. 1B), 5 in Europeans (Fig. 1C). In Asians, none of the variants were in LD.

Fig. 2

Linkage disequilibrium (r2) in the TMPRSS2 gene in the included populations. Of the 39 variants shown in Table 2, five were INDELs and one a deletion, so they were not added to the Haploview program. Of the remaining 33, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 31 in Americans (Fig. 2A), 26 in Africans (Fig. 2B), 28 in Europeans (Fig. 2C), and 24 in Asians (Fig. 2D).

Fig. 3

Linkage disequilibrium (r2) in the TMPRSS11A gene in the included populations. Of the 20 variants shown in Table 3, one was INDEL and was not added to the Haploview program. Of the remaining 19, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 19 in Americans (Fig. 3A), 17 in Africans (Fig. 3B), 17 in Europeans (Fig. 3C), and 11 in Asians (Fig. 3D).

Fig. 4

Linkage disequilibrium (r2) in the ELANE gene in the included populations. Of the 17 variants shown in Table 4, one was INDEL and was not added to the Haploview program. Of the remaining 16, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 15 in Americans (Fig. 4A), 13 in Africans (Fig. 4B), 15 in Europeans (Fig. 4C), and 10 in Asians (Fig. 4D).

Fig. 5

Linkage disequilibrium (r2) in the CTSL gene in the included populations. Of the 12 variants shown in Table 5, one was INDEL and was not added to the Haploview program. Of the remaining 11, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 10 in Americans (Fig. 5A), 8 in Africans (Fig. 5B), 7 in Europeans (Fig. 5C), and 4 in Asians (Fig. 5D).

Linkage disequilibrium (r2) in the ACE2 gene in the included populations. Of the 13 variants shown in Table 1, two were INDELs and in two of them no information was found, so they were not added to the Haploview program. Of the remaining 9, some were not polymorphic in the different populations. Linkage disequilibrium (LD) between variants is shown in the figures, 5 in Americans (Fig. 1A), 7 in Africans (Fig. 1B), 5 in Europeans (Fig. 1C). In Asians, none of the variants were in LD. Linkage disequilibrium (r2) in the TMPRSS2 gene in the included populations. Of the 39 variants shown in Table 2, five were INDELs and one a deletion, so they were not added to the Haploview program. Of the remaining 33, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 31 in Americans (Fig. 2A), 26 in Africans (Fig. 2B), 28 in Europeans (Fig. 2C), and 24 in Asians (Fig. 2D). Linkage disequilibrium (r2) in the TMPRSS11A gene in the included populations. Of the 20 variants shown in Table 3, one was INDEL and was not added to the Haploview program. Of the remaining 19, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 19 in Americans (Fig. 3A), 17 in Africans (Fig. 3B), 17 in Europeans (Fig. 3C), and 11 in Asians (Fig. 3D). Linkage disequilibrium (r2) in the ELANE gene in the included populations. Of the 17 variants shown in Table 4, one was INDEL and was not added to the Haploview program. Of the remaining 16, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 15 in Americans (Fig. 4A), 13 in Africans (Fig. 4B), 15 in Europeans (Fig. 4C), and 10 in Asians (Fig. 4D). Linkage disequilibrium (r2) in the CTSL gene in the included populations. Of the 12 variants shown in Table 5, one was INDEL and was not added to the Haploview program. Of the remaining 11, some were not polymorphic in the different populations. Linkage disequilibrium between variants is shown in the figures, 10 in Americans (Fig. 5A), 8 in Africans (Fig. 5B), 7 in Europeans (Fig. 5C), and 4 in Asians (Fig. 5D).

Discussion

Using the information about allelic frequencies obtained from dbSNPs, Ensembl Genome Browser, and the 1000 Genome Project, as well as different wed-based tools, we defined some polymorphic variants in the ACE2, TMPRSS2, TMPRSS11A, ELANE, and CTSL genes that could be important for association studies in the SARS-CoV-2 infection. SARS-Cov-2 enters the cell by binding its S protein with cellular receptors (e.g., ACE2 membrane-bound protein) [16]. Some proteases, such as TMPRSS2, cathepsin L, neutrophil elastase, and probably TMPRSS11A participate in this process [[8], [9], [10], [11], [12], [13], [14], [15]]; in fact, polymorphisms in their encoding genes could not only have an impact in the expression and/or structure of these proteases but also be associated with SARS-CoV-2 infection susceptibility. Even though most of the ACE2 variants occur at low frequencies in human populations, we detected three polymorphisms with a possible functional effect: binding site generation for some transcription factors. AP2alpha, BCL6, CEBP, ETS (rs7885856), HIC1 (rs9698134), BRCA, DBP, ETF, MYB, RFX, and WT1 (rs9698150) are some of these factors, which could have a role in the virus infection. It has been reported that BCL6 modulates tissue neutrophil survival and exacerbates pulmonary inflammation following influenza virus infection [45]. Han et al. [46] demonstrated that the CEBP alpha participates in the activation of hfg12 prothrombinase during SARS-CoV infection, thus having an important role in the development of thrombosis in SARS. The three ACE2 polymorphisms with possible functional effects have a high frequency of its minor allele only in the African population. Thus, these polymorphisms could be genetic targets for association studies in this population. Two recent studies have analyzed the association of ACE2 polymorphisms with susceptibility to SARS-CoV-2 infection [42,43]; however, the evidence stating that low-frequency variants can participate in SARS-CoV-2 infection is not convincing. In the same way, Cao et al. [47] systematically investigated the candidate functional-coding variants in ACE2 and the allele frequency differences between several populations. The results of this analysis suggested that there are no variants in the ACE2 gene resistant to coronavirus S-protein binding in the study populations. It was recently suggested that a renin-angiotensin system (RAS) imbalance impacts all stages of SARS-CoV-2 infection and clinical findings thereof, placing RAS molecules at the center of COVID-19 pathophysiology. The imbalance between the ACE/Ang II/AT1R and ACE2/Ang-(1-7)/MasR axes results in multiple organ dysfunction and uncontrolled inflammatory response [48]. The insertion/deletion (I/D) polymorphism of the ACE1 gene is associated with plasma and tissue levels of ACE. In this context, Delanghe et al. [49] analyzed not only the prevalence and mortality data (per 1,000,000 inhabitants) of the COVID-19 infection of several countries but also the frequency of several polymorphisms in genes of some human plasma proteins, including the ACE I/D polymorphism. The results of this study suggest that the prevalence of COVID-19 is significantly correlated with the ACE1 polymorphism. Contrary to the ACE2 gene, the polymorphisms in the TMPRSS2 gene had a considerable variation in its frequencies between human populations. In this gene, we detected one polymorphism (rs12329760) located in the coding sequence that created a nonsynonymous substitution (Val160Met). Our in silico analysis using ModPred did not show a possible effect of the TMPRSS2 rs12329760 polymorphism on any post-translational modification (e.g., proteolytic cleavage, acetylation, glycosylation, phosphorylation, and sulfation). However, this variant was predicted to be damaging by PolyPhen-2 and deleterious by SIFT. It has been recently reported that the TMPRSS2 Val160Met variant decreases the stability of the protein, which might impede viral entry [50]. In a previous in silico analysis of the TMPRSS2 gene, it was found that this polymorphism creates a de novo pocket protein [44]. The frequency of the minor allele of this polymorphism was high in the four study populations. Seventeen TMPRSS2 polymorphisms (located in the promoter, in the 5′ region near the gene, and the 3′ UTR region) generated a possible functional effect: the binding of different transcription factors and microRNAs. Two of them had a high frequency of its minor allele in the four populations (rs4283504 and rs12626358) and created binding sites for the DBP, HSF1, NKX25, and KAISO factors. It has been reported that heat shock factor 1 (HSF1) is an innate repressor of HIV-induced inflammation [51]. The frequency of the minor allele of 7 of these polymorphisms was high in populations from the American, African, and European continents. However, in the Asian population, only 3 (rs4283504, rs12626358, and rs8128074) out of the ten polymorphisms were observed with a minor allele frequency higher than 10%. In the same TMPRSS2 gene, we detected 7 polymorphisms with a functional effect: all of them producing binding sites for microRNAs and two of them (rs456142 and rs456298) with high frequencies of its minor allele in the four study populations. The rs12627374 polymorphism produces a binding site for the microRNA-345. This polymorphism only was present in the Han Chinese population. Using computational analysis, we observed that it can affect a wide spectrum of microRNAs profile [44]. Although we identified that the three non-synonymous variants of TMPRSS11A are benign (according to PolyPhen-2) or tolerated (using SIFT), two of them (rs353163-Arg290Gln and rs13901019-Lys48Arg) possibly undergo a post-translational modification: proteolytic cleavage (according to ModPred server). Moreover, since this variant is located in the catalytic domain, it has been suggested that its activity is reduced because of the impact on the protein three-dimensional structure [52]. On the other hand, the rs353163 (Arg290Gln) polymorphism has been associated with the risk of esophageal squamous cell carcinoma (52). Therefore, it is possible that these variants could affect viral entry; however, future functional studies should be carried out to establish its role on SARS-CoV-2 susceptibility. The minor allele of the rs353163 polymorphism was present in a high frequency in the four study populations. Another 10 polymorphisms in this gene evoke a possible functional effect. Five located in the promoter and the 5′ region near the gene and two in the 5′ UTR region produced binding sites for several transcription factors. On the other hand, three polymorphisms located in the 3′ UTR region created microRNAs binding sites. Out of these polymorphisms, the minor alleles of only 3 (rs17088849, rs6552134, and rs4860265) were present in high frequencies in the four populations. The minor alleles of three (rs17088850, rs17088851, rs720009) out of ten TMPRSS11A gene polymorphisms with a possible functional effect were seen in a high frequency only in the African population. It could be interesting to study the association of these polymorphisms with SARS-CoV-2 infection in African populations to define if they are related to the low infection rate in the continent. In the ELANE gene that encodes the neutrophil elastase, 12 polymorphisms with possible functional effects were detected: ten in the promoter and the 5′ region near the gene, and two in the 3′ region near the gene. These twelve polymorphisms produced binding sites for several transcription factors and microRNAs. The minor allele of four of these polymorphisms (rs10409474, rs3761005, rs3761001, rs17223066) was present in high frequency in the four populations. The minor allele of two polymorphisms (rs3761007 and rs3761006) had a high frequency only in the Han Chinese population. In a like manner, the minor allele frequency of rs2007647 was high only in the European (British) population. As for the SARS-CoV-2 infection, these polymorphisms could be relevant in the Asian and European populations. The nonsynonymous ELANE Pro257Leu (rs17216663) variant is predicted to be benign (PolyPhen-2) and tolerated (SIFT). Nevertheless, we found with the ModPred server that Pro257 may undergo hydrolyzation, which could affect the function of the ELANE protein. Previously, one study reported that Pro257Leu (located in the ELANE carboxyl terminus) is a risk factor for severe congenital neutropenia; however, the biological significance of this variant remains uncertain [53]. Therefore, future functional studies are essential to determine its effect. In the CTSL gene, six polymorphisms with possible functional effects were detected. These polymorphisms were located in several regions of the gene and created binding sites for transcription factors. The minor allele of one of these polymorphisms (rs41307457) showed a high frequency only in the African population. Similarly, the minor allele of rs41312184 was present in high frequency only in the European population. The association of these polymorphisms with the SARS-CoV-2 infection should be analyzed in these populations. It is important to note the high heterogeneity in the different populations, which is evident in the linkage disequilibrium analysis that we carried out. Different linkage disequilibrium patterns were observed for each gene in each population. The above requires an adequate selection of the SNPs to be studied in each of the populations. Of note, we did not evaluate potentially important variants located in the gene introns. Admittedly, some of these variants could have a role in producing different mRNAs and protein isoforms on these 5 genes. Even though the synonymous variants (substitutions that do not lead to an amino acid change) seem to have no functional effect on proteins, some authors have published an effect on the structure and function of them [54,55]. In our study, we included only information from the dbSNPs, Ensembl Genome Browser, and 1000 Genome Project databases. Discrete sequence databases of individuals infected with SARS-CoV2 were not analyzed. The phenotypic classification was not linked with the allelic patterns. In summary, using web-based tools, we identified herein some polymorphisms in the genes that encode proteins related to the SARS-CoV-2 entry into the host cells that could be used for genetic association studies.

Declaration of competing interest

The authors declare no competing interests.
  18 in total

1.  Common variants at 21q22.3 locus influence MX1 and TMPRSS2 gene expression and susceptibility to severe COVID-19.

Authors:  Immacolata Andolfo; Roberta Russo; Vito Alessandro Lasorsa; Sueva Cantalupo; Barbara Eleni Rosato; Ferdinando Bonfiglio; Giulia Frisso; Pasquale Abete; Gian Marco Cassese; Giuseppe Servillo; Gabriella Esposito; Ivan Gentile; Carmelo Piscopo; Romolo Villani; Giuseppe Fiorentino; Pellegrino Cerino; Carlo Buonerba; Biancamaria Pierri; Massimo Zollo; Achille Iolascon; Mario Capasso
Journal:  iScience       Date:  2021-03-17

2.  ACE2 and FURIN variants are potential predictors of SARS-CoV-2 outcome: A time to implement precision medicine against COVID-19.

Authors:  Fahd Al-Mulla; Anwar Mohammad; Ashraf Al Madhoun; Dania Haddad; Hamad Ali; Muthukrishnan Eaaswarkhanth; Sumi Elsa John; Rasheeba Nizam; Arshad Channanath; Mohamed Abu-Farha; Rasheed Ahmad; Jehad Abubaker; Thangavel Alphonse Thanaraj
Journal:  Heliyon       Date:  2021-01-28

Review 3.  Oral Symptoms Associated with COVID-19 and Their Pathogenic Mechanisms: A Literature Review.

Authors:  Hironori Tsuchiya
Journal:  Dent J (Basel)       Date:  2021-03-11

Review 4.  Genetic and epigenetic factors associated with increased severity of Covid-19.

Authors:  Zafer Yildirim; Oyku Semahat Sahin; Seyhan Yazar; Vildan Bozok Cetintas
Journal:  Cell Biol Int       Date:  2021-03-01       Impact factor: 4.473

5.  Transmembrane serine protease 2 Polymorphisms and Susceptibility to Severe Acute Respiratory Syndrome Coronavirus Type 2 Infection: A German Case-Control Study.

Authors:  Kristina Schönfelder; Katharina Breuckmann; Carina Elsner; Ulf Dittmer; David Fistera; Frank Herbstreit; Joachim Risse; Karsten Schmidt; Sivagurunathan Sutharsan; Christian Taube; Karl-Heinz Jöckel; Winfried Siffert; Andreas Kribben; Birte Möhlendick
Journal:  Front Genet       Date:  2021-04-21       Impact factor: 4.599

Review 6.  Genetics Insight for COVID-19 Susceptibility and Severity: A Review.

Authors:  Ingrid Fricke-Galindo; Ramcés Falfán-Valencia
Journal:  Front Immunol       Date:  2021-04-01       Impact factor: 7.561

Review 7.  Molecular Modeling Targeting Transmembrane Serine Protease 2 (TMPRSS2) as an Alternative Drug Target Against Coronaviruses.

Authors:  Igor José Dos Santos Nascimento; Edeildo Ferreira da Silva-Júnior; Thiago Mendonça de Aquino
Journal:  Curr Drug Targets       Date:  2022       Impact factor: 2.937

8.  Expression and co-expression analyses of TMPRSS2, a key element in COVID-19.

Authors:  Francesco Piva; Berina Sabanovic; Monia Cecati; Matteo Giulietti
Journal:  Eur J Clin Microbiol Infect Dis       Date:  2020-11-27       Impact factor: 3.267

9.  The Pursuit of COVID-19 Biomarkers: Putting the Spotlight on ACE2 and TMPRSS2 Regulatory Sequences.

Authors:  Ayelet Barash; Yossy Machluf; Ilana Ariel; Yaron Dekel
Journal:  Front Med (Lausanne)       Date:  2020-10-30

Review 10.  Human genetic factors associated with susceptibility to SARS-CoV-2 infection and COVID-19 disease severity.

Authors:  Cleo Anastassopoulou; Zoi Gkizarioti; George P Patrinos; Athanasios Tsakris
Journal:  Hum Genomics       Date:  2020-10-22       Impact factor: 4.639

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.