Literature DB >> 35122525

Evolutionary history of type II transmembrane serine proteases involved in viral priming.

Diego Forni1, Manuela Sironi1, Rachele Cagliani2.   

Abstract

Type II transmembrane serine proteases (TTSPs) are a family of trypsin-like membrane-anchored serine proteases that play key roles in the regulation of some crucial processes in physiological conditions, including cardiac function, digestion, cellular iron homeostasis, epidermal differentiation, and immune responses. However, some of them, in particular TTSPs expressed in the human airways, were identified as host factors that promote the proteolytic activation and spread of respiratory viruses such as influenza virus, human metapneumovirus, and coronaviruses, including SARS-CoV-2. Given their involvement in viral priming, we hypothesized that members of the TTSP family may represent targets of positive selection, possibly as the result of virus-driven pressure. Thus, we investigated the evolutionary history of sixteen TTSP genes in mammals. Evolutionary analyses indicate that most of the TTSP genes that have a verified role in viral proteolytic activation present signals of pervasive positive selection, suggesting that viral infections represent a selective pressure driving the evolution of these proteases. We also evaluated genetic diversity in human populations and we identified targets of balancing selection in TMPRSS2 and TMPRSS4. This scenario may be the result of an ancestral and still ongoing host-pathogen arms race. Overall, our results provide evolutionary information about candidate functional sites and polymorphic positions in TTSP genes.
© 2022. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35122525      PMCID: PMC8817155          DOI: 10.1007/s00439-022-02435-y

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   5.881


Introduction

In order to initiate a successful infection, viruses must deliver their genome to the host cell cytosol. To achieve this, enveloped viruses trigger a sequence of events that lead to the fusion of their membrane with that of the target host cell, either by direct fusion with the plasma membrane or by an internalization process into endosomes (Barrett and Dutch 2020; Harrison 2008). Usually, this process is mainly mediated by the interaction of specific viral attachment proteins expressed on the surface of the virion (envelope glycoproteins) with viral receptors. These latter are cell surface molecules that perform specific host cellular functions but are hijacked by viruses to assist infection. Multiple types of molecules, including proteins, carbohydrates, and lipids, can be used as viral receptors. However, proteins are considered to be more suitable receptors because of their stronger affinity and higher specificity for viral attachment, resulting in an increase in the efficiency of viral entry and in an expansion of the viral host range and tropism (Zhang et al. 2019a, b; Maginnis 2018). Viral receptors favor the recognition and attachment of the virus to the host cell, but the activation of the membrane fusion machinery often requires the involvement of additional host factors. In fact, most viral fusion proteins are synthesized as a precursor and require processing by host cell proteases to trigger the fusion of the viral lipid envelope with the cellular membrane. Several cellular proteases, including proprotein convertases (i.e., furin), trypsin, cathepsins, and type II transmembrane serine proteases (TTSPs), have been described to participate in the proteolytic activation of viral proteins. TTSPs are a family of trypsin-like membrane-anchored serine proteases that play a key role in the regulation of some crucial processes, including cardiac function, digestion, cellular iron homeostasis, epidermal differentiation, and immune responses (Bugge et al. 2009). Consequently, deregulation of these enzymes is an important feature of several pathological conditions (List et al. 2002; Du et al. 2008; Scott et al. 2001), including tumorigenesis (Choi et al. 2009). TTSPs expressed in the human airways were identified as host factors that promote the proteolytic activation and spread of respiratory viruses such as influenza virus, human metapneumovirus, paramyxoviruses, human parainfluenza viruses, Sendai virus, orthoreoviruses, Group A rotaviruses and coronaviruses (Shirogane et al. 2008; Bertram et al. 2010, 2013; Shirato et al. 2013; Sakai et al. 2014; Sasaki et al. 2021; Nygaard et al. 2012), including SARS-CoV-2, the causative agent of the current COVID-19 pandemic (Zang et al. 2020; Hoffmann et al. 2020b). For a successful infection, influenza viruses depend on the proteolytic activation of their entry-mediating surface protein, hemagglutinin (HA). HA may be activated by diverse host trypsin-like proteases in vitro or in vivo. In particular, influenza viruses (e.g. H1N1 and H7N9 strains) hijack transmembrane serine proteases (including TMPRSS2, TMPRSS4, TMPRSS13, TMPRSS11D, TMPRSS11E, and ST14) (Laporte and Naesens 2017), that cleave HA into subunits HA1 and HA2, a prerequisite for conformational changes that trigger fusion. In coronaviruses, it is the spike (S) protein that promotes cell entry via membrane fusion (Shang et al. 2020; Li 2016). After receptor engagement, the coronavirus S protein is enzymatically processed by cellular proteases, mediating efficient membrane fusion with host cells (Bertram et al. 2013; Shirato et al. 2013; Hoffmann et al. 2020a, b; Matsuyama et al. 2020; Simmons et al. 2005). Human coronaviruses can use a variety of cellular proteases to prime their S protein, including TMPRSS2 and other members of the TTSP family (e.g., TMPRSS4 and TMPRSS11) (Zang et al. 2020; Millet and Whittaker 2015). Likewise, coronaviruses can utilize different receptors to promote their entry into the host cells: both SARS-CoV and SARS-CoV-2 use angiotensin-converting enzyme 2 (ACE2), whereas MERS-CoV and HCoV-229E exploit DPP4 (dipeptidyl peptidase 4) and ANPEP (aminopeptidase N or CD13), respectively (Hoffmann et al. 2020b; Raj et al. 2013; Yeager et al. 1992; Forni et al. 2017). Furthermore, previous studies have shown that SARS-CoV and other coronaviruses interact with multiple cell-surface molecules, such as CD209 (DC-SIGN), CLEC4M (L-SIGN/CD299), CLECG (LSECtin), and basigin (BSG or CD147) (Yang et al. 2004; Marzi et al. 2004; Gramberg et al. 2005; Vankadari and Wilce 2020; Wang et al. 2020a). Interestingly, many of these receptors are targets of positive selection (Cui et al. 2013; Bibiana et al. 2020; Guo et al. 2020; Damas et al. 2020; Ortiz et al. 2008; Enard et al. 2016; Sironi et al. 2015; Wang et al. 2020b). In line with its role in the activation of viral glycoproteins, TMPRSS2 variants were associated with a higher risk of severe disease and increased susceptibility to human influenza (Cheng et al. 2015). Variants in the same gene they may also play a significant role in the interindividual differences, particularly in the gender-related bias, of COVID-19 susceptibility and severity (Alshahawey et al. 2020). Moreover, TMPRSS2 polymorphisms may explain the higher mortality rate among Italians and the association to SARS-CoV-2 infection susceptibility in different populations (Asselta et al. 2020; Irham et al. 2020; Piva et al. 2021; Vargas-Alarcón et al. 2020; Hou et al. 2020). Given their involvement in viral priming, we reasoned that members of the TTSP family may also represent targets of positive selection, possibly as the result of virus-driven pressure. To test this hypothesis, we systematically analyzed the evolutionary patterns of TTSP genes in a large phylogeny of mammals. We also evaluated their genetic diversity in human populations.

Material and methods

Sequences, alignments, and phylogenetic trees

Coding sequences for TTSPs were retrieved from the UCSC server (http://genome.ucsc.edu/). Mammalian homologs of human genes were included only if they represented 1-to-1 orthologs, as reported in the EnsemblCompara GeneTrees (Vilella et al. 2009). Sequences that carry stop codons or have a sequence coverage of less than 50% were excluded. The list of species analyzed for each gene is reported in Supplementary Table S1. The RevTrans 2.0 utility was used to generate multiple sequence alignments (http://www.cbs.dtu.dk/services/RevTrans/, MAFFT v6.240 as an aligner) (Wernersson and Pedersen 2003). Phylogenetic trees were reconstructed using the phyML program with gamma-distributed rates, 4 substitution rate categories, and estimation of transition/transversion ratio and proportion of invariable sites (Guindon et al. 2009).

Evolutionary analysis in mammals

All alignments were screened for the presence of recombination using genetic algorithm recombination detection (GARD) (Kosakovsky Pond et al. 2006). GARD is a genetic algorithm implemented in the HYPHY suite, which uses phylogenetic incongruence among segments in the alignment to detect the best-fit number and location of recombination breakpoints. To obtain an overview of the selective patterns of mammalian TTSPs coding genes, the average nonsynonymous substitution (dN)/synonymous substitution (dS) rate ratio was calculated using the single-likelihood ancestor counting (SLAC) method (Kosakovsky Pond and Frost 2005). To detect positive selection, the codon-based codeml program implemented in the phylogenetic analysis by maximum likelihood (PAML) suite was applied (Yang 2007). Specifically, a model (M8, positive selection model) that allows a class of sites to evolve with dN/dS > 1 was compared to two models (M7 and M8a, neutral models) that do not allow dN/dS > 1. To assess statistical significance, twice the difference of the likelihood (ΔlnL) for the models (M8a vs M8 and M7 vs M8) is compared to a χ2 distribution (1 or 2 degrees of freedom for M8a vs M8 and M7 vs M8 comparisons, respectively). To assure reliability, two different codon substitution models were used (F3 × 4 and F61). Positively selected sites were identified using the following methods: (1) the Bayes empirical Bayes (BEB) analysis (with a cutoff of 0.90), which calculates the posterior probability that each codon is from the site class of positive selection (under model M8) (Anisimova et al. 2002); (2) fast unbiased Bayesian AppRoximation (FUBAR) (Murrell et al. 2013), an approximate hierarchical Bayesian method that generates an unconstrained distribution of selection parameters to estimate the posterior probability of positive diversifying selection at each site in a given alignment (with a cutoff ≥ 0.90); (3) the fixed effects likelihood (FEL) (Kosakovsky Pond and Frost 2005), a maximum-likelihood (ML) approach to infer dN/dS on a per-site basis, assuming that the selection pressure for each site is constant along the entire phylogeny (with a cutoff of ≥ 50). To be conservative, a site was considered under positive selection if it was detected by at least two methods. Protein structures were visualized with the PyMOL software (Schrödinger 2017), which was also used to create 3D protein figures. GARD, FEL, FUBAR, and SLAC analyses were run locally through the HyPhy suite (Pond et al. 2005). In order to identify specific branches with a proportion of sites evolving with dN/dS > 1, the adaptive Branch-Site Random Effects Likelihood method (aBS-REL) was used. This method applies sequential likelihood ratio tests to identify branches under positive selection without a priori knowledge about which lineages are of interest (Smith et al. 2015); branches identified using this approach were cross-validated using the branch-site likelihood ratio tests from PAML (models MA and MA1).

Population genetic analyses

Data from the 1000 Genomes Phase 1 Project (1000 Genomes Project Consortium et al. 2010) for individuals with East Asian (AS; Japanese plus Chinese) or European ancestry (CEU), and Yoruba subjects from Nigeria (YRI), were used to calculate nucleotide diversity and site frequency spectrum statistics for seven TTPS genes. Nucleotide diversity over the whole gene was measured as the average number of pairwise sequence nucleotide differences, π (Nei and Li 1979) and the expected per site heterozygosity, θW (Watterson 1975). The same statistics were also calculated in 5 kb sliding windows moving with a step of 250 bp. Three site frequency spectrum (SFS) statistics (Tajima's D, DT) (Tajima 1989), Fu and Li's D* and F* (Fu and Li 1993)] plus nucleotide diversity indicators were calculated for gene regions showing high level of diversity in the sliding-windows approach. In order to obtain a statistical significance for these values, the same tests were performed for a control set of ~ 2000 genes, as previously described (Forni et al. 2014, 2013). Briefly, ~ 2000 genes were randomly selected, with the only limitations to be longer than 5000 bp and to have more than 80% human-outgroup (chimpanzee or orangutan genomes) aligning bases (Forni et al. 2014, 2013). This approach allowed us to generate a distribution of the above statistics and the calculation of empirical percentiles. We considered as statistically significant those values with a percentile rank higher than the 95th. For the gene region analysis, we selected 5 kb independent gene regions from the 2000 random genes and we used them to generate SFS distributions. Orthologous genomic regions in chimpanzee, orangutan or macaque were retrieved using the liftOver tool (Kuhn et al. 2013).

Results and discussion

Evolutionary analysis of TTSPs in mammals

In humans, the TTSP family comprises 17 known members that share a number of common structural features (Fig. 1A). These include a short N-terminal cytoplasmic domain, a transmembrane domain, a variable stem region, and a C-terminal serine protease domain with a catalytic triad (Ser-His-Asp) (Böttcher-Friebertshäuser 2018). The stem region contains an assortment of modular structural domains that contribute to TTSP activation and interaction with substrates (Hooper et al. 2001; Antalis et al. 2010). Based on the composition of the extracellular protein portion and similarity in domain structure, chromosomal localization, and phylogenetic analysis of the serine protease domains, TTSPs are grouped in four subfamilies known as HAT/DESC, hepsin/TMPRSS (transmembrane protease/serine S1), matriptase, and corin (Böttcher-Friebertshäuser 2018; Szabo and Bugge 2008) (Fig. 1A).
Fig. 1

HYPERLINK "sps:id::fig1||locator::gr1||mediaobject::0" Evolutionary Analysis of TTSPs in Mammals. A Schematic representation of Type II Transmembrane Serine Proteases (TTSPs). TTPSs are grouped into four subfamilies based on domain composition and phylogenetic analyses (Szabo and Bugge. 2008) The name and the localization of the domains in the protein sequences are taken from the Uniprot database (https://www.uniprot.org/). LDL-receptor class A: Low-density lipoprotein receptor domain class A; SRCR: Scavenger receptor Cys-rich; SEA: sea urchin sperm protein, enterokinase, agrin; CUB: C1r, C1s, uEGF, and bone morphogenetic protein; MAM: domain in meprin, A5, receptor protein tyrosine phosphatase mu; SRCR: Scavenger receptor Cys-rich; FZ: Frizzled. Red arrow heads indicate the position of positively selected sites (see Table 1). The CORIN regions obtained on the basis of the recombination breakpoints are also reported. Viruses target of proteolytic activation by TTSPs are also reported (see Supplementary Table S2). B. Comparison of dN/dS values. Boxplot comparison of average dN/dS values calculated for TTSP genes. p values from the Wilcoxon rank sum test and average dN/dS values for each gene are reported. TTSPs involved (red) or not involved (blue) in the activation of viral glycoproteins are indicated. C. Branch-site analysis of positive selection. aBS-REL analysis for TMPRSS2 in mammals identified one branch showing evidence of positive selection (pink box). Branch lengths are scaled to the expected number of substitutions per nucleotide. The p value (red) from aBS-REL is reported. See Supplementary Table 1 for the full name of the species reported in the tree. See also Figure S1

HYPERLINK "sps:id::fig1||locator::gr1||mediaobject::0" Evolutionary Analysis of TTSPs in Mammals. A Schematic representation of Type II Transmembrane Serine Proteases (TTSPs). TTPSs are grouped into four subfamilies based on domain composition and phylogenetic analyses (Szabo and Bugge. 2008) The name and the localization of the domains in the protein sequences are taken from the Uniprot database (https://www.uniprot.org/). LDL-receptor class A: Low-density lipoprotein receptor domain class A; SRCR: Scavenger receptor Cys-rich; SEA: sea urchin sperm protein, enterokinase, agrin; CUB: C1r, C1s, uEGF, and bone morphogenetic protein; MAM: domain in meprin, A5, receptor protein tyrosine phosphatase mu; SRCR: Scavenger receptor Cys-rich; FZ: Frizzled. Red arrow heads indicate the position of positively selected sites (see Table 1). The CORIN regions obtained on the basis of the recombination breakpoints are also reported. Viruses target of proteolytic activation by TTSPs are also reported (see Supplementary Table S2). B. Comparison of dN/dS values. Boxplot comparison of average dN/dS values calculated for TTSP genes. p values from the Wilcoxon rank sum test and average dN/dS values for each gene are reported. TTSPs involved (red) or not involved (blue) in the activation of viral glycoproteins are indicated. C. Branch-site analysis of positive selection. aBS-REL analysis for TMPRSS2 in mammals identified one branch showing evidence of positive selection (pink box). Branch lengths are scaled to the expected number of substitutions per nucleotide. The p value (red) from aBS-REL is reported. See Supplementary Table 1 for the full name of the species reported in the tree. See also Figure S1
Table 1

Likelihood ratio test statistics for models of variable selective pressure among sites (F3 × 4 and F61 codon frequency model) for membrane-associated type II transmembrane serine protease genes

Gene/LRT modeln. of speciesF3 × 4F61Positively selected sitesc
− 2ΔlnLap valueb− 2ΔlnLap valueb
TMPRSS11D570.408
M8a vs M819.92.81 × 10–1029.291.95 × 10–1475T, 156E, 215S, 248F, 254R, 259L, 260I, 265 K, 314Q
M7 vs M824.572.13 × 10–1135.036.12 × 10–16
TMPRSS11E57
M8a vs M815.622.28 × 10–817.682.74 × 10–998E, 252K, 324Y, 347A
M7 vs M827.331.35 × 10–1227.121.67 × 10–12
TMPRSS11A54
M8a vs M82.950.01513.730.006314N, 39V, 62I, 413N
M7 vs M89.378.52 × 10–58.420.00022
TMPRSS11F56
M8a vs M80101
M7 vs M8011.390.25
HPN57
M8a vs M80101
M7 vs M85.190.005572.770.063
TMPRSS252
M8a vs M82.120.03954.360.0031103C,140G, 301P, 433V, 489I
M7 vs M89.66.67 × 10–511.638.9 × 10–6
TMPRSS354
M8a vs M8010.160.57
M7 vs M810.622.44 × 10–55.530.004
TMPRSS456
M8a vs M89.261.68 × 10–58.563.51 × 10–5103A, 161D, 166E, 179S, 298F, 319T
M7 vs M812.713.02 × 10–613.741.08 × 10–6
TMPRSS13
M8a vs M81.330.1-0.181
M7 vs M86.920.00098830.05
TMPRSS547
M8a vs M81.170.130.030.81
M7 vs M82.760.0630.090.91
TMPRSS1557
M8a vs M833.742.13 × 10–1639.039.99 × 10–1912H, 26A, 389A, 486I, 499Y, 545S, 621 V, 669E, 701 T, 918 V, 997L
M7 vs M859.311.75 × 10–2661.511.93 × 10–27
ST1458
M8a vs M80.120.629.681.08 × 10–5
M7 vs M813.171.91 × 10–617.053.94 × 10–8
TMPRSS657
M8a vs M8011.150.13
M7 vs M811.867.07 × 10–617.123.67 × 10–8
TMPRSS756
M8a vs M80.120.620.820.2
M7 vs M87.790.0004149.010.00012
TMPRSS956
M8a vs M80.5460.36.580.0003
M7 vs M86.6660.0012711.668.63 × 10–6
CORIN-reg157
M8a vs M88.960.002814.865.9 × 10–412R, 13C, 21P, 90E, 99 T, 118 V
M7 vs M823.821.06 × 10–637.288.03 × 10–9
CORIN-reg257
M8a vs M86.980.008213.080.0014138A, 163V, 252V, 274N, 278A,375V, 380H, 520 V, 558S, 588R
M7 vs M821.842.96 × 10–637.527.12 × 10–9
CORIN-reg357
M8a vs M80.120.72912.640.0018
M7 vs M80.920.33713.40.0012

Note: − 2ΔlnL values and p values ​​of genes target of positive selection are shown in bold

aTwice the difference of likelihood for the two models compared

bp value of rejecting the neutral models (M8a and M7) in favor of the positive selection model (M8)

cPositively selected sites detected by at least two methods among BEB, FEL, and FUBAR

TTSPs expressed in the human airways have been identified as host cell factors that may support the proteolytic activation and spread of respiratory viruses, including influenza viruses and coronaviruses. In particular, in vitro studies have demonstrated the involvement of 8 TTSP proteases (TMPRSS11D, TMPRSS11E, TMPRSS11A, TMPRSS2, TMPRSS4, TMPRSS13, TMPRSS15, and ST14) in the activation of HA and of spike proteins (Zang et al. 2020; Hoffmann et al. 2020b; Böttcher-Friebertshäuser 2018; Hayashi et al. 2018). Other viruses also require TTSPs (at least in vitro) for entry into the host cell. Indeed mammalian orthoreovirus, paramyxoviruses, human metapneumovirus, and human parainfluenza viruses require proteolytic activation through TMPRSS2 and/or TMPRSS11D at the viral entry step (Nygaard et al. 2012; Böttcher-Friebertshäuser 2018). More recently, Sasaki and colleagues demonstrated that TMPRSS2 and TMPRSS11D also mediate proteolytic activation and trypsin-independent infection in group A rotavirus (Sasaki et al. 2021) (Supplemetary Table S2, Fig. 1A). To comprehensively analyze the selective pressure acting on TTSPs and to obtain an estimate of the extent of constraint acting on these genes, we retrieved coding sequence information for all available mammalian species in the UCSC database (Supplementary Table 1). In particular, we retrieved sequences for at least 46 species for 16 TTSP genes. We excluded TMPRSS11B from the analysis, as the existence of a human pseudogene (TMPRSS11BNL) could affect evolutionary inference. Concerning TMPRSS13, we excluded from the analysis the region encoding the first 89 amino acids, as it showed limited similarity across species. Because recombination can generate false positive inferences of positive selection (Sironi et al. 2015; Anisimova et al. 2003), all alignments were screened for the presence of recombination using genetic algorithm recombination detection (GARD) (Kosakovsky Pond et al. 2006). Evidence of recombination was detected only for the CORIN gene. The CORIN coding alignment was thus split on the basis of the recombination breakpoints in three sub-regions (CORIN-reg1-3) and then used for the analyses. For coding genes, the strength of selection can be quantified by comparing the rate of non-synonymous nucleotide substitutions per non-synonymous site (dN) with that of synonymous substitution per synonymous site (dS) (Hughes and Nei 1989). We thus calculated the average dN/dS ratio (also referred to as ω) using the single-likelihood ancestor counting (SLAC) method (Kosakovsky Pond and Frost 2005). dN/dS values greater than 1 imply positive (diversifying) selection, whereas ratios lower than 1 indicate purifying selection (functional constraint). The expected dN/dS under selective neutrality is 1. In the 16 TTSPs, the average dN/dS ratio varied from 0.092 to 0.408 (Fig. 1B). Thus, as is the case of most mammalian genes (Sironi et al. 2015), purifying selection is the major force acting on these proteases. However, comparison of dN/dS values between TTSPs not involved (as of present knowledge) in the activation of viral glycoproteins and those with a verified role in this process indicated that the latter show generally faster evolutionary rates based on overall dN/dS (Wilcoxon test, p value = 0.02) (Fig. 1B). This is consistent with the idea that positive selection may act on specific sites in proteins that are otherwise selectively constrained. To test this hypothesis in TTSP genes, we applied the “site models” implemented in the codeml tool from the PAML suite (see methods). Using likelihood ratio tests (LRTs), codeml compares models of gene evolution that allow (model M8) or disallow (models M8a and M7) a class of codons to evolve with dN/dS  > 1. Specifically, M8 represents the positive selection model that is tested against the neutral M8a and M7 models. To assess statistical significance, twice the difference of the likelihood (ΔlnL) for the models (M7 vs M8 and M8a vs M8) was compared to a χ2 distribution (2 degrees of freedom for the M7 vs M8 comparison, 1 degree of freedom for M8a vs M8). To be conservative and to obtain robust results we applied two different codon frequency models (F3 × 4 and F61) and we called a gene as a target of positive selection only if it was detected by both comparisons (M7 versus M8, M8a versus M8) with both models. Applying these criteria, we found seven TTSP genes (TMPRSS11D, TMPRSS11E, TMPRSS11A, TMPRSS15, TMPRSS2, TMPRSS4, CORIN-reg1 and CORIN-reg2) to be targets of positive selection in mammals (Table 1). To identify specific sites targeted by positive selection, we again applied a conservative strategy and we called a site as positively selected only if it was detected by at least two of the following methods: BEB analysis from the M8 model, FUBAR and FEL methods (Kosakovsky Pond and Frost 2005; Murrell et al. 2013). Positively selected sites were identified in each of the seven genes and, as evident in Fig. 1A, they were not clustered, but rather distributed along the protein sequences. The only exception was TMPRSS11D, where a preponderance of selected sites was observed in the protease S1 domain. Likelihood ratio test statistics for models of variable selective pressure among sites (F3 × 4 and F61 codon frequency model) for membrane-associated type II transmembrane serine protease genes Note: − 2ΔlnL values and p values ​​of genes target of positive selection are shown in bold aTwice the difference of likelihood for the two models compared bp value of rejecting the neutral models (M8a and M7) in favor of the positive selection model (M8) cPositively selected sites detected by at least two methods among BEB, FEL, and FUBAR We next extended our analysis to explore possible variations in selective pressure across lineages. In particular, we aimed to assess whether specific internal branches in the mammalian phylogenies evolved under episodic positive selection. Because we did not want to make any a priori assumption about which lineages were more likely to have experienced adaptive evolution, we applied the branch site-random effects likelihood (aBS-REL) method (Smith et al. 2015) to the seven positively selected TTSPs. For three genes, aBS-REL identified the presence of a few positively selected branches with borderline p values (Fig. 1C and Supplementary Fig. S1). In particular, evidence of selection was detected for the branch leading to Eptesicus fuscus and the Myotis clade for TMPRSS11D, to the myomorpha suborder of Rodentia for TMPRSS11A, and to some lineages of the carnivora order for TMPRSS2. However, these results were not confirmed using the codeml branch-site LRT models (Zhang et al. 2005), suggesting that positive selection in TTSP genes is mainly pervasive rather than episodic. As mentioned above, an involvement of TTSP genes in the activation and spread of influenza viruses and/or coronaviruses was verified for 8 TTSP genes (Böttcher-Friebertshäuser 2018; Hayashi et al. 2018). Notably, six of the seven positively selected genes (TMPRSS11D, TMPRSS11E, TMPRSS11A, TMPRSS15, TMPRSS2, and TMPRSS4) are involved in the processing of viral glycoproteins, strongly suggesting that viral infections could represent the selective pressure driving the evolution of these TTSPs. Conversely, proteases with no known role in viral cell entry displayed no positive selection signals. The only exception was the CORIN gene, in which positive selection signals were identified in two of the 3 analyzed sub-regions. Corin is a cardiac membrane serine protease and plays a key role in maintaining cardiac function and regulating blood pressure (Yan et al. 2000). Corin deficiency causes spontaneous hypertension and cardiac hypertrophy in mice (Chan et al. 2005), and variants in the human CORIN gene can alter corin protein conformation, inhibit corin zymogen activation, and contribute to hypertension and cardiovascular diseases (Zhou and Wu 2014; Peng et al. 2015). The N-terminal cytoplasmic tail and the transmembrane domain are unnecessary for corin expression and catalytic activity (Knappe et al. 2003). However truncation of the cytoplasmic tail can create a new N-terminus that inhibits corin trafficking in the Golgi (He et al. 2020). Interestingly, three out of the 16 positively selected sites localized to the N-cytoplasmic tail. In particular, positively selected site Cys13 is polymorphic in human populations (rs2289433, p.Cys13Tyr) and it has been associated with serum corin levels (Zhang et al. 2019a, b). Taking into consideration the physiological functions of this protein and that, up to date, the involvement of corin in viral activation and spread has not been reported, the signal of positive selection is most likely driven by non-viral factors. For the other six positively selected genes, their role in the processing of viral glycoproteins has been widely described (Böttcher-Friebertshäuser 2018; Hayashi et al. 2018; Zmora et al. 2018; Kishimoto et al. 2021; Zang et al. 2020), supporting the hypothesis that viral infections represented the main selective force in the evolutionary history of these genes. For instance, in swine and mouse, TMPRSS2 can activate HA with monobasic cleavage site, indicating that homologous proteases are involved in HA cleavage in different host species (Tarnow et al. 2014; Peitsch et al. 2014). In addition to the canonical activator TMPRSS2, other TTSPs have been reported to function as entry cofactors of different viral fusion proteins, at least in vitro. For example, it was recently reported that TMPRSS4 works synergistically with TMPRSS2 to promote SARS-CoV-2 infection of small intestinal enterocytes (Zhang et al. 2020), and three members of the HAT/DESC subfamily (TMPRSS11D, TMPRSS11A, and TMPRSS11E) are able to activate the SARS-CoV, SARS-CoV-2, and MERS-CoV spike proteins, as well as influenza HA (Böttcher-Friebertshäuser 2018; Hayashi et al. 2018; Zmora et al. 2018; Kishimoto et al. 2021; Zhang et al. 2020; Bertram et al. 2011). The redundancy of function of the different proteases may appear to be in contrast with the presence of positive selection signals in the TTSP genes. However, we wish to mention that several lines of evidence suggest that this might not be the case. First, not all TTSPs have the same tissue/cell expression and not all of them may have the same cleavage efficiency. Second, protease activation may occur through different mechanisms in cells, as recently suggested by Liu and colleagues based on proteomic interaction data (Liu et al. 2021). Third, most data suggesting redundancy of TTSP function in terms of viral protein cleavage have been obtained in vitro and do not necessarily translate to the situation in vivo or in natural infections. Finally, since more proteases are able to cleave proteins of different viruses and the same viral protein can be cleaved by more proteases, different infectious agents may have exerted distinct selection pressures on the same gene, and this could explain, at least in part, the presence of selected sites distributed in different domains. As stated above, positively selected sites in TTSPs genes are not clustered, but rather distributed along the protein sequences. However, all TTSPs involved in the processing of viral glycoproteins have at least one selected site in the peptidase domain (Fig. 1A). Although such sites are not located near the active site (catalytic triad) of the protease domain (Fig. 2), we cannot exclude that they may have a role in modulating the proteolytic activity of these proteins. Likewise, we cannot rule out the possibility that the sites identified in the protease domains, as well as in other protein regions, are involved in direct interactions with the viral target proteins (e.g., spike protein in coronavirus, HA in influenza viruses, VP4 protein in group A rotaviruses). It is currently unknown whether proteases are able to establish direct interactions with their substrates.
Fig. 2

Positive selection acting on TTSP serine protease domains. Cartoon 3D representation of the serine protease domain for TMPRSS11E (a), TMPRSS15 (b), and TMPRSS2 (c). Positively selected sites are colored in red and refer to the human protein; the catalytic triad (Ser-His-Asp) is shown in blue. TMPRSS11E, TMPRSS15, and TMPRSS2 serine protease domain structures are derived from the Protein Data Bank (PDB) ids 2OQ5, 4DGJ, and 7MEQ, respectively. For all other TTSP domains with positively selected sites (see Fig. 1A and Table 1), no 3D structures were available in the PDB database

Positive selection acting on TTSP serine protease domains. Cartoon 3D representation of the serine protease domain for TMPRSS11E (a), TMPRSS15 (b), and TMPRSS2 (c). Positively selected sites are colored in red and refer to the human protein; the catalytic triad (Ser-His-Asp) is shown in blue. TMPRSS11E, TMPRSS15, and TMPRSS2 serine protease domain structures are derived from the Protein Data Bank (PDB) ids 2OQ5, 4DGJ, and 7MEQ, respectively. For all other TTSP domains with positively selected sites (see Fig. 1A and Table 1), no 3D structures were available in the PDB database In summary, evolutionary analyses indicate that most of the TTSP genes that have a verified role in proteolytic activation and spread of influenza viruses and/or coronaviruses present pervasive positive selection signals. This evidence strongly suggests that viral infections represent a major selective pressure driving the evolution of these type II transmembrane serine proteases in mammals. Nonetheless, we cannot exclude that the selection signals are due, at least partially, to other selective pressures associated with the physiological functions of these proteins. Future studies are required to clarify this aspect and to evaluate the functional relevance and selective advantages associated with these genetic changes in TTSP proteins during evolution.

Population genetic analysis in humans

Infectious diseases have exerted strong selective pressures throughout human history (Haldane 1932; Fumagalli et al. 2011). Indeed, a number of human loci have been shown to evolve in response to such pressures; the best documented example is the major histocompatibility complex (MHC) (Bernatchez and Landry. 2003), which is highly polymorphic in several species as a consequence of balancing selection. This is a process whereby genetic variability is maintained in populations due to some selective pressure. Molecular signals of balancing selection have also been described for several genes involved in immune defense, including toll-like receptors (Fumagalli et al. 2009; Ferrer-Admetlla et al. 2008), various cytokine genes (Fumagalli et al. 2009; Ferrer-Admetlla et al. 2008), antimicrobial peptides (AMPs) (Cagliani et al. 2008; Hollox and Armour. 2008), and TRIM5 (Cagliani et al. 2010). Thus, we addressed the role of natural selection in the shaping of genetic diversity at the seven positively selected TTSP genes in human populations. To this aim, we calculated θW (an estimate of the expected per site heterozygosity) (Watterson 1975) and π (the average number of pairwise sequence nucleotide differences between haplotypes) (Nei and Li 1979) in individuals with European ancestry (CEU), in Yoruba subjects from Nigeria (YRI), and in East Asian subjects (AS; Japanese plus Chinese). We then compared the values with the distribution obtained from 2000 randomly selected genes (see Methods for details). As reported in Supplementary Table S3, only TMPRSS2 and TMPRSS4 showed evidence of high nucleotide diversity in all populations, although the rank of θW and π did not reach the 95th percentile in all of them. High nucleotide diversity might be suggestive of balancing selection. Yet, as a result of mutations and recombination, balancing selection signatures tend to extend over relatively short genomic intervals (Wiuf et al. 2004; Charlesworth. 2006). Thus, in order to map the region(s) responsible for high nucleotide diversity in TMPRSS2 and TMPRSS4, we calculated θW and π in 5 kb sliding-windows moving along these genes. In order to analyze potential regulatory regions, we extended the analysis to 20 kb upstream and downstream each gene. Overall, the same trend of θW and π was evident in all populations for both genes (Figs. 3, 4). In TMPRSS2, sliding-windows analysis highlighted 3 regions with high nucleotide diversity in at least two populations (Fig. 3 and Table 2): one located in the 3′ region of TMPRSS2 (TMPRSS2-reg1) and falling in the MX1 gene, one in TMPRSS2 intron 8 (TMPRSS2-reg2), and one covering the promoter region (TMPRSS2-reg3). In the case of TMPRSS4, only two intragenic regions were identified (Fig. 4 and Table 2): one covering the 2 kb surrounding exon 2 (TMPRSS4-reg1) and one covering the entire 3′UTR (TMPRSS4-reg2).
Fig. 3

Human population genetic analysis of TMPRSS2. A Screenshot from the UCSC genome browser (http://genome.ucsc.edu/, GRCh37/hg19). The panel shows the analyzed genomic region (chr21:42816232–42900085), polymorphic variants associated with COVID-19 and with a potential functional effects (Asselta et al. 2020; Piva et al. 2021; Vargas-Alarcón et al. 2020), as well as a track with genetic variants likely affecting proximal gene expression in human tissues from the genotype-tissue expression (GTEx, V6 data release). The eQTL item color indicates the effect size attributed to the eQTL: red, high positive; light red, moderate positive; light blue, moderate negative; blue, high negative. B Nucleotide diversity analysis in human populations. Data from 1000 genomes Project were used to calculate θW and π in sliding windows of 5 kb moving with a step of 250 bp. Horizontal lines represent the 95th percentiles in the distribution of θW (blue) and π (red). Vertical dashed lines indicate the start and the end of TMPRSS2. Regions with θW and π greater than the 95th percentile in at least two populations are shaded in gray

Fig. 4

Human population genetic analysis of TMPRSS4. A Screenshot from the UCSC genome browser (http://genome.ucsc.edu/, GRCh37/hg19). The panel shows the analyzed genomic region (chr11:117927793–118012605) and a track with genetic variants likely affecting proximal gene expression in human tissues from the genotype-tissue expression (GTEx, V6 data release). The eQTL item color indicates the effect size attributed to the eQTL: red, high positive; light red, moderate positive; light blue, moderate negative; blue, high negative. B Nucleotide diversity analysis in human populations. Data from 1000 genomes Project were used to calculate θW and π in sliding windows of 5 kb moving with a step of 250 bp. Horizontal lines represent the 95th percentiles in the distribution of θW (blue) and π (red). Vertical dashed lines indicate the start and the end of TMPRSS4. Regions with θW and π greater than the 95th percentile in at least two populations are shaded in gray

Table 2

Nucleotide diversity and neutrality tests for the TMPRSS2 and TMPRSS4 gene regions

Gene (chromosomal position)PopulationSaπ (× 10–4)θW (× 10–4)Tajima's DFu and Li's D*Fu and Li's F*
ValueRankbValueRankbValueRankbValueRankbValueRankb
TMPRSS2-reg1CEU3829.969916.57992.38971.5972.2599
(chr21:42,818,732–42,822,732)YRI5838.959925.24991.65990.98801.5397
AS3329.129914.12993.05992.06992.9899
TMPRSS2-reg2CEU8650.549928.57992.39981.55972.3299
(chr21:42,843,982–42,849,232)YRI9954.689932.82992.08991.88992.3799
AS7847.479925.43992.66981.23932.2599
TMPRSS2-reg3CEU927.719912.56962.79991.28932.1799
(chr21:42,886,982–42,888,232)YRI1025.39813.93831.9499-0.19360.6981
AS1028.529913.7982.54970.56771.5293
TMPRSS4-reg1CEU2120.039818.31990.2656-0.15480.0151
(chr11:117,964,293–117,966,293)YRI2117.419218.2895-0.13670.81740.5375
AS2124.059917.97990.91730.3690.6674
TMPRSS4-reg2CEU3123.799915.45991.55891.65981.9397
(chr11: 117,988,543–117,992,043)YRI3125.769815.42891.93991.65972.1299
AS2927.329914.18992.62980.39731.5894

Significant values are in bold

aNumber of segregating sites

bPercentile rank relative to a distribution of 5-kb regions from 2000 random genes

Human population genetic analysis of TMPRSS2. A Screenshot from the UCSC genome browser (http://genome.ucsc.edu/, GRCh37/hg19). The panel shows the analyzed genomic region (chr21:42816232–42900085), polymorphic variants associated with COVID-19 and with a potential functional effects (Asselta et al. 2020; Piva et al. 2021; Vargas-Alarcón et al. 2020), as well as a track with genetic variants likely affecting proximal gene expression in human tissues from the genotype-tissue expression (GTEx, V6 data release). The eQTL item color indicates the effect size attributed to the eQTL: red, high positive; light red, moderate positive; light blue, moderate negative; blue, high negative. B Nucleotide diversity analysis in human populations. Data from 1000 genomes Project were used to calculate θW and π in sliding windows of 5 kb moving with a step of 250 bp. Horizontal lines represent the 95th percentiles in the distribution of θW (blue) and π (red). Vertical dashed lines indicate the start and the end of TMPRSS2. Regions with θW and π greater than the 95th percentile in at least two populations are shaded in gray Human population genetic analysis of TMPRSS4. A Screenshot from the UCSC genome browser (http://genome.ucsc.edu/, GRCh37/hg19). The panel shows the analyzed genomic region (chr11:117927793–118012605) and a track with genetic variants likely affecting proximal gene expression in human tissues from the genotype-tissue expression (GTEx, V6 data release). The eQTL item color indicates the effect size attributed to the eQTL: red, high positive; light red, moderate positive; light blue, moderate negative; blue, high negative. B Nucleotide diversity analysis in human populations. Data from 1000 genomes Project were used to calculate θW and π in sliding windows of 5 kb moving with a step of 250 bp. Horizontal lines represent the 95th percentiles in the distribution of θW (blue) and π (red). Vertical dashed lines indicate the start and the end of TMPRSS4. Regions with θW and π greater than the 95th percentile in at least two populations are shaded in gray Nucleotide diversity and neutrality tests for the TMPRSS2 and TMPRSS4 gene regions Significant values are in bold aNumber of segregating sites bPercentile rank relative to a distribution of 5-kb regions from 2000 random genes An effect of balancing selection is a distortion of the site frequency spectrum (SFS) toward intermediate frequency alleles. Deviation in the SFS is identified by neutrality tests, including Tajima's D (DT) (Tajima 1989) and Fu and Li's D* and F* (Fu and Li. 1993). DT tests the departure from neutrality by comparing θW and π, and positive values indicate an excess of intermediate frequency variants. Fu and Li's F* and D* are also based on SNP frequency spectra and differ from DT in that they also take into account whether mutations occur in external or internal branches of a genealogy. For TMPRSS2-reg1 and TMPRSS2-reg2, tests indicated departure from neutrality with significantly positive values for most statistics in all populations (Table 2). Conversely, for TMPRSS2-reg3, neutrality tests showed more significant values in the CEU population. With respect to TMPRSS4, significantly positive values for most statistics were observed only in the region covering the 3′-UTR (Table 2). Overall, these analyses indicate that both TMPRSS2 and TMPRSS4 may be targets of ongoing balancing selection in human populations. The action of natural selection obviously implies the presence of a functional variant with an effect on fitness and a selective pressure to act on it. Recently, variants in TMPRSS2 with a potential functional role and/or associated to viral infection susceptibility have been identified (Asselta et al. 2020; Irham et al. 2020; Piva et al. 2021; Vargas-Alarcón et al. 2020). Specifically, two variants in TMPRSS2-reg2 (rs55964536 and rs7364083) define two distinct haplotypes with different frequencies among populations and were predicted to modulate TMPRSS2 expression (Asselta et al. 2020) (Fig. 3). The rs55964536 polymorphism was reported as an eQTL in lung tissue (genotype-tissue expression, GTEx) (GTEx Consortium et al. 2017) and the haplotype to which rs55964536 belongs is functionally linked to another eQTL variant (rs8134378) located at a known androgen-responsive enhancer for TMPRSS2 (Asselta et al. 2020). Based on these observations, Asselta and colleagues suggested that upregulation of TMPRSS2 expression level through androgens could explain the sex bias in COVID-19 severity. On the other hand, rs7364083 belongs to the second haplotype. This latter is also defined by a variant (rs2070788) associated with increased gene expression, a higher risk of severe disease with the 2009 H1N1 pandemic influenza, and increased susceptibility to human H7N9 flu (Cheng et al. 2015). In TMPRSS2-reg2, another variant (rs2298661) in association with both COVID-19 disease susceptibility and with increased risk of worse disease outcomes was identified through a trans-ancestry meta-analysis (Horowitz et al. 2021). The rs2298661 polymorphism is in tight linkage disequilibrium (LD R2 > 0.8) with a non-synonymous variant (rs12329760, p.V160M). V160M falls in an exonic splicing enhancer site (Srp40) and is also predicted to decrease protein stability in structural models, with a plausible role in viral entry (Vishnubhotla et al. 2020). Taken together, these evidences suggest that the balancing selection signal in TMPRSS2-reg2 could be secondary to the presence of variants with a potential functional effect, most likely on gene expression regulation. Overall, these variants with different allele frequencies in different populations could play an important role in viral entry, helping to explain the heterogeneity observed in different ethnic groups with regard to incidence, severity, progression, and therapeutic response to COVID-19 (Alshahawey et al. 2020). As reported in Fig. 3, other eQTL variants fall in TMPRSS2-reg1 and -reg3. In particular, a cluster of eQTLs modulating expression levels of TMPRSS2 and MX1 in lung and tyroid falls in TMPRSS2-reg1, whereas whole-blood MX2 eQTLs are present in TMPRSS2-reg3. MX1 and MX2 are involved in antiviral response. In particular, MX1 can restrict a wide variety of viruses, including orthomyxoviruses (e.g., influenza and Thogoto), bunyaviruses (e.g., La Crosse and rift valley viruses), and hepatitis B virus (Verhelst et al. 2013). Conversely, MX2 is known to counteract retroviruses (Kane et al. 2013), and to exert a weak effect against vescicular stomatitis virus infection (Liu et al. 2012). Moreover, MX1 expression is higher in COVID-19 patients compared to non-infected subjects and tends to decrease with age (Bizzotto et al. 2020). Variants in TMPRSS2-reg1 might thus contribute to modulate both TMPRSS2 and MX1 expression, acting on the ability to infect and to trigger an effective antiviral response. As stated above, TMPRSS2-reg1 falls within the MX1 gene, hence deviation from neutrality with significantly positive values for most SFS statistics in human populations may be attributable to a specific selection signal for the MX1 gene rather than TMPRSS2. However, within this region, there are eQTL variants for TMPRSS2; these variants may act to determine a different expression of the protease and, consequently, a different ability to activate the viral proteins that are TMPRSS2 protease targets. With respect to TMPRSS4, information on its regulation is more limited. However, eQTLs are present in TMPRSS4-reg2. Specifically, different eQTLs for the esophagous mucosa map to this genomic region. In summary, high nucleotide diversity and a shift in the SFS toward intermediate-frequency alleles suggest that the natural selection signals in these TMPRSS2 and TMPRSS4 regions are driven by the presence of polymorphic variants with a functional role that could contribute to infection with and spread of viruses, and/or to the modulation of disease clinical course.

Conclusions

Type II transmembrane serine proteases (TTSPs) are involved in the proteolitic activation of different viruses. In particular, the cleavage of surface glycoproteins by some proteases has been reported for influenza viruses and coronaviruses, but also for other respiratory viruses. These observations led us to hypothesize that the members of this family may be targets of virus-driven selective pressure. It is known that natural selection events acted on the receptors used by viruses to promote their entry into the cell during the early stages of infection. From this point of view, coronavirus receptors, such as ACE2, DPP4, and ANPEP have been extensively studied. However, until now, systematic evolutionary studies had never been carried out on TTSP gene family. In this study, the application of specific molecular evolution tests showed that 7 TTSP genes evolved adaptively in mammals. Six of these are involved in the proteolytic activation of different viral proteins, suggesting that host–pathogen arms races drive the evolution of these genes. Since more proteases are able to prime proteins of different viruses, we may speculate that they play an important role in determining viral tropism and disease progression, explaining, at least partially, the apparent redundancy of the function of these proteases. Regarding the positively selected sites, we observed that all the TTSP genes have at least one site in the protease domain, that these sites are not in close proximity to the catalytic domain and that in general the selected sites are distributed along the gene. Taken together, these results suggest that different infectious agents may have exerted distinct selection pressures on the same gene and that sites targeted by positive selection in the protease and in other domains could participate in the formation of bonds or in stabilizing the reciprocal position of the host and viral proteins. Of particular interest, especially at the present time, is TMPRSS2. This protease represents the canonical activator of SARS-CoV-2, although its role in priming has been confirmed for many other respiratory viruses. Notably, TMPRSS2 presents pervasive positive selection signals in the mammalian phylogeny and could be target of ongoing balancing selection in human populations. In the latter case, the signals of natural selection could be driven by the presence of polymorphic variants with a functional role that may contribute to the infection with and the spread of viruses, and/or to the modulation of the clinical course of the disease. As the world continues to wrestle with the COVID-19 pandemic, research on the virus aims to develop therapies to control the health and socio-economic emergency. Different vaccine platforms are being used for vaccine development (Kyriakidis et al. 2021), most of them aiming to induce neutralizing antibodies against the spike protein (S). Some of these are already used in vaccination campaigns worldwide, others are close to receiving approval or authorization for large-scale vaccinations. However, the emergence of new lineages of SARS-CoV-2 carrying variants in the spike protein raises concerns about a reduction in the efficacy of the vaccines currently in use. Furthermore, vaccination during the pandemic could exert selective pressures that might favor the expansion of vaccine-resistant SARS-CoV-2 lineages. In this context, it is important to combine vaccinations with alternative therapies aimed at impairing the virus capacity to spread through the host cells. Thus, several studies have identified serine proteases as potential drug targets for the treatment of SARS-CoV-2 infection (Baughn et al. 2020; Qiao et al. 2020; Tang et al. 2020; Zarubin et al. 2020; Deng et al. 2021). TMPRSS2 is a well-characterized TTSP and represents one of the essential host factors in viral activation and cell entry of influenza viruses, such as pandemic influenza H1N1, and different coronaviruses, including SARS-CoV-2. TMPRSS2 is dispensable for development and homeostasis in mice (Kim et al. 2006). Thus, this protease is potentially one of the most promising drug targets for COVID-19 therapy. Therapeutics targeting TMPRSS2 expression and/or activity may be beneficial not only for coronavirus-infected patients but also for those infected with influenza viruses. Indeed, promising data have been obtained from the use of TMPRSS2 inhibitors such as camostat and nafamostat mesylate, which seem to have good efficacy against coronaviruses (Hoffmann et al. 2020b, c). The design of selective inhibitors will be facilitated by a better understanding of functional genetic diversity, both in humans and in other mammals that are used as model organisms for drug development. Data reported in this study indicate that, in analogy to other genes with a role in viral infections, TTSPs have evolved adaptively in mammals (Sironi et al. 2015). In particular a continuum of selective pressure acting on TMPRSS2 and TMPRSS4 was observed as these genes also represent selection targets in human populations. This scenario may be the result of an ancestral and still ongoing host–pathogen arms race. Overall, our results provide evolutionary information about candidate functional sites that may shape infection outcomes across mammals and within humans. Below is the link to the electronic supplementary material. Supplementary file1 (DOCX 13 KB) Supplementary file2 (DOCX 10 KB) Supplementary file3 (DOCX 8 KB) Supplementary Figure S1. Branch-site analysis of positive selection. aBS-REL analysis for TMPRSS11D and TMPRSS11A in mammals. Branches showing evi-dence of positive selection (pink boxes) and the corresponding p values (red) are al-so reported. Branch lengths are scaled to the expected number of substitutions per nucleotide. See supplementary Table 1 for the full name of the species reported in the tree (PDF 28 KB)
  61 in total

Review 1.  MHC studies in nonmodel vertebrates: what have we learned about natural selection in 15 years?

Authors:  L Bernatchez; C Landry
Journal:  J Evol Biol       Date:  2003-05       Impact factor: 2.411

Review 2.  The cutting edge: membrane-anchored serine protease activities in the pericellular microenvironment.

Authors:  Toni M Antalis; Marguerite S Buzza; Kathryn M Hodge; John D Hooper; Sarah Netzel-Arnett
Journal:  Biochem J       Date:  2010-06-15       Impact factor: 3.857

Review 3.  Type II transmembrane serine proteases.

Authors:  Thomas H Bugge; Toni M Antalis; Qingyu Wu
Journal:  J Biol Chem       Date:  2009-06-01       Impact factor: 5.157

4.  Long-term balancing selection maintains trans-specific polymorphisms in the human TRIM5 gene.

Authors:  R Cagliani; M Fumagalli; M Biasin; L Piacentini; S Riva; U Pozzoli; M C Bonaglia; N Bresolin; M Clerici; Manuela Sironi
Journal:  Hum Genet       Date:  2010-09-02       Impact factor: 4.132

5.  Cleavage and activation of the severe acute respiratory syndrome coronavirus spike protein by human airway trypsin-like protease.

Authors:  Stephanie Bertram; Ilona Glowacka; Marcel A Müller; Hayley Lavender; Kerstin Gnirss; Inga Nehlmeier; Daniela Niemeyer; Yuxian He; Graham Simmons; Christian Drosten; Elizabeth J Soilleux; Olaf Jahn; Imke Steffen; Stefan Pöhlmann
Journal:  J Virol       Date:  2011-10-12       Impact factor: 5.103

6.  TMPRSS2 and TMPRSS4 facilitate trypsin-independent spread of influenza virus in Caco-2 cells.

Authors:  Stephanie Bertram; Ilona Glowacka; Paulina Blazejewska; Elizabeth Soilleux; Paul Allen; Simon Danisch; Imke Steffen; So-Young Choi; Youngwoo Park; Heike Schneider; Klaus Schughart; Stefan Pöhlmann
Journal:  J Virol       Date:  2010-07-14       Impact factor: 5.103

7.  TMPRSS2 activates the human coronavirus 229E for cathepsin-independent host cell entry and is expressed in viral target cells in the respiratory epithelium.

Authors:  Stephanie Bertram; Ronald Dijkman; Matthias Habjan; Adeline Heurich; Stefanie Gierer; Ilona Glowacka; Kathrin Welsch; Michael Winkler; Heike Schneider; Heike Hofmann-Winkler; Volker Thiel; Stefan Pöhlmann
Journal:  J Virol       Date:  2013-03-27       Impact factor: 5.103

Review 8.  Balancing selection and its effects on sequences in nearby genome regions.

Authors:  Deborah Charlesworth
Journal:  PLoS Genet       Date:  2006-04       Impact factor: 5.917

9.  The signature of long-standing balancing selection at the human defensin beta-1 promoter.

Authors:  Rachele Cagliani; Matteo Fumagalli; Stefania Riva; Uberto Pozzoli; Giacomo P Comi; Giorgia Menozzi; Nereo Bresolin; Manuela Sironi
Journal:  Genome Biol       Date:  2008-09-25       Impact factor: 13.583

10.  SARS-CoV-2 Infection Boosts MX1 Antiviral Effector in COVID-19 Patients.

Authors:  Juan Bizzotto; Pablo Sanchis; Mercedes Abbate; Sofía Lage-Vickers; Rosario Lavignolle; Ayelén Toro; Santiago Olszevicki; Agustina Sabater; Florencia Cascardo; Elba Vazquez; Javier Cotignola; Geraldine Gueron
Journal:  iScience       Date:  2020-09-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.