Literature DB >> 28035001

Structural and Molecular Evidence Suggesting Coronavirus-driven Evolution of Mouse Receptor.

Guiqing Peng1,2, Yang Yang1, Joseph R Pasquarella1, Liqing Xu1,3, Zhaohui Qian4,5, Kathryn V Holmes5, Fang Li6.   

Abstract

Hosts and pathogens are locked in an evolutionary arms race. To infect mice, mouse hepatitis coronavirus (MHV) has evolved to recognize mouse CEACAM1a (mCEACAM1a) as its receptor. To elude MHV infections, mice may have evolved a variant allele from the Ceacam1a gene, called Ceacam1b, producing mCEACAM1b, which is a much poorer MHV receptor than mCEACAM1a. Previous studies showed that sequence differences between mCEACAM1a and mCEACAM1b in a critical MHV-binding CC' loop partially account for the low receptor activity of mCEACAM1b, but detailed structural and molecular mechanisms for the differential MHV receptor activities of mCEACAM1a and mCEACAM1b remained elusive. Here we have determined the crystal structure of mCEACAM1b and identified the structural differences and additional residue differences between mCEACAM1a and mCEACAM1b that affect MHV binding and entry. These differences include conformational alterations of the CC' loop as well as residue variations in other MHV-binding regions, including β-strands C' and C'' and loop C'C''. Using pseudovirus entry and protein-protein binding assays, we show that substituting the structural and residue features from mCEACAM1b into mCEACAM1a reduced the viral receptor activity of mCEACAM1a, whereas substituting the reverse changes from mCEACAM1a into mCEACAM1b increased the viral receptor activity of mCEACAM1b. These results elucidate the detailed molecular mechanism for how mice may have kept pace in the evolutionary arms race with MHV by undergoing structural and residue changes in the MHV receptor, providing insight into this possible example of pathogen-driven evolution of a host receptor protein.
© 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

Entities:  

Keywords:  X-ray crystallography; evolution; receptor structure-function; virus; virus entry

Mesh:

Substances:

Year:  2016        PMID: 28035001      PMCID: PMC5313091          DOI: 10.1074/jbc.M116.764266

Source DB:  PubMed          Journal:  J Biol Chem        ISSN: 0021-9258            Impact factor:   5.157


Introduction

According to the Red Queen hypothesis, hosts and pathogens are in an evolutionary arms race to keep pace with each other for fitness and survival (1, 2). Coronaviruses are a large family of ancient and diverse RNA virus pathogens that infect many mammalian and avian species (3, 4). Different coronaviruses use a variety of cell surface receptors for entry into host cells through the activities of virus-surface spike proteins (5, 6). The host receptor-adapting evolution of coronavirus spike proteins has been extensively studied (3–6), but coronavirus-driven evolution of host receptors is much less well understood. The current study investigates how a host receptor may undergo molecular changes under possible selective pressure from lethal coronavirus infections and how these changes may help the host to resist death from coronavirus infections. As the prototypic member of the coronavirus family, mouse hepatitis coronavirus (MHV) presents a good model system for studying the co-evolutionary relationship between viruses and hosts. Depending on the strain, MHV can cause enteric, respiratory, or brain infections in mice. The enterotropic strains of MHV spread widely in susceptible mouse populations and are lethal in infant mice of many inbred strains (up to 100% fatality) (7). Infection with MHV is a major concern in laboratory mice because it can disrupt mouse-based research through clinical disease and/or alteration of immunologic responses (8). MHV uses a cell-surface protein, mouse carcinoembryonic antigen-related cell adhesion molecule 1a (mCEACAM1a), as its host receptor (9, 10). The CEACAM1 protein is widely expressed in all mammals on the membranes of epithelial cells, endothelial cells, and leukocytes (11). It mediates cell-cell adhesion and signaling and participates in the differentiation and arrangement of tissue three-dimensional structure, angiogenesis, apoptosis, tumor suppression, cancer metastasis, and the modulation of innate and adaptive immune responses (12). The envelope-anchored MHV spike glycoprotein specifically recognizes mCEACAM1a through the N-terminal domain of its S1 subunit (S1-NTD) (9, 13, 14). Our previous structural studies revealed that coronavirus S1-NTDs have the same tertiary structural fold as human galectins (galactose-binding lectins), and that whereas MHV S1-NTD recognizes mCEACAM1a, bovine coronavirus (BCoV) S1-NTD recognizes sugar (15, 16). Thus, we proposed that coronaviruses acquired a host galectin gene and inserted it into their spike protein gene, and that whereas bovine coronavirus S1-NTD has kept its original sugar-binding lectin activity, MHV S1-NTD has evolved novel mCEACAM1a-binding affinity and lost its original sugar-binding lectin activity. These studies have provided insight into the host receptor-adapting evolution of coronaviruses (5, 6). To respond to the selective pressure from lethal MHV infections, mice may have evolved a variant allele from the Ceacam1a gene, called Ceacam1b; of the two gene products, mCEACAM1b is a much less efficient MHV receptor than mCEACAM1a (17–19). Correspondingly, mice homozygous for Ceacam1b (1b/1b) are resistant to death from MHV infections, whereas mice homozygous for Ceacam1a (1a/1a) are highly susceptible to lethal MHV infections (7, 9, 10, 20, 21). Other than their different MHV receptor activities, mCEACAM1a and mCEACAM1b appear to be functionally equivalent: neither 1a/1a mice nor 1b/1b mice show any growth defects, whereas a dysfunctional Ceacam1 gene leads to impaired insulin clearance, abnormal weight gain, and reduced fertility (22). Our previous structural studies of mCEACAM1a and its complex with MHV S1-NTD have delineated detailed interactions between mCEACAM1a and MHV S1-NTD (15, 23). Moreover, previous studies identified the CC′ loop (loop that connects β-strands C and C′) in mCEACAM1a as critical for MHV binding; the sequence of this loop diverges in mCEACAM1b, partially accounting for the low MHV receptor activity of mCEACAM1b (17, 24). However, due to the lack of structural information about mCEACAM1b, it was not known what structural differences between mCEACAM1a and mCEACAM1b or whether additional residue differences between mCEACAM1a and mCEACAM1b account for the MHV resistance in mice homozygous for Ceacam1b. In this study we have determined the crystal structure of mCEACAM1b and elucidated the structural differences and additional residue differences between mCEACAM1a and mCEACAM1b that impede the binding of MHV S1-NTD to mCEACAM1b. Moreover, we have performed structure-guided mutagenesis studies on mCEACAM1a and mCEACAM1b to investigate the significance of their structural and sequence differences upon their MHV receptor activities. These results provide insight into the possibility that MHV has driven the evolution of the mCEACAM1 protein in mice.

Results

Due to alternative mRNA splicing, mCEACAM1 contains either two [D1 and D4] or four [D1-D4] Ig-like domains in tandem, in addition to a transmembrane anchor and a short intracellular tail at its C terminus (12). mCEACAM1b[D1,D4] (residues 1–202) without the membrane anchor or the intracellular tail was expressed and purified as previous described for mCEACAM1a[D1,D4] (15). It was subsequently crystallized in space group P3121, a = 113.1 Å, b = 113.1 Å, and c = 64.4 Å. Although each asymmetric unit of the crystal contains two mCEACAM1b[D1,D4] molecules, the protein is a monomer in solution based on gel filtration chromatography. The structure was determined by molecular replacement using the structure of mCEACAM1a[D1,D4] as the search template and refined at 3.1 Å resolution (Fig. 1A; Table 1). The final model contains all of the residues in domains D1 and D4 and a glycan N-linked to Asn-270.
FIGURE 1.

Overall structure of mouse CEACAM1b. A, crystal structure of mCEACAM1b[D1,D4] containing the D1 and D4 domains. Secondary structures in domain D1 of mCEACAM1b[D1,D4] are named after the mCEACAM1a[D1,D4] structure (23). B, crystal structure of mCEACAM1a[D1,D4] (PDB 1L67) (23). VBMs are in blue. C, crystal structure of mCEACAM1a[D1,D4] complexed with MHV S1-NTD (PDB 3R4D) (16). MHV S1-NTD is in cyan, with receptor-binding motifs (RBMs) in red. D, sequence alignment of mCEACAM1b[D1,D4] and mCEACAM1a[D1,D4]. β-strands are shown as arrows. VBMs in mCEACAM1a and the corresponding regions in mCEACAM1b are in blue. Asterisks indicate positions that have fully conserved residues; colons indicate positions that have strongly conserved residues; periods indicate positions that have weakly conserved residues. The boundary between domains D1 and D4 is indicated by a black line. Sequence alignment was done using ClustalW (48). Structural illustrations were done using PyMOL (49).

TABLE 1

Data collection and refinement statistics

mCEAMCAM1b
Data collection
    Space groupP3121
    Cell dimensions
        a, b, c (Å)113.097, 113.097, 64.380
        α, β, γ (°)90, 90, 120
    Resolution (Å)16.97-3.10 (3.21-3.10)a
    Rsym0.114 (0.507)
    II9.15 (1.78)
    Completeness (%)96.3 (86.4)
    Redundancy2.5 (2.1)

Refinement
    Resolution (Å)16.97-3.10
    No. reflections8557
    Rwork/Rfree0.216/0.273
    No. atoms1579
        Protein1541
        Ligand28
        Water10
    B-Factors (Å2)40.49
        Protein40.57
        Ligand41.59
        Water23.83
    r.m.s.d.
        Bond lengths (Å)0.015
        Bond angles (°)1.46
    Ramachandran plot
        Favored (%)93.4
        Outliers (%)0.51

Values in parentheses are for highest resolution shell.

Overall structure of mouse CEACAM1b. A, crystal structure of mCEACAM1b[D1,D4] containing the D1 and D4 domains. Secondary structures in domain D1 of mCEACAM1b[D1,D4] are named after the mCEACAM1a[D1,D4] structure (23). B, crystal structure of mCEACAM1a[D1,D4] (PDB 1L67) (23). VBMs are in blue. C, crystal structure of mCEACAM1a[D1,D4] complexed with MHV S1-NTD (PDB 3R4D) (16). MHV S1-NTD is in cyan, with receptor-binding motifs (RBMs) in red. D, sequence alignment of mCEACAM1b[D1,D4] and mCEACAM1a[D1,D4]. β-strands are shown as arrows. VBMs in mCEACAM1a and the corresponding regions in mCEACAM1b are in blue. Asterisks indicate positions that have fully conserved residues; colons indicate positions that have strongly conserved residues; periods indicate positions that have weakly conserved residues. The boundary between domains D1 and D4 is indicated by a black line. Sequence alignment was done using ClustalW (48). Structural illustrations were done using PyMOL (49). Data collection and refinement statistics Values in parentheses are for highest resolution shell. The overall structure of mCEACAM1b[D1,D4] is similar to that of mCEACAM1a[D1,D4], but the structural similarity is uneven in different regions of the two proteins. In both mCEACAM1a and mCEACAM1b, the two Ig-like domains, D1 and D4, are arranged in tandem without any significant interactions with each other (Fig. 1, A and B). In mCEACAM1a, the D1 domain binds to MHV S1-NTD, whereas the D4 domain has no contact with MHV S1-NTD (Fig. 1C). In the D1 domain of mCEACAM1a, several loops (CC′, C′C′′, C′′D, and FG) and β-strands (βC, βC′, and βC′′) are directly involved in MHV binding, and thus these regions have been called the virus-binding motifs (VBMs) (Fig. 1, C and D). Interestingly, the D1 domains of mCEACAM1a and mCEACAM1b are significantly more divergent in both primary structure (sequence identity = 74%) and tertiary structure (main chain r.m.s.d. = 1.11 Å) than the D4 domains (sequence identity = 98%; main chain r.m.s.d. = 0.77 Å) (Fig. 2). Furthermore, within the D1 domain, the VBMs of mCEACAM1a and mCEACAM1b are more divergent in both primary structure (sequence identity = 56%) and tertiary structure (main chain r.m.s.d. = 1.36 Å) than are the non-virus-binding regions (Fig. 2, C and D). These results suggest that compared with the rest of the protein, the VBMs in the D1 domains of mCEACAM1a have been under strong selective pressure possibly from MHV infections.
FIGURE 2.

Structural comparisons of mCEACAM1a and mCEACA1b. A, overlay of mCEACAM1a and mCEACAM1b in domain D1. mCEACAM1a is in black, and mCEACAM1b in yellow. VBMs in mCEACAM1a and the corresponding regions in mCEACAM1b are in blue. B, overlay of mCEACAM1a and mCEACAM1b in domain D4. C, structure of mCEACAM1b showing distribution of the residues that differ between mCEACAM1a and mCEACAM1b. Regions corresponding to VBMs in mCEACAM1a are in blue. Residues that differ between mCEACAM1a and mCEACAM1b are shown as balls and sticks. D, sequence identities and r.m.s.d. between mCEACAM1a and mCEACAM1b in different regions. See Fig. 1D for the residue ranges of domains and VBMs. r.m.s.d. were calculated using Coot (50).

Structural comparisons of mCEACAM1a and mCEACA1b. A, overlay of mCEACAM1a and mCEACAM1b in domain D1. mCEACAM1a is in black, and mCEACAM1b in yellow. VBMs in mCEACAM1a and the corresponding regions in mCEACAM1b are in blue. B, overlay of mCEACAM1a and mCEACAM1b in domain D4. C, structure of mCEACAM1b showing distribution of the residues that differ between mCEACAM1a and mCEACAM1b. Regions corresponding to VBMs in mCEACAM1a are in blue. Residues that differ between mCEACAM1a and mCEACAM1b are shown as balls and sticks. D, sequence identities and r.m.s.d. between mCEACAM1a and mCEACAM1b in different regions. See Fig. 1D for the residue ranges of domains and VBMs. r.m.s.d. were calculated using Coot (50). Further inspection of the structures of mCEACAM1a and mCEACAM1b has identified detailed structural divergence between the VBMs of the two proteins. In mCEACAM1a, a critical CC′ loop (loop that connects β-strands C and C′) in the D1 domain interacts extensively with MHV S1-NTD and thus plays a prominent role in the virus/receptor binding interactions (Fig. 3A). These interactions include the multiple hydrophobic interactions between the side chain of receptor Ile-41 and the side chains of MHV Tyr-15, Leu-89, and Leu-160 as well as the hydrogen bonds between the carbonyl oxygen of receptor Thr-39 and the side chain of MHV Arg-20. Compared with mCEACAM1a, mCEACAM1b has undergone significant structural changes in loop CC′ (Fig. 3, B and C), which result from residue changes in this loop. For example, residue 38 is a threonine in mCEACAM1a but a proline in mCEACAM1b. This residue change likely has a significant impact on the conformation of loop CC′ because prolines are known to cause changes to protein secondary structures. Additionally, a number of residues in other VBM regions of mCEACAM1a also form critical interactions with MHV S1-NTD but have been substituted with different residues in mCEACAM1b. For example, in mCEACAM1a, the side chain of receptor Arg-47 in strand βC′ forms hydrogen bonds with the carbonyl oxygens of MHV Gln-23 and Val-25, the side chains of receptor Met-54 and Phe-56 in strand βC′ are part of a hydrophobic cluster at the S1-NTD/receptor interface, and the side chain of receptor Asn-59 in strand βC′ forms hydrophobic stacking with the Cα of S1-NTD Gly-29 (Fig. 4, A and B). However, Arg-47, Met-54, Phe-56, and Asn-59 in mCEACAM1a have been substituted with His-47, Lys-54, Thr-56, and Pro-59, respectively, in mCEACAM1b (Fig. 4, C and D). The above structural and residue changes in the VBMs from mCEACAM1a to mCEACAM1b would lead to the loss of numerous energetically favorable interactions at the S1-NTD/receptor interface and disrupt the virus/receptor binding interactions. These structural analyses further suggest that the VBMs in the D1 domain of mCEACAM1 have been under strong selective pressure possibly from MHV infections.
FIGURE 3.

Structural comparisons of mCEACAM1a and mCEACAM1b in MHV-binding loop CC′. A, interactions between MHV S1-NTD (from MHV strain A59) and loop CC′ of mCEACAM1a. mCEACAM1 residues are in green, and S1-NTD residues are in magenta. Hydrophobic interactions are indicated as arrows, and hydrogen bonds are indicated as dotted lines. B, structure of loop CC′ of mCEACAM1b. C, overlay of the CC′ loops from mCEACAM1a and mCEACAM1b.

FIGURE 4.

Structural comparisons of mCEACAM1a and mCEACAM1b in MHV-binding strands βC′ and βC′′. A, interactions between MHV S1-NTD and Arg-47 in strand βC′ of mCEACAM1a. B, interactions between MHV S1-NTD and Met-54, Phe-56, and Asn-59 in strand βC′′ of mCEACAM1a. C, conformation of the side chain of His-47 in strand βC′ of mCEACAM1b. D, conformations of the side chains of Lys-54, Thr-56, and Pro-59 in strand βC′′ of mCEACAM1b.

Structural comparisons of mCEACAM1a and mCEACAM1b in MHV-binding loop CC′. A, interactions between MHV S1-NTD (from MHV strain A59) and loop CC′ of mCEACAM1a. mCEACAM1 residues are in green, and S1-NTD residues are in magenta. Hydrophobic interactions are indicated as arrows, and hydrogen bonds are indicated as dotted lines. B, structure of loop CC′ of mCEACAM1b. C, overlay of the CC′ loops from mCEACAM1a and mCEACAM1b. Structural comparisons of mCEACAM1a and mCEACAM1b in MHV-binding strands βC′ and βC′′. A, interactions between MHV S1-NTD and Arg-47 in strand βC′ of mCEACAM1a. B, interactions between MHV S1-NTD and Met-54, Phe-56, and Asn-59 in strand βC′′ of mCEACAM1a. C, conformation of the side chain of His-47 in strand βC′ of mCEACAM1b. D, conformations of the side chains of Lys-54, Thr-56, and Pro-59 in strand βC′′ of mCEACAM1b. To investigate how the structural and residue differences between mCEACAM1a and mCEACAM1b affect their functions as MHV receptor, we carried out structure-guided mutagenesis and introduced structural and residue features from mCEACAM1b into mCEACAM1a. These structural and residue changes include replacing loop CC′ in mCEACAM1a with its counterpart from mCEACAM1b and substituting residues 47, 54, 56, and 59 in mCEACAM1a with the corresponding residues from mCEACAM1b. A pseudovirus entry assay was performed where a lentiviral vector pseudotyped with the MHV spike protein was used to enter mammalian cells expressing either wild-type or mutant mCEACAM1a on their surface. The results demonstrated that each of the structural and residue features from mCEACAM1b introduced into mCEACAM1a significantly reduced the efficiency of pseudovirus entry (Fig. 5A), reflecting a weaker binding affinity between the MHV spike protein and the mutant mCEACAM1a. Thus, the structural and residue changes from mCEACAM1a to mCEACAM1b reduced the capability of mCEACAM1a to serve as the MHV receptor. These loss-of-function experiments mimic the possible loss-of-function evolution of the mouse Ceacam1a gene under the selective pressure from MHV infections.
FIGURE 5.

Structure-guided mutational and functional characterizations of mCEACAM1a and mCEACAM1b. A, pseudovirus entry efficiency. Lentiviruses pseudotyped with MHV spike protein (from MHV strain A59) were used to enter HEK293T cells expressing mCEACAM1a or mCEACAM1b (wild type or mutant). The relative expression of each receptor was used to normalize pseudovirus entry efficiency. The pseudovirus entry mediated by wild-type mCEACAM1a was taken as 100%. Error bars indicate S.E. (n = 3). Comparisons between wild-type mCEACAM1a and mutant mCEACAM1a or between wild-type mCEACAM1b and mutant mCEACAM1b in their capabilities to mediate pseudovirus entry were done using a two-tailed t test (**, p < 0.01; ***, p < 0.001). The mutant mCEACAM1a molecules contain single mutations R47H, M54K, F56T, Q59P, or loop CC′ from mCEACAM1b (residues 38–43). The mutant mCEACAM1b molecules contain loop CC′ from mCEACAM1a (residues 38–43), loop CC′ and strand βC′ from mCEACAM1a (residues 38–51) or loop CC′, strand βC′, loop C′C′′, and strand βC′′ from mCEACAM1a (residues 38–59). B, protein-protein binding assay. The interactions between MHV S1-NTD and mCEACAM1a or mCEACAM1b (wild type or mutant) were measured using AlphaScreen assay. MHV S1-NTD with a C-terminal His6 tag and mCEAMCAM1a or mCEACAM1b (wild type or mutant) with a C-terminal human IgG4 Fc tag were attached to AlphaScreen Nickel Chelate Donor Beads and Alpha Screen protein A acceptor beads, respectively. Error bars indicate S.E. (n = 3). Comparisons between wild-type mCEACAM1b and mutant mCEACAM1b in their binding affinity for MHV S1-NTD were done using two-tailed t test (***, p < 0.001).

Structure-guided mutational and functional characterizations of mCEACAM1a and mCEACAM1b. A, pseudovirus entry efficiency. Lentiviruses pseudotyped with MHV spike protein (from MHV strain A59) were used to enter HEK293T cells expressing mCEACAM1a or mCEACAM1b (wild type or mutant). The relative expression of each receptor was used to normalize pseudovirus entry efficiency. The pseudovirus entry mediated by wild-type mCEACAM1a was taken as 100%. Error bars indicate S.E. (n = 3). Comparisons between wild-type mCEACAM1a and mutant mCEACAM1a or between wild-type mCEACAM1b and mutant mCEACAM1b in their capabilities to mediate pseudovirus entry were done using a two-tailed t test (**, p < 0.01; ***, p < 0.001). The mutant mCEACAM1a molecules contain single mutations R47H, M54K, F56T, Q59P, or loop CC′ from mCEACAM1b (residues 38–43). The mutant mCEACAM1b molecules contain loop CC′ from mCEACAM1a (residues 38–43), loop CC′ and strand βC′ from mCEACAM1a (residues 38–51) or loop CC′, strand βC′, loop C′C′′, and strand βC′′ from mCEACAM1a (residues 38–59). B, protein-protein binding assay. The interactions between MHV S1-NTD and mCEACAM1a or mCEACAM1b (wild type or mutant) were measured using AlphaScreen assay. MHV S1-NTD with a C-terminal His6 tag and mCEAMCAM1a or mCEACAM1b (wild type or mutant) with a C-terminal human IgG4 Fc tag were attached to AlphaScreen Nickel Chelate Donor Beads and Alpha Screen protein A acceptor beads, respectively. Error bars indicate S.E. (n = 3). Comparisons between wild-type mCEACAM1b and mutant mCEACAM1b in their binding affinity for MHV S1-NTD were done using two-tailed t test (***, p < 0.001). To further explore the functional significance of the structural and residue differences between mCEACAM1a to mCEACAM1b, we introduced the reverse substitutions (i.e. the features from mCEACAM1b introduced into mCEACAM1a). These structural and residue changes include replacing (i) loop CC′, (ii) both loop CC′ and strand βC′, or (iii) all of loop CC′, strand βC′, loop C′C′′, and strand βC′′ in mCEACAM1b with the corresponding regions from mCEACAM1a. In addition to the pseudovirus entry assay, protein-protein binding assays were also performed between the MHV S1-NTD and wild-type or mutant mCEACAM1b. The results showed that all of the structural and residue changes introduced into mCEACAM1b significantly enhanced both the pseudovirus entry efficiency and protein-protein binding affinity (Fig. 5, A and B). Among the mutant mCEACAM1b molecules, the one containing changes in loop CC′, strand βC′, loop C′C′′, and strand βC′′ all together demonstrated the highest MHV receptor activity. More specifically, introduction of the above structural and residue changes into mCEACAM1b restored the receptor activity of mCEACAM1b up to ∼67% of mCEACAM1a based on the pseudovirus entry efficiency and ∼83% of mCEACAM1a based on the protein-protein binding affinity. It is worth noting that incorporation of the above structural and residue features from mCEACM1a did not fully restore the MHV receptor activity of mCEACAM1b to the same level as mCEACAM1a, suggesting that structural and/or residue differences elsewhere in domain D1 may account for the remaining difference between mCEACAM1a and mCEACAM1b in their MHV receptor activities. Nevertheless, these gain-of-function experiments represent the reverse course of the possible loss-of-function evolution of the mouse Ceacam1 gene under the selective pressure from MHV infections.

Discussion

The Red Queen hypothesis states that hosts and pathogens are constantly in an evolutionary arms race. Previous structural studies of the coronavirus/receptor interactions have revealed how coronaviruses have evolved a variety of strategies to recognize different host receptors for host range expansion and cross-species infections (5, 6, 15, 16, 23, 25–31). One of these strategies would be for coronaviruses to steal a host galectin, which became the S1-NTD of the coronavirus spike protein, and use it to bind sugar on host cell surfaces for viral attachment to host cells. Although the S1-NTDs of many contemporary coronaviruses still recognize sugar receptors (32–38), MHV S1-NTD has evolved novel binding affinity for mCEACAM1a protein, which greatly enhanced the infection efficiency of MHV in mouse cells (15, 16) (Fig. 6). Through the above evolution of its spike protein, MHV appears to have gained a significant edge in the evolutionary arms race with mice and become a highly infectious and pathogenic virus for mice.
FIGURE 6.

Proposed evolutionary arms race between MHV and mice. The race includes both MHV-driven evolution of mice (top) and mouse-adapting evolution of MHV (bottom). For a detailed explanation, see the “Discussion” section.

Proposed evolutionary arms race between MHV and mice. The race includes both MHV-driven evolution of mice (top) and mouse-adapting evolution of MHV (bottom). For a detailed explanation, see the “Discussion” section. How have mice evolved to keep pace in the evolutionary arms race with MHV? An interesting observation is that the mouse Ceacam1 gene has diversified into two alleles, Ceacam1a and Ceacam1b. Their protein products, mCEACAM1a and mCEACAM1b, demonstrate different MHV receptor activities; mCEACAM1b is a much poorer MHV receptor than mCEACAM1a. Consequently, mice homozygous for Ceacam1b are highly resistant to death from MHV infections. The selective pressures that drive the evolution of mammalian Ceacam1 genes could come from several sources. Mammalian CEACAM1 functions in many physiological processes including cell-cell adhesion, cell signaling, and cell development (12). In addition, human CEACAM1 is a receptor for a variety of bacterial pathogens (39–41). Thus, the physiological functions of CEACAM1, host evasion of bacterial infections, or some other unknown selective pressures could potentially drive the evolution of mammalian Ceacam1 genes. Although the physiological functions of mCEACAM1b need to be investigated further, mice homozygous for Ceacam1b apparently retain all the normal phenotypes of Ceacam1a, suggesting no major alterations of the physiological functions of mCEACAM1b. Moreover, mouse CEACAM1a is not a receptor for those bacterial pathogens that use human CEACAM1 as the receptor (42). On the other hand, MHV infections can be devastating to infant mice that express mCEACAM1a. Therefore, although other selective pressures cannot be ruled out, MHV infection is likely one of the major driving forces for the evolution of mouse Ceacam1 gene. There are insufficient data on mouse genomes to prove which one of the mouse Ceacam1 alleles evolved first. Based on the above discussion, we suggest that in the mouse population the Ceacam1a allele preceded the appearance and maintenance of the Ceacam1b allele in the presence of MHV epidemics. This study investigates the structural and residue differences between mCEACAM1a and mCEACAM1b that render mCEACAM1b a less efficient MHV receptor. Previous studies identified a critical MHV-binding loop CC′ that diverges in sequence between mCEACAM1a and mCEACAM1b, partially accounting for the different MHV receptor activities of the two proteins (17, 24). The current study reveals the altered conformation of loop CC′ in the crystal structure of mCEACAM1b, providing a structural basis for the critical role of loop CC′ in the different MHV receptor activities of the two mCEACAM1 molecules. Furthermore, this study identifies residue variations in several other MHV-binding regions in mCEACAM1b that render mCEACAM1b a poor MHV receptor. These regions include β-strands C′ and C″ and loop C′C″. Using structure-guided mutational and functional assays, this study shows that the structural and residue substitutions from mCEACAM1a into mCEACAM1b cause a loss of receptor function in mCEACAM1a, whereas the reverse substitutions cause a gain of receptor function in mCEACAM1b. The structural and residue changes from mCEACAM1a to mCEACAM1b mimic the possible loss-of-function evolution in mCEACAM1a during pathogen-driven host evolution, which would result in less severe MHV infections in mice and partial alleviation of the selective pressure from MHV infections (Fig. 6). Therefore, it is likely that through divergent evolution of its Ceacam1 gene to generate the Ceacam1b allele, mice may have gained the ability to keep pace in the evolutionary arms race with MHV for fitness and survival. Overall, the current study provides insight into a possible example of coronavirus-driven evolution of mouse receptor protein, lending molecular evidence to the Red Queen hypothesis.

Materials and Methods

Protein Preparation and Crystallization

mCEACAM1b[D1,D4] (residues 1–202) was expressed and purified as previously described for mCEACAM1a[D1,D4] (15). Briefly, mCEACAM1b[D1,D4] containing a C-terminal His6 tag was expressed in sf9 insect cells using the Bac-to-Bac expression system (Life Technologies) and was secreted into cell culture medium. The protein was harvested and loaded onto a nickel-nitrilotriacetic acid (Ni-NTA) column, eluted from the Ni-NTA column with imidazole, and further purified by gel filtration chromatography on Superdex 200 (GE Healthcare). The protein was concentrated to 10 mg/ml and stored in buffer containing 20 mm Tris, pH 7.2, and 200 mm NaCl. Crystallization of mCEACAM1b[D1,D4] was set up using the sitting drop vapor diffusion method, with 1 μl of protein solution added to 1 μl of well buffer containing 0.1 m Tris, pH 6.2, 10% PEG4000 (v/v), and 1 m NaCl at 20 °C. Crystals of mCEACAM1b[D1,D4] appeared in 2–3 days and were allowed to grow for another 2 weeks before they were harvested and flash-frozen in liquid nitrogen.

Data Collection and Structure Determination

X-ray diffraction data were collected at the Advanced Light Source beamline 4.2.2 and processed using HKL2000 (43). The structure of mCEACAM1b[D1,D4] was determined by molecular replacement using mCEACAM1a[D1,D4] (PDB 3R4D) as the search template. The model was built using Coot (44) and refined with Refmac (45) to a final Rwork and Rfree of 0.216 and 0.273, respectively.

Pseudovirus Entry Assay

Lentiviruses pseudotyped with MHV spike protein were produced as previously described (15). Briefly, pcDNA3.1(+) plasmid encoding MHV spike protein (from MHV strain A59) was co-transfected into HEK293T cells with helper plasmid psPAX2 and reporter plasmid pLenti-GFP at a molar ratio of 1:1:1 using Lipofectamine 2000 (Life Technologies). 48 h post-transfection, the produced pseudovirus particles were harvested and inoculated onto the HEK293T cells expressing mCEACAM1a or mCEACAM1b (wild type or mutant). 48 h post-infection, cells were observed under a fluorescent microscope, and the percentage of GFP-expressing cells was calculated using ImageJ (National Institutes of Health). The expression levels of mCEACAM1a and mCEACAM1b in HEK293T cells were measured by Western blotting using antibodies against their C-terminal C9 tag, quantified using ImageJ, and presented as relative expressions in comparison to wild-type mCEACAM1a. The relative expression of each receptor was used to normalize pseudovirus entry efficiency. The experiments were further repeated twice, and similar results were obtained.

Protein-Protein Binding Assay Using AlphaScreen

The interactions between recombinant MHV S1-NTD and recombinant mCEACAM1a or mCEACAM1b (wild type or mutant) were measured using AlphaScreen as previously described (46, 47). Briefly, 300 nm MHV S1-NTD with a C-terminal His6 tag was mixed with 30 nm mCEAMCAM1a or mCEACAM1b (wild type or mutant) with a C-terminal human IgG4 Fc tag in OptiPlate-96 (PerkinElmer Life Sciences) for 1 h at room temperature. AlphaScreen Nickel Chelate Donor Beads and AlphaScreen protein A acceptor beads (PerkinElmer Life Sciences) were added to the mixture at final concentrations of 20 μg/ml. The mixture was incubated at room temperature for 1 h and protected from light. The assay plates were read in an EnSpire plate reader (PerkinElmer Life Sciences). The experiments were further repeated twice, and similar results were obtained.

Author Contributions

G. P., Y. Y., Z. Q., K. V. H., and F. L. designed the experiments and wrote the manuscript. G. P., Y. Y., J. R. P., and L. X. performed the experiments. G. P., Y. Y., J. R. P., L. X., Z. Q., K. V. H., and F. L. analyzed the data.
  48 in total

1.  Difference in virus-binding activity of two distinct receptor proteins for mouse hepatitis virus.

Authors:  N Ohtsuka; Y K Yamada; F Taguchi
Journal:  J Gen Virol       Date:  1996-08       Impact factor: 3.891

2.  Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor.

Authors:  Guiqing Peng; Dawei Sun; Kanagalaghatta R Rajashankar; Zhaohui Qian; Kathryn V Holmes; Fang Li
Journal:  Proc Natl Acad Sci U S A       Date:  2011-06-13       Impact factor: 11.205

3.  Expression of pathogen-like Opa adhesins in commensal Neisseria: genetic and functional analysis.

Authors:  M Toleman; E Aho; M Virji
Journal:  Cell Microbiol       Date:  2001-01       Impact factor: 3.715

4.  CGM1a antigen of neutrophils, a receptor of gonococcal opacity proteins.

Authors:  T Chen; E C Gotschlich
Journal:  Proc Natl Acad Sci U S A       Date:  1996-12-10       Impact factor: 11.205

5.  Arms races between and within species.

Authors:  R Dawkins; J R Krebs
Journal:  Proc R Soc Lond B Biol Sci       Date:  1979-09-21

Review 6.  Receptor recognition mechanisms of coronaviruses: a decade of structural studies.

Authors:  Fang Li
Journal:  J Virol       Date:  2014-11-26       Impact factor: 5.103

Review 7.  Structure, Function, and Evolution of Coronavirus Spike Proteins.

Authors:  Fang Li
Journal:  Annu Rev Virol       Date:  2016-08-25       Impact factor: 10.431

8.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

9.  Structure of MERS-CoV spike receptor-binding domain complexed with human receptor DPP4.

Authors:  Nianshuang Wang; Xuanling Shi; Liwei Jiang; Senyan Zhang; Dongli Wang; Pei Tong; Dongxing Guo; Lili Fu; Ye Cui; Xi Liu; Kelly C Arledge; Ying-Hua Chen; Linqi Zhang; Xinquan Wang
Journal:  Cell Res       Date:  2013-07-09       Impact factor: 25.617

10.  Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer.

Authors:  Alexandra C Walls; M Alejandra Tortorici; Berend-Jan Bosch; Brandon Frenz; Peter J M Rottier; Frank DiMaio; Félix A Rey; David Veesler
Journal:  Nature       Date:  2016-02-08       Impact factor: 49.962

View more
  8 in total

1.  An Evolutionary Insertion in the Mxra8 Receptor-Binding Site Confers Resistance to Alphavirus Infection and Pathogenesis.

Authors:  Arthur S Kim; Ofer Zimmerman; Julie M Fox; Christopher A Nelson; Katherine Basore; Rong Zhang; Lorellin Durnell; Chandni Desai; Christopher Bullock; Sharon L Deem; Jonas Oppenheimer; Beth Shapiro; Ting Wang; Sara Cherry; Carolyn B Coyne; Scott A Handley; Michael J Landis; Daved H Fremont; Michael S Diamond
Journal:  Cell Host Microbe       Date:  2020-02-18       Impact factor: 21.023

2.  Human CEACAM1 N-domain dimerization is independent from glycan modifications.

Authors:  Meagan Belcher Dufrisne; Nicole Swope; Marissa Kieber; Jeong-Yeh Yang; Ji Han; Jason Li; Kelley W Moremen; James H Prestegard; Linda Columbus
Journal:  Structure       Date:  2022-02-25       Impact factor: 5.871

3.  Cryo-EM structure of infectious bronchitis coronavirus spike protein reveals structural and functional evolution of coronavirus spike proteins.

Authors:  Jian Shang; Yuan Zheng; Yang Yang; Chang Liu; Qibin Geng; Chuming Luo; Wei Zhang; Fang Li
Journal:  PLoS Pathog       Date:  2018-04-23       Impact factor: 6.823

Review 4.  SARS-CoV-2 Evolutionary Adaptation toward Host Entry and Recognition of Receptor O-Acetyl Sialylation in Virus-Host Interaction.

Authors:  Cheorl-Ho Kim
Journal:  Int J Mol Sci       Date:  2020-06-26       Impact factor: 5.923

Review 5.  Research progress on coronavirus S proteins and their receptors.

Authors:  Hao-Wen Yuan; Hong-Ling Wen
Journal:  Arch Virol       Date:  2021-03-28       Impact factor: 2.574

Review 6.  Kathryn V. Holmes: A Career of Contributions to the Coronavirus Field.

Authors:  Aurelio Bonavia; Samuel R Dominguez; Gabriela Dveksler; Sara Gagneten; Megan Howard; Scott Jeffers; Zhaohui Qian; Mary Kathryn Smith; Larissa B Thackray; Dina B Tresnan; David E Wentworth; David R Wessner; Richard K Williams; Tanya A Miura
Journal:  Viruses       Date:  2022-07-20       Impact factor: 5.818

7.  Glycine 29 Is Critical for Conformational Changes of the Spike Glycoprotein of Mouse Hepatitis Virus A59 Triggered by either Receptor Binding or High pH.

Authors:  Dan Mi; Xiuyuan Ou; Pei Li; Guiqing Peng; Yan Liu; Ruixuan Guo; Zhixia Mu; Fang Li; Kathryn Holmes; Zhaohui Qian
Journal:  J Virol       Date:  2019-09-30       Impact factor: 5.103

8.  Evidence for the loss and recovery of SLAMF9 during human evolution: implications on Dollo's law.

Authors:  Maegan K Murphy; Justin T Moon; Alexis T Skolaris; Joseph A Mikulin; Timothy J Wilson
Journal:  Immunogenetics       Date:  2021-02-22       Impact factor: 2.846

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.