Dhanusha Yesudhas1, Ambuj Srivastava1, Masakazu Sekijima2, M Michael Gromiha1,2. 1. Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India. 2. School of Computing, Tokyo Tech World Research Hub Initiative (WRHI), Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan.
Abstract
The 2019-novel coronavirus also known as severe acute respiratory syndrome-coronavirus 2 (SARS-CoV-2) is a common threat to animals and humans, and is responsible for the human SARS pandemic in 2019 to 2021. The infection of SARS-CoV-2 in humans involves a viral surface glycoprotein named as spike proteins, which bind to the human angiotensin-converting enzyme 2 (ACE2) proteins. Particularly, the receptor binding domains (RBDs) mediate the interaction and contain several disordered regions, which help in the binding. Investigations on the influence of disordered residues/regions in stability and binding of spike protein with ACE2 help to understand the disease pathogenesis, which has not yet been studied. In this study, we have used molecular-dynamics simulations to characterize the structural changes in disordered regions of the spike protein that result from ACE2 binding. We observed that the disordered regions undergo disorder-to-order transition (DOT) upon binding with ACE2, and the DOT residues are located at functionally important regions of RBD. Although the RBD is having rigid structure, DOT residues make conformational rearrangements for the spike protein to attach with ACE2. The binding is strengthened via hydrophilic and aromatic amino acids mainly present in the DOTs. The positively correlated motions of the DOT residues with its nearby residues also explain the binding profile of RBD with ACE2, and the residues are observed to be contributing more favorable binding energies for the spike-ACE2 complex formation. This study emphasizes that intrinsically disordered residues in the RBD of spike protein may provide insights into its etiology and be useful for drug and vaccine discovery.
The 2019-novel coronavirus also known as severe acute respiratory syndrome-coronavirus 2 (SARS-CoV-2) is a common threat to animals and humans, and is responsible for the human SARS pandemic in 2019 to 2021. The infection of SARS-CoV-2 in humans involves a viral surface glycoprotein named as spike proteins, which bind to the human angiotensin-converting enzyme 2 (ACE2) proteins. Particularly, the receptor binding domains (RBDs) mediate the interaction and contain several disordered regions, which help in the binding. Investigations on the influence of disordered residues/regions in stability and binding of spike protein with ACE2 help to understand the disease pathogenesis, which has not yet been studied. In this study, we have used molecular-dynamics simulations to characterize the structural changes in disordered regions of the spike protein that result from ACE2 binding. We observed that the disordered regions undergo disorder-to-order transition (DOT) upon binding with ACE2, and the DOT residues are located at functionally important regions of RBD. Although the RBD is having rigid structure, DOT residues make conformational rearrangements for the spike protein to attach with ACE2. The binding is strengthened via hydrophilic and aromatic amino acids mainly present in the DOTs. The positively correlated motions of the DOT residues with its nearby residues also explain the binding profile of RBD with ACE2, and the residues are observed to be contributing more favorable binding energies for the spike-ACE2 complex formation. This study emphasizes that intrinsically disordered residues in the RBD of spike protein may provide insights into its etiology and be useful for drug and vaccine discovery.
Human respiratory disease with symptoms of pneumonia has been diagnosed at the end of 2019, in Wuhan city, China, which is identified as COVID‐19 caused by severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2). Coronaviruses have been known to cause respiratory diseases and gastroenteritis in animals and humans.
The SARS‐CoV and SARS‐CoV‐2 belong to the betacoronavirus family, and share 80% whole genome sequence similarity.At the time of viral invasion, spike protein (~1300 amino acid residues) is cleaved by host cell proteases into S1 (the peripheral fragment) and S2 (membrane‐spanning fragment) subunits. The S1 subunit contains the receptor binding domain (RBD), which facilitates the interactions with host cell receptors whereas, the S2 subunit fuses with host cell membrane and releases the viral genome into the cell
,
(Figure S1). The heptads repeat regions N‐HR, C‐HR and the fusion peptides are involved in membrane fusion. The viral entry to the host cells takes place by an endocytic and nonendosomal pathways.
The spike protein priming by furin at the S1/S2 site is important and unique in SARS‐CoV‐2 membrane fusion, which is absent in SARS‐CoV.
Spike proteins have a good binding affinity for sugar receptors of human cells, which uses them as a mechanism of cell entry.
Notably, an SARS‐CoV‐2 has a stronger affinity to human ACE2 than an SARS‐CoV. SARS‐CoV‐2 also has a higher pathogenicity and mortality rate than SARS‐CoV. The clinical spectrum of SARS‐CoV‐2 is broader than SARS‐CoV including asymptomatic infections, mild upper respiratory tract illness, severe viral pneumonia with respiratory failure and death. In addition, SARS‐CoV‐2 infected patients suffer prolonged illness and have a low neutralizing antibody capacity, suggesting a greater need for vaccine interventions.The intrinsic disorder regions (IDRs) are the regions, which have a dynamic ensemble of conformations that do not adopt stable three‐dimensional structure in physiological conditions.
IDRs are known to be involved in many biological functions, including signaling, regulation and binding molecules including proteins, DNA, RNA and small molecules. IDPs are known to be associated with various diseases including neurological disorders and cancer.
,
IDPs also tune their binding affinity for specific and non‐specific binding with other molecules by changing its conformation.
,
They also act as hub proteins in various signaling networks, bind multiple partners and are involved in many pathways.
,IDRs are reported in SARS and other coronaviruses, for example, in human coronavirus 229E (hCoV‐229E) an intrinsically disordered domain present near the C‐terminal tail peptide, plays an important role in dimer‐dimer association. Similarly, coronavirus HKU1 has a large structurally conserved linker loop (amino acids 428‐587), which is partially disordered.
The S2 subunit of HKU1 also shows the presence of disordered residues at its protease cleavage site.
The structure reported for the recent SARS‐CoV‐2 spike protein, shows that S1/S2 junction is in a disordered, solvent‐exposed loop.
Based on the literature reports, more number of databases and tools have been developed for predicting the disordered regions in proteins.
,
,
For example, the DisProt database contains the disorder content of spike and other viral protein structures.Wrapp et al determined the structure of the pre‐fusion spike trimeric protein using cryogenic electron microscopy at a resolution of 3.46 Å (PDB ID: 6VSB).
In this trimeric protein, many residues are missing their structural coordinate information in the RBD (residues 329‐334, 444‐448, and 455‐490), especially in “A” chain, which has an elevated structure and is primarily responsible for binding. The 329 to 334 residues are present at the N‐terminal of RBD and disordered in free protein as well as in the complex and are termed as DR (disordered residues). However, the intrinsic disordered residues 444 to 448 and 455 to 490 have stable structure upon ACE2 binding (PDB ID: 6M0J)
and are termed as disorder‐to‐order transition (DOT) residues. Despite the lack of a stable three‐dimensional structure of these regions in free protein, the DOTs facilitate the spike‐ACE2 recognition and help for both the proteins interactions.On the other hand, coronavirus spike protein is a glycoprotein, and the glycosylation site has high flexibility and variable secondary structures.
SARS‐CoV‐2 spike protein exists in both closed and open states, referred as inactive and active states of spike protein. Casalino et al suggested that the glycans help in conformational change from closed‐to‐open state, allowing the RBD to bind with ACE2 receptors.The infection and pathogenicity are often associated with the disordered regions in the viral proteins.
,
,
,
However, the role of disordered residues/regions in stability and binding of spike protein with ACE2, which could help in understanding the disease pathogenesis, has not yet been explored. In this work, we investigate the importance of disordered residues in spike‐ACE2 recognition using molecular dynamics (MD) simulations. We observed that the DOTs in the free form of the spike protein experience very high fluctuations more than the complex structure. The DOTs undergo a structural transition upon binding with ACE2, which facilitates the stable compact complex structure. In addition, DOT regions are also contributing significant binding energy upon interactions with ACE2 receptors. Our study helps in understanding the functionality of IDRs in spike protein, and its interactions with host cell proteins.
METHODS
Disordered residues in spike protein
A region is considered to be disordered if at least five continuous non‐terminal residues are missing in the PDB structure.
,
The DOT residues in RBD domain are obtained by comparing the missing residues from free protein (PDB ID: 6VSB) and spike‐ACE2 complex (PDB ID: 6M0J). We observed that the A chain of 6VSB is an elevated structure and it contains missing residues from 444 to 448 and 455 to 490, which obtained an ordered structure upon binding to the ACE2 receptor (PDB ID: 6M0J). Thus, in this study, the residues 444 to 448 are considered as DOT1 and 455 to 490 as DOT2.
Modeling
To obtain the structures for missing residues of the RBD present in free spike protein we have used modeler version 9.24
software and generated 10 models and selected the model, which has the lowest discrete optimized protein energy score. The missing residues (DOT residues) of the prefusion structure were modeled using 6VSB as a template.
MD simulations
MD simulations are performed for free spike protein and the complex with ACE2 for 100 ns using AMBER18 tools with FF14SB
and GAFF force fields.
We have repeated the simulations three times to evaluate the consistency of results. The repetitions have been carried out with a seed generated from a random number generator and hence the initial velocities for each system are different. Molecule is solvated using TIP3PBOX water model with edge of the box at least 10 Å away from the solute molecules.
The Na + ions are added to neutralize the system in order to maintain the physiological condition. Energy minimization of each molecule is done for 50 000 steps using the steepest descent minimization with the step size of 2 fs at constant temperature (300 K), and pressure (1 atm). The description of the simulation systems has been listed in Table S1.
Trajectory analysis
The trajectories obtained from simulations are visualized using VMD,
Pymol,
and Chimera.
Properties such as root mean square fluctuation (RMSF) and root mean square deviation (RMSD) are calculated using “rms” and “atomicfluct” commands available in CPPTRAJ program, respectively. The average minimum distance is calculated by plotting the distance of each residue in spike RBD with ACE2 during the simulation using “nativecontacts” command and generating average value using a python program. The interaction energy between spike and ACE2 is calculated using the the molecular mechanics‐Poisson Boltzmann surface area (MMPBSA)
program for the last 2 ns. The dynamic residue movement is monitored via the dynamic cross‐correction map (DCCM) using the Bio3D package.
Binding free energy calculation
MM‐PBSA method is applied to calculate the binding free energy between RBD of spike protein and the ACE2 receptor. The energy contribution of each residue toward binding is measured.
where E
, E
, and E
are the molecular mechanics (MM) energy terms from bonded (bond length, angle, and dihedral), electrostatic, and van der Waals interactions, respectively. G
and G
are the polar and nonpolar contributions, respectively, to the solvation free energies. TS is the absolute temperature, T, multiplied by the entropy, S.
where ∆G
represents the free energy of a protein‐protein complex, and ∆G
and G
are the free energies of RBD and ACE2 proteins, respectively.
RESULTS AND DISCUSSION
Recognition and interaction of spike with ACE2 is regulated by the RBD (319‐541), in which residues from 436 to 506 are termed as receptor binding motif (RBM) because these residues are actively participating in the interactions. The DOT1 (residues 444‐448) and DOT2 (residues 455‐490) are grouped in the RBM motif which is responsible for ACE2 recognition. As shown in Figures 1 and S1, the DOT1 and DOT2 are located in the interface of the spike protein and ACE2. The crucial interacting DOT residues as well the residues, which are important to maintain the binding conformation, are heighted in Figure 1. The DOT regions help in providing the shape complementarity, strength and stability. In addition, the results from the previous studies state that the disordered regions of viral proteins are associated with their virulence.
,
,
,
Hence, atomic‐level MD simulations studies of disordered regions in spike protein can provide better insights into the host tropism and transmission capacity of SARS‐CoV‐2.
FIGURE 1
Structure of spike receptor binding domain (RBD)‐ACE2 complex. A, Important interacting residues (Leu455, Glu484, Phe486, and Tyr489) at the interface, disorder‐to‐order transition (DOT) regions (DOT1 and DOT2) are highlighted. B, The interface residues between spike RBD and angiotensin‐converting enzyme (ACE). Gly, Phe, and Tyr residues in DOT1 and DOT2 are represented in sticks and colored with silver, red, and green, respectively; the conserved residues are marked in yellow
Structure of spike receptor binding domain (RBD)‐ACE2 complex. A, Important interacting residues (Leu455, Glu484, Phe486, and Tyr489) at the interface, disorder‐to‐order transition (DOT) regions (DOT1 and DOT2) are highlighted. B, The interface residues between spike RBD and angiotensin‐converting enzyme (ACE). Gly, Phe, and Tyr residues in DOT1 and DOT2 are represented in sticks and colored with silver, red, and green, respectively; the conserved residues are marked in yellow
Flexibility of disordered residues in free and complex structure of spike protein
The flexibility for the overall structure of RBD spike protein in the free and complex form is observed from the backbone RMSD during the simulations. For the free protein, a major global conformational change occurs around 50 ns, increasing the RMSD up to 8 Å (Figure S2). This conformational change is principally caused by the C‐terminal loop structures of the DOT regions. We confirm this observation by plotting the RMSF values and superimposed structures of RBD domain at different time frames (30, 50 and 80 ns). The RMSF calculations showed that the C‐terminal of DOT2 region is highly flexible from 48 to 52 ns of simulations, and the superimposed structures show that DOT regions are highly deviated while the other regions are marginally deviating in different time frames (Figure S3).Overall, both the simulations reached an equilibrium state, and the RMSD value of the complex structure shows lesser fluctuations (3 Å) than the free form (8 Å) (Figure S2). The unconstrained DOT regions have an impact on the overall structural stability when the protein binds with its partner. Thus, the RMSD of the DOT residues alone was calculated throughout the trajectories in order to observe its dynamic nature during simulation. Figure 2A,B displays the RMSDs of DOT1 and DOT2, respectively, of the complex and free protein structures. The RMSD of the DOTs in complex structure showed less dynamics, and maintains the stable trajectories throughout the simulation. However, the DOTs of the free protein show fluctuating movement, especially the DOT2 residues experience deviations up to 8 Å distance (Figure 2B). In addition, the RMSD and RMSF results from each simulation repetition have been provided for better insights (Figure S4). Moreover, the compactness of the structures is computed from the radius of gyration analysis and the results showed that RBD in both free and complex structures have similar compactness throughout the simulations. The results are consistent in all three repetitions of simulations (Figure S5).
FIGURE 2
Overall backbone fluctuation of disorder‐to‐order transition (DOT) residues throughout the simulation. A, Root mean square deviation (RMSD) of DOT1 in free and complex structure. B, RMSD of DOT2 in free and complex structure. The blue and gray shades represent the standard deviations
Overall backbone fluctuation of disorder‐to‐order transition (DOT) residues throughout the simulation. A, Root mean square deviation (RMSD) of DOT1 in free and complex structure. B, RMSD of DOT2 in free and complex structure. The blue and gray shades represent the standard deviationsSimilarly, the RMSF analysis explains the fluctuations of each residue during the simulation. The fluctuations of residues show that the DOTs in the free protein structure are experiencing high fluctuations, especially in the DOT2, some of the residues (470‐490) are fluctuating up to 9 Å distance (Figure 3). However, the DOTs in the complex structure show low fluctuations, particularly the DOT2 experiences a maximum of 2.5 Å fluctuation for few residues (476‐483) (Figure 3). Based on the RMSD and RMSF calculations, it clearly explains that the DOT regions greatly alter the dynamic stability of structure. The interactions of DOT residues with ACE2 make the complex structure compact and stable, thereby experiencing less fluctuation.
FIGURE 3
The residue fluctuation of spike protein receptor binding domain (RBD). The DOT1 and DOT2 residues are marked by black line below the X‐axis. The, secondary structure information is displayed above the X‐axis. The symbols α, β, η, and T represent α‐helix, β‐strand, 310‐helix, and turn, respectively
The residue fluctuation of spike protein receptor binding domain (RBD). The DOT1 and DOT2 residues are marked by black line below the X‐axis. The, secondary structure information is displayed above the X‐axis. The symbols α, β, η, and T represent α‐helix, β‐strand, 310‐helix, and turn, respectively
Contributions of ordered residues vs disordered residues
SARS‐CoV and SARS‐CoV‐2 do not fall in the same clade and the whole genome sequence similarity is less than 80%. The major sequence differences arise from their RBD sequences.
Regardless of this sequence diversity, the SARS‐CoV and SARS‐CoV‐2 are having 100% structure identities in the RBD domain, and they share few conserved residues. Based on the interactions that exist between the RBD domain and ACE2, residues such as Tyr449, Gly502, and Tyr505 are observed to be conserved among SARS‐CoV, NL‐63, and SARS‐CoV‐2.
,
All these conserved residues are located at the flanking regions of DOTs. Studies have been reported that often the DOTs are located near the actual binding sites.The average minimum distance calculation of each residue of a spike protein with ACE2 receptor is measured (Figure 4). Most of the DOT residues are maintaining less than 5 Å distance from the corresponding ACE2 residues throughout the simulation. Apart from the DOTs, the ordered residues in the interface (436‐443; 449‐454, and 491‐506), which are located adjacent to the DOTs, are showing lesser distance with ACE2 (Table 1). Particularly, the hot spot residues (Lue455, Glu484, Phe486, Asn487, Tyr489, Gln493, Gly496, Asn501, and Gly502), which are mentioned in the literature
,
,
,
maintain less than 3 Å distance from ACE2 (Table 1 and Figure 1). The average minimum distance between the spike RBD and ACE2 has been computed for the three repetitions of simulations, and the results are observed to be consistent (Figure S6). The location and the flexibility of these DOT residues provide better shape complementary for the spike‐ACE2 recognition, and maintain its binding conformation for a longer time
demonstrated that RBM is an intrinsically flexible region, which is recognized by the ACE2 receptor, and the binding region is either in a closed (active) or open (inactive) state. The DOTs, which are responsible for the compact and stable structure, and their interactions with ACE2 support the open conformation.
FIGURE 4
A, The average minimum distance (deviations are indicated with error bars) and, B, the decomposition energy of each residue between the receptor binding domain (RBD) residues against angiotensin‐converting enzyme 2 (ACE2) throughout the simulations. Secondary structures of residues are the same as in Figure 3
TABLE 1
Residues, which are less than 3 Å distance from ACE2 and their energetic contribution upon binding
The key residues reported in the literature. Conserved residues among SARS‐CoV‐2, NL‐63, and SARS‐CoV are in bold.
A, The average minimum distance (deviations are indicated with error bars) and, B, the decomposition energy of each residue between the receptor binding domain (RBD) residues against angiotensin‐converting enzyme 2 (ACE2) throughout the simulations. Secondary structures of residues are the same as in Figure 3Residues, which are less than 3 Å distance from ACE2 and their energetic contribution upon bindingAbbreviations: ACE2, angiotensin‐converting enzyme 2; DOT, disorder‐to‐order transition; SARS‐CoV‐2, severe acute respiratory syndrome‐coronavirus 2.The key residues reported in the literature. Conserved residues among SARS‐CoV‐2, NL‐63, and SARS‐CoV are in bold.Likewise, the binding free energy contribution from each residue of the RBD domain is calculated using AMBER‐MMPBSA. As expected, the residues, which are maintaining lesser distance with ACE2 are showing more favorable interaction energies. The ordered and the disordered residues and their interaction energies are given in Figure 4B and are also listed in Table 1. Particularly, the residues located at the flanking regions of DOTs are shown to be contributing more favorable interaction energies. Among the ordered residues in the interface, seven residues are maintaining less than 3 Å distance with ACE2 and contributing more for the binding energy whereas among DOT residues, six of them are contributing favorable binding free energies. The alanine scanning mutagenesis of DOT residues has been performed to examine the change in binding free energy of the complex. The mutation of DOT residues Leu455, Phe456, Phe486, and Tyr489 decreased the binding affinity between spike RBD and ACE2, which confirms the importance of these residues in maintaining the binding affinity of the complex (Figure S7).
Interactions of DOTs with ACE2 receptor
The DOT2, which makes the longest interacting loop region, is rich in Phe (11%) residues. This aromatic amino acid makes bonded and nonbonded interactions with the other aromatic residues of the partner ACE2 protein (Figure 1B). The residues involved in hydrogen bond interactions between the DOTs and ACE2 receptors are reported in Table 2, and the other hydrophobic interactions are listed in Table S2. Mainly, the residues present in RBD of spike protein Ala475, Gly476, Asn487, and Tyr489 are making interactions with Ser19, Gln24 and Tyr83 of ACE2 (Table 2). Part of the experimentally determined interacting residues which are reported in previous studies are present in the DOT regions (Glu484, Asn487, and Tyr489), and in the flanking region of the DOTs (Figure 1).
TABLE 2
Hydrogen bond and salt bridge interacting residues between DOTs and ACE2
The experimentally reported residues. The donor and the acceptor atoms are also included.
Hydrogen bond and salt bridge interacting residues between DOTs and ACE2Lys458 (NZ)‐Glu23 (OE2)Glu484 (OE1)
‐Lys31 (NZ)Abbreviations: ACE2, angiotensin‐converting enzyme 2; DOT, disorder‐to‐order transition; SARS‐CoV‐2, severe acute respiratory syndrome‐coronavirus 2.The experimentally reported residues. The donor and the acceptor atoms are also included.Both SARS‐CoV and SARS‐CoV‐2 RBM (436‐506) are enriched in Gly (11%), Tyr (8%), and Phe (7%) residues. Based on previous studies, these three residues are playing an important role in interactions with ACE2.
In the current study, these residues form hydrogen bonds and other non‐covalent interactions (Tables 2 and S2). Notably, the decomposition energy calculations also stated that glycine, tyrosine, and phenylalanine residues are contributing favorable affinities for the complex structure. However, a recent study revealed that in SARS‐CoV RBD, the tyrosine residues, which are present at the interface (Tyr442, Tyr475, and Tyr491), are not actively taking part in the interaction with ACE2.
Whereas in SARS‐CoV‐2, even though the Tyr and Phe residues of the DOT2 are present in the loop, they are still stable via pi‐pi interactions of their phenyl ring structures, for example, Phe486 of spike protein interacts with Tyr83 of ACE2. These residues are actively taking part in intraprotein and interprotein interactions. Figure 1B represents the binding interface, in which all the Gly, Phe, and Tyr residues in DOT2 are marked in silver, red, and green, respectively, and the conserved residues are also marked in yellow. The side chains of Tyr and Phe of spike protein make favorable electrostatic interactions with α1 helix of ACE2, especially with Tyr and Gln amino acids (Table 2). Followed by Tyr and Phe, Gly is also abundant in the RBD domain of both SARS‐CoV and SARS‐CoV‐2 spike proteins and is located at the binding site. The prevalence of aromatic‐aromatic interactions and presence of DOT regions in SARS‐CoV‐2 makes it different from SARS coronavirus.Moreover, recently, Ju et al identified antibodies that potentially neutralize SARS‐CoV‐2, and the residues Lys444, Gly446, Gly447, Asn448, Tyr449, Asn450, Lys452, Val483, Glu484, Gly485, Phe490, and Ser494 are epitope residues.
Interestingly, all these residues are in the DOT regions of the RBD receptor‐binding motif. This implies the importance of targeting DOT regions/residues in designing the drugs and vaccines.
Cross‐correlation map of DOTs of spike protein
In addition, the DCCM of RBD residues in free and complex structures revealed that most of the residues in the complex structure experience positive correlation motion with nearby residues which are represented in blue boxes. The pink and white color regions show negative and no correlation motion, respectively. The DOTs, especially DOT2 in the complex structure, experience positive correlation with the nearby residues, (highlighted with a red circle) which shows its stability and compactness upon ACE2 binding (Figure 5). Other than the DOT residues, the N‐terminal region of the spike RBD domain in complex structure shows positive correlation with other residues. Since the N‐terminal also took part in interactions with the ACE2 receptor, the ordered residues in that N‐terminal are also showing positive correlated motion. However, in Figure 5B, the DOT residues of the free protein (marked in red circle) are not having positively correlated motion with the nearby residues, which explains its higher structural flexibility (Figure 2). Similarly, the residue contact plot also explained that the crucial DOT residues are maintaining the interactions with ACE2 throughout the simulations (Figure S8).
FIGURE 5
A, Dynamic cross‐correction map (DCCM) of receptor binding domain (RBD) residues in the complex structure and it represents that the disorder‐to‐order transition (DOT) residues exist in positive correlative motions with nearby residues, which suggest the stable interactions and compact structure of the complex. B, DCCM of RBD residues in the free form structure
A, Dynamic cross‐correction map (DCCM) of receptor binding domain (RBD) residues in the complex structure and it represents that the disorder‐to‐order transition (DOT) residues exist in positive correlative motions with nearby residues, which suggest the stable interactions and compact structure of the complex. B, DCCM of RBD residues in the free form structureThe host cell invasion of SARS‐CoV‐2 is happening by glycosylated spike protein. This fusion protein exists in metastable prefusion form, which undergoes more structural rearrangements to fuse with the host cell membrane. Receptor binding destabilizes the prefusion trimer, and the S1 subunit undergoes open and close conformational changes which represent the receptor accessible (less stable conformation) and inaccessible states, respectively.
,
,
,
,
,
,
These conformational rearrangements could be possibly controlled by the DOT residues which are present in the interface between the RBD domain and the ACE2. The positive correlation motion of residues in the complex structure, especially the DOTs and N‐terminal region of RBD provides the precise geometry for the binding and explains the stable and compact structure of spike‐ACE2 complex.Hence, focusing on the behavior of disordered residues of spike protein in free and complex forms may shed some light to identify the reason for transmission and higher binding affinities. The flexibility and the conformational changes adapted by these DOTs upon binding with ACE2 have to be focused while identifying drugs.
SUMMARY
The intrinsically flexible RBD recognizes the ACE2 receptor, and the binding region is either in an open or a closed state. The ACE2 recognition by the RBD is mainly based on the conformational state adapted by the RBD, and the conformational switch is governed by the highly flexible residues. Due to the highly flexible nature of DOTs, the DOT residues can govern the precise geometry for ACE2 binding. The DOT of the residues and their higher number of aromatic and ionic interactions make the complex structure more stable and experience higher binding energies. The positive correlative motion of DOTs with the nearby residues also explains the compact and stable structure of the RBD with ACE2. The uniquely reported DOTs in SARS‐CoV‐2 spike protein, which are located in the binding interface, require attention for the development of small molecule inhibitors. In addition, these spike protein DOT residues act as potential epitope residues for designing new antibodies. Thus, an atomic level characterization of the spike protein and its unique DOTs structure can guide for mechanism study, which is associated with its transmission. The inhibition of viral entry needs to focus on the structural aspects of DOTs in spike RBD domain, which may help in therapeutic applications for COVID‐19.
CONFLICT OF INTEREST
The authors declare no conflicts of interest.
AUTHOR CONTRIBUTIONS
Dhanusha Yesudhas and Ambuj Srivastava: Designed and carried out the work. Dhanusha Yesudhas, Ambuj Srivastava, Masakazu Sekijima, and M. Michael Gromiha: Contributed in discussions. Dhanusha Yesudhas and Ambuj Srivastava: Wrote the manuscript. Masakazu Sekijima and M. Michael Gromiha: Edited the manuscript. All authors read and approved the manuscript.
PEER REVIEW
The peer review history for this article is available at https://publons.com/publon/10.1002/prot.26088.Appendix
S1: Supporting InformationClick here for additional data file.
Authors: Barry J Grant; Ana P C Rodrigues; Karim M ElSawy; J Andrew McCammon; Leo S D Caves Journal: Bioinformatics Date: 2006-08-29 Impact factor: 6.937
Authors: James M Krieger; Giuliana Fusco; Marc Lewitzky; Philip C Simister; Jan Marchant; Carlo Camilloni; Stephan M Feller; Alfonso De Simone Journal: Biophys J Date: 2014-04-15 Impact factor: 4.033