| Literature DB >> 33527091 |
Gajendra Kumar Azad1, Parimal Kumar Khan1.
Abstract
Severe acquired respiratory syndrome coronavirus 2 (SARS-CoV-2) rapidly spread worldwide and acquired multiple mutations in its genome. Orf3a, an accessory protein encoded by the genome of SARS-CoV-2, plays a significant role in viral infection and pathogenesis. In the present in-silico study, 15,928 sequences of Orf3a reported worldwide were compared to identify variations in this protein. Our analysis revealed the occurrence of mutations at 173 residues of Orf3a protein. Subsequently, protein modelling was performed that revealed twelve mutations which can considerably affect the stability of Orf3a. Among the 12 mutations, three mutations (Y160H, D210Y and S171L) also lead to alterations in secondary structure and protein disorder parameters of the Orf3a protein. Further, we used predictive tools to identify five promising epitopes of B-cells, which resides in the mutated regions of Orf3a. Altogether, our study sheds light on the variations occurring in Orf3a that might contribute to alteration in protein structure and function.Entities:
Keywords: B-cell epitopes; Mutations; Orf3a; Protein disorder; Protein stability; SARS-CoV-2
Year: 2021 PMID: 33527091 PMCID: PMC7839395 DOI: 10.1016/j.bbrep.2021.100933
Source DB: PubMed Journal: Biochem Biophys Rep ISSN: 2405-5808
Fig. 1A) The cartoon structure of Orf3a. The red color represents helical structure while yellow color represents beta sheets. B) The various properties of residues of Orf3a. The data were obtained from Innovagen's peptide calculator (https://pepcalc.com/). The colour coded display of amino acid classification and peptide hydropathy plot are shown. C) The sequence of SARS-CoV-2 Orf3a protein. The mutated amino acids are shown in red color. The number denotes the position of residues in Orf3a. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
List of Orf3a mutations identified in this study. The sequence of SARS-CoV-2 Orf3a protein reported till 14th Sept 2020 was aligned with the sequence from Wuhan (wet sea food market) SARS-CoV-2. The mutations were recognized by amino acid sequence alignment by CLUSTAL Omega.
| S. No. | Mutation | S. No. | Mutation | S. No. | Mutation |
|---|---|---|---|---|---|
| 1 | D2G, D2Y | 59 | L83F | 117 | S171L |
| 2 | M5V | 60 | L85F | 118 | G172V, G172C |
| 3 | R6T | 61 | L86W | 119 | G174D |
| 4 | I7T | 62 | F87L | 120 | T175K, T175I |
| 5 | T9K, T9I | 63 | V88L, V88A | 121 | T176I |
| 6 | T12N | 64 | T89I | 122 | S177I |
| 7 | V13L, V13A, V13I | 65 | V90F, V90I | 123 | P178S |
| 8 | T14I | 66 | S92L | 124 | H182Y |
| 9 | L15F | 67 | H93Y | 125 | Y184H |
| 10 | K16N | 68 | L94P, L94I, L94F | 126 | Q185H |
| 11 | Q17R | 69 | L95F | 127 | G188C |
| 12 | G18S, G18C, G18V, G18D | 70 | L96F | 128 | Y189C |
| 13 | I20T | 71 | V97A, V97F | 129 | E191K |
| 14 | K21Q, K21N | 72 | A99T, A99S, A99V | 130 | W193C, W193R |
| 15 | D22Y | 73 | G100L, G100C, G100F, G100V | 131 | E194Q |
| 16 | A23S | 74 | L101F | 132 | S195Y |
| 17 | P25L, P25S | 75 | A103S, A103V | 133 | G196V, G196R |
| 18 | S26L, S26P | 76 | P104L, P104S | 134 | V197L, V197I |
| 19 | D27Y, D27H | 77 | F105L | 135 | V201I |
| 20 | A31T | 78 | L106F | 136 | V202L |
| 21 | T32I | 79 | L108F | 137 | T208A |
| 22 | A33E, A33S | 80 | A110S, A110V | 138 | D210Y |
| 23 | T34A | 81 | L111S | 139 | Y211C |
| 24 | I35T | 82 | V112F, V112L | 140 | Q213H |
| 25 | P36L | 83 | F114C | 141 | Y215H |
| 26 | Q38E, Q38P | 84 | Q116H | 142 | S216P |
| 27 | A39T | 85 | I118V | 143 | T217A |
| 28 | S40P, S40L | 86 | N119H | 144 | Q218R |
| 29 | L41I, L41H, L41F | 87 | F120L | 145 | L219F, L219S, L219V |
| 30 | P42S, P42L, P42R | 88 | R122K, R122I | 146 | S220N |
| 31 | F43Y | 89 | I123V | 147 | D222G |
| 32 | G44V | 90 | M125I | 148 | T223I |
| 33 | W45L, W45R | 91 | R126 M, R126S | 149 | G224C, G224V |
| 34 | L46F | 92 | L127F, L127I | 150 | V225L, V225F |
| 35 | V48F | 93 | W128C, W128L | 151 | T229I |
| 36 | G49D, G49S, G49V | 94 | L129F | 152 | V237A, V237F |
| 37 | V50A, V50I | 95 | W131S, W131R, W131L, W131V | 153 | D238N, D238E |
| 38 | A51S | 96 | C133F | 154 | E239D, E239G |
| 39 | L52F, L52I | 97 | R134H, R134L, R134C | 155 | P240L, P240S |
| 40 | L53F, L53H | 98 | S135P | 156 | E241A |
| 41 | A54T, A54S, A54V | 99 | L140F, L140I | 157 | E242A |
| 42 | V55G, V55F | 100 | A143S, A143V | 158 | Q245L |
| 43 | F56C | 101 | N144Y | 159 | H247Y |
| 44 | Q57Y, Q57H | 102 | L147F | 160 | G251C, G251V |
| 45 | A59V | 103 | C148Y, C148S | 161 | G254R |
| 46 | K61 N | 104 | W149L, W149C | 162 | N257S |
| 47 | T64I | 105 | T151I | 163 | V259L, V259E |
| 48 | L65F | 106 | N152S, N152I | 164 | M260K, M260I |
| 49 | K66N | 107 | C153Y | 165 | P262L, P262S |
| 50 | K67N, K67R | 108 | Y154C | 166 | I263M |
| 51 | R68I | 109 | D155Y | 167 | Y264C |
| 52 | W69C, W69L, W69R | 110 | I158V | 168 | P267S, P267L |
| 53 | A72S | 111 | Y160H | 169 | T268K, T268M |
| 54 | L73F | 112 | V163L | 170 | T269M |
| 55 | S74F, S74P | 113 | S165F, S165I | 171 | T271I |
| 56 | K75R, K75E | 114 | S166L | 172 | S272I |
| 57 | V77I, V77F | 115 | V168I | 173 | V273L |
| 58 | H78Q, H78Y | 116 | T170S |
Calculations of ΔΔG between wild-type and mutant Orf3a. The top rank mutations are listed in the table. DynaMut webserver was used to calculate the predicted ΔΔG. The negative values indicate the destabilization of protein upon mutation.
| S. No | Wild type residue | Residue position | Mutant residue | ΔΔG DynaMut (kcal/mol) |
|---|---|---|---|---|
| 1 | G | 49 | V | 1.74 |
| 2 | V | 88 | L | 1.507 |
| 3 | V | 90 | F | 1.512 |
| 4 | V | 112 | F | 1.30 |
| 5 | R | 126 | S | −2.024 |
| 6 | C | 148 | S | −1.741 |
| 7 | I | 158 | V | −1.583 |
| 8 | Y | 160 | H | −1.527 |
| 9 | S | 171 | L | 1.619 |
| 10 | D | 210 | Y | 1.442 |
| 11 | G | 224 | V | −1.528 |
| 12 | G | 224 | C | −1.717 |
Fig. 2Analysis of the secondary structure and intrinsic disorder predisposition of the unique mutations of SARS-CoV2 Orf3a in comparison with the reference Orf3a protein (YP_009724391) from China, Wuhan. (A, C and E) Secondary structure predictions, the amino acid sequences near the mutation site were uploaded on CFSSP web tool that predict secondary structure. Each panel (A, C and E) shows the secondary structure of the wild type and mutated input sequences. The panel (i) represents the wild type or Wuhan sequence while panel (ii) represents the mutated Indian sequence. The mutation site is highlighted in the rectangular box. (B,D and F) protein disorder prediction, the analysis was conducted using PONDR-VSL2 algorithm. A disorder threshold is depicted at a score of 0.5). Residues/regions with the disorder scores >0.5 are considered as disordered.
Fig. 3Prediction of B-cell epitopes. A) On the graphs, the Y-axis depicts for each residue the correspondent BepiPred score (averaged in the specified window); while the X-axis depicts the residue positions in the sequence. The larger score for the residues might be interpreted as that the residue might have a higher probability to be part of epitope (those residues are colored in yellow on the graphs). B) The top five peptides of Orf3a that showed the highest score. The sequences of all five peptides are shown. The number in parentheses represents the location of the peptide in the primary sequence of Orf3a. The red font shows the location of mutant residues of Orf3a. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)