| Literature DB >> 19851853 |
E Sreekumar1, Aneesh Issac, Sajith Nair, Ramkumar Hariharan, M B Janki, D S Arathy, R Regu, Thomas Mathew, M Anoop, K P Niyas, M R Pillai.
Abstract
Chikungunya virus (CHIKV), a positive-stranded alphavirus, causes epidemic febrile infections characterized by severe and prolonged arthralgia. In the present study, six CHIKV isolates (2006 RGCB03, RGCB05; 2007 RGCB80, RGCB120; 2008 RGCB355, RGCB356) from three consecutive Chikungunya outbreaks in Kerala, South India, were analyzed for genetic variations by sequencing the 11798 bp whole genome of the virus. A total of 37 novel mutations were identified and they were predominant in the 2007 and 2008 isolates among the six isolates studied. The previously identified E1 A226V critical mutation, which enhances mosquito adaptability, was present in the 2007 and 2008 samples. An important observation was the presence of two coding region substitutions, leading to nsP2 L539S and E2 K252Q change. These were identified in three isolates (2007 RGCB80 and RGCB120; 2008 RGCB355) by full-genome analysis, and also in 13 of the 31 additional samples (42%), obtained from various parts of the state, by sequencing the corresponding genomic regions. These mutations showed 100% co-occurrence in all these samples. In phylogenetic analysis, formation of a new genetic clade by these isolates within the East, Central and South African (ECSA) genotypes was observed. Homology modeling followed by mapping revealed that at least 20 of the identified mutations fall into functionally significant domains of the viral proteins and are predicted to affect protein structure. Eighteen of the identified mutations in structural proteins, including the E2 K252Q change, are predicted to disrupt T-cell epitope immunogenicity. Our study reveals that CHIK virus with novel genetic changes were present in the severe Chikungunya outbreaks in 2007 and 2008 in South India.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19851853 PMCID: PMC7088544 DOI: 10.1007/s11262-009-0411-9
Source DB: PubMed Journal: Virus Genes ISSN: 0920-8569 Impact factor: 2.332
Fig. 1Locations in Kerala from where samples were collected for analysis in the study during CHIKV outbreaks in 2006, 2007, and 2008
Primers used for whole genome sequencing
| Name | Forward primer sequence (5′ → 3′) | Position (with respect to S27) | Name | Reverse primer sequence (5′ → 3′) in the complementary strand |
|---|---|---|---|---|
|
|
| 1–19 | chr1 | Nil |
| chf2 | cgtctatgctgtacacgca | 532–550 | chr2 | tgcgtgtacagcatagacg |
| chf3 | gacggattcctgatgtgc | 1004–1021 | chr3 | gcacatcaggaatccgtc |
| chf4 | gacagctttgtggtaccga | 1418–1436 | chr4 | tcggtaccacaaagctgtc |
|
|
| 1863–1880 | chr5 | cgtcgtacgcttcgaccg |
| chf6 | ctgcacgtacggttgactcgc | 2370–2390 |
|
|
| chf7 | cgtgttaacgtgcttcagag | 2746–2765 | chr7 | ctctgaagcacgttaacacg |
| chf8 | cctatcctcgaaacagcg | 3134–3151 | chr8 | cgctgtttcgaggatagg |
|
|
| 3619–3636 | chr9 | gtgactctcttagtaggc |
| chf10 | ggatgtgcaccgtcgtac | 4070–4087 |
|
|
| chf11 | gacccagtcactgaacca | 4438–4455 | chr11 | tggttcagtgactgggtc |
| chf12 | gaaatgcccggtggatga | 4807–4823 | chr12 | tcatccaccgggcatttc |
|
|
| 5236–5253 | chr13 | acccagtcagacacggcc |
| chf14 | gccggtgaacaccctggag | 5731–5749 |
|
|
| chf15 | gagcgacattcaatccgtc | 6147–6165 | chr15 | gacggattgaatgtcgctc |
| chf16 | atacaggcggctgaaccct | 6599–6617 | chr16 | agggttcagccgcctgtat |
|
|
| 7025–7043 | chr17 | acgcggattttgtcagacg |
| chf18 | ggtacgcactacagctacc | 7501–7519 |
|
|
| chf19 | cgaagtcaagcacgaaggt | 7911–7929 | chr19 | accttcgtgcttgacttcg |
| chf20 | agagtggagtcttgccatc | 8343–8361 | chr20 | gatggcaagactccactct |
|
|
| 8832–8849 | chr21 | gccaggatgaagtgtccc |
| chf22 | gtttccgctggcaaatgt | 9315–9332 |
|
|
| chf23 |
| 9751–9769 | chr23 | aagggacggtagctcctgg |
| chf24 | acagctgtaaggtcttcaccgg | 10220–10241 |
|
|
| chf25 | gtacggtacacgtgccatact | 10673–10693 | chr25 | agtatggcacgtgtaccgtac |
| chf26 | aggaccacatagtcaactacc | 11144–11164 | chr26 | ggtagttgactatgtggtcct |
| chf27 | gtcccctaagagacacattg | 11486–11505 | chr27 | caatgtgtctcttaggggac |
| Chf28 | Nil | 11780–11798 |
|
|
All the primers are newly designed in the study. From each location, one forward and one reverse primer were designed based on the S-27 sequence
Primers used for generating the clones are highlighted in bold in the table
Fig. 2Cloning strategy adopted for full-genome sequencing
CHIKV proteins and their regions used in molecular modeling
| Sl. no. | CHIKV protein/domains modeled (target) | Template | PDB id | % identity with template | Region of target modeled (amino acids) |
|---|---|---|---|---|---|
| 1 | Helicase (nsP2) | Hydrolase | 3E1s chain A | 26.3 | 1–102 |
| 2 | Peptidase C9 (nsP2) | Helicase nsp2. Chain: a. Venezuelan equine encephalitis virus | 2hwk chain A | 44.4 | 41–202 |
| 3 | Macrodomain (nsP3) | Non-structural protein 3. Chain: a Human coronavirus 229e | 3ewr chain A | 23 | 6–139 |
| 4 | S3Peptidase (Capsid) | Semliki forest virus capsid protein | 1Vcp chain C | 93.2 | 8–156 |
| 5 | E1 (Structural protein) | Glycoprotein e1 from Semliki forest virus. | 1rer chain A | 63.5 | 67–456 |
Nucleotide changes in RGCB isolates with respect to S-27 reference strain
| Sl. no. | Isolate | Year of collection | GenBank accession no. | Number of nucleotide changes | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Coding region (synonymous/non-synonymous) | 5′NTR | 3′NTR | ||||||||||||
| Non-structural protein | Structural protein | |||||||||||||
| nsP1 | nsP2 | nsP3 | nsP4 | C | E3 | E2 | 6K | E1 | ||||||
| 1 | RGCB03/KL06 | 2006 | GQ428210 | 31/9 | 51/5 | 47/12 | 44/7 | 16/3 | 3/1 | 29/15 | 2/2 | 28/6 | 1 | 16 |
| 2 | RGCB05/KL06 | 2006 | GQ428211 | 32/9 | 50/5 | 48/12 | 42/6 | 15/4 | 3/1 | 31/14 | 2/2 | 28/4 | 1 | 17 |
| 3 | RGCB80/KL07 | 2007 | GQ428212 | 33/10 | 55/9 | 47/12 | 42/7 | 16/3 | 3/1 | 31/15 | 2/2 | 29/5 | 1 | 19 |
| 4 | RGCB120/KL07 | 2007 | GQ428213 | 33/9 | 49/8 | 48/11 | 44/6 | 16/3 | 3/1 | 32/17 | 2/3 | 29/5 | 1 | 19 |
| 5 | RGCB355/KL08 | 2008 | GQ428214 | 33/9 | 51/7 | 50/13 | 45/9 | 16/3 | 3/1 | 31/16 | 2/2 | 29/6 | 1 | 19 |
| 6 | RGCB356/KL08 | 2008 | GQ428215 | 35/10 | 52/6 | 47/12 | 42/6 | 16/2 | 3/2 | 29/17 | 2/2 | 28/6 | 1 | 17 |
Comparison of amino acid substitutions identified in Kerala isolates with that of S-27 and other closely related ECSA genotypes of CHIKV
| Protein | Position | Isolates | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Identified protein domains in the corresponding region | Polypeptide | Protein |
| IND-00-MH-4 (Yawat) | LR2006-OPY1 | DRDE-06 | IND-06-RJ1 | IND-06-MH2 | IND-06-AP3 | IND-06-KA15 | IND-06-TN1 | DRDE-07 |
|
|
|
|
|
| ITA07-RA1 | |
| nsP1 | Viral methyl transferase | 105 | 105 | G | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . |
| 128 | 128 | T | . | . | K | K | K | K | K | K | K | K | K | K | K | K | K | K | ||
| 258 | 258 | W | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | ||
| 314 | 314 | M | . | . | L | . | . | . | . | . | L | . | . | . | . | . | . | . | ||
| 376 | 376 | T | . | . | M | M | M | M | M | M | M | M | M | M | M | M | M | M | ||
| 488 | 488 | Q | . | R | R | R | R | R | R | R | R | R | R | R | R | R | R | R | ||
| 536 | 536 | D | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | ||
| nsP2 | 583 | 48 | V | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | |
| 589 | 54 | S | . | N | N | N | N | N | N | N | N | N | N | N | N | N | N | N | ||
| Viral helicase | 716 | 181 | V | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | |
| 772 | 237 | L | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | ||
| 773 | 238 | L | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | ||
| 864 | 329 | K | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | ||
| 946 | 411 | N | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | ||
| Peptidase C9 | 1074 | 539 | L | . | . | . | . | . | . | . | . | . | . | . |
|
|
| . | . | |
| 1243 | 708 | L | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| 1328 | 793 | A | . | V | V | V | V | V | V | V | V | V | V | V | V | V | V | V | ||
| nsP3 | Macrodomain | 1359 | 26 | P | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . |
| 1427 | 94 | K | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | ||
| 1534 | 201 | L | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| 1583 | 250 | D | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | ||
| 1709 | 376 | I | T | T | T | T | T | T | T | T | T | T | T | T | T | T | T | . | ||
| 1773 | 440 | E | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| 1794 | 461 | L | . | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P | ||
| 1804 | 471 | P | . | S | S | S | S | S | S | S | S | S | S | S | S | S | S | S | ||
| nsP4 | 1938 | 75 | T | . | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | |
| 1979 | 116 | S | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| RNA dependent RNA polymerase (RdRP) | 2115 | 252 | N | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | |
| 2117 | 254 | T | . | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||
| 2152 | 289 | V | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | ||
| 2342 | 479 | M | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| 2352 | 489 | M | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | |||
| Capsid | 23 | 23 | P | . | S | S | S | S | S | S | S | S | S | S | S | S | S | S | S | |
| 27 | 27 | V | . | . | I | I | I | . | I | I | . | I | I | I | I | I | . | I | ||
| 28 | 28 | I | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | ||
| S3 Peptidase | 199 | 199 | G | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | |
| E3 | 305 | 44 | R | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | |
| E2 | 397 | 72 | N | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | |
| 554 | 229 | V | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | ||
| 577 | 252 | K | . | . | . | . | . | . | . | . | . | . | . |
|
|
| . | . | ||
| 641 | 316 | E | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | ||
| 675 | 350 | G | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | ||
| 686 | 361 | E | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | ||
| 700 | 375 | S | . | T | T | T | T | T | T | T | T | T | T | T | T | T | T | T | ||
| 711 | 386 | V | . | A | A | A | A | A | A | A | A | A | A | A | A | A | A | A | ||
| 741 | 416 | C | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | ||
| 6K | 756 | 8 | V | . | I | I | I | I | I | I | I | I | I | I | I | I | I | I | I | |
| 808 | 60 | S | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | ||
| E1 | 813 | 04 | V | . | . | . | . | . | . | . | . | A | . | . | . | . | . |
| . | |
| 926 | 127 | E | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | ||
| 983 | 184 | D | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | ||
| 994 | 195 | Y | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | ||
| 996 | 197 | M | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | ||
| 1035 | 226 | A | . | V | . | . | . | . | . | . | V | . | . | V | V | V | V | V | ||
| 1057 | 248 | G | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| 1093 | 284 | D | . | E | E | E | E | E | E | E | E | E | E | E | E | E | E | E | ||
| 1113 | 304 | P | . | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | ||
| 1205 | 396 | T | . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | ||
| 1220 | 411 | K | . | . | . | . | . | . | . | . | . | . | . | . | . |
| . | . | ||
| 1242 | 433 | C | . | . | . | . | . | . | . | . | . | . | . |
| . | . | . | . | ||
Identical amino acids with S-27 strain are indicated by “.”
Isolates of the present study are indicated in italics. Unique mutations are identified in bold
Fig. 3Phylogenetic analysis based on full-length coding region nucleotide sequences (11237 bp) of selected ECSA strains with Kerala strains (filled triangle). GenBank Accession numbers and strain name are given. GenBank accession nos. of RGCB isolates are given in Table 3. Boot-strap values (>50%) are indicated at the nodes. Scale bar represent the number of substitutions per site
Fig. 4Mapping the positions of selected major mutations identified in CHIKV Kerala isolates on to the structural regions of the corresponding protein. The sequence of the reference strain S-27 was modeled using suitable templates as described in “Materials and Methods”
Association of unique structural protein mutations identified in RGCB isolates with potential T-cell epitopes
| Sl. no. | Viral protein | Position in the polypeptide | Mutation | Associated HLA allelesa | Isolates that show the presence of the mutation | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| RGCB03/2006 | RGCB05/2006 | RGCB80/2007 | RGCB120/2007 | RGCB355/2008 | RGCB356/2008 | |||||
| 1 | Capsid | 28 | I → T | HLA-B*27, HLA-B*51, HLA-B*53 | – | – | – | – | – | Y |
| 2 | 199 | G → D | HLA-A*0301, HLA-A*1101, HLA-A*24, HLA-A*6801, HLA-B*07, HLA-B*27 | – | Y | – | – | – | – | |
| 3 | E3 | 305 | R → G | HLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-B*27 | – | – | – | – | – | Y |
| 4 | E2 | 554 | V → I | HLA-A*0101, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*6802 | – | – | – | – | – | Y |
| 5 | 577 | K → Q | HLA-A*0301, HLA-A*1101, HLA-B*40, HLA-B*44 | – | – | Y | Y | Y | – | |
| 6 | 641 | E → K | HLA-A*0201, HLA-A*0202, HLA-A*6802, HLA-B*40, HLA-B*44 | Y | – | – | – | – | – | |
| 7 | 675 | G → S | HLA-B*40, HLA-B*44, HLA-B*51 | – | – | – | – | – | Y | |
| 8 | 686 | E → G | HLA-A*0101, HLA-A*0201, HLA-A*0203, HLA-A*0301, HLA-A*24, HLA-B*40, HLA-B*44 | – | – | – | Y | – | – | |
| 9 | 741 | C → G | HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*24, HLA-A*3101, HLA-A*6802 | – | – | – | Y | – | – | |
| 10 | 6K | 808 | S → G | HLA-A*0101, HLA-A*0201, HLA- A*0202, HLA-A*0206, HLA-B*51 | – | – | – | Y | – | – |
| 11 | E1 | 813 | V → A | HLA-A*0201, HLA-A*0202, HLA-A*0206, HLA-A*24, HLA-A*6802, HLA-B*51 | – | – | – | – | – | Y |
| 12 | 926 | E → G | HLA-A*0301, HLA-A*1101, HLA-A*6801, HLA-B*40, HLA-B*44 | Y | – | – | – | – | – | |
| 13 | 983 | D → G | HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*24, HLA-A*6801, HLA-A*6802, HLA-B*51 | – | – | – | Y | – | – | |
| 14 | 994 | Y → H | HLA-A*0101, HLA-A*24, HLA-B*07 | Y | – | – | – | – | – | |
| 15 | 1035 | A → V | HLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-B*07, HLA-B*27, HLA-B*51 | – | – | Y | Y | Y | Y | |
| 16 | 1057 | G → E | HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*24 | – | – | – | – | Y | – | |
| 17 | 1205 | T → A | HLA-A*0201, HLA-A*0203, HLA-A*0206, HLA-A*24, HLA-B*51 | Y | – | – | – | – | – | |
| 18 | 1220 | K → R | HLA-A*0202, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*6802, HLA-B*51 | – | – | – | – | Y | – | |
aPredicted epitopes are given in Supplementary Tables 2 and 3. Y Indicates presence of mutation