Literature DB >> 19851853

Genetic characterization of 2006-2008 isolates of Chikungunya virus from Kerala, South India, by whole genome sequence analysis.

E Sreekumar1, Aneesh Issac, Sajith Nair, Ramkumar Hariharan, M B Janki, D S Arathy, R Regu, Thomas Mathew, M Anoop, K P Niyas, M R Pillai.   

Abstract

Chikungunya virus (CHIKV), a positive-stranded alphavirus, causes epidemic febrile infections characterized by severe and prolonged arthralgia. In the present study, six CHIKV isolates (2006 RGCB03, RGCB05; 2007 RGCB80, RGCB120; 2008 RGCB355, RGCB356) from three consecutive Chikungunya outbreaks in Kerala, South India, were analyzed for genetic variations by sequencing the 11798 bp whole genome of the virus. A total of 37 novel mutations were identified and they were predominant in the 2007 and 2008 isolates among the six isolates studied. The previously identified E1 A226V critical mutation, which enhances mosquito adaptability, was present in the 2007 and 2008 samples. An important observation was the presence of two coding region substitutions, leading to nsP2 L539S and E2 K252Q change. These were identified in three isolates (2007 RGCB80 and RGCB120; 2008 RGCB355) by full-genome analysis, and also in 13 of the 31 additional samples (42%), obtained from various parts of the state, by sequencing the corresponding genomic regions. These mutations showed 100% co-occurrence in all these samples. In phylogenetic analysis, formation of a new genetic clade by these isolates within the East, Central and South African (ECSA) genotypes was observed. Homology modeling followed by mapping revealed that at least 20 of the identified mutations fall into functionally significant domains of the viral proteins and are predicted to affect protein structure. Eighteen of the identified mutations in structural proteins, including the E2 K252Q change, are predicted to disrupt T-cell epitope immunogenicity. Our study reveals that CHIK virus with novel genetic changes were present in the severe Chikungunya outbreaks in 2007 and 2008 in South India.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19851853      PMCID: PMC7088544          DOI: 10.1007/s11262-009-0411-9

Source DB:  PubMed          Journal:  Virus Genes        ISSN: 0920-8569            Impact factor:   2.332


Introduction

Increase in the virulence of emerging pathogens, such as Chikungunya virus (CHIKV), Dengue virus, and other arboviruses, is of grave concern [1-3]. Chikungunya (CHIK), a mosquito-transmitted zoonotic viral infection, is characterized by high fever, headache, myalgia, severe and prolonged arthralgia, and erythematous skin rashes. The main mosquito vectors are the members of Aedes family, Ae. aegypti and Ae. albopictus. Recent outbreaks of CHIK have exhibited unusual severity and suspected mortality among the affected people in several parts of the world [4-6]. CHIK was reported from India as early as in 1963 [7] and later it was thought to have disappeared from the sub-continent [8]. Among the three genotypes of Chikungunya virus [Asian; West African; and East, Central and South African (ECSA)], earlier outbreaks in India were caused by the Asian genotype. With the identification of the ECSA genotype of the virus from a field-caught mosquito in 2000 [9], the disease gained attention as a re-emerging infection. A number of CHIK outbreaks have been reported from several parts of the country during the period 2005–2008 [10, 11]. One of the worst affected states was Kerala in South India. As per the statistics available with the Department of Health and Family Welfare, Government of Kerala, the number of CHIKV suspected cases were 70,780 in 2006; 24,052 in 2007; and 24,667 in 2008. The disease in 2007 and 2008 was unusual with a rapid spread, lymphadenitis, hemorrhages, hepatic involvement, severe, prolonged arthralgia, and even suspected incidences of mortality [6, 12, 13]. Genetic changes in the virus were speculated as the plausible reason for this [12, 13]. CHIKV is a positive-stranded RNA virus belonging to the alphavirus of the Togaviridae family [1]. The 11.8 kb viral genome of the CHIKV codes for four non-structural proteins (nsP1, nsP2, nsP3, and nsP4), a core protein, and three envelope proteins (E1, E2, E3; E1 and E2 connected by a short 6K region). Previous studies show that in alphaviruses, the infectivity and virulence-associated gene mutations are distributed both in the structural and non-structural protein coding regions of the viral genome [14-16]. Several mutations in the structural and non-structural protein coding regions were detected in the CHIKV isolates from the Re’Union outbreak [4]. A point mutation in the E1 protein (A226V) has been proven to be the main reason for altered vector specificity and epidemic potential of the re-emerging CHIKV [17, 18]. This mutation has been documented in the recent viral isolates (2007) from the Indian subcontinent, including those from Kerala [6, 12]. Only few studies have looked into the variations in the genome regions other than the envelope protein coding region of the CHIKV isolates from India. Mutations in the nsP1 region (T128K and T376M) and capsid region (P23S) of 2006 isolates [11], and in the E1 region (V14A and A226V) and in the nsP1 region (M184T) of a 2007 isolate [6] have been previously reported. The present study was carried out to understand the gene mutations in the virus that caused an explosive febrile infection in three consecutive CHIK outbreaks in Kerala (2006–2008). It revealed novel mutations, phylogenetic evolution of a new viral genetic clade, and in silico predicted amino acid substitutions affecting several potential T-cell epitopes.

Materials and methods

Virus

CHIKV was isolated from serum samples of patients suffering from the classical symptoms such as fever, polyarthralgia, and skin rashes. Samples were collected during the July–December 2006, May–October, 2007, and May–September, 2008 outbreaks in Kerala. The presence of virus was detected by RT-PCR as previously reported [19]. The viral RNA was isolated using QiaAmp Viral RNA isolation kit (Qiagen, GmBH, Germany) and the reverse-transcription polymerase chain reaction (RT-PCR) was carried out using Genei One-step AMV RT-PCR kit (Bangalore Genei, Bangalore, India). Virus isolation was carried out using confluent monolayer culture of Vero cells. CHIKV in 200-μl patient serum was used for infection as per standard protocols [4, 6]. Infected cells were incubated till the cytopathic effects were visible.

Full-genome sequencing

Two isolates from the same locality for each of the 2006, 2007, and 2008 outbreaks were subjected to sequencing. The details of the samples, place, and date of collection are given in Fig. 1 and the Online Resource Table 1. Seven overlapping genomic regions were RT-PCR amplified from the isolated viral RNA. The details of the primer sequences and cloning strategy are given in Table 1 and Fig. 2. RT-PCR was carried out using Fidelitaq RT-PCR kit (USB, Cleveland, Ohio). The conditions were a reverse transcription step at 50°C for 45 min and 35 cycles of thermal cycling, which included denaturation at 95°C for 1 min, annealing at 55°C for 1 min, and an extension at 68°C for 4 min. The fragments were subsequently cloned into pGEM-T easy vector (Promega) and were sequenced using specific primers, using the Big-dye Terminator Cycle sequencing kit in an ABI 3730 Genetic Analyzer automated DNA sequencer (PE Applied Biosystems, Foster City, CA). The sequence contigs were assembled using CAP contig assembly program in Bio-Edit software [20].
Fig. 1

Locations in Kerala from where samples were collected for analysis in the study during CHIKV outbreaks in 2006, 2007, and 2008

Table 1

Primers used for whole genome sequencing

NameForward primer sequence (5′ → 3′)Position (with respect to S27)NameReverse primer sequence (5′ → 3′) in the complementary strand
chf1 atggctgcgtgagacacac 1–19chr1Nil
chf2cgtctatgctgtacacgca532–550chr2tgcgtgtacagcatagacg
chf3gacggattcctgatgtgc1004–1021chr3gcacatcaggaatccgtc
chf4gacagctttgtggtaccga1418–1436chr4tcggtaccacaaagctgtc
chf5 cggtcgaagcgtacgacg 1863–1880chr5cgtcgtacgcttcgaccg
chf6ctgcacgtacggttgactcgc2370–2390 chr6 gcgagtcaaccgtacgtgcag
chf7cgtgttaacgtgcttcagag2746–2765chr7ctctgaagcacgttaacacg
chf8cctatcctcgaaacagcg3134–3151chr8cgctgtttcgaggatagg
chf9 gcctactaagagagtcac 3619–3636chr9gtgactctcttagtaggc
chf10ggatgtgcaccgtcgtac4070–4087 chr10 gtacgacggtgcacatcc
chf11gacccagtcactgaacca4438–4455chr11tggttcagtgactgggtc
chf12gaaatgcccggtggatga4807–4823chr12tcatccaccgggcatttc
chf13 ggccgtgtctgactgggt 5236–5253chr13acccagtcagacacggcc
chf14gccggtgaacaccctggag5731–5749 chr14 ctccagggtgttcaccggc
chf15gagcgacattcaatccgtc6147–6165chr15gacggattgaatgtcgctc
chf16atacaggcggctgaaccct6599–6617chr16agggttcagccgcctgtat
chf17 cgtctgacaaaatccgcgt 7025–7043chr17acgcggattttgtcagacg
chf18ggtacgcactacagctacc7501–7519 chr18 ggtagctgtagtgcgtacc
chf19cgaagtcaagcacgaaggt7911–7929chr19accttcgtgcttgacttcg
chf20agagtggagtcttgccatc8343–8361chr20gatggcaagactccactct
chf21 gggacacttcatcctggc 8832–8849chr21gccaggatgaagtgtccc
chf22gtttccgctggcaaatgt9315–9332 chr22 acatttgccagcggaaac
chf23 ccaggagctaccgtccctt 9751–9769chr23aagggacggtagctcctgg
chf24acagctgtaaggtcttcaccgg10220–10241 chr24 ccggtgaagaccttacagctgt
chf25gtacggtacacgtgccatact10673–10693chr25agtatggcacgtgtaccgtac
chf26aggaccacatagtcaactacc11144–11164chr26ggtagttgactatgtggtcct
chf27gtcccctaagagacacattg11486–11505chr27caatgtgtctcttaggggac
Chf28Nil11780–11798 Chr28 tacgtccctgtgggttcggagaat

All the primers are newly designed in the study. From each location, one forward and one reverse primer were designed based on the S-27 sequence

Primers used for generating the clones are highlighted in bold in the table

Fig. 2

Cloning strategy adopted for full-genome sequencing

Locations in Kerala from where samples were collected for analysis in the study during CHIKV outbreaks in 2006, 2007, and 2008 Primers used for whole genome sequencing All the primers are newly designed in the study. From each location, one forward and one reverse primer were designed based on the S-27 sequence Primers used for generating the clones are highlighted in bold in the table Cloning strategy adopted for full-genome sequencing

RT-PCR amplification and sequencing of partial nsP2, E1, and E2 sequences

Partial nsP2, E1, and E2 sequences were amplified from 31 samples (Online Resource Table 1) by RT-PCR. The primers used for amplification were ch8F and ch9R for nsP2 region; ch24F and ch25R for the E1 region; and ch21F and ch22R for the E2 region (Table 1). The RT-PCR was done as described above. The amplified products were purified using GFX Gel band purification system (GE Healthcare) and were subjected to automated DNA sequencing.

Phylogenetic analysis

CHIKV full-length coding sequences (11237 bp) were aligned with corresponding sequences obtained from NCBI GenBank using Clustal W program of MEGA3.1 [21] software, with Kimura-2 distance correction. The phylogeny was reconstructed by Neighbor-Joining method with 10000 bootstrap replications.

Mapping of mutations

Homology models for the CHIKV proteins were made from structural templates that were identified by BLAST search against the PDB chain database. Details of the selected templates, along with the regions showing reliable alignments, are given in Table 2. To avoid problems with automated alignments, the target sequences were manually threaded into the template structures using Deepview (Swiss-pdb Viewer) [22]. The resulting project files were sent to the SWISSMODEL automated homology model-building server for model calculation. Stereochemical quality checks were performed on the returned models using VADAR server [23]. A few problematic side chain conformations were identified and rectified. The resulting structures were energy minimized using DeepView iteratively using 200 cycles of steepest descent. Structural representations of the models were generated with PyMOL (http://www.pymol.org/) and the mutations identified in the study were mapped.
Table 2

CHIKV proteins and their regions used in molecular modeling

Sl. no.CHIKV protein/domains modeled (target)TemplatePDB id% identity with templateRegion of target modeled (amino acids)
1Helicase (nsP2)Hydrolase ( Deinococcus radiodurans recd2)3E1s chain A26.31–102
2Peptidase C9 (nsP2)Helicase nsp2. Chain: a. Venezuelan equine encephalitis virus2hwk chain A44.441–202
3Macrodomain (nsP3)Non-structural protein 3. Chain: a Human coronavirus 229e3ewr chain A236–139
4S3Peptidase (Capsid)Semliki forest virus capsid protein1Vcp chain C93.28–156
5E1 (Structural protein)Glycoprotein e1 from Semliki forest virus.1rer chain A63.567–456
CHIKV proteins and their regions used in molecular modeling

T-cell epitope prediction

T-cell epitopes of the CHIKV structural proteins were predicted using the EpiJen online server [24]. The entire structural polyprotein sequence of CHIKV was used as input to EpiJen and the server was run in the default mode. The appropriate proteasomal and Tap cut-off values were selected while getting epitope predictions for the 18 different HLA alleles that are currently being offered at EpiJen.

Results

RT-PCR and virus isolation

In RT-PCR of the clinical samples, 9 out of 11 (81.18%) samples from 2006, 33 out of 36 (91.6%) samples from 2007, and 11 out of 19 (57.8%) samples from 2008 were positive for the presence of viral RNA. During virus isolation in Vero cells from RT-PCR-positive samples, cytopathic effect (CPE) development was observed in 48–96 h. The CPE was characterized by the roundening of cells, increased granularity and vacuolation, followed by cell death and disruption of the monolayer by detachment of the dead cells. RNA isolation and RT-PCR amplification from the culture supernatant further confirmed the presence of the virus.

Full-genome sequencing of 2006, 2007, and 2008 isolates

Single passage virus and high-fidelity PCR enzymes were used to minimize experimentally induced genetic alterations in whole genome analysis. Six isolates—RGCB03, RGCB05, RGCB80, RGCB120, RGCB355, and RGCB356—were subjected to full genome sequencing. The year of collection of these isolates, GenBank accession numbers of the sequences and the number of nucleotide changes observed with respect to the reference sequence of S-27 African strain (GenBank accession No. AF369024) are given in Table 3. The full-length assembled CHIKV genome consisted of 11,798 base pairs and corresponds to the positions 1–11,798 of the reference sequence. The Kerala isolates had an average whole genome nucleotide identity of 96.95% with the S-27 sequence, 98.07% with the 2000 Indian (Yawat) isolate, and 99.68-99.8% with the 2006 Indian and Re’Union isolates. The sequences had 99.7% identity with the 2007 ITA07-RA strain, which was isolated in Italy. The comparative analysis of the predicted amino acid substitutions observed in the six Kerala isolates with that of other Indian isolates, Re’Union and Italian ECSA isolates are given in Table 4.
Table 3

Nucleotide changes in RGCB isolates with respect to S-27 reference strain

Sl. no.IsolateYear of collectionGenBank accession no.Number of nucleotide changes
Coding region (synonymous/non-synonymous)5′NTR3′NTR
Non-structural proteinStructural protein
nsP1nsP2nsP3nsP4CE3E26KE1
1RGCB03/KL062006GQ42821031/951/547/1244/716/33/129/152/228/6116
2RGCB05/KL062006GQ42821132/950/548/1242/615/43/131/142/228/4117
3RGCB80/KL072007GQ42821233/1055/947/1242/716/33/131/152/229/5119
4RGCB120/KL072007GQ42821333/949/848/1144/616/33/132/172/329/5119
5RGCB355/KL082008GQ42821433/951/750/1345/916/33/131/162/229/6119
6RGCB356/KL082008GQ42821535/1052/647/1242/616/23/229/172/228/6117
Table 4

Comparison of amino acid substitutions identified in Kerala isolates with that of S-27 and other closely related ECSA genotypes of CHIKV

ProteinPositionIsolates
Identified protein domains in the corresponding regionPolypeptideProtein S-27 IND-00-MH-4 (Yawat)LR2006-OPY1DRDE-06IND-06-RJ1IND-06-MH2IND-06-AP3IND-06-KA15IND-06-TN1DRDE-07 RGCB03-06 RGCB 05-06 RGCB 80-07 RGCB 120-07 RGCB 355-08 RGCB 356-08 ITA07-RA1
nsP1Viral methyl transferase105105G.............. R .
128128T..KKKKKKKKKKKKKK
258258W.............. R .
314314M..L.....L.......
376376T..MMMMMMMMMMMMMM
488488Q.RRRRRRRRRRRRRRR
536536D........... G ....
nsP258348V........... A ....
58954S.NNNNNNNNNNNNNNN
Viral helicase716181V........... A ....
772237L............ R ...
773238L............ P ...
864329K........... E ....
946411N.............. D .
Peptidase C91074539L........... S S S ..
1243708L............. P ..
1328793A.VVVVVVVVVVVVVVV
nsP3Macrodomain135926P.......... S .....
142794K........... Q ....
1534201L............. R ..
1583250D......... E ......
1709376ITTTTTTTTTTTTTTT.
1773440E............. G ..
1794461L.PPPPPPPPPPPPPPP
1804471P.SSSSSSSSSSSSSSS
nsP4193875T.AAAAAAAAAAAAAAA
1979116S............. P ..
RNA dependent RNA polymerase (RdRP)2115252N............. D ..
2117254T.AAAAAAAAAAAAAAA
2152289V........... A ....
2342479M............. V ..
2352489M........ K ......
Capsid2323P.SSSSSSSSSSSSSSS
2727V..III.II.IIIII.I
2828I.............. T .
S3 Peptidase199199G.......... D .....
E330544R.............. G .
E239772N............. S ..
554229V.............. I .
577252K........... Q Q Q ..
641316E......... K ......
675350G.............. S .
686361E............ G ...
700375S.TTTTTTTTTTTTTTT
711386V.AAAAAAAAAAAAAAA
741416C............ G ...
6K7568V.IIIIIIIIIIIIIII
80860S............ G ...
E181304V........A..... A .
926127E......... G ......
983184D............ G ...
994195Y......... H ......
996197M.......... V .....
1035226A.V......V..VVVVV
1057248G............. E ..
1093284D.EEEEEEEEEEEEEEE
1113304P.............. L .
1205396T......... A ......
1220411K............. R ..
1242433C........... R ....

Identical amino acids with S-27 strain are indicated by “.”

Isolates of the present study are indicated in italics. Unique mutations are identified in bold

Nucleotide changes in RGCB isolates with respect to S-27 reference strain Comparison of amino acid substitutions identified in Kerala isolates with that of S-27 and other closely related ECSA genotypes of CHIKV Identical amino acids with S-27 strain are indicated by “.” Isolates of the present study are indicated in italics. Unique mutations are identified in bold

Sequence analysis of the non-structural protein coding region

Several nucleotide changes were found in the non-structural protein coding region in comparison with the S-27 prototype strain. Some of these nucleotide substitutions were non-synonymous and resulted in amino acid change (Tables 3 and 4). Most of these changes were also found in other CHIKV isolates from 2006 and 2007 outbreaks in India and other parts of the world. However, some nucleotide changes were unique to the Kerala isolates studied (Online Resource Table 2). Six nucleotide changes (T1381G; T3297C; T3397C; C5014T; A6076G; C7450T) were common to the 2007 isolates (RGCB80, RGCB120) and to one of the 2008 isolates (RGCB355).

Sequence analysis of structural protein coding region

Structural protein coding region also showed several nucleotide changes (Table 3). As in the non-structural protein coding region, some of the changes were present only in RGCB isolates (Online Resource Table 2). Among this, five unique nucleotide changes (C7983T; A9295C; G9654A; G9681A; A10356G) were uniformly found in RGCB80, RGCB120, and RGCB355. The A9295C change caused a K252Q mutation in the E2 coding region of the three isolates. The nucleotide substitution C10670T causing the A226 V mutation in the E1 protein was present only in 2007 and 2008 RGCB isolates (viz. RGCB80, RGCB120, RGCB355, and RGCB356). Predicted amino acid changes observed in the structural protein region are shown in Table 4.

5′ and 3′ non-translated regions (NTRs)

5′NTR region in the Kerala isolates did not reveal any nucleotide change compared to the S-27 strain, except for G68T substitution, which was observed also in all the recent Indian and Re’ Union viral isolates. There were no deletions or insertions. However, the 3′NTR showed nucleotide substitutions (Table 3; Online Resource Table 2). The internal polyadenylation stretch at nucleotides 11445–11458, observed within the 3′NTR in the S-27 strain, was absent in the recent isolates of CHIKV ECSA strains [6]. This stretch of 14 adenine nucleotides was absent in the six isolates sequenced in this study also. Most of the nucleotide changes observed in the 3′NTR were common to the RGCB isolates and recent CHIKV isolates from India and other parts of the world. The C11614T substitution was observed only in three RGCB isolates (RGCB80, RGCB120, RGCB355), ITA07-RA1 and in IND-06-RJ1 strains. Another common nucleotide variation observed in Indian isolates (DRDE06, ITA-07-RA, IND-06-RJ1, IND-06-KA15, IND-06-TN1) and in RGCB isolates (RGCB05, RGCB80, RGCB120, and RGCB355) was the A11655G substitution. 3′NTR did not reveal any deletions or insertions.

Phylogenetic analysis based on full-length coding region

Phylogenetic analysis using 11237 bp full-length coding region nucleotide sequences of the six RGCB isolates, eight other Indian isolates available in GenBank, one isolate from Italy, one from Re’Union Island, and three from Mauritius indicated clear clustering of the 2006, 2007, and 2008 isolates (Fig. 3). The isolates Yawat-2000 and S-27 African prototype formed separate branches. Within the recent ECSA genotypes, the three Kerala isolates (RGCB80, RGCB120, and RGCB355) and the ITA-07-RA1 Italian isolate showed very close clustering. The three RGCB isolates formed a separate clade, with 100% boot-strap support. It was found that the presence of five unique nucleotide variations in the genome of these RGCB isolates (T3297C, C5014T, A6076G, G9654A, and A10356G) contributed to the formation of this clade.
Fig. 3

Phylogenetic analysis based on full-length coding region nucleotide sequences (11237 bp) of selected ECSA strains with Kerala strains (filled triangle). GenBank Accession numbers and strain name are given. GenBank accession nos. of RGCB isolates are given in Table 3. Boot-strap values (>50%) are indicated at the nodes. Scale bar represent the number of substitutions per site

Phylogenetic analysis based on full-length coding region nucleotide sequences (11237 bp) of selected ECSA strains with Kerala strains (filled triangle). GenBank Accession numbers and strain name are given. GenBank accession nos. of RGCB isolates are given in Table 3. Boot-strap values (>50%) are indicated at the nodes. Scale bar represent the number of substitutions per site

Sequence analysis of partial nsP2, E1, and E2 coding region

In 31 additional samples collected from different parts of the state from 2007 and 2008 outbreaks, nucleotide changes causing the nsP2 L539S and E2 K252Q were present in 13 samples (42%); and the nucleotide change causing the E1 A226V mutation was present in 28 of the samples (Online Resource, Table 1). Interestingly, the L539S change in the non-structural protein coding region and K252Q change in the structural protein coding region showed 100% co-occurrence. Since we resorted to direct sequencing of the PCR product in all these cases, the possibility of these viral strains being quasispecies circulating in the patients had to be excluded. In careful analysis of the sequencing chromatogram, all samples showed only one clear peak, which corresponded to the changed nucleotide. This confirmed the presence of a single species of virus in such samples.

Molecular modeling

Homology-based models of five CHIKV proteins are given in Fig. 4 with the mutations that have been mapped onto the three dimensional structure. Only five of the CHIKV proteins had reliable templates for modeling (Table 2). The models represent energy minimized structures and were not subjected to dynamic perturbation. Target-template identities ranged from 23% for the CHIKV macrodomain to about 93% for the peptidase S3. As shown, many of the mutations were mapped to areas with major secondary structures, and could affect the local protein structure.
Fig. 4

Mapping the positions of selected major mutations identified in CHIKV Kerala isolates on to the structural regions of the corresponding protein. The sequence of the reference strain S-27 was modeled using suitable templates as described in “Materials and Methods”

Mapping the positions of selected major mutations identified in CHIKV Kerala isolates on to the structural regions of the corresponding protein. The sequence of the reference strain S-27 was modeled using suitable templates as described in “Materials and Methods” Results from the T-cell epitope prediction server, EpiJen are shown in Table 5 and the actual peptide-epitope sequences are shown in Online Resource Tables 3 and 4. While ranking the epitopes for mutation mapping, epitopes with calculated IC50 values above 100 were not included since such peptides are unlikely to yield significant immunogenicity. It is interesting to note that majority of the identified mutations fall within predicted T-cell epitopes, and are therefore predicted to influence immunogenicity. The unique mutation K577Q in the E2 structural protein was linked to epitopes presented by HLA-A and HLA-B alleles. The E1 A226V mutation, implicated in the increased virulence of Re’Union isolates, also could be mapped to epitopes presented by a number of HLA alleles.
Table 5

Association of unique structural protein mutations identified in RGCB isolates with potential T-cell epitopes

Sl. no.Viral proteinPosition in the polypeptideMutationAssociated HLA allelesa Isolates that show the presence of the mutation
RGCB03/2006RGCB05/2006RGCB80/2007RGCB120/2007RGCB355/2008RGCB356/2008
1Capsid28I → THLA-B*27, HLA-B*51, HLA-B*53Y
2199G → DHLA-A*0301, HLA-A*1101, HLA-A*24, HLA-A*6801, HLA-B*07, HLA-B*27Y
3E3305R → GHLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-B*27Y
4E2554V → IHLA-A*0101, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*6802Y
5577K → QHLA-A*0301, HLA-A*1101, HLA-B*40, HLA-B*44YYY
6641E → KHLA-A*0201, HLA-A*0202, HLA-A*6802, HLA-B*40, HLA-B*44Y
7675G → SHLA-B*40, HLA-B*44, HLA-B*51Y
8686E → GHLA-A*0101, HLA-A*0201, HLA-A*0203, HLA-A*0301, HLA-A*24, HLA-B*40, HLA-B*44Y
9741C → GHLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*24, HLA-A*3101, HLA-A*6802Y
106K808S → GHLA-A*0101, HLA-A*0201, HLA- A*0202, HLA-A*0206, HLA-B*51Y
11E1813V → AHLA-A*0201, HLA-A*0202, HLA-A*0206, HLA-A*24, HLA-A*6802, HLA-B*51Y
12926E → GHLA-A*0301, HLA-A*1101, HLA-A*6801, HLA-B*40, HLA-B*44Y
13983D → GHLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*24, HLA-A*6801, HLA-A*6802, HLA-B*51Y
14994Y → HHLA-A*0101, HLA-A*24, HLA-B*07Y
151035A → VHLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-B*07, HLA-B*27, HLA-B*51YYYY
161057G → EHLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*24Y
171205T → AHLA-A*0201, HLA-A*0203, HLA-A*0206, HLA-A*24, HLA-B*51Y
181220K → RHLA-A*0202, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*6802, HLA-B*51Y

aPredicted epitopes are given in Supplementary Tables 2 and 3. Y Indicates presence of mutation

Association of unique structural protein mutations identified in RGCB isolates with potential T-cell epitopes aPredicted epitopes are given in Supplementary Tables 2 and 3. Y Indicates presence of mutation

Discussion

Kerala was one of the South Indian states that witnessed wide-spread outbreak of CHIKV in the years 2006, 2007, and 2008. These outbreaks differed in the geographical region involved (coastal region in 2006 versus hilly areas with rubber plantations and abundance of Ae. albopictus mosquitoes in 2007 and 2008) and in disease severity (the 2007 and 2008 outbreaks more severe than the 2006 outbreak) [6, 12, 13]. In the outbreaks in 2007 and 2008, one of the unique complications observed was joint swelling (61.3% cases) along with painful arthralgia, which persisted for long periods [13, 25]. Another major difference in the clinical symptoms compared to the outbreaks in other parts of the world was the increased presence of skin rashes/itching. In one of the studies, these symptoms were observed in 80.8% patients in the Kerala outbreak in 2007–2008 [13]. Similar symptoms were seen only in 19.3% of the patients in the Re’Union outbreak [26]. Skin manifestations were predominant in pediatric patients also. 75% of the CHIKV affected infants exhibited peripheral cyanosis and dermatological manifestations like generalized erythema, maculopapular rash, vesiculobullous lesions or skin peeling, as characteristic symptoms [27]. A rapid spread of the disease was observed in 2007 and 2008, than in 2006, which has been linked to the outbreak of the disease in areas with abundance of Ae. albopictus. This vector permits rapid replication of the E1 A226V mutant virus [17, 18], that was circulating in Kerala during this period [12]. Though mortalities were observed, they were not conclusively attributed to CHIKV infection [25]. The increased severity of recent CHIK infections in other parts of the world has been linked to the genetic alterations in the virus [4, 17, 18]. In order to identify possible correlation of the clinical severity of the disease in Kerala with CHIKV genomic changes, we analyzed full-genome sequences of the viral isolates obtained from these outbreaks. Our study exposed greater genetic heterogeneity in the CHIKV circulating in Kerala, than that was observed in earlier studies. Both coding region changes and changes in the non-coding region, especially 3′NTR were observed in the study. Except for the 4 mutations—E1 (V04A) [6]; E2 (K252Q), and nsP2 (V48A, L539S) [28]—all the 18 mutations in the structural protein coding region and 19 mutations in the non-structural protein coding region identified in this study are new. Mutations were present throughout the genome. However, they were predominant in nsP2, E1, and E2 coding regions, which play major role in alphavirus infection and replication. Interestingly, they were more in the 2007 and 2008 isolates (Table 4). Along with the A226V mutation in the E1 protein, K252Q mutation in the E2 protein and L539S in the nsP2 protein were common features of most of the 2007–2008 Kerala isolates (Table 4). These changes and possibly other mutations in the viral proteins of the Kerala isolates might have contributed to the pathogenesis of CHIKV in these outbreaks. The non-structural protein nsP2 plays a critical role in the cytopathogenicity of alphaviruses [16, 29, 30]. The Viral Helicase and Peptidase C9 cysteine-protease domains of this protein had predicted amino acid changes in the RGCB CHIKV isolates (Table 4). In protein model, the two observed mutations in the viral helicase domain, L237R and L238P in RGCB120 were mapped to an α-helix forming region (Fig. 4). These are non-conservative changes that could alter the conformation of this region. Mutation that was observed in the cysteine-protease domain was the L539S in the RGCB80, RGCB120, and RGCB355 (Fig. 4), which fell closer to the H548. The H548 residue forms the catalytic diad along with C478 in other alphaviruses [31]. This change from a hydrophobic amino acid to polar amino acid also has the potential to affect the secondary structure of this region. Among the structural proteins, E1 and E2 proteins play major role in the cell entry of alphavirus. Domains I and II of the E1 protein are involved in E1 trimerization during the viral fusion process at the time of infection. Domain II mediates the E1–E2 interaction during the virus maturation and budding from infected cells [32-34]. In RGCB CHIKV isolates, the observed mutations in the E1 protein coding region were spread across all these functional regions (Fig. 4). Most of the mutations were in the loops of the domain II, indicating that this region plays major role in the biology of CHIKV E1 protein. Some of the changes were non-conservative (E127G, D184G, G248E), which might affect the mobility and strength of E1–E1 and E1–E2 interactions. The folding back of domain III and its interactions with the E1 trimer during fusion are critical events in alphavirus entry [35]. Domain III in one of the RGCB CHIKV isolates (RGCB356) had a major mutation (P304L) in one of the loops that could destabilize the local structure. In the E2 protein, most of the mutations fell in the region between amino acids (aa) 229 and 361. This stretch of the protein (aa 1–364) in alphaviruses forms a highly hydrophilic domain. In Hopps and Woods [36] hydrophilicity prediction, the CHIKV E2 peptide 249GDRKGKIH256 within this region was the most hydrophilic. Hence it could be a potential antigenic epitope. The K252Q basic to neutral amino acid change, observed consistently in many Kerala isolates, involved this epitope and was found to reduce the predicted hydrophilicity. This, in turn, indicates altered immunogenicity of the E2 protein of the isolates. The residues corresponding to the receptor binding region of E2 in alphaviruses (aa 170–220) [37] did not show mutation in the six isolates studied. The 391–423 aa stretch is identified as cytoplasmic domain of E2 protein in Sindbis virus [38], and the C416G mutation that was observed in RGCB120 falls within this region. This mutation changes one of the two conserved cysteine residues. Palmitoylation of these cysteine residues are thought to be important in anchoring of the E2 to the plasma membrane, and favor interaction with the capsid protein during Sindbis virus assembly [38, 39]. The non-coding region changes were mostly in the 3′NTR region, which play role in the viral genome replication during minus strand synthesis in alphaviruses [40]. The 3′NTR of alphaviruses contain repeat sequence elements (RSEs) (Pfeffer et al., 1998) and the CHIKV 3′NTR has three RSEs at positions 11382–11416, 11525–11559, and 11611–11646 [41]. Interestingly, one of the 3′NTR nucleotide changes (C11614T) observed in three RGCB isolates (RGCB80, RGCB120, and RGCB355) and in two other isolates (ITA07-RA and IND-06-RJ1) fell within the third RSE region. However, this did not change the predicted RNA secondary structure at the region. Analysis of the changes possibly affecting the immunogenicity of these isolates indicated that several mutations in the structural protein region fell within the predicted T-cell epitopes (Table 5). Epitopes presented by major HLA alleles, such as HLA-A*02 and HLA-A*11 [42], were shown to be involved. The latter is one of the main HLA types present in populations world over and is highly prevalent in South East Asia [43]. Classical CHIKV infections are known to offer long-term protection [44]. However, the variations in these epitopes have the potential to affect the cell-mediated immune response to CHIKV antigens. In phylogenetic analysis, the 2007 and 2008 isolates showed closer clustering, even though the 2008 isolates were from a geographically distant location. This indicates that the viral strains causing the latter outbreaks were similar to the strains that were circulating in 2007. The timelines of the outbreak with an inter-epidemic period of almost a year (May–October 2007 and May–September 2008) clearly points out that the viral strains were maintained in active form in the region during this period. Further investigations are essential to understand whether this was contributed by sporadic incidences of the disease, which was prevailing undetected in the population, or through natural reservoirs of the Chikungunya virus such as non-human primates or vector mosquitoes [1]. In conclusion, our study reveals circulation of CHIKV with novel genetic changes in Kerala, South India, in the 2007 and 2008 disease outbreaks. Evolvement of a new genetic clade among these viruses was also observed in phylogenetic analysis. The samples analyzed represent only a small cross section of the cases from massive Chikungunya outbreaks in the state. However, the study points out genome microevolution of the viral strains, which might have affected the disease profile. Functional studies on these mutant viruses would help to understand the correlation of these genetic changes to CHIKV virulence and pathogenesis. Below is the link to the electronic supplementary material. (PDF 36 kb)
  43 in total

1.  Complete nucleotide sequence of chikungunya virus and evidence for an internal polyadenylation site.

Authors:  Afjal Hossain Khan; Kouichi Morita; Maria Del Carmen Parquet; Futoshi Hasebe; Edward G M Mathenge; Akira Igarashi
Journal:  J Gen Virol       Date:  2002-12       Impact factor: 3.891

2.  VADAR: a web server for quantitative evaluation of protein structure quality.

Authors:  Leigh Willard; Anuj Ranjan; Haiyan Zhang; Hassan Monzavi; Robert F Boyko; Brian D Sykes; David S Wishart
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Genetic divergence of Chikungunya viruses in India (1963-2006) with special reference to the 2005-2006 explosive epidemic.

Authors:  Vidya A Arankalle; Shubham Shrivastava; Sarah Cherian; Rashmi S Gunjikar; Atul M Walimbe; Santosh M Jadhav; A B Sudeep; Akhilesh C Mishra
Journal:  J Gen Virol       Date:  2007-07       Impact factor: 3.891

4.  A single amino acid change in the nuclear localization sequence of the nsP2 protein affects the neurovirulence of Semliki Forest virus.

Authors:  John K Fazakerley; Amanda Boyd; Marja L Mikkola; Leevi Kääriäinen
Journal:  J Virol       Date:  2002-01       Impact factor: 5.103

Review 5.  The alphaviruses: gene expression, replication, and evolution.

Authors:  J H Strauss; E G Strauss
Journal:  Microbiol Rev       Date:  1994-09

6.  A study on chikungunya outbreak during 2007 in Kerala, south India.

Authors:  M Kannan; R Rajendran; I P Sunish; R Balasubramaniam; N Arunachalam; R Paramsivan; S C Tewari; P Philip Samuel; B K Tyagi
Journal:  Indian J Med Res       Date:  2009-03       Impact factor: 2.375

Review 7.  Chikungunya: an overview.

Authors:  A B Sudeep; D Parashar
Journal:  J Biosci       Date:  2008-11       Impact factor: 1.826

8.  Combined detection and genotyping of Chikungunya virus by a specific reverse transcription-polymerase chain reaction.

Authors:  F Hasebe; M C Parquet; B D Pandey; E G M Mathenge; K Morita; V Balasubramaniam; Z Saat; A Yusop; M Sinniah; S Natkunam; A Igarashi
Journal:  J Med Virol       Date:  2002-07       Impact factor: 2.327

9.  Two Chikungunya isolates from the outbreak of La Reunion (Indian Ocean) exhibit different patterns of infection in the mosquito, Aedes albopictus.

Authors:  Marie Vazeille; Sara Moutailler; Daniel Coudrier; Claudine Rousseaux; Huot Khun; Michel Huerre; Julien Thiria; Jean-Sébastien Dehecq; Didier Fontenille; Isabelle Schuffenecker; Philippe Despres; Anna-Bella Failloux
Journal:  PLoS One       Date:  2007-11-14       Impact factor: 3.240

10.  Mutations in the nuclear localization signal of nsP2 influencing RNA synthesis, protein expression and cytotoxicity of Semliki Forest virus.

Authors:  Kristi Tamm; Andres Merits; Inga Sarand
Journal:  J Gen Virol       Date:  2008-03       Impact factor: 3.891

View more
  21 in total

1.  The synergistic effect of nsP2-L618, nsP3-R117, and E2-K187 on the large plaque phenotype of chikungunya virus.

Authors:  Benjawan Thoka; Thitigun Jaimipak; Supachoke Onnome; Sutee Yoksan; Sukathida Ubol; Rojjanaporn Pulmanausahakul
Journal:  Virus Genes       Date:  2017-11-28       Impact factor: 2.332

2.  Neutralization activity of patient sera collected during the 2008-2009 Chikungunya outbreak in Thailand.

Authors:  Natsuko Kishishita; Mikiko Sasayama; Naokazu Takeda; Areerat Sa-Ngasang; Atchareeya Anuegoonpipat; Surapee Anantapreecha
Journal:  J Clin Microbiol       Date:  2014-11-05       Impact factor: 5.948

3.  Genetic characterization of Chikungunya virus circulating in individuals from Paraná, Brazil.

Authors:  Deborah de Castro Moreira; Francisco Ferreira Duarte Junior; José Renato Pattaro Júnior; Fernando Américo Jorge; Fabiana Dos Santos Rando; Mateus Vaillant Thomazella; Mayra Marinho Presibella; Irina Nastassja Riediger; Maria Aparecida Fernandez; Dennis Armando Bertolini
Journal:  Braz J Microbiol       Date:  2022-01-14       Impact factor: 2.214

4.  Interferon regulated gene (IRG) expression-signature in a mouse model of chikungunya virus neurovirulence.

Authors:  Sreeja R Nair; Rachy Abraham; Sankar Sundaram; Easwaran Sreekumar
Journal:  J Neurovirol       Date:  2017-10-24       Impact factor: 2.643

5.  Inhibition of chikungunya virus replication by harringtonine, a novel antiviral that suppresses viral protein expression.

Authors:  Parveen Kaur; Meerra Thiruchelvan; Regina Ching Hua Lee; Huixin Chen; Karen Caiyun Chen; Mah Lee Ng; Justin Jang Hann Chu
Journal:  Antimicrob Agents Chemother       Date:  2012-10-22       Impact factor: 5.191

6.  Molecular characterization of Chikungunya virus isolates from clinical samples and adult Aedes albopictus mosquitoes emerged from larvae from Kerala, South India.

Authors:  Kudukkil P Niyas; Rachy Abraham; Ramakrishnan Nair Unnikrishnan; Thomas Mathew; Sajith Nair; Anoop Manakkadan; Aneesh Issac; Easwaran Sreekumar
Journal:  Virol J       Date:  2010-08-13       Impact factor: 4.099

7.  Molecular characterization and phylogenetic analysis of Chikungunya virus from Delhi, India.

Authors:  Kundan Tandel; Mahadevan Kumar; S P S Shergill; Kavita Sahai; R M Gupta
Journal:  Med J Armed Forces India       Date:  2018-05-24

8.  Induction of cytopathogenicity in human glioblastoma cells by chikungunya virus.

Authors:  Rachy Abraham; Prashant Mudaliar; Aiswaria Padmanabhan; Easwaran Sreekumar
Journal:  PLoS One       Date:  2013-09-25       Impact factor: 3.240

9.  A novel 2006 Indian outbreak strain of Chikungunya virus exhibits different pattern of infection as compared to prototype strain.

Authors:  Abhishek Kumar; Prabhudutta Mamidi; Indrani Das; Tapas K Nayak; Sameer Kumar; Jagamohan Chhatai; Subhasis Chattopadhyay; Amol R Suryawanshi; Soma Chattopadhyay
Journal:  PLoS One       Date:  2014-01-20       Impact factor: 3.240

10.  Assessment of packed bed bioreactor systems in the production of viral vaccines.

Authors:  Ramya Rajendran; Rajendra Lingala; Siva Kumar Vuppu; Bala Obulapathi Bandi; Elaiyaraja Manickam; Sankar Rao Macherla; Stéphanie Dubois; Nicolas Havelange; Kapil Maithal
Journal:  AMB Express       Date:  2014-04-25       Impact factor: 3.298

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.