| Literature DB >> 30059496 |
Yi Tan1, Brett E Pickett1, Susmita Shrivastava1, Lionel Gresh2, Angel Balmaseda3, Paolo Amedeo1, Lihui Hu1, Vinita Puri1, Nadia B Fedorova1, Rebecca A Halpin1, Matthew P LaPointe1, Marshall R Cone4, Lea Heberlein-Larson4, Laura D Kramer5, Alexander T Ciota5, Aubree Gordon6, Reed S Shabman1, Suman R Das1, Eva Harris7.
Abstract
Chikungunya virus (CHIKV) has been detected sporadically since the 1950s and includes three distinct co-circulating genotypes. In late 2013, the Asian genotype of CHIKV was responsible for the Caribbean outbreak (CO) that rapidly became an epidemic throughout the Americas. There is a limited understanding of the molecular evolution of CHIKV in the Americas during this epidemic. We sequenced 185 complete CHIKV genomes collected mainly from Nicaragua in Central America and Florida in the United States during the 2014-2015 Caribbean/Americas epidemic. Our comprehensive phylogenetic analyses estimated the epidemic history of the Asian genotype and the recent Caribbean outbreak (CO) clade, revealed considerable genetic diversity within the CO clade, and described different epidemiological dynamics of CHIKV in the Americas. Specifically, we identified multiple introductions in both Nicaragua and Florida, with rapid local spread of viruses in Nicaragua but limited autochthonous transmission in Florida in the US. Our phylogenetic analysis also showed phylogeographic clustering of the CO clade. In addition, we identified the significant amino acid substitutions that were observed across the entire Asian genotype during its evolution and examined amino acid changes that were specific to the CO clade. Deep sequencing analysis identified specific minor variants present in clinical specimens below-consensus levels. Finally, we investigated the association between viral phylogeny and geographic/clinical metadata in Nicaragua. To date, this study represents the largest single collection of CHIKV complete genomes during the Caribbean/Americas epidemic and significantly expands our understanding of the emergence and evolution of CHIKV CO clade in the Americas.Entities:
Mesh:
Year: 2018 PMID: 30059496 PMCID: PMC6085065 DOI: 10.1371/journal.pntd.0006670
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Regions of sampling countries.
| Genotype | Region | Sampling country | No. of samples |
|---|---|---|---|
| ECSA | CAfrica: Central Africa | Central African Republic, Democratic Republic of the Congo, Gabon | 6 |
| EAfrica: East Africa | Kenya, Tanzania, Uganda | 7 | |
| EAsia: East Asia | China, Japan | 20 | |
| Euro: Europe | France, Italy | 6 | |
| IndiaO: Indian Ocean | Comoros, Madagascar, Mauritius, Mayotte, La Reunion island | 15 | |
| MLSEA: Mainland Southeast Asia | Thailand, Cambodia, Myanmar | 30 | |
| MTSEA: Maritime Southeast Asia | Indonesia, Malaysia, Singapore | 16 | |
| SAfrica: South Africa | South Africa, Angola | 3 | |
| SAmerica: South America | Brazil | 4 | |
| SAsia: South Asia | India, Sri Lanka, Bangladesh | 63 | |
| WAfrica: West Africa | Senegal | 1 | |
| WAsia: West Asia | Yemen | 1 | |
| Asian | CAmerica: Central America | Nicaragua, EI Salvador, Guatemala, Honduras, Panama | 120 |
| NAmerica: North America | USA, Mexico | 17 | |
| SAmerica: South America | Brazil, Colombia, Guyana | 10 | |
| CariB: Caribbean | Saint Barthelemy, Dominican Republic, Haiti, Jamaica, Saint Lucia, Martinique, Puerto Rico, British Virgin Islands, US Virgin Islands, Trinidad and Tobago | 84 | |
| PaciO: Pacific Ocean | Micronesia, New Caledonia, French Polynesia, Samoa | 7 | |
| EAsia: East Asia | China | 2 | |
| SAsia: South Asia | India | 5 | |
| MLSEA: Mainland Southeast Asia | Thailand | 6 | |
| MTSEA: Maritime Southeast Asia | Indonesia, Malaysia, Philippines | 19 |
aRegions were assigned based on the country of origin for 442 sequences with detailed travel history information.
Fig 1Maximum likelihood phylogenetic tree of complete coding nucleotide sequences of CHIKV.
Well-supported nodes by bootstrap values over 70% are marked in the tree. Three genotypes (West African, ECSA, and Asian) and clades (Indian Ocean and Caribbean outbreaks) are described in the trees. New sequences sampled from the Americas are colored in red. The phylogenetic tree is midpoint-rooted, and the scale bar represents the number of nucleotide substitutions per site. Arrows indicate known recent travel of the human hosts to Africa or Asia in the ECSA genotype.
Fig 2Phylodynamics of CHIKV Asian genotype epidemics.
Time-scaled maximum clade credibility (MCC) tree of the CHIKV Asian genotype was inferred by Bayesian analysis. Sequences from different regions are colored as described in the key. The Caribbean outbreak (CO) clade is supported by Bayesian posterior probabilities (BPP) as 1 in BMCMC analysis and bootstrap values 96% in the maximum likelihood phylogeny. Subclades CO1 and CO2, supported by BPP over 0.98, are also marked in the tree. A small group of Nicaraguan sequences that are relatively distinct from other Nicaraguan sequences are highlighted by a red hash symbol (#). The major ancestral geographic states (state probabilities over 0.5) at the backbone of phylogenetic tree are colored by region and labeled in the tree. The nodes with state probabilities greater than 0.9 are marked by asterisks. The mean and 95% highest posterior density (HPD) of the most recent common ancestor (tMRCA) of the entire Asian genotype and tMRCAs of virus movement between regions are also indicated. The scale bar represents chronological time (in years).
Fig 3Phylogeographic clustering of Caribbean/Americas epidemic CHIKV strains.
The Bayesian maximum clade credibility (MCC) tree of CHIKV Asian genotype was annotated to reflect the geographic origin of samples collected in the Americas. The CO clade was split into three parts, magnified, and indicated by gray shading: subclade CO1 (right side of the tree), subclade CO2 (left middle panel), and non-CO1/CO2 strains (left lower panel). Sequences from different regions are colored in branches as described in the key. The original countries of infections (either indigenous or travel-related) are depicted with colored ovals next to the complete strain names and described in the key as well.
Fig 4Maximum likelihood tree of CHIKV Asian genotype.
The major nodes with bootstrap values above 70% are indicated. The CO clade, supported by bootstrap value 96.6%, is colored in red. Amino acid substitutions that contribute to clade diversity are mapped to the branches of the tree. CO-clade-specific amino acid substitutions are marked in red. Monophyletic groups within CO clade and associated amino acid changes are marked in blue. The scale bar represents genetic distance.
Significant residue diversity between CO clades.
| Polyprotein Name | Compared Clades | Polyprotein Position | Mature Peptide Position | P-value | Residue Diversity (Clade 1) | Residue Diversity (Clade 2) | Residue Diversity (Clade 3) | Residue Diversity (Clade 4) |
|---|---|---|---|---|---|---|---|---|
| nsp | CO1 vs CO2 | NSP3, 48 | 1.39E-08 | 142 S | 13 C, 42 S | |||
| CO2.1 vs CO2.2 | NSP3, 48 | 6.89E-05 | 13 C | 7 S | ||||
| CO1.1 vs CO1.4 vs CO1.6 vs CO1.7 | 940 | NSP2, 405 | 5.68E-05 | 38 S | 9 S | 11 S | 7 L, 13 S | |
| 1157 | NSP2, 622 | 2.63E-05 | 38 E | 6 E, 3 G | 11 E | 20 E | ||
| 1720 | NSP3, 387 | 2.63E-05 | 38 A | 6 A, 3 T | 11 A | 20 A | ||
| 1827 | NSP3, 494 | 2.63E-05 | 38 L | 6 L, 3 P | 11 L | 20 L | ||
| 2452 | NSP4, 589 | 2.63E-05 | 38 F | 6 F, 3 L | 11 F | 20 F | ||
| sp | CO1 vs CO2 | 2984 | E2, 185 | 0.03109 | 142 S | 3 C, 52 S | ||
| E1, 155 | 9.63E-05 | 142 T | 7 I, 48 T | |||||
| CO2.1 vs CO2.2 | E1, 155 | 6.89E-05 | 13 T | 7 I | ||||
| CO1.1 vs CO1.4 vs CO1.6 vs CO1.7 | 3167 | E2, 368 | 0.01345 | 29 A, 9 V | 9 A | 11 A | 20 A |
aPosition in RefSeq strain S27, GenBank Accession: NC_004162.
bUnderlined positions indicate significant differences between multiple clades.
cWe did not find this site in maximum likelihood phylogeny.
Fig 5Structural analysis of clade-specific amino acid residues.
Threonine 155 in the E1 protein (magenta) and Serine 185 in the E2 protein (green) were visualized on the three-dimensional hetero-multimeric structure (PDB: 3J2W). The colored insets show a magnified view of the regions surrounding T155 and S185 (magenta and green, respectively).
Results of the phylogeny-trait association tests for demographic and clinical characteristics of chikungunya patients in Nicaragua.
| Comparison | Statistic | Observed value | Null value | |
|---|---|---|---|---|
| Sex of patient | AI | 5.20 (4.27–6.11) | 5.31 (4.27–6.29) | 0.42 |
| Male: 38/108 | PS | 30.09 (28.0–32.0) | 31.48 (28.25–34.21) | 0.20 |
| Age of patient | AI | 8.32 (7.40–9.26) | 8.02 (6.94–9.08) | 0.67 |
| <5 years: 28/108 | PS | 52.17 (50.0–55.0) | 51.12 (47.05–54.78) | 0.64 |
| 6–10 years: 32/108 | ||||
| 11–18 years: 45/108 | ||||
| >18 years: 3/108 | ||||
| Dehydration: 29/95 | AI | 3.01 (2.29–3.73) | 5.04 (4.15–5.89) | |
| PS | 22.08 (20.0–24.0) | 29.65 (26.99–32.0) | ||
| Headache: 43/95 | AI | 4.01 (3.21–4.81) | 5.04 (4.02–5.98) | 0.05 |
| PS | 28.43 (26.0–31.0) | 31.49 (27.93–34.59) | 0.05 | |
| Vomiting: 22/95 | AI | 3.16 (2.46–3.88) | 3.58 (2.79–4.34) | 0.17 |
| PS | 20.13 (19.0–21.0) | 20.04 (18.23–21.45) | 0.43 | |
| Myalgia: 49/95 | AI | 3.67 (2.85–4.47) | 5.60 (4.62–6.53) | |
| PS | 28.65 (26.0–31.0) | 34.84 (31.33–37.96) | ||
| Rash: 24/95 | AI | 2.89 (2.21–3.59) | 3083 (3.02–4.58) | |
| PS | 19.77 (18.0–21.0) | 21.59 (19.35–23.23) | 0.1 | |
| Abdominal pain: 10/95 | AI | 2.19 (1.66–2.72) | 1.87 (1.34–2.40) | 0.85 |
| PS | 9.99 (10.0–10.0) | 9.69 (8.82–10.0) | 1 | |
| Arthralgia: 76/95 | AI | 2.53 (1.92–3.15) | 3.22 (2.46–3.94) | 0.07 |
| PS | 15.15 (14.0–16.0) | 17.56 (15.93–18.72) | ||
| Retro-orbital pain: 10/95 | AI | 1.56 (1.02–2.13) | 1.87 (1.36–2.38) | 0.15 |
| PS | 9.96 (10.0–10.0) | 9.67 (8.82–10.0) | 1 | |
| Fever: 93/95 | AI | 0.71 (0.67–0.75) | 0.41 (0.18–0.64) | 0.99 |
| PS | 2.0 (2.0–2.0) | 1.99 (2.0–2.0) | 1 |
aNo. of cases/total No. of patients
bAI, association index; PS, parsimony score.
cValue with 95% confidence interval.
dStatistically significant results are in bold.