| Literature DB >> 28480053 |
N S D Sahadeo1, O M Allicock1, P M De Salazar2,3, A J Auguste4, S Widen4, B Olowokure2, C Gutierrez2, A M Valadere2, K Polson-Edwards2, S C Weaver4, C V F Carrington1.
Abstract
Local transmission of chikungunya virus (CHIKV) was first detected in the Americas in December 2013, after which it spread rapidly throughout the Caribbean islands and American mainland, causing a major chikungunya fever epidemic. Previous phylogenetic analysis of CHIKV from a limited number of countries in the Americas suggests that an Asian genotype strain was responsible, except in Brazil where both Asian and East/Central/South African (ECSA) lineage strains were detected. In this study, we sequenced thirty-three complete CHIKV genomes from viruses isolated in 2014 from fourteen Caribbean islands, the Bahamas and two mainland countries in the Americas. Phylogenetic analyses confirmed that they all belonged to the Asian genotype and clustered together with other Caribbean and mainland sequences isolated during the American outbreak, forming an 'Asian/American' lineage defined by two amino acid substitutions, E2 V368A and 6K L20M, and divided into two well-supported clades. This lineage is estimated to be evolving at a mean rate of 5 × 10-4 substitutions per site per year (95% higher probability density, 2.9-7.9 × 10-4) and to have arisen from an ancestor introduced to the Caribbean (most likely from Oceania) in about March 2013, 9 months prior to the first report of CHIKV in the Americas. Estimation of evolutionary rates for individual gene regions and selection analyses indicate that (in contrast to the Indian Ocean Lineage that emerged from the ECSA genotype followed by adaptive evolution and with a significantly higher substitution rate) the evolutionary dynamics of the Asian/American lineage are very similar to the rest of the Asian genotype and natural selection does not appear to have played a major role in its emergence. However, several codon sites with evidence of positive selection were identified within the non-structural regions of Asian genotype sequences outside of the Asian/American lineage.Entities:
Keywords: Americas; Caribbean; chikungunya virus; complete genome; evolution; selection analysis
Year: 2017 PMID: 28480053 PMCID: PMC5413804 DOI: 10.1093/ve/vex010
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.MCC phylogeny based on the complete ORFs of 149 Asian lineage sequences (Asian149), including all Caribbean sequences. Taxon labels include year of isolation, accession number/sequence ID and country of isolation, except in collapsed clades, which are comprised of two or more sequences from the same country and year. The Asian/American lineage comprises all Caribbean sequences, including the thirty-three newly generated sequences, as well as Asian sequences from Central and South America and a 2015 sequence from French Polynesia (Genbank Accession No. KR559473). The most closely related sequences to the Asian/American lineage were sequences from American Samoa (2014), the Philippines (2013) and Micronesia (2013). Posterior probabilities ≥ 99 per cent are indicated at relevant nodes.
Figure 2.The Asian/American lineage and its most closely related sequence from American Samoa (2014). Taxon labels include year of isolation, accession number/sequence ID and country of isolation except in collapsed clades, which are comprised of two or more sequences from the same country and year. The number of sequences in each collapsed clade is shown in parentheses, (the accession numbers for sequences in these collapsed clades are shown in Supplementary Figure S2). Amino acid changes defining clades within the phylogeny are indicated by coloured bars on the relevant branches (blue = 6K L20M and E2 V368A; red= nsP4 R99Q; green = nsP4 H179Y; orange = E2 V113A, purple = nsP3 Stop521R; pink = E1 T101M). Terminal branches of the tree are colored according to the region to which the country from which the sequence at the tip was derived, i.e. Central America (Mexico); Greater Antilles (Cayman Islands, Dominican Republic, Haiti, Jamaica, Puerto Rico); Leeward Islands (Anguilla, Antigua, British Virgin Islands, Guadeloupe, Montserrat, Saint Barthélemy, St. Kitts and Saint Martin); Lucayan Archipelago (Bahamas and Turks & Caicos); Oceania (American Samoa, French Polynesia, Micronesia and New Caledonia); South America (Brazil, Colombia, Guyana and Suriname); Windward Islands (Barbados, Dominica, Grenada, Martinique, St. Lucia, St. Vincent and Trinidad). Internal branches are colored according to the most probable (model) location of their parental nodes and the location state probability is indicated as a percentage at selected nodes. Also at two selected nodes, in square brackets, are node ages. and corresponding 95% HPDs. Nodes with posterior probabilities (clade credibilities) ≥0.99 are labelled.
Non-synonymous amino acid changes that distinguish CHIKV sequences from the Americas from other closely related Asian genotype sequences (American Samoa 2014, Philippines 2013, Micronesia 2013).
| Gene | Nt change | AA change | Sequence ID | Country (year) |
|---|---|---|---|---|
| Capsid | T7735 C | 56 V → A | 14.03985 | Barbados (2014) |
| C8158T | 197 S → L | 14.01349 | St. Lucia (2014) | |
| E2 | C8824T | 94 T → I | KR046233.1 | Trinidad (2014) |
| T8881C | 113 V → A | 14.00309 | BVI (2014) | |
| KJ451624.1 | BVI (2014) | |||
| CH0008 | Mexico (2014) | |||
| CH0045 | Mexico (2014) | |||
| CH0072 | Mexico (2014) | |||
| LI0036 | Mexico (2014) | |||
| TA0006 | Mexico (2014) | |||
| T9646C | 368 V → A | All | All | |
| 6K | C9870A | 20 L → M | All | All |
| T9909A | 33 C → S | 14.03985 | Barbados (2014) | |
| E1 | C10219T | 101 T → M | KR559491 | Colombia (2014) |
| G10452A | 153 A → T | KR559497 | St. Barts 2014) | |
| 14.04425 | Barbados (2014) | |||
| C10498T | 168 S → L | 14.06638 | Trinidad (2014) | |
| T10867C | 291 V → A | 14.01152 | St. Kitts (2014) | |
| A10890T | 299 M → L | 14.03985 | Barbados (2014) | |
| nsP1 | G590A | 171 R → Q | 14.02961 | Bahamas (2014) |
| 14.02086 | Guyana (2014) | |||
| G603A | 176 V → I | 14.0444 | Suriname (2014) | |
| G765A | 230 G → R | 14.0444 | Suriname (2014) | |
| T775A | 233 L → Q | KR046230.1 | Trinidad (2014) | |
| nsP2 | C3389T | 569 R → C | KR046232.1 | Trinidad (2014) |
| C2416T | 245 P → L | 14.0444 | Suriname (2014) | |
| G2481T | 267 A → S | 14.02086 | Guyana (2014) | |
| C1932A | H644Q | KP164571 | Brazil (2014) | |
| nsP3 | G4430A | 118 G → R | 14.01526 | Antigua (2014) |
| T5646C | 521 stop → R | 14.05081 | Cayman Is (2014) | |
| 14.02961 | Bahamas (2014) | |||
| 14.05085 | Cayman Is (2014) | |||
| 14.01507 | Haiti (2014) | |||
| 14.04561 | Jamaica (2014) | |||
| 14.06350 | Suriname (2014) | |||
| CNR20235H | Saint Martin (2013) | |||
| nsP4 | G5962A | 99 R → Q | 14.0444 | Suriname (2014) |
| KR046233.1 | Trinidad (2014) | |||
| 14.02086 | Guyana (2014) | |||
| KR559490.1 | Guyana (2014) | |||
| KR559496.1 | Guyana (2014) | |||
| C6201T | 179 H → Y | 14.06638 | Trinidad (2014) | |
| 14.02346 | St. Vincent (2014) | |||
| 14.03562 | Grenada (2014) | |||
| 14.0256 | Grenada (2014) | |||
| G6762A | 366 A → T | 14.05085 | Cayman Is (2014) | |
| C7276T | 537 A → V | 14.02961 | Bahamas (2014) |
Nucleotide positions are numbered beginning at the start codon for the complete coding sequence; amino acid positions are numbered from the first amino acid of each gene product.
Substitution rates estimated for the complete coding sequence and for individual genes of CHIKV Asian lineages.
| Substitution rate (95% HPD)/ × 10−4 subs/site/yr | |||
|---|---|---|---|
| Data set | Asian149 | Asian50 | |
| Region | Asian genotype | Asian/ American lineage | Asian lineages paraphyletic to Asian/ American lineage |
| Complete cds | 4.6 (4.1–5.2) | 5.0 (2.9–7.9) | 4.4 (3.8–5.0) |
| CP1 + 2 of complete cds | 0.6 (0.6–0.7) | 0.8 (0.6–0.9) | 0.6 (0.5–0.6) |
| CP3 of complete cds | 5.5 (5.2–5.7) | 4.9 (4.4–5.5) | 5.8 (5.5–6.0) |
| C | 4.2 (3.0–6.0) | 5.0 (2.0–9.0) | 3.7 (2.3–5.3) |
| 6k | 10 .0 (6.0–20.0) | 11.0 (3.0–20.0) | 2.6 (0.3–8.1) |
| E1 | 6.0 (4.0–8.0) | 8.0 (1.0–19.0) | 5.3 (3.5–7.2) |
| E2 | 7.0 (5.0–8.0) | 9.0 (3.0–13.0) | 5.7 (4.2–7.3) |
| E3 | 6.0 (3.0–9.9) | − | 6.6 (2.3–12.2) |
| nsP1 | 4.0 (3.0– 5.0) | 5.0 (2.0–8.0) | 3.7 (2.6–4.8) |
| nsP2 | 4.0 (3.0– 5.0) | 4.0 (3.0–6.0) | 4.2 (3.2–5.0) |
| nsP3 | 7.0 (6.0–9.0) | − | 6.0 (4.5–7.5) |
| nsP4 | 4.0 (3.0– 5.0) | − | 4.5 (2.5 – 17.1) |
aThe Asian/American lineage was not apparent as a distinct clade in phylogenies based on these gene regions.
Positively selected sites within CHIKV Asian lineages.
| Method | Data set | No. of sites | Codon position | P value | AA position | Mutation | Sequences with derived amino acid state |
|---|---|---|---|---|---|---|---|
| Structural region | |||||||
| SLAC | Asian149 | 0 | – | – | – | – | – |
| Asian/American | 0 | – | – | – | – | – | |
| FEL | Asian149 | 0 | – | – | – | – | – |
| Asian/American | 0 | – | – | – | – | – | |
| MEME | Asian149 | 1 | 546 | 0.01 | 221 E2 | K → G | Thailand 1958 (HM045810) |
| Asian/American | – | – | – | ||||
| FUBAR | Asian149 | 2 | 323 | 116.04 | 62 E3 | Q → R | Malaysia 2006 (FN295483-4) |
| 546 | 59.83 | 221 E2 | K → G | Malaysia 2006 (EU703759-63) | |||
| Malaysia 2009 (KX168429) | |||||||
| Thailand 1958 (HM045810) | |||||||
| Asian/American | 0 | – | – | – | – | ||
| Non-structural region | |||||||
| SLAC | Asian149 | 1 | 1,728 | 0.02 | 395 nsP3 | V → P | Malaysia 2006 (FN295483, FN295484) |
| V → S | New Caledonia 2011 (HE806461) | ||||||
| Malaysia 2006 (EU703759-62) | |||||||
| Malaysia 2007 (KM923917-20) | |||||||
| Philippines 1985 (HM045790, HM045800) | |||||||
| Indonesia 1983 (HM045791) | |||||||
| Indonesia 1985 (HM045797) | |||||||
| India 1963 (HN045803) | |||||||
| India 1963 (HM045813) | |||||||
| India 1973 (HM045788) | |||||||
| Asian/American | 0 | – | – | – | – | – | |
| FEL | Asian149 | 0 | – | – | – | – | – |
| Asian/American | 0 | – | – | – | – | – | |
| MEME | Asian149 | 2 | 1,727 | 0.00 | 394 nsP3 | P → M | Malaysia 2007 (KM923917-20) |
| 1,728 | 0.00 | 395 nsP3 | V → P | Malaysia 2006 (EU70359-62) | |||
| V → S | Thailand 1995 (HM045787, HM045796) | ||||||
| Thailand 1996 (KX262987) | |||||||
| Philippines 1985 (HM045790, HM045800) | |||||||
| Indonesia 1983 (HM045791) | |||||||
| Indonesia 1985 (HM)45797) | |||||||
| Thailand 1978 (HM045808) | |||||||
| Thailand 1975 (HM045814) | |||||||
| Thailand 1958 (HM045810) | |||||||
| India 1963 (HM045803, HM045813) | |||||||
| India 1973 (HM045788) | |||||||
| Malaysia 2006 (FN295483, FN295484) | |||||||
| New Caledonia 2011 (HE806461) | |||||||
| Malaysia 2006 (EU703759-62) | |||||||
| Malaysia 2007 (KM923917-20) | |||||||
| Philippines 1985 (HM045790, HM045800) | |||||||
| Indonesia 1983 (HM045791) | |||||||
| Indonesia 1985 (HM045797) | |||||||
| India 1963 (HN045803) | |||||||
| India 1963 (HM045813) | |||||||
| India 1973 (HM045788) | |||||||
| Asian/American | 1 | 1,510 | 0.01 | 177 nsP3 | R → Q | Guadeloupe (2014) LN898098 | |
| FUBAR | Asian149 | 1 | 1728 | 153.20 | 395 nsP3 | V → P | Malaysia 2006 (FN295483, FN295484) |
| V → S | New Caledonia 2011 (HE806461) | ||||||
| Malaysia 2006 (EU703759-62) | |||||||
| Malaysia 2007 (KM923917-20) | |||||||
| Philippines 1985 (HM045790, HM045800) | |||||||
| Indonesia 1983 (HM045791) | |||||||
| Indonesia 1985 (HM045797) | |||||||
| India 1963 (HN045803) | |||||||
| India 1963 (HM045813) | |||||||
| India 1973 (HM045788) | |||||||
| Asian/American | 0 | – | – | – | – | – | |