Literature DB >> 26414178

The Complete Genome Phylogeny of Geographically Distinct Dengue Virus Serotype 2 Isolates (1944-2013) Supports Further Groupings within the Cosmopolitan Genotype.

Akhtar Ali1, Ijaz Ali1.   

Abstract

Dengue virus serotype 2 (DENV-2) isolates have been implicated in deadly outbreaks of dengue fever (DF) and dengue hemorrhagic fever (DHF) in several regions of the world. Phylogenetic analysis of DENV-2 isolates collected from particular countries has been performed using partial or individual genes but only a few studies have examined complete whole-genome sequences collected worldwide. Herein, 50 complete genome sequences of DENV-2 isolates, reported over the past 70 years from 19 different countries, were downloaded from GenBank. Phylogenetic analysis was conducted and evolutionary distances of the 50 DENV-2 isolates were determined using maximum likelihood (ML) trees or Bayesian phylogenetic analysis created from complete genome nucleotide (nt) and amino acid (aa) sequences or individual gene sequences. The results showed that all DENV-2 isolates fell into seven main groups containing five previously defined genotypes. A Cosmopolitan genotype showed further division into three groups (C-I, C-II, and C-III) with the C-I group containing two subgroups (C-IA and C-IB). Comparison of the aa sequences showed specific mutations among the various groups of DENV-2 isolates. A maximum number of aa mutations was observed in the NS5 gene, followed by the NS2A, NS3 and NS1 genes, while the smallest number of aa substitutions was recorded in the capsid gene, followed by the PrM/M, NS4A, and NS4B genes. Maximum evolutionary distances were found in the NS2A gene, followed by the NS4A and NS4B genes. Based on these results, we propose that genotyping of DENV-2 isolates in future studies should be performed on entire genome sequences in order to gain a complete understanding of the evolution of various isolates reported from different geographical locations around the world.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26414178      PMCID: PMC4587552          DOI: 10.1371/journal.pone.0138900

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Dengue is an emerging and re-emerging infectious disease caused by a mosquito-borne, single-stranded, positive-sense RNA virus named the dengue virus (DENV) (genus Flavivirus, family Flaviviridae). DENV has four antigenically related but genetically different serotypes: DENV-1, DENV-2, DENV-3 and DENV-4. The genome of DENV is approximately 11 kb, containing a single open reading frame (ORF) flanked by 5´ and 3´ UTRs. Translation of the ORF produces a large polyprotein that is cleaved into 10 mature proteins. The N-terminal of the polyprotein encodes three structural proteins: capsid (C), premembrane/membrane (PrM/M), and envelope (E), as well as seven non-structural (NS) proteins: NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5, which are flanked by 5´ and 3´-non-translated regions (5´ NTR/3´NTR) [1, 2]. The first reported epidemics of dengue fever occurred from 1779–1780 in Asia, Africa, and North America; however, the first global pandemic began after World War II [3]. Over the next 60 years, the geographic distribution of dengue expanded considerably, and now all four serotypes of the virus are circulating in Asia, Africa, and the Americas [4]. Different DENV serotypes (DENV-1, DENV-2, DENV-3 and DENV-4) are important with respect to their association with sylvatic cycles, DF outbreaks, and low to high transmission to humans, as well as DHF or dengue shock syndrome (DSS) [1]. Our study focuses solely on the DENV-2 isolates as this serotype is more prevalent worldwide and has been associated with a number of epidemics. In addition, large numbers of complete genome sequences of DENV-2 from diverse geographical locations are available in GenBank, as compared to serotypes 1, 3, and 4. DENV-2 is also the most frequently circulating serotype in Pakistan, as reported in several outbreaks from 2005–2013 in Pakistan, and 10 complete genome sequences of Pakistan DENV-2 isolates are available in the GenBank database. Therefore, we restricted this study strictly to DENV-2 isolates, and future studies will be focused on the remaining serotypes, depending on the availability of complete genome sequences from different countries. The evolutionary history of dengue viruses is recent, but DENV-2 is believed to have emerged 120 to 215 years ago [5, 6, 7, 8]. DENV-2 has been linked to severe epidemics of DHF in various geographical regions of the world [9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21]. Recently, severe epidemics of DENV-2 caused high morbidity and mortality in South Asia [11, 12, 22]. Being a Flavivirus, DENV-2 is prone to rapid mutation as it replicates in known hosts, such as human beings and mosquitoes of the Aedes genus. It has also been recently detected in bats [23]. Reverse transcription polymerase chain reaction (RT-PCR) and real-time PCR have been used for many years for the identification of DENV serotypes [24]. However, increasing viral intra-genetic diversity requires a more effective method for genotype identification [1]. Phylogenetic analysis based on individual gene sequences has, therefore, recently proved useful for the genotyping of DENV-2 [1, 7, 25]. Based on envelope gene sequences, DENV-2 has been divided into five distinct genotypes: American, Asian-American, Asian-I, Asian-II, and Cosmopolitan [7]. Various genotypes of DENV-2 have been the causative agents of the worst epidemics, resulting in high morbidity and mortality in a number of different countries [9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 26]. A plethora of phylogenetic studies have previously used partial DENV-2 genomic sequences of the capsid, PrM, envelope, or other genes for finding viral factors responsible for pathogenicity and evolutionary and transmission trends [8, 9, 11, 12, 13, 22, 27, 28, 29, 30, 31, 32, 33]. Several studies have used a partial or truncated 5´ or 3´ C-PrM region for conducting phylogenetic analysis [11, 29, 33]. A few studies have recently used ORF or complete genome sequences for finding the phylogenetic relationship of DENV-2 isolates restricted to a particular region or country [10, 13, 22, 25, 34]. Partial genomic sequences are frequently being used for evolutionary analysis of DENV-2. Thus, there is a need to determine whether focusing on a particular gene or utilizing entire genome sequences is best suited for the genotyping of DENV-2 isolates worldwide. In this study, we compared the complete genome sequences of 50 DENV-2 isolates (isolated from 1944 to 2013), including 10 Pakistan isolates, in order to determine genetic diversity, selection pressure on particular genes, and evolutionary distances over time.

Material and Methods

Source of sequences and phylogenetic tree

Entire genome sequences of 50 DENV-2 isolates (Table 1) were selected and retrieved from the GenBank NCBI database, as they were representative of diverse geographical locations in 19 different countries spanning South Asia, Southeast Asia, the Far-East, Africa, Australia, North America, and South America. Minor and major dengue outbreaks had previously been recorded in these regions and whole-genome sequences of DENV-2 isolates had been characterized. In addition, these 50 DENV-2 isolates were divided into seven temporal classes, which included one isolate each from 1944 and 1964, five isolates from 1970–1980, six isolates from 1981–1990, five isolates from 1991–2000, 24 isolates from 2001–2010, and eight isolates from 2011–2013. (The number of DENV-2 isolates selected fluctuates decade by decade according to the availability of complete genome sequences in the GenBank database). We randomly selected representative isolates of DENV-2 from South and North America that included isolates from Columbia, Peru and the USA. Therefore, we did not include isolates from Brazil and Venezuela. Previous studies have used a total of either 22 [1] or nine [25] DENV-2 isolates for classification and evolutionary analysis, including the New Guinea C isolate (NGC), which is generally used as the standard for temporal analysis targeting evolutionary divergence. We included representative sequences from a number of geographical regions in order to have a broader picture and clearer understanding of the spatio-temporal evolution and classification of DENV-2. Although there are temporal standards for analysis, there is no consensus on criteria for selecting spatial representative sequences.
Table 1

Locations and names of DENV-2 isolates collected worldwide during 1944–2013.

CountryName of isolateGenome (bp)YearNucleotide AccessionProtein AccessionReference
AustraliaTSV01107231993AY037116AAK67712[46]
BruneiDS09-280106107092006EU179859ABW06614[47]
DS31-291005107092005EU179857ABW06612[47]
Burkina Faso1349107231983EU056810ABW74619[48]
ChinaGD01/03107232003FJ196853ACN54392[49]
44107231989AF204177AAF18446Direct submission
43107231987AF204178AAF18447Direct submission
China 04107231985AF119661AAD18036Direct submission
FJ11/99107231999AF359579AAK49562Direct submission
FJ-10107232000AF276619AAF86463Direct submission
China/IndiaQHD13CAIQ107232013KF479233AHA42535[32]
ColombiaCO/BID-V3358106671986GQ868592ACW82882Direct submission
FijiFJ/UH21/1971107141971HM582099ADM26218[50]
GuamGU/BID-V2950109672001HM488257ADK26435Direct submission
GuatemalaAmerican Asian107252009HQ999999AER45462[34]
IndiaGWL18106702001DQ448231ABE02262Direct submission
IN/BID-V2961106692006FJ898454ACQ44493Direct submission
Od2112106702011JQ955624AFZ40226[22]
RR44106702009JQ955623AFZ40225[22]
1392106702009JX475906AFZ40227[22]
Indonesia1016DN107231975GQ398258ADK37474[51]
1017DN107231976GQ398259ADK37475[51]
1070DN107231976GQ398260ADK37476[51]
98900663DHF107231998AB189122BAD42415Direct submission
BA05i107232004AY858035AAW51406[52]
1022DN107241975GQ398268ADK37484[51]
New GuineaNGC107241944AF038403AAC59275[53]
PakistanPak-L-2011107232011KF041234AHC72406[33]
Pak-L-2011107232011KF041232AHC72404[33]
Pak-K-2009107232009KF041237AHC72409[33]
Pak-K-2009107232009KF041235AHC72407[33]
Pak-M-2011107232011KF041233AHC72405[33]
PakL-2013106292013KJ010186AHM25910Direct submission
Pak—L-2011106292011KJ010185AHM25909Direct submission
Pak-L-2010106292010KF360005AHA80987Direct submission
Pak-L-2008107232008KF041236AHC72408[33]
PeruPE/NFI1159107232010KC294223AGX15388[54]
PE/IQA 2080107232010KC294221AGX15386[54]
SingaporeSG/D2Y98P-PP1107232009JF327392AEI29060[56]
SG/05K3295DK1107232005EU081177ABW82013[57]
Sri LankaLK/BID-V2421106292003GQ252676ACS32038Direct submission
LK/BID-V2422106282004GQ252677ACS32039Direct submission
LK/BID-V2416106281996FJ882602ACQ44409Direct submission
Taiwan1222-DF-06106712002DQ645546ABG29081Direct submission
TW/BID-V5056106152008HQ891024AEH59344Direct submission
ThailandTH/BID-V3357106781964GQ868591ACW82881Direct submission
USAUS/BID-V5412104842007JF730050AEH59345Direct submission
US/BID-V5055104882008JN796245AET72454Direct submission
IQT1797106741998AF100467AAD32962[55]
VietnamVN/BID-V735106782006EU482672ACA48939Direct submission
USA-DENV1US/Hawaii/1944107341944EU848545ACF49259Direct submission
Complete sequences of each isolate were manually fragmented into 13 segments that included 5´UTR, C, PrM/M, E, NS1, NS2A, NS2B, NS3, NS4A, NS4B, NS5, 3´UTR and the complete ORF. Nucleotide sequences of all individual genes, 5´ and 3´ UTRs, ORFs, and whole-genome nt or aa sequences were aligned using the Clustal X program [35]. Phylogenetic analysis was performed with the MEGA5 program [36] using the maximum-likelihood (ML) method, based on the general time reversible (GTR) or GTR+I+G nucleotide substitution models. The robustness of all ML trees was tested with 1000 bootstrap replications.

Bayesian MCMV evolutionary analysis

After phylogenetic analysis using ML trees, the Bayesian Markov chain Monte Carlo (MCMC) approach, as implemented in the BEAST package v1.8.2 (Available online at http://tree.bio.ed.ac.uk/software/) was used to analyze the complete genome sequences of the 50 DENV-2 isolates. The data were analyzed using the Bayesian Skyline speciation model and the GTR+G model of evolution with empirical base frequencies and lognormal relaxed clock with 20 million generations. We set a burn-in of 20% for posterior probabilities and then examined the results using Tree Annotator followed by TRACER v1.6 programs from the BEAST package. The tree was visualized in Fig tree v1.4.2. The complete genome of the DENV-1 US/Hawaii isolate (Table 1) was also downloaded from GenBank and was used as an out-group in the BEAST analysis (Fig 1B).
Fig 1

Phylogenetic analysis of DENV-2 complete genome nucleotide sequences.

(A) Maximum-likelihood trees were constructed using MEGA V5.05 software with bootstrap support of 1000 replicates. All nucleotide sequences were downloaded from the GenBank database for analysis (Table 1). The phylogenetic tree was constructed using the General Time Reversible (GTR) model. (B) Bayesian Maximum Clade Credibility tree of the 50 DENV-2 isolates. Seven groups including five major genotypes were identified. In the Cosmopolitan genotype, there were three groups (C-I, C-II and C-III) while C-I was sub-grouped into C-IA and C-IB. The DENV-1 US/Hawaii isolate was used as an out-group.

Phylogenetic analysis of DENV-2 complete genome nucleotide sequences.

(A) Maximum-likelihood trees were constructed using MEGA V5.05 software with bootstrap support of 1000 replicates. All nucleotide sequences were downloaded from the GenBank database for analysis (Table 1). The phylogenetic tree was constructed using the General Time Reversible (GTR) model. (B) Bayesian Maximum Clade Credibility tree of the 50 DENV-2 isolates. Seven groups including five major genotypes were identified. In the Cosmopolitan genotype, there were three groups (C-I, C-II and C-III) while C-I was sub-grouped into C-IA and C-IB. The DENV-1 US/Hawaii isolate was used as an out-group.

Evolutionary distances among DENV-2 isolates

Once the sequences were aligned for either the whole-genome or individual genes, those files were used in the MEGA5 program to determine first the best model and then the overall evolutionary distances among the DENV-2 isolates (Table 2). Bootstrap resampling analyses were performed using 1000 replicates.
Table 2

Evolutionary distances among the complete genome or individual genes of 50 DENV-2 isolates collected from various countries during 1944 to 2013.

Genome/geneSize (Nucleotides)Size (Amino Acids)Mean Distance
Complete genome1072333910.06036 ± 0.00124
ORF102720.06103 ± 0.00136
5´-UTRMinimum (64)
Maximum (96)
Capsid3421140.05316 ± 0.00728
PrM/M4981660.05855 ± 0.00689
Envelope14854950.05316 ± 0.00740
NS110563520.06114 ± 0.00418
NS2A6542180.08272 ± 0.00768
NS2B3901300.06500 ± 0.00819
NS318546180.06113 ± 0.00320
NS4A4501500.06866 ± 0.00782
NS4B7442480.06511 ± 0.00580
NS527039010.05800 ± 0.00275
3´ UTRMinimum (120)0.03885 ± 0.0115
Maximum (643)

Determination of group-specific amino acid patterns

Amino acid sequences of the 50 DENV-2 isolates (Table 1) were retrieved from the GenBank NCBI data base and manually fragmented into individual protein sequences. All the respective amino acid sequences were aligned using the Clustal X program, as above. Group-specific amino acid patterns were determined manually from the aligned sequences in the respective genotype groups.

Results

The size of the genome varied from 10484 to 10724 nucleotides among the 50 DENV-2 isolates with a diverse geographical background encompassing South Asia, East Asia, Africa, Australia, and North and South America. However, the ORF size (10240) was the same among all the DENV-2 isolates (Table 1). The main differences in the sizes of the genomes were due to variations in the length of 5´ or 3´ UTRs rather than in the length of the ORFs.

Phylogenetic analysis based on complete genome nucleotide sequences

Using the ML method, phylogenetic analysis of the 50 DENV-2 isolates showed seven main groups containing five previously defined genotypes (Cosmopolitan, American, Asian-American, Asian-I, and Asian-II). Out of these 50, 35 DENV-2 isolates were clustered in the Cosmopolitan group, three in the American, four in the Asian-American, seven in the Asian-II, and one isolate in the Asian-I group (Fig 1A). Phylogenetic analysis showed three main groups within the Cosmopolitan genotype, which were designated as C-I, C-II, and C-III, with a bootstrap support of 100 (Fig 1A). The C-I group was further divided into two subgroups, which were named Cosmopolitan IA (C-IA) and Cosmopolitan IB (C-IB). The two Sri Lankan and all the Pakistani isolates except one (Accession # KF041236) clustered in C-IA, which is a distinct subgroup but closely related to C-IB, that contains 10 DENV-2 isolates from China, India and Sri Lanka. The only other Pakistani DENV-2 isolate, reported in 2008 from Karachi (Accession # KF041236), clustered with isolates in the C-IB subgroup and was different from the rest of the Pakistani isolates, which clustered in C-1A. This same DENV-2 isolate from Karachi, Pakistan, was more closely related to the Chinese isolates (Fig 1A) than to the other Pakistani isolates. The second main group of the Cosmopolitan genotype (C-II) contained four isolates, three Indonesian DENV-2 isolates from 1975/76 and one being from Burkina Faso originally identified in 1943. Cosmopolitan III (C-III) also made up a distinct major group containing 11 isolates from East Asia, Southeast Asia, Africa and Australia. Both the C-II and C-III groups were supported by a bootstrap value of 100 (Fig 1A). The phylogeny based on the entire genome sequences revealed that the 8 Asian isolates clustered into two distinct groups (Asian-I and Asian-II), and were genetically closer to the Asian-American DENV-2 genotypes (Fig 1A).

BEAST analysis

The phylogenetic tree reconstructed by Bayesian analysis is shown in Fig 1B. The Bayesian tree topology was highly similar to that recovered using ML methods (Fig 1A). The phylogeny also showed that the Cosmopolitan genotype has three main groups: C-I, C-II, and C-III. The C-I group contains subgroups C-IA and C-IB, and the distribution of isolates is exactly the same as obtained from the ML trees (Fig 1A). The posterior probability of each node is denoted by 1 (100%). These results shows that two different methods confirm our conclusions that the Cosmopolitan genotype should be further divided into three groups. The Pakistani DENV-2 isolates are estimated to have emerged in the last 10 years (Fig 1B).

Phylogenetic analysis based on amino acid sequences

Phylogenetic analysis of the 50 DENV-2 isolates based on complete genome amino acid sequences also showed seven distinct main groups (Fig 2). The Cosmopolitan genotype fell into three major groups (C-I, C-II and C-III) with >90 bootstrap support, while C-I was subdivided into C-IA and C-IB. This matches the results of both genotype analyses.
Fig 2

Phylogenetic maximum-likelihood trees of DENV-2 complete genome amino acid sequences.

Trees were constructed using MEGA V5.05 software with bootstrap support of 1000 replicates. All amino acid sequences were downloaded from the GenBank database for analysis, and the respective DENV-2 isolates with their accession numbers are listed in Table 1.

Phylogenetic maximum-likelihood trees of DENV-2 complete genome amino acid sequences.

Trees were constructed using MEGA V5.05 software with bootstrap support of 1000 replicates. All amino acid sequences were downloaded from the GenBank database for analysis, and the respective DENV-2 isolates with their accession numbers are listed in Table 1.

Phylogenetic analysis based on ORFs

Phylogenetic analysis of all 50 isolates based on complete ORFs showed almost exactly the same results as obtained on the basis of entire genome nucleotide sequences (Fig 3). The topology of the ORF-based tree was highly similar to the complete genome nt or aa trees and showed the same distribution of DENV-2 types (Fig 3).
Fig 3

Phylogenetic maximum-likelihood trees of DENV-2 ORF nucleotide sequences.

Trees were constructed using MEGAV5.05 software with bootstrap support of 1000 replicates. All sequences of the ORF were manually separated from the whole-genome sequences that were downloaded from the GenBank database for analysis. The phylogenetic tree was constructed using the GTR model.

Phylogenetic maximum-likelihood trees of DENV-2 ORF nucleotide sequences.

Trees were constructed using MEGAV5.05 software with bootstrap support of 1000 replicates. All sequences of the ORF were manually separated from the whole-genome sequences that were downloaded from the GenBank database for analysis. The phylogenetic tree was constructed using the GTR model.

Group-specific patterns of aa mutations among various genotypes

When amino acids sequences were aligned from the 50 DENV-2 isolates, group-specific patterns of aa mutations were observed among the various groups. The nature of specific patterns common to various groups (Table 3) and the number of patterns of mutations found in individual groups (Table 4) revealed that the highest number of specific patterns (n = 24) was present in the American isolates, followed by the Asian-American (n = 10), C-IA (n = 7) and C-III (n = 6). The lowest number of type-specific patterns was observed in the case of the Asian-American isolates. No type-specific aa mutations were observed in C-IB, C-II, or Asian-I, alone; however, C-IB shared 14 group-specific mutations with C-IA.
Table 3

Type-specific amino acid mutations among the various groups of DENV-2 isolates.

CosmopolitanAsian-IIAsian-IAsian/AmericanAmerican
C-IAC-IBC-IIC-III
AAMut--AAMutAAMutAAMutAAMut
130R→K - - 108L→M1266N→K - 130R→G275V→I
506T→I--143D→N1301V→A - 1231T→A351A→D
862T→S--892T→A1662R→K - 1236V→A361S→I
1243I→T--906Q→H1820N→S - 1444I→V670N→D
1301V→I--1047K→R2355L→F - 1504K→R848P→S
2891T→A--1298I→T1536K→R887K→R
3352K→R--2418V→I1032N→S
2647V→I1242T→S
2736R→K1407E→D
3327I→V1842A→T
1856K→R
1941K→R
2042I→T
2043K→R
2444I→V
2670C→V
2684K→V
3044V→I
3049E→A
3129T→S
3138L→V
3219V→T
3291R→S
3310Q→L
Total700651052

Footnote: AA: Amino acid, C-IA: Cosmopolitan IA, C-IB: Cosmopolitan IB, C-II: Cosmopolitan II, C-III: Cosmopolitan III, Mut; mutation

Table 4

Total number of type-specific amino acid mutations in individual genes among various genotypes of DENV-2 isolates.

CosmopolitanAsian-IIAsian-IAsian/AmericanAmericanTotal
C-IC-IIC-III
C-IAC-IB
No. of isolates1010411804350
Capsid000100001
PrM/M100100114
E100000034
NS1100300037
NS2A200120218
NS2B000000112
NS3000020237
NS4A000000022
NS4B000010113
NS52000003914
Total 7 0 0 6 5 0 10 24 52
Footnote: AA: Amino acid, C-IA: Cosmopolitan IA, C-IB: Cosmopolitan IB, C-II: Cosmopolitan II, C-III: Cosmopolitan III, Mut; mutation The subgroup C-IA contained seven distinct patterns of aa mutations, distinguishing it from isolates in C-IB and the rest of the DENV-2 isolates used for comparison. Although subgroup C-II shared some aa substitutions with C-III, the latter had six unique patterns that clearly distinguished the isolates in C-II from those in C-III (Tables 3 & 4). The most frequent pattern found at different positions among the isolates was K→R followed by V→I (Table 3). The fewest number of group-specific patterns of mutations (2 each) was found in the C and NS4A genes, while the highest was found in NS5 (n = 14), followed by NS2A (n = 8), with n = 7 in the case of the NS1 and NS3 genes (Table 3). The NS2A gene was the only one that contained group-specific mutations in a majority of the groups (Table 4), which effectively differentiated the DENV-2 genotypes. The nature of the type-specific mutations observed in the NS2A gene is given in Table 3.

Evolutionary distances across the entire genome, ORFs or individual genes of DENV-2

Evolutionary distances across the amino acids sequences of the entire genome, the ORFs, the individual genes and the UTRs were determined. Minimum distances were noted in the 5´ UTR (0.02355 ± 0.05397) and 3´ UTR (0.03885 ± 0.0115), while maximum distances were found in the NS2A gene (0.08272 ± 0.00768), followed by the NS4A (0.06866 ± 0.00782) and NS4B genes (0.06511 ± 0.00580). Evolutionary distances observed in NS5 were less than all other non-structural genes and were most similar to those observed in structural genes (Table 2).

Phylogenetic analysis based on individual genes

The ML trees constructed on the basis of individual structural and non-structural genes indicated that topologies of the structural genes and NS4A were dissimilar to the complete genome nt, aa, and ORF-based trees, either in terms of bootstrap support for different groups of DENV-2 isolates, or distribution of the isolates into various groups. The topologies of the NS5, NS3, NS1, and NS2A-based phylogenetic trees were relatively closer to the whole-genome or ORF trees than all other non-structural or structural genes. However, a clear distinction could be made between C-IA (South Asian) and C-IB (Southeast Asian) genes with considerably high (>95) bootstrap support, except for the structural genes (C, PrM/M, E) and NS4A, where the bootstrap support was not significant (<90), even though the groups were consistent (Fig 4A–4J).
Fig 4

Phylogenetic maximum-likelihood trees of DENV-2 individual gene nucleotide sequences.

All sequences of the individual genes were manually separated from the complete genome sequences that were downloaded from the GenBank database for analysis. The phylogenetic trees were constructed using the GTR model. (A) C gene; (B) PrM/M gene; (C) E gene; (D) NS1 gene; (E) NS2A gene; (F) NS2B gene; (G) NS3; (H) NS4A; (I) NS4B; (J) NS5; (K) 3´ UTR; and (L) 5´UTR.

Phylogenetic maximum-likelihood trees of DENV-2 individual gene nucleotide sequences.

All sequences of the individual genes were manually separated from the complete genome sequences that were downloaded from the GenBank database for analysis. The phylogenetic trees were constructed using the GTR model. (A) C gene; (B) PrM/M gene; (C) E gene; (D) NS1 gene; (E) NS2A gene; (F) NS2B gene; (G) NS3; (H) NS4A; (I) NS4B; (J) NS5; (K) 3´ UTR; and (L) 5´UTR. DENV-2 C-II and C-III grouped separately from both C-IA and C-IB in the ML trees of all genes with high (>70) bootstrap support, except for the C gene, where the support was lower (<70) (Fig 4A–4J). Analysis of the ML trees revealed that NS1, NS2A, NS2B, NS3, and NS5 genes were distinct in most of the Asian and Asian-American types with high bootstrap support (>70), while support for the same groups was lower (<70) in the case of NS4A and the structural genes (Fig 4A–4J). Moreover, the Asian-I DENV-2 isolate (Thailand-1964 #GQ868591) either grouped with the Asian-American types (Fig 4H; ML tree of NS4A) or formed a distinct cluster with a bootstrap support of >90 in the case of the PrM/M gene (Fig 4B). Similarly, an Asian-II isolate (Indonesiona-1975 #GQ398268) was found at a distinct position in the case of the NS4A ML tree (Fig 4H).

Phylogenetic analysis based on the UTR regions

Phylogenies based on the 5´ UTR or 3´ UTR from all 50 isolates was not informative about the distribution of various DENV-2 types, as the isolates changed their positions among various groups containing different DENV-2 isolates (Fig 4K and 4L). The reason for this is that the actual sequence of the 5´ and the 3´ UTRs is unknown, because of the use of conserved primers in sequencing by the scientists who submitted the sequences to GenBank.

Diversity among DENV-2 isolates from Pakistan

Analysis of the nine whole-genome DENV-2 sequences from Pakistan revealed that they have recently evolved from the Sri Lankan isolates and have formed a unique and distinct pattern compared to the rest of the DENV-2 isolates. All Pakistani DENV-2 isolates except one (Pak-K-2008, Accession # KF041236) clustered in the C-IA group with bootstrap support between 90 and 100, based on complete genome nt, aa, or ORF, as well as individual gene-based phylogenies (Figs 1–4J). Seven group-specific patterns of aa mutations (Table 3) were observed in the structural (PrM/M and E) and non-structural (NS1, NS2A, and NS5) genes (Table 4) of the Pakistani isolates in the C-IA subgroup, which do not exist in other DENV-2 isolates reported worldwide.

Discussion

Previously, individual gene sequences of DENV-2 have been used for phylogenetic analysis in order to group them into various genotypes [1, 7, 21, 25, 37]. Most of the studies used E gene sequences for genotyping of DENV-2 [7, 12, 26, 27, 38, 39, 40, 41]; although, other genes have been used by some investigators [1, 11, 22, 25, 29, 42]. Based on the sequences of the E gene, DENV-2 has been divided into 5 genotypes: Cosmopolitan, Asian-I, Asian-II, Asian-American and American [7, 37]. Previous studies suggested that either the PrM/M, E, NS1, NS3, NS4A, and NS5 genes [1], or the ORFs [25], were suitable for the genotyping of DENV-2; however, these studies either did not use a phylogeny based upon whole-genome nt or aa sequences for validation of their result, or used only a limited number of local isolates in a specific country. For example, the most recent study [25] used only nine Chinese whole-genome DENV-2 sequences, or their partial gene sequences, for phylogenetic analysis encompassing only mainland China. In our study, for the first time, 50 complete genomes of DENV-2 isolates reported from geographically distinct regions of the world were chosen for phylogenetic analysis with whole-genome sequences (both nucleotide and amino acid), ORFs, complete sequences of individual genes, and 5´ or 3´ UTRs. In addition, group-specific aa mutations prevalent in various groups of DENV-2 isolates were also observed that had not been reported in previous studies. Our results showed that the recently evolved Pakistani DENV-2 isolates form a separate and distinct subgroup (C-IA) within the main Cosmopolitan genotype, supported by a bootstrap value of 100. Similarly, other individual genes also demonstrate the existence of a distinct subgroup of Pakistani isolates, except for the E, C and NS4A genes, where the bootstrap support was less than 50. However, several investigators have previously used structural genes for the genotyping of DENV-2 isolates [7, 11, 12, 21, 26, 27, 29, 38, 39, 40]. Seven group-specific amino acid mutations in the PrM/M, E, NS1, NS2A, and NS5 genes of Pakistani isolates (C-IA) also differentiated them from the rest of the Cosmopolitan DENV-2 isolates. The C-IB subgroup did not have a group specific amino acid mutation, but as a part of the C-I group (i.e., both C-IA and C-IB), it shared 14 group-specific mutations that distinguished the C-I group from the rest of the isolates. These results suggest that Pakistani DENV-2 isolates have diverged and are evolving distinctly, probably due to several unprecedented outbreaks of DENV-2 in Pakistan since 2005 [11, 12, 29, 43].Only one Pakistani DENV-2 isolate (originally identified in 2008 in the port city of Karachi) fell into the C-IB group, which contained isolates from India, China, and Sri Lanka. There is no recent genetic evidence for the propagation of similar isolates in Pakistan during any recent outbreaks of dengue in Karachi, Lahore, or Swat, Pakistan [12, 43]; whereas, genetically similar types have been reported in India and China (Figs 1–4J). A previous study describing the phylogenetic relationship of the Indian DENV-2 isolates [22] reported on the prevalence of a distinct South-Asian DENV-2 clade in India, which is consistent with our results for the C-IB group that contains all the Indian isolates clustered with isolates from South Asia and Southeast Asia. However, that study [22] used only E gene or ORF sequences for its phylogeny. Results obtained in our study confirmed the unique identity of the South Asian DENV-2 isolates (C-IB) by using whole-genome nt, aa, and ORFs, or all of the individual genes. Previously, Picket et al. [44] reported 19 mutations at various positions in the NS1 gene among various DENV serotypes. However, the seven mutations we identified in the NS1 gene are different from those reported by Picket et al. [44]. In our study, type-specific patterns of amino acid substitutions were identified among various groups as diagnostic markers in order to provide further support for the existence of various groups or genotypes. Substitutions that were consistent within a single group, and not shared by other groups or genotypes, may be helpful in understanding particular evolutionary trends in a region and useful for differential diagnosis. Southeast Asian, East Asian, Australian, and African Cosmopolitan isolates also made up two distinct groups (C-II and C-III) based on whole-genome nt or aa sequences, ORFs, or individual gene-based phylogenies. The existence of the two groups was supported with a bootstrap value between 70 and 100 on the basis of the whole-genome nt and aa trees, as well as all the individual genes (Figs 1–4J). Isolates in C-III contained six group-specific amino acid mutations in the C, PrM/M, NS1, and NS2A genes, with a maximum of three specific patterns found in the NS1 gene. These type-specific amino acid mutations separated them into two distinct groups (Tables 3 and 4). Previously described Asian (Asian-I & Asian-II) and Asian-American Cosmopolitan genotypes [1, 7] also formed distinct groups based on the whole-genome nt, aa, and ORF-based phylogenetic trees with high bootstrap support (>90). However, the bootstrap values of individual genes varied with <70 for structural genes and >90 for nonstructural genes (Fig 4A–4J), indicating that some of the structural genes reflect the same evolutionary trend and distribution as the Asian and Asian-American types, as was observed in the whole-genome phylogeny. Although many investigators have used the E gene for the genotyping of DENV-2 isolates [7, 12, 26, 27, 37, 38, 39, 40, 41], our study indicates that this may not be suitable for the genotyping of geographically distinct DENV-2 isolates. The phylogenetic tree based on the E gene showed low bootstrap values (>30) among the American, Asian, and Asian-American types, as well as the Cosmopolitan groups. Similar results have been reported recently for Chinese isolates based on the E gene [25]. A striking feature of the ML trees was the group-displacement of the previously described Asian-I DENV-2 isolate (Thailand-1964 #GQ868591), which either grouped with the Asian-American types (Fig 4H) or claimed a distinct place with a bootstrap support of >90, based on the PrM/M tree (Fig 4B). Similarly, an Asian II isolate (Indonesia-1975 #GQ398268) was found at a distinct position in the ML tree based on the NS4A gene (Fig 4H). Displacement of these Asian-I and Asian-II genotypes in the ML trees of NS4A and PrM/M genes indicates that they may have gone through recombination events in some geographical locations and may no longer be usable for typing of the Asian genotypes, as has been reported earlier for dengue virus [45]. Among the Asian genotypes (Asian-I & Asian-II), five distinct group-specific amino acid patterns were observed in Asian-I, while the Asian-American types had a total of 10 type-specific patterns that effectively separated them into two separate groups (Tables 3 and 4). Although evolutionary distances recorded over time in the NS4A gene (Table 2) were more than those of the structural genes, the lowest number (two) of group-specific amino acid mutations was observed in the same gene among the American isolates (Table 4). The whole-genome phylogeny divided Asian and American DENV-2 isolates into separate groups with high bootstrap support (>98), while individual gene phylogenies of C, E, NS4A, and NS4B genes also revealed the same groups but with lower support (>70). Interestingly, the American genotypes always formed a separate group and had the maximum number of group-specific amino acid mutations distinguishing them from all other groups. Nine unique patterns of aa mutations were observed solely in the NS5 genes of American isolates, indicating that this particular gene has gone through extensive selection pressure over time. This might be one of the reasons for the lower degree of fitness and subsequent replacement of the American types by the Asian-American DENV-2 (Tables 3 and 4) [8, 10, 26, 41]. Individual full-length gene phylogenies revealed that the NS5, NS3, NS1, and NS2A genes reflect comparatively similar evolutionary trends, as well as the same distribution of DENV-2 isolates with high (>90) bootstrap support as observed on the basis of whole-genome nt, aa, or ORF trees. These could therefore potentially be used for genotyping (Fig 4J, 4G, 4D and 4E). However, all structural genes and one non-structural gene (NS4A) had considerably different topologies of ML trees than did their whole-genome nt or aa trees, and thus do not seem to be suitable for the genotyping of DENV-2 or for finding evolutionary relationships among the isolates. Among the 50 isolates of DENV-2, maximum evolutionary distances were observed in the NS2A gene, followed by the NS4A, NS4B, NS2B, NS1, and NS3 genes, which are all non-structural and important with respect to various enzymatic functions needed during the viral life cycle (Table 2). With the exception of the American isolates, mean evolutionary distances in the NS5 gene were similar to the structural genes (C, PrM/M, and E), suggesting comparatively less evolutionary pressure on the NS5 gene over time. It is possible that structural integrity of the NS5 gene is essential in viral replication of DENV-2 isolates. The maximum number of group-specific mutations was also detected in the NS5 gene, which distinguished the C-IA, C-IB, Asian-American, and American types. The majority of the group-specific mutations were found in the American isolates, which have long since been replaced with other DENV-2 isolates. These group-specific mutations could therefore be used as an important tool for the molecular detection and typing of individual isolates; however, mining of sequencing data from various geographical regions of the world on a much larger scale is needed to devise more accurate assays.

Conclusions

Our analysis of the phylogenetic trees based on complete genomes, ORFs, or individual genes indicated that whole-genome nt, aa, and ORFs are the best options for the classification of DENV-2 isolates into various genotypes or groups, which significantly supports a further subdivision of the Cosmopolitan genotype into C-I (C-IA and C-IB), C-II, and C-III subgroups. Among the individual genes, however, some full-length NS5, NS3, NS1, and NS2A genes comparatively reflect closely related evolutionary trends but do not entirely reflect the same evolutionary trends for all the groups of DENV-2 isolates. For instance, bootstrap support in the case of Cosmopolitan II and III, Asian and Asian-American types, as well as the South Asian isolates, differs between the whole-genome ML trees and the ORF ML trees. In addition, group-specific amino acid mutations identified in this study effectively distinguish different genotypes or groups and could also be used as diagnostic tools for the identification of various DENV-2 isolates.

Recommendations

Firstly, only complete-genome nt or aa sequences, and ORFs should be used for classification and recombination of DENV-2 isolates into genotypes or groups, due to the lower predictive value for individual genes, but not for diagnostic purposes. For diagnostic purposes individual genes such as the NS5 gene phylogeny are sufficient for genotyping. Geographically-distinct, individual DENV-2 isolates currently grouped on the basis of individual genes should be re-assigned to their specific groups based on a complete-genome or ORF phylogeny. The use of partial sequences for determining phylogenetic relationships should be discouraged, in order to refine evolutionary trends. Group-specific patterns of amino acid mutations should be explored in other geographically-distinct DENV-2 isolates, as they could serve as valuable markers for rapid identification and typing.
  52 in total

1.  A single amino acid in nonstructural protein NS4B confers virulence to dengue virus in AG129 mice through enhancement of viral RNA synthesis.

Authors:  Dixon Grant; Grace K Tan; Min Qing; Jowin K W Ng; Andy Yip; Gang Zou; Xuping Xie; Zhiming Yuan; Mark J Schreiber; Wouter Schul; Pei-Yong Shi; Sylvie Alonso
Journal:  J Virol       Date:  2011-06-01       Impact factor: 5.103

2.  Phylogeography and molecular evolution of dengue 2 in the Caribbean basin, 1981-2000.

Authors:  Jerome E Foster; Shannon N Bennett; Christine V F Carrington; Helen Vaughan; W Owen McMillan
Journal:  Virology       Date:  2004-06-20       Impact factor: 3.616

3.  Molecular evolution and distribution of dengue viruses type 1 and 2 in nature.

Authors:  R Rico-Hesse
Journal:  Virology       Date:  1990-02       Impact factor: 3.616

4.  Origins of dengue type 2 viruses associated with increased pathogenicity in the Americas.

Authors:  R Rico-Hesse; L M Harrison; R A Salas; D Tovar; A Nisalak; C Ramos; J Boshell; M T de Mesa; R M Nogueira; A T da Rosa
Journal:  Virology       Date:  1997-04-14       Impact factor: 3.616

5.  Partial nucleotide sequence and deduced amino acid sequence of the structural proteins of dengue virus type 2, New Guinea C and PUO-218 strains.

Authors:  A Gruenberg; W S Woo; A Biedrzycka; P J Wright
Journal:  J Gen Virol       Date:  1988-06       Impact factor: 3.891

6.  Nucleotide sequence of yellow fever virus: implications for flavivirus gene expression and evolution.

Authors:  C M Rice; E M Lenches; S R Eddy; S J Shin; R L Sheets; J H Strauss
Journal:  Science       Date:  1985-08-23       Impact factor: 47.728

7.  Inhibition of interferon signaling by dengue virus.

Authors:  Jorge L Muñoz-Jordan; Gilma G Sánchez-Burgos; Maudry Laurent-Rolle; Adolfo García-Sastre
Journal:  Proc Natl Acad Sci U S A       Date:  2003-11-11       Impact factor: 11.205

8.  Sequence variation of dengue type 2 virus isolated from clinical cases in Thailand.

Authors:  Takeshi Kurosu; Panjaporn Chaichana; Supranee Phanthanawiboon; Chidchanok Khamlert; Akifumi Yamashita; Atchareeya A-nuegoonpipat; Kazuyoshi Ikuta; Surapee Anantapreecha
Journal:  Jpn J Infect Dis       Date:  2014       Impact factor: 1.362

9.  Molecular epidemiology of dengue viruses in southern China from 1978 to 2006.

Authors:  Weili Wu; Zhijun Bai; Houqing Zhou; Zeng Tu; Meiyu Fang; Boheng Tang; Jinhua Liu; Licheng Liu; Jianwei Liu; Weijun Chen
Journal:  Virol J       Date:  2011-06-26       Impact factor: 4.099

10.  Spatiotemporal characterizations of dengue virus in mainland China: insights into the whole genome from 1978 to 2011.

Authors:  Hao Zhang; Yanru Zhang; Rifat Hamoudi; Guiyun Yan; Xiaoguang Chen; Yuanping Zhou
Journal:  PLoS One       Date:  2014-02-14       Impact factor: 3.240

View more
  10 in total

1.  Validation of the Pockit Dengue Virus Reagent Set for Rapid Detection of Dengue Virus in Human Serum on a Field-Deployable PCR System.

Authors:  Jih-Jin Tsai; Li-Teh Liu; Ping-Chang Lin; Ching-Yi Tsai; Pin-Hsing Chou; Yun-Long Tsai; Hsiao-Fen Grace Chang; Pei-Yu Alison Lee
Journal:  J Clin Microbiol       Date:  2018-04-25       Impact factor: 5.948

2.  A Pan-Dengue Virus Reverse Transcription-Insulated Isothermal PCR Assay Intended for Point-of-Need Diagnosis of Dengue Virus Infection by Use of the POCKIT Nucleic Acid Analyzer.

Authors:  Yun Young Go; R P V Jayanthe Rajapakse; Senanayake A M Kularatne; Pei-Yu Alison Lee; Keun Bon Ku; Sangwoo Nam; Pin-Hsing Chou; Yun-Long Tsai; Yu-Lun Liu; Hsiao-Fen Grace Chang; Hwa-Tang Thomas Wang; Udeni B R Balasuriya
Journal:  J Clin Microbiol       Date:  2016-03-30       Impact factor: 5.948

3.  Beneath the surface: Amino acid variation underlying two decades of dengue virus antigenic dynamics in Bangkok, Thailand.

Authors:  Angkana T Huang; Henrik Salje; Ana Coello Escoto; Nayeem Chowdhury; Christian Chávez; Bernardo Garcia-Carreras; Wiriya Rutvisuttinunt; Irina Maljkovic Berry; Gregory D Gromowski; Lin Wang; Chonticha Klungthong; Butsaya Thaisomboonsuk; Ananda Nisalak; Luke M Trimmer-Smith; Isabel Rodriguez-Barraquer; Damon W Ellison; Anthony R Jones; Stefan Fernandez; Stephen J Thomas; Derek J Smith; Richard Jarman; Stephen S Whitehead; Derek A T Cummings; Leah C Katzelnick
Journal:  PLoS Pathog       Date:  2022-05-02       Impact factor: 7.464

4.  Phylogenetic Insight into Zika and Emerging Viruses for a Perspective on Potential Hosts.

Authors:  Diana S Weber; Karen A Alroy; Samuel M Scheiner
Journal:  Ecohealth       Date:  2017-04-18       Impact factor: 3.184

5.  Flavivirus and Filovirus EvoPrinters: New alignment tools for the comparative analysis of viral evolution.

Authors:  Thomas Brody; Amarendra S Yavatkar; Dong Sun Park; Alexander Kuzin; Jermaine Ross; Ward F Odenwald
Journal:  PLoS Negl Trop Dis       Date:  2017-06-16

6.  Global evolutionary history and spatio-temporal dynamics of dengue virus type 2.

Authors:  Kaifa Wei; Yuhan Li
Journal:  Sci Rep       Date:  2017-04-05       Impact factor: 4.379

7.  Co-circulation of the dengue with chikungunya virus during the 2013 outbreak in the southern part of Lao PDR.

Authors:  Viengvaly Phommanivong; Seiji Kanda; Takaki Shimono; Pheophet Lamaningao; Andrew Waleluma Darcy; Nobuyuki Mishima; Bounthanh Phaytanavanh; Toshimasa Nishiyama
Journal:  Trop Med Health       Date:  2016-08-04

8.  Complete Coding Sequences of Five Dengue Virus Type 2 Clinical Isolates from Venezuela Obtained through Shotgun Metagenomics.

Authors:  Erley Lizarazo; Natacha Couto; Maria Vincenti-Gonzalez; Erwin C Raangs; Thomas Jaenisch; Alex W Friedrich; Adriana Tami; John W Rossen
Journal:  Genome Announc       Date:  2018-06-21

9.  Complete Genome Sequences of Dengue Virus Type 2 Epidemic Strains from Reunion Island and the Seychelles.

Authors:  Hervé Pascalis; Leon Biscornet; Céline Toty; Sarah Hafsia; Marjolaine Roche; Philippe Desprès; Célestine Atyame Nten; Jastin Bibi; Meggy Louange; Jude Gedeon; Patrick Mavingui
Journal:  Microbiol Resour Announc       Date:  2020-01-23

10.  Emergence of dengue virus serotype 2 in Mauritania and molecular characterization of its circulation in West Africa.

Authors:  Toscane Fourié; Ahmed El Bara; Audrey Dubot-Pérès; Gilda Grard; Sébastien Briolant; Leonardo K Basco; Mohamed Ouldabdallahi Moukah; Isabelle Leparc-Goffart
Journal:  PLoS Negl Trop Dis       Date:  2021-10-25
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.