Literature DB >> 18446239

Genomic diversity and evolution of the lyssaviruses.

Olivier Delmas1, Edward C Holmes, Chiraz Talbi, Florence Larrous, Laurent Dacheux, Christiane Bouchier, Hervé Bourhy.   

Abstract

Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as 'Lagos Bat'. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses.

Entities:  

Mesh:

Year:  2008        PMID: 18446239      PMCID: PMC2327259          DOI: 10.1371/journal.pone.0002057

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Lyssaviruses (LYSSAV) are RNA viruses with single-stranded, negative-sense genomes of the family Rhabdoviridae [1] that infect a variety of mammals causing rabies-like diseases. Rabies is an ancient disease that may have been reported in the Old World before 2300 B.C. [2]. However, the absence of effective control measures in animal reservoir populations combined with a widespread lack of human access to vaccination means that more than 50,000 people annually die of rabies, particularly in Asia and Africa [3], [4]. Currently, there are seven recognised genotypes (GT) of LYSSAV defined on the basis of their genetic similarity [5], [6]: rabies virus (RABV, GT1) responsible for classical rabies in terrestrial mammals globally and in bats on the American continent, as well as the cause of most rabies-related human deaths worldwide [3]; Lagos bat virus (LBV, GT2); Mokola virus (MOKV, GT3); Duvenhage virus (DUVV, GT4); European bat lyssavirus type 1 (EBLV-1, GT5); European bat lyssavirus type 2 (EBLV-2, GT 6); and Australian bat lyssavirus (ABLV, GT7). All genotypes except MOKV (where the host species is unknown) have bat reservoirs, hinting that lyssaviruses originated in these mammals [7]. Additionally, four new lyssavirus genotypes that infect bats in central and southeast Asia have been proposed: Aravan virus, Khujand virus, Irkut virus and West Caucasian Bat virus [8], [9]. The negative-sense LYSSAV genome encodes five proteins: the nucleoprotein (N), phosphoprotein (P), matrix protein (M), glycoprotein (G) and RNA polymerase (L) in the order 3′-N-P-M-G-L-5′ [10]. Despite the importance of LYSSAV for human and wildlife populations, the number of complete genome sequences of field isolates of LYSSAV is sparse, with only eight currently available for limited type species [11]–[15]. Herein, we present the first genomic and evolutionary analysis of the seven known genotypes of LYSSAV, therein significantly increasing the extent of available genome sequence data available for these important mammalian pathogens.

Materials and Methods

Viruses and RNA isolation

Total RNA (Table 1) was isolated from original specimens or from suckling mice brain after early passage using Tri-Reagent (Euromedex). The only exception was the 8743THA isolate that was adapted on BSR cells (passage 22). For this isolate, total RNA was isolated from infected BSR cells infected at a low multiplicity of infection (0,1). Reverse transcription was performed with random hexamer primer (Roche Boehringer) using Superscript II (Invitrogen) following the manufacturer instructions.
Table 1

Isolates of lyssavirus analysed in this study.

Genus and nameReference no.Host species/vectorOriginYear of first isolationGenBank accession no.
Lyssavirus Genotype 1
Rabies virus8743THAHumanThailand1983EU293121
Rabies virus8764THAHumanThailand1983EU293111
Rabies virus9147FRAFoxFrance1991EU293115
Rabies virus9001FRADog bitten by a batFrench Guyana1990EU293113
Rabies virus9704ARGBat (Tadarida brasiliensis)Argentina1997EU293116
Rabies virusSHBRV-18Bat (L. noctivagans)USA1983AY705373
Rabies virusNNV-RAB-HHumanIndia2006EF437215
Rabies virusRABVHumanIndia2004AY956319
Rabies virusSADB19VaccineM31046
Rabies virusPVVaccineNC_001542
Genotype 2
Lagos bat virus8619NGABat (Eidolon helvum)Nigeria1956EU293110
Lagos bat virus0406SENBat (Eidolon helvum)Senegal1985EU293108
Genotype 3
Mokola virusMOKVCatZimbabwe1981NC_006429
Mokola virus86100CAMShrewCameroun1974EU293117
Mokola virus86101RCARodentRCA1981EU293118
Genotype 4
Duvenhage virus94286SABat (Minopterus sp.)South Africa1981EU293120
Duvenhage virus86132SAHumanSouth Africa1971EU293119
Genotype 5
European bat lyssavirus 18918FRABat (Eptesicus serotinus)France1989EU293112
European bat lyssavirus 103002FRABat (Eptesicus serotinus)France2003EU293109
European bat lyssavirus 1RV9Bat (Eptesicus serotinus)Germany1968EF157976
Genotype 6
European lyssavirus 29018HOLBat (Myotis dasycneme)Holland1986EU293114
European lyssavirus 2RV1333HumanScotland2002EF157977
Genotype 7
Australian bat lyssavirusABLhHumanAustralia1986AF418014
Australian bat lyssavirusABLbBat (Pteropus species)Australia1996NC_003243

Genbank accession numbers for the newly acquired sequences are designated EU293108-EU293121.

Genbank accession numbers for the newly acquired sequences are designated EU293108-EU293121.

PCR and sequence determination

Long-range PCR products were obtained using ExTaq (Takara) and specific primers (Table S1) using manufacturer recommendations. For sequence determination we used a shotgun base approach called LoPPS (Long PCR Product Sequencing) [16], [17]. 3′ genomic ends were generated by RACE protocol [14] using a 5′ phosphorylated reverse complementary T7 primer. T7 cDNAs were further used for heminested-PCR with ExTaq using T7 and two strain specific primers designed in the N coding region (supplementary Table 1). To determine the 5′ sequence of the genomic RNA we used a 5′RACE version 2.0 kit from Invitrogen following manufacturer instructions. The PCR products (5′ or 3′ RACE) were then purified on gel using Qiaquick gel extraction kit (Qiagen) and cloned in PCR 2.1 TOPO T/A (Invitrogen) for sequencing. Each position of the consensus nucleotide sequence was determined from at least three independent sequences. All consensus sequences obtained using Sequencher 4.7 (Gene Codes) software were aligned using ClustalX 1.83.1 [18]. The untranslated regions were further aligned manually using the SE-AL program (http://tree.bio.ed.ac.uk/). GenBank accession numbers for the sequences newly acquired here are designated EU293108-EU293121.

Phylogenetic analysis

Phylogenetic analysis of LYSSAV genomes was based on a multiple alignment of concatenated coding region sequences (12105 nt). A maximum likelihood (ML) phylogenetic analysis of these data was undertaken using PAUP* [19] employing the best-fit GTR+I+Γ4 model of nucleotide substitution inferred by ModelTest [20]. To determine the extent of support for different groupings on the tree a bootstrap resampling analysis was undertaken employing 1000 replicate neighbor-joining trees estimated under the ML substitution model.

Results and Discussion

In total we determined 14 new complete genome sequences of field isolates representing six (GT1, GT2, GT3, GT4, GT5 and GT6) of the seven genotypes of LYSSAV, with complete genome from GT4 obtained for the first time. These genomes were combined with eight genomes described previously (with the exception of one Australian bat lyssavirus for which leader and trailer sequences are unavailable). Eight field isolates of viruses isolated from humans, canids and bats were chosen as representative of the diversity of GT1. Two vaccine strains (SAD-B19 and PV) were included in all sequence comparisons but not in the phylogenetic analysis. Our study also represent the first analysis of the intrinsic genetic diversity of GT2, GT3, GT4, GT5 and GT6 based on full length genomes. All genomes have the same structural organization although their lengths varied between 11918 nt. (GT7) and 12016 nt. (GT2) (Table 2). The predicted size of the coding regions is similar among genotypes, with the M protein identical in length across all genotypes and the P protein the most variable [14], [21], 22. As observed in other RNA viruses, all genotypes show a bias toward G+C richness [23], with the lowest G+C content observed in GT2 and the highest in GT1 (Table 2). All genomes have a polycistronic genome organization surrounded by untranslated regions (Table S2) similar to that already described [10], [14]. The extent of genetic diversity, reflected in percentage identity, varies within and among proteins (Figure 1), in the order N>L>M>G>P (95.2, 94.2, 92.3, 85.8, 81.5% amino acid identity, respectively). A similar pattern was previously observed using more limited data sets [14]. This same order was also observed in terms of overall selection pressure, measured as the mean ratio of nonsynonymous (dN) to synonymous substitutions (dS) per site (dN/dS), estimated using the maximum likelihood SLAC (Single Likelihood Ancestor Counting; http://www.datamonkey.org/) method [24]: N = 0.048; L = 0.055; M = 0.078; G = 0.119; P = 0.187. This approximately four-fold difference in mean dN/dS reflects major differences in selective constraint among proteins. This trend was also reflected in previous analyses of full length genomes of vaccine strains [22] and through partial gene comparisons [25], [26].
Table 2

Coding potential, genome size (in nucleotides) and G+C content of 24 genomes representing the 7 genotypes of the lyssavirus genus.

Genotype1234567All genotypes
3′UTR 7070707070707070
N protein 13531353135313561356135613531353–1356
N-P 90–94101100–1029090–9610193–9490–102
P protein 894918912897894894894894–918
P-M 87–907580–838383888775–90
M protein 609609609609609609609609
M-G 211–215204203–204191211205–210207–209191–215
G protein 1575156915691602157515751578–15811569–1602
G-L 515–525578–588546–562562–563560511–512508–509508–588
L protein 6384–642963846381–63846384638463846384–63876381–6429
5′UTR 86–131145112–114131130–13113113186–145
Genome 11923–1192812006–1201611940–1195711975–1197611966–1197111924–119301191811918–12016
G+C% 44,9–45,440,9–43,544,1–44,944,1–44,244,6–45,044,843,4–44,240,9–45,4

Upper and lower size ranges are indicated.

Figure 1

Schematic representation of lyssavirus genome organization and sequence similarity among 24 aligned genomes.

A. The 3′ leader, N-, P-, M-, G- and L-coding regions and the 5′ trailer region are shown. B. Sequence similarity is calculated by moving a window of 60 nucleotides along the aligned sequences. C. Sequence similarity is calculated by moving a window of 20 amino acids along the aligned sequences. Within each window, the similarity of any one position is taken to be the average of all the possible pairwise scores at that position and is calculated using PLOTCON (available at http://bioweb.pasteur.fr/seqanal/interfaces/plotcon.html).

Schematic representation of lyssavirus genome organization and sequence similarity among 24 aligned genomes.

A. The 3′ leader, N-, P-, M-, G- and L-coding regions and the 5′ trailer region are shown. B. Sequence similarity is calculated by moving a window of 60 nucleotides along the aligned sequences. C. Sequence similarity is calculated by moving a window of 20 amino acids along the aligned sequences. Within each window, the similarity of any one position is taken to be the average of all the possible pairwise scores at that position and is calculated using PLOTCON (available at http://bioweb.pasteur.fr/seqanal/interfaces/plotcon.html). Upper and lower size ranges are indicated. Our study represents the largest analysis of the 3′ and 5′ UTR of the lyssavirus genomes undertaken to date. The 3′ UTR comprises 70 nt and includes the leader regions potentially transcribed into the leader RNA. The 5′UTR region comprises 86–145 nt and contains the trailer regions of size 68–69 nt. Both the 3′ and 5′ UTR have conserved signals that play a role to modulate replication and transcription (Figure S1) [27]. Our data also reveals a strict complementarity limited to the 9 terminal nucleotides as well as nucleotide positions 14 and 16 from both ends of the genome [14], [28], [29]. There have been several attempts to estimate the evolutionary relationships among lyssaviruses, with most utilizing only one or two genes [1], [7], [21], [26], [30]–[34]. We therefore undertook a phylogenetic analysis of 22 genomes representative of the seven genotypes of LYSSAV based on a multiple alignment of concatenated coding sequences. Our phylogenetic analysis reveals the separation of LYSSAV into two major branches previously defined as different ‘phylogroups’ [7] and 7 component lineages defined as genotypes [5], [35]. Phylogroup 1 comprised GT1, 4, 5, 6 and 7, while phylogroup 2 contains only GT2 and GT3 (Figure 2). Notably, phylogroup 2 contains viruses of sampled exclusively from Africa – LBV and MOKV – while a third African genotype (DUVV) is found within phylogroup 1 [32], [36]. Also of note was the observation that although GT5 and GT6 both circulate in European insectivorous bats [32], the former is more closely related to the African GT4 viruses [32]. Hence, there has clearly been an independent origin of genotypes 5 and 6 in European bats, as previously documented in analyses of the N and G genes in isolation [32]. Finally, that bats appear as the principle host species across such a large phylogeographic range indicates that the association between lyssaviruses and bats is likely to be the ancestral condition (with a secondary loss of bat transmission in GT3), such that the movement of bats is likely to be responsible for the global dissemination of these viruses [7].
Figure 2

Phylogenetic relationships of 22 complete coding regions of LYSSAV genomes representatives of the 7 genotypes.

The phylogeny was inferred using an ML procedure, and all horizontal branches are scaled according to the number of substitutions per site. Boot strap values (>95%) are shown for key nodes. The tree is mid-point rooted for purposes of clarity only.

Phylogenetic relationships of 22 complete coding regions of LYSSAV genomes representatives of the 7 genotypes.

The phylogeny was inferred using an ML procedure, and all horizontal branches are scaled according to the number of substitutions per site. Boot strap values (>95%) are shown for key nodes. The tree is mid-point rooted for purposes of clarity only. Notably, our study represents the first analysis of the genetic diversity of four complete genomes of GT2 and GT4, both of which are African in origin. While little variation is seen within GT4, the degree of divergence among the two GT2 isolates (Lagos bat virusLBV) is striking (23.7% and 12.1% at the nucleotide and amino acid levels, respectively) and greater than that seen within any other genotype. Hence, although 0406SEN and 8619NGA are related according to the arbitrary classification system based on nucleotide identity between N coding regions (80.3% between 0406SEN and 8619NGA compared to a cut-off of 80%) [6], [21], this classification system will likely need to be revised as expanded surveys of LYSSAV in Africa (this study and [37]) and in Eurasia [8], [9], [21] reveal greater genetic diversity. More fundamentally, if, as we suggest, complete genomes represent the best tools for genotyping, we propose that 0406SEN should constitute a new GT8 different from GT2 (8619NGA) and that the genotype division should be set at 76.4 to 81.6% nucleotide identity at coding sequences for all five viral proteins. Such a cut-off would provide more discriminatory power than systems that utilize the N gene in isolation (Table 3).
Table 3

Minimum intra-genotype and maximum inter-genotype sequence similarities among 24 lyssaviruses.

Coding regionsNumber of genotypesMinimum intragenotype similarityMaximum intergenotype similarity
N 8* 83,380,3
780,379,8
N,P,M,G,L 8* 81,676,4
776,376,4

when 0406SEN is considered as the representative isolate of a new GT8, with 8619 the representative isolate of GT2.

when 0406SEN is considered as the representative isolate of a new GT8, with 8619 the representative isolate of GT2. Finally, we suggest that the phylogenetic methods used here – based on a realistic model of nucleotide substitution, a robust phylogenetic method, and rigorous bootstrap resampling – represent a more powerful method of lyssavirus classification than those based on pairwise genetic diversity alone, particularly as they account for any lineage-specific rate variation that will compromise all distance-based approaches used to date. This method has also been proposed for HIV to try to standardize viral classification [38] confirming the interest of this method for viral classification. List of primers (0.06 MB PDF) Click here for additional data file. Transcription and termination signals for all lyssavirus genotypes. (0.05 MB PDF) Click here for additional data file. Comparison of the 5′ and the reverse complementary 3′ genomic termini of the antigenomic (+) sense RNA of lyssaviruses. Identical nucleotides are indicated by a vertical line. A, 23 lyssaviruses representing the 7 genotypes. B, consensus sequences. Only regions corresponding to 3′ and 5′ UTR sequences are shown. TTS: transcription termination signal. (0.27 MB DOC) Click here for additional data file.
  36 in total

1.  Datamonkey: rapid detection of selective pressure on individual sites of codon alignments.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Bioinformatics       Date:  2005-02-15       Impact factor: 6.937

2.  Phylogenetic relationships of Irkut and West Caucasian bat viruses within the Lyssavirus genus and suggested quantitative criteria based on the N gene sequence for lyssavirus genotype definition.

Authors:  Ivan V Kuzmin; Gareth J Hughes; Alexandr D Botvinkin; Lillian A Orciari; Charles E Rupprecht
Journal:  Virus Res       Date:  2005-04-08       Impact factor: 3.303

3.  Phylogeography, population dynamics, and molecular evolution of European bat lyssaviruses.

Authors:  Patricia L Davis; Edward C Holmes; Florence Larrous; Wim H M Van der Poel; Kirsten Tjørnehøj; Wladimir J Alonso; Hervé Bourhy
Journal:  J Virol       Date:  2005-08       Impact factor: 5.103

4.  LoPPS: a long PCR product sequencing method for rapid characterisation of long amplicons.

Authors:  Sébastien Emonet; Gilda Grard; Nadège Brisbarre; Gregory Moureau; Sarah Temmam; Rémi Charrel; Xavier de Lamballerie
Journal:  Biochem Biophys Res Commun       Date:  2006-04-19       Impact factor: 3.575

5.  MODELTEST: testing the model of DNA substitution.

Authors:  D Posada; K A Crandall
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

6.  Phylogenetic relationships among rhabdoviruses inferred using the L polymerase gene.

Authors:  H Bourhy; J A Cowley; F Larrous; E C Holmes; P J Walker
Journal:  J Gen Virol       Date:  2005-10       Impact factor: 3.891

7.  Virus promoters determine interference by defective RNAs: selective amplification of mini-RNA vectors and rescue from cDNA by a 3' copy-back ambisense rabies virus.

Authors:  S Finke; K K Conzelmann
Journal:  J Virol       Date:  1999-05       Impact factor: 5.103

Review 8.  Re-evaluating the burden of rabies in Africa and Asia.

Authors:  Darryn L Knobel; Sarah Cleaveland; Paul G Coleman; Eric M Fèvre; Martin I Meltzer; M Elizabeth G Miranda; Alexandra Shaw; Jakob Zinsstag; François-Xavier Meslin
Journal:  Bull World Health Organ       Date:  2005-06-24       Impact factor: 9.408

9.  Composition bias and genome polarity of RNA viruses.

Authors:  Prasert Auewarakul
Journal:  Virus Res       Date:  2004-11-18       Impact factor: 3.303

10.  Assessment of automated genotyping protocols as tools for surveillance of HIV-1 genetic diversity.

Authors:  Robert Gifford; Tulio de Oliveira; Andrew Rambaut; Richard E Myers; Catherine V Gale; David Dunn; Robert Shafer; Anne-Mieke Vandamme; Paul Kellam; Deenan Pillay
Journal:  AIDS       Date:  2006-07-13       Impact factor: 4.177

View more
  65 in total

1.  Genome-wide networks of amino acid covariances are common among viruses.

Authors:  Maureen J Donlin; Brandon Szeto; David W Gohara; Rajeev Aurora; John E Tavis
Journal:  J Virol       Date:  2012-01-11       Impact factor: 5.103

2.  The full-length genome analysis of a street rabies virus strain isolated in Yunnan province of China.

Authors:  Jian Zhang; Hai-lin Zhang; Xiao-yan Tao; Hao Li; Qing Tang; Xiu-yun Jiang; Guo-dong Liang
Journal:  Virol Sin       Date:  2012-06-09       Impact factor: 4.327

3.  Conservation of a unique mechanism of immune evasion across the Lyssavirus genus.

Authors:  L Wiltzer; F Larrous; S Oksayan; N Ito; G A Marsh; L F Wang; D Blondel; H Bourhy; D A Jans; G W Moseley
Journal:  J Virol       Date:  2012-06-27       Impact factor: 5.103

4.  Application of broad-spectrum resequencing microarray for genotyping rhabdoviruses.

Authors:  Laurent Dacheux; Nicolas Berthet; Gabriel Dissard; Edward C Holmes; Olivier Delmas; Florence Larrous; Ghislaine Guigon; Philip Dickinson; Ousmane Faye; Amadou A Sall; Iain G Old; Katherine Kong; Giulia C Kennedy; Jean-Claude Manuguerra; Stewart T Cole; Valérie Caro; Antoine Gessain; Hervé Bourhy
Journal:  J Virol       Date:  2010-07-07       Impact factor: 5.103

5.  Gene order rearrangement of the M gene in the rabies virus leads to slower replication.

Authors:  Xian-Feng Yang; Jiao-Jiao Peng; Hong-Ru Liang; You-Tian Yang; Yi-Fei Wang; Xiao-Wei Wu; Jiao-Jiao Pan; Yong-Wen Luo; Xiao-Feng Guo
Journal:  Virusdisease       Date:  2014-06-07

6.  Application of high-throughput sequencing to whole rabies viral genome characterisation and its use for phylogenetic re-evaluation of a raccoon strain incursion into the province of Ontario.

Authors:  Susan A Nadin-Davis; Adam Colville; Hannah Trewby; Roman Biek; Leslie Real
Journal:  Virus Res       Date:  2017-02-17       Impact factor: 3.303

7.  Lyssavirus detection and typing using pyrosequencing.

Authors:  Paola De Benedictis; Cristian De Battisti; Laurent Dacheux; Sabrina Marciano; Silvia Ormelli; Angela Salomoni; Silvia Tiozzo Caenazzo; Anthony Lepelletier; Hervé Bourhy; Ilaria Capua; Giovanni Cattoli
Journal:  J Clin Microbiol       Date:  2011-03-09       Impact factor: 5.948

8.  Structure of the nucleoprotein binding domain of Mokola virus phosphoprotein.

Authors:  René Assenberg; Olivier Delmas; Jingshan Ren; Pierre-Olivier Vidalain; Anil Verma; Florence Larrous; Stephen C Graham; Frédéric Tangy; Jonathan M Grimes; Hervé Bourhy
Journal:  J Virol       Date:  2009-11-11       Impact factor: 5.103

9.  Intergenotypic replacement of lyssavirus matrix proteins demonstrates the role of lyssavirus M proteins in intracellular virus accumulation.

Authors:  Stefan Finke; Harald Granzow; Jose Hurst; Reiko Pollin; Thomas C Mettenleiter
Journal:  J Virol       Date:  2009-12-02       Impact factor: 5.103

10.  Fatal human rabies due to Duvenhage virus from a bat in Kenya: failure of treatment with coma-induction, ketamine, and antiviral drugs.

Authors:  Pieter-Paul A M van Thiel; Rob M A de Bie; Filip Eftimov; Robert Tepaske; Hans L Zaaijer; Gerard J J van Doornum; Martin Schutten; Albert D M E Osterhaus; Charles B L M Majoie; Eleonora Aronica; Christine Fehlner-Gardiner; Alex I Wandeler; Piet A Kager
Journal:  PLoS Negl Trop Dis       Date:  2009-07-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.