| Literature DB >> 35500026 |
Kathryn Campbell1, Robert J Gifford2, Joshua Singer2, Verity Hill3, Aine O'Toole3, Andrew Rambaut3, Katie Hampson1, Kirstyn Brunker1.
Abstract
The availability of pathogen sequence data and use of genomic surveillance is rapidly increasing. Genomic tools and classification systems need updating to reflect this. Here, rabies virus is used as an example to showcase the potential value of updated genomic tools to enhance surveillance to better understand epidemiological dynamics and improve disease control. Previous studies have described the evolutionary history of rabies virus, however the resulting taxonomy lacks the definition necessary to identify incursions, lineage turnover and transmission routes at high resolution. Here we propose a lineage classification system based on the dynamic nomenclature used for SARS-CoV-2, defining a lineage by phylogenetic methods for tracking virus spread and comparing sequences across geographic areas. We demonstrate this system through application to the globally distributed Cosmopolitan clade of rabies virus, defining 96 total lineages within the clade, beyond the 22 previously reported. We further show how integration of this tool with a new rabies virus sequence data resource (RABV-GLUE) enables rapid application, for example, highlighting lineage dynamics relevant to control and elimination programmes, such as identifying importations and their sources, as well as areas of persistence and routes of virus movement, including transboundary incursions. This system and the tools developed should be useful for coordinating and targeting control programmes and monitoring progress as countries work towards eliminating dog-mediated rabies, as well as having potential for broader application to the surveillance of other viruses.Entities:
Mesh:
Year: 2022 PMID: 35500026 PMCID: PMC9162366 DOI: 10.1371/journal.ppat.1010023
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 7.464
Fig 1Illustration of rabies virus clades, subclades and lineages.
Maximum likelihood tree of all publicly available whole genome rabies virus sequences (n = 650) coloured by previously identified subclades [13]. The NEE subclade has been expanded to show major and minor lineages from the updated MADDOG classification system.
Fig 2Schematic of the online capabilities of RABV-GLUE.
Fig 3Summary of lineage designation steps.
Fig 4Lineage designations of whole genome and N gene sequences from the RABV Cosmopolitan clade.
A: Maximum likelihood tree showing the lineage positions, rooted using an outgroup of 10 Asian SEA1b sequences, with scale indicating substitutions per site per year, and hierarchical relationships with dashed lines indicating 5 lineage iterations (n = 2608 total sequences for tree and sunburst plot, combing both WGS and N gene sequences). B: Progression of classification definition from 22 previously defined subclades (left) to 52 lineages using only whole genome sequences (middle) to 147 lineages adding N gene sequences, with lineages seen at whole genome level in blue and additional lineages only seen at N gene level in red (right).
Details of numbers of lineages and sequences available for each Cosmopolitan subclade at whole genome and N gene level.
| Whole genome | N gene | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| 2 | 24 | 1984 | 10 | 377 | 1982 |
|
| 10 | 251 | 1981 | 10 | 471 | 1981 |
|
| 0 | 3 | 1986 | 0 | 6 | 1986 |
|
| 0 | 8 | 1950 | 1 | 13 | 1950 |
|
| 0 | 4 | 1982 | 6 | 58 | 1981 |
|
| 1 | 10 | 1991 | 4 | 136 | 1990 |
|
| 0 | 2 | 2009 | 0 | 24 | 1986 |
|
| 0 | 5 | 1986 | 20 | 172 | 1985 |
|
| 0 | 4 | 1986 | 10 | 108 | 1986 |
|
| 0 | 4 | 1974 | 1 | 19 | 1974 |
|
| 1 | 25 | 1974 | 9 | 224 | 1974 |
|
| 1 | 15 | 1993 | 7 | 106 | 1993 |
|
| 0 | 7 | 1991 | 6 | 45 | 1991 |
|
| 1 | 15 | 1990 | 1 | 19 | 1985 |
|
| 1 | 15 | 1986 | 1 | 47 | 1977 |
|
| 4 | 101 | 1976 | 4 | 195 | 1976 |
|
| 0 | 2 | 1993 | 0 | 7 | 1993 |
|
| 4 | 38 | 1989 | 6 | 66 | 1989 |
|
| 6 | 71 | 1986 | 6 | 78 | 1985 |
|
| 1 | 14 | 1986 | 1 | 148 | 1974 |
|
| 0 | 2 | 1978 | 0 | 6 | 1978 |
|
| 0 | 1 | 1972 | 0 | 12 | 1972 |
Fig 5Distribution of RABV lineages from the Cosmopolitan clade determined from WGS and N gene sequences.
Circle size indicates the number of sequences (sum of both WGS and N gene sequences), with circles plotted to the centroid of each country. Sunburst plot from Fig 4A included to indicate colour scheme. Base layer of map from Natural Earth (https://www.naturalearthdata.com/).
Fig 7Lineage designations of 224 rabies virus genomes from Tanzania.
Maximum likelihood tree with scale in substitutions per site per year, hierarchical relationships, and distribution of lineages. Sequences obtained from RABV-GLUE. Base layer of map from GADM (https://gadm.org/download_country.html).
Case Studies using the MADDOG lineage classification system.
|
|
|
| The higher resolution achieved through this classification system can be used to ‘zoom in’ on unusual cases and look at the historical and geographical context of a lineage. Here, we illustrate the example of a human rabies case (GenBank accession: KC737850) reported from the USA in 2011 [ | Rabies was first documented in Tanzania in the 1930’s [ |