Literature DB >> 28492506

Implementation of Objective PASC-Derived Taxon Demarcation Criteria for Official Classification of Filoviruses.

Yīmíng Bào1, Gaya K Amarasinghe2, Christopher F Basler3, Sina Bavari4, Alexander Bukreyev5, Kartik Chandran6, Olga Dolnik7, John M Dye8, Hideki Ebihara9, Pierre Formenty10, Roger Hewson11, Gary P Kobinger12, Eric M Leroy13, Elke Mühlberger14, Sergey V Netesov15, Jean L Patterson16, Janusz T Paweska17, Sophie J Smither18, Ayato Takada19, Jonathan S Towner20, Viktor E Volchkov21, Victoria Wahl-Jensen22, Jens H Kuhn23.   

Abstract

The mononegaviral family Filoviridae has eight members assigned to three genera and seven species. Until now, genus and species demarcation were based on arbitrarily chosen filovirus genome sequence divergence values (≈50% for genera, ≈30% for species) and arbitrarily chosen phenotypic virus or virion characteristics. Here we report filovirus genome sequence-based taxon demarcation criteria using the publicly accessible PAirwise Sequencing Comparison (PASC) tool of the US National Center for Biotechnology Information (Bethesda, MD, USA). Comparison of all available filovirus genomes in GenBank using PASC revealed optimal genus demarcation at the 55-58% sequence diversity threshold range for genera and at the 23-36% sequence diversity threshold range for species. Because these thresholds do not change the current official filovirus classification, these values are now implemented as filovirus taxon demarcation criteria that may solely be used for filovirus classification in case additional data are absent. A near-complete, coding-complete, or complete filovirus genome sequence will now be required to allow official classification of any novel "filovirus." Classification of filoviruses into existing taxa or determining the need for novel taxa is now straightforward and could even become automated using a presented algorithm/flowchart rooted in RefSeq (type) sequences.

Entities:  

Keywords:  Ebola; Filoviridae; ICTV; Mononegavirales; cuevavirus; ebolavirus; filovirus; marburgvirus; virus classification; virus taxonomy

Mesh:

Year:  2017        PMID: 28492506      PMCID: PMC5454419          DOI: 10.3390/v9050106

Source DB:  PubMed          Journal:  Viruses        ISSN: 1999-4915            Impact factor:   5.048


1. Introduction

The family Filoviridae, one of eight families in the order Mononegavirales [1], has eight members assigned to seven species included in three genera (Table 1) [2,3,4].
Table 1

Official filovirus taxonomy endorsed by the 2015–2017 International Committee on Taxonomy of Viruses (ICTV) Filoviridae Study Group and accepted by the ICTV.

Current Taxonomy and Nomenclature
Order Mononegavirales   Family Filoviridae    Genus Marburgvirus     Species Marburg Marburgvirus       Virus 1: Marburg virus (MARV)       Virus 2: Ravn virus (RAVV)    Genus Ebolavirus      Species Bundibugyo ebolavirus       Virus: Bundibugyo virus (BDBV)      Species Reston ebolavirus      Virus: Reston virus (RESTV)      Species Sudan ebolavirus       Virus: Sudan virus (SUDV)      Species Taï Forest ebolavirus       Virus: Taï Forest virus (TAFV)      Species Zaire ebolavirus       Virus: Ebola virus (EBOV)    Genus Cuevavirus      Species Lloviu cuevavirus      Virus: Lloviu virus (LLOV)
Traditionally, the eight currently recognized filoviruses have been classified using phenotypic characteristics of virions and/or partial filovirus genome sequences [5,6,7]. Sequence-based filovirus taxon demarcation criteria (nucleotide and amino acid sequence identity values and/or phylogenies) were officially introduced as additional demarcation criteria in 2000 [8] and further refined thereafter [9]. Yet, true filovirus genome sequence-based taxon demarcation was only introduced in 2011. At that time, the International Committee on Taxonomy of Viruses (ICTV) Filoviridae Study Group decided arbitrarily that marburgvirus genomes differ from ebolavirus genomes by ≥50% and that ebolavirus species are differentiated on the basis of glycoprotein (GP) gene sequence differences (≥30%) or genome sequence differences (≥30%) [3]. These values were used to develop a decision algorithm/flowchart for filovirus taxon assignment that could guide filovirus classification [10]. In 2012, two pairwise sequence comparison methods, PAirwise Sequence Comparison (PASC) and DivErsity pArtitioning by hieRarchical Clustering (DEmARC), confirmed that the then official filovirus taxonomy (identical to the current one shown in Table 1) is justified, but that the 50% and 30% values ought to be adjusted objectively based on the PASC and/or DEmARC results [11,12]. Both analyses were based on the available ≈50 near-complete, coding-complete or complete filovirus genomes (see [13,14] for nomenclature) in the US National Center for Biotechnology Information (NCBI, Bethesda, MD, USA) GenBank database. Yet, at the time it was unclear whether the ICTV would accept classification of viruses based on sequence analysis alone. In 2017, the ICTV members reached a consensus together with other experts that “the development of a robust framework for sequence-based virus taxonomy is indispensable for the comprehensive characterization of the global virome” [15]. Under proper oversight by, for instance, ICTV Study Groups, virus classification criteria can now be based on measurable objective criteria inferable only from viral genome sequence data. Thus, using automatic classification algorithms is possible. The number of GenBank-deposited near-complete, coding-complete, and complete filovirus genome sequences has increased substantially in recent years (from the ≈50 in 2012 to ≈1400 at the time of writing in 2017). We analyzed these sequences using PASC, a method that can be easily used by any scientist using an open-access software platform [16,17,18]. We created inferred objective filovirus taxon demarcation criteria and updated the algorithm/flowchart for filovirus taxon assignment using the recently decided type filovirus sequences (NCBI RefSeq database sequences) [10] as starting points.

2. Materials and Methods

All 1404 near-complete, coding-complete, or complete filovirus genomes available from GenBank (NCBI, Bethesda, MD, USA) on 04/16/2017 were downloaded from the NCBI viral genomes resource [19]. Redundant filovirus genome sequences (here defined as sequences with PASC identities >99.5%) were removed, leaving 112 filovirus genome sequences for further analysis [20]. PASC analysis was performed with those 112 genome sequences as previously described [18] using the open-access PASC tool (NCBI). The new taxon demarcation algorithm/flowchart was developed based on the previously developed chart presented in [10] using type filoviruses [4] and type filovirus genome sequences (RefSeq, NCBI) [10].

3. Results

PASC analysis of 112 filovirus near-complete, coding-complete, or complete genome sequences revealed clear clustering into three higher ranks (genera), with two of those genera including single species and one genus including five species (visualized in Figure 1).
Figure 1

Screenshot of the US National Center for Biotechnology Information (NCBI) PAirwise Sequence Comparison (PASC) tool result after comparing 112 distinct near-complete, coding-complete or complete filovirus genome sequences. Brown bars represent genome pairs assigned to (three) different genera; yellow bars represent genome pairs assigned to (seven) separate species; and green bars represent genome pairs assigned to the same species. BLAST: Basic Local Alignment Search Tool.

Unblinding of input sequences revealed the three genera and seven species to correspond to those already established and depicted in Table 1, raising confidence in PASC as a method to adequately recreate current knowledge on filovirus diversity. However, the analysis indicated an ideal genus demarcation threshold range of 55–58% sequence divergence rather than the currently used 50% threshold and an ideal species demarcation threshold range of 23–36% rather than the currently used 30% threshold.

4. Discussion

Using the new filovirus taxon demarcation criteria established here using PASC, the earliest discovered filovirus (Marburg virus; MARV) as the type virus for the family Filoviridae [4], the RefSeq MARV genome sequence as the MARV type sequence, and the remaining filovirus RefSeq genome sequences as additional anchor points, we created a filovirus classification decision matrix in form of an algorithm/flowchart (Figure 2). Using the NCBI PASC tool and Figure 2, any user can now quickly assess whether a novel filovirus sequence of interest represents a filovirus already classified in one of the established filovirus taxa or whether establishment of a new taxon/new taxa may be necessary. PASC requires at least near-complete or coding-complete genome input sequences. Therefore, the ICTV Filoviridae Study Group decided that moving forward, at least a coding-complete filovirus genome sequence will be minimally required for filovirus classification into novel filovirus taxa. Partial filovirus-like nucleic acids, for instance, those recently discovered in Chinese bats [21,22], may point towards the existence of novel filoviruses but will not suffice for official recognition of novel filoviruses or establishment of novel filovirus taxa. The Study Group recommends that such sequences be referred to as “filovirus-like sequences” and not as “filoviruses.” Likewise, a virus for which a partial filovirus-like sequence information exists ought to be referred to as a “putative filovirus” until at least coding-complete genome sequence information is available.
Figure 2

Algorithm/flow chart for filovirus classification based on genomics sequence information (modified from [10]) and PASC-derived sequence demarcation criteria. A putative filovirus genome of interest is compared to the type filovirus RefSeq genome sequence (i.e., that of Marburg virus/H.sapiens-tc/KEN/1980/Mt. Elgon-Musoke [10]) and then sequentially moved through the process until its proper placement in a species is revealed. If the sequence comparison reveals the need for the creation of a novel genus and/or species, official taxonomic proposals ought to be submitted to the ICTV.

Importantly, PASC analysis followed by use of the algorithm/flowchart (Figure 2) alone does not constitute official classification, and the Study Group sees PASC results as highly informative, but not binding. Thus, if the PASC algorithm/flowchart indicates the need for a novel filovirus genus and/or species to a user analyzing a particular sequence, the user should follow the official pathway for ICTV classification starting with submission of an official taxonomic proposal (TaxoProp [23]). The user is recommended to engage with the ICTV Filoviridae Study Group as early as possible during that process. The Study Group and ICTV will evaluate all available data on a particular putative filovirus (e.g., host information, disease phenotype, biophysical properties of virions) and make their decisions accordingly. Phylogenetic results obtained with methods more sophisticated than PASC are always desired and may ultimately overrule PASC results.
  15 in total

1.  Proposal for a revised taxonomy of the family Filoviridae: classification, names of taxa and viruses, and virus abbreviations.

Authors:  Jens H Kuhn; Stephan Becker; Hideki Ebihara; Thomas W Geisbert; Karl M Johnson; Yoshihiro Kawaoka; W Ian Lipkin; Ana I Negredo; Sergey V Netesov; Stuart T Nichol; Gustavo Palacios; Clarence J Peters; Antonio Tenorio; Viktor E Volchkov; Peter B Jahrling
Journal:  Arch Virol       Date:  2010-10-30       Impact factor: 2.574

2.  Discussions and decisions of the 2012–2014 International Committee on Taxonomy of Viruses (ICTV) Filoviridae Study Group, January 2012–June 2013.

Authors:  Alexander A Bukreyev; Kartik Chandran; Olga Dolnik; John M Dye; Hideki Ebihara; Eric M Leroy; Elke Mühlberger; Sergey V Netesov; Jean L Patterson; Janusz T Paweska; Erica Ollmann Saphire; Sophie J Smither; Ayato Takada; Jonathan S Towner; Viktor E Volchkov; Travis K Warren; Jens H Kuhn
Journal:  Arch Virol       Date:  2014-04       Impact factor: 2.574

Review 3.  Standard finishing categories for high-throughput sequencing of viral genomes.

Authors:  J T Ladner; J H Kuhn; G Palacios
Journal:  Rev Sci Tech       Date:  2016-04       Impact factor: 1.181

4.  PAirwise Sequence Comparison (PASC) and its application in the classification of filoviruses.

Authors:  Yiming Bao; Vyacheslav Chetvernin; Tatiana Tatusova
Journal:  Viruses       Date:  2012-08-20       Impact factor: 5.048

5.  NCBI viral genomes resource.

Authors:  J Rodney Brister; Danso Ako-Adjei; Yiming Bao; Olga Blinkova
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 16.971

6.  Filovirus RNA in Fruit Bats, China.

Authors:  Biao He; Yun Feng; Hailin Zhang; Lin Xu; Weihong Yang; Yuzhen Zhang; Xingyu Li; Changchun Tu
Journal:  Emerg Infect Dis       Date:  2015-09       Impact factor: 6.883

7.  Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification.

Authors:  Yiming Bao; Vyacheslav Chetvernin; Tatiana Tatusova
Journal:  Arch Virol       Date:  2014-08-14       Impact factor: 2.574

8.  Filovirus RefSeq entries: evaluation and selection of filovirus type variants, type sequences, and names.

Authors:  Jens H Kuhn; Kristian G Andersen; Yīmíng Bào; Sina Bavari; Stephan Becker; Richard S Bennett; Nicholas H Bergman; Olga Blinkova; Steven Bradfute; J Rodney Brister; Alexander Bukreyev; Kartik Chandran; Alexander A Chepurnov; Robert A Davey; Ralf G Dietzgen; Norman A Doggett; Olga Dolnik; John M Dye; Sven Enterlein; Paul W Fenimore; Pierre Formenty; Alexander N Freiberg; Robert F Garry; Nicole L Garza; Stephen K Gire; Jean-Paul Gonzalez; Anthony Griffiths; Christian T Happi; Lisa E Hensley; Andrew S Herbert; Michael C Hevey; Thomas Hoenen; Anna N Honko; Georgy M Ignatyev; Peter B Jahrling; Joshua C Johnson; Karl M Johnson; Jason Kindrachuk; Hans-Dieter Klenk; Gary Kobinger; Tadeusz J Kochel; Matthew G Lackemeyer; Daniel F Lackner; Eric M Leroy; Mark S Lever; Elke Mühlberger; Sergey V Netesov; Gene G Olinger; Sunday A Omilabu; Gustavo Palacios; Rekha G Panchal; Daniel J Park; Jean L Patterson; Janusz T Paweska; Clarence J Peters; James Pettitt; Louise Pitt; Sheli R Radoshitzky; Elena I Ryabchikova; Erica Ollmann Saphire; Pardis C Sabeti; Rachel Sealfon; Aleksandr M Shestopalov; Sophie J Smither; Nancy J Sullivan; Robert Swanepoel; Ayato Takada; Jonathan S Towner; Guido van der Groen; Viktor E Volchkov; Valentina A Volchkova; Victoria Wahl-Jensen; Travis K Warren; Kelly L Warfield; Manfred Weidmann; Stuart T Nichol
Journal:  Viruses       Date:  2014-09-26       Impact factor: 5.048

9.  Genetics-based classification of filoviruses calls for expanded sampling of genomic sequences.

Authors:  Chris Lauber; Alexander E Gorbalenya
Journal:  Viruses       Date:  2012-08-31       Impact factor: 5.048

10.  Standards for sequencing viral genomes in the era of high-throughput sequencing.

Authors:  Jason T Ladner; Brett Beitzel; Patrick S G Chain; Matthew G Davenport; Eric F Donaldson; Matthew Frieman; Jeffrey R Kugelman; Jens H Kuhn; Jules O'Rear; Pardis C Sabeti; David E Wentworth; Michael R Wiley; Guo-Yun Yu; Shanmuga Sozhamannan; Christopher Bradburne; Gustavo Palacios
Journal:  mBio       Date:  2014-06-17       Impact factor: 7.867

View more
  6 in total

Review 1.  Jingchuvirales: a New Taxonomical Framework for a Rapidly Expanding Order of Unusual Monjiviricete Viruses Broadly Distributed among Arthropod Subphyla.

Authors:  Nicholas Di Paola; Jens H Kuhn; Nolwenn M Dheilly; Sandra Junglen; Sofia Paraskevopoulou; Thomas S Postler; Mang Shi
Journal:  Appl Environ Microbiol       Date:  2022-02-02       Impact factor: 5.005

2.  Virus taxonomy: the database of the International Committee on Taxonomy of Viruses (ICTV).

Authors:  Elliot J Lefkowitz; Donald M Dempsey; Robert Curtis Hendrickson; Richard J Orton; Stuart G Siddell; Donald B Smith
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

3.  A Diacylglycerol Kinase Inhibitor, R-59-022, Blocks Filovirus Internalization in Host Cells.

Authors:  Corina M Stewart; Stephanie S Dorion; Marie A F Ottenbrite; Nicholas D LeBlond; Tyler K T Smith; Shirley Qiu; Morgan D Fullerton; Darwyn Kobasa; Marceline Côté
Journal:  Viruses       Date:  2019-03-01       Impact factor: 5.048

4.  Minimum Information about an Uncultivated Virus Genome (MIUViG).

Authors:  Simon Roux; Evelien M Adriaenssens; Bas E Dutilh; Eugene V Koonin; Andrew M Kropinski; Mart Krupovic; Jens H Kuhn; Rob Lavigne; J Rodney Brister; Arvind Varsani; Clara Amid; Ramy K Aziz; Seth R Bordenstein; Peer Bork; Mya Breitbart; Guy R Cochrane; Rebecca A Daly; Christelle Desnues; Melissa B Duhaime; Joanne B Emerson; François Enault; Jed A Fuhrman; Pascal Hingamp; Philip Hugenholtz; Bonnie L Hurwitz; Natalia N Ivanova; Jessica M Labonté; Kyung-Bum Lee; Rex R Malmstrom; Manuel Martinez-Garcia; Ilene Karsch Mizrachi; Hiroyuki Ogata; David Páez-Espino; Marie-Agnès Petit; Catherine Putonti; Thomas Rattei; Alejandro Reyes; Francisco Rodriguez-Valera; Karyna Rosario; Lynn Schriml; Frederik Schulz; Grieg F Steward; Matthew B Sullivan; Shinichi Sunagawa; Curtis A Suttle; Ben Temperton; Susannah G Tringe; Rebecca Vega Thurber; Nicole S Webster; Katrine L Whiteson; Steven W Wilhelm; K Eric Wommack; Tanja Woyke; Kelly C Wrighton; Pelin Yilmaz; Takashi Yoshida; Mark J Young; Natalya Yutin; Lisa Zeigler Allen; Nikos C Kyrpides; Emiley A Eloe-Fadrosh
Journal:  Nat Biotechnol       Date:  2018-12-17       Impact factor: 54.908

5.  Novel Filoviruses, Hantavirus, and Rhabdovirus in Freshwater Fish, Switzerland, 2017.

Authors:  Melanie M Hierweger; Michel C Koch; Melanie Rupp; Piet Maes; Nicholas Di Paola; Rémy Bruggmann; Jens H Kuhn; Heike Schmidt-Posthaus; Torsten Seuberlich
Journal:  Emerg Infect Dis       Date:  2021-12       Impact factor: 6.883

6.  Reassessing species demarcation criteria in viroid taxonomy by pairwise identity matrices.

Authors:  Michela Chiumenti; Beatriz Navarro; Thierry Candresse; Ricardo Flores; Francesco Di Serio
Journal:  Virus Evol       Date:  2021-01-25
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.