Literature DB >> 30992348

Can Sequence Phylogenies Safely Infer the Origin of the Global Virome?

Edward C Holmes1,2,3, Sebastián Duchêne4.   

Abstract

Entities:  

Keywords:  RNA polymerase; evolution; phylogeny; virus

Mesh:

Substances:

Year:  2019        PMID: 30992348      PMCID: PMC6469966          DOI: 10.1128/mBio.00289-19

Source DB:  PubMed          Journal:  mBio            Impact factor:   7.867


× No keyword cloud information.

LETTER

Resolving the origins of RNA viruses is one of the most important questions in viral evolution, but research on this question has been hampered by a lack of primary sequence similarity among the most divergent groups of RNA viruses (1). In a recent paper, Wolf et al. (2) seemingly overcame these problems and presented a comprehensive picture of the origins and deep evolutionary relationships among divergent RNA viruses. While many of the ideas presented in this study (2) were illuminating, we contend that they cannot be supported by the phylogenetic analysis performed, which is still based on sequences that often share no recognizable similarity and cannot be safely aligned. Central to the study of Wolf et al. (2) is the quality of their sequence alignment of the most conserved RNA-dependent RNA polymerase (RdRp) protein. To obtain this, Wolf et al. (2) followed an iterative procedure, resulting in a main alignment comprising 4,627 taxa with a length of 12,220 amino acids. However, even a cursory inspection of this alignment indicates that it is highly unlikely to be accurate among the most distantly related RNA viruses. For example, (i) every site in the alignment contains at least one gap, including the canonical GDD motif, and there are no contiguous stretches of clearly aligned sequence across all viruses. (ii) Only 3.6% (441 residues) of the alignment remains after sites that harbor a majority (>50%) of gaps are removed. (iii) The pairwise identity between the aligned sequences is often less than the 5% expected by chance alone (mean value of 7.7% across the alignment) and was as low as 1%, with a mean pairwise distance of 0.93 (from a maximum of 1) substitutions per site. (iv) A total of 812 sites contain all 20 amino acids, and (v) 95.9% of sequences failed a χ2 test of compositional heterogeneity in IQ-TREE (3). (vi) Only six sites can be safely aligned according to Gblocks (4), whereas TrimAl (5) could not align any sites, with both programs employing their least stringent settings. Points iii to v imply that even sophisticated substitution models cannot reliably estimate evolutionary divergence in these data (6), and point vi is particularly important for phylogenetic inference because the inclusion of poorly aligned regions results in biased tree estimates, with high bootstrap support for the incorrect topology (4). While clusters of clearly related sequences are present in these data, the deepest parts of the phylogeny reflect sequences that are so divergent in sequence that any attempt to depict their relationship, including through bootstrap analysis, is meaningless. Despite the presence of a number of putative sequence motifs that we agree are indicative of common ancestry, the sequence alignment presented by Wolf et al. (2) is not sufficiently robust for a comprehensive phylogenetic analysis or to draw conclusions about the earliest moments of RNA virus evolution. We urge that caution be exercised in all studies that utilize sequences as divergent as those analyzed here, as phylogenies are meaningful only when they are estimated in the case of clear primary sequence similarity. Unfortunately, this is unlikely ever to be realistic in the case of RNA viruses.
  6 in total

1.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments.

Authors:  Gerard Talavera; Jose Castresana
Journal:  Syst Biol       Date:  2007-08       Impact factor: 15.683

2.  A reevaluation of the higher taxonomy of viruses based on RNA polymerases.

Authors:  P M Zanotto; M J Gibbs; E A Gould; E C Holmes
Journal:  J Virol       Date:  1996-09       Impact factor: 5.103

3.  Substitution Model Adequacy and Assessing the Reliability of Estimates of Virus Evolutionary Rates and Time Scales.

Authors:  Sebastián Duchêne; Francesca Di Giallonardo; Edward C Holmes
Journal:  Mol Biol Evol       Date:  2015-09-28       Impact factor: 16.240

4.  IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

Authors:  Lam-Tung Nguyen; Heiko A Schmidt; Arndt von Haeseler; Bui Quang Minh
Journal:  Mol Biol Evol       Date:  2014-11-03       Impact factor: 16.240

5.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Authors:  Salvador Capella-Gutiérrez; José M Silla-Martínez; Toni Gabaldón
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

6.  Origins and Evolution of the Global RNA Virome.

Authors:  Yuri I Wolf; Darius Kazlauskas; Jaime Iranzo; Adriana Lucía-Sanz; Jens H Kuhn; Mart Krupovic; Valerian V Dolja; Eugene V Koonin
Journal:  mBio       Date:  2018-11-27       Impact factor: 7.867

  6 in total
  15 in total

1.  Bacterial Origin and Reductive Evolution of the CPR Group.

Authors:  Rijja Hussain Bokhari; Nooreen Amirjan; Hyeonsoo Jeong; Kyung Mo Kim; Gustavo Caetano-Anollés; Arshan Nasir
Journal:  Genome Biol Evol       Date:  2020-03-01       Impact factor: 3.416

Review 2.  Global Organization and Proposed Megataxonomy of the Virus World.

Authors:  Eugene V Koonin; Valerian V Dolja; Mart Krupovic; Arvind Varsani; Yuri I Wolf; Natalya Yutin; F Murilo Zerbini; Jens H Kuhn
Journal:  Microbiol Mol Biol Rev       Date:  2020-03-04       Impact factor: 11.056

Review 3.  Diversity and evolution of the animal virome.

Authors:  Erin Harvey; Edward C Holmes
Journal:  Nat Rev Microbiol       Date:  2022-01-04       Impact factor: 60.633

4.  Resolving deep evolutionary relationships within the RNA virus phylum Lenarviricota.

Authors:  Sabrina Sadiq; Yan-Mei Chen; Yong-Zhen Zhang; Edward C Holmes
Journal:  Virus Evol       Date:  2022-06-14

5.  Structure Unveils Relationships between RNA Virus Polymerases.

Authors:  Heli A M Mönttinen; Janne J Ravantti; Minna M Poranen
Journal:  Viruses       Date:  2021-02-17       Impact factor: 5.048

6.  Hadaka Virus 1: a Capsidless Eleven-Segmented Positive-Sense Single-Stranded RNA Virus from a Phytopathogenic Fungus, Fusarium oxysporum.

Authors:  Yukiyo Sato; Wajeeha Shamsi; Atif Jamal; Muhammad Faraz Bhatti; Hideki Kondo; Nobuhiro Suzuki
Journal:  mBio       Date:  2020-05-26       Impact factor: 7.867

7.  The virome from a collection of endomycorrhizal fungi reveals new viral taxa with unprecedented genome organization.

Authors:  Suvi Sutela; Marco Forgia; Eeva J Vainio; Marco Chiapello; Stefania Daghino; Marta Vallino; Elena Martino; Mariangela Girlanda; Silvia Perotto; Massimo Turina
Journal:  Virus Evol       Date:  2020-10-08

8.  Understanding evolution of SARS-CoV-2: A perspective from analysis of genetic diversity of RdRp gene.

Authors:  Sunitha M Kasibhatla; Meenal Kinikar; Sanket Limaye; Mohan M Kale; Urmila Kulkarni-Kale
Journal:  J Med Virol       Date:  2020-06-02       Impact factor: 20.693

Review 9.  The origin and evolution of viruses inferred from fold family structure.

Authors:  Fizza Mughal; Arshan Nasir; Gustavo Caetano-Anollés
Journal:  Arch Virol       Date:  2020-08-03       Impact factor: 2.574

Review 10.  Investigating the Concept and Origin of Viruses.

Authors:  Arshan Nasir; Ethan Romero-Severson; Jean-Michel Claverie
Journal:  Trends Microbiol       Date:  2020-12       Impact factor: 17.079

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.