Literature DB >> 24860690

Phylogenetic Analysis of Guinea 2014 EBOV Ebolavirus Outbreak.

Gytis Dudas1, Andrew Rambaut2.   

Abstract

Members of the genus Ebolavirus have caused outbreaks of haemorrhagic fever in humans in Africa. The most recent outbreak in Guinea, which began in February of 2014, is still ongoing. Recently published analyses of sequences from this outbreak suggest that the outbreak in Guinea is caused by a divergent lineage of Zaire ebolavirus. We report evidence that points to the same Zaire ebolavirus lineage that has previously caused outbreaks in the Democratic Republic of Congo, the Republic of Congo and Gabon as the culprit behind the outbreak in Guinea.

Entities:  

Keywords:  Guinea; disease outbreak; ebolavirus; zoonoses

Year:  2014        PMID: 24860690      PMCID: PMC4024086          DOI: 10.1371/currents.outbreaks.84eefe5ce43ec9dc0bf0670f7b8b417d

Source DB:  PubMed          Journal:  PLoS Curr        ISSN: 2157-3999


Introduction

A recent article1 suggests that the currently ongoing outbreak in Guinea is caused by a divergent variant of the Zaire ebola (EBOV) lineage. The EBOV strain has previously caused ebola outbreaks in the Democratic Republic of Congo (DRC), the Republic of Congo (RC) and Gabon. The authors publish three complete genome sequences from the Guinea outbreak and perform a phylogenetic analysis using 24 sequences of the Zaire and other representative lineages. One finding is that the 2014 sequences fall as a divergent lineage outside the Zaire lineage suggesting that this may be a pre-existing endemic virus in West Africa rather than the result of spread of the EBOV lineage from the Central African countries that have had previous human outbreaks. Previously, a dynamic re-interpretation of EBOV emergence in Central Africa has been suggested, citing correlations between time, geographic distance and genetic distance of Ebola haemorrhagic fever outbreaks2 and the recent ancestry of related EBOV lineages in fruit bats3.

Materials and Methods

All complete genome sequences from the genus Ebolavirus (which includes Bundibugyo BDBV, Reston RESTV, Sudan SUDV, Tai Forest TAFV and Zaire ebolavirus EBOV species) were collated from genbank including the sequences from the Guinea outbreak. Genbank accessions and sources for the sequences can be found at http://epidemic.bio.ed.ac.uk/ebolavirus_sequences. The Ebolavirus genome consists of a single strand of negative sense RNA and contains 7 protein coding genes (in order 3'-NP-VP35-VP40-GP-VP30-VP24-L, separated by various intergenic regions)4. We collated the protein coding regions of each gene (alignment length 14647 nucleotides) and, in a separate alignment, the non-coding intergenic regions. Phylogenetic trees were inferred in PhyML5 or MrBayes6 using the GTR7+Γ substitution model. We were able to replicate the analysis presented in Baize et al.1 only when omitting the accommodation of rate heterogeneity modelled as a discretized Γ distribution. We suspect the difficulty in replicating the analysis is due to a combination of using different sequences, a different alignment and the inherently unreliable rooting of the EBOV clade using highly divergent sequences from other ebolavirus clades. We have uploaded the alignments we used (whole genome, coding and non-coding) to a GitHub repository at https://github.com/evogytis/ebolaGuinea2014. We also compiled a dataset containing only the glycoprotein (GP) sequences, for which more sequences are available. Many of the extra sequences come from wild ape carcasses8 in Gabon and RC. These sequences were analyzed in BEAST9 to establish a time frame for the split of the Guinea viruses from other EBOV lineages. The data were analyzed using the GTR+Γ nucleotide substitution model, an uncorrelated relaxed molecular clock (following a lognormal distribution)10 and under different demographic models (constant population size, exponential growth or the non-parametric Bayesian skyride11). GP sequence results were recovered from a relaxed molecular clock analysis, under an exponential growth tree prior (as it can accommodate a constant population size scenario when the growth rate is 0) but the analysis was found to be quite robust to different demographic models.

Analysis

An alignment of complete genomes and a maximum likelihood tree (PhyML) appears to confirm the phylogenetic position shown in the recent paper1 (Figure 1), albeit the position of the Guinea outbreak sequences is not very well supported. ML tree of complete genomes without accommodating for rate heterogeneity shows the Guinea outbreak sequences (highlighted) as belonging to a divergent EBOV lineage. Tips belonging to the EBOV lineage are not collapsed. Numbers above key nodes in the EBOV clade are bootstrap values (100 replicates). When the intergenic sequences are removed, however, the Guinea outbreak sequences fall within the diversity of Zaire ebolavirus (Figure 2). When only the coding sequences are used, the Guinea outbreak sequences appear to be derived from within the diversity of Gabon/DRC EBOV lineages. Expanding the EBOV region of the tree (same tree as Figure 2, but with the divergent ebolavirus species cropped out) we see that the Guinea outbreak sequences are nested within the EBOV clade. Intergenic regions show a similar picture with the Guinea sequences nested within EBOV. EBOV lineages are rather poorly sampled and sequences from most outbreaks, because of the nature of the outbreaks, have nearly identical sequences. The branch leading to the Guinea outbreak is long, not because it is a divergent lineage but because it is the most recently sampled so has had the most time to evolve. Combined with a very divergent outgroup this leads to a situation where the root position of the EBOV clade is unreliably estimated. Figures 3 and 4 show MrBayes trees from protein coding and intergenic regions of the EBOV genome, respectively, with more divergent ebolavirus strains cropped out. Note that trees in Figures 3 and 4 are essentially identical but differ by where the other ebolavirus species root the EBOV clade (on the 2007 Gabon outbreak for the coding regions in Figure 3 and on the 1995 Kikwit outbreak for the intergenic regions in Figure 4). This shows that the rooting of this clade using the highly divergent other ebolavirus species is very problematic. However, EBOV is estimated to evolve at about 7×10-4 substitutions per site per year12 which means that the virus will accumulate significant amounts of substitutions over the nearly 40 years since the first recorded outbreak in 1976. We can use this to root the EBOV tree and look at where the Guinea outbreak lies. Path-O-Gen (available at http://tree.bio.ed.ac.uk/software/pathogen/) was used to find the root that gave the best association between genetic divergence and time. The relationship between genetic divergence and time after rooting the tree using least squares regression is shown in Figure 5. Sequences from the 1976 Zaire outbreak are very close to the root. The Bayesian posterior support for all the groupings between the outbreaks are 1.0 including for the grouping of Guinea 2014 with DRC 2007 and Gabon 2002. This demonstrates that the uncertainty about the position of the Guinea 2014 lineage in the complete ebolavirus trees was down to the rooting of the EBOV clade (i.e., where the divergent outgroups connect to the EBOV tree). The relationships of the EBOV outbreaks is completely consistent for the simple whole genome alignment, the coding regions only and the intergenic regions only but the position of the root changes. In the figure A) denotes the position of the root for the full genome maximum likelihood tree, B) for the Bayesian coding-sequence only tree, C) the Bayesian intergenic regions only tree and D) the combined coding-sequence and intergenic region accommodating different rates of evolution. Figure 6 shows the phylogeny of the coding sequences recovered by MrBayes (a maximum likelihood tree using PhyML gave an almost identical tree) rooted by least squares regression. The root of this tree is very close to the earliest sequences from the 1976 Zaire outbreak.

Estimating the date of introduction of EBOV into Guinea

The analysis of GP sequences in BEAST revealed rooting consistent with that found in Figure 6 as well as a nucleotide substitution rate (mean of lognormal distribution from which the rates were drawn is 1.07×10-3 substitutions per site per year, 95% HPD interval 5.99×10-4 - 1.75×10-3) on a scale expected, given previously published rates12 and the fact that GP codes for a surface glycoprotein. In Figure 7 the estimate of the split between the lineage now causing an outbreak in Guinea and the Central African lineage that had caused outbreaks in DRC and Gabon is late 2002 (95% HPD interval 2000 - 2006). This gives us a lower boundary on the introduction of Central African lineage of EBOV into Guinea, although these estimates should be interpreted with caution. We also find very good support for the common ancestry of Guinea and DRC/Gabon lineages (posterior probability = 1.0). Figure 7 also highlights the importance of environmental sampling - many sequences in the tree come from ape carcasses and are more diverse (not shown) than sequences from human outbreaks, giving this dataset much better resolution. Although the closest relatives of the Guinea lineage are not entirely certain (posterior probability 0.92), its relationship with Central African EBOV lineages is well-supported (posterior probability 1.0).

Conclusion

The phylogenetic analysis of the five ebolavirus species here does not substantially improve on that presented by Baize et al.1 in that even when partitioning the alignment into coding and non-coding regions we get inconsistent rooting positions for the EBOV clade. We believe that at present no suitable outgroup sequences to root the EBOV phylogeny exist and that a temporal rooting gives the most consistent results. This approach indicates that the outbreak in Guinea is likely caused by a Zaire ebolavirus lineage that has spread from Central Africa into Guinea and West Africa in recent decades, and does not represent the emergence of a divergent and endemic virus. As the GP sequences show, without more diverse sequences, especially those from the animal reservoir, it is difficult to narrow down the estimates of when and through what means the Central African EBOV lineage has been introduced into West Africa.

Competing Interests

The authors have declared that no competing interests exist.
  11 in total

1.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

2.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

3.  Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics.

Authors:  Vladimir N Minin; Erik W Bloomquist; Marc A Suchard
Journal:  Mol Biol Evol       Date:  2008-04-11       Impact factor: 16.240

4.  Emergence of Zaire Ebola virus disease in Guinea.

Authors:  Sylvain Baize; Delphine Pannetier; Lisa Oestereich; Toni Rieger; Lamine Koivogui; N'Faly Magassouba; Barrè Soropogui; Mamadou Saliou Sow; Sakoba Keïta; Hilde De Clerck; Amanda Tiffany; Gemma Dominguez; Mathieu Loua; Alexis Traoré; Moussa Kolié; Emmanuel Roland Malano; Emmanuel Heleze; Anne Bocquin; Stephane Mély; Hervé Raoul; Valérie Caro; Dániel Cadar; Martin Gabriel; Meike Pahlmann; Dennis Tappe; Jonas Schmidt-Chanasit; Benido Impouma; Abdoul Karim Diallo; Pierre Formenty; Michel Van Herp; Stephan Günther
Journal:  N Engl J Med       Date:  2014-04-16       Impact factor: 91.245

5.  Wave-like spread of Ebola Zaire.

Authors:  Peter D Walsh; Roman Biek; Leslie A Real
Journal:  PLoS Biol       Date:  2005-10-25       Impact factor: 8.029

6.  Sequence analysis of the Ebola virus genome: organization, genetic elements, and comparison with the genome of Marburg virus.

Authors:  A Sanchez; M P Kiley; B P Holloway; D D Auperin
Journal:  Virus Res       Date:  1993-09       Impact factor: 3.303

7.  Isolates of Zaire ebolavirus from wild apes reveal genetic lineage and recombinants.

Authors:  Tatiana J Wittmann; Roman Biek; Alexandre Hassanin; Pierre Rouquet; Patricia Reed; Philippe Yaba; Xavier Pourrut; Leslie A Real; Jean-Paul Gonzalez; Eric M Leroy
Journal:  Proc Natl Acad Sci U S A       Date:  2007-10-17       Impact factor: 11.205

8.  Bayesian phylogenetics with BEAUti and the BEAST 1.7.

Authors:  Alexei J Drummond; Marc A Suchard; Dong Xie; Andrew Rambaut
Journal:  Mol Biol Evol       Date:  2012-02-25       Impact factor: 16.240

9.  Relaxed phylogenetics and dating with confidence.

Authors:  Alexei J Drummond; Simon Y W Ho; Matthew J Phillips; Andrew Rambaut
Journal:  PLoS Biol       Date:  2006-03-14       Impact factor: 8.029

10.  Recent common ancestry of Ebola Zaire virus found in a bat reservoir.

Authors:  Roman Biek; Peter D Walsh; Eric M Leroy; Leslie A Real
Journal:  PLoS Pathog       Date:  2006-10       Impact factor: 6.823

View more
  37 in total

1.  Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus.

Authors:  Kristian G Andersen; B Jesse Shapiro; Christian B Matranga; Rachel Sealfon; Aaron E Lin; Lina M Moses; Onikepe A Folarin; Augustine Goba; Ikponmwonsa Odia; Philomena E Ehiane; Mambu Momoh; Eleina M England; Sarah Winnicki; Luis M Branco; Stephen K Gire; Eric Phelan; Ridhi Tariyal; Ryan Tewhey; Omowunmi Omoniwa; Mohammed Fullah; Richard Fonnie; Mbalu Fonnie; Lansana Kanneh; Simbirie Jalloh; Michael Gbakie; Sidiki Saffa; Kandeh Karbo; Adrianne D Gladden; James Qu; Matthew Stremlau; Mahan Nekoui; Hilary K Finucane; Shervin Tabrizi; Joseph J Vitti; Bruce Birren; Michael Fitzgerald; Caryn McCowan; Andrea Ireland; Aaron M Berlin; James Bochicchio; Barbara Tazon-Vega; Niall J Lennon; Elizabeth M Ryan; Zach Bjornson; Danny A Milner; Amanda K Lukens; Nisha Broodie; Megan Rowland; Megan Heinrich; Marjan Akdag; John S Schieffelin; Danielle Levy; Henry Akpan; Daniel G Bausch; Kathleen Rubins; Joseph B McCormick; Eric S Lander; Stephan Günther; Lisa Hensley; Sylvanus Okogbenin; Stephen F Schaffner; Peter O Okokhere; S Humarr Khan; Donald S Grant; George O Akpede; Danny A Asogun; Andreas Gnirke; Joshua Z Levin; Christian T Happi; Robert F Garry; Pardis C Sabeti
Journal:  Cell       Date:  2015-08-13       Impact factor: 41.582

2.  Puzzling Origins of the Ebola Outbreak in the Democratic Republic of the Congo, 2014.

Authors:  Tommy Tsan-Yuk Lam; Huachen Zhu; Yee Ling Chong; Edward C Holmes; Yi Guan
Journal:  J Virol       Date:  2015-07-22       Impact factor: 5.103

Review 3.  The evolution of Ebola virus: Insights from the 2013-2016 epidemic.

Authors:  Edward C Holmes; Gytis Dudas; Andrew Rambaut; Kristian G Andersen
Journal:  Nature       Date:  2016-10-13       Impact factor: 49.962

4.  Genetic diversity and evolutionary dynamics of Ebola virus in Sierra Leone.

Authors:  Yi-Gang Tong; Wei-Feng Shi; Di Liu; Jun Qian; Long Liang; Xiao-Chen Bo; Jun Liu; Hong-Guang Ren; Hang Fan; Ming Ni; Yang Sun; Yuan Jin; Yue Teng; Zhen Li; David Kargbo; Foday Dafae; Alex Kanu; Cheng-Chao Chen; Zhi-Heng Lan; Hui Jiang; Yang Luo; Hui-Jun Lu; Xiao-Guang Zhang; Fan Yang; Yi Hu; Yu-Xi Cao; Yong-Qiang Deng; Hao-Xiang Su; Yu Sun; Wen-Sen Liu; Zhuang Wang; Cheng-Yu Wang; Zhao-Yang Bu; Zhen-Dong Guo; Liu-Bo Zhang; Wei-Min Nie; Chang-Qing Bai; Chun-Hua Sun; Xiao-Ping An; Pei-Song Xu; Xiang-Li-Lan Zhang; Yong Huang; Zhi-Qiang Mi; Dong Yu; Hong-Wu Yao; Yong Feng; Zhi-Ping Xia; Xue-Xing Zheng; Song-Tao Yang; Bing Lu; Jia-Fu Jiang; Brima Kargbo; Fu-Chu He; George F Gao; Wu-Chun Cao
Journal:  Nature       Date:  2015-05-13       Impact factor: 49.962

5.  Structures of protective antibodies reveal sites of vulnerability on Ebola virus.

Authors:  Charles D Murin; Marnie L Fusco; Zachary A Bornholdt; Xiangguo Qiu; Gene G Olinger; Larry Zeitlin; Gary P Kobinger; Andrew B Ward; Erica Ollmann Saphire
Journal:  Proc Natl Acad Sci U S A       Date:  2014-11-17       Impact factor: 11.205

Review 6.  Emerging Concepts of Data Integration in Pathogen Phylodynamics.

Authors:  Guy Baele; Marc A Suchard; Andrew Rambaut; Philippe Lemey
Journal:  Syst Biol       Date:  2017-01-01       Impact factor: 15.683

Review 7.  Genomic Analysis of Viral Outbreaks.

Authors:  Shirlee Wohl; Stephen F Schaffner; Pardis C Sabeti
Journal:  Annu Rev Virol       Date:  2016-08-03       Impact factor: 10.431

Review 8.  Ebola Virus Infection: Review of the Pharmacokinetic and Pharmacodynamic Properties of Drugs Considered for Testing in Human Efficacy Trials.

Authors:  Vincent Madelain; Thi Huyen Tram Nguyen; Anaelle Olivo; Xavier de Lamballerie; Jérémie Guedj; Anne-Marie Taburet; France Mentré
Journal:  Clin Pharmacokinet       Date:  2016-08       Impact factor: 6.447

9.  Ebola viral load at diagnosis associates with patient outcome and outbreak evolution.

Authors:  Marc-Antoine de La Vega; Grazia Caleo; Jonathan Audet; Xiangguo Qiu; Robert A Kozak; James I Brooks; Steven Kern; Anja Wolz; Armand Sprecher; Jane Greig; Kamalini Lokuge; David K Kargbo; Brima Kargbo; Antonino Di Caro; Allen Grolla; Darwyn Kobasa; James E Strong; Giuseppe Ippolito; Michel Van Herp; Gary P Kobinger
Journal:  J Clin Invest       Date:  2015-11-09       Impact factor: 14.808

10.  Virus genomes reveal factors that spread and sustained the Ebola epidemic.

Authors:  Gytis Dudas; Luiz Max Carvalho; Trevor Bedford; Andrew J Tatem; Guy Baele; Nuno R Faria; Daniel J Park; Jason T Ladner; Armando Arias; Danny Asogun; Filip Bielejec; Sarah L Caddy; Matthew Cotten; Jonathan D'Ambrozio; Simon Dellicour; Antonino Di Caro; Joseph W Diclaro; Sophie Duraffour; Michael J Elmore; Lawrence S Fakoli; Ousmane Faye; Merle L Gilbert; Sahr M Gevao; Stephen Gire; Adrianne Gladden-Young; Andreas Gnirke; Augustine Goba; Donald S Grant; Bart L Haagmans; Julian A Hiscox; Umaru Jah; Jeffrey R Kugelman; Di Liu; Jia Lu; Christine M Malboeuf; Suzanne Mate; David A Matthews; Christian B Matranga; Luke W Meredith; James Qu; Joshua Quick; Suzan D Pas; My V T Phan; Georgios Pollakis; Chantal B Reusken; Mariano Sanchez-Lockhart; Stephen F Schaffner; John S Schieffelin; Rachel S Sealfon; Etienne Simon-Loriere; Saskia L Smits; Kilian Stoecker; Lucy Thorne; Ekaete Alice Tobin; Mohamed A Vandi; Simon J Watson; Kendra West; Shannon Whitmer; Michael R Wiley; Sarah M Winnicki; Shirlee Wohl; Roman Wölfel; Nathan L Yozwiak; Kristian G Andersen; Sylvia O Blyden; Fatorma Bolay; Miles W Carroll; Bernice Dahn; Boubacar Diallo; Pierre Formenty; Christophe Fraser; George F Gao; Robert F Garry; Ian Goodfellow; Stephan Günther; Christian T Happi; Edward C Holmes; Brima Kargbo; Sakoba Keïta; Paul Kellam; Marion P G Koopmans; Jens H Kuhn; Nicholas J Loman; N'Faly Magassouba; Dhamari Naidoo; Stuart T Nichol; Tolbert Nyenswah; Gustavo Palacios; Oliver G Pybus; Pardis C Sabeti; Amadou Sall; Ute Ströher; Isatta Wurie; Marc A Suchard; Philippe Lemey; Andrew Rambaut
Journal:  Nature       Date:  2017-04-12       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.