Literature DB >> 29876136

Easy and accurate reconstruction of whole HIV genomes from short-read sequence data with shiver.

Chris Wymant1,2, François Blanquart2, Tanya Golubchik1,3, Astrid Gall4,5, Margreet Bakker6, Daniela Bezemer7, Nicholas J Croucher2, Matthew Hall1,2, Mariska Hillebregt7, Swee Hoe Ong5, Oliver Ratmann2,8, Jan Albert9,10, Norbert Bannert11, Jacques Fellay12,13, Katrien Fransen14, Annabelle Gourlay15,16, M Kate Grabowski17, Barbara Gunsenheimer-Bartmeyer18, Huldrych F Günthard19,20, Pia Kivelä21, Roger Kouyos19,20, Oliver Laeyendecker22, Kirsi Liitsola21, Laurence Meyer23, Kholoud Porter15, Matti Ristola21, Ard van Sighem7, Ben Berkhout6, Marion Cornelissen6, Paul Kellam24,25, Peter Reiss7,26, Christophe Fraser1,2.   

Abstract

Studying the evolution of viruses and their molecular epidemiology relies on accurate viral sequence data, so that small differences between similar viruses can be meaningfully interpreted. Despite its higher throughput and more detailed minority variant data, next-generation sequencing has yet to be widely adopted for HIV. The difficulty of accurately reconstructing the consensus sequence of a quasispecies from reads (short fragments of DNA) in the presence of large between- and within-host diversity, including frequent indels, may have presented a barrier. In particular, mapping (aligning) reads to a reference sequence leads to biased loss of information; this bias can distort epidemiological and evolutionary conclusions. De novo assembly avoids this bias by aligning the reads to themselves, producing a set of sequences called contigs. However contigs provide only a partial summary of the reads, misassembly may result in their having an incorrect structure, and no information is available at parts of the genome where contigs could not be assembled. To address these problems we developed the tool shiver to pre-process reads for quality and contamination, then map them to a reference tailored to the sample using corrected contigs supplemented with the user's choice of existing reference sequences. Run with two commands per sample, it can easily be used for large heterogeneous data sets. We used shiver to reconstruct the consensus sequence and minority variant information from paired-end short-read whole-genome data produced with the Illumina platform, for sixty-five existing publicly available samples and fifty new samples. We show the systematic superiority of mapping to shiver's constructed reference compared with mapping the same reads to the closest of 3,249 real references: median values of 13 bases called differently and more accurately, 0 bases called differently and less accurately, and 205 bases of missing sequence recovered. We also successfully applied shiver to whole-genome samples of Hepatitis C Virus and Respiratory Syncytial Virus. shiver is publicly available from https://github.com/ChrisHIV/shiver.

Entities:  

Keywords:  HIV; bioinformatics; diversity; genome assembly; mapping; next-generation sequencing

Year:  2018        PMID: 29876136      PMCID: PMC5961307          DOI: 10.1093/ve/vey007

Source DB:  PubMed          Journal:  Virus Evol        ISSN: 2057-1577


  54 in total

1.  Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID.

Authors:  Cassandra B Jabara; Corbin D Jones; Jeffrey Roach; Jeffrey A Anderson; Ronald Swanstrom
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-30       Impact factor: 11.205

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  Identification and characterization of conserved and variable regions in the envelope gene of HTLV-III/LAV, the retrovirus of AIDS.

Authors:  B R Starcich; B H Hahn; G M Shaw; P D McNeely; S Modrow; H Wolf; E S Parks; W P Parks; S F Josephs; R C Gallo
Journal:  Cell       Date:  1986-06-06       Impact factor: 41.582

4.  VirVarSeq: a low-frequency virus variant detection pipeline for Illumina sequencing using adaptive base-calling accuracy filtering.

Authors:  Bie M P Verbist; Kim Thys; Joke Reumers; Yves Wetzels; Koen Van der Borght; Willem Talloen; Jeroen Aerssens; Lieven Clement; Olivier Thas
Journal:  Bioinformatics       Date:  2014-08-31       Impact factor: 6.937

5.  Concomitant emergence of the antisense protein gene of HIV-1 and of the pandemic.

Authors:  Elodie Cassan; Anne-Muriel Arigon-Chifolleau; Jean-Michel Mesnard; Antoine Gross; Olivier Gascuel
Journal:  Proc Natl Acad Sci U S A       Date:  2016-09-28       Impact factor: 11.205

6.  De novo assembly of highly diverse viral populations.

Authors:  Xiao Yang; Patrick Charlebois; Sante Gnerre; Matthew G Coole; Niall J Lennon; Joshua Z Levin; James Qu; Elizabeth M Ryan; Michael C Zody; Matthew R Henn
Journal:  BMC Genomics       Date:  2012-09-13       Impact factor: 3.969

7.  Disease progression despite protective HLA expression in an HIV-infected transmission pair.

Authors:  Jacqui Brener; Astrid Gall; Rebecca Batorsky; Lynn Riddell; Soren Buus; Ellen Leitman; Paul Kellam; Todd Allen; Philip Goulder; Philippa C Matthews
Journal:  Retrovirology       Date:  2015-06-30       Impact factor: 4.602

8.  CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community.

Authors:  Thomas R Connor; Nicholas J Loman; Simon Thompson; Andy Smith; Joel Southgate; Radoslaw Poplawski; Matthew J Bull; Emily Richardson; Matthew Ismail; Simon Elwood- Thompson; Christine Kitchen; Martyn Guest; Marius Bakke; Samuel K Sheppard; Mark J Pallen
Journal:  Microb Genom       Date:  2016-09-20

9.  Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe.

Authors:  François Blanquart; Chris Wymant; Marion Cornelissen; Astrid Gall; Margreet Bakker; Daniela Bezemer; Matthew Hall; Mariska Hillebregt; Swee Hoe Ong; Jan Albert; Norbert Bannert; Jacques Fellay; Katrien Fransen; Annabelle J Gourlay; M Kate Grabowski; Barbara Gunsenheimer-Bartmeyer; Huldrych F Günthard; Pia Kivelä; Roger Kouyos; Oliver Laeyendecker; Kirsi Liitsola; Laurence Meyer; Kholoud Porter; Matti Ristola; Ard van Sighem; Guido Vanham; Ben Berkhout; Paul Kellam; Peter Reiss; Christophe Fraser
Journal:  PLoS Biol       Date:  2017-06-12       Impact factor: 8.029

10.  Genomics Virtual Laboratory: A Practical Bioinformatics Workbench for the Cloud.

Authors:  Enis Afgan; Clare Sloggett; Nuwan Goonasekera; Igor Makunin; Derek Benson; Mark Crowe; Simon Gladman; Yousef Kowsar; Michael Pheasant; Ron Horst; Andrew Lonie
Journal:  PLoS One       Date:  2015-10-26       Impact factor: 3.240

View more
  21 in total

1.  Measurement error and variant-calling in deep Illumina sequencing of HIV.

Authors:  Mark Howison; Mia Coetzer; Rami Kantor
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

2.  Evaluation of Phylogenetic Methods for Inferring the Direction of Human Immunodeficiency Virus (HIV) Transmission: HIV Prevention Trials Network (HPTN) 052.

Authors:  Yinfeng Zhang; Chris Wymant; Oliver Laeyendecker; M Kathryn Grabowski; Matthew Hall; Sarah Hudelson; Estelle Piwowar-Manning; Marybeth McCauley; Theresa Gamble; Mina C Hosseinipour; Nagalingeswaran Kumarasamy; James G Hakim; Johnstone Kumwenda; Lisa A Mills; Breno R Santos; Beatriz Grinsztejn; Jose H Pilotto; Suwat Chariyalertsak; Joseph Makhema; Ying Q Chen; Myron S Cohen; Christophe Fraser; Susan H Eshleman
Journal:  Clin Infect Dis       Date:  2021-01-23       Impact factor: 9.079

3.  Performance of a high-throughput next-generation sequencing method for analysis of HIV drug resistance and viral load.

Authors:  Jessica M Fogel; David Bonsall; Vanessa Cummings; Rory Bowden; Tanya Golubchik; Mariateresa de Cesare; Ethan A Wilson; Theresa Gamble; Carlos Del Rio; D Scott Batey; Kenneth H Mayer; Jason E Farley; James P Hughes; Robert H Remien; Chris Beyrer; Christophe Fraser; Susan H Eshleman
Journal:  J Antimicrob Chemother       Date:  2020-12-01       Impact factor: 5.790

4.  Deep-sequence phylogenetics to quantify patterns of HIV transmission in the context of a universal testing and treatment trial - BCPP/Ya Tsie trial.

Authors:  Lerato E Magosi; Yinfeng Zhang; Tanya Golubchik; Victor DeGruttola; Eric Tchetgen Tchetgen; Vladimir Novitsky; Janet Moore; Pam Bachanas; Tebogo Segolodi; Refeletswe Lebelonyane; Molly Pretorius Holme; Sikhulile Moyo; Joseph Makhema; Shahin Lockman; Christophe Fraser; Myron Max Essex; Marc Lipsitch
Journal:  Elife       Date:  2022-03-01       Impact factor: 8.140

5.  Phylogenetic estimation of the viral fitness landscape of HIV-1 set-point viral load.

Authors:  Lele Zhao; Chris Wymant; François Blanquart; Tanya Golubchik; Astrid Gall; Margreet Bakker; Daniela Bezemer; Matthew Hall; Swee Hoe Ong; Jan Albert; Norbert Bannert; Jacques Fellay; M Kate Grabowski; Barbara Gunsenheimer-Bartmeyer; Huldrych F Günthard; Pia Kivelä; Roger D Kouyos; Oliver Laeyendecker; Laurence Meyer; Kholoud Porter; Ard van Sighem; Marc van der Valk; Ben Berkhout; Paul Kellam; Marion Cornelissen; Peter Reiss; Christophe Fraser; Luca Ferretti
Journal:  Virus Evol       Date:  2022-03-16

6.  Recombination Analysis of Near Full-Length HIV-1 Sequences and the Identification of a Potential New Circulating Recombinant Form from Rakai, Uganda.

Authors:  Adam A Capoferri; Susanna L Lamers; Mary Kate Grabowski; Rebecca Rose; Maria J Wawer; David Serwadda; Ronald H Gray; Thomas C Quinn; Godfrey Kigozi; Joseph Kagaayi; Oliver Laeyendecker
Journal:  AIDS Res Hum Retroviruses       Date:  2020-03-02       Impact factor: 2.205

7.  V-pipe: a computational pipeline for assessing viral genetic diversity from high-throughput data.

Authors:  Susana Posada-Céspedes; David Seifert; Ivan Topolsky; Kim Philipp Jablonski; Karin J Metzner; Niko Beerenwinkel
Journal:  Bioinformatics       Date:  2021-01-20       Impact factor: 6.937

8.  HIV-1 full-genome phylogenetics of generalized epidemics in sub-Saharan Africa: impact of missing nucleotide characters in next-generation sequences.

Authors:  Oliver Ratmann; Chris Wymant; Caroline Colijn; Siva Danaviah; M Essex; Simon D W Frost; Astrid Gall; Simani Gaiseitsiwe; Mary Grabowski; Ronald Gray; Stephane Guindon; Arndt von Haeseler; Pontiano Kaleebu; Michelle Kendall; Alexey Kozlov; Justen Manasa; Bui Quang Minh; Sikhulile Moyo; Vladimir Novitsky; Rebecca Nsubuga; Sureshnee Pillay; Thomas C Quinn; David Serwadda; Deogratius Ssemwanga; Alexandros Stamatakis; Jana Trifinopoulos; Maria Wawer; Andrew Leigh Brown; Tulio de Oliveira; Paul Kellam; Deenan Pillay; Christophe Fraser
Journal:  AIDS Res Hum Retroviruses       Date:  2017-05-25       Impact factor: 2.205

9.  SARS-CoV-2 within-host diversity and transmission.

Authors:  Katrina A Lythgoe; Matthew Hall; Luca Ferretti; Mariateresa de Cesare; George MacIntyre-Cockett; Amy Trebes; Monique Andersson; Newton Otecko; Emma L Wise; Nathan Moore; Jessica Lynch; Stephen Kidd; Nicholas Cortes; Matilde Mori; Rebecca Williams; Gabrielle Vernet; Anita Justice; Angie Green; Samuel M Nicholls; M Azim Ansari; Lucie Abeler-Dörner; Catrin E Moore; Timothy E A Peto; David W Eyre; Robert Shaw; Peter Simmonds; David Buck; John A Todd; Thomas R Connor; Shirin Ashraf; Ana da Silva Filipe; James Shepherd; Emma C Thomson; David Bonsall; Christophe Fraser; Tanya Golubchik
Journal:  Science       Date:  2021-03-09       Impact factor: 47.728

Review 10.  Quality Control of Next-Generation Sequencing-Based HIV-1 Drug Resistance Data in Clinical Laboratory Information Systems Framework.

Authors:  Rupert Capina; Katherine Li; Levon Kearney; Anne-Mieke Vandamme; P Richard Harrigan; Kristel Van Laethem
Journal:  Viruses       Date:  2020-06-14       Impact factor: 5.048

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.