Literature DB >> 27194763

Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling.

John T McCrone1, Adam S Lauring2.   

Abstract

UNLABELLED: With next-generation sequencing technologies, it is now feasible to efficiently sequence patient-derived virus populations at a depth of coverage sufficient to detect rare variants. However, each sequencing platform has characteristic error profiles, and sample collection, target amplification, and library preparation are additional processes whereby errors are introduced and propagated. Many studies account for these errors by using ad hoc quality thresholds and/or previously published statistical algorithms. Despite common usage, the majority of these approaches have not been validated under conditions that characterize many studies of intrahost diversity. Here, we use defined populations of influenza virus to mimic the diversity and titer typically found in patient-derived samples. We identified single-nucleotide variants using two commonly employed variant callers, DeepSNV and LoFreq. We found that the accuracy of these variant callers was lower than expected and exquisitely sensitive to the input titer. Small reductions in specificity had a significant impact on the number of minority variants identified and subsequent measures of diversity. We were able to increase the specificity of DeepSNV to >99.95% by applying an empirically validated set of quality thresholds. When applied to a set of influenza virus samples from a household-based cohort study, these changes resulted in a 10-fold reduction in measurements of viral diversity. We have made our sequence data and analysis code available so that others may improve on our work and use our data set to benchmark their own bioinformatics pipelines. Our work demonstrates that inadequate quality control and validation can lead to significant overestimation of intrahost diversity. IMPORTANCE: Advances in sequencing technology have made it feasible to sequence patient-derived viral samples at a level sufficient for detection of rare mutations. These high-throughput, cost-effective methods are revolutionizing the study of within-host viral diversity. However, the techniques are error prone, and the methods commonly used to control for these errors have not been validated under the conditions that characterize patient-derived samples. Here, we show that these conditions affect measurements of viral diversity. We found that the accuracy of previously benchmarked analysis pipelines was greatly reduced under patient-derived conditions. By carefully validating our sequencing analysis using known control samples, we were able to identify biases in our method and to improve our accuracy to acceptable levels. Application of our modified pipeline to a set of influenza virus samples from a cohort study provided a realistic picture of intrahost diversity and suggested the need for rigorous quality control in such studies.
Copyright © 2016, American Society for Microbiology. All Rights Reserved.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27194763      PMCID: PMC4944299          DOI: 10.1128/JVI.00667-16

Source DB:  PubMed          Journal:  J Virol        ISSN: 0022-538X            Impact factor:   5.103


  55 in total

1.  Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID.

Authors:  Cassandra B Jabara; Corbin D Jones; Jeffrey Roach; Jeffrey A Anderson; Ronald Swanstrom
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-30       Impact factor: 11.205

2.  Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus.

Authors:  Kristian G Andersen; B Jesse Shapiro; Christian B Matranga; Rachel Sealfon; Aaron E Lin; Lina M Moses; Onikepe A Folarin; Augustine Goba; Ikponmwonsa Odia; Philomena E Ehiane; Mambu Momoh; Eleina M England; Sarah Winnicki; Luis M Branco; Stephen K Gire; Eric Phelan; Ridhi Tariyal; Ryan Tewhey; Omowunmi Omoniwa; Mohammed Fullah; Richard Fonnie; Mbalu Fonnie; Lansana Kanneh; Simbirie Jalloh; Michael Gbakie; Sidiki Saffa; Kandeh Karbo; Adrianne D Gladden; James Qu; Matthew Stremlau; Mahan Nekoui; Hilary K Finucane; Shervin Tabrizi; Joseph J Vitti; Bruce Birren; Michael Fitzgerald; Caryn McCowan; Andrea Ireland; Aaron M Berlin; James Bochicchio; Barbara Tazon-Vega; Niall J Lennon; Elizabeth M Ryan; Zach Bjornson; Danny A Milner; Amanda K Lukens; Nisha Broodie; Megan Rowland; Megan Heinrich; Marjan Akdag; John S Schieffelin; Danielle Levy; Henry Akpan; Daniel G Bausch; Kathleen Rubins; Joseph B McCormick; Eric S Lander; Stephan Günther; Lisa Hensley; Sylvanus Okogbenin; Stephen F Schaffner; Peter O Okokhere; S Humarr Khan; Donald S Grant; George O Akpede; Danny A Asogun; Andreas Gnirke; Joshua Z Levin; Christian T Happi; Robert F Garry; Pardis C Sabeti
Journal:  Cell       Date:  2015-08-13       Impact factor: 41.582

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

Review 4.  The role of replicates for error mitigation in next-generation sequencing.

Authors:  Kimberly Robasky; Nathan E Lewis; George M Church
Journal:  Nat Rev Genet       Date:  2013-12-10       Impact factor: 53.242

5.  Intrahost dynamics of antiviral resistance in influenza A virus reflect complex patterns of segment linkage, reassortment, and natural selection.

Authors:  Matthew B Rogers; Timothy Song; Robert Sebra; Benjamin D Greenbaum; Marie-Eve Hamelin; Adam Fitch; Alan Twaddle; Lijia Cui; Edward C Holmes; Guy Boivin; Elodie Ghedin
Journal:  MBio       Date:  2015-04-07       Impact factor: 7.867

6.  Subclonal variant calling with multiple samples and prior knowledge.

Authors:  Moritz Gerstung; Elli Papaemmanuil; Peter J Campbell
Journal:  Bioinformatics       Date:  2014-01-16       Impact factor: 6.937

7.  Frequency of acute respiratory illnesses and circulation of respiratory viruses in households with children over 3 surveillance seasons.

Authors:  Arnold S Monto; Ryan E Malosh; Joshua G Petrie; Mark G Thompson; Suzanne E Ohmit
Journal:  J Infect Dis       Date:  2014-06-06       Impact factor: 5.226

8.  Estimation of sequencing error rates in short reads.

Authors:  Xin Victoria Wang; Natalie Blades; Jie Ding; Razvan Sultana; Giovanni Parmigiani
Journal:  BMC Bioinformatics       Date:  2012-07-30       Impact factor: 3.169

9.  Viral genome sequencing by random priming methods.

Authors:  Appolinaire Djikeng; Rebecca Halpin; Ryan Kuzmickas; Jay Depasse; Jeremy Feldblyum; Naomi Sengamalay; Claudio Afonso; Xinsheng Zhang; Norman G Anderson; Elodie Ghedin; David J Spiro
Journal:  BMC Genomics       Date:  2008-01-07       Impact factor: 3.969

10.  An extensive evaluation of read trimming effects on Illumina NGS data analysis.

Authors:  Cristian Del Fabbro; Simone Scalabrin; Michele Morgante; Federico M Giorgi
Journal:  PLoS One       Date:  2013-12-23       Impact factor: 3.240

View more
  43 in total

1.  Limited Intrahost Diversity and Background Evolution Accompany 40 Years of Canine Parvovirus Host Adaptation and Spread.

Authors:  Ian E H Voorhees; Hyunwook Lee; Andrew B Allison; Robert Lopez-Astacio; Laura B Goodman; Oyebola O Oyesola; Olutayo Omobowale; Olusegun Fagbohun; Edward J Dubovi; Susan L Hafenstein; Edward C Holmes; Colin R Parrish
Journal:  J Virol       Date:  2019-12-12       Impact factor: 5.103

2.  Model-based estimation of superinfection prevalence from limited datasets.

Authors:  Daniel B Reeves; Amalia S Magaret; Alex L Greninger; Christine Johnston; Joshua T Schiffer
Journal:  J R Soc Interface       Date:  2018-02       Impact factor: 4.118

Review 3.  Within-Host Evolution of Human Influenza Virus.

Authors:  Katherine S Xue; Louise H Moncla; Trevor Bedford; Jesse D Bloom
Journal:  Trends Microbiol       Date:  2018-03-10       Impact factor: 17.079

4.  Influenza B Viruses Exhibit Lower Within-Host Diversity than Influenza A Viruses in Human Hosts.

Authors:  Andrew L Valesano; William J Fitzsimmons; John T McCrone; Joshua G Petrie; Arnold S Monto; Emily T Martin; Adam S Lauring
Journal:  J Virol       Date:  2020-02-14       Impact factor: 5.103

5.  Influenza Viruses in Mice: Deep Sequencing Analysis of Serial Passage and Effects of Sialic Acid Structural Variation.

Authors:  Brian R Wasik; Ian E H Voorhees; Karen N Barnard; Brynn K Alford-Lawrence; Wendy S Weichert; Grace Hood; Aitor Nogales; Luis Martínez-Sobrido; Edward C Holmes; Colin R Parrish
Journal:  J Virol       Date:  2019-11-13       Impact factor: 5.103

6.  Narrow transmission bottlenecks and limited within-host viral diversity during a SARS-CoV-2 outbreak on a fishing boat.

Authors:  William W Hannon; Pavitra Roychoudhury; Hong Xie; Lasata Shrestha; Amin Addetia; Keith R Jerome; Alexander L Greninger; Jesse D Bloom
Journal:  Virus Evol       Date:  2022-06-16

Review 7.  Teaching an Old Virus New Tricks: A Review on New Approaches to Study Age-Old Questions in Influenza Biology.

Authors:  Seema S Lakdawala; Nara Lee; Christopher B Brooke
Journal:  J Mol Biol       Date:  2019-04-30       Impact factor: 5.469

8.  The Number of Target Molecules of the Amplification Step Limits Accuracy and Sensitivity in Ultradeep-Sequencing Viral Population Studies.

Authors:  Romain Gallet; Frédéric Fabre; Yannis Michalakis; Stéphane Blanc
Journal:  J Virol       Date:  2017-07-27       Impact factor: 5.103

9.  Patterns of within-host genetic diversity in SARS-CoV-2.

Authors:  Gerry Tonkin-Hill; Inigo Martincorena; Roberto Amato; Andrew R J Lawson; Moritz Gerstung; Ian Johnston; David K Jackson; Naomi Park; Stefanie V Lensing; Michael A Quail; Sónia Gonçalves; Cristina Ariani; Michael Spencer Chapman; William L Hamilton; Luke W Meredith; Grant Hall; Aminu S Jahun; Yasmin Chaudhry; Myra Hosmillo; Malte L Pinckert; Iliana Georgana; Anna Yakovleva; Laura G Caller; Sarah L Caddy; Theresa Feltwell; Fahad A Khokhar; Charlotte J Houldcroft; Martin D Curran; Surendra Parmar; Alex Alderton; Rachel Nelson; Ewan M Harrison; John Sillitoe; Stephen D Bentley; Jeffrey C Barrett; M Estee Torok; Ian G Goodfellow; Cordelia Langford; Dominic Kwiatkowski
Journal:  Elife       Date:  2021-08-13       Impact factor: 8.140

10.  The Mutational Robustness of Influenza A Virus.

Authors:  Elisa Visher; Shawn E Whitefield; John T McCrone; William Fitzsimmons; Adam S Lauring
Journal:  PLoS Pathog       Date:  2016-08-29       Impact factor: 6.823

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.