MOTIVATION: Pyrosequencing technology provides an important new approach to more extensively characterize diverse sequence populations and detect low frequency variants. However, the promise of this technology has been difficult to realize, as careful correction of sequencing errors is crucial to distinguish rare variants (∼1%) in an infected host with high sensitivity and specificity. RESULTS: We developed a new approach, referred to as Indel and Carryforward Correction (ICC), to cluster sequences without substitutions and locally correct only indel and carryforward sequencing errors within clusters to ensure that no rare variants are lost. ICC performs sequence clustering in the order of (i) homopolymer indel patterns only, (ii) indel patterns only and (iii) carryforward errors only, without the requirement of a distance cutoff value. Overall, ICC removed 93-95% of sequencing errors found in control datasets. On pyrosequencing data from a PCR fragment derived from 15 HIV-1 plasmid clones mixed at various frequencies as low as 0.1%, ICC achieved the highest sensitivity and similar specificity compared with other commonly used error correction and variant calling algorithms. AVAILABILITY AND IMPLEMENTATION: Source code is freely available for download at http://indra.mullins.microbiol.washington.edu/ICC. It is implemented in Perl and supported on Linux, Mac OS X and MS Windows.
MOTIVATION: Pyrosequencing technology provides an important new approach to more extensively characterize diverse sequence populations and detect low frequency variants. However, the promise of this technology has been difficult to realize, as careful correction of sequencing errors is crucial to distinguish rare variants (∼1%) in an infected host with high sensitivity and specificity. RESULTS: We developed a new approach, referred to as Indel and Carryforward Correction (ICC), to cluster sequences without substitutions and locally correct only indel and carryforward sequencing errors within clusters to ensure that no rare variants are lost. ICC performs sequence clustering in the order of (i) homopolymer indel patterns only, (ii) indel patterns only and (iii) carryforward errors only, without the requirement of a distance cutoff value. Overall, ICC removed 93-95% of sequencing errors found in control datasets. On pyrosequencing data from a PCR fragment derived from 15 HIV-1 plasmid clones mixed at various frequencies as low as 0.1%, ICC achieved the highest sensitivity and similar specificity compared with other commonly used error correction and variant calling algorithms. AVAILABILITY AND IMPLEMENTATION: Source code is freely available for download at http://indra.mullins.microbiol.washington.edu/ICC. It is implemented in Perl and supported on Linux, Mac OS X and MS Windows.
Authors: Marcel Margulies; Michael Egholm; William E Altman; Said Attiya; Joel S Bader; Lisa A Bemben; Jan Berka; Michael S Braverman; Yi-Ju Chen; Zhoutao Chen; Scott B Dewell; Lei Du; Joseph M Fierro; Xavier V Gomes; Brian C Godwin; Wen He; Scott Helgesen; Chun Heen Ho; Chun He Ho; Gerard P Irzyk; Szilveszter C Jando; Maria L I Alenquer; Thomas P Jarvie; Kshama B Jirage; Jong-Bum Kim; James R Knight; Janna R Lanza; John H Leamon; Steven M Lefkowitz; Ming Lei; Jing Li; Kenton L Lohman; Hong Lu; Vinod B Makhijani; Keith E McDade; Michael P McKenna; Eugene W Myers; Elizabeth Nickerson; John R Nobile; Ramona Plant; Bernard P Puc; Michael T Ronan; George T Roth; Gary J Sarkis; Jan Fredrik Simons; John W Simpson; Maithreyan Srinivasan; Karrie R Tartaro; Alexander Tomasz; Kari A Vogt; Greg A Volkmer; Shally H Wang; Yong Wang; Michael P Weiner; Pengguang Yu; Richard F Begley; Jonathan M Rothberg Journal: Nature Date: 2005-07-31 Impact factor: 49.962
Authors: Christine M Rousseau; Brian A Birditt; Angela R McKay; Julia N Stoddard; Tsan Chun Lee; Sherry McLaughlin; Sarah W Moore; Nice Shindo; Gerald H Learn; Bette T Korber; Christian Brander; Philip J R Goulder; Photini Kiepiela; Bruce D Walker; James I Mullins Journal: J Virol Methods Date: 2006-05-15 Impact factor: 2.014
Authors: Birgitte B Simen; Jan Fredrik Simons; Katherine Huppler Hullsiek; Richard M Novak; Rodger D Macarthur; John D Baxter; Chunli Huang; Christine Lubeski; Gregory S Turenchalk; Michael S Braverman; Brian Desany; Jonathan M Rothberg; Michael Egholm; Michael J Kozal Journal: J Infect Dis Date: 2009-03-01 Impact factor: 5.226
Authors: Wei Shao; Valerie F Boltz; Jonathan E Spindler; Mary F Kearney; Frank Maldarelli; John W Mellors; Claudia Stewart; Natalia Volfovsky; Alexander Levitsky; Robert M Stephens; John M Coffin Journal: Retrovirology Date: 2013-02-13 Impact factor: 4.602
Authors: Paul Hughes; Wenjie Deng; Scott C Olson; Robert W Coombs; Michael H Chung; Lisa M Frenkel Journal: AIDS Res Hum Retroviruses Date: 2015-12-15 Impact factor: 2.205
Authors: Robert Lücking; James D Lawrey; Patrick M Gillevet; Masoumeh Sikaroodi; Manuela Dal-Forno; Simon A Berger Journal: J Mol Evol Date: 2013-12-17 Impact factor: 2.395
Authors: Ingrid A Beck; Wenjie Deng; Rachel Payant; Robert Hall; Roger E Bumgarner; James I Mullins; Lisa M Frenkel Journal: J Clin Microbiol Date: 2014-04-16 Impact factor: 5.948
Authors: Michael H Chung; Ingrid A Beck; Sandra Dross; Kenneth Tapia; James N Kiarie; Barbra A Richardson; Julie Overbaugh; Samah R Sakr; Grace C John-Stewart; Lisa M Frenkel Journal: J Acquir Immune Defic Syndr Date: 2014-11-01 Impact factor: 3.731
Authors: Scott C Olson; Nicole Ngo-Giang-Huong; Ingrid Beck; Wenjie Deng; Paula Britto; David E Shapiro; Roger E Bumgarner; James I Mullins; Russell B Van Dyke; Gonzague Jourdain; Lisa M Frenkel Journal: AIDS Date: 2015-07-31 Impact factor: 4.177
Authors: Shyamala Iyer; Eleanor Casey; Heather Bouzek; Moon Kim; Wenjie Deng; Brendan B Larsen; Hong Zhao; Roger E Bumgarner; Morgane Rolland; James I Mullins Journal: PLoS One Date: 2015-08-28 Impact factor: 3.240
Authors: Brendan B Larsen; Lennie Chen; Brandon S Maust; Moon Kim; Hong Zhao; Wenjie Deng; Dylan Westfall; Ingrid Beck; Lisa M Frenkel; James I Mullins Journal: PLoS One Date: 2013-10-02 Impact factor: 3.240
Authors: Joanne D Stekler; Ross Milne; Rachel Payant; Ingrid Beck; Joshua Herbeck; Brandon Maust; Wenjie Deng; Kenneth Tapia; Sarah Holte; Janine Maenza; Claire E Stevens; James I Mullins; Ann C Collier; Lisa M Frenkel Journal: PLoS Med Date: 2018-03-27 Impact factor: 11.069