Literature DB >> 26790373

Characterization of sequence-specific errors in various next-generation sequencing systems.

Sunguk Shin1, Joonhong Park1.   

Abstract

Next-generation sequencing (NGS) is a popular method for assessing the molecular diversity of microbial communities without cultivation, for identifying polymorphisms in populations, and for comparing genomes and transcriptomes. However, sequence-specific errors (SSEs) by NGS systems can result in genome mis-assembly, overestimation of diversity in microbial community analyses, and false polymorphism discovery. SSEs can be particularly problematic due to rich microbial biodiversity and genomes containing frequent repeats. In this study, SSEs in public data from all popular NGS systems were discovered using a Markov chain model and hotspots for sequence errors were identified. Deletion errors were frequently preceded by homopolymers in non-Illumina NGS systems, such as GS FLX+. Substitution errors were often related to high GC contents and long G/C homopolymers in Illumina sequencing systems such as HiSeq. After removal of long G/C homopolymers in HiSeq, the average lengths of contigs and average SNP quality increased. SSEs were selectively removed from our mock community data by quality filtering, and a bias against specific microbes was identified. Our findings provide a scientific basis for filtering poor-quality reads, correcting deletion errors, preventing genome mis-assembly, and accurately assessing microbial community compositions and polymorphisms.

Entities:  

Mesh:

Year:  2016        PMID: 26790373     DOI: 10.1039/c5mb00750j

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  11 in total

1.  Complete nucleotide sequence characterization of DRB5 alleles reveals a homogeneous allele group that is distinct from other DRB genes.

Authors:  Konstantinos Barsakis; Farbod Babrzadeh; Anjo Chi; Kalyan Mallempati; William Pickle; Michael Mindrinos; Marcelo A Fernández-Viña
Journal:  Hum Immunol       Date:  2019-04-05       Impact factor: 2.850

2.  Pan-cancer analysis of systematic batch effects on somatic sequence variations.

Authors:  Ji-Hye Choi; Seong-Eui Hong; Hyun Goo Woo
Journal:  BMC Bioinformatics       Date:  2017-04-11       Impact factor: 3.169

3.  Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease.

Authors:  Allison A Dilliott; Sali M K Farhan; Mahdi Ghani; Christine Sato; Eric Liang; Ming Zhang; Adam D McIntyre; Henian Cao; Lemuel Racacho; John F Robinson; Michael J Strong; Mario Masellis; Dennis E Bulman; Ekaterina Rogaeva; Anthony Lang; Carmela Tartaglia; Elizabeth Finger; Lorne Zinman; John Turnbull; Morris Freedman; Rick Swartz; Sandra E Black; Robert A Hegele
Journal:  J Vis Exp       Date:  2018-04-04       Impact factor: 1.355

4.  Evaluation of Oxford Nanopore's MinION Sequencing Device for Microbial Whole Genome Sequencing Applications.

Authors:  Andrea D Tyler; Laura Mataseje; Chantel J Urfano; Lisa Schmidt; Kym S Antonation; Michael R Mulvey; Cindi R Corbett
Journal:  Sci Rep       Date:  2018-07-19       Impact factor: 4.379

Review 5.  Challenges in the introduction of next-generation sequencing (NGS) for diagnostics of myeloid malignancies into clinical routine use.

Authors:  Ulrike Bacher; Evgenii Shumilov; Johanna Flach; Naomi Porret; Raphael Joncourt; Gertrud Wiedemann; Martin Fiedler; Urban Novak; Ursula Amstutz; Thomas Pabst
Journal:  Blood Cancer J       Date:  2018-11-12       Impact factor: 11.037

6.  CAGm: a repository of germline microsatellite variations in the 1000 genomes project.

Authors:  Nicholas Kinney; Kyle Titus-Glover; Jonathan D Wren; Robin T Varghese; Pawel Michalak; Han Liao; Ramu Anandakrishnan; Arichanah Pulenthiran; Lin Kang; Harold R Garner
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

7.  Is reliance on an inaccurate genome sequence sabotaging your experiments?

Authors:  Rodrigo P Baptista; Jessica C Kissinger
Journal:  PLoS Pathog       Date:  2019-09-12       Impact factor: 6.823

8.  Deep Parallel Characterization of AAV Tropism and AAV-Mediated Transcriptional Changes via Single-Cell RNA Sequencing.

Authors:  David Brown; Michael Altermatt; Tatyana Dobreva; Sisi Chen; Alexander Wang; Matt Thomson; Viviana Gradinaru
Journal:  Front Immunol       Date:  2021-10-21       Impact factor: 7.561

9.  Bioinformatic data processing pipelines in support of next-generation sequencing-based HIV drug resistance testing: the Winnipeg Consensus.

Authors:  Hezhao Ji; Eric Enns; Chanson J Brumme; Neil Parkin; Mark Howison; Emma R Lee; Rupert Capina; Eric Marinier; Santiago Avila-Rios; Paul Sandstrom; Gary Van Domselaar; Richard Harrigan; Roger Paredes; Rami Kantor; Marc Noguera-Julian
Journal:  J Int AIDS Soc       Date:  2018-10       Impact factor: 5.396

10.  Next-generation sequencing in the diagnosis of viral encephalitis: sensitivity and clinical limitations.

Authors:  Karol Perlejewski; Iwona Bukowska-Ośko; Małgorzata Rydzanicz; Agnieszka Pawełczyk; Kamila Caraballo Cortѐs; Sylwia Osuch; Marcin Paciorek; Tomasz Dzieciątkowski; Marek Radkowski; Tomasz Laskus
Journal:  Sci Rep       Date:  2020-09-30       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.