Literature DB >> 32122347

Contaminant DNA in bacterial sequencing experiments is a major source of false genetic variability.

Galo A Goig1, Silvia Blanco2, Alberto L Garcia-Basteiro2,3, Iñaki Comas4,5.   

Abstract

BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic repositories. Strikingly, analysis workflows for whole-genome sequencing (WGS) data commonly do not account for errors potentially introduced by contamination, which could lead to the wrong assessment of allele frequency both in basic and clinical research.
RESULTS: We used a taxonomic filter to remove contaminant reads from more than 4000 bacterial samples from 20 different studies and performed a comprehensive evaluation of the extent and impact of contaminant DNA in WGS. We found that contamination is pervasive and can introduce large biases in variant analysis. We showed that these biases can result in hundreds of false positive and negative SNPs, even for samples with slight contamination. Studies investigating complex biological traits from sequencing data can be completely biased if contamination is neglected during the bioinformatic analysis, and we demonstrate that removing contaminant reads with a taxonomic classifier permits more accurate variant calling. We used both real and simulated data to evaluate and implement reliable, contamination-aware analysis pipelines.
CONCLUSION: As sequencing technologies consolidate as precision tools that are increasingly adopted in the research and clinical context, our results urge for the implementation of contamination-aware analysis pipelines. Taxonomic classifiers are a powerful tool to implement such pipelines.

Entities:  

Year:  2020        PMID: 32122347     DOI: 10.1186/s12915-020-0748-z

Source DB:  PubMed          Journal:  BMC Biol        ISSN: 1741-7007            Impact factor:   7.431


  23 in total

Review 1.  Clinical Aspergillus Signatures in COPD and Bronchiectasis.

Authors:  Pei Yee Tiew; Kai Xian Thng; Sanjay H Chotirmall
Journal:  J Fungi (Basel)       Date:  2022-05-05

2.  Benchmarking the empirical accuracy of short-read sequencing across the M. tuberculosis genome.

Authors:  Maximillian Marin; Roger Vargas; Michael Harris; Brendan Jeffrey; L Elaine Epperson; David Durbin; Michael Strong; Max Salfinger; Zamin Iqbal; Irada Akhundova; Sergo Vashakidze; Valeriu Crudu; Alex Rosenthal; Maha Reda Farhat
Journal:  Bioinformatics       Date:  2022-01-10       Impact factor: 6.931

3.  In-host population dynamics of Mycobacterium tuberculosis complex during active disease.

Authors:  Roger Vargas; Luca Freschi; Maximillian Marin; L Elaine Epperson; Melissa Smith; Irina Oussenko; David Durbin; Michael Strong; Max Salfinger; Maha Reda Farhat
Journal:  Elife       Date:  2021-02-01       Impact factor: 8.140

4.  Dynamics of within-host Mycobacterium tuberculosis diversity and heteroresistance during treatment.

Authors:  Camus Nimmo; Kayleen Brien; James Millard; Alison D Grant; Nesri Padayatchi; Alexander S Pym; Max O'Donnell; Richard Goldstein; Judith Breuer; François Balloux
Journal:  EBioMedicine       Date:  2020-04-28       Impact factor: 8.143

5.  Whole genomic sequencing based genotyping reveals a specific X3 sublineage restricted to Mexico and related with multidrug resistance.

Authors:  Ana Cristina Jiménez-Ruano; Carlos Francisco Madrazo-Moya; Irving Cancino-Muñoz; Paulina M Mejía-Ponce; Cuauhtémoc Licona-Cassani; Iñaki Comas; Raquel Muñiz-Salazar; Roberto Zenteno-Cuevas
Journal:  Sci Rep       Date:  2021-01-21       Impact factor: 4.379

6.  Simplitigs as an efficient and scalable representation of de Bruijn graphs.

Authors:  Michael Baym; Gregory Kucherov; Karel Břinda
Journal:  Genome Biol       Date:  2021-04-06       Impact factor: 13.583

Review 7.  Metagenomics: a path to understanding the gut microbiome.

Authors:  Sandi Yen; Jethro S Johnson
Journal:  Mamm Genome       Date:  2021-07-14       Impact factor: 2.957

Review 8.  Mycobacterium bovis: From Genotyping to Genome Sequencing.

Authors:  Ana M S Guimaraes; Cristina K Zimpel
Journal:  Microorganisms       Date:  2020-05-03

9.  Assessment of databases to determine the validity of β- and γ-carbonic anhydrase sequences from vertebrates.

Authors:  Reza Zolfaghari Emameh; Marianne Kuuslahti; Hassan Nosrati; Hannes Lohi; Seppo Parkkila
Journal:  BMC Genomics       Date:  2020-05-11       Impact factor: 3.969

10.  Genomic variant-identification methods may alter Mycobacterium tuberculosis transmission inferences.

Authors:  Katharine S Walter; Caroline Colijn; Ted Cohen; Barun Mathema; Qingyun Liu; Jolene Bowers; David M Engelthaler; Apurva Narechania; Darrin Lemmer; Julio Croda; Jason R Andrews
Journal:  Microb Genom       Date:  2020-07-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.