Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Contaminant DNA in bacterial sequencing experiments is a major source of false genetic variability.

Literature DB >> 32122347

Contaminant DNA in bacterial sequencing experiments is a major source of false genetic variability.

Galo A Goig¹, Silvia Blanco², Alberto L Garcia-Basteiro^2,3, Iñaki Comas^4,5.

Abstract

BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic repositories. Strikingly, analysis workflows for whole-genome sequencing (WGS) data commonly do not account for errors potentially introduced by contamination, which could lead to the wrong assessment of allele frequency both in basic and clinical research.
RESULTS: We used a taxonomic filter to remove contaminant reads from more than 4000 bacterial samples from 20 different studies and performed a comprehensive evaluation of the extent and impact of contaminant DNA in WGS. We found that contamination is pervasive and can introduce large biases in variant analysis. We showed that these biases can result in hundreds of false positive and negative SNPs, even for samples with slight contamination. Studies investigating complex biological traits from sequencing data can be completely biased if contamination is neglected during the bioinformatic analysis, and we demonstrate that removing contaminant reads with a taxonomic classifier permits more accurate variant calling. We used both real and simulated data to evaluate and implement reliable, contamination-aware analysis pipelines.
CONCLUSION: As sequencing technologies consolidate as precision tools that are increasingly adopted in the research and clinical context, our results urge for the implementation of contamination-aware analysis pipelines. Taxonomic classifiers are a powerful tool to implement such pipelines.

Entities: CellLine Chemical Disease Species

Year: 2020 PMID： 32122347 DOI： 10.1186/s12915-020-0748-z

Source DB: PubMed Journal: BMC Biol ISSN： 1741-7007 Impact factor: 7.431

23 in total

Review 1. Clinical Aspergillus Signatures in COPD and Bronchiectasis.

Authors: Pei Yee Tiew; Kai Xian Thng; Sanjay H Chotirmall
Journal: J Fungi (Basel) Date: 2022-05-05

2. Benchmarking the empirical accuracy of short-read sequencing across the M. tuberculosis genome.

Authors: Maximillian Marin; Roger Vargas; Michael Harris; Brendan Jeffrey; L Elaine Epperson; David Durbin; Michael Strong; Max Salfinger; Zamin Iqbal; Irada Akhundova; Sergo Vashakidze; Valeriu Crudu; Alex Rosenthal; Maha Reda Farhat
Journal: Bioinformatics Date: 2022-01-10 Impact factor: 6.931

3. In-host population dynamics of Mycobacterium tuberculosis complex during active disease.

Authors: Roger Vargas; Luca Freschi; Maximillian Marin; L Elaine Epperson; Melissa Smith; Irina Oussenko; David Durbin; Michael Strong; Max Salfinger; Maha Reda Farhat
Journal: Elife Date: 2021-02-01 Impact factor: 8.140

4. Dynamics of within-host Mycobacterium tuberculosis diversity and heteroresistance during treatment.

Authors: Camus Nimmo; Kayleen Brien; James Millard; Alison D Grant; Nesri Padayatchi; Alexander S Pym; Max O'Donnell; Richard Goldstein; Judith Breuer; François Balloux
Journal: EBioMedicine Date: 2020-04-28 Impact factor: 8.143

5. Whole genomic sequencing based genotyping reveals a specific X3 sublineage restricted to Mexico and related with multidrug resistance.

Authors: Ana Cristina Jiménez-Ruano; Carlos Francisco Madrazo-Moya; Irving Cancino-Muñoz; Paulina M Mejía-Ponce; Cuauhtémoc Licona-Cassani; Iñaki Comas; Raquel Muñiz-Salazar; Roberto Zenteno-Cuevas
Journal: Sci Rep Date: 2021-01-21 Impact factor: 4.379

Contaminant DNA in bacterial sequencing experiments is a major source of false genetic variability.

Review 1. Clinical Aspergillus Signatures in COPD and Bronchiectasis.

2. Benchmarking the empirical accuracy of short-read sequencing across the M. tuberculosis genome.

3. In-host population dynamics of Mycobacterium tuberculosis complex during active disease.

4. Dynamics of within-host Mycobacterium tuberculosis diversity and heteroresistance during treatment.

5. Whole genomic sequencing based genotyping reveals a specific X3 sublineage restricted to Mexico and related with multidrug resistance.

6. Simplitigs as an efficient and scalable representation of de Bruijn graphs.

Review 7. Metagenomics: a path to understanding the gut microbiome.

Review 8. Mycobacterium bovis: From Genotyping to Genome Sequencing.

9. Assessment of databases to determine the validity of β- and γ-carbonic anhydrase sequences from vertebrates.

10. Genomic variant-identification methods may alter Mycobacterium tuberculosis transmission inferences.