Literature DB >> 26932272

Normalization of transposon-mutant library sequencing datasets to improve identification of conditionally essential genes.

Michael A DeJesus1, Thomas R Ioerger1.   

Abstract

Sequencing of transposon-mutant libraries using next-generation sequencing (TnSeq) has become a popular method for determining which genes and non-coding regions are essential for growth under various conditions in bacteria. For methods that rely on quantitative comparison of counts of reads at transposon insertion sites, proper normalization of TnSeq datasets is vitally important. Real TnSeq datasets are often noisy and exhibit a significant skew that can be dominated by high counts at a small number of sites (often for non-biological reasons). If two datasets that are not appropriately normalized are compared, it might cause the artifactual appearance of Differentially Essential (DE) genes in a statistical test, constituting type I errors (false positives). In this paper, we propose a novel method for normalization of TnSeq datasets that corrects for the skew of read-count distributions by fitting them to a Beta-Geometric distribution. We show that this read-count correction procedure reduces the number of false positives when comparing replicate datasets grown under the same conditions (for which no genuine differences in essentiality are expected). We compare these results to results obtained with other normalization procedures, and show that it results in greater reduction in the number of false positives. In addition we investigate the effects of normalization on the detection of DE genes.

Entities:  

Keywords:  Normalization; TnSeq; essentiality

Mesh:

Substances:

Year:  2016        PMID: 26932272      PMCID: PMC5654600          DOI: 10.1142/S021972001642004X

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  21 in total

1.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Authors:  B M Bolstad; R A Irizarry; M Astrand; T P Speed
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

2.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

3.  Identifying essential genes in Mycobacterium tuberculosis by global phenotypic profiling.

Authors:  Jarukit E Long; Michael DeJesus; Doyle Ward; Richard E Baker; Thomas Ioerger; Christopher M Sassetti
Journal:  Methods Mol Biol       Date:  2015

4.  A purified mariner transposase is sufficient to mediate transposition in vitro.

Authors:  D J Lampe; M E Churchill; H M Robertson
Journal:  EMBO J       Date:  1996-10-01       Impact factor: 11.598

5.  A scaling normalization method for differential expression analysis of RNA-seq data.

Authors:  Mark D Robinson; Alicia Oshlack
Journal:  Genome Biol       Date:  2010-03-02       Impact factor: 13.583

6.  Genetic requirements for mycobacterial survival during infection.

Authors:  Christopher M Sassetti; Eric J Rubin
Journal:  Proc Natl Acad Sci U S A       Date:  2003-10-20       Impact factor: 11.205

Review 7.  Transposon insertion sequencing: a new tool for systems-level analysis of microorganisms.

Authors:  Tim van Opijnen; Andrew Camilli
Journal:  Nat Rev Microbiol       Date:  2013-05-28       Impact factor: 60.633

8.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

9.  ARTIST: high-resolution genome-wide assessment of fitness using transposon-insertion sequencing.

Authors:  Justin R Pritchard; Michael C Chao; Sören Abel; Brigid M Davis; Catherine Baranowski; Yanjia J Zhang; Eric J Rubin; Matthew K Waldor
Journal:  PLoS Genet       Date:  2014-11-06       Impact factor: 5.917

10.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

View more
  6 in total

1.  Statistical analysis of genetic interactions in Tn-Seq data.

Authors:  Michael A DeJesus; Subhalaxmi Nambi; Clare M Smith; Richard E Baker; Christopher M Sassetti; Thomas R Ioerger
Journal:  Nucleic Acids Res       Date:  2017-06-20       Impact factor: 16.971

2.  Analysis of Gene Essentiality from TnSeq Data Using Transit.

Authors:  Thomas R Ioerger
Journal:  Methods Mol Biol       Date:  2022

3.  TnSeq of Mycobacterium tuberculosis clinical isolates reveals strain-specific antibiotic liabilities.

Authors:  Allison F Carey; Jeremy M Rock; Inna V Krieger; Michael R Chase; Marta Fernandez-Suarez; Sebastien Gagneux; James C Sacchettini; Thomas R Ioerger; Sarah M Fortune
Journal:  PLoS Pathog       Date:  2018-03-05       Impact factor: 6.823

4.  A Comprehensive Overview of the Genes and Functions Required for Lettuce Infection by the Hemibiotrophic Phytopathogen Xanthomonas hortorum pv. vitians.

Authors:  Lucas Morinière; Laurène Mirabel; Erwan Gueguen; Franck Bertolla
Journal:  mSystems       Date:  2022-03-21       Impact factor: 7.324

5.  Statistical analysis of variability in TnSeq data across conditions using zero-inflated negative binomial regression.

Authors:  Siddharth Subramaniyam; Michael A DeJesus; Anisha Zaveri; Clare M Smith; Richard E Baker; Sabine Ehrt; Dirk Schnappinger; Christopher M Sassetti; Thomas R Ioerger
Journal:  BMC Bioinformatics       Date:  2019-11-21       Impact factor: 3.169

6.  Genome-wide assessment of antimicrobial tolerance in Yersinia pseudotuberculosis under ciprofloxacin stress.

Authors:  Samuel Willcocks; Kristin K Huse; Richard Stabler; Petra C F Oyston; Andrew Scott; Helen S Atkins; Brendan W Wren
Journal:  Microb Genom       Date:  2019-11
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.