Literature DB >> 26833344

A zero-inflated Poisson model for insertion tolerance analysis of genes based on Tn-seq data.

Fangfang Liu1, Chong Wang2, Zuowei Wu3, Qijing Zhang3, Peng Liu1.   

Abstract

MOTIVATION: Transposon insertion sequencing (Tn-seq) is an emerging technology that combines transposon mutagenesis with next-generation sequencing technologies for the identification of genes related to bacterial survival. The resulting data from Tn-seq experiments consist of sequence reads mapped to millions of potential transposon insertion sites and a large portion of insertion sites have zero mapped reads. Novel statistical method for Tn-seq data analysis is needed to infer functions of genes on bacterial growth.
RESULTS: In this article, we propose a zero-inflated Poisson model for analyzing the Tn-seq data that are high-dimensional and with an excess of zeros. Maximum likelihood estimates of model parameters are obtained using an expectation-maximization (EM) algorithm, and pseudogenes are utilized to construct appropriate statistical tests for the transposon insertion tolerance of normal genes of interest. We propose a multiple testing procedure that categorizes genes into each of the three states, hypo-tolerant, tolerant and hyper-tolerant, while controlling the false discovery rate. We evaluate the proposed method with simulation studies and apply the proposed method to a real Tn-seq data from an experiment that studied the bacterial pathogen, Campylobacter jejuniAvailability and implementation: We provide R code for implementing our proposed method at http://github.com/ffliu/TnSeq A user's guide with example data analysis is also available there. CONTACT: pliu@iastate.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26833344     DOI: 10.1093/bioinformatics/btw061

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Modelling RNA-Seq data with a zero-inflated mixture Poisson linear model.

Authors:  Siyun Liu; Yuan Jiang; Tao Yu
Journal:  Genet Epidemiol       Date:  2019-07-22       Impact factor: 2.135

Review 2.  A Comprehensive Overview of Online Resources to Identify and Predict Bacterial Essential Genes.

Authors:  Chong Peng; Yan Lin; Hao Luo; Feng Gao
Journal:  Front Microbiol       Date:  2017-11-27       Impact factor: 5.640

3.  TnseqDiff: identification of conditionally essential genes in transposon sequencing studies.

Authors:  Lili Zhao; Mark T Anderson; Weisheng Wu; Harry L T Mobley; Michael A Bachman
Journal:  BMC Bioinformatics       Date:  2017-07-06       Impact factor: 3.169

Review 4.  No wisdom in the crowd: genome annotation in the era of big data - current status and future prospects.

Authors:  Antoine Danchin; Christos Ouzounis; Taku Tokuyasu; Jean-Daniel Zucker
Journal:  Microb Biotechnol       Date:  2018-05-28       Impact factor: 5.813

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.