MOTIVATION: Next-generation sequencing affords an efficient analysis of transposon insertion libraries, which can be used to identify essential genes in bacteria. To analyse this high-resolution data, we present a formal Bayesian framework for estimating the posterior probability of essentiality for each gene, using the extreme-value distribution to characterize the statistical significance of the longest region lacking insertions within a gene. We describe a sampling procedure based on the Metropolis-Hastings algorithm to calculate posterior probabilities of essentiality while simultaneously integrating over unknown internal parameters. RESULTS: Using a sequence dataset from a transposon library for Mycobacterium tuberculosis, we show that this Bayesian approach predicts essential genes that correspond well with genes shown to be essential in previous studies. Furthermore, we show that by using the extreme-value distribution to characterize genomic regions lacking transposon insertions, this method is capable of identifying essential domains within genes. This approach can be used for analysing transposon libraries in other organisms and augmenting essentiality predictions with statistical confidence scores.
MOTIVATION: Next-generation sequencing affords an efficient analysis of transposon insertion libraries, which can be used to identify essential genes in bacteria. To analyse this high-resolution data, we present a formal Bayesian framework for estimating the posterior probability of essentiality for each gene, using the extreme-value distribution to characterize the statistical significance of the longest region lacking insertions within a gene. We describe a sampling procedure based on the Metropolis-Hastings algorithm to calculate posterior probabilities of essentiality while simultaneously integrating over unknown internal parameters. RESULTS: Using a sequence dataset from a transposon library for Mycobacterium tuberculosis, we show that this Bayesian approach predicts essential genes that correspond well with genes shown to be essential in previous studies. Furthermore, we show that by using the extreme-value distribution to characterize genomic regions lacking transposon insertions, this method is capable of identifying essential domains within genes. This approach can be used for analysing transposon libraries in other organisms and augmenting essentiality predictions with statistical confidence scores.
Authors: J D McKinney; K Höner zu Bentrup; E J Muñoz-Elías; A Miczak; B Chen; W T Chan; D Swenson; J C Sacchettini; W R Jacobs; D G Russell Journal: Nature Date: 2000-08-17 Impact factor: 49.962
Authors: S Y Gerdes; M D Scholle; J W Campbell; G Balázsi; E Ravasz; M D Daugherty; A L Somera; N C Kyrpides; I Anderson; M S Gelfand; A Bhattacharya; V Kapatral; M D'Souza; M V Baev; Y Grechkin; F Mseeh; M Y Fonstein; R Overbeek; A-L Barabási; Z N Oltvai; A L Osterman Journal: J Bacteriol Date: 2003-10 Impact factor: 3.490
Authors: Gyanu Lamichhane; Matteo Zignol; Natalie J Blades; Deborah E Geiman; Annette Dougherty; Jacques Grosset; Karl W Broman; William R Bishai Journal: Proc Natl Acad Sci U S A Date: 2003-05-29 Impact factor: 11.205
Authors: S T Cole; R Brosch; J Parkhill; T Garnier; C Churcher; D Harris; S V Gordon; K Eiglmeier; S Gas; C E Barry; F Tekaia; K Badcock; D Basham; D Brown; T Chillingworth; R Connor; R Davies; K Devlin; T Feltwell; S Gentles; N Hamlin; S Holroyd; T Hornsby; K Jagels; A Krogh; J McLean; S Moule; L Murphy; K Oliver; J Osborne; M A Quail; M A Rajandream; J Rogers; S Rutter; K Seeger; J Skelton; R Squares; S Squares; J E Sulston; K Taylor; S Whitehead; B G Barrell Journal: Nature Date: 1998-06-11 Impact factor: 49.962
Authors: Joanna Lipowska; Charles Dylan Miks; Keehwan Kwon; Ludmilla Shuvalova; Heping Zheng; Krzysztof Lewiński; David R Cooper; Ivan G Shabalin; Wladek Minor Journal: Int J Biol Macromol Date: 2019-06-15 Impact factor: 6.953
Authors: Teresa A Hudock; Taylor W Foreman; Nirmalya Bandyopadhyay; Uma S Gautam; Ashley V Veatch; Denae N LoBato; Kaylee M Gentry; Nadia A Golden; Amy Cavigli; Michelle Mueller; Shen-An Hwang; Robert L Hunter; Xavier Alvarez; Andrew A Lackner; Joel S Bader; Smriti Mehra; Deepak Kaushal Journal: Am J Respir Cell Mol Biol Date: 2017-05 Impact factor: 6.914
Authors: Melisa Lázaro; Roberto Melero; Charlotte Huet; Jorge P López-Alonso; Sandra Delgado; Alexandra Dodu; Eduardo M Bruch; Luciano A Abriata; Pedro M Alzari; Mikel Valle; María-Natalia Lisa Journal: Commun Biol Date: 2021-06-03
Authors: Allison N Dammann; Anna B Chamby; Andrew J Catomeris; Kyle M Davidson; Hervé Tettelin; Jan-Peter van Pijkeren; Kathyayini P Gopalakrishna; Mary F Keith; Jordan L Elder; Adam J Ratner; Thomas A Hooven Journal: PLoS Pathog Date: 2021-03-08 Impact factor: 6.823
Authors: Michael C Chao; Justin R Pritchard; Yanjia J Zhang; Eric J Rubin; Jonathan Livny; Brigid M Davis; Matthew K Waldor Journal: Nucleic Acids Res Date: 2013-07-30 Impact factor: 16.971