Literature DB >> 29718935

Automatic identification of optimal marker genes for phenotypic and taxonomic groups of microorganisms.

Elad Segev1, Zohar Pasternak2, Tom Ben Sasson3, Edouard Jurkevitch2, Mira Gonen4.   

Abstract

Finding optimal markers for microorganisms important in the medical, agricultural, environmental or ecological fields is of great importance. Thousands of complete microbial genomes now available allow us, for the first time, to exhaustively identify marker proteins for groups of microbial organisms. In this work, we model the biological task as the well-known mathematical "hitting set" problem, solving it based on both greedy and randomized approximation algorithms. We identify unique markers for 17 phenotypic and taxonomic microbial groups, including proteins related to the nitrite reductase enzyme as markers for the non-anammox nitrifying bacteria group, and two transcription regulation proteins, nusG and yhiF, as markers for the Archaea and Escherichia/Shigella taxonomic groups, respectively. Additionally, we identify marker proteins for three subtypes of pathogenic E. coli, which previously had no known optimal markers. Practically, depending on the completeness of the database this algorithm can be used for identification of marker genes for any microbial group, these marker genes may be prime candidates for the understanding of the genetic basis of the group's phenotype or to help discover novel functions which are uniquely shared among a group of microbes. We show that our method is both theoretically and practically efficient, while establishing an upper bound on its time complexity and approximation ratio; thus, it promises to remain efficient and permit the identification of marker proteins that are specific to phenotypic or taxonomic groups, even as more and more bacterial genomes are being sequenced.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29718935      PMCID: PMC5931505          DOI: 10.1371/journal.pone.0195537

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  28 in total

1.  In vivo effect of NusB and NusG on rRNA transcription antitermination.

Authors:  Martha Torres; Joan-Miquel Balada; Malcolm Zellars; Craig Squires; Catherine L Squires
Journal:  J Bacteriol       Date:  2004-03       Impact factor: 3.490

2.  By their genes ye shall know them: genomic signatures of predatory bacteria.

Authors:  Zohar Pasternak; Shmuel Pietrokovski; Or Rotem; Uri Gophna; Mor N Lurie-Weinberger; Edouard Jurkevitch
Journal:  ISME J       Date:  2012-11-29       Impact factor: 10.302

Review 3.  Pathogen typing in the genomics era: MLST and the future of molecular epidemiology.

Authors:  Marcos Pérez-Losada; Patricia Cabezas; Eduardo Castro-Nallar; Keith A Crandall
Journal:  Infect Genet Evol       Date:  2013-01-26       Impact factor: 3.342

4.  An optimized binary typing panel improves the typing capability for Campylobacter jejuni.

Authors:  Bixing Huang; Dong Zhao; Ning-Xia Fang; Ashleigh Hall; Sofroni Eglezos; Barry Blair
Journal:  Diagn Microbiol Infect Dis       Date:  2013-10-17       Impact factor: 2.803

5.  Obscured phylogeny and possible recombinational dormancy in Escherichia coli.

Authors:  Shana R Leopold; Stanley A Sawyer; Thomas S Whittam; Phillip I Tarr
Journal:  BMC Evol Biol       Date:  2011-06-27       Impact factor: 3.260

6.  MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data.

Authors:  Ikuo Uchiyama; Motohiro Mihara; Hiroyo Nishide; Hirokazu Chiba
Journal:  Nucleic Acids Res       Date:  2014-11-14       Impact factor: 16.971

Review 7.  Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?

Authors:  Roy M Robins-Browne; Kathryn E Holt; Danielle J Ingle; Dianna M Hocking; Ji Yang; Marija Tauschek
Journal:  Front Cell Infect Microbiol       Date:  2016-11-18       Impact factor: 5.293

8.  Signature genes as a phylogenomic tool.

Authors:  Bas E Dutilh; Berend Snel; Thijs J G Ettema; Martijn A Huynen
Journal:  Mol Biol Evol       Date:  2008-05-19       Impact factor: 16.240

9.  Systematic identification of gene families for use as "markers" for phylogenetic and phylogeny-driven ecological studies of bacteria and archaea and their major subgroups.

Authors:  Dongying Wu; Guillaume Jospin; Jonathan A Eisen
Journal:  PLoS One       Date:  2013-10-17       Impact factor: 3.240

10.  Ensembl Genomes 2016: more genomes, more complexity.

Authors:  Paul Julian Kersey; James E Allen; Irina Armean; Sanjay Boddu; Bruce J Bolt; Denise Carvalho-Silva; Mikkel Christensen; Paul Davis; Lee J Falin; Christoph Grabmueller; Jay Humphrey; Arnaud Kerhornou; Julia Khobova; Naveen K Aranganathan; Nicholas Langridge; Ernesto Lowy; Mark D McDowall; Uma Maheswari; Michael Nuhn; Chuang Kee Ong; Bert Overduin; Michael Paulini; Helder Pedro; Emily Perry; Giulietta Spudich; Electra Tapanari; Brandon Walts; Gareth Williams; Marcela Tello-Ruiz; Joshua Stein; Sharon Wei; Doreen Ware; Daniel M Bolser; Kevin L Howe; Eugene Kulesha; Daniel Lawson; Gareth Maslen; Daniel M Staines
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

View more
  1 in total

1.  Identifying protein function and functional links based on large-scale co-occurrence patterns.

Authors:  Zohar Pasternak; Noam Chapnik; Roy Yosef; Naama M Kopelman; Edouard Jurkevitch; Elad Segev
Journal:  PLoS One       Date:  2022-03-03       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.