Literature DB >> 23142964

Density parameter estimation for finding clusters of homologous proteins--tracing actinobacterial pathogenicity lifestyles.

Richard Röttger1, Prabhav Kalaghatgi, Peng Sun, Siomar de Castro Soares, Vasco Azevedo, Tobias Wittkop, Jan Baumbach.   

Abstract

MOTIVATION: Homology detection is a long-standing challenge in computational biology. To tackle this problem, typically all-versus-all BLAST results are coupled with data partitioning approaches resulting in clusters of putative homologous proteins. One of the main problems, however, has been widely neglected: all clustering tools need a density parameter that adjusts the number and size of the clusters. This parameter is crucial but hard to estimate without gold standard data at hand. Developing a gold standard, however, is a difficult and time consuming task. Having a reliable method for detecting clusters of homologous proteins between a huge set of species would open opportunities for better understanding the genetic repertoire of bacteria with different lifestyles.
RESULTS: Our main contribution is a method for identifying a suitable and robust density parameter for protein homology detection without a given gold standard. Therefore, we study the core genome of 89 actinobacteria. This allows us to incorporate background knowledge, i.e. the assumption that a set of evolutionarily closely related species should share a comparably high number of evolutionarily conserved proteins (emerging from phylum-specific housekeeping genes). We apply our strategy to find genes/proteins that are specific for certain actinobacterial lifestyles, i.e. different types of pathogenicity. The whole study was performed with transitivity clustering, as it only requires a single intuitive density parameter and has been shown to be well applicable for the task of protein sequence clustering. Note, however, that the presented strategy generally does not depend on our clustering method but can easily be adapted to other clustering approaches. AVAILABILITY: All results are publicly available at http://transclust.mmci.uni-saarland.de/actino_core/ or as Supplementary Material of this article. CONTACT: roettger@mpi-inf.mpg.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2012        PMID: 23142964     DOI: 10.1093/bioinformatics/bts653

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Comparing the performance of biomedical clustering methods.

Authors:  Christian Wiwie; Jan Baumbach; Richard Röttger
Journal:  Nat Methods       Date:  2015-09-21       Impact factor: 28.547

2.  Guiding biomedical clustering with ClustEval.

Authors:  Christian Wiwie; Jan Baumbach; Richard Röttger
Journal:  Nat Protoc       Date:  2018-05-24       Impact factor: 13.491

3.  CMRegNet-An interspecies reference database for corynebacterial and mycobacterial regulatory networks.

Authors:  Vinicius A C Abreu; Sintia Almeida; Sandeep Tiwari; Syed Shah Hassan; Diego Mariano; Artur Silva; Jan Baumbach; Vasco Azevedo; Richard Röttger
Journal:  BMC Genomics       Date:  2015-06-11       Impact factor: 3.969

4.  PaPrBaG: A machine learning approach for the detection of novel pathogens from NGS data.

Authors:  Carlus Deneke; Robert Rentzsch; Bernhard Y Renard
Journal:  Sci Rep       Date:  2017-01-04       Impact factor: 4.379

5.  NRfamPred: a proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families.

Authors:  Ravindra Kumar; Bandana Kumari; Abhishikha Srivastava; Manish Kumar
Journal:  Sci Rep       Date:  2014-10-29       Impact factor: 4.379

6.  Transcriptome profile of Corynebacterium pseudotuberculosis in response to iron limitation.

Authors:  Izabela Coimbra Ibraim; Mariana Teixeira Dornelles Parise; Doglas Parise; Michelle Zibetti Tadra Sfeir; Thiago Luiz de Paula Castro; Alice Rebecca Wattam; Preetam Ghosh; Debmalya Barh; Emannuel Maltempi Souza; Aristóteles Góes-Neto; Anne Cybelle Pinto Gomide; Vasco Azevedo
Journal:  BMC Genomics       Date:  2019-08-20       Impact factor: 3.969

7.  Comparative analysis of essential genes in prokaryotic genomic islands.

Authors:  Xi Zhang; Chong Peng; Ge Zhang; Feng Gao
Journal:  Sci Rep       Date:  2015-07-30       Impact factor: 4.379

8.  The Druggable Pocketome of Corynebacterium diphtheriae: A New Approach for in silico Putative Druggable Targets.

Authors:  Syed S Hassan; Syed B Jamal; Leandro G Radusky; Sandeep Tiwari; Asad Ullah; Javed Ali; Paulo V S D de Carvalho; Rida Shams; Sabir Khan; Henrique C P Figueiredo; Debmalya Barh; Preetam Ghosh; Artur Silva; Jan Baumbach; Richard Röttger; Adrián G Turjanski; Vasco A C Azevedo
Journal:  Front Genet       Date:  2018-02-13       Impact factor: 4.599

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.