Literature DB >> 24855068

On the limits of computational functional genomics for bacterial lifestyle prediction.

Eudes Barbosa, Richard Röttger, Anne-Christin Hauschild, Vasco Azevedo, Jan Baumbach.   

Abstract

We review the level of genomic specificity regarding actinobacterial pathogenicity. As they occupy various niches in diverse habitats, one may assume the existence of lifestyle-specific genomic features. We include 240 actinobacteria classified into four pathogenicity classes: human pathogens (HPs), broad-spectrum pathogens (BPs), opportunistic pathogens (OPs) and non-pathogenic (NP). We hypothesize: (H1) Pathogens (HPs and BPs) possess specific pathogenicity signature genes. (H2) The same holds for OPs. (H3) Broad-spectrum and exclusively HPs cannot be distinguished from each other because of an observation bias, i.e. many HPs might yet be unclassified BPs. (H4) There is no intrinsic genomic characteristic of OPs compared with pathogens, as small mutations are likely to play a more dominant role to survive the immune system. To study these hypotheses, we implemented a bioinformatics pipeline that combines evolutionary sequence analysis with statistical learning methods (Random Forest with feature selection, model tuning and robustness analysis). Essentially, we present orthologous gene sets that computationally distinguish pathogens from NPs (H1). We further show a clear limit in differentiating OPs from both NPs (H2) and pathogens (H4). HPs may also not be distinguished from bacteria annotated as BPs based only on a small set of orthologous genes (H3), as many HPs might as well target a broad range of mammals but have not been annotated accordingly. In conclusion, we illustrate that even in the post-genome era and despite next-generation sequencing technology, our ability to efficiently deduce real-world conclusions, such as pathogenicity classification, remains quite limited.
© The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  actinobacteria; bioinformatics; machine learning; pathogenicity

Mesh:

Year:  2014        PMID: 24855068     DOI: 10.1093/bfgp/elu014

Source DB:  PubMed          Journal:  Brief Funct Genomics        ISSN: 2041-2649            Impact factor:   4.241


  7 in total

1.  Compensation of feature selection biases accompanied with improved predictive performance for binary classification by using a novel ensemble feature selection approach.

Authors:  Ursula Neumann; Mona Riemenschneider; Jan-Peter Sowa; Theodor Baars; Julia Kälsch; Ali Canbay; Dominik Heider
Journal:  BioData Min       Date:  2016-11-18       Impact factor: 2.522

2.  PaPrBaG: A machine learning approach for the detection of novel pathogens from NGS data.

Authors:  Carlus Deneke; Robert Rentzsch; Bernhard Y Renard
Journal:  Sci Rep       Date:  2017-01-04       Impact factor: 4.379

3.  Genomic analysis of bacteria in the Acute Oak Decline pathobiome.

Authors:  James Doonan; Sandra Denman; Justin A Pachebat; James E McDonald
Journal:  Microb Genom       Date:  2019-01

4.  Comparative analysis of essential genes in prokaryotic genomic islands.

Authors:  Xi Zhang; Chong Peng; Ge Zhang; Feng Gao
Journal:  Sci Rep       Date:  2015-07-30       Impact factor: 4.379

5.  Genotypic Prediction of Co-receptor Tropism of HIV-1 Subtypes A and C.

Authors:  Mona Riemenschneider; Kieran Y Cashin; Bettina Budeus; Saleta Sierra; Elham Shirvani-Dastgerdi; Saeed Bayanolhagh; Rolf Kaiser; Paul R Gorry; Dominik Heider
Journal:  Sci Rep       Date:  2016-04-29       Impact factor: 4.379

6.  Exploiting HIV-1 protease and reverse transcriptase cross-resistance information for improved drug resistance prediction by means of multi-label classification.

Authors:  Mona Riemenschneider; Robin Senge; Ursula Neumann; Eyke Hüllermeier; Dominik Heider
Journal:  BioData Min       Date:  2016-02-29       Impact factor: 2.522

7.  EFS: an ensemble feature selection tool implemented as R-package and web-application.

Authors:  Ursula Neumann; Nikita Genze; Dominik Heider
Journal:  BioData Min       Date:  2017-06-27       Impact factor: 2.522

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.