Literature DB >> 33383692

SeMPI 2.0-A Web Server for PKS and NRPS Predictions Combined with Metabolite Screening in Natural Product Databases.

Paul F Zierep1, Adriana T Ceci2, Ilia Dobrusin1, Sinclair C Rockwell-Kollmann1, Stefan Günther1.   

Abstract

Microorganisms produce secondary metabolites with a remarkable range of bioactive properties. The constantly increasing amount of published genomic data provides the opportunity for efficient identification of biosynthetic gene clusters by genome mining. On the other hand, for many natural products with resolved structures, the encoding biosynthetic gene clusters have not been identified yet. Of those secondary metabolites, the scaffolds of nonribosomal peptides and polyketides (type I modular) can be predicted due to their building block-like assembly. SeMPI v2 provides a comprehensive prediction pipeline, which includes the screening of the scaffold in publicly available natural compound databases. The screening algorithm was designed to detect homologous structures even for partial, incomplete clusters. The pipeline allows linking of gene clusters to known natural products and therefore also provides a metric to estimate the novelty of the cluster if a matching scaffold cannot be found. Whereas currently available tools attempt to provide comprehensive information about a wide range of gene clusters, SeMPI v2 aims to focus on precise predictions. Therefore, the cluster detection algorithm, including building block generation and domain substrate prediction, was thoroughly refined and benchmarked, to provide high-quality scaffold predictions. In a benchmark based on 559 gene clusters, SeMPI v2 achieved comparable or better results than antiSMASH v5. Additionally, the SeMPI v2 web server provides features that can help to further investigate a submitted gene cluster, such as the incorporation of a genome browser, and the possibility to modify a predicted scaffold in a workbench before the database screening.

Entities:  

Keywords:  machine learning; natural compounds; nonribosomal peptides; polyketides; secondary metabolites

Year:  2020        PMID: 33383692      PMCID: PMC7823522          DOI: 10.3390/metabo11010013

Source DB:  PubMed          Journal:  Metabolites        ISSN: 2218-1989


  51 in total

1.  NRPSsp: non-ribosomal peptide synthase substrate predictor.

Authors:  Carlos Prieto; Carlos García-Estrada; Diego Lorenzana; Juan Francisco Martín
Journal:  Bioinformatics       Date:  2011-11-29       Impact factor: 6.937

2.  Comprehensive analysis of distinctive polyketide and nonribosomal peptide structural motifs encoded in microbial genomes.

Authors:  Yohsuke Minowa; Michihiro Araki; Minoru Kanehisa
Journal:  J Mol Biol       Date:  2007-03-14       Impact factor: 5.469

3.  D3GB: An Interactive Genome Browser for R, Python, and WordPress.

Authors:  David Barrios; Carlos Prieto
Journal:  J Comput Biol       Date:  2017-02-16       Impact factor: 1.479

4.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

5.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

Review 6.  The role of transcription in heterologous expression of polyketides in bacterial hosts.

Authors:  D Cole Stevens; Taylor P A Hari; Christopher N Boddy
Journal:  Nat Prod Rep       Date:  2013-10-11       Impact factor: 13.423

7.  UniProt: a worldwide hub of protein knowledge.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  ZINC: a free tool to discover chemistry for biology.

Authors:  John J Irwin; Teague Sterling; Michael M Mysinger; Erin S Bolstad; Ryan G Coleman
Journal:  J Chem Inf Model       Date:  2012-06-15       Impact factor: 4.956

9.  Classification of the adenylation and acyl-transferase activity of NRPS and PKS systems using ensembles of substrate specific hidden Markov models.

Authors:  Barzan I Khayatt; Lex Overmars; Roland J Siezen; Christof Francke
Journal:  PLoS One       Date:  2013-04-18       Impact factor: 3.240

10.  The Pfam protein families database in 2019.

Authors:  Sara El-Gebali; Jaina Mistry; Alex Bateman; Sean R Eddy; Aurélien Luciani; Simon C Potter; Matloob Qureshi; Lorna J Richardson; Gustavo A Salazar; Alfredo Smart; Erik L L Sonnhammer; Layla Hirsh; Lisanna Paladin; Damiano Piovesan; Silvio C E Tosatto; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  1 in total

Review 1.  Alkaloids in Contemporary Drug Discovery to Meet Global Disease Needs.

Authors:  Sharna-Kay Daley; Geoffrey A Cordell
Journal:  Molecules       Date:  2021-06-22       Impact factor: 4.411

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.