| Literature DB >> 30859257 |
Phuong Nguyen Tran1, Ming-Ren Yen1, Chen-Yu Chiang2, Hsiao-Ching Lin3, Pao-Yang Chen4.
Abstract
Secondary metabolites (SM) produced by fungi and bacteria have long been of exceptional interest owing to their unique biomedical ramifications. The traditional discovery of new natural products that was mainly driven by bioactivity screening has now experienced a fresh new approach in the form of genome mining. Several bioinformatics tools have been continuously developed to detect potential biosynthetic gene clusters (BGCs) that are responsible for the production of SM. Although the principles underlying the computation of these tools have been discussed, the biological background is left underrated and ambiguous. In this review, we emphasize the biological hypotheses in BGC formation driven from the observations across genomes in bacteria and fungi, and provide a comprehensive list of updated algorithms/tools exclusively for BGC detection. Our review points to a direction that the biological hypotheses should be systematically incorporated into the BGC prediction and assist the prioritization of candidate BGC.Entities:
Keywords: Bioinformatics; Biosynthetic gene cluster; Duplicate gene; Horizontal gene transfer; Secondary metabolites; Self-protection
Mesh:
Substances:
Year: 2019 PMID: 30859257 PMCID: PMC6449301 DOI: 10.1007/s00253-019-09708-z
Source DB: PubMed Journal: Appl Microbiol Biotechnol ISSN: 0175-7598 Impact factor: 4.813
Computational programs for secondary metabolite gene mining
| Category | Software | Year/version | Features | User interface | Computation platform | Target organism(s) | Reference(s) |
|---|---|---|---|---|---|---|---|
| BGC prediction | BAGEL | 2006/v1, 2010/v2, 2013/v3 | Identify bacteriocins and RiPPs using HMM search with bacteriocin database | Web | Server | Bacteria | (de Jong et al. |
| ClustScan | 2008 | Identify BGCs using HMM search and predict product structure | GUI | Local PC | Bacteria | (Starcevic et al. | |
| NP.searcher | 2009 | Identify BGCs using BLAST and construct the structure of natural products | Web/command line | Server/local PC | Bacteria | (Li et al. | |
| SMURF | 2010 | Predict secondary metabolite biosynthesis gene clusters based on their genomic context and domain content using HMM search | Web | Server | Fungi | (Khaldi et al. | |
| antiSMASH | 2011/v1, 2013/v2, 2015/v3, 2017/v4 | Identify BGCs using HMMer3 to search experimentally characterized signature proteins | Web/command line | Server/local PC | Bacteria, fungi, plants | (Blin et al. | |
| ClusterFinder | 2014 | Identify BGCs using a hidden Markov model-based probabilistic algorithm | Command line | Local PC | Bacteria | (Cimermancic et al. | |
| PRISM | 2015/PRISM,2016/RiPP-PRISM,2017/PRISM3 | Identify BGCs using BLAST and HMMER and structure prediction using HMM | Web | Server | Bacteria | (Skinnider et al. | |
| EvoMining | 2016 | Identify BGCs using phylogenomic analysis | Command line | Local PC | Actinobacteria | (Cruz-Morales et al. | |
| RODEO | 2017 | Identify BGC and RiPP precursor peptide using HMM and machine learning | Web | Server | Bacteria | (Tietz et al. | |
| ARTS | 2017 | Uses three additional selection criteria, including BGC proximity, gene duplication and horizontal gene transfer, to prioritize antiSMASH-detected BGCs | Web | Server | Bacteria | (Alanjary et al. | |
| Biosynthetic gene analysis | SBSPKS | 2010 | Analyze the 3D structure of PKS protein using BLAST and SCWRL; predict the order of substrate channeling between multiple ORFs in a modular PKS cluster based on docking domain interaction | Web | Server | Bacteria, fungi, plants | (Anand et al. |
| NaPDoS | 2012 | Predict natural products of secondary metabolite genes using BLAST and domain phylogeny | Web | Server | Bacteria | (Ziemert et al. |
Fig. 1Overview of biological aspects underlying biosynthetic gene cluster (BGC) target-directed detection. Three hypotheses, numbered a–c, are presented here. a The resistance hypothesis comprises three notable models: target-based strategies, drug efflux, and enzyme deactivation. In the target-based strategies, the resistance gene is involved in target modification, in which the encoded protein can modify the SM-targeting protein, which is a drug receptor in drug-targeting strains or a nascent target in SM-producing strains. The resistance gene involved drug efflux encodes a transporter for pumping out the SM. For enzyme deactivation, the resistance gene encoding the enzyme modifies the SM and then deactivates it. b The duplication hypothesis holds that the SM producer harbors a protein isoform (duplicate protein) of an essential protein. Therefore, it protects the essential protein that the toxic SM targets by providing excess targets or proteins with greater binding affinity. c The horizontal gene transfer hypothesis of core genes is a potential way for microorganism to gain genetic advantage for self-protection. Bioinformatics analysis is applied to scan for BGCs that contain genes matching the three hypotheses. The output BGC candidates will be validated with experiments such as refactoring BGCs, identification of the corresponding SM product, and evaluation of biological activity