| Literature DB >> 35910305 |
Julián H Calvento1, Franco Leonardo Bulgarelli2, Ana Julia Velez Rueda1.
Abstract
There are multiple tools for positive selection analysis, including vaccine design and detection of variants of circulating drug-resistant pathogens in population selection. However, applying these tools to analyze a large number of protein families or as part of a comprehensive phylogenomics pipeline could be challenging. Since many standard bioinformatics tools are only available as executables, integrating them into complex Bioinformatics pipelines may not be possible. We have developed OBI, an open-source tool aimed to facilitate positive selection analysis on a large scale. It can be used as a stand-alone command-line app that can be easily installed and used as a Conda package. Some advantages of using OBI are:•It speeds up the analysis by automating the entire process•It allows multiple starting points and customization for the analysis•It allows the retrieval and linkage of structural and evolutive data for a protein throughWe hope to provide with OBI a solution for reliably speeding up large-scale protein evolutionary and structural analysis.Entities:
Keywords: Proteins evolution; Python library; Structural bioinformatics pipeline
Year: 2022 PMID: 35910305 PMCID: PMC9334345 DOI: 10.1016/j.mex.2022.101786
Source DB: PubMed Journal: MethodsX ISSN: 2215-0161
Fig. 1A. The data preparation pipeline flow includes I) the homologous proteins search using BLAST, the sequences clustering using CD-Hit, and the sequences alignment using Clustal; II) finally produce a nucleotide alignment guided by the amino acid alignment; B. Obi command general information and usage can be accessed by using –help flag; C. Obi provides different running configurations, that allow the users to customize the pipeline running according to their preferences.
Fig. 2A. Alternative Local or remote positive selection analysis flow: OBI provides users with two different strategies for running the positive selection analysis; B. Positive selection analysis can be run separately from the alignments construction and homolog proteins retrieving, by using the first step output files; C. When running Obi in the remote mode, the analysis can be resumed with the resume command.
Fig. 3Report and deliverables example: A. OBI's command-line interface running analysis command; B. JSON report file content example; C. Phylogenetic tree generated by OBI example; D. Protein sequences alignment example.
| Subject Area | Bioinformatics |
|---|---|
| More specific subject area: | 3: Biochemistry, Genetics and Molecular Biology |
| Method name: | OBI |
| Name and reference of original method | HyPhy |
| Resource availability |