| Literature DB >> 35445164 |
Alexis M Hill1, James R Rybarski2, Kuang Hu1,2, Ilya J Finkelstein2,3, Claus O Wilke1.
Abstract
Gene clusters are sets of co-localized, often contiguous genes that together perform specific functions, many of which are relevant to biotechnology. There is a need for software tools that can extract candidate gene clusters from vast amounts of available genomic data. Therefore, we developed Opfi: a modular pipeline for identification of arbitrary gene clusters in assembled genomic or metagenomic sequences. Opfi contains functions for annotation, de-deduplication, and visualization of putative gene clusters. It utilizes a customizable rule-based filtering approach for selection of candidate systems that adhere to user-defined criteria. Opfi is implemented in Python, and is available on the Python Package Index and on Bioconda (Grüning et al., 2018).Entities:
Year: 2021 PMID: 35445164 PMCID: PMC9017871 DOI: 10.21105/joss.03678
Source DB: PubMed Journal: J Open Source Softw ISSN: 2475-9066
Figure 1:One of two type-I CRISPR-Cas systems present in the genome of Rippkaea orientalis PCC 8802. Note that the ORF beginning at position ~2500 has homology with both cas1 and cas4. These alignments have identical bitscores (i.e., the goodness of alignments is quivalent, using this metric), so both annotations appear in the diagram, even though pick_overlapping_features_by_bit_score was applied.