| Literature DB >> 32282909 |
Vadim M Gumerov1, Igor B Zhulin1.
Abstract
Key steps in a computational study of protein function involve analysis of (i) relationships between homologous proteins, (ii) protein domain architecture and (iii) gene neighborhoods the corresponding proteins are encoded in. Each of these steps requires a separate computational task and sets of tools. Currently in order to relate protein features and gene neighborhoods information to phylogeny, researchers need to prepare all the necessary data and combine them by hand, which is time-consuming and error-prone. Here, we present a new platform, TREND (tree-based exploration of neighborhoods and domains), which can perform all the necessary steps in automated fashion and put the derived information into phylogenomic context, thus making evolutionary based protein function analysis more efficient. A rich set of adjustable components allows a user to run the computational steps specific to his task. TREND is freely available at http://trend.zhulinlab.org.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32282909 PMCID: PMC7319448 DOI: 10.1093/nar/gkaa243
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Domains and Neighborhoods pipelines in TREND. Steps that can be skipped are marked by dashed lines. If identifiers provided instead of protein sequences the sequences will be fetched from NCBI and/or MiST databases. Optionally the redundancy of the provided sequences can be reduced based on the identity level specified by a user.
Figure 2.Phylogenetic tree of Pseudomonas aeruginosa PilJ chemoreceptor homologs combined with domain architecture (identified protein domains and transmembrane regions) generated by TREND. Domain nomenclature is according to the Pfam database.
Figure 3.Phylogenetic tree of a subset of CheA kinase homologs and their corresponding gene neighborhoods generated by TREND. Genes that encode proteins consisting of the same domains are shown in the same color: cheA genes are shown in blue. Genes encoding proteins from corresponding branches of the phylogenetic tree (left panel) are marked with bold borders (in this case, cheA genes).