| Literature DB >> 35610035 |
Ivan S Ilnitskiy1,2, Anastasia A Zharikova1,2, Andrey A Mironov1,2.
Abstract
Extensive amounts of data from next-generation sequencing and omics studies have led to the accumulation of information that provides insight into the evolutionary landscape of related proteins. Here, we present OrthoQuantum, a web server that allows for time-efficient analysis and visualization of phylogenetic profiles of any set of eukaryotic proteins. It is a simple-to-use tool capable of searching large input sets of proteins. Using data from open source databases of orthologous sequences in a wide range of taxonomic groups, it enables users to assess coupled evolutionary patterns and helps define lineage-specific innovations. The web interface allows to perform queries with gene names and UniProt identifiers in different phylogenetic clades and supplement presence with an additional BLAST search. The conservation patterns of proteins are coded as binary vectors, i.e., strings that encode the presence or absence of orthologous proteins in other genomes. These strings are used to calculate top-scoring correlation pairs needed for finding co-inherited proteins which are simultaneously present or simultaneously absent in specific lineages. Profiles are visualized in combination with phylogenetic trees in a JavaScript-based interface. The OrthoQuantum v1.0 web server is freely available at http://orthoq.bioinf.fbb.msu.ru along with documentation and tutorial.Entities:
Year: 2022 PMID: 35610035 PMCID: PMC9252792 DOI: 10.1093/nar/gkac385
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.(A) Input area of the OrthoQuantum web server. Users can select target species by using the series of presented dropdown menus. (B) Matrix of Pearson correlation coefficients between orthogroups. (C) Phylogenetic profile of proteins associated with chromatin supplemented by BLAST search (E-value = 1e–10). The rows show 150 species of eukaryotes evenly covering the evolutionary tree. The columns are orthologous groups with 510 groups in total. Hierarchical clustering is applied only to the columns. Cells where the protein was found in OrthoDB are shown in blue, and BLAST findings are shown in purple.
Figure 2.(A) Phylogenetic profile plot of nine epigenetic proteins in 165 eukaryotes. (B) PANTHERDB subfamily annotation for PIWI/Argonaute protein family.
Figure 3.Phylogenetic profile of the PRC2 complex members (Suz12, Eed, Jarid2, Ezh2, Phf1 and Aebp2) and PRC1 (Phc2, Pcgf1, Bmi1, Ring1 and Scmh1) in plants. The presence and absence of orthologs in the database are indicated in blue and black, respectively, and the cells in which the findings were made using BLAST (E-value = 1e–6) are marked in purple.