| Literature DB >> 22267904 |
Dimosthenis Tsagrasoulis1, Vasilis Danos, Maria Kissa, Philip Trimpalis, V Lila Koumandou, Amalia D Karagouni, Athanasios Tsakalidis, Sophia Kossida.
Abstract
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality.Entities:
Keywords: BLAST; gene fusion; protein protein interactions
Year: 2011 PMID: 22267904 PMCID: PMC3256994 DOI: 10.4137/EBO.S8018
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Screenshot of the FED database homepage, showing the available search options.
Figure 2Workflow of the SAFE software. Starting with the input FASTA files of the proteomes of the two organisms that the user wants to analyse, userspecified parameters are used in different steps to filter the data, as described in the text. The output files of the program can be downloaded as text files or visualised on the SAFE interface, as shown in Figure 3.
Figure 3The SAFE interface. On the bottom right is the graphical representation of the selected results.
Fusion events generated via SAFE.
| Composite | Components | |
|---|---|---|
| Phospholipid cytidylyltransferase | Phosphate cytidyltransferase | Phosphate cytidyltransferase |
| Exonuclease | Hypothetical protein | Exoribonuclease |
| Valyl-tRNA synthetase | Valyl-tRNA synthetase | Leucyl-tRNA synthetase |
| Elongation factor EF-G | Elongation factor 1-alpha | Elongation factor EF-2 |
| Phosphate guanyltransferase | Nucleotidyl transferase | Phosphoglucomutase |
| Formate dehydrogenase | Formate dehydrogenase | Formate dehydrogenase |
| NADH dehydrogenase | NADH dehydrogenase | NADH dehydrogenase |
| Threonyl-tRNAsynthetase | Alanyl-tRNA synthetase | Prolyl-tRNA synthetase |
| Ankyrin | Ankyrin | Hexapeptide repeat-containing transferase |
| Glycyl-tRNA synthetase | Glycyl-tRNA synthetase | Hypothetical protein |
| Elongation factor | Translation initiation factor | Elongation factor |
| Methionyl-tRNA synthetase | Methionyl-tRNA synthetase | Methionyl-tRNA synthetase |
| FAD-dependent dehydrogenase | FAD-dependent oxidoreductase | Hypothetical protein |
Figure 4Examples of fusion events detected by SAFE. (A) A protein from the species Entamoeba histolytica with suggested cytidyltransferase activity, generated by a Fusion Event. (B) A tRNA synthetase protein from the species Thermofilum pendens, generated by a Fusion Event. (C) A protein from the species Entamoeba histolytica that has methionyl-tRNA synthetase activity, generated by a Fusion Event. (D) A composite mannose transferase which is found fused in Aquifex aeolicus proteome.
Comparison of the search features and the type of data stored in the FED database and in FusionDB.9
| Navigating and searching through the two databases
| |
|---|---|
| FusionDB | FED |
By gene name By COG id By protein sequence By COG pairs | By organism name (A–Z order) By organism name (search) By protein name (search box) By protein name (filtering organism kingdom—eukaryotes, bacteria, archaea) |
Gene ontologies for the novel fusion events included in the FED databases, i.e. fusion events which were not previously reported in the literature.
| Uniprot ID | Biological process | Cellular component | Ligand | Molecular function |
|---|---|---|---|---|
| NP_212986.1 | Protein biosynthesis | Cytoplasm | GTP-binding | Elongation factor |
| NP_213493.1 | Carbohydrate metabolism | – | – | Transferase |
| NP_213709.1 | Cellular respiration | Cytoplasm | 4Fe-4S | Oxidoreductase |
| NP_213899.1 | Transport | Cell inner membrane | NAD | Oxidoreductase |
| NP_214149.1 | Protein biosynthesis | Cytoplasm | ATP-binding | Aminoacyl-tRNA synthetase |
| XP_652860.1 | – | – | – | Transferase |
| XP_656678.1 | Glycyl-tRNA aminoacylation | Cytoplasm | ATP-binding glycine-tRNA | Aminoacyl-tRNA synthetase |
| XP_655775.1 | Protein biosynthesis | – | GTP-binding | Elongation factor |
| XP_652867.1 | Methionyl-tRNA aminoacylation | Cytoplasm | ATP-binding methionine-tRNA | Aminoacyl-tRNA synthetase |
| XP_649611.2 | – | – | – | Oxidoreductase |
| XP_649803.1 | – | – | – | Nucleotidyltransferase |
| XP_649075.1 | – | – | RNA-binding | Exonuclease |
| YP_919737 | Protein biosynthesis | Cytoplasm | ATP-binding | Aminoacyl-tRNA synthetase |
| XP_001268882.1 | Amino-acid biosynthesis | Cytoplasm | ATP-binding | Kinase |
Note: Only one accession number is given per protein family.