Literature DB >> 29136215

MultitaskProtDB-II: an update of a database of multitasking/moonlighting proteins.

Luís Franco-Serrano1, Sergio Hernández1, Alejandra Calvo2, María A Severi2, Gabriela Ferragut2, JosepAntoni Pérez-Pons1, Jaume Piñol1, Òscar Pich1, Ángel Mozo-Villarias1, Isaac Amela1, Enrique Querol1, Juan Cedano2.   

Abstract

Multitasking, or moonlighting, is the capability of some proteins to execute two or more biological functions. MultitaskProtDB-II is a database of multifunctional proteins that has been updated. In the previous version, the information contained was: NCBI and UniProt accession numbers, canonical and additional biological functions, organism, monomeric/oligomeric states, PDB codes and bibliographic references. In the present update, the number of entries has been increased from 288 to 694 moonlighting proteins. MultitaskProtDB-II is continually being curated and updated. The new database also contains the following information: GO descriptors for the canonical and moonlighting functions, three-dimensional structure (for those proteins lacking PDB structure, a model was made using Itasser and Phyre), the involvement of the proteins in human diseases (78% of human moonlighting proteins) and whether the protein is a target of a current drug (48% of human moonlighting proteins). These numbers highlight the importance of these proteins for the analysis and explanation of human diseases and target-directed drug design. Moreover, 25% of the proteins of the database are involved in virulence of pathogenic microorganisms, largely in the mechanism of adhesion to the host. This highlights their importance for the mechanism of microorganism infection and vaccine design. MultitaskProtDB-II is available at http://wallace.uab.es/multitaskII.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29136215      PMCID: PMC5753234          DOI: 10.1093/nar/gkx1066

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Multitasking, or moonlighting, refers to those proteins presenting two or more functions performed by a single polypeptide chain. The term ‘moonlighting’ was coined by Constance Jeffery (1), whereas Joram Piatigorsky proposed gene sharing (2). Multitasking proteins present alternative functions, resulting from differences in cellular localization, cell type, oligomeric state, concentration of cellular ligands, substrates, cofactors, products or post-translational modifications. Although some findings suggest the involvement of a protein in extra functions, for example, finding them in different cellular localizations or in amounts exceeding those required for their canonical function, usually multitasking proteins are experimentally revealed by serendipity. The appearance of a new function can become an advantage for the cell or the organism because it reduces the number of proteins to be synthesized, making its genome more compact and coordinating cell activities better. In any case, these proteins complicate the interpretation of knock-out/knock-in, DNA array, metabolomic, systems biology, drug pharmacokinetic, pharmacodynamic and toxicity assays. In order to facilitate the work of researchers interested in this field, we decided to make our set of multitasking proteins freely available in the form of a web database, MultitaskProtDB (3). Additionally, two other databases are currently available: MoonProt (4) and MoonDB (5).

DATABASE IMPROVEMENTS

Information on multitasking proteins has been collected from the literature through the NCBI PubMed server using keywords like moonlighting/multitasking/multifunctional protein and gene sharing. When necessary, some important protein characteristics have been retrieved from the UniProt Consortium (6). In order to identify which proteins of our database are involved in human diseases, the information present in the Online Mendelian Inheritance in Man (OMIM) (7) and Human Gene Mutation Database (HGMD) (8) databases, have been carefully inspected. Moreover, in order to check which proteins of our database are a drug target, the Therapeutic Target Database (TTD) (9) and the DrugBank database (10) have been scanned. The three-dimensional (3D) structure of those proteins without a previously-solved PDB structure has been modeled by applying the ITasser (11) and Phyre (12) servers. Both methods use template-based tertiary structure modeling. Using the SCOP code associated with the PDB structure with which the sequence of the moonlighting protein aligned, a table of observed main fold frequencies was made (Table 1). In order to study any fold preference in our database of moonlighting proteins, all proteins were aligned with the astral95 database, and a protein subset was made considering only those with <95% of identity (moon95). With this strategy, we wished to prevent the abundance of the same protein of close species and avoid over-represented proteins because of the accumulation of the same protein with multiple moonlighting functions. To test whether the distribution of fold frequencies is similar to what we would see if the moonlighting proteins had the same distribution of folds as that observed in the astral95 database, the frequency distribution of subset moon95 was compared with the distribution present in the proteins of the astral95 database. This was done using a G-test calculated through a specific statistical R package (www.r-project.org). The P-value provided by R was <2.2 × 10–16, which is below the acceptance threshold of the null hypothesis. We could then conclude that the distribution of frequencies in the structural classes of both subgroups of proteins is different.
Table 1.

Most frequent folds between moonlighting proteins in which the 3D structure is available

SCOPe IDFOLDSFREQUENCY
c1TIM beta/alpha-barrel9
c2NAD(P)-binding Rossmann-fold domains9
b1Immunoglobulin-like beta-sandwich6
c47Thioredoxin fold6
c37P-loop containing nucleoside triphosphate hydrolases5
c55Ribonuclease H-like motif4
c57Molybdenum cofactor biosynthesis proteins4
d144Protein kinase-like (PK-like)4
d54Enolase N-terminal domain-like4
i1Ribosome and ribosomal fragments4
a118alpha-alpha superhelix3
c8The ‘swivelling’ beta/beta/alpha domain3
d15beta-Grasp (ubiquitin-like)3
d162LDH C-terminal domain-like3
d58Ferredoxin-like3
a127L-aspartase-like2
a45GST C-terminal domain-like2
b26SMAD/FHA domain2
b29Concanavalin A-like lectins/glucanases2
b35GroES-like2
b42beta-Trefoil2
b697-bladed beta-propeller2
b85beta-clip2
c23Flavodoxin-like2
c26Adenine nucleotide alpha hydrolase-like2
c42Arginase/deacetylase2
c58Aminoacid dehydrogenase-like, N-terminal domain2
c67PLP-dependent transferase-like2
c80SIS domain2
d2Lysozyme-like2
d41alpha/beta-Hammerhead2
d8Urease, gamma-subunit2
g37beta-beta-alpha zinc fingers2
b98Zn aminopeptidase N-terminal domain2

USER INTERFACE

Upon opening the database web page (Figure 1), a large table that contains 694 entries of multitasking proteins is shown. Fifteen columns in the table give different information regarding the main characteristics of each protein. Alphabetically ordering is available by clicking on the title of each column, and this allows to order, for example, by organisms. From left to right, the following information is shown: Column 1 is a clickable button to see the complete record details. Column 2 is a clickable button to select the entry. Column 3 (UniProt) shows the UniProt accession number, which is linked to the corresponding database information. Column 4 (Protein Name) shows the name of the protein. Column 5 (Canonical Function) contains a detailed description of the canonical function of the protein. Columns 6 and 8 (GO and GO Moon) display GO numbers related to the canonical and moonlighting protein functions, respectively. Column 7 (Moonlighting Function) contains a detailed description of the moonlighting functions. Column 9 (Organism) indicates the organism in which the protein acts as a moonlighting protein according to the bibliography. Column 10 (Human Disease) indicates the associated diseases in the case of human moonlighting proteins that are linked to the OMIM database. Column 11 (Drugs) indicates whether the protein is a known target of current drugs and is linked to the DrugBank database information. Column 12 (PDB) points to the PDB structure with which the structure was modeled (sequence identity is shown as a percentage), or to the experimentally solved PDB structure of the protein entry (homology is 100). Column 13 (Models) gives the 3D structure model with highest score according to ITasser or Phyre servers. These models were obtained by using the same sequence of the moonlighting protein and they are highly reliable, as they were modeled starting from high-homology templates. Even so, the reliability and coverage might be low in some cases, particularly if they are based on very remote homologs. Column 14 (Reference) provides a link to the PubMed bibliographic reference. Some facilities like display, print or search buttons are provided by the web page. Moreover, an export process can be easily performed to the whole database or to some selected entries. The type of data file obtained through the export option can be selected depending on the type of data file required by the user (Excel, Word, CSV or XML). The database is accessible at http://wallace.uab.es/multitaskII.
Figure 1.

A screenshot of MultitaskProtDB-II web page. Currently, the database contains information of 694 multitasking proteins that can be easily viewed with the search button and other display facilities.

A screenshot of MultitaskProtDB-II web page. Currently, the database contains information of 694 multitasking proteins that can be easily viewed with the search button and other display facilities.

CONCLUSION

An important issue included in the present version of the database is the relation between multitasking proteins and human diseases. We have seen that 78% of the human moonlighting proteins are involved in diseases. Furthermore, 48% of the human multifunctional proteins are targets of current drugs. According to UniProt, the number of human proteins with a reviewed status is 26 199, and 13.74% of them are related to human diseases (see cross-reference between UniProt and OMIM at www.uniprot.org/help/involvement_in_disease). Thus, a percentage as high as the previously mentioned 78% indicates that moonlighting proteins are prone to be involved in human diseases, probably because of the two or more exhibited molecular functions. This is the case of fumarate hydratase, in which each molecular function is related to a different disease (fumarate deficiency and leiomiomatosis). A more unexpected result is the 48% of moonlighting proteins that are targets of current drugs (9,10), since only 9.8% of the human proteins present in UniProt are specified as drug targets. Moreover, targets of current drugs represent only successful cases, because not all the theoretically druggable human genome (5000–10 000 proteins, according to Drews, (13)) can be targetable. Still, a drug acting on a moonlighting protein can trigger more complicated toxic side-effects. Another interesting issue is that 25% of the database entries correspond to proteins involved in pathogenic microorganisms’ virulence, mostly in the mechanism of host adhesion and colonization through interaction with plasminogen or extracellular matrix components. From the reverse vaccinology point of view, it is a very important fact. Several authors (14,15) have previously reviewed the involvement of moonlighting proteins in pathogen virulence. The percentages described above highlight the interest of moonlighting proteins for gaining insight into the molecular basis of genetic-based diseases, the rational drug-design upon target identification, and the infection mechanism of pathogens and vaccine design. Databases such as MultitaskProtDB (3), MoonProt (4) and MoonDB (5) can be used as a source of data to create models or validate hypotheses about these proteins. Our database contains close to 700 experimentally demonstrated moonlighting proteins, with much information related to each one, and is a valuable resource for this growing class of proteins.
  15 in total

1.  Drug discovery: a historical perspective.

Authors:  J Drews
Journal:  Science       Date:  2000-03-17       Impact factor: 47.728

Review 2.  Moonlighting proteins.

Authors:  C J Jeffery
Journal:  Trends Biochem Sci       Date:  1999-01       Impact factor: 13.807

Review 3.  Gene sharing in lens and cornea: facts and implications.

Authors:  J Piatigorsky
Journal:  Prog Retin Eye Res       Date:  1998-04       Impact factor: 21.198

4.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.

Authors:  Ada Hamosh; Alan F Scott; Joanna S Amberger; Carol A Bocchini; Victor A McKusick
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

5.  MoonProt: a database for proteins that are known to moonlight.

Authors:  Mathew Mani; Chang Chen; Vaishak Amblee; Haipeng Liu; Tanu Mathur; Grant Zwicke; Shadi Zabad; Bansi Patel; Jagravi Thakkar; Constance J Jeffery
Journal:  Nucleic Acids Res       Date:  2014-10-16       Impact factor: 16.971

6.  Extreme multifunctional proteins identified from a human protein interaction network.

Authors:  Charles E Chapple; Benoit Robisson; Lionel Spinelli; Céline Guien; Emmanuelle Becker; Christine Brun
Journal:  Nat Commun       Date:  2015-06-09       Impact factor: 14.919

7.  Physical Features of Intracellular Proteins that Moonlight on the Cell Surface.

Authors:  Vaishak Amblee; Constance J Jeffery
Journal:  PLoS One       Date:  2015-06-25       Impact factor: 3.240

8.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

9.  The Phyre2 web portal for protein modeling, prediction and analysis.

Authors:  Lawrence A Kelley; Stefans Mezulis; Christopher M Yates; Mark N Wass; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2015-05-07       Impact factor: 13.491

10.  Therapeutic target database update 2014: a resource for targeted therapeutics.

Authors:  Chu Qin; Cheng Zhang; Feng Zhu; Feng Xu; Shang Ying Chen; Peng Zhang; Ying Hong Li; Sheng Yong Yang; Yu Quan Wei; Lin Tao; Yu Zong Chen
Journal:  Nucleic Acids Res       Date:  2013-11-21       Impact factor: 16.971

View more
  15 in total

1.  Moonlighting protein prediction using physico-chemical and evolutional properties via machine learning methods.

Authors:  Farshid Shirafkan; Sajjad Gharaghani; Karim Rahimian; Reza Hasan Sajedi; Javad Zahiri
Journal:  BMC Bioinformatics       Date:  2021-05-24       Impact factor: 3.169

Review 2.  Understanding protein multifunctionality: from short linear motifs to cellular functions.

Authors:  Andreas Zanzoni; Diogo M Ribeiro; Christine Brun
Journal:  Cell Mol Life Sci       Date:  2019-08-20       Impact factor: 9.261

3.  Identification of Moonlighting Proteins in Genomes Using Text Mining Techniques.

Authors:  Aashish Jain; Hareesh Gali; Daisuke Kihara
Journal:  Proteomics       Date:  2018-10-10       Impact factor: 3.984

4.  iPCD: A Comprehensive Data Resource of Regulatory Proteins in Programmed Cell Death.

Authors:  Dachao Tang; Cheng Han; Shaofeng Lin; Xiaodan Tan; Weizhi Zhang; Di Peng; Chenwei Wang; Yu Xue
Journal:  Cells       Date:  2022-06-24       Impact factor: 7.666

Review 5.  The emerging role of mass spectrometry-based proteomics in drug discovery.

Authors:  Felix Meissner; Jennifer Geddes-McAlister; Matthias Mann; Marcus Bantscheff
Journal:  Nat Rev Drug Discov       Date:  2022-03-29       Impact factor: 112.288

6.  Multifunctional Proteins: Involvement in Human Diseases and Targets of Current Drugs.

Authors:  Luis Franco-Serrano; Mario Huerta; Sergio Hernández; Juan Cedano; JosepAntoni Perez-Pons; Jaume Piñol; Angel Mozo-Villarias; Isaac Amela; Enrique Querol
Journal:  Protein J       Date:  2018-10       Impact factor: 2.371

7.  PlantMP: a database for moonlighting plant proteins.

Authors:  Bo Su; Zhuang Qian; Tianshu Li; Yuwei Zhou; Aloysius Wong
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

8.  Pathogen Moonlighting Proteins: From Ancestral Key Metabolic Enzymes to Virulence Factors.

Authors:  Luis Franco-Serrano; David Sánchez-Redondo; Araceli Nájar-García; Sergio Hernández; Isaac Amela; Josep Antoni Perez-Pons; Jaume Piñol; Angel Mozo-Villarias; Juan Cedano; Enrique Querol
Journal:  Microorganisms       Date:  2021-06-15

9.  MoonDB 2.0: an updated database of extreme multifunctional and moonlighting proteins.

Authors:  Diogo M Ribeiro; Galadriel Briere; Benoit Bely; Lionel Spinelli; Christine Brun
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Structural and Functional Dynamics of Staphylococcus aureus Biofilms and Biofilm Matrix Proteins on Different Clinical Materials.

Authors:  Anna K Hiltunen; Kirsi Savijoki; Tuula A Nyman; Ilkka Miettinen; Petri Ihalainen; Jouko Peltonen; Adyary Fallarero
Journal:  Microorganisms       Date:  2019-11-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.