| Literature DB >> 25497125 |
Ishita Khan, Yuqian Chen, Tiange Dong, Xioawei Hong, Rikiya Takeuchi, Hirotada Mori, Daisuke Kihara.
Abstract
BACKGROUND: Moonlighting proteins perform two or more cellular functions, which are selected based on various contexts including the cell type they are expressed, their oligomerization status, and the binding of different ligands at different sites. To understand overall landscape of their functional diversity, it is important to establish methods that can identify moonlighting proteins in a systematic fashion. Here, we have developed a computational framework to find moonlighting proteins on a genome scale and identified multiple proteomic characteristics of these proteins.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25497125 PMCID: PMC4307903 DOI: 10.1186/s13062-014-0030-9
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1Semantic similarity distribution. The distribution of the relevance semantic similarity SS score of GO term pairs, aconitase, yeast (Uniprot ID : P19414). (A) SS distribution of GO pairs within the primary function (function 1), the secondary function (function 2), and pairs from function 1 and 2. (B) Hierarchical clustering of GO terms in the three GO categories using pairwise SS scores.
Figure 2Average of GO term pairs for moonlighting proteins. Average SS of GO pairs within function 1, function 2, and pairs from function 1 and 2 were computed separately. (A) Moonlighting proteins in the MPR1 set. Protein 24 is presenilin in Physcomitrella patens (Uniprot ID: A9S846). This protein have one GO term each in F1 and F2 (F1 term GO:0004190, “aspartic type endopeptidase activity” and F2 term GO:0016021, “intergral to membrane”). The two GO terms are in different ontologies, MF and CC respectively, and thus the scores are zero for F1 and F2 (because there is only one term) as well as F1-F2 (because similarity of GO terms in different categories cannot be considered). (B) the MPR2 set; and (C) the MPR3 set.
Figure 3Average distribution. Box-and-whisker plots for average SS distribution of BP, MF, and CC GO pairs for the moonlighting proteins in the MPR1-3 sets excluding proteins with * in Figure 2. The top and the bottom of a box show the first and third quartiles and the line in the middle of a box is the median. The two ends of whisker show the minimum and the maximum values.
Figure 4Clustering profiles of sets of moonlighting and non-moonlighting proteins. For each protein in a dataset, GO terms were clustered using various threshold values of SS and average number of GO term clusters were plotted. The datasets plotted were experimentally known moonlighting proteins (MPR1, 2, and 3) and identified moonlighting and non-moonlighting proteins in E. coli (Ecoli-MP and Ecoli-nonMP). E. coli moonlighting proteins were also plotted separately for each evidence category, 1 to 3 (Ecoli-PosMP-Cat1-3; see Methods) as well as multi-domain multi-function proteins. (A) BP GO terms were considered. (B) MF GO terms were considered.
Moonlighting proteins identified in
|
|
|
|
|
|
|---|---|---|---|---|
| b0118/P36683/AcnB | Aconitate hydratase | Post-transcriptional regulation; mRNA binding | I | [ |
| b1019/P31545/EfeB | Peroxidase on guaiacol | Iron assimilation from heme; response to DNA damage stimulas | I | [ |
| b1276/P25516/AcnA | Aconitate hydratase | Post-transcriptional regulation; mRNA binding | I | [ |
| b1967/P31658/HchA | Molecular chaperone | Glyoxalase activity | I | [ |
| b3183/P42641/ObgE | GTPase | Role in ribosome biogenesis | I | [ |
| b4151/P0A8Q3/FrdD | Membrane bound respiratory protein (anaerobic condition) | Role in bacterial flagellar switch (aerobic conditions) | I | [ |
| b4152/P0A8Q0/FrdC | Membrane bound respiratory protein (anaerobic condition) | Role in bacterial flagellar switch (aerobic conditions) | I | [ |
| b4153/P0AC47/FrdB | Membrane bound respiratory protein (anaerobic condition) | Role in bacterial flagellar switch (aerobic conditions) | I | [ |
| b4154/P00363/FrdA | Membrane bound respiratory protein (anaerobic condition) | Role in bacterial flagellar switch (aerobic conditions) | I | [ |
| b4179/P21499/Rnr | Helicase | RNase | I | [ |
| b4260/P68767/PepA†b) | Plasmid recombination | Peptide catabolic process; DNA binding/transcriptional control | I | [ |
| b0161/P0C0V0/DegP† | Chaperone | Proteolysis | II | [ |
| b0509/P77161/GlxR | Glyoxylate metabolism | Allantoin assimilation; DNA damage response | II | [ |
| b0957/P0A910/OmpA | Transport | 1. Viral entry 2.DNA damage response | II | [ |
| b1317/P77366/YcjU | Carbohydrate metabolism | 1. Cell-to-cell plasmid transfer 2. Reduce the lethal effects of stress | II | [ |
| b1710/P06610/BtuE | Glutathione peroxidase | Non-essential role in vitamin-B12 transport | II | [ |
| b2415/P0AA04/PtsH | Phosphocarrier protein essential in sugar transport | Positive regulation of glycogen catabolism | II | [ |
| b2552/P24232/Hmp | (aerobic condition) Nitric oxide dioxygenase (NOD) | (anaerobic condition) Amplifier of superoxide stress, NO and FAD reductase | II | [ |
| b2949/P0A8I1/YqgF | Putative Holliday junction resolvase | Transcription anti-termination | II | [ |
| b3414/P63020/NfuA | Fe-S biogenesis | Necessary for the use of extracellular DNA as the sole source of carbon and energy | II | [ |
| b3463/P0A9R7/FtsE | Cell division | Salt transport by ABC-Transporter | II | [ |
| b3706/P25522/MnmE | tRNA modification | Regulating glutamate-dependent acid resistance | II | [ |
| b0135/P31058/YadC | Cell adhesion | Reduce lethal effects of stress | III | [ |
| b0284/P77489/YagR | Putative xanthine dehydrogenase | DNA damage response | III | [ |
| b0543/P23895/EmrE | Multidrug transporter | DNA damage response | III | [ |
| b1018/P0AB24/EfeO | Involved in Iron uptake | Response to lethal antimicrobial and environmental stress | III | [ |
| b2037/P37746/RfbX | Putative O-antigen transporter | DNA damage response | III | [ |
| b2147/P25889/PreA | Pyrimidine base degradation | Required for swarming motility | III | [ |
| b2290/P0A959/AlaA | Involved in biosynthesis of alanine | Response to lethal antimicrobial and environmental stress | III | [ |
| b3191/P64602/MlaB | Phospholipid ABC transporter | Response to lethal antimicrobial and environmental stress | III | [ |
| b3233/P0A9Q9/Asd | Aspartate-semialdehyde dehydrogenase | DNA damage response | III | [ |
| b4177/P0A7D4/PurA | Adenylosuccinate synthetase | DNA damage response | III | [ |
| b4383/P0A6K6/DeoB | Phosphopentomutase | DNA damage response | III | [ |
a)Moonlighting proteins are classified into four categories: I, both primary and the secondary functions have clear experimental evidences that they are independent; II, both primary and the secondary functions have experimental evidences but it is not clear if the functions are independent; III, “weak” moonlighting proteins, evidences for the secondary function is from a large scale assay or a phenotypic experiment of mutants and the relationship between the primary and the secondary function is not known. Proteins are sorted by the b number within each category.
b)Proteins included in either MoonProt or MultiTaskDB are indicated with†. PepA is included in the MoonProt database. DegP is included in both MoonProt and MultiTaskDB.
Multi-domain proteins with multiple functions identified in
|
|
|
|
|
|---|---|---|---|
| b0002/P00561/ThrA | Aspartokinase | Homoserine dehydrogenase | [ |
| b0529/P24186/FolD | Oxidation of methylenetetrahydrofolate | Hydrolysis of methenyltetrahydrofolate | [ |
| b1241/P0A9Q7/AdhE | Alcohol dehydrogenase | Acetaldehyde dehydrogenase; Pyruvate-formate-lyase deactivase | [ |
| b1888/P07363/CheA | Chematoxis sensor kinase | Regulation of protein; dephosphorylation | [ |
| b2255/P77398/ArnA | Oxidative decarboxylation of UDP-glucuronic acid | Formyltransferase | [ |
| b3052/P76658/HldE | D-beta-D-heptose 7-phosphate kinase | D-beta-D-heptose 1-phosphate adenosyltransferase | [ |
| b3368/P0AEA8/CysG | SAM-dependent methylation | NAD-dependent ring dehydrogenation; Ferrorochelation | [ |
| b3650/P0AG24/SpoT | ppGpp synthase | ppGpp hydrolase | [ |
| b3940/P00562/MetL | Aspartokinase | Homoserine dehydrogenase | [ |
| b4390/P27278/NadR† | Transcriptional regulator | Nicotinamide mononucleotide adenylyltransferase; Ribosylnicotinamide kinase | [ |
†This protein is included in MoonProt.
Figure 5Interacting proteins of moonlighting and non-moonlighting proteins. Physically interacting proteins were obtained from the STRING database. (A) Histogram of the number of interacting proteins. Five datasets are shown: known moonlighting proteins in the MPR1-3 sets (MPR-ALL), the identified moonlighting proteins in E. coli (Ecoli-MP), moonlighting proteins detected in E. coli that have clear experimental evidences for the dual functions and classified into the category 1 (Ecoli-MP-Cat1), E. coli proteins whose multi-functionality originates from different domains (Ecoli-MultiDomain) and non-moonlighting proteins in E. coli. Values on the y-axis are the fraction of the proteins among the entire proteins in each dataset. The bin size used was five. (B), average number of clusters of interacting proteins clustered using the funsim score (Eqn. 4). Seven datasets are plotted: MPR1, MPR2, MPR3, Ecoli-MP, Ecoli-MP-Cat1, Ecoli-MultiDomain, and Ecoli-nonMP. (C) Clustering was performed using the funsim score of BP terms only (Eqn. 3).
Figure 6Function similarity analysis. Functional similarity between interacting proteins and the primary and secondary functions of moonlighting proteins. 27 moonlighting proteins in the MPR1-3 sets that have physically interacting proteins in STRING database and their 575 interacting proteins were analyzed. (A) The functional similarity score is computed between GO terms of the primary (F1) or the secondary (F2) functions of a moonlighting protein against the entire GO terms of its interacting protein and the score difference was computed. Interacting proteins were classified by the range of funsim score difference between F1 and F2 GO terms from their interacting moonlighting proteins. (B) The same type of chart as panel A, using the BP-funsim score. (C) For each moonlighting protein, percentages (%) of interacting proteins sharing F1, F2, or both functions of moonlighting proteins are shown. The BP-funsim score was used to determine if proteins share functional similarity. If an interacting protein has a BP-funsim score to both F1 and F2 GO terms of the moonlighting protein, it is classified as both. An interacting protein is considered to share F1, F2, or both functions if the BP-funsim score is larger than the mean SS score of BP GO pairs of F1 or F2 in the moonlighting protein. In the case that a moonlighting protein has 0 SS score, the cutoff was set to 0.4 for an interacting protein to be considered to share F1, F2, or both functions. P47897 does not have any interacting proteins with F1 or F2 function. Its only interacting protein, RSBN1, has a BP-funsim score of 0 with F1 and F2 functions of P47897. P36024 also does not have any interacting proteins sharing F1 or F2 function. Out of its four interacting proteins, YKL088W has the highest funsim-BP score with F1/F2 GO terms of P36024 (score 0.25), which is below the funsim-BP F1/F2 cutoff for P36024 (cutoff 0.4 for both F1 and F2).
Interacting proteins that have both primary and secondary functions of moonlighting proteins in the MPR1-3 set
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| P93834/HXK2 | Glucose metabolism | Glucose signalling | Q42525/HXK1 | 1. Glycolysis | MP, I | [ |
| 2. Sugar mediated signaling programmed cell death | ||||||
| Q99798/ACO2 | TCA cycle enzyme | Iron homeostasis | P21399/ACO1 | 1. Role in TCA cycle | MP, I | [ |
| mRNA binding and role in iron homeostasis | ||||||
| P00924/ENO1 | Galactose catabolism enzyme | Homotypic vacuole fusion | P00925/ENO2 | 1. Glycolysis | MP, I | [ |
| Vacuole fusion | ||||||
| Q08438/Vhs3 | Halotorance determinant | Coenzyme A biosynthesis | P36024/SIS2 | 1. CoA biosynthesis | MP, I | [ |
| Salt tolerance | ||||||
| P28482/ERK2 | MAP kinase | Transcriptional | P01100 | 1. Regulation of transcription | MP, II | [ |
| Repressor | /FOS | 2. Activates phospholipid synthesis in growing cells (regulated by Mos/MAP kinase pathway) | ||||
| Q15796/SMAD2-5 | 1. TGF signaling protein | Multi-domain | [ | |||
| 2. Tumor suppressor, dual role in transcriptional activation | ||||||
| P05771/PRKCB | 1. Serine/threonine-protein kinase, activates transcription. | MP, II | [ | |||
| Inhibition of the insulin gene transcription. | ||||||
| O43293 | 1. Serine/threonine kinase | MP, I | [ | |||
| /DAPK3 | 2. Role in apoptosis, transcription, regulation of cell polarity, contractile processes in non-muscle or smooth muscle cells, and cytokinesis | |||||
| P14921/ETS1 | Transcription factor | nonMP | - | |||
| P19838/NFKB1 | 1. Transcription factor | MP, I | [ | |||
| 2. Cytoplasmic retention of attached NF-kappa-B proteins by p105, generation of p50 by a co-translational processing, transcriptional repressor | ||||||
| O43318 | 1. MAPK | MP, II | [ | |||
| /MAP3K7 | Regulates TF activator proteins | |||||
| Q99ML3/STAT3 | Transcription factor | Electron transport chain | Q5EG47/Prkaa1-2 | 1. Protein kinase that phosphorylates TF | MP, II | [ |
| Regulation of cellular energy | ||||||
| Q62120/Jak2-3 | 1. Tyrosine protein kinase | Multi-domain | [ | |||
| Regulation of cellular signaling and cell cycle control | ||||||
| P05480/Src | 1. Tyrosine protein kinase | Multi-domain | [ | |||
| Cytokine/cellular receptor | ||||||
| O14786/Neuropilin-1 | Vascular endothelial growth factor | Receptor for semaphorin III | Q14563/SEMA3A-G | 1. Development of the olfactory system and in neuronal control of puberty | MP, II | [ |
| 2. Ensures proper endothelial abundance of soluble and alternatively spliced form of VEGF receptor(flt1) | ||||||
| P15692/VEGFA | Vascular endothelial growth factor | nonMP | - | |||
| P17948 | 1. VEGF receptor; plays negative role in angiogenesis in the embryo most likely by trapping VEGF | MP, I | [ | |||
| /FLT1/VEGFR/ | 2. Plays positive role in adulthood in a tyrosine kinase-dependent manner | |||||
| P54279/PMS2 | Mismatch repair enzyme | Hypermutation of antibody variable chains | Q9JK91/Mlh1 | 1. Mismatch repair protein | MP, I | [ |
| Somatic hyper mutation | ||||||
| P19971/PD-ECGF | Thymidine phosphorylase | Platelet-derived endothelial cell growth factor | P04183/TK1-2 | 1. Phosphotransferase activity | nonMP | - |
| Q96B60 | Deoxyribonucleotidase, mitochondrial | nonMP | - | |||
| /NT5E, NT5M | ||||||
| P06744/Neuroleukin | Phosphoglucose Isomerase | Differentiation, maturation mediator | P52789/HK2 | 1. Hexokinase-2 | MP, III | [ |
| HK2 detachment causes apoptosis | ||||||
| P04075/ALDOA-C | 1. Glycolysis and gluconeogenesis | MP, III | [ | |||
| Regulation of cell shape | ||||||
| P30613/PKLR | 1. Pyruvate kinase | MP, III | [ | |||
| Mutation causes hemolytic anemia | ||||||
| P14618/PKM2 | 1. Pyruvate kinase | MP, III | [ | |||
| Programmed cell death | ||||||
| P06795/P-glycoprotein | P-glycoprotein (transporter) | Regulator of cell-swelling ion channel (K+/Cl-) | P41233/Abca1 | Anion transporter | nonMP | - |
| Q91XR9/Phospholipid hydroperoxide glutathione peroxidase | Antioxidant of mature sperm | Structural protein of the mitochondrial capsule | Q60928/Ggt1 | 1. Part of the cell antioxidant defense mechanism | MP, IV | [ |
| 2. Indirectly regulates multiple aspects of skeletal biology | ||||||
| Q9P2J5/Leucine-tRNA ligase | tRNA synthetases | Translocation and activation of mTORC1 to lysosomal membrane | Q9H6Q3/MARS | Methionine-tRNA ligase, cytoplasmic | nonMP | - |
| Q6P0M4/IARS | tRNA aminoacylation for protein translation | nonMP | - | |||
| P11325/Nam2p | Mitochondrial leucyl-tRNA synthetase | bI4 mitochondrial RNA splicing activity | P26637/CDC60 | Leucine-tRNA ligase, cytoplasmic | nonMP | - |
| P19414/ACO1 | TCA cycle enzyme | Mitochondrial DNA stability | P33421/SDH3 | Succinate dehydrogenase involved in mt-electron transport chain | nonMP | - |
This table corresponds to Figure 6C.
a)The name and UniProt ID of the moonlighting proteins in the MPR1-3 set.
b)Primary function and c) secondary function of the moonlighting protein.
d)The name and the UniProt ID of interacting proteins of the moonlighting protein shown in the left column.
e)Multiple functions (if any) of the interacting protein.
f)This column indicates if the interacting protein is a moonlighting protein (MP), not (non-MP), or multi-domain multi-functional protein (Multi-domain). The roman numerals, I to III, indicate the category of moonlighting proteins (see Table 1 caption).
Figure 7Gene expression profile analysis. Average number of clusters of interacting proteins relative to the number of proteins interacting by gene expression. Proteins considered to be interacting are the top 2% of proteins in the Gene Expression network of E. coli sorted in terms of the Pearson correlation coefficient. (A) Histogram of number of interacting proteins. (B) Functional clustering using Funsim (BP, MF, CC) score thresholds between 0.1 and 1.0. (C) Functional clustering using Funsim (BP) score thresholds between 0.1 and 1.0.
Figure 8Phylogenetic profile analysis. Average number of clusters of phylogenetically related proteins relative to the number of phylogenetically related proteins. Phylogenetically related proteins are taken from the STRING database. (A) The histogram of number of phylogenetically related proteins. (B) Functional clustering using Funsim (BP, MF, CC) score with thresholds between 0.1 and 1.0. (C) Functional clustering using Funsim (BP) score with thresholds from 0.1 to 1.0.
Figure 9Genetic interaction network analysis. The number of interacting proteins in the genetic interaction network of E. coli. (A) The number of interacting proteins selected with a Pearson correlation cutoff of 0.2. E. coli MP and non-MP, multi-domain multi-functional proteins, and the first category E. coli MPs are plotted. (B) The number of clusters of interacting proteins for individual E. coli moonlighting (blue) and non-moonlighting (red) proteins at BP-funsim threshold of 0.2. (C) The number of clusters of interacting proteins for individual E. coli moonlighting (blue) and non-moonlighting (red) proteins at BP-funsim threshold of 0.6.
Figure 10Disordered region of moonlighting and non-moonlighting proteins. Histograms of the disordered regions in moonlighting and non-moonlighting proteins. Five datasets are plotted: MPR1-3 (MPR-All), E. coli moonlighting proteins (Ecoli-MP), E. coli moonlighting proteins in the first category (Ecoli-MP-Cat1), multi-domain multi-functional proteins, and E. coli non-moonlighting proteins (Ecoli-nonMP). (A) Length of the disordered regions; (B) Fraction of the length of disordered regions relative to the whole sequence length of the proteins.
Figure 11Moonlighting protein structures. Tertiary structures of moonlighting proteins. (A) human dihydrolipoamide dehydrogenase (PDB ID: 1ZMC-A). It binds NAD shown in yellow at residues 208, 243, 279 (“NAD binding” classified as both F1 and F2 function) and FAD shown in cyan at residues 54, 119, 320 (“FAD binding” classified as F2 term). (B) mitogen activated protein kinase 1 (PDB ID: 4G6N). It binds ATP (related to F1 function) at residues 31–39 and 54 (shown in yellow), and DNA (related to F2 function) with residues 259–277 (purple).
The MPR3 moonlighting protein dataset
|
|
|
|
|
|
|---|---|---|---|---|
| P79149/Pinin | Canis familiaris | Induce junction formation and enhance cell aggregation | Component of the RNP structure | [ |
| P27487/DPP4 | Homo sapiens | Serine protease | 1. Cell surface glycoprotein receptor for CAV1 | [ |
| 2. Co-stimulatory protein involving in T-cell receptor-mediated T-cell activation and proliferation. | ||||
| 3. Binding collagen and fibronectin | ||||
| 4. Involvement in apoptosis | ||||
| Q91XR9/GPx-4 | Mus musculus | Antioxidant of mature sperm | Structural protein of the mitochondrial capsule | [ |
| O35242/FAN | Mus musculus | Apoptosis | Inflammatory signalling | [ |
| E3D2R2/Fructose-1, 6-bisphosphate aldolase | Neisseria meningitidis | Glycolytic enzyme | Host-cell invasion | [ |
| Q7L0Y3/MRP1 | Homo sapiens | tRNA methyltransferase | Dehydrogenase | [ |
| Q9Y7F0/Peroxiredoxin TSA1 | Candida albicans | Antioxidant against sulfur-containing radicals | Involved in morphology | [ |
| P48237/CCM1 | Saccharomyces cerevisiae | Introns removal in mRNA maturation | Maintains the steady-state levels of the mitoribosome small subunit RNA | [ |
| P11325/Nam2p | Saccharomyces cerevisiae | Mitochondrial leucyl-tRNA synthetase | Mitochondrial RNA splicing activity | [ |
| Q9P2J5/LeuRS | Homo sapiens | tRNA synthetase | Translocation and activation of mTORC1 to lysosomal membrane | [ |
| P47897/GlnRS | Homo sapiens | tRNA synthetase | Suppresses apoptotic acitivities | [ |
| Q6DRC0/SerRS | Danio rerio | tRNA synthetase | Regulates development of closed circulatory system | [ |
| P00883/Fructose-bisphosphate aldolase A | Oryctolagus cuniculus | Glycolytic enzyme | Regulation of cell mobility | [ |
| P0A518/Cpn60-1 | Mycobacterium tuberculosis | Prototypic molecular chaperone | Osteoclast-inhibitory action | [ |
| P0A518/Cpn60-2 | Mycobacterium tuberculosis | Prototypic molecular chaperone | Stimulates macrophage pro-inflammatory cytokine synthesis | [ |