| Literature DB >> 22472218 |
Khader Shameer1, Ramanathan Sowdhamini.
Abstract
BACKGROUND: 3D domain swapping is a novel structural phenomenon observed in diverse set of protein structures in oligomeric conformations. A distinct structural feature, where structural segments in a protein dimer or higher oligomer were shared between two or more chains of a protein structure, characterizes 3D domain swapping. 3D domain swapping was observed as a key mediator of numerous functional mechanisms and play pathogenic role in various diseases including conformational diseases like amyloidosis, Alzheimer's disease, Parkinson's disease and prion diseases. We report the first study with a focus on identifying functional classes, pathways and diseases mediated by 3D domain swapping in the human proteome.Entities:
Year: 2012 PMID: 22472218 PMCID: PMC3508620 DOI: 10.1186/2043-9113-2-8
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Figure 1Schematic representation of 3D domain swapping.
Figure 2Taxonomic (a) and species (b) level distribution of proteins in swapped conformation from 3DSwap knowledgebase.
Enriched Pfam domains associated with proteins involved in 3D domain swapping
| Pfam identifier | Pfam Description |
|
|---|---|---|
| PF07714 | Protein tyrosine kinase | 3.0E-6 |
| PF00031 | Cystatin domain | 1.1E-5 |
| PF01463 | Leucine rich repeat C-terminal domain | 1.9E-3 |
| PF00625 | Guanylate kinase | 3.3E-4 |
| PF07679 | Immunoglobulin I-set domain | 6.6E-3 |
Figure 3Bioinformatics pipeline employed to derive functional, pathway and diseases associations of proteins involved in 3D domain swapping.
Statistically significant Biological Process terms from GO term enrichment analysis
| GO ID | GO term |
|
|---|---|---|
| GO:0048518 | Positive regulation of biological process | 0.002 |
| GO:0016032 | Viral reproduction | 0.002 |
| GO:0048519 | Negative regulation of biological process | 0.005 |
| GO:0009987 | Cellular process | 0.006 |
| GO:0040007 | Growth | 0.008 |
| GO:0018126 | Protein amino acid hydroxylation | 0.008 |
| GO:0032501 | Multicellular organismal process | 0.009 |
| GO:0035110 | Leg morphogenesis | 0.01 |
| GO:0007154 | Cell communication | 0.01 |
| GO:0016271 | Tissue death | 0.011 |
| GO:0051704 | Multi-organism process | 0.014 |
| GO:0090046 | Regulation of transcription regulator activity | 0.014 |
| GO:0050896 | Response to stimulus | 0.015 |
| GO:0044403 | Symbiosis, encompassing mutualism through parasitism | 0.015 |
| GO:0001775 | Cell activation | 0.016 |
| GO:0065007 | Biological regulation | 0.017 |
| GO:0023052 | Signaling | 0.019 |
| GO:0032502 | Developmental process | 0.021 |
| GO:0034465 | Response to carbon monoxide | 0.021 |
| GO:0014071 | Response to cycloalkane | 0.023 |
| GO:0006793 | Phosphorus metabolic process | 0.023 |
| GO:0051098 | Regulation of binding | 0.026 |
| GO:0000003 | Reproduction | 0.032 |
| GO:0045342 | MHC class II biosynthetic process | 0.033 |
| GO:0001816 | Cytokine production | 0.037 |
| GO:0008356 | Asymmetric cell division | 0.037 |
| GO:0046417 | Chorismate metabolic process | 0.038 |
| GO:0030431 | Sleep | 0.038 |
| GO:0048610 | Reproductive cellular process | 0.039 |
| GO:0007610 | Behaviour | 0.043 |
Statistically significant Cellular Component terms from GO term enrichment analysis
| GO ID | GO term |
|
|---|---|---|
| GO:0005802 | Trans-Golgi network | 0.002 |
| GO:0071944 | Cell periphery | 0.004 |
| GO:0005737 | Cytoplasm | 0.009 |
| GO:0045121 | Membrane raft | 0.024 |
| GO:0048786 | Presynaptic active zone | 0.05 |
Statistically significant Molecular Function terms from GO term enrichment analysis
| GO ID | GO term |
|
|---|---|---|
| GO:0060089 | Molecular transducer activity | 0.008 |
| GO:0003682 | Chromatin binding | 0.008 |
| GO:0042802 | Identical protein binding | 0.011 |
| GO:0019838 | Growth factor binding | 0.011 |
| GO:0046983 | Protein dimerization activity | 0.011 |
| GO:0004713 | Protein tyrosine kinase activity | 0.013 |
| GO:0019144 | ADP-sugar diphosphatase activity | 0.02 |
| GO:0004883 | Glucocorticoid receptor activity | 0.023 |
| GO:0030545 | Receptor regulator activity | 0.035 |
| GO:0050998 | Nitric-oxide synthase binding | 0.047 |
| GO:0001871 | Pattern binding | 0.048 |
| GO:0070851 | Growth factor receptor binding | 0.049 |
KEGG pathways associated with proteins involved in 3D domain swapping in the dataset.
| KEGG Pathway ID | Pathway Name |
| KEGG BRITE class |
|---|---|---|---|
| hsa05200 | Pathways in cancer | 0.000 | Human Diseases; Cancers |
| hsa04722 | Neurotrophin signaling pathway | 0.000 | Organismal Systems; Nervous System |
| hsa05144 | Malaria | 0.000 | Human Diseases; Infectious Diseases |
| hsa04630 | Jak-STAT signaling pathway | 0.000 | Environmental Information Processing; Signal Transduction |
| hsa05120 | Epithelial cell signaling in Helicobacter pylori infection | 0.000 | Human Diseases; Infectious Diseases |
| hsa05211 | Renal cell carcinoma | 0.000 | Human Diseases; Cancers |
| hsa04510 | Focal adhesion | 0.001 | Cellular Processes; Cell Communication |
| hsa04660 | T cell receptor signaling pathway | 0.001 | Organismal Systems; Immune System |
| hsa05310 | Asthma | 0.002 | Human Diseases; Immune System Diseases |
| hsa04060 | Cytokine-cytokine receptor interaction | 0.002 | Environmental Information Processing; Signaling Molecules and Interaction |
| hsa05020 | Prion diseases | 0.002 | Human Diseases; Neurodegenerative Diseases |
| hsa05330 | Allograft rejection | 0.003 | Human Diseases; Immune System Diseases |
| hsa00620 | Pyruvate metabolism | 0.003 | Metabolism; Carbohydrate Metabolism |
| hsa04672 | Intestinal immune network for IgA production | 0.005 | Organismal Systems; Immune System |
| hsa05320 | Autoimmune thyroid disease | 0.006 | Human Diseases; Immune System Diseases |
| hsa05110 | Vibrio cholerae infection | 0.006 | Human Diseases; Infectious Diseases |
| hsa05221 | Acute myeloid leukemia | 0.006 | Human Diseases; Cancers |
| hsa04144 | Endocytosis | 0.008 | Cellular Processes; Transport and Catabolism |
| hsa05218 | Melanoma | 0.009 | Human Diseases; Cancers |
| hsa05100 | Bacterial invasion of epithelial cells | 0.009 | Human Diseases; Infectious Diseases |
| hsa05220 | Chronic myeloid leukemia | 0.010 | Human Diseases; Cancers |
| hsa04520 | Adherens junction | 0.010 | Cellular Processes; Cell Communication |
| hsa00400 | Phenylalanine, tyrosine and tryptophan biosynthesis | 0.010 | Metabolism; Amino Acid Metabolism |
| hsa04664 | Fc epsilon RI signaling pathway | 0.012 | Organismal Systems; Immune System |
| hsa05222 | Small cell lung cancer | 0.013 | Human Diseases; Cancers |
| hsa04012 | ErbB signaling pathway | 0.014 | Environmental Information Processing; Signal Transduction |
| hsa04210 | Apoptosis | 0.014 | Cellular Processes; Cell Growth and Death |
| hsa04540 | Gap junction | 0.015 | Cellular Processes; Cell Communication |
| hsa04010 | MAPK signaling pathway | 0.018 | Environmental Information Processing; Signal Transduction |
| hsa05146 | Amoebiasis | 0.020 | Human Diseases; Infectious Diseases |
| hsa04360 | Axon guidance | 0.029 | Organismal Systems; Development |
| hsa04530 | Tight junction | 0.031 | Cellular Processes; Cell Communication |
Statistically significant associations are highlighted in bold
Disease ontology terms associated with proteins involved in 3D domain swapping.
| DO Term | Genes |
|
|---|---|---|
| Breast cancer | 0.05 | |
| Lung cancer | 0.057 | |
| Adenovirus infection | 0.072 | |
| Abortion | 0.085 | |
| Autistic disorder | 0.096 | |
| Kidney disease | 0.101 | |
| Kidney failure | 0.128 | |
| Enteritis | 0.142 | |
| Autoimmune disease | 0.148 | |
| Systemic scleroderma | 0.173 | |
| Ulcerative colitis | 0.18 | |
| Multiple sclerosis | 0.184 | |
| Infection | 0.266 | |
| Dermatitis | 0.294 | |
| Cancer | 0.329 | |
| Lupus erythematosus | 0.378 | |
| Melanoma | 0.41 | |
| Alzheimer's disease | 0.713 | |
| Embryoma | 0.99 | |
| Rheumatoid arthritis | 0.99 | |
| Colon cancer | 0.99 | |
| Leukemia | 0.99 | |
| Diabetes mellitus | 0.99 | |
Statistically significant associations are highlighted in bold
Figure 4Gene Ontology enrichment analysis (Cellular Component) using unique human proteins from the dataset. Colored nodes indicate enriched terms associated with proteins involved in 3D domain swapping.
Figure 5Comparison of KEGG BRITE hierarchies in KEGG database and proteins from the human dataset mapped to KEGG BRITE hierarchy. HD = Human Diseases, OS = Organismal Systems; CP = Cellular Processes; EIP = Environmental Information Processing; GIP = Genetic Information Processing and Met = Metabolism.
Figure 6.