Literature DB >> 17597849

Protein subunit interfaces: heterodimers versus homodimers.

Cui Zhanhua1, Jacob Gah-Kok Gan, Li Lei, Meena Kishore Sakharkar, Pandjassarame Kangueane.   

Abstract

Protein dimers are either homodimers (complexation of identical monomers) or heterodimers (complexation of non-identical monomers). These dimers are common in catalysis and regulation. However, the molecular principles of protein dimer interactions are difficult to understand mainly due to the geometrical and chemical characteristics of proteins. Nonetheless, the principles of protein dimer interactions are often studied using a dataset of 3D structural complexes determined by X-ray crystallography. A number of physical and chemical properties govern protein dimer interactions. Yet, a handful of such properties are known to dominate protein dimer interfaces. Here, we discuss the differences between homodimer and heterodimer interfaces using a selected set of interface properties.

Entities:  

Year:  2005        PMID: 17597849      PMCID: PMC1891636          DOI: 10.6026/97320630001028

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Protein subunit interaction (either homodimer or heterodimer) is an important phenomenon in regulation and catalysis. Thousands of such interactions are theoretically possible in a combinatorial manner. The task of documenting each of these interactions is laborious. Therefore, prediction of subunit interaction sites either from folded structures or from primary sequences is required. However, this objective is currently ambitious due to the limited knowledge on the principles of protein subunit interactions using structural data. Therefore, it is our interest to study the nature of subunit interactions. Several studies report on these interactions. Jones & Thornton (used 59 protein complexes) [1], Xu & colleagues (used 319 protein-protein interfaces) [2], Tsai & colleagues (used 362 protein-protein interfaces) [3], Lo Conte & colleagues (used 75 hetero-complexes) [ 4 ], Chakrabart & Janin (used 70 hetero-complexes) [5], Brinda & colleagues (used 20 homodimers) [6], Bahadur and colleagues (used 122 homodimers) [7], Nooren and Thornton (used 39 protein dimers) [8], Caffrey and colleagues (used 64 protein-protein interfaces) [9] and Zhanhua and colleagues (used 65 heterodimers) [10], utilized a dataset of protein complexes determined by X-ray crystallography to examine the properties of subunit interaction. Protein subunit interfaces in these studies have been characterized using geometrical properties (interface size, planarity, sphericity and complementarity) and chemical properties (the types of amino acid chemical groups, hydrophobicity, electrostatic interactions and H-bonds). These studies are influenced by dataset size and their characteristics. However, the analyses are based on limited datasets consisting of heterogeneous (disproportionate mixture of homodimers and heterodimers) data. The analyses report on the role of inter-subunit H-bonds in protein subunit association. The numbers of H-bonds vary in different studies. [2–4– 7–8–11] On average, Bahadur & colleagues show 9.0 H-bonds per homodimer interface with an r value of 0.75 (Pearson correlation coefficient) between H-bonds and interface area. [7] Jones & Thornton (used 32 homodimers) show 0.88 H-bonds per 100 Å2 interface area with an r value of 0.77 between H-bonds and interface area. [11] Lo Conte et al. show an average of 10.1 H-bonds with one H-bond per 170 Å2 interface area and an r value of 0.84 between H-bonds and interface area. [ 4] Xu & colleagues also show 11 H-bonds per subunit with an r value of 0.89 between H-bonds and interface area. [2] The r value between H-bonds and interface area in these studies varies from 0.75 to 0.89. This variation is influenced primarily by dataset size and nature of data. Previous studies also show that hydrophobic effect plays an important role in protein association [3–7–12 ], yet not as much as in protein folding. [3] There studies showed that protein interfaces are more hydrophobic than surfaces, but less than interior. Hydrophobic effect was measured by the buried non-polar surface area (or percent burial) of residue types. [3] The study showed that the ratio between buried hydrophobic and buried hydrophilic residues is approximately 1.5. [3] Hydrophobic residues (except ALA) and the charged residue ARG are predominantly present at protein-protein interfaces with TYR and TRP having highest propensity. [4 –6–7–12–13] Interface size is yet another important property widely used to describe protein-protein interfaces and it is usually characterized by interface area. The number of interface residues is linearly correlated to interface area (r ≥ 0.96) in several studies. [5 –7] However, the mean number of interface residues varies between these studies. It is shown that the mean is 52 [7], 57 [5], 53.7 [14], 44.4 (for homodimers) and 42.2 (for heterodimers). [9] Thus, the number of interface residues vary within a narrow range of 42 and 57 in these studies. Here, we created two extended datasets of mutually exclusive homodimers and heterodimers. We believe that these exclusive datasets can reduce data bias to differentiate heterodimer and homodimer interfaces.

Methodology

Creation of heterodimer and homodimer dataset

A total of 2488 heterodimer candidates and 1324 homodimer candidates were downloaded from PDB (Protein Databank) and PQS (Protein Quaternary Structure Server). We then created a non-redundant dataset of 156 heterodimers and 170 homodimers (Table 1) such that they satisfy the following conditions. These include: (1) each chain ≥ 50 residues; (2) structures determined by x-ray crystallography; (3) resolution ≤ 2.5 Å ; (4) the structure with the highest resolution was selected where more than one structure was available; (5) redundant entries were removed at a sequence similarity cut-off of ≥ 30%. [15]
Table 1

Dataset Creation

Hetero-dimers
PDB codeResolution n (Å)Chain oneName of chain oneLengthChain twoName of chain twoLength
1YCS2.2B53BP2193AP53191
1ABR2.1BAbrin-A267ACarbohydrate251
1KU62.5AAcetylcholinesterase535BFasciculin 261
1LFD2.1BActive ras protein167ARas-interacting domain of ralgds87
1JIW1.7PAlkaline metalloproteinase470IProteinase inhibitor105
1BPL2.2BAlpha-amylase290AAlpha-amylase179
1KXV1.6AAlpha-amylase496CCamelid VHH domain cab10119
1TMQ2.5AAlpha-amylase470BRagi bifunctional inhibitor117
1BVN2.5PAlpha-amylase496TTendamistat71
1ACB2.0EAlpha-chymotrypsin241IEglin C63
1CHO1.8EAlpha-chymotrypsin238ITurkey ovomucoid third domain53
1CGI2.3EAlpha-chymotrypsinogen245ITrypsin inhibitor56
1SLU1.8BAnionic trypsin216AEcotin131
1RE02.4BARF guanine-nucleotide exchange factor 1195AADP-ribosylation factor 1162
1KSH1.8AARF-like protein 2164BCyclic phosphodiesterase delta-subunit141
1MG92.3BATP dependent CLP protease143AProtein YLJA84
1BRL2.4ABacterial luciferase340BBacterial luciferase319
1AVA1.9ABarley alpha-amylase 2403CBarley alpha-amylase/subtilisin inhibitor181
1B272.1ABarnase110DBarstar90
1LUJ2.5ABeta-catenin501BBeta-catenin-interacting protein ICAT71
1S0W2.3ABeta-lactamase tem263CBeta-lactamase inhibitory protein165
1BND2.3ABrain derived neurotrophic factor109BNeurotrophin 3108
1D4X1.8AC. Elegans actin 1/3368GGelsolin124
1G4Y1.6RCalmodulin147BCalcium-activated potassium channel RSK281
1DTD1.7ACarboxypeptidase A2303BMetallocarboxypeptidase inhibitor61
1NW92.4BCatalytic domain of caspase-9238AInhibitor of apoptosis protein 391
1OKK2.1DCell division protein265ASignal recognition particle protein290
1H1S2.0ACell division protein kinase 2296BCyclin A2258
1OHZ2.2ACellulosomal scaffolding protein A140BEndo-1 4-beta-xylanase Y56
1HL62.5ACG8781 protein119BMago nashi protein137
1P5V1.7AChaperone protein CAF1M191BF1 capsule antigen136
1PDK2.4AChaperone protein PAPD296BProtein PAPK258
1N0L2.3AChaperone protein PAPD212BMature fimbrial protein PAPE116
1FFG2.1BChemotaxis protein chea68AChemotaxis protein chey128
1EAY2AChey128CChea67
1P2M1.8AChymotrypsinogen A238BPancreatic trypsin inhibitor58
1HCG2.2ACoagulation factor236BCoagulation factor51
1V742.0AColicin D107BColicin D immunity protein87
1E442.4BColicin E396AImmunity protein84
1FR21.6BColicin E9131AColicin E9 immunity protein83
1F5Q2.5ACyclin dependent kinase 2296BGamma herpesvirus cyclin247
1FIN2.3ACyclin-dependent kinase298BCyclin A260
1BLX1.9ACyclin-dependent kinase 6305BP19ink4D160
1M9E1.7ACyclophilin A164DHIV-1 capsid135
1S6V1.9ACytochrome C peroxidase294BCytochrome C108
1R8S1.5ECytohesin 2187AADP-ribosylation factor 1160
1UJZ2.1BDesigned colicin E7 dnase127ADesigned colicin E7 immunity protein87
1NLV1.8ADictyostelium discoideum actin364GGelsolin123
1H311.5ADiheme cytochrome C260BCytochrome C138
1EM82.1ADNA polymerase III CHI subunit147BDNA polymerase III PSI subunit110
1JQL2.5ADNA polymerase III beta chain366BDNA polymerase III delta subunit140
1EAI2.4AElastase240CChymotrypsin isoinhibitor 161
1EFV2.1AElectron transfer flavoprotein alpha chain312BElectron transfer flavoprotein beta chain252
1F601.7AElongation factor EEF1A440BElongation factor EEF1BA90
1TA31.7BEndo-1 4-beta-xylanase301AXylanase inhibitor protein I274
1TE12.5BEndo-1 4-xylanase190AXylanase inhibitor protein I274
3FAP1.9AFK506-binding protein107BFKBP12-rapamycin associated protein94
1FCD2.5AFlavocytochrome C sulfide dehydrogenase401CFlavocytochrome c sulfide dehydrogenase174
1NF32.1AG25k GTP-binding protein194CPAR-6B123
1NQI2BGalactosyltransferase272AAlpha lactalbumin123
1WQ12.5GGapette320RHarvey-RAS166
1OR02.0BGlutaryl acylase beat subunit510AGlutaryl acylase alpha subunit152
1AXI2.1BGrowth hormone receptor191AGrowth hormone175
2NGR1.9BGtpase activating protein196AGTP binding protein191
1TX41.7AGtpase-activating protein rhogap196BTransforming protein RHOA174
1AY71.7AGuanyl-specific ribonuclease SA96BBarstar89
1HX11.9AHeat shock cognate 71 KDA377BBag-family molecular chaperone regulator-1112
1USU2.2AHeat shock protein HSP82246BAHA1132
2HBE2.0BHemoglobin146AHemoglobin141
1GPW2.4AHisf protein253BAmidotransferase HISF200
1CXZ2.2AHis-tagged transforming protein RHOA182BPKN86
1US72.3BHSP90 chaperone protein kinase194AHeat shock protein HSP82207
1KXP2.1DHuman vitamin D-binding protein438AActin alpha skeletal muscle349
1H2A1.8LHydrogenase534SHydrogenase267
1KA92.3FImidazole glycerol phosphtate synthase251HImidazole glycerol phosphtate synthase195
1IBR2.3BImportin beta-1 subunit458AGTP-binding nuclear protein ran169
1PVH2.5AInterleukin 6 signal transducer201BLeukemia inhibitory factor169
1IAR2.3BInterleukin-4 receptor alpha chain188AInterleukin129
1I1R2.4AInterleukin-6 receptor beta chain301BViral IL-6167
1O6S1.8AInternalin A461BE-cadherin105
1KI12.3BIntersectin long form342AG25k GTP-binding protein178
2KIN1.9AKinesin238BKinesin100
1PPF1.8ELeukocyte elastase218IOvomucoid inhibitor56
1OP91.9BLysozyme C130AHl6 camel VHH fragment121
1UUZ1.8DLysozyme C129AInhibitor of vertebrate lysozyme130
1OO01.9AMago nashi protein144BDrosophila Y1492
1SVX2.2BMaltose-binding periplasmic protein369AAnkyrin repeat protein OFF7157
1PQZ2.1AMCMV M144238BBeta-2-microglobulin99
1MEE2.0AMesentericopeptidase275IEglin-C64
1JW91.7BMolybdopterin biosynthesis moeb protein240DMolybdopterin converting factor81
1Q402.0BMrna export factor MEX67180AMrna transport regulator MTR2163
1SHW2.2BNeural kinase181AEphrin-A5138
1QAV1.9BNeuronal nitric oxide synthase115AAlpha-1 syntrophin90
1E962.4BNeutrophil cytosol factor 2185ARas-related C3 botulinum toxin substrate 1178
1NPE2.3ANidogen263BLaminin gamma-1 chain164
1GL42.0ANidogen-1273BProteoglycan core protein89
1M4U2.4ANoggin199LOsteogenic protein 1112
1FYH2.0ANterferon-gamma242BInterferon-gamma receptor alpha chain201
1STF2.4EPapain212IStefin B98
1F342.5APepsin A325BMajor pepsin inhibitor PI-3138
1UBK1.2LPeriplasmic hydrogenase large subunit534SPeriplasmic hydrogenase small subunit267
1JLT1.4BPhospholipase A2122APhospholipase A2 inhibitor122
1L4Z2.3APlasminogen248BStreptokinase125
1DHK1.9APorcine pancreatic alpha-amylase495BBean lectin-like inhibitor195
3YGS2.5PProcaspase 997CApoptotic protease activating factor 195
1FT12.3BProtein farnesyltransferase416AProtein farnesyltransferase315
1G4U2.3SProtein tyrosine phosphatase SPTP360RRas-related C3 botulinum toxin substrate 1180
1CT41.6EProteinase185IOvomucoid inhibitor51
1VG02.2ARab escort protein 1481BRas-related protein rab-7182
1F2T1.6ARad50 abc-atpase N-terminal domain145BRad50 abc-atpase C-terminal domain143
1GUA2.0ARap1A167BC-raf176
1HE12.0CRas-related C3 botulinum toxin substrate 1176AExoenzyme S135
1DS62.4ARas-related C3 botulinum toxin substrate 2181BRHO GDP-dissociation inhibitor 2179
1C1Y1.9ARas-related protein167BProto-onkogene serine77
1DFJ2.5ERibonuclease A124IRibonuclease inhibitor456
1DZB2.0ASCFV fragment 1F9224XTurkey egg-white lysozyme C129
1H2S1.9ASensory rhodopsin II225BSensory rhodopsin II transducer60
1P571.8BSerine protease hepsin heavy chain247ASerine protease hepsin light chain110
4SGB2.1ESerine proteinase B185IPotato inhibitor51
1SMP2.3ASerratia metallo proteinase468IErwinia chrysanthemi inhibitor100
1NRJ1.7BSignal recognition particle receptor191ADocking protein147
1RJ91.9ASignal recognition protein277BSignal recognition particle protein282
1JTP1.9ASingle-domain antibody135LLysozyme C129
1SGD1.8EStreptogrisin B185IOvomucoid51
1LW61.5ESubtilisin BPN281IUbtilisin-chymotrypsin inhibitor-2A63
2SIC1.8ESubtilisin BPN275IStreptomyces subtilisin inhibitor107
1SPB2.0SSubtilisin BPN264PSubtilisin BPN prosegment71
1R0R1.1ESubtilisin carlsberg274IOvomucoid51
1CSE1.2ESubtilisin carlsberg274IEglin-C63
1SCJ2.0ASubtilisin E275BSubtilisin E71
2SNI2.1ESubtilisin novo275IChymotrypsin inhibitor 264
1EUC2.1BSuccinyl-coa synthetase beta chain393ASuccinyl-coa synthetase alpha chain306
1ONQ2.2AT-cell surface glycoprotein CD1A274BBeta-2-microglobulin99
1JTD2.3ATem-1 beta-lactamase262BBeta-lactamase inhibitor protein II273
1KTZ2.2BTGF-beta type II receptor106ATransforming growth factor beta 382
2TEC2.0EThermitase279IEglin-C63
1JKG1.9BTip associating protein180ANTF2-related export protein 1139
1D4V2.2BTNF-related apoptosis inducing ligand163ADeath receptor 5117
1AVW1.8ATrypsin223BTrypsin inhibitor171
1BRB2.1ETrypsin223IBPTI51
1F5R1.7ATrypsin II216IPancreatic trypsin inhibitor57
1K9O2.3ETrypsin II anionic223IAlaserpin376
1D6R2.3ATrypsinogen223IBowman-birk proteinase inhibitor58
1OPH2.3BTrypsinogen223AAlpha-1 protease inhibitor375
1P2J1.4ATrypsinogen220IPancreatic trypsin inhibitor56
1S1Q2.0ATumor susceptibility gene 101 protein137BUbiquitin71
1ITB2.5BType 1 interleukin-1 receptor310AInterleukin-1 beta153
1J7D1.9BUbiquitin-conjugating enzyme E2-17 KDA149AMMS2140
1EUV1.3AULP1 protease221BUbitqutin-like protein SMT379
1UGH1.9EUracil-dna glycosylase223IUracil-DNA glycosylase inhibitor82
1UZX1.9AVacuolar protein sorting-associated protein135BUbiquitin75
1JTT2.1AVH single-domain antibody133LLysozyme129
1RKE2.4AVinculin262BVCL protein176
1MA92.4AVitamin D-binding protein442BActin alpha skeletal muscle356
1YVN2.1AYeast actin372GGelsolin125
1OXB2.3AYDP1P166BOsmolarity two-component system protein124
Homodimers
PDB codeResolution(Å)Name of HomodimerScientific sourceChain oneLengthChain twoLength
1CNZ1.83-isopropylmalate dehydrogenaseSalmonella typhimuriumA363B363
1AFW1.83-ketoacetyl-coa thiolaseSaccharomyces cerevisiaeA390B393
1M4I1.5AcetyltransferaseEscherichia coliA181B176
1LQ91.3Actva-orf6 monooxygenaseStreptomyces coelicolorA112B112
1ADE2Adenylosucinate synthetaseEscherichia coliA431B431
1M7H2Adenylylsulfate kinasePenicillium chrysogenumA203B200
1NA82.3ADP-ribosylation binding proteinHomo sapiensA151B145
1OR42.2Aerotactic transducer hematBacillus subtilisA169B158
1BD01.6Alanine racemaseBacillus stearothermophilusA381B380
1A4U1.9Alcohol dehydrogenaseDrosophila lebanonensisA254B254
1ALK2Alkaline phosphataseEscherichia coliA449B449
1LK91.5Alliin lyaseAllium sativumA425B427
1HSS2.1Alpha-amylase inhibitorTriticum aestivumA111B111
1S2Q2.1Amine oxidase BHomo sapiensA499B494
1EKP2.5Amino acid aminotransferaseHomo sapiensA365B365
2GSA2.4AminotransferaseSynechococcus SPA427B427
1DQT2AntigenMus musculusA117B117
1BJW1.8Aspartate aminotransferaseThermus thermophilusA381B381
1JFL1.9Aspartate racemaseEscherichia coliA228B228
1MJH1.7Atp-binding proteinMethanococcus jannaschiiA143B144
1IRI2.4Autocrine motility factorHomo sapiensA557B557
1LR51.9Auxin binding proteinZea maysA160B160
1N802.5Baseplate structural proteinBacteriophage T4A328B328
1EWZ2.4Beta lactamase oxa-10Pseudomonas aeruginosaA243C243
1EBL1.8Beta-ketoacyl-acp Synthase IIIEscherichia coliA309B309
1N1B2Bornyl diphosphate synthaseSalvia officinalisA516B519
1KSO1.7Calcium-binding protein A3Homo sapiensA93B93
1JD01.5Carbonic anhydraseHomo sapiensA260B259
1AUO1.8CarboxylesterasePseudomonas fluorescensA218B218
1CDC2CD2Rattus norvegicusA96B96
1F132.1Cellular coagulation factorHomo sapiensA722B719
1NW12Choline kinaseCaenorhabditis elegansA365B357
1R5P2.2Circadian oscillation regulatorAnabaena SPA90B93
1G642.1Cob(I) alamin adenosyltransferaseSalmonella typhimuriumA169B190
1OTV2.1Coenzyme pqq synthesis protein CKlebsiella pneumoniaeA254B254
1I0R1.5Conserved hypothetical proteinArchaeoglobus fulgidusA161B168
1OAC2Copper amine oxidaseEscherichia coliA719B722
1EAJ1.4Coxsackie virusHomo sapiensA124B120
1CHM1.9CreatinasePseudomonas putidaA401B401
1S441.6Crustacyanin A1 subunitHomarus gammarusA180B180
1GD72CSAA proteinThermus thermophilusA109B109
1L5B2Cyanovirin-NNostoc ellipsosporumA101B101
1SO22.4Cyclic Phosphodiesterase BHomo sapiensA363B363
1P3W2.1Cysteine desulfuraseEscherichia coliA385B385
1COZ2CytidylyltransferaseBacillus subtilisA126B126
1P6O1.1Cytosine deaminaseSaccharomyces cerevisiaeA156B161
2DAB2D-amino acid aminotransferaseThermophilic bacillusA280B282
1F172.3DehydrogenaseHomo sapiensA293B291
2NAC1.8DehydrogenaseMethylotrophic bacterium pseudomonasA374B374
1NFZ2Delta-isomeraseEscherichia coliA176B180
1D1G2.1Dihydrofolate reductaseThermotoga maritimaA164B164
1DOR2Dihydroorotate dehydrogenase ALactococcus lactisA311B311
1AD12.2Dihydropteroate synthetaseStaphylococcus aureusA264B251
1NU62.1Dipeptidyl peptidaseHomo sapiensA728B728
1PE01.7DJ-1Homo sapiensA187B187
1G1A2.5DTDP-D-glucose 4,6-DehydrataseSalmonella entericaA352B352
1BBH1.8Electron transportChromatium vinosumA131B131
1Q8R1.9Endodeoxyribonuclease rusaEscherichia coliA118B109
1RVE2.5EndonucleaseEscherichia coliA244B244
1M9K2Endothelial nitric-oxide synthaseHomo sapiensA400B401
1P431.8Enolase 1Saccharomyces cerevisiaeA436B436
1JR81.5Erv2 protein mitochondrialSaccharomyces cerevisiaeA105B105
1V262.5Fatty-acid-coa synthetaseThermus thermophilusA489B510
1LBQ2.4FerrochelataseSaccharomyces cerevisiaeA356B354
1RYA1.3Gdp-mannose mannosyl hydrolaseEscherichia coliA160B160
1QFH2.2Gelation factorDictyostelium discoideumA212B212
1JV32.2Glcnac1p uridyltransferaseHomo sapiensA490B484
1DPG2Glucose 6-phosphate dehydrogenaseLeuconostoc mesenteroidesA485B485
1QXR1.7Glucose-6-phosphate isomerasePyrococcus furiosusA187B187
1EOG2.1Glutathione S-transferaseEscherichia coliA208B208
1N2A1.9Glutathione S-transferaseEscherichia coliA201B187
1M0W1.8Glutathione synthetaseSaccharomyces cerevisiaeA481B479
1R9C1.8Glutathione transferaseMesorhizobium lotiA125B118
1F4Q1.9GrancalcinHomo sapiensA161B165
1DQP1.8Guanine phosphoribosyltransferaseGiardia lambliaA230B230
3SDH1.4HemoglobinScapharca inaequivalvisA145B145
1IPI2.2Holliday junction resolvasePyrococcus furiosusA114B114
1FWL2.3Homoserine kinaseMethanococcus jannaschiiA296B296
2HHM2.1HydrolaseHomo sapiensA272B272
1PP22.5HydrolaseCrotalus atroxR122L122
1FJH1.7Hydroxysteroid dehydrogenaseComamonas testosteroniA236B236
1G0S1.9Hypothetical ProteinEscherichia coliA201B202
1JOG2.4Hypothetical proteinHaemophilus influenzaeA129B129
1PT52Hypothetical proteinEscherichia coliA415B415
1QYA2Hypothetical ProteinEscherichia coliA293B307
1FUX1.8Hypothetical proteinEscherichia coliA164B163
1J301.7Hypothetical rubrerythrinSulfolobus tokodaiiA141B137
1LHZ2.3Immunoglobulin lambdaHomo sapiensA213B213
1AA72.1Influenza virus matrix mroteinInfluenza virusA158B157
8PRK1.9Inorganic pyrophosphataseSaccharomyces cerevisiaeA282B282
1R8J2KaiaSynechococcus elongatusA272B264
1CQS1.9Ketosteroid isomerasePseudomonas putidaA124B124
1AQ62L-2-haloacid dehalogenaseXanthobacter autotrophicusA245B245
1I2W1.7LactamaseBacillus licheniformisA255B256
1BH52.2Lactoylglutathione lyaseHomo sapiensA177B182
1QMJ2.2LectinGallus gallusA132B132
1K751.8L-histidinol dehydrogenaseEscherichia coliA425B425
1EHI2.4LigaseLeuconostoc mesenteroidesA360B347
1NWW1.2Limonene-1,2-epoxide hydrolaseRhodococcus erythropolisA145B146
1UC82Lysine biosynthesis enzymeThermus thermophilusA240B239
1EN52.3Manganese superoxide dismutaseEscherichia coliA205B205
1A4I1.5MethylenetetrahydrofolateHomo sapiensA285B295
1FC52.2Molybdopterin biosynthesisEscherichia coliA397B396
1JYS1.9Mta/sah nucleosidaseEscherichia coliA226B226
1LNW2.1Multidrug resistance operon repressorPseudomonas aeruginosaA137B135
1FP32N-acyl-d-glucosamineSus scrofaA402B402
1FYD2.3NAD(+) SynthetaseBacillus subtilisA271B246
1HJ31.6Nitrite reductaseParacoccus pantotrophusA544B542
1G1M2.3Nitrogenase iron proteinAzotobacter vinelandiiA287B289
1G8T1.1Nuclease SM2 isoformSeratia marcencsensA241B241
1EYV1.6N-utilizing substance proteinMycobacterium tuberculosisA131B133
1M982.1Orange carotenoid proteinArthrospira maximaA316B314
1ORO2.4Orotate phosphoribosyltransferaseEscherichia coliA213B206
1DVJ1.5Orotidine 5'-phosphate decarboxylaseMethanobacterium thermoautotrophicumA239B211
1GGQ2.5Outer surface protein CBorrelia burgdorferiA162B162
1AOR2.3OxidoreductasePyrococcus furiosisA605B605
1BMD1.9OxidoreductaseThermus flavusA327B327
1HDY2.5OxidoreductaseHomo sapiensA374B374
1N2O2.1Pantothenate synthetaseMycobacterium tuberculosisA279B279
1RN52.2Peptide deformylaseLeptospira interrogansA177B177
1PN22Peroxisomal hydrataseCandida tropicalisA269B267
1PN01.7Phenol 2-monooxygenaseTrichosporon cutaneumA652C656
1BXG2.3Phenylalanine dehydrogenaseRhodococcus SPA349B347
1M6P1.8Phosphate receptorBos TaurusA146B146
1RQL2.4Phosphonoacetaldehyde hydrolaseBacillus cereusA257B257
1O4U2.5PhosphoribosyltransferaseThermotoga maritimaA265B266
1EZ21.9PhosphotriesterasePseudomonas diminutaA328B328
1EXQ1.6Pol polyproteinEscherichia coliA147B145
1MNA1.8Polyketide synthaseStreptomyces venezuelaeA276B278
1C6X2.5ProteaseEscherichia coliA99B99
1FL12.2ProteaseEscherichia coliA192B207
1F892.4Protein YLC351CSaccharomyces cerevisiaeA271B271
1LHP2.1Pyridoxal kinaseOvis ariesA306B309
1CBK2PyrophosphokinaseHaemophilus influenzaeA160B160
1QR22.1Quinone reductase type 2Homo sapiensA230B230
1EN72.4Recombination endonucleaseBacteriophage T4A157B157
1EV72.4Restriction endonuclease naeiNocardia aerocolonigenesA295B293
1H8X2RibonucleaseHomo sapiensA125B125
1I4S2.2Ribonuclease IIIAquifex aeolicusA147B147
1KGN1.9Ribonucleotide reductase proteinCorynebacterium ammoniagenesA296B296
1TLU1.6S-adenosylmethionine decarboxylaseThermotoga maritimaA117B117
1K6Z2Secretion chaperone syceYersinia pestisA120B119
1K3S1.9SigeSalmonella entericaA106B104
1PJQ2.2Siroheme synthaseSalmonella typhimuriumA447B454
1HJR2.5Site-specific recombinaseEscherichia coliA158C158
3LYN1.7Sperm lysineHaliotis fulgensA122B124
2SQC2Squalene-hopene CyclaseAlicyclobacillus acidocaldariusA623B623
1SCF2.2Stem cell factorHomo sapiensA116B118
1OX82.2Stringent starvation protein BEscherichia coliA105B105
1M3E2.5Succinyl-coaSus scrofaA459B460
1R7A1.8Sucrose phosphorylaseBifidobacterium adolescentisA503B503
1SOX1.9Sulfite oxidaseGallus gallusA463B458
1L5X2Survival protein EPyrobaculum aerophilumA270B272
1REG1.9T4 regaBacteriophage T4X122Y120
1MKB2Thiol ester dehydraseEscherichia coliA171B171
1QHI1.9Thymidine kinaseHerpes simplex virusA304B308
1HSJ2.3Transcription/sugar binding proteinEscherichia coliA487B487
1NY52.4Transcriptional regulatorAquifex aeolicusA384B385
1ON21.6Transcriptional regulatorBacillus subtilisA135B135
1SMT2.2Transcriptional repressorSynechococcusA98B101
1TRK2TransferaseSaccharomyces cerevisiaeA678B678
7AAT1.9TransferaseGallus gallusA401B401
1KIY2.4Trichodiene synthaseFusarium sporotrichioidesA354B354
1I8T2.4Udp-galactopyranose mutaseEscherichia coliA367B367
1F6D2.5Udp-n-acetylglucosamineEscherichia coliA366B363
1JP31.8Undecaprenyl pyrophosphate synthaseEscherichia coliA210B207
1JMV1.9Universal stress protein AHaemophilus influenzaeA140B137
1HQO2.3URE2 proteinSaccharomyces cerevisiaeA221B217
9WGA1.8Wheat germ agglutininTriticum vulgarisA170B170
1MI31.8Xylose reductaseCandida tenuisA319B319

Calculation of interface parameters

Interface area

ASA (accessible surface area) was calculated using NACCESS [16] with a probe radius of 1.4 Å and interface area is defined by ΔASA (change in ASA upon complexation from monomer to dimer state) as described elsewhere. [10]

Inter-subunit H-bonds

A hydrogen bond is a polar interaction between two electronegative atoms, where a donor and an acceptor participate. The number of H-bonds formed between subunits was calculated using the program HBPLUS. [17 ]

Hydrophobicity

Interface hydrophobicity was estimated using the equation [ 18], where N is the number of interface residues, and HV is the hydrophobicity scale for each residue. [ 18]

Interface residues propensity

Interface residues show an ΔASA (change in accessibility) of ≥ 5 % upon complexation. Interface residue propensities were calculated using the percentage frequencies of 20 residues using the following functions: PIS(i)=finterface(i)/fsurface(i) PII(i)=finterface(i)/finterior(i) where PIS(i) is residue interface propensity compared to protein surface, PII(i) is residue interface propensity compared to protein interior, finterface(i) is residue frequency at the protein interface, fsurface(i) is residue frequency at the protein surface, finterior(i) is residue frequency at the protein interior.

Results and Discussion

Dimer interactions are characterized by a large combination of physical-chemical parameters. Analysis of dimer structures can provide insight into the principles of protein-protein complexation and help develop models to predict interaction sites. The multi dimensional scaling method applied in a recent study reduced a large pool of interface parameters to a small set of six critical properties for heterodimers. [10] Zhanhua et al., 2005, showed that the six selected parameters were sufficient to describe subunit interfaces instead of the complete parameter space. Here, we use these selected set of properties to discuss the interface differences between 156 heterodimers and 170 homodimers. The properties used in this study are (1) interface residues, (2) interface H-bonds, (3) interface hydrophobicity, (4) interface residue composition.

Interface H-bonds

Intermolecular hydrogen bonds between subunits are important in the association and stability of protein-protein interfaces. [3-4] H-bonds in homodimers (range 0 ­ 51) and heterodimers (range 0 ­ 98) are different. The mean H-bonds are larger for homodimers (mean = 18) than heterodimers (mean = 12). Figure 1 A and B show that there is a high correlation between H-bonds and interface residues. The correlation coefficient is 0.83 in heterodimers and 0.85 in homodimers. This is similar to the previous reports in the range of 0.75 and 0.89. [2–4 –7–8–11] However, there is a subtle difference with the previous studies and the variation is affected by structure resolution, dataset size and data type. The dataset used in this study contains structures with resolution ≤ 2.5 Å and the data is either exclusively homodimer or heterodimer. However, previous datasets contain structures with resolution ≤ 3.0 Å and the data is a mixture of heterodimers, homodimers and other oligomers. At low resolution there are fewer H-bonds and the correlation with interface area decreases.[4] Here, we show that the relation between H-bonds and interface residues is highly correlated for both heterodimers and homodimers. This is useful to evaluate inter-subunit H-bonds prediction and their involvement in interface stability. On average there are 0.24 H-bonds per interface residue in heterodimers and 0.22 H-bonds per interface residue in homodimer. The maximum number of H-bonds per interface residue is 0.65 in heterodimers and 0.44 in homodimers. Although there are more intermolecular H-bonds in homodimers, the density of H-bonds per interface residue is lower in homodimers than in heterodimers. [7]
Figure 1

Difference between heterodimer and homodimer interface properties is shown.

(A) Hydrogen bonds in heterodimer interface; (B) Hydrogen bonds in homodimer interface; (C) Interface area in heterodimer interface; (D) Interface area in homodimer interface; (E) Interface residues in heterodimers; (F) Interface residues in homodimers; (G) Hydrophobic, hydrophilic and charged residue fraction in heterodimers and homodimers; (H) Propensity difference in heterodimers and homodimers (heterodimers ­ homodimers); (I) Ratio of interface to surface & interface to interior propensity in heterodimers; (J) Ratio of interface to surface & interface to interior propensity in homodimers. FBM = Fraction below mean value; FAM = Fraction above mean value.

Interface residues

The number of interface residues is proportional to interface area.[5 –7] Stronger protein subunit associations were generally associated with larger interface areas.[11] In our study, the range of heterodimer interface residues varies from 18 to 162 with a mean value of 51. While, the range of homodimer interface residues extends from 15 to 308 with a mean value of 81. Like H-bonds, interface residues also varied with different studies and are affected by dataset size and data type. [5–7–9– 14] Hence, we created mutually exclusive datasets of homodimers and heterodimers for this analysis to reduce bias due to data type heterogeneity. Thus, we show that the amount of interface residues is significantly different for homodimers and heterodimers. The results also suggest that the previous studies are based on datasets biased with heterodimers. The relation between number of interface residues and monomer length is shown in Figure 1 E and F. They show that interface residues increase with both heterodimer and homodimer monomer length. However, the relation is causal. Figure 1 C and D show a causal relationship between interface area and monomer length for both homodimers and heterodimers. The mean interface residues are larger in homodimers than heterodimers. This is consistent with previous studies. [ 7–9]

Interface residue composition

Several studies show the prevalence of certain types of residues at the dimer interfaces.[4–6–7– 12–13] However, the significance of hydrophobic, hydrophilic, and charged residues at the interface of homodimers and heterodimers is not well documented. Figure 1 G show the fractional distribution of hydrophobic, hydrophilic and charged residues in homodimer and heterodimer interfaces. Hydrophobic residues (M, F, P, A, B, L), except for I and G are dominant in homodimer interfaces. However, hydrophilic residues (W, C, H, Q, N, Y, S), except for T, are dominant in heterodimer interfaces. This observation is interesting and not surprising because homodimers being made of identical monomer subunits tend to associate by hydrophobic interactions. This is in contrast to the observation in heterodimer interfaces being made of non-identical monomer subunits, associating generally by hydrophilic interactions. Figure 1 H, shows the ration of interface/surface and interface/interior residue propensity difference between heterodimers and homodimers. Interestingly, the ratio of interface to interior charged residues (D, E, K, R) is significantly larger in heterodimers compared to homodimers (Figures 1H, 1I, 1J). On the other hand, the ratio of interface to interior hydrophobic residues (A, V, L, M, I, F) are prevalent in homodimers than in heterodimers (Figures 1H, 1I, 1J). Similarly, hydrophilic residues (N, Q, H, Y, S, T) are prevalent in heterodimer interfaces (Figures 1H, 1I, 1J). However, the propensity difference in the ratio of interface to surface hydrophobic/hydrophilic/charged for homodimers and heterodimers is almost zero (Figure 1H).

Conclusion

We performed a comprehensive analysis on the differences between 156 heterodimers and 170 homodimers. The homodimer and heterodimer datasets are mutually exclusive and is one of the unique features of the analysis. The analysis documents the differences between homodimer and heterodimer interfaces for the first time in a comprehensive manner. Homodimer interfaces have greater number of interface residues and H-bonds on average. However, the density of H-bonds per residue is greater for heterodimer interfaces. The study also shows that charged residues (D, E, K, R) and hydrophilic residues (N, T, S, Q, H, W, Y) are dominant at the heterodimer interfaces. Nonetheless, hydrophobic residues (A, V, L, M, I, F) are predominant at the homodimer interfaces. These data find utility in the development of independent models for the prediction of homodimer and heterodimer interaction sites.
  18 in total

1.  Conservation helps to identify biologically relevant crystal contacts.

Authors:  W S Valdar; J M Thornton
Journal:  J Mol Biol       Date:  2001-10-19       Impact factor: 5.469

2.  Analysis of homodimeric protein interfaces by graph-spectral methods.

Authors:  K V Brinda; N Kannan; S Vishveshwara
Journal:  Protein Eng       Date:  2002-04

3.  Are protein-protein interfaces more conserved in sequence than the rest of the protein surface?

Authors:  Daniel R Caffrey; Shyamal Somaroo; Jason D Hughes; Julian Mintseris; Enoch S Huang
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

4.  Dissecting subunit interfaces in homodimeric proteins.

Authors:  Ranjit Prasad Bahadur; Pinak Chakrabarti; Francis Rodier; Joël Janin
Journal:  Proteins       Date:  2003-11-15

5.  Structural characterisation and functional significance of transient protein-protein interactions.

Authors:  Irene M A Nooren; Janet M Thornton
Journal:  J Mol Biol       Date:  2003-01-31       Impact factor: 5.469

6.  Identification of critical heterodimer protein interface parameters by multi-dimensional scaling in euclidian space.

Authors:  Cui Zhanhua; Jacob Gah-Kok Gan; Li Lei; Venkatarajan Subramanian Mathura; Meena Kishore Sakharkar; Pandjassarame Kangueane
Journal:  Front Biosci       Date:  2005-01-01

7.  Generation of a dataset for studying ligand effect on homodimer interface.

Authors:  Lei Li; Jacob Gah-Kok Gan; Zhanhua Cui; Meena Kishore Sakharkar; Pandjassarame Kangueane
Journal:  Front Biosci       Date:  2005-05-01

8.  The atomic structure of protein-protein recognition sites.

Authors:  L Lo Conte; C Chothia; J Janin
Journal:  J Mol Biol       Date:  1999-02-05       Impact factor: 5.469

9.  The interpretation of protein structures: estimation of static accessibility.

Authors:  B Lee; F M Richards
Journal:  J Mol Biol       Date:  1971-02-14       Impact factor: 5.469

Review 10.  Protein-protein interactions: a review of protein dimer structures.

Authors:  S Jones; J M Thornton
Journal:  Prog Biophys Mol Biol       Date:  1995       Impact factor: 3.667

View more
  26 in total

1.  Domains of Pyrococcus furiosus L-asparaginase fold sequentially and assemble through strong intersubunit associative forces.

Authors:  Dushyant K Garg; Rachana Tomar; Reema R Dhoke; Ankit Srivastava; Bishwajit Kundu
Journal:  Extremophiles       Date:  2015-04-11       Impact factor: 2.395

2.  Linking structural features of protein complexes and biological function.

Authors:  Gopichandran Sowmya; Edmond J Breen; Shoba Ranganathan
Journal:  Protein Sci       Date:  2015-07-14       Impact factor: 6.725

3.  Trichomonas vaginalis metalloproteinase TvMP50 is a monomeric Aminopeptidase P-like enzyme.

Authors:  Rodrigo Arreola; José Luis Villalpando; Jonathan Puente-Rivera; Jorge Morales-Montor; Enrique Rudiño-Piñera; María Elizbeth Alvarez-Sánchez
Journal:  Mol Biotechnol       Date:  2018-08       Impact factor: 2.695

4.  A comparison of successful and failed protein interface designs highlights the challenges of designing buried hydrogen bonds.

Authors:  P Benjamin Stranges; Brian Kuhlman
Journal:  Protein Sci       Date:  2012-11-29       Impact factor: 6.725

Review 5.  Caught in self-interaction: evolutionary and functional mechanisms of protein homooligomerization.

Authors:  Kosuke Hashimoto; Hafumi Nishi; Stephen Bryant; Anna R Panchenko
Journal:  Phys Biol       Date:  2011-05-13       Impact factor: 2.583

6.  EvoEF2: accurate and fast energy function for computational protein design.

Authors:  Xiaoqiang Huang; Robin Pearce; Yang Zhang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

7.  A decision tree model for the prediction of homodimer folding mechanism.

Authors:  Abishek Suresh; Velmurugan Karthikraja; Sajitha Lulu; Uma Kangueane; Pandjassarame Kangueane
Journal:  Bioinformation       Date:  2009-11-17

8.  Interaction modes at protein hetero-dimer interfaces.

Authors:  Ayyappan Vaishnavi; Gopichandran Sowmya; Jayaseelan Kalaivanii; Selvarajan Ilakya; Uma Kangueane; Pandjassarame Kangueane
Journal:  Bioinformation       Date:  2010-01-20

9.  Infantile hypophosphatasia secondary to a novel compound heterozygous mutation presenting with pyridoxine-responsive seizures.

Authors:  Dina Belachew; Traci Kazmerski; Ingrid Libman; Amy C Goldstein; Susan T Stevens; Stephanie Deward; Jerry Vockley; Mark A Sperling; Arcangela L Balest
Journal:  JIMD Rep       Date:  2013-03-12

10.  Amyloidogenic regions and interaction surfaces overlap in globular proteins related to conformational diseases.

Authors:  Virginia Castillo; Salvador Ventura
Journal:  PLoS Comput Biol       Date:  2009-08-21       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.