| Literature DB >> 25231912 |
Fergal J Duffy, Marc Devocelle, David R Croucher, Denis C Shields1.
Abstract
BACKGROUND: Bioactive cyclic peptides derived from natural sources are well studied, particularly those derived from non-ribosomal synthetases in fungi or bacteria. Ribosomally synthesised bioactive disulphide-bonded loops represent a large, naturally enriched library of potential bioactive compounds, worthy of systematic investigation.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25231912 PMCID: PMC4262234 DOI: 10.1186/1471-2105-15-305
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Loop, surface and interface size distributions for disulphide-bonded loop containing proteins. This study examined a total of 2,380 3D loop structures from 759 unique proteins in the Protein Data Bank (PDB), along with 8,607 annotated disulphide-bonded loop sequences from 5,989 Uniprot protein entries. (a) Shows the loop size distribution of disulphide bonded loops in PDB structures. (b) Shows the distribution of the number of surface residues possessed by each PDB-derived disulphide-bonded loop (c) shows the distribution of the number of residues in each PDB-derived disulphide loop that are involved in a protein-protein contact. (d) Shows the size distribution of short disulphide-bonded loops annotated in Uniprot.
Uniprot Proteins containing a disulphide-bonded loop comprising over 50% of a PDB protein-protein interface
| Uniprot | Protein containing | Interacting | PDB ID |
|---|---|---|---|
| accession | disulphide-bonded loop | protein | |
| P02462 | Collagen alpha-1(IV) chain [Cleaved into: Arresten], | Homodimer | 1LI1 |
| Q7T1K6 | Natrin-1 (Cysteine-rich venom protein 1) (NA-CRVP1) (Protein G2a), | Homodimer | 1XTA |
| P01133 | Pro-epidermal growth factor (EGF) [Cleaved into: Epidermal growth factor (Urogastrone)], | EGF Receptor (ErbB1), | 1IVO |
| P01058 | Bowman-Birk type proteinase inhibitor, | Bovine trypsin | 1TAB |
| P01062 | Bowman-Birk type trypsin inhibitor, | Trypsin, | 1G9I |
| P17734 | Bowman-Birk type seed trypsin and chymotrypsin inhibitor (BTCI), | Trypsin, | 2G81 |
| P0AD59 | Inhibitor of vertebrate lysozyme, | Lysosome C, | 1GPQ |
| Q9JHF9 | Lymphocyte antigen 96 (Ly-96) (ESOP-1) (Protein MD-2), | Toll-like receptor 4 (TLR-4), | 2Z64 |
| A0S865 | Irditoxin subunit B (IrTxB), | Iridotoxin subunit A | 2H7Z |
| P50897 | Palmitoyl-protein thioesterase 1 (PPT-1) (EC 3.1.2.22) (Palmitoyl-protein hydrolase 1), | Homodimer | 3GRO |
| P04004 | Somatomedin-B subunit of Vitronectin (VN), | Plasminogen activator inhibitor-1 (PAI-1), | 1OC0 |
| P04004 | Somatomedin-B subunit of Vitronectin (VN), | Urokinase plasminogen activator surface receptor (uPAR), | 3BT2 |
| Q03405 | Urokinase plasminogen activator surface receptor (uPAR), | Urokinase-type plasminogen activator (uPA), | 3BT2 |
| Q8RS40 | Beta-1,3-xylanase (txyA), | Homodimer | 2COV |
Protein families containing preferentially conserved disulphide-bonded loop
| Protein family | Disulphide-bonded |
|---|---|
| loop count | |
| Somatotropin | 91 |
| Prolactin | 50 |
| Polygalacturonase/Endopolygalacturonase | 50 |
| Guanylate cyclase activator/Guanylin | 13 |
| Urotensin | 12 |
| Disintegrin | 8 |
| Calcitonin | 6 |
| Other | 57 |
| Total | 287 |
(Conservation difference between disulphide-bonded loop and juxtapeptide region of over 0.30.).
Cyclic peptides derived from EGF-domain containing proteins and tested for EGF activation/inhibition
| No. | Cyclic peptide sequence | Human parent protein |
|---|---|---|
| 1 | CVVGYIGERC | EGF, Epidermal Growth Factor* |
| 2 | CHSGYVGARC | TGFA, Transforming Growth |
| Factor Alpha* | ||
| 3 | CQQEYFGERC | AREG, Amphiregulin* |
| 4 | CDEGYIGARC | BTC, Betacellulin* |
| 5 | CHPGYHGERC | HBEGF, Heparin-binding EGF-like |
| growth factor* | ||
| 6 | CEVGYTGVRC | EREG, Epiregulin* |
| 7 | CQPGFTGARC | NRG1, Neuregulin-1** |
| 8 | CPNGFFGQRC | NRG2, Neuregulin-2** |
| 9 | CKEGYQGVRC | NRG3, Neuregulin-3** |
| 10 | CVENYTGARC | NRG4, Neuregulin-4** |
| 11 | CAQECVHGRC | PEAR1, Platelet endothelial |
| aggregation receptor 1 | ||
| 12 | CTRTGYSGPNC | PTGS1, Prostaglandin G/H |
| synthase 1 | ||
| 13 | CDPGFSGLKC | SELE, E-selectin |
| 14 | CLPAFEGRNC | F7, Coagulation factor VII |
| 15 | CDSDWTGYYC | ITGB3, Integrin Beta 3 |
*Denotes growth factors known to directly activate the EGF receptor. **Denotes growth factors known to directly activate members of the ErbB receptor family other than EGFR/ErbB1.
Figure 2Amino-acid distribution for proteins containing short disulphide-bonded loops. White bars indicate fractional amino acid frequencies across all Uniprot proteins and black bars indicate amino acid frequencies inside short disulphide-bonded loops, excluding the disulphide-bond forming cysteines.
Figure 3Conservation score distribution of amino acids in proteins containing short disulphide-bonded loops. Amino-acids inside short disulphide-bonded loops are indicated in light grey and the overall distribution is in black. Both distributions are scaled to a maximum Y-axis value of 1. P-value of <10-15 indicates that average conservation inside disulphide-bonded loops is very significantly larger than the overall average conservation, based on a single-tailed Mann-Whitney U test.
Figure 4Average amino acid conservation scores close to short loop disulphide bonds. Error-bars indicate standard error. "***" indicates significant results, with p-values < 0.001 based on a Mann-Whitney U-test, where residues at an equal distance from a disulphide bonded cysteine are more conserved inside the disulphide bond.
Figure 5Distribution of the differences between average disulphide-bonded loop conservation scores and the average conservation scores of residues located immediately outside the disulphide-bonded loop. Positive values indicate disulphide-bonded loops more conserved than the regions surrounding them. The black distribution represents all SwissProt annotated disulphide-bonded loops, the red distribution represents disulphide-bonded loops on a PDB protein model surface, and the blue distribution seen at the base of the red distribution represents disulphide-bonded loops at a PDB interface. The green ’X’s underneath the histogram represent the conservation difference of the disulphide-bonded loops in Table 1 comprising over 50% of a PDB interface.
Similar disulphide-bonded loops between human and virus
| Human | Human | Human | Virus | Virus | Virus |
|---|---|---|---|---|---|
| accession | peptide | protein name | accessions | peptide | protein name |
| O43506 | CELQWC | ADAM20: Disintegrin and metalloproteinase domain-containing protein 20 | Q1HVB5 | CELGWC | Uncharacterized protein BNLF2b: Human herpesvirus 4 |
| P22413, Q13822 | CKGRC | ENPP1 and ENPP2: Ectonucleotide pyrophosphatase/phosphodiesterase family member 1 and 2 | O41974 | CKRRC | Immediate-early protein, 73: Murid herpesvirus 4 |
| P04626 | CSKPC | ERBB2: Receptor tyrosine-protein kinase erbB-2 | P03227 | CRRPC | Single stranded DNA-binding protein: Human herpesvirus 4 |
| P00451 | CRAPC | F8: Coagulation factor VIII | P03227 | CRRPC | Single stranded DNA-binding protein: Human herpesvirus 4 |
| P18564 | CTTSTDSC | ITGB6: Integrin beta-6 | P03254 | CNSSTDSC | Early E1A 32 kDa protein: Human adenovirus 2 |
| Q96LB9 | CTVSTDC | PGLYRP3: Peptidoglycan recognition protein 3 | Q77PU6 | CTIRSDC | Protein U90: Human herpesvirus 6B |
| Q92954, P04004 | CKGRC | PRG4: Proteoglycan 4 and VN: Vitronectin | O41974 | CKRRC | Immediate-early protein, 73: Murid herpesvirus 4 |