| Literature DB >> 21569261 |
Konstantinos P Exarchos1, Themis P Exarchos, Georgios Rigas, Costas Papaloukas, Dimitrios I Fotiadis.
Abstract
BACKGROUND: In peptides and proteins, only a small percentile of peptide bonds adopts the cis configuration. Especially in the case of amide peptide bonds, the amount of cis conformations is quite limited thus hampering systematic studies, until recently. However, lately the emerging population of databases with more 3D structures of proteins has produced a considerable number of sequences containing non-proline cis formations (cis-nonPro).Entities:
Mesh:
Substances:
Year: 2011 PMID: 21569261 PMCID: PMC3097163 DOI: 10.1186/1471-2105-12-142
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Conformational isomers of a Glycine-Phenylalanine peptide bond detected in the beta-ketoacyl-acyl carrier protein synthase III (PDB id: 1HNJ).
Figure 2Overview of the employed methodological analysis.
Figure 3Construction of the .
Overview of patterns maintained after each preprocessing step and for all types of pattern discovery.
| Exact pattern discovery | |||
|---|---|---|---|
| 4815 | 1622 | 231 | |
| 100% | - | 100 | |
| 3.58 | - | 0.25 | |
| 38904 | 8251 | 235 | |
| 100 | - | 100 | |
| 6.79 | - | 0.03 | |
| 32812 | 7347 | 225 | |
| 100 | - | 100 | |
| 6.69 | - | 0.02 | |
The 20 highest scoring patterns sorted in descending order by sc_score.
| Exact pattern discovery | Chemical equivalency set | Structural equivalency set | ||||||
|---|---|---|---|---|---|---|---|---|
| KP | 1 | -37.67 | KP | 1 | -37.67 | KP | 1 | -37.67 |
| EDG | 1 | -42.25 | G.[AG][DE]. | 1 | -23.91 | S. | 1 | -19.73 |
| HAE | 1 | -44.51 | [ILMV][ILMV]. | 1 | -21.70 | E. | 1 | -18.28 |
| LG | 1 | -36.61 | G[AG].[DE] | 1 | -31.33 | MLQ...[ITV].[KMR] | 1 | -19.03 |
| 1 | -20.01 | G..[FY]W | 1 | -25.89 | [EQ]. | 1 | -20.32 | |
| ALN | 1 | -41.86 | LG | 1 | -36.61 | [KMR].. | 1 | -20.23 |
| YFT.. | 1 | -14.57 | EDG | 1 | -42.25 | EDG | 1 | -42.25 |
| CLA. | 1 | -20.96 | G | 1 | -24.88 | LG | 1 | -36.61 |
| R..DP....VV | 1 | -20.01 | 1 | -20.01 | HAE | 1 | -44.51 | |
| H. | 1 | -15.76 | [ST]..A | 1 | -18.38 | 1 | -20.01 | |
| VYL.. | 1 | -20.29 | [AG][ILMV].. | 1 | -20.11 | ALN | 1 | -41.86 |
| 1 | -15.89 | T.R. | 1 | -18.48 | GG... | 1 | -19.48 | |
| A...K | 1 | -26.56 | [ST].LN. | 1 | -23.04 | [DLN]L. | 1 | -20.75 |
| L..S | 1 | -19.77 | [AG]. | 1 | -24.32 | [EQ]..P..[FHWY]P.E | 1 | -19.33 |
| REPDP | 1 | -21.25 | MLQ | 1 | -23.90 | QL... | 1 | -23.32 |
| G.MFW | 1 | -16.96 | [AG]K | 1 | -25.89 | A[FHWY].[FHWY] | 1 | -24.56 |
| L.G.. | 1 | -24.86 | HAE | 1 | -44.51 | M[FHWY]. | 1 | -23.90 |
| 1 | -28.88 | [KR][ILMV].P. | 1 | -21.32 | [ITV]..G[ITV].T.[ITV].V | 1 | -22.12 | |
| VL.G. | 1 | -25.03 | [AG][ST].D. | 1 | -18.86 | [KMR]Y... | 1 | -19.41 |
| L..A. | 1 | -18.77 | G... | 1 | -16.47 | [FHWY]..K | 1 | -23.29 |
Figure 4Groupings of amino acids based on common physicochemical properties.
Frequencies of occurrence for each residues and character class in the retained patterns.
| Amino acid frequencies (%) | |||
|---|---|---|---|
| 8 | 5 | 6 | |
| 3 | 2 | 2 | |
| 4 | 3 | 3 | |
| 6 | 4 | 4 | |
| 1 | 1 | 1 | |
| 5 | 4 | 3 | |
| 2 | 1 | 2 | |
| 13 | 9 | 10 | |
| 3 | 2 | 2 | |
| 5 | 3 | 3 | |
| 9 | 6 | 6 | |
| 4 | 3 | 3 | |
| 2 | 1 | 1 | |
| 4 | 3 | 2 | |
| 6 | 4 | 4 | |
| 7 | 5 | 5 | |
| 6 | 3 | 3 | |
| 2 | 2 | 1 | |
| 4 | 3 | 2 | |
| 7 | 4 | 4 | |
| - | 6 | 10 | |
| - | 3 | 7 | |
| - | 3 | 5 | |
| - | 2 | 2 | |
| - | 12 | 7 | |
| - | 5 | 2 | |
| - | 3 | - | |
Frequencies of each residue occupying the position that cis-nonPro peptide bond occurs and the preceding one.
| Residues with | Preceding residue | Frequency of residues with | Frequency of preceding residue (%) | |
|---|---|---|---|---|
| 23 | 22 | 7 | 7 | |
| 15 | 14 | 5 | 4 | |
| 26 | 23 | 8 | 7 | |
| 5 | 4 | 2 | 1 | |
| 27 | 20 | 8 | 6 | |
| 9 | 14 | 3 | 4 | |
| 52 | 62 | 16 | 19 | |
| 7 | 7 | 2 | 2 | |
| 7 | 6 | 2 | 2 | |
| 9 | 11 | 3 | 3 | |
| 17 | 16 | 5 | 5 | |
| 4 | 5 | 1 | 2 | |
| 15 | 12 | 5 | 4 | |
| 0 | 24 | 0 | 8 | |
| 16 | 14 | 5 | 4 | |
| 19 | 16 | 6 | 5 | |
Figure 5Functional associations of .
Functional verification between ELM functional classes and the respective associated sequences.
| ELM | Sequences | ||
|---|---|---|---|
| LIG_14-3-3_2 | Binding | Binding, catalytic activity | |
| LIG_PP1 | Binding, enzyme regulator activity | Binding, catalytic activity, transporter activity, transcription regulator activity | |
| LIG_PP2B_1 | Binding, catalytic activity | Catalytic activity | |
| LIG_MAPK_1 | Binding | Binding, catalytic activity, | |
| LIG_SCF-TrCP1_1 | Binding, catalytic activity | Binding, catalytic activity | |
| LIG_14-3-3_3 | Binding | Binding, catalytic activity | |
| LIG_EH1_1 | Binding | Binding, catalytic activity | |
| LIG_BRCT_BRCA1_1 | Binding | Binding, catalytic activity | |
| LIG_NRBOX | Binding | Binding, catalytic activity | |
| LIG_CORNRBOX | Binding | Binding, catalytic activity, transporter activity | |
Comparison of available methodologies for the classification of amide peptide bonds.
| Author | Method | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| Pahlke | Chou-Fasman parameters | 35 | 97 | 66 |
| Exarchos | SVM classifier | 77 | 65 | 71 |
| Current work | exact pattern discovery | 45 | 54 | 49 |
| chemical equivalency set | 76 | 58 | 67 | |
| structural equivalency set | 77 | 63 | 70 | |