| Literature DB >> 23523652 |
Konda Mani Saravanan1, Samuel Selvaraj.
Abstract
For the past few decades, intensive studies have been carried out in an attempt to understand how the amino acid sequences of proteins encode their three dimensional structures to perform their specific functions. In order to understand the sequence-structure relationship of proteins, several sub-sequence search studies in non-redundant sequence-structure databases have been undertaken which have given some fruitful clues. In our earlier work, we analyzed a set of 3124 non-redundant protein sequences from the Protein Data Bank (PDB) and retrieved 30 identical octapeptides having different secondary structures. These octapeptides were characterized by using different computational procedures. This prompted us to explore the presence of octapeptides with reverse sequences and to analyze whether these octapeptides would adopt similar structures as that of their parent octapeptides. Our identical reverse octapeptide search resulted in the finding of eight octapeptide pairs (octapeptide and reverse octapeptide) with similar secondary structure and 23 octapeptide pairs with different secondary structures. In the present work, the geometrical and biophysical characteristics of identical reverse octapeptides were explored and compared with unrelated octapeptide pairs by using various computational tools. We thus conclude that proteins containing identical reverse octapeptides are not very abundant and residues in the octapeptide pairs do not contribute to the stability of the protein. Furthermore, compared to unrelated octapeptides, identical reverse octapeptides do not show certain biophysical and geometrical properties.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23523652 PMCID: PMC4357837 DOI: 10.1016/j.gpb.2012.11.005
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Biophysical and geometrical properties of octapeptides with inverse sequences having different secondary structures
| No. | PDB ID | Octapeptide | Secondary structure | Stabilizing residues | Hphil (%) | Hpho (%) | Others (%) | Attribute | RMSD | CSM | Volume (Å3) | Surface area (Å2) | Quadrapole (Debyes) | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | P1 | 1A8D_A(52-59) | VPGINGKA | EECCCCEE | No | 12 | 25 | 62 | Basic | 4.3 | 96.655 | 971 | 621.9 | 81 |
| P2 | 1YZY_A(88-95) | AKGNIGPV | CCCCHHHH | No | 12 | 25 | 62 | Basic | 4.3 | 94.398 | 949 | 630.15 | 42 | |
| 2 | P1 | 1D8W_A(418-425) | YEKEILSR | HHHHCCCC | No | 30 | 20 | 50 | Neutral | 0.6 | 83.482 | 1260 | 718.53 | 100 |
| P2 | 2E52_A(244-251) | RSLIEKEY | HHHHHHHH | No | 30 | 20 | 50 | Neutral | 0.6 | 85.753 | 1305 | 772.85 | 24 | |
| 3 | P1 | 1DM9_B(103-110) | KLNALTMP | HHHHCCCC | No | 38 | 0 | 62 | Basic | 3.0 | 99.401 | 959 | 580.7 | 71 |
| P2 | 1W8K_A(137-144) | PMTLANLK | CEEHHHHH | No | 38 | 0 | 62 | Basic | 3.0 | 92.286 | 1118 | 653.28 | 45 | |
| 4 | P1 | 1ED1_A(47-54) | AESLLENK | CHHHHHCH | No | 20 | 13 | 67 | Acidic | 1.6 | 80.533 | 1117 | 740.55 | 163 |
| P2 | 1R6X_A(20-27) | KNELLSEA | HHHHHHHH | No | 20 | 13 | 67 | Acidic | 1.6 | 77.565 | 1129 | 719.77 | 110 | |
| 5 | P1 | 1G4M_A(106-113) | KKLGEHAY | HHCCCCEE | No | 50 | 25 | 25 | Basic | 1.0 | 90.681 | 1178 | 802.44 | 248 |
| P2 | 2UVK_A(276-283) | YAHEGLKK | CCCCCCCC | No | 50 | 25 | 25 | Basic | 1.0 | 99.514 | 1102 | 757.62 | 27 | |
| 6 | P1 | 1GJW_A(2-9) | LLREINRY | HHHHHHHH | No | 38 | 50 | 12 | Basic | 3.5 | 81.161 | 1378 | 849.18 | 43 |
| P2 | 2ODI_A(231-238) | YRNIERLL | EEEEEECC | No | 38 | 50 | 12 | Basic | 3.5 | 59.256 | 1363 | 973.29 | 234 | |
| 7 | P1 | 1ITX_A(354-361) | QTCTGGSS | CCCCEECC | No | 0 | 0 | 100 | Neutral | 1.4 | 96.532 | 864 | 659.05 | 88 |
| P2 | 2V3I_A(20-27) | SSGGTCTQ | CCCCCEEE | No | 0 | 0 | 100 | Neutral | 1.4 | 87.245 | 960 | 637.51 | 109 | |
| 8 | P1 | 1JL1_A(139-146) | AAAMNPTL | HHHHCCCC | No | 40 | 20 | 40 | Neutral | 0.04 | 95.226 | 972 | 660.34 | 71 |
| P2 | 2NVO_A(383-390) | LTPNMAAA | CCHHHHHH | No | 40 | 20 | 40 | Neutral | 0.04 | 93.055 | 967 | 580.17 | 80 | |
| 9 | P1 | 1N1B_A(481-488) | YHDILCLA | CCHHHHHH | No | 9 | 18 | 73 | Neutral | 2.7 | 86.665 | 1140 | 686.48 | 105 |
| P2 | 2FA1_A(142-149) | ALCLIDHY | EEEEEEEE | No | 9 | 18 | 73 | Neutral | 2.7 | 90.691 | 1154 | 797.82 | 53 | |
| 10 | P1 | 1O2D_A(131-138) | VVEIPTTA | EEEEECCC | 132V | 20 | 27 | 53 | Acidic | 1.6 | 93.583 | 1023 | 724.32 | 87 |
| P2 | 1TG7_A(626-633) | ATTPIEVV | CCEEEEEE | No | 20 | 27 | 53 | Acidic | 1.6 | 93.519 | 1026 | 735.4 | 49 | |
| 11 | P1 | 1QOY_A(177-184) | AYAGAAAG | HHHCCCCC | No | 0 | 12 | 88 | Neutral | 3.2 | 99.551 | 787 | 550.79 | 57 |
| P2 | 1RWR_A(209-216) | GAAAGAYA | CCCCCCEE | No | 0 | 12 | 88 | Neutral | 3.2 | 99.532 | 776 | 590.71 | 67 | |
| 12 | P1 | 1QUS_A(277-284) | GQAPGLPN | CCCCCCCC | No | 20 | 20 | 60 | Neutral | 2.5 | 67.633 | 918 | 649.24 | 134 |
| P2 | 1QWO_A(286-293) | NPLGPAQG | CCCCHHHH | No | 20 | 20 | 60 | Neutral | 2.5 | 75.013 | 886 | 558.93 | 91 | |
| 13 | P1 | 1SU8_A(269-276) | IVSVSKEM | HHHHHHHC | No | 19 | 25 | 56 | Neutral | 3.4 | 81.048 | 1126 | 667.56 | 53 |
| P2 | 1W77_A(75-82) | MEKSVSVI | CCCCEEEE | 80S, 82I | 19 | 25 | 56 | Neutral | 3.4 | 89.778 | 1112 | 786.99 | 103 | |
| 14 | P1 | 1SVF_A(177-184) | SPAITAAN | HHHHHHCC | No | 0 | 12 | 88 | Neutral | 4.1 | 90.993 | 907 | 555.91 | 23 |
| P2 | 1YNF_A(110-117) | NAATIAPS | HCEEEECC | No | 0 | 12 | 88 | Neutral | 4.1 | 84.159 | 896 | 667.46 | 64 | |
| 15 | P1 | 1UA4_A(409-416) | IKEGIGEV | CCCCEEEE | No | 38 | 38 | 25 | Acidic | 1.9 | 52.912 | 1048 | 737.5 | 48 |
| P2 | 1X2I_A(48-55) | VEGIGEKI | CCCCCHHH | No | 38 | 38 | 25 | Acidic | 1.9 | 46.389 | 1043 | 706.78 | 19 | |
| 16 | P1 | 2ANE_A(68-75) | LFTVGTVA | CCCEEEEE | 73T, 75A | 0 | 50 | 50 | Neutral | 1.5 | 67.694 | 1000 | 724.57 | 170 |
| P2 | 3B8D_A(263-270) | AVTGVTFL | CCCEEEEC | 268T | 0 | 50 | 50 | Neutral | 1.5 | 80.077 | 1012 | 734.76 | 76 | |
| 17 | P1 | 2BKX_A(202-209) | KAEAVRKL | HHHHHHHH | No | 50 | 25 | 25 | Basic | 3.6 | 66.488 | 1204 | 797.6 | 44 |
| P2 | 2BZ1_A(3-10) | LKRVAEAK | EEEEEEEE | No | 50 | 25 | 25 | Basic | 3.6 | 47.348 | 1173 | 873.21 | 260 | |
| 18 | P1 | 2CHH_A(4-11) | GVFTLPAN | CEEECCCC | No | 0 | 38 | 62 | Neutral | 2.2 | 99.601 | 1000 | 716.51 | 97 |
| P2 | 2Q2R_A(21-28) | NAPLTFVG | CCCEEEEE | No | 0 | 38 | 62 | Neutral | 2.2 | 99.613 | 1010 | 712.11 | 154 | |
| 19 | P1 | 2ISB_A(111-118) | EEVVEAMR | HHHHHHHC | No | 47 | 20 | 33 | Acidic | 2.9 | 93.375 | 1092 | 668.75 | 39 |
| P2 | 2JDJ_A(85-92) | RMAEVVEE | HHEEEEEE | No | 47 | 20 | 33 | Acidic | 2.9 | 99.661 | 1002 | 773.7 | 117 | |
| 20 | P1 | 2NY1_A(253-260) | PVVSSQLL | CCCCCCEE | 257S, 258Q | 0 | 50 | 50 | Neutral | 4.1 | 99.461 | 1087 | 756.28 | 14 |
| P2 | 2A40_B(220-227) | LLQSSVVP | HHHHHCCC | No | 0 | 50 | 50 | Neutral | 4.1 | 99.342 | 1076 | 688.67 | 39 | |
| 21 | P1 | 2ZBL_A(14-21) | EQETDRIF | HHHHHHHH | No | 21 | 14 | 64 | Acidic | 1.1 | 82.528 | 1230 | 707.03 | 39 |
| P2 | 1M0W_A(271-278) | FIRDTEQE | EECCCCCE | No | 21 | 14 | 64 | Acidic | 1.1 | 84.776 | 1221 | 736.33 | 133 | |
| 22 | P1 | 1R6X_A(20-27) | KNELLSEA | HHHHHHHH | No | 29 | 29 | 43 | Acidic | 1.6 | 77.567 | 1129 | 719.77 | 110 |
| P2 | 1ED1_A(47-54) | AESLLENK | CHHHHHCH | No | 29 | 29 | 43 | Acidic | 1.6 | 80.533 | 1117 | 740.55 | 163 | |
| 23 | P1 | 2EIY_A(171-178) | KMEAVAAG | HHHHHHCC | No | 25 | 25 | 50 | Neutral | 0.4 | 93.512 | 941 | 619.3 | 125 |
| P2 | 1TWD_A(232-239) | GAAVAEMK | HHHHHHHH | No | 25 | 25 | 50 | Neutral | 0.4 | 91.508 | 805 | 545.36 | 30 |
Note: P1 and P2 refer to the octapeptide and its reverse octapeptide, respectively. Hphil, Hpho and others column indicates the occurrence (in percentage) of hydrophilic residues, hydrophobic residues and other residues in the octapeptide, respectively. No in Stabilizing residues column indicates that there are no stabilizing residues; if a stabilizing residue exists, number and stabilizing amino acid in a single letter code are indicated. CSM, continuous symmetry measure; RMSD, root mean square deviation.
Figure 1Occurrence of each amino acid residue in identical reverse octapeptide pairs with similar and different conformations
Figure 2Difference in dipole moment of identical reverse octapeptides with different conformations
Figure 3Distribution of GO terms of protein containing identical reverse octapeptides with similar and different secondary structures