| Literature DB >> 26920059 |
Golnaz Bahramali1, Bahram Goliaei2, Zarrin Minuchehr3, Ali Salari4.
Abstract
Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.Entities:
Keywords: Chameleon sequences; Enrichment analysis; Neurodegenerative diseases; Protein secondary structure; Sequence properties
Mesh:
Substances:
Year: 2016 PMID: 26920059 PMCID: PMC7124260 DOI: 10.1016/j.bbrc.2016.01.187
Source DB: PubMed Journal: Biochem Biophys Res Commun ISSN: 0006-291X Impact factor: 3.575
Statistics of the chameleon sequences in the dataset.
| Length of chameleon | No of non-redundant HE-Chameleon | No of non-redundant HC-Chameleon | No of non-redundant CE-Chameleon |
|---|---|---|---|
| 4-mer | 55380(1125678) | 62129(984713) | 43645(347083) |
| 5-mer | 25056 (43550) | 21022(33288) | 7878(9670) |
| 6-mer | 1423(1501) | 1067(1268) | 267(283) |
| 7-mer | 62(63) | 42(57) | 4 (9) |
| 8-mer | 0 | 6(6) | 0 |
| Total | 81921(1170792) | 84266(1019332) | 51795(357045) |
Complete helix to strand.
Complete helix to coil.
Complete strand to coil.
The sequences were shown when chameleons are detected in more than one occasion.
Fig. 1A bar graph of normalized chameleon's propensity of amino acids in groups of different chameleon types (HE, HC and CE) in dataset1. The propensity values of amino acids in different groups are represented by different bars as indicated in the legend.
Normalized neighbor-dependent propensity in HE-Chameleons group.
| Ala | Arg | Asn | Asp | Cys | Glu | Gln | Gly | His | Ile | Leu | Lys | Met | Phe | Pro | Ser | Thr | Trp | Tyr | Val | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ala | 0.74 | 0.88 | 0.80 | 0.69 | 0.88 | 0.72 | 0.71 | 1.02 | 1.13 | 0.84 | 0.76 | 0.81 | 1.10 | 0.82 | 0.95 | 1.19 | 1.12 | 1.20 | ||
| Arg | 0.81 | 0.85 | 0.76 | 0.59 | 1.04 | 0.61 | 0.74 | 0.93 | 1.18 | 1.03 | 0.75 | 0.87 | 1.04 | 0.73 | 1.07 | 1.14 | 0.97 | |||
| Asn | 0.86 | 0.84 | 0.69 | 0.58 | 0.68 | 0.61 | 0.60 | 1.17 | 0.70 | 1.18 | 0.99 | 0.77 | 0.76 | 1.18 | 0.40 | 0.92 | 1.04 | 0.71 | 0.99 | |
| Asp | 0.66 | 0.60 | 0.73 | 0.51 | 0.77 | 0.45 | 0.61 | 1.05 | 0.73 | 1.07 | 0.81 | 0.60 | 0.59 | 0.96 | 0.59 | 0.75 | 0.93 | 0.59 | 0.96 | 1.14 |
| Cys | 0.94 | 0.85 | 0.57 | 0.63 | 0.32 | 0.80 | 0.70 | 0.89 | 0.56 | 1.08 | 1.18 | 0.79 | 0.66 | 1.07 | 0.20 | 0.96 | 1.06 | 0.39 | 0.88 | |
| Glu | 0.62 | 0.66 | 0.60 | 0.52 | 0.75 | 0.50 | 0.60 | 1.01 | 0.78 | 1.00 | 0.84 | 0.61 | 0.82 | 1.00 | 0.96 | 0.82 | 0.98 | 0.85 | 0.95 | 1.14 |
| Gln | 0.66 | 0.71 | 0.74 | 0.57 | 0.65 | 0.53 | 0.67 | 1.09 | 0.82 | 1.13 | 0.90 | 0.60 | 0.71 | 1.00 | 0.68 | 1.02 | 1.12 | 0.93 | 1.05 | 1.19 |
| Gly | 1.18 | 1.01 | 0.86 | 0.96 | 1.05 | 1.09 | 1.06 | 1.10 | 1.09 | 0.72 | 1.17 | 1.11 | ||||||||
| His | 1.04 | 1.04 | 0.90 | 0.66 | 0.64 | 0.87 | 0.82 | 1.19 | 0.78 | 1.19 | 1.07 | 0.89 | 0.56 | 1.16 | 0.53 | 1.09 | 1.11 | 0.71 | 1.12 | |
| Ile | 1.15 | 1.11 | 1.10 | 1.03 | 1.01 | 1.10 | 1.09 | 1.19 | 1.13 | 1.09 | 1.18 | 1.19 | 0.88 | 1.18 | 1.17 | 1.20 | 1.17 | |||
| Leu | 0.92 | 1.01 | 0.97 | 0.84 | 0.88 | 0.91 | 1.19 | 1.15 | 1.04 | 0.87 | 1.18 | 1.17 | 0.88 | 1.08 | 1.15 | |||||
| Lys | 0.75 | 0.81 | 0.68 | 0.65 | 0.72 | 0.61 | 0.67 | 1.17 | 0.69 | 1.11 | 0.92 | 0.69 | 0.89 | 1.02 | 0.65 | 0.88 | 1.06 | 0.93 | 0.91 | 1.15 |
| Met | 0.90 | 0.90 | 0.77 | 0.68 | 0.53 | 0.78 | 0.75 | 0.94 | 1.18 | 1.03 | 0.87 | 0.73 | 0.99 | 0.38 | 1.04 | 1.13 | 0.61 | 1.04 | ||
| Phe | 1.14 | 1.10 | 1.07 | 0.98 | 0.96 | 1.06 | 1.01 | 1.16 | 1.04 | 1.08 | 1.13 | 0.78 | 1.15 | 1.10 | ||||||
| Pro | 0.55 | 0.46 | 0.38 | 0.27 | 0.45 | 0.19 | 0.30 | 0.53 | 0.49 | 1.03 | 0.85 | 0.42 | 0.62 | 0.78 | 0.32 | 0.36 | 0.65 | 0.35 | 0.74 | 1.00 |
| Ser | 1.12 | 1.04 | 0.91 | 0.70 | 0.98 | 0.88 | 0.88 | 0.99 | 1.09 | 0.82 | 1.01 | 0.74 | 1.07 | 1.20 | 1.17 | 1.20 | ||||
| Thr | 1.16 | 1.14 | 1.16 | 0.97 | 0.85 | 1.04 | 1.11 | 1.16 | 1.15 | 1.16 | 1.13 | 1.14 | 1.17 | 0.78 | 1.16 | 1.17 | 1.14 | 1.17 | ||
| Trp | 1.06 | 1.00 | 0.81 | 0.70 | 0.57 | 0.85 | 0.88 | 1.04 | 0.77 | 1.07 | 1.08 | 0.95 | 0.69 | 1.06 | 0.32 | 1.01 | 1.08 | 0.61 | 1.14 | 1.19 |
| Tyr | 1.14 | 1.07 | 0.93 | 0.81 | 1.11 | 1.12 | 1.13 | 1.13 | 1.14 | 1.08 | 0.77 | 1.18 | 1.20 | |||||||
| Val | 1.20 | 1.20 | 1.12 | 1.07 | 1.12 | 1.14 | 1.20 | 1.13 | 0.86 | 1.17 | 1.19 | 1.14 |
* Amino acids in the columns precede amino acids in the rows. All the propensities >1.2 were bolded.
Normalized neighbor-dependent propensity in HC-Chameleons group.
| Ala | Arg | Asn | Asp | Cys | Glu | Gln | Gly | His | Ile | Leu | Lys | Met | Phe | Pro | Ser | Thr | Trp | Tyr | Val | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ala | 0.91 | 0.95 | 1.13 | 0.99 | 0.85 | 0.88 | 0.89 | 1.00 | 1.18 | 0.89 | 0.81 | 0.95 | 0.87 | 1.13 | 0.65 | 1.09 | 1.16 | 1.09 | 1.14 | 1.02 |
| Arg | 0.97 | 1.09 | 1.16 | 1.10 | 0.79 | 0.96 | 0.99 | 1.01 | 1.14 | 1.18 | 0.97 | 1.12 | 0.97 | 1.18 | 0.55 | 1.13 | 1.03 | 1.15 | ||
| Asn | 1.13 | 0.91 | 0.93 | 0.79 | 1.06 | 0.57 | 1.01 | 1.20 | 1.10 | 0.99 | 0.20 | 0.97 | 1.13 | 0.83 | ||||||
| Asp | 1.04 | 1.12 | 0.87 | 0.86 | 0.96 | 0.92 | 1.09 | 0.61 | 1.06 | 1.19 | 1.14 | 0.97 | 1.11 | 1.20 | 0.17 | 0.91 | 1.05 | 1.08 | 1.20 | 1.15 |
| Cys | 0.95 | 0.95 | 0.73 | 0.66 | 0.32 | 0.97 | 0.95 | 0.92 | 0.73 | 0.72 | 1.03 | 0.83 | 0.34 | 0.83 | 0.15 | 0.91 | 0.94 | 0.25 | 0.63 | 1.01 |
| Glu | 0.89 | 0.98 | 0.99 | 0.98 | 0.89 | 0.93 | 0.96 | 1.03 | 1.14 | 1.00 | 0.85 | 0.99 | 0.87 | 1.18 | 0.60 | 1.05 | 1.13 | 1.04 | 1.07 | 1.17 |
| Gln | 0.96 | 1.11 | 1.12 | 0.70 | 1.03 | 0.93 | 1.10 | 1.09 | 1.06 | 0.94 | 1.09 | 0.92 | 1.17 | 0.57 | 0.90 | 1.15 | ||||
| Gly | 1.06 | 0.93 | 0.77 | 0.72 | 0.90 | 0.95 | 1.02 | 0.66 | 1.00 | 1.18 | 1.14 | 0.87 | 1.14 | 0.56 | 0.80 | 0.87 | 1.17 | 1.18 | 1.09 | |
| His | 1.09 | 1.05 | 0.61 | 1.17 | 0.97 | 0.88 | 0.91 | 0.85 | 0.21 | 1.10 | 1.27 | 0.91 | 1.30 | |||||||
| Ile | 0.88 | 0.97 | 1.14 | 0.93 | 1.00 | 1.11 | 1.18 | 0.97 | 0.85 | 1.15 | 0.88 | 0.77 | 0.83 | 1.15 | 1.17 | |||||
| Leu | 0.83 | 0.95 | 1.09 | 1.02 | 1.11 | 0.92 | 0.98 | 1.02 | 1.17 | 0.91 | 0.79 | 0.98 | 0.83 | 1.00 | 0.79 | 1.02 | 1.10 | 0.86 | 1.01 | 0.98 |
| Lys | 0.97 | 1.07 | 1.02 | 0.87 | 0.99 | 1.09 | 0.90 | 1.07 | 0.98 | 1.05 | 1.01 | 0.54 | 1.05 | 1.17 | 0.97 | 1.11 | ||||
| Met | 1.03 | 0.96 | 1.03 | 0.98 | 0.47 | 1.01 | 0.89 | 1.09 | 0.73 | 0.87 | 0.89 | 1.06 | 0.72 | 0.92 | 0.57 | 1.14 | 1.04 | 0.38 | 1.03 | 1.02 |
| Phe | 1.16 | 1.20 | 0.87 | 1.19 | 1.27 | 1.10 | 1.05 | 0.97 | 1.15 | 0.89 | 0.64 | 1.07 | ||||||||
| Pro | 0.86 | 0.73 | 0.49 | 0.46 | 0.36 | 0.64 | 0.82 | 0.41 | 0.74 | 1.16 | 1.01 | 0.64 | 0.96 | 0.98 | 0.19 | 0.56 | 0.69 | 0.84 | 1.01 | 0.98 |
| Ser | 1.11 | 1.07 | 0.98 | 0.89 | 1.05 | 0.99 | 1.13 | 0.74 | 1.03 | 1.17 | 1.09 | 1.08 | 1.18 | 1.22 | 0.24 | 0.98 | 1.12 | 1.08 | ||
| Thr | 1.16 | 1.07 | 0.95 | 0.91 | 1.09 | 0.82 | 1.10 | 1.13 | 1.13 | 0.37 | 1.13 | |||||||||
| Trp | 0.92 | 0.96 | 1.07 | 1.00 | 0.44 | 1.07 | 0.91 | 0.90 | 0.90 | 0.89 | 1.14 | 0.63 | 0.90 | 0.73 | 1.07 | 1.13 | 0.65 | 0.99 | 1.10 | |
| Tyr | 1.12 | 1.16 | 0.87 | 1.18 | 1.07 | 1.13 | 0.97 | 0.99 | 0.54 | 1.12 | ||||||||||
| Val | 1.01 | 1.16 | 1.20 | 0.90 | 1.08 | 1.19 | 1.13 | 1.00 | 1.15 | 1.07 | 0.68 | 1.20 | 1.08 |
* Amino acids in the columns precede amino acids in the rows. All the propensities >1.2 were bolded.
Normalized neighbor-dependent propensity in CE-Chameleons group.
| Ala | Arg | Asn | Asp | Cys | Glu | Gln | Gly | His | Ile | Leu | Lys | Met | Phe | Pro | Ser | Thr | Trp | Tyr | Val | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ala | 0.90 | 0.79 | 0.78 | 1.16 | 0.94 | 1.13 | 1.15 | 1.17 | 1.03 | 0.56 | 1.11 | 1.11 | ||||||||
| Arg | 1.13 | 0.87 | 0.72 | 0.74 | 1.10 | 1.00 | 0.90 | 1.03 | 1.15 | 1.18 | 1.08 | 0.87 | 0.69 | 1.20 | 0.86 | 1.16 | ||||
| Asn | 1.01 | 0.90 | 0.55 | 0.48 | 0.67 | 0.80 | 0.71 | 0.57 | 0.78 | 1.14 | 0.74 | 0.73 | 0.35 | 0.75 | 0.89 | 0.83 | 1.12 | |||
| Asp | 0.86 | 0.71 | 0.52 | 0.47 | 0.69 | 0.65 | 0.79 | 0.53 | 0.71 | 1.07 | 0.68 | 0.66 | 0.36 | 0.68 | 0.92 | 0.92 | 1.15 | 1.14 | ||
| Cys | 0.78 | 0.69 | 0.50 | 0.47 | 0.15 | 0.76 | 0.66 | 0.81 | 0.40 | 0.80 | 0.94 | 0.57 | 0.29 | 0.53 | 0.44 | 0.89 | 0.97 | 0.32 | 0.57 | 0.81 |
| Glu | 1.05 | 0.72 | 0.76 | 0.70 | 1.16 | 1.16 | 0.79 | 1.13 | 1.04 | 1.11 | 0.74 | 1.03 | 1.20 | 0.98 | ||||||
| Gln | 1.02 | 0.77 | 0.70 | 0.52 | 1.08 | 1.04 | 0.82 | 0.84 | 0.96 | 0.80 | 0.58 | 1.15 | 1.18 | 0.94 | ||||||
| Gly | 0.89 | 0.84 | 0.67 | 0.54 | 0.82 | 0.83 | 0.80 | 0.78 | 0.88 | 1.09 | 1.00 | 0.68 | 1.00 | 1.14 | 0.63 | 0.86 | 0.95 | 1.05 | 1.16 | 1.04 |
| His | 1.04 | 0.74 | 0.77 | 0.35 | 1.08 | 0.62 | 0.86 | 0.69 | 1.15 | 0.94 | 0.47 | 0.53 | 1.15 | 0.73 | 1.17 | |||||
| Ile | 1.09 | 1.16 | 1.13 | 1.09 | 0.63 | 1.16 | 0.83 | 0.91 | 0.84 | 1.13 | 0.80 | 1.19 | 1.16 | 0.76 | 1.06 | 0.84 | ||||
| Leu | 1.12 | 1.00 | 1.02 | 1.12 | 1.00 | 1.01 | 1.18 | 0.64 | 1.11 | 1.10 | 0.94 | 1.20 | 0.93 | |||||||
| Lys | 1.11 | 0.69 | 0.72 | 0.71 | 1.14 | 1.10 | 0.83 | 0.89 | 1.16 | 1.18 | 0.90 | 1.08 | 0.63 | 0.96 | 1.14 | 0.98 | 1.10 | |||
| Met | 1.16 | 0.74 | 0.80 | 0.76 | 0.44 | 0.98 | 0.68 | 1.07 | 0.53 | 0.81 | 1.03 | 0.74 | 0.53 | 0.86 | 0.65 | 0.93 | 0.92 | 0.11 | 0.94 | 0.96 |
| Phe | 1.19 | 1.12 | 0.75 | 1.17 | 1.15 | 1.03 | 0.63 | 0.66 | 1.30 | 0.95 | 1.03 | |||||||||
| Pro | 0.61 | 0.53 | 0.33 | 0.26 | 0.35 | 0.39 | 0.44 | 0.37 | 0.56 | 1.01 | 0.87 | 0.45 | 0.72 | 0.91 | 0.13 | 0.41 | 0.61 | 0.41 | 0.79 | 0.98 |
| Ser | 0.80 | 0.64 | 0.80 | 1.08 | 1.08 | 0.94 | 1.03 | 1.00 | 1.01 | 0.51 | 0.99 | 1.18 | 1.19 | 1.31 | ||||||
| Thr | 0.95 | 0.82 | 0.87 | 1.11 | 1.20 | 1.13 | 1.03 | 0.63 | 1.20 | 1.15 | ||||||||||
| Trp | 1.10 | 1.07 | 0.88 | 0.90 | 0.42 | 1.19 | 0.84 | 1.05 | 0.70 | 0.77 | 1.01 | 1.03 | 0.53 | 0.74 | 0.52 | 1.07 | 0.95 | 0.63 | 0.94 | 0.98 |
| Tyr | 1.11 | 1.06 | 0.82 | 1.12 | 1.09 | 1.00 | 0.67 | 0.97 | 1.04 | |||||||||||
| Val | 1.13 | 1.11 | 1.07 | 0.71 | 1.15 | 0.79 | 0.96 | 0.99 | 1.12 | 0.79 | 1.14 | 1.13 | 0.94 | 1.06 | 0.84 |
* Amino acids in the columns precede amino acids in the rows. All the propensities >1.2 were bolded.
Solvent accessibility of chameleon pairs.
| Segments | No | No (%) (both exposed) | No (%) (mixed) | No (%) (both buried) |
|---|---|---|---|---|
| 4-mers | ||||
| HE | 53235 | 14647(27.5) | 24502(46) | 14086(26.5) |
| HC | 58633 | 33562(57.2) | 21368(36.4) | 3703(6.3) |
| CE | 41402 | 4308(10.4) | 20397(49.3) | 166974(40.3) |
| 5-mers | ||||
| HE | 24133 | 5843(24.2) | 10857(45) | 7433(30.8) |
| HC | 19940 | 12079(60.6) | 6897(34.6) | 964(4.8) |
| CE | 7444 | 3069(41.2) | 3720(50) | 655(8.8) |
| 6,7,8-mers | ||||
| HE | 1429 | 366(25.6) | 657(46) | 406(28.4) |
| HC | 1052 | 661(62.8) | 341(32.4) | 50(4.8) |
| CE | 252 | 114(45.2) | 119(47.2) | 19(7.5) |
Disease enrichment analysis of chameleon sequences in seven residues in length.
| Gene-set library | Type of chameleons | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Type of disease | |||||||||||
| Neurodegenerative disease | Blood disorder | Colorectal cancer | Eye disorder | Mental disorder | Lung disease | Liver disease | Artritis | IBD | Type 2 diabetes mellitus | CF | |
| HE-Chameleons | |||||||||||
| OMIM | √ | √ | – | – | – | – | – | – | – | – | |
| Disease Perturbations from GEO UP | √ | – | √ | √ | √ | √ | – | √ | √ | – | – |
| Disease Perturbations from GEO Down | √ | – | – | – | √ | – | – | – | – | – | – |
| HC-Chameleons | |||||||||||
| OMIM | – | √ | √ | – | – | – | – | – | – | – | |
| Disease Perturbations from GEO UP | – | √ | – | √ | √ | √ | √ | – | – | ||
| Disease Perturbations from GEO Down | √ | √ | – | – | – | – | – | √ | √ | ||
| CE-Chameleons | |||||||||||
| OMIM | – | – | – | – | – | ||||||
| Disease Perturbations from GEO UP | √ | √ | – | – | |||||||
| Disease Perturbations from GEO Down | – | – | – | – | √ | ||||||
Neurodegenerative disease including Parkinson's disease, Alzheimer's disease, Huntington's disease, Amyotrophic lateral sclerosis (ALS), Alexander disease.
Eye disease including Leber congenital amaurosis and Retinoschisis.
Mental disorders including schizophrenia, bipolar disorder, Anxiety disorders.
IBD, inflammatory bowel disease.
CF, cystic fibrosis.
Human flexible proteins.
| Name of proteins (gene names) | PDB ID | No of chameleon sequences in protein | |||
|---|---|---|---|---|---|
| HE | HC | CE | Total | ||
| Insulin-degrading enzyme(IDE) | 3CWW | 257 | 261 | 122 | 640 |
| GTP-binding nuclear protein Ran(RAN) | 4HAT | 185 | 189 | 31 | 405 |
| Cytoplasmic FMR1-interacting protein 1 (CYFP1) | 3P8C | 157 | 181 | 24 | 379 |
| DNA damage-binding protein 1 (DDB1) | 3EI3 | 157 | 73 | 136 | 366 |
| SAM domain and HD domain-containing protein 1(SAMHD1) | 4MZ7 | 114 | 128 | 48 | 290 |
| pyruvate kinase muscle isozyme (PKM) | 3GR4 | 101 | 124 | 43 | 268 |
| Xaa-Pro aminopeptidase 1(,d.d24”>XPNPEP1) | 3ctz | 87 | 104 | 67 | 258 |
| SOSS complex subunit B1(SOSB1) | 4owt | 106 | 110 | 25 | 241 |
| Serum albumin(ALBU) | 1N5U | 86 | 130 | 21 | 237 |
| Presequence protease, mitochondrial(PREP) | 4L3T | 106 | 70 | 61 | 237 |
| 6-phosphogluconate dehydrogenase, decarboxylating(PGD) | 4GWG | 91 | 114 | 31 | 236 |
| Gamma-tubulin complex component 4 (,d.d24”>TUBGCP4) | 3RIP | 98 | 123 | 15 | 236 |
| Apoptosis inhibitor 5(API5) | 30UR | 102 | 116 | 16 | 234 |
| Beta adrenergic receptor kinase (,d.d24”>ADRBK1) | 4MK0 | 85 | 94 | 55 | 234 |
Proteins with a large number of chameleon sequences.