| Literature DB >> 32476993 |
Meiqi Wu1, Pengchao Lu2, Yingxi Yang3, Liwen Liu1, Hui Wang4, Yan Xu1, Jixun Chu1.
Abstract
BACKGROUND: Lysine lipoylation which is a rare and highly conserved post-translational modification of proteins has been considered as one of the most important processes in the biological field. To obtain a comprehensive understanding of regulatory mechanism of lysine lipoylation, the key is to identify lysine lipoylated sites. The experimental methods are expensive and laborious. Due to the high cost and complexity of experimental methods, it is urgent to develop computational ways to predict lipoylation sites.Entities:
Keywords: Lysine lipoylation; amino acids; post-translational modifications; prediction; scoring matrix; support vector machine
Year: 2019 PMID: 32476993 PMCID: PMC7235397 DOI: 10.2174/1389202919666191014092843
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Fig. (2)The values of MCC with different ratio data sets and encoding schemes. The X-axis represents different encoding schemes, the Y-axis has average values of MCC and the black bars represent standard error.
Fig. (4)The proportion of different amino acids between lysine lipoylation and non-lipoylation fragments. The X-axis represents different amino acids, and the Y-axis is the percentage of different amino acids.
Fig. (5)Two Sample Logo (p<0.05) of compositional bias around the lysine lipoylation and non-lipoylation sites.
Performance of various window lengths in a 10-fold cross-validation.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 9 | 99.64 | 99.61 | 99.94 | 0.9889 | 0.9979 |
| 11 | 99.85 | 99.84 | 98.77 | 0.9931 | 0.9989 |
| 13 | 99.72 | 99.79 | 98.89 | 0.9819 | 0.9992 |
| 15 | 99.86 | 99.84 | 99.96 | 0.9959 | 0.9995 |
| 17 | 99.96 | 99.98 | 100.00 | 0.9990 | 0.9997 |
| 19 | 99.92 | 99.97 | 99.40 | 0.9965 | 0.9964 |
| 21 | 99.93 | 99.89 | 99.44 | 0.9968 | 0.9978 |
Performance of models with different ratios and encoding schemes.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 1:1 | BPB | 99.84±0.21 | 99.88±0.25 | 99.81±0.28 | 0.9969±0.0041 | 0.9989±0.0031 |
| AAIndex | 99.91±0.14 | 99.88±0.23 | 99.94±0.18 | 0.9981±0.0028 | 0.9992±0.0008 | |
| PSSM | 99.96±0.09 | 99.99±0.24 | ||||
| BLOSUM62 | 99.84±0.25 | 99.91±0.18 | 99.69±0.50 | 0.9969±0.0050 | 0.9979±0.0012 | |
| Binary | 99.81±0.31 | 99.89±0.22 | 99.63±0.63 | 0.9963±0.0062 | 0.9989±0.0011 | |
| 1:2 | BPB | 99.96±0.12 | 99.97±0.09 | 99.94±0.19 | 0.9991±0.0028 | 0.9997±0.0012 |
| AAIndex | 99.75±0.39 | 99.26±1.16 | 0.9945±0.0087 | 0.9986±0.0018 | ||
| PSSM | 99.67±0.19 | 99.51±0.28 | 99.98±0.11 | 0.9927±0.0042 | 0.9958±0.0013 | |
| BLOSUM62 | 99.94±0.13 | 99.99±0.03 | 99.81±0.40 | 0.9986±0.0029 | 0.9979±0.0014 | |
| Binary | 99.96±0.12 | 99.97±0.09 | 99.94±0.19 | 0.9991±0.0028 | 0.9995±0.0020 | |
| 1:3 | BPB | 99.92±0.08 | 99.96±0.08 | 99.81±0.28 | 0.9979±0.0020 | 0.9983±0.0017 |
| AAIndex | 99.83±0.11 | 99.98±0.06 | 99.38±0.39 | 0.9955±0.0029 | 0.9979±0.0032 | |
| PSSM | 99.95±0.12 | 99.93±0.14 | 99.99±0.04 | 0.9986±0.0027 | 0.9999±0.0021 | |
| BLOSUM62 | 99.91±0.23 | 99.99±0.04 | 99.63±0.92 | 0.9975±0.0062 | 0.9979±0.0012 | |
| Binary | 99.95±0.07 | 99.99±0.05 | 99.81±0.28 | 0.9988±0.0019 | 0.9994±0.0032 | |
| 1:4 | BPB | 99.94±0.08 | 99.98±0.04 | 99.75±0.30 | 0.9981±0.0026 | 0.9987±0.0025 |
| AAIndex | 99.92±0.11 | 99.99±0.02 | 99.63±0.56 | 0.9977±0.0035 | 0.9985±0.0013 | |
| PSSM | 99.97±0.05 | 99.97±0.06 | 99.99±0.09 | 0.9992±0.0015 | 0.9994±0.0011 | |
| BLOSUM62 | 99.92±0.11 | 99.99±0.04 | 99.63±0.56 | 0.9977±0.0035 | 0.9978±0.0015 | |
| Binary | 99.88±0.16 | 99.99±0.06 | 99.38±0.83 | 0.9961±0.0052 | 0.9972±0.0026 | |
| 1:5 | BPB | 99.96±0.07 | 99.99±0.04 | 99.81±0.28 | 0.9985±0.0024 | 0.9992±0.0015 |
| AAIndex | 99.96±0.08 | 99.99±0.05 | 99.75±0.49 | 0.9985±0.0030 | 0.9982±0.0036 | |
| PSSM | 99.94±0.05 | 99.92±0.06 | 0.9978±0.0018 | 0.9979±0.0012 | ||
| BLOSUM62 | 99.93±0.10 | 99.99±0.10 | 99.57±0.62 | 0.9974±0.0037 | 0.9977±0.0035 | |
| Binary | 99.96±0.07 | 99.99±0.08 | 99.75±0.41 | 0.9985±0.0024 | 0.9987±0.0026 | |
| 1:6 | BPB | 99.95±0.04 | 99.99±0.12 | 99.63±0.30 | 0.9978±0.0018 | 0.9979±0.0019 |
| AAIndex | 99.89±0.14 | 99.98±0.04 | 99.38±0.87 | 0.9956±0.0053 | 0.9967±0.0024 | |
| PSSM | 99.91±0.00 | 99.90±0.00 | 99.99±0.04 | 0.9964±0.0000 | 0.9991±0.0012 | |
| BLOSUM62 | 99.95±0.10 | 99.99±0.09 | 99.63±0.74 | 0.9978±0.0043 | 0.9978±0.0033 | |
| Binary | 99.97±0.04 | 99.99±0.06 | 99.79±0.29 | 0.9988±0.0017 | 0.9983±0.0019 |
Performance of models with different algorithms and encoding schemes.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| SVM | BPB | 99.84±0.21 | 99.88±0.25 | 99.81±0.28 | 0.9969±0.0041 | 0.9989±0.0031 |
| AAIndex | 99.91±0.14 | 99.88±0.23 | 99.94±0.18 | 0.9981±0.0028 | 0.9992±0.0008 | |
| PSSM | 99.96±0.09 | 99.99±0.24 | ||||
| BLOSUM62 | 99.84±0.25 | 99.91±0.18 | 99.69±0.50 | 0.9969±0.0050 | 0.9979±0.0012 | |
| Binary | 99.81±0.31 | 99.89±0.22 | 99.63±0.63 | 0.9963±0.0062 | 0.9989±0.0011 | |
| KNN | BPB | 98.84±0.21 | 99.69±0.41 | 99.99±0.14 | 0.9969±0.0041 | 0.9996±0.0001 |
| AAIndex | 84.26±1.61 | 69.01±3.34 | 99.51±0.77 | 0.7198±0.0258 | 0.9603±0.0109 | |
| PSSM | 99.13±0.38 | 98.27±0.77 | 99.98±0.29 | 0.9829±0.0076 | 0.9959±0.0014 | |
| BLOSUM62 | 90.99±1.31 | 81.97±2.62 | 99.99±0.36 | 0.8336±0.0224 | 0.9818±0.0049 | |
| Binary | 83.46±1.64 | 66.91±3.28 | 99.98±0.63 | 0.7092±0.0261 | 0.9648±0.0122 | |
| Decision | BPB | 97.99±0.62 | 97.59±0.80 | 98.39±0.88 | 0.9600±0.0124 | 0.9804±0.0078 |
| Tree | AAIndex | 96.60±1.26 | 93.64±2.21 | 99.57±0.62 | 0.9339±0.0242 | 0.9674±0.0128 |
| PSSM | 97.31±0.63 | 96.23±0.97 | 98.39±1.00 | 0.9466±0.0127 | 0.9743±0.0085 | |
| BLOSUM62 | 97.72±0.96 | 95.86±1.89 | 99.57±0.39 | 0.9551±0.0184 | 0.9791±0.0087 | |
| Binary | 96.60±0.62 | 93.52±1.08 | 99.69±0.41 | 0.9339±0.0119 | 0.9684±0.0079 | |
| Logistic | BPB | 99.91±0.20 | 99.88±0.25 | 99.94±0.18 | 0.9981±0.0039 | 0.9993±0.0017 |
| Regression | AAIndex | 99.51±0.25 | 99.01±0.49 | 99.98±0.83 | 0.9902±0.0049 | 0.9989±0.0102 |
| PSSM | 99.54±0.21 | 99.07±0.41 | 99.97±0.48 | 0.9908±0.0041 | 0.9991±0.0076 | |
| BLOSUM62 | 99.72±0.29 | 99.44±0.58 | 99.98±0.62 | 0.9945±0.0058 | 0.9994±0.0076 | |
| Binary | 99.94±0.12 | 99.88±0.25 | 0.9988±0.0025 | 0.9996±0.0108 | ||
| Naïve | BPB | 99.54±0.32 | 99.57±0.62 | 99.51±0.25 | 0.9908±0.0063 | 0.9995±0.0004 |
| Bayes | AAIndex | 98.73±0.59 | 99.81±0.28 | 97.65±1.16 | 0.9750±0.0116 | 0.9964±0.0043 |
| PSSM | 99.96±0.72 | 99.99±0.35 | 0.9991±0.0140 | 0.9994±0.0091 | ||
| BLOSUM62 | 97.78±0.94 | 99.13±0.63 | 96.42±1.43 | 0.9560±0.0187 | 0.9949±0.0033 | |
| Binary | 98.86±0.59 | 99.99±0.14 | 97.72±1.17 | 0.9775±0.0115 | 0.9883±0.0061 |