| Literature DB >> 27388622 |
Monika Marczak1, Krystyna Okoniewska2, Tomasz Grabowski3.
Abstract
BACKGROUND: Total body clearance of biological drugs is for the most part dependent on the receptor mechanisms (receptor mediated clearance) and the concentration of antibodies aimed at administered drug - anti-drug-antibodies (ADA). One of the significant factors that induces the increase of ADA level after drug administration could be the aggregates present in the finished product or formed in the organism. Numerous attempts have been made to identify the sequence fragments that could be responsible for forming the aggregates - aggregate prone regions (APR).Entities:
Keywords: ADA; Aggregates; Immunogenicity; In silico; Proteins
Year: 2016 PMID: 27388622 PMCID: PMC4937009 DOI: 10.1186/s40203-016-0019-4
Source DB: PubMed Journal: In Silico Pharmacol ISSN: 2193-9616
Aggregation prone regions (no. 1–84), tregitope sequences (no. 85–106) and short sequences extracted from tregitopes - control set (no. 107–149)
| No. | Sequence | AM | AC | ROT | HBA | QPCaco [nm/s] | IP [eV] | NON | QPlogS | AEx |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ALLVN | 4 | 4 | 15 | 13.50 | 4.41 | 39.05 | 0 | −0.012 | −1.19 |
| 2 | ALVLIAFA | 5 | 5 | 19 | 15.00 | 7.17 | 48.36 | 0 | −0.163 | −3.22 |
| 3 | ALYLV | 4 | 4 | 16 | 12.75 | 4.76 | 29.11 | 0 | −0.119 | −1.10 |
| 4 | CQQYN | 5 | 5 | 23 | 21.75 | 1.71 | 38.59 | 0 | 0.174 | 2.69 |
| 5 | DDHYC | 5 | 7 | 21 | 19.25 | 2.08 | 38.91 | 0 | −0.105 | 2.65 |
| 6 | ELLFFAK | 7 | 7 | 28 | 21.00 | 5.73 | 57.80 | 0 | −0.082 | −0.75 |
| 7 | FAAFV | 3 | 3 | 11 | 9.00 | 4.20 | 28.86 | 0 | −0.096 | −2.23 |
| 8 | FALFFTIF | 7 | 7 | 28 | 21.70 | 8.87 | 67.56 | 0 | −0.226 | −3.96 |
| 9 | FAVWG | 4 | 4 | 13 | 12.00 | 4.25 | 37.33 | 0 | −0.098 | −1.33 |
| 10 | FILFAVF | 6 | 6 | 23 | 18.00 | 8.46 | 57.82 | 0 | −0.205 | −4.01 |
| 11 | FLSVFFSG | 8 | 8 | 29 | 25.40 | 8.38 | 77.56 | 0 | −0.122 | −3.29 |
| 12 | FVQWLM | 6 | 6 | 25 | 21.00 | 6.24 | 56.03 | 0 | −0.124 | −1.63 |
| 13 | GLALL | 4 | 4 | 14 | 12.00 | 4.97 | 38.83 | 0 | −0.088 | −1.92 |
| 14 | GLLYC | 5 | 5 | 19 | 15.25 | 5.09 | 38.51 | 0 | −0.077 | −0.97 |
| 15 | GSFFL | 5 | 5 | 18 | 15.70 | 5.18 | 48.43 | 0 | −0.080 | −1.34 |
| 16 | GSFFLY | 6 | 6 | 23 | 19.45 | 5.60 | 48.43 | 0 | −0.110 | −1.04 |
| 17 | GSFFLYS | 7 | 7 | 27 | 23.15 | 5.96 | 58.29 | 0 | −0.095 | −1.04 |
| 18 | IAALL | 3 | 3 | 12 | 9.00 | 4.34 | 29.09 | 0 | −0.106 | −2.04 |
| 19 | IFLFG | 5 | 5 | 18 | 15.00 | 6.27 | 48.24 | 0 | −0.130 | −2.43 |
| 20 | IFTDF | 5 | 6 | 20 | 16.70 | 4.72 | 48.68 | 0 | −0.147 | −0.39 |
| 21 | IFYFYGTTY | 9 | 9 | 37 | 30.65 | 7.07 | 58.21 | 0 | −0.184 | −1.42 |
| 22 | IGAIY | 4 | 4 | 15 | 12.75 | 3.95 | 29.02 | 0 | −0.083 | −0.39 |
| 23 | IGYIS | 5 | 5 | 19 | 16.45 | 4.31 | 38.87 | 0 | −0.068 | −0.20 |
| 24 | IGYIY | 5 | 5 | 20 | 16.50 | 4.36 | 29.02 | 0 | −0.113 | −0.03 |
| 25 | IMVTF | 5 | 5 | 20 | 16.20 | 5.91 | 47.81 | 0 | −0.125 | −1.86 |
| 26 | ISLLLIQ | 7 | 7 | 29 | 24.20 | 7.63 | 67.96 | 0 | −0.102 | −2.68 |
| 27 | IVTCVVV | 7 | 7 | 24 | 21.20 | 8.93 | 67.63 | 0 | −0.103 | −4.25 |
| 28 | IYYCV | 5 | 5 | 21 | 16.00 | 4.88 | 28.73 | 0 | −0.108 | −0.48 |
| 29 | LAILT | 4 | 4 | 16 | 12.70 | 4.83 | 38.93 | 0 | −0.106 | −1.47 |
| 30 | LFNIA | 4 | 4 | 16 | 13.50 | 4.32 | 38.90 | 0 | −0.032 | −0.95 |
| 31 | LFVEF | 5 | 6 | 20 | 17.00 | 5.67 | 48.29 | 0 | −0.132 | −1.33 |
| 32 | LGIYF | 5 | 5 | 19 | 15.75 | 5.32 | 38.65 | 0 | −0.122 | −1.20 |
| 33 | LGLLG | 5 | 5 | 16 | 15.00 | 5.61 | 48.53 | 0 | −0.070 | −2.02 |
| 34 | LGQFLLFC | 8 | 8 | 31 | 26.00 | 8.90 | 77.02 | 0 | −0.100 | −3.72 |
| 35 | LGVIW | 5 | 5 | 17 | 15.00 | 5.77 | 47.12 | 0 | −0.130 | −2.02 |
| 36 | LIGALLV | 6 | 6 | 21 | 18.00 | 7.88 | 58.18 | 0 | −0.142 | −3.58 |
| 37 | LLIYAA | 4 | 4 | 17 | 12.75 | 4.75 | 29.09 | 0 | −0.136 | −0.99 |
| 38 | LLIYAASYL | 7 | 7 | 30 | 23.20 | 6.97 | 48.66 | 0 | −0.187 | −1.85 |
| 39 | LLIYGA | 5 | 5 | 19 | 15.75 | 5.39 | 38.78 | 0 | −0.118 | −1.28 |
| 40 | LLIYSASFLY | 9 | 9 | 38 | 29.90 | 8.70 | 68.10 | 0 | −0.211 | −3.06 |
| 41 | LLMLL | 5 | 5 | 21 | 15.50 | 6.91 | 47.90 | 0 | −0.175 | −2.78 |
| 42 | LMVFFGN | 7 | 7 | 26 | 23.00 | 7.47 | 67.24 | 0 | −0.069 | −2.67 |
| 43 | LVFFA | 4 | 4 | 15 | 12.00 | 5.65 | 38.57 | 0 | −0.131 | −2.41 |
| 44 | LVYGA | 4 | 4 | 14 | 12.75 | 3.96 | 29.09 | 0 | −0.066 | −0.53 |
| 45 | NLFLLS | 6 | 6 | 24 | 20.20 | 6.12 | 58.52 | 0 | −0.053 | −1.60 |
| 46 | NVILFSVF | 8 | 8 | 30 | 26.20 | 8.97 | 77.71 | 0 | −0.093 | −3.83 |
| 47 | RGFFY | 5 | 5 | 22 | 17.75 | 3.85 | 37.66 | 0 | −0.091 | 0.50 |
| 48 | SFFLY | 5 | 5 | 21 | 16.45 | 4.96 | 38.74 | 0 | −0.128 | −0.69 |
| 49 | SFFLYS | 6 | 6 | 25 | 20.15 | 5.32 | 48.60 | 0 | −0.113 | −0.64 |
| 50 | SVFIF | 5 | 5 | 19 | 15.70 | 6.01 | 48.38 | 0 | −0.116 | −2.06 |
| 51 | SVFIFP | 6 | 6 | 20 | 19.20 | 7.90 | 57.74 | 4 | −0.128 | 0.32 |
| 52 | SVFLFP | 6 | 6 | 20 | 19.20 | 7.90 | 57.79 | 4 | −0.128 | 0.32 |
| 53 | SVFLFPP | 7 | 7 | 21 | 22.70 | 9.80 | 67.16 | 8 | −0.140 | 2.67 |
| 54 | TEYNQ | 5 | 6 | 23 | 22.45 | 1.05 | 39.13 | 0 | 0.108 | 3.60 |
| 55 | TLFLVY | 6 | 6 | 24 | 19.45 | 6.63 | 48.54 | 0 | −0.157 | −2.01 |
| 56 | TLLIIFK | 8 | 7 | 31 | 22.70 | 7.73 | 67.67 | 0 | −0.113 | −2.67 |
| 57 | TNYNQ | 5 | 5 | 22 | 21.95 | 1.08 | 39.35 | 0 | 0.186 | 3.25 |
| 58 | TTEYN | 5 | 6 | 22 | 20.65 | 1.50 | 39.34 | 0 | 0.048 | 3.08 |
| 59 | TVFIFP | 6 | 6 | 20 | 19.20 | 8.04 | 57.73 | 4 | −0.142 | 0.18 |
| 60 | VAYWYILFIG | 9 | 9 | 35 | 28.50 | 9.42 | 66.37 | 0 | −0.264 | −3.87 |
| 61 | VEALYL | 5 | 6 | 21 | 17.75 | 4.79 | 38.83 | 0 | −0.119 | −0.27 |
| 62 | VFLGMFLY | 8 | 8 | 31 | 25.25 | 9.28 | 67.02 | 0 | −0.212 | −4.04 |
| 63 | VLIYF | 5 | 5 | 20 | 15.75 | 6.14 | 38.65 | 0 | −0.158 | −1.94 |
| 64 | VLMISL | 6 | 6 | 24 | 19.20 | 7.29 | 57.67 | 0 | −0.143 | −2.76 |
| 65 | VTLFF | 5 | 5 | 19 | 15.70 | 6.14 | 48.41 | 0 | −0.131 | −2.20 |
| 66 | VTMLV | 5 | 5 | 19 | 16.20 | 6.00 | 47.97 | 0 | −0.105 | −2.04 |
| 67 | VVCFL | 5 | 5 | 18 | 14.50 | 6.88 | 48.06 | 0 | −0.105 | −3.03 |
| 68 | VVCLL | 5 | 5 | 18 | 14.50 | 6.95 | 48.19 | 0 | −0.102 | −3.11 |
| 69 | VVITL | 5 | 5 | 18 | 15.70 | 6.31 | 48.59 | 0 | −0.107 | −2.47 |
| 70 | VVSVLTVL | 8 | 8 | 28 | 25.40 | 9.59 | 77.87 | 0 | −0.129 | −4.55 |
| 71 | VVSVLTVV | 8 | 8 | 27 | 25.40 | 9.60 | 77.84 | 0 | −0.111 | −4.62 |
| 72 | VVYYSNSYWYF | 11 | 11 | 46 | 38.90 | 7.51 | 66.98 | 0 | −0.147 | −1.40 |
| 73 | YCLQYD | 6 | 7 | 27 | 22.50 | 3.50 | 38.73 | 0 | −0.063 | 1.55 |
| 74 | YCQQHNE | 7 | 8 | 32 | 31.75 | 2.19 | 57.83 | 0 | 0.162 | 3.13 |
| 75 | YCQQHY | 6 | 6 | 28 | 26.00 | 2.53 | 38.17 | 0 | 0.056 | 2.39 |
| 76 | YCQQNNN | 7 | 7 | 31 | 30.75 | 1.82 | 58.47 | 0 | 0.329 | 3.28 |
| 77 | YCQQS | 5 | 5 | 23 | 20.95 | 2.01 | 38.51 | 0 | 0.112 | 2.39 |
| 78 | YCQQYS | 6 | 6 | 28 | 24.70 | 2.43 | 38.51 | 0 | 0.082 | 2.48 |
| 79 | YFIAAV | 4 | 4 | 16 | 12.75 | 4.69 | 28.93 | 0 | −0.122 | −1.03 |
| 80 | YIFSNYWIQWV | 11 | 11 | 46 | 39.20 | 8.59 | 84.76 | 0 | −0.153 | −2.52 |
| 81 | YISQFIIMY | 9 | 9 | 40 | 32.20 | 8.08 | 67.10 | 0 | −0.163 | −2.37 |
| 82 | YSVVLLL | 7 | 7 | 27 | 22.45 | 8.03 | 58.37 | 0 | −0.158 | −3.10 |
| 83 | YVWQVL | 6 | 6 | 24 | 21.25 | 5.61 | 47.09 | 0 | −0.100 | −0.98 |
| 84 | YYWTWI | 6 | 6 | 26 | 20.20 | 4.34 | 36.24 | 0 | −0.213 | 0.49 |
| 85 | LMIYEVSNRPSGVPD | 15 | 17 | 56 | 55.15 | 12.68 | 134.38 | 8 | −0.105 | 2.04 |
| 86 | NTLYLQMNSLRAEDTAVYYCA | 18 | 20 | 80 | 68.85 | 10.95 | 144.82 | 0 | −0.047 | −3.65 |
| 87 | NSTYRVVSVLTVLH | 14 | 14 | 56 | 51.05 | 11.43 | 125.82 | 0 | −0.079 | −4.92 |
| 88 | PAVLQSSGLYSLSSVVTVPSSSLGTQ | 25 | 25 | 89 | 88.75 | 21.08 | 233.51 | 8 | 0.009 | −5.47 |
| 89 | VDTSKNQFSLRLSSVTAADTA | 19 | 20 | 77 | 67.90 | 10.42 | 175.19 | 0 | 0.049 | −3.18 |
| 90 | KVYACEVTHQGLSS | 14 | 14 | 54 | 48.85 | 8.39 | 116.08 | 0 | 0.067 | −1.91 |
| 91 | MHWVRQAPGKGLEWV | 15 | 15 | 56 | 52.50 | 10.90 | 130.77 | 4 | −0.085 | −0.31 |
| 92 | LNNFYPREAKVQWKVDNALQSGNS | 24 | 24 | 94 | 86.65 | 12.22 | 202.03 | 4 | 0.330 | −0.58 |
| 93 | VHWYQQKPGQAPVL | 14 | 13 | 49 | 51.25 | 10.67 | 113.79 | 8 | 0.071 | 3.62 |
| 94 | GTDFTLTISSLQPED | 15 | 18 | 57 | 55.50 | 10.60 | 146.32 | 4 | −0.116 | 0.20 |
| 95 | PGLVRPSQTLSLTCT | 15 | 15 | 55 | 53.50 | 13.68 | 144.30 | 8 | −0.034 | 0.86 |
| 96 | GGLVQPGGSLRLSCAASGFTF | 19 | 19 | 66 | 64.30 | 16.45 | 183.05 | 4 | −0.014 | −5.46 |
| 97 | WSWIRQPPGKGLEWI | 16 | 16 | 58 | 54.20 | 12.32 | 139.75 | 8 | −0.130 | 2.37 |
| 98 | VSWYQQLPGTAPKL | 14 | 13 | 49 | 48.15 | 11.01 | 114.36 | 8 | 0.020 | 3.28 |
| 99 | WSWVRQPPGRGLEWI | 15 | 16 | 57 | 55.20 | 12.31 | 139.09 | 8 | −0.181 | 2.36 |
| 100 | GGLVQPGRSLRLSCAASGFTF | 19 | 19 | 71 | 66.30 | 15.86 | 182.15 | 4 | −0.033 | −4.79 |
| 101 | LAWYQQKPGKAPKL | 15 | 12 | 52 | 45.75 | 8.84 | 103.95 | 8 | 0.157 | 5.45 |
| 102 | MHWVRQAPGQGLEWM | 14 | 15 | 56 | 54.50 | 10.54 | 130.27 | 4 | −0.106 | 0.04 |
| 103 | VSWYQQHPGKAPKL | 15 | 13 | 52 | 50.45 | 9.59 | 113.81 | 8 | 0.110 | 4.76 |
| 104 | GDRVTITCRASQGIS | 14 | 15 | 58 | 51.80 | 8.68 | 134.40 | 0 | 0.007 | −2.06 |
| 105 | GASVKVSCKASGYTF | 15 | 13 | 53 | 44.05 | 8.86 | 116.11 | 0 | 0.115 | −2.48 |
| 106 | EEQYNSTYRVVSVLTVLHQDW | 21 | 24 | 89 | 82.80 | 12.81 | 182.90 | 0 | −0.081 | −5.22 |
| 107 | LMIYE | 5 | 6 | 23 | 18.25 | 4.47 | 38.13 | 0 | −0.135 | 0.19 |
| 108 | NTLYL | 5 | 5 | 21 | 17.95 | 3.86 | 39.21 | 0 | −0.023 | 0.40 |
| 109 | NSTYR | 5 | 5 | 24 | 20.65 | 1.38 | 38.43 | 0 | 0.062 | 3.08 |
| 110 | PAVLQ | 4 | 4 | 13 | 15.00 | 4.85 | 38.40 | 4 | −0.005 | 2.02 |
| 111 | VDTSK | 6 | 6 | 22 | 18.40 | 2.43 | 48.87 | 0 | 0.029 | 2.06 |
| 112 | KVYAC | 5 | 4 | 19 | 13.25 | 3.09 | 28.56 | 0 | 0.024 | 0.87 |
| 113 | MHWVR | 5 | 5 | 23 | 19.50 | 3.89 | 45.42 | 0 | −0.124 | 0.43 |
| 114 | LNNFY | 5 | 5 | 21 | 18.75 | 3.34 | 39.18 | 0 | 0.051 | 0.92 |
| 115 | VHWYQ | 5 | 5 | 21 | 20.25 | 3.16 | 37.21 | 0 | −0.058 | 1.12 |
| 116 | GTDFT | 5 | 6 | 18 | 17.40 | 3.04 | 48.96 | 0 | −0.054 | 1.12 |
| 117 | PGLVR | 5 | 5 | 17 | 17.50 | 5.49 | 47.26 | 4 | −0.048 | 2.26 |
| 118 | GGLVQ | 5 | 5 | 16 | 17.50 | 4.23 | 48.42 | 0 | 0.043 | −0.63 |
| 119 | WSWIR | 5 | 5 | 23 | 17.70 | 3.43 | 45.05 | 0 | −0.140 | 0.89 |
| 120 | WSWVR | 5 | 5 | 22 | 17.70 | 3.44 | 45.08 | 0 | −0.123 | 0.81 |
| 121 | GGLVQ | 5 | 5 | 16 | 17.50 | 4.23 | 48.42 | 0 | 0.043 | −0.63 |
| 122 | LAWYQ | 4 | 4 | 18 | 15.25 | 2.69 | 27.72 | 0 | −0.064 | 1.19 |
| 123 | MHWVR | 5 | 5 | 23 | 19.50 | 3.89 | 45.42 | 0 | −0.124 | 0.43 |
| 124 | GDRVT | 5 | 6 | 20 | 18.70 | 2.69 | 48.02 | 0 | −0.035 | 1.66 |
| 125 | GASVK | 5 | 4 | 16 | 13.70 | 2.53 | 38.72 | 0 | 0.081 | 0.88 |
| 126 | EEQYN | 5 | 7 | 24 | 23.75 | 0.57 | 39.01 | 0 | 0.107 | 4.32 |
| 127 | VSNRP | 5 | 5 | 19 | 19.70 | 3.83 | 47.65 | 4 | 0.061 | 4.13 |
| 128 | QMNSL | 5 | 5 | 22 | 20.20 | 3.04 | 48.18 | 0 | 0.084 | 1.16 |
| 129 | VVSVL | 5 | 5 | 17 | 15.70 | 6.18 | 48.63 | 0 | −0.075 | 0.00 |
| 130 | SSGLY | 5 | 5 | 19 | 17.15 | 3.22 | 39.12 | 0 | −0.018 | 0.89 |
| 131 | NQFSL | 5 | 5 | 21 | 19.70 | 3.28 | 48.73 | 0 | 0.079 | 0.84 |
| 132 | EVTHQ | 5 | 6 | 21 | 22.20 | 2.49 | 48.40 | 0 | 0.031 | 1.92 |
| 133 | QAPGK | 5 | 4 | 15 | 16.00 | 2.66 | 38.18 | 4 | 0.134 | 4.63 |
| 134 | PREAK | 5 | 5 | 20 | 17.50 | 2.05 | 37.37 | 4 | 0.054 | 6.16 |
| 135 | QKPGQ | 6 | 5 | 20 | 21.50 | 2.70 | 47.81 | 4 | 0.195 | 5.36 |
| 136 | LTISS | 5 | 5 | 20 | 17.10 | 4.11 | 48.93 | 0 | −0.041 | −0.08 |
| 137 | PSQTL | 5 | 5 | 18 | 19.40 | 4.25 | 48.41 | 4 | 0.029 | 3.59 |
| 138 | PGGSL | 5 | 5 | 13 | 16.20 | 4.98 | 48.32 | 4 | 0.004 | 2.10 |
| 139 | QPPGK | 6 | 5 | 16 | 19.50 | 4.55 | 47.54 | 8 | 0.122 | 7.10 |
| 140 | QLPGT | 5 | 5 | 16 | 18.70 | 4.53 | 48.25 | 4 | 0.032 | 3.08 |
| 141 | QPPGR | 5 | 5 | 16 | 20.50 | 4.53 | 46.86 | 8 | 0.054 | 7.11 |
| 142 | PGRSL | 5 | 5 | 18 | 18.20 | 4.39 | 47.43 | 4 | −0.015 | 3.47 |
| 143 | QKPGK | 7 | 5 | 22 | 20.00 | 2.73 | 47.66 | 4 | 0.201 | 5.51 |
| 144 | QAPGQ | 4 | 4 | 13 | 17.50 | 2.63 | 38.32 | 4 | 0.128 | 4.24 |
| 145 | QHPGK | 6 | 5 | 19 | 21.00 | 3.11 | 47.70 | 4 | 0.122 | 4.86 |
| 146 | ITCRA | 4 | 4 | 19 | 14.20 | 3.14 | 37.69 | 0 | −0.031 | 0.60 |
| 147 | VSCKA | 5 | 4 | 18 | 13.20 | 3.04 | 38.41 | 0 | 0.069 | 0.62 |
| 148 | STYRV | 5 | 5 | 23 | 19.15 | 2.79 | 38.18 | 0 | −0.034 | 1.62 |
Sequence sequence named according to IUPAC recommendations (IUPAC 1983), AM number of non-conjugated amine groups, AC number of carboxylic acid groups, ROT number of non-trivial (not CX3), non-hindered (not alkene, amide, small ring) rotatable bonds, HBA hydrogen bond acceptors, QPCaco predicted apparent Caco-2 cell permeability, IP ionization potential, eV electronovolts, NON number of ring atoms not able to form conjugated aromatic systems, QPlogS solubility, AEx arithmetic expression value Ln(AM − IP + AC × ROT) − (QPCaco − NON)
An arithmetic mean and standard deviations of physicochemical parameters used for arithmetic expression (AEx; Ln(AM − IP + AC × ROT) − (QPCaco − NON)) determination
| Analyzed sequences | Arithmetic mean; standard deviation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| AM | AC | ROT | HBA | QPCaco [nm/s] | IP [eV] | NON | QPlogS | AEx | |
| Aggregation prone regions | 6.0; 2.0 | 6.0; 2.0 | 23.0; 7.0 | 19.64; 6.01 | 5.84; 2.17 | 49.70; 13.96 | 0.0; 1.0 | −0.095; 0.095 | −1.21; 2.02 |
| Tregitopes | 17.0; 3.0 | 17.0; 4.0 | 63.0; 14.0 | 58.98; 12.61 | 11.83; 2.88 | 145.77; 32.73 | 4.0; 2.0 | −0.003; 0.113 | −0.68; 3.40 |
| Control sequences | 5.0; 1.0 | 5.0; 1.0 | 19.0; 3.0 | 18.17; 2.30 | 3.44; 1.04 | 43.27; 5.65 | 2.0; 2.0 | 0.015; 0.084 | 2.23; 2.03 |
AM number of non-conjugated amine groups, AC number of carboxylic acid groups, ROT number of non-trivial (not CX3), non-hindered (not alkene, amide, small ring) rotatable bonds, HBA hydrogen bond acceptors, QPCaco predicted apparent Caco-2 cell permeability, IP ionization potential, eV electronovolt, NON number of ring atoms not able to form conjugated aromatic systems, QPlogS solubility
Fig. 1A relationship between hydrogen bond acceptors (HBA) and ionization potential (IP-[eV – electronovolts]). APR aggregation prone regions (○; n = 84); tregitope sequences (□; n = 22), CS control set extracted from tregitopes (●; n = 42)
Fig. 2A separation of 4 classes (a, b, c, d) of dependencies and the relationship between aqueous solubility (QPlogS) and arithmetic expression value (AEx), Ln(AM − IP + AC × ROT) − (QPCaco − NON). APR aggregation prone regions (○; n = 84), CS control set extracted from tregitopes (●; n = 42), AC number of carboxylic acid groups, AM number of non-conjugated amine groups, IP ionization potential, ROT number of non-trivial (not CX3), non-hindered (not alkene, amide, small ring) rotatable bonds, QPCaco predicted apparent Caco-2 cell permeability, NON number of ring atoms not able to form conjugated aromatic systems
Summary of statistics for arithmetic expression value (AEx) Ln(AM − IP + AC × ROT) − (QPCaco − NON) and solubility (APlogS) of each sequence (aggregation prone regions and control set) in classes A, B, C, D
| Class | ||||||||
|---|---|---|---|---|---|---|---|---|
| A | B | C | D | |||||
| Aggregation prone regions | ||||||||
| Parameters | AEx | QPlogS | AEx | QPlogS | AEx | QPlogS | AEx | QPlogS |
| M | na. | na. | 0.313 | 0.015 | 0.103 | −0.012 | −1.629 | −0.098 |
| SD | na. | na. | 0.420 | 0.082 | 0.990 | 0.041 | 1.167 | 0.044 |
| CI low | na. | na. | 0.039 | −0.039 | −0.583 | −0.041 | −1.909 | −0.108 |
| CI high | na. | na. | 0.587 | 0.068 | 0.789 | 0.017 | −1.350 | −0.087 |
| n | 0 | 9 | 8 | 67 | ||||
| SE | na. | na. | 0.140 | 0.027 | 0.350 | 0.015 | 0.143 | 0.005 |
| Control set sequences | ||||||||
| M | −0.029 | 0.002 | 1.733 | 0.041 | 0.472 | −0.025 | −0.002 | −0.001 |
| SD | 0.000 | 0.000 | 2.060 | 0.052 | 0.801 | 0.046 | 0.000 | 0.000 |
| CI low | na. | na. | 0.873 | 0.019 | 0.091 | −0.046 | na. | na. |
| CI high | na. | na. | 2.594 | 0.062 | 0.853 | −0.003 | na. | na. |
| n | 2 | 22 | 17 | 1 | ||||
| SE | na. | na. | 0.439 | 0.011 | 0.194 | 0.011 | na. | na. |
Groups groups of parameters depicted in Fig. 2, M arithmetic mean, SD standard deviation, CI low lower 95 % confidence interval for M, CI high higher 95 % confidence interval for M, n number of sequences observed in specific class (A, B, C, D), SE standard error, na. not applicable, QPlogs >0 and AEx <0 (class A), QPlogs ≥0 and AEx >0 (class B), QPlogs <0 and AEx ≥0 (class C), QPlogs ≤0 and AEx ≤0 (class D)