| Literature DB >> 15804363 |
Markus Koschorreck1, Markus Fischer, Sandra Barth, Jürgen Pleiss.
Abstract
BACKGROUND: In screening of libraries derived by expression cloning, expression of active proteins in E. coli can be limited by formation of inclusion bodies. In these cases it would be desirable to enrich gene libraries for coding sequences with soluble gene products in E. coli and thus to improve the efficiency of screening. Previously Wilkinson and Harrison showed that solubility can be predicted from amino acid composition (Biotechnology 1991, 9(5):443-448). We have applied this analysis to members of the alpha/beta hydrolase fold family to predict their solubility in E. coli. alpha/beta hydrolases are a highly diverse family with more than 1800 proteins which have been grouped into homologous families and superfamilies.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15804363 PMCID: PMC1079826 DOI: 10.1186/1471-2164-6-49
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution of CV-CV' of bacterial (blue), eukaryotic (green), and all (black) α/β-hydrolases.
Figure 2Distribution of sequence length of bacterial (blue), eukaryotic (green), and all (black) α/β-hydrolases.
Figure 3Correlation between CV-CV' and the length of protein sequences in bacterial (blue), eukaryotic (green), and archaean (red) α/β-hydrolases.
Hydrolases from 171 to 379 amino acids sorted by taxa of their host n is the number of hydrolases in the group, charge the charge per residue, turn the relative number of turn- forming residues. Only groups with at least eight hydrolases were included.
| d-proteobacteria | 8 | 0.02 | -0.87 | -0.062 | 0.230 | 267 |
| enterobacteria | 29 | 0.02 | -0.28 | -0.061 | 0.229 | 287 |
| a-proteobacteria | 66 | 0.14 | -0.26 | -0.054 | 0.223 | 300 |
| actinobacteria | 127 | 0.23 | -0.49 | -0.060 | 0.240 | 291 |
| proteobacteria | 302 | 0.28 | -0.23 | -0.052 | 0.230 | 293 |
| eubacteria | 558 | 0.29 | -0.28 | -0.050 | 0.229 | 288 |
| archaea | 22 | 0.30 | -0.43 | -0.052 | 0.229 | 272 |
| g-proteobacteria | 149 | 0.31 | -0.27 | -0.052 | 0.232 | 291 |
| b-proteobacteria | 46 | 0.55 | -0.04 | -0.044 | 0.236 | 298 |
| plants | 52 | 0.57 | -0.23 | -0.048 | 0.239 | 301 |
| ascomycetes | 40 | 0.89 | 0.32 | -0.046 | 0.257 | 267 |
| fungi | 43 | 0.91 | 0.37 | -0.046 | 0.258 | 268 |
| eukaryota | 235 | 0.94 | 0.36 | -0.040 | 0.250 | 284 |
| chordata | 72 | 0.98 | 0.85 | -0.036 | 0.247 | 269 |
| mammalia | 65 | 1.00 | 0.88 | -0.035 | 0.245 | 267 |
| metazoa | 140 | 1.10 | 0.78 | -0.035 | 0.252 | 283 |
| arthropoda | 48 | 1.38 | 0.86 | -0.031 | 0.263 | 300 |
Hydrolases from 380 to 519 amino acids sorted by taxa of their host n is the number of hydrolases in the group, charge the charge per residue, turn the relative number of turn- forming residues. Only groups with at least eight hydrolases were included.
| actinobacteria | 15 | 0.30 | -0.33 | -0.060 | 0.246 | 439 |
| ascomycetes | 15 | 0.52 | 0.09 | -0.047 | 0.235 | 447 |
| fungi | 21 | 0.67 | 0.25 | -0.046 | 0.242 | 435 |
| eubacteria | 79 | 0.84 | 0.30 | -0.044 | 0.251 | 437 |
| proteobacteria | 38 | 0.89 | 0.44 | -0.042 | 0.250 | 436 |
| g-proteobacteria | 11 | 0.96 | 0.59 | -0.055 | 0.279 | 454 |
| arthropoda | 40 | 1.04 | 0.50 | -0.034 | 0.246 | 441 |
| eukaryota | 165 | 1.04 | 0.72 | -0.035 | 0.246 | 443 |
| a-proteobacteria | 19 | 1.08 | 0.56 | -0.030 | 0.239 | 439 |
| metazoa | 136 | 1.08 | 0.76 | -0.033 | 0.245 | 444 |
| chordata | 76 | 1.25 | 0.99 | -0.030 | 0.249 | 448 |
| mammalia | 71 | 1.25 | 0.98 | -0.031 | 0.251 | 446 |
| plants | 8 | 1.42 | 1.10 | -0.034 | 0.268 | 452 |
Hydrolases larger than 519 amino acids sorted by taxa of their host n is the number of hydrolases in the group, charge the charge per residue, turn the relative number of turn- forming residues. Only groups with at least eight hydrolases were included.
| arthropoda | 98 | 0.75 | 0.51 | -0.048 | 0.251 | 606 |
| metazoa | 262 | 0.85 | 0.56 | -0.046 | 0.254 | 649 |
| eukaryota | 279 | 0.88 | 0.58 | -0.045 | 0.255 | 644 |
| mammalia | 95 | 0.88 | 0.56 | -0.046 | 0.256 | 720 |
| chordata | 108 | 0.90 | 0.58 | -0.046 | 0.257 | 705 |
| enterobacteria | 8 | 0.99 | 0.74 | -0.054 | 0.279 | 633 |
| eubacteria | 49 | 1.12 | 0.76 | -0.047 | 0.276 | 654 |
| g-proteobacteria | 32 | 1.17 | 0.79 | -0.049 | 0.285 | 643 |
| ascomycetes | 13 | 1.21 | 0.88 | -0.046 | 0.278 | 552 |
| proteobacteria | 35 | 1.22 | 0.79 | -0.048 | 0.286 | 638 |
| fungi | 14 | 1.25 | 0.88 | -0.045 | 0.279 | 553 |