| Literature DB >> 19534748 |
Giovanni Minervini1, Giuseppe Evangelista, Laura Villanova, Debora Slanzi, Davide De Lucrezia, Irene Poli, Pier Luigi Luisi, Fabio Polticelli.
Abstract
BACKGROUND: The number of natural proteins represents a small fraction of all the possible protein sequences and there is an enormous number of proteins never sampled by nature, the so called "never born proteins" (NBPs). A fundamental question in this regard is if the ensemble of natural proteins possesses peculiar chemical and physical properties or if it is just the product of contingency coupled to functional selection. A key feature of natural proteins is their ability to form a well defined three-dimensional structure. Thus, the structural study of NBPs can help to understand if natural protein sequences were selected for their peculiar properties or if they are just one of the possible stable and functional ensembles.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19534748 PMCID: PMC2697646 DOI: 10.1186/1471-2105-10-S6-S22
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Amino acid composition of NBPs and natural proteins. A) Comparison between the amino acid composition of NBPs (blue bars) and that of selected natural proteins of comparable length (purple bars; see Methods). B) Comparison between the amino acid composition of the natural proteins subset used in this work (purple bars) and that of the whole UniProt dataset [2] (green bars).
Mean and Standard deviation values of amino acids relative content in the NBPs dataset.
| 4.90 | 2.57 | |
| 4.90 | 2.53 | |
| 4.97 | 2.63 | |
| 4.92 | 2.62 | |
| 5.02 | 2.62 | |
| 4.90 | 2.57 | |
| 4.99 | 2.63 | |
| 4.89 | 2.57 | |
| 4.89 | 2.59 | |
| 4.96 | 2.59 | |
| 4.90 | 2.53 | |
| 4.92 | 2.57 | |
| 4.93 | 2.59 | |
| 4.99 | 2.63 | |
| 4.92 | 2.59 | |
| 4.89 | 2.54 | |
| 4.96 | 2.62 | |
| 4.92 | 2.59 | |
| 4.93 | 2.62 | |
| 4.96 | 2.57 |
Hydrophobic amino acids relative content of natural proteins and NBPs.
| 33.35 | 25.25 | |
| 7.49 | 14.91 | |
| 4.45 | 1.69 |
Figure 2Genius grid services. Screenshots of the GENIUS grid portal [13] showing services for the specification of the number of structure predictions to run (A), of the input and output files (B) and for the inspection of the parametric JDL file (C).
Figure 3Sample NBPs predicted three-dimensional structures. Schematic representation of the three-dimensional structure of randomly chosen NBPs. α helices are coloured in magenta, β strands in yellow.
Figure 4Sample natural proteins predicted three-dimensional structures. Schematic representation of the experimental (left panel) and predicted (right panel) three-dimensional structures of randomly chosen natural proteins. α helices are coloured in magenta, β strands in yellow. Protein Data Bank [11] identification codes are indicated on the left for reference.
Average valuesa of the structure-related parameters calculated for natural proteins and NBPs.
| 3920.0 | (625.0) | 3955.6 | (239.8) | |
| 8629.0 | (1191.6) | 9294.2 | (359.2) | |
| 0.46 | (0.07) | 0.43 | (0.03) | |
| 21.4 | (17.9) | 31.0 | (8.8) | |
| 14.4 | (12.2) | 7.3 | (4.3) | |
| 25.6 | (13.1) | 24.7 | (9.9) | |
| 0.36 | (0.07) | 0.38 | (0.05) | |
aStandard deviation values are given in parenthesis.
Figure 5Structural properties of NBPs and natural proteins. A) Surface, volume and surface/volume ratio distribution for NBPs and natural proteins; B) Secondary structure content of NBPs and natural proteins. In this and in the following figure, boxplots are shown in the right panels
Figure 6Surface properties of NBPs and natural proteins. A) Surface hydrophobicity of NBPs and natural proteins; B) Amino acids solvent accessibility for NBPs (blue) and natural proteins (purple).