| Literature DB >> 30667382 |
Viki Kumar Prasad1, Alberto Otero-de-la-Roza2, Gino A DiLabio1,3.
Abstract
We present an extensive and diverse database of peptide conformational energies. Our database contains five different classes of model geometries: dipeptides, tripeptides, and disulfide-bridged, bioactive, and cyclic peptides. In total, the database consists of 3775 conformational energy data points and 4530 conformer geometries. All the reference energies have been calculated at the LC-ωPBE-XDM/aug-cc-pVTZ level of theory, which is shown to yield conformational energies with an accuracy in the order of tenths of a kcal/mol when compared to complete-basis-set coupled-cluster reference data. The peptide conformational data set (PEPCONF) is presented as a high-quality reference set for the development and benchmarking of molecular-mechanics and semi-empirical electronic structure methods, which are the most commonly used techniques in the modeling of medium to large proteins.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30667382 PMCID: PMC6343515 DOI: 10.1038/sdata.2018.310
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Molecular structure of the amino acids and representative peptide model systems considered in this work.
(a) The classification of the twenty standard proteinogenic amino acids by the nature of their side-chains. The N-terminal and C-terminal are capped with acetyl and primary amide group, respectively. The single- and three-letter codes for each amino acid are also provided. (b) A representative candidate from each of the five different classes of peptide model systems considered in the PEPCONF data set.
A description of the DB-format file or the database-file format (.db) for a peptide system containing N number of atoms.
| Line | Column | Content |
|---|---|---|
| 1 | 1 | ‘ref’ string specifying the reference energy |
| 1 | 2 | reference conformational energy (peptide A - peptide B) (in kcal/mol) |
| 2 | 1 | ‘molc’ string specifying start of the first molecular block |
| 2 | 2 | unique integer identifier, 1 indicating peptide conformer A |
| 2 | 3 | charge of the conformer A |
| 2 | 4 | multiplicity of the conformer A |
| 3, …, N + 2 | 1 | element type |
| 3, …, N + 2 | 2 | X coordinates (in Å) |
| 3, …, N + 2 | 3 | Y coordinates (in Å) |
| 3, …, N + 2 | 4 | Z coordinates (in Å) |
| N + 3 | 1 | ‘end’ string specifying end of the first molecular block |
| N + 4 | 1 | ‘molc’ string specifying start of the second molecular block |
| N + 4 | 2 | unique integer identifier, −1 indicating peptide conformer B (with lower energy than A) |
| N + 4 | 3 | charge of the peptide conformer B |
| N + 4 | 4 | multiplicity of the peptide conformer B |
| N + 4, …, 2 N + 4 | 1 | element type |
| N + 4, …, 2 N + 4 | 2 | X coordinates (in Å) |
| N + 4, …, 2 N + 4 | 3 | Y coordinates (in Å) |
| N + 4, …, 2 N + 4 | 4 | Z coordinates (in Å) |
| 2 N + 5 | 1 | ‘end’ string specifying end of the second molecular block |