| Literature DB >> 27330544 |
Jianbo Lu1, Guoliang Xu2, Shihua Zhang2, Benzhuo Lu2.
Abstract
BACKGROUND: Superpositioning is an important problem in structural biology. Determining an optimal superposition requires a one-to-one correspondence between the atoms of two proteins structures. However, in practice, some atoms are missing from their original structures. Current superposition implementations address the missing data crudely by ignoring such atoms from their structures.Entities:
Keywords: Iterative closest point; Protein structure alignment; Superposition
Year: 2016 PMID: 27330544 PMCID: PMC4915111 DOI: 10.1186/s13015-016-0079-3
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Fig. 1The flow chart of our pairwise protein structure superposition algorithm for missing data (PSSM)
Fig. 2Illustration of the principal axes transform for aligning two two-dimensional shapes
The superposition results of PSSM for two identical protein structures with one randomly generated by a rotation from another one
| Structure data | Time (s) | RMSD (Å) |
|---|---|---|
|
| 317.8 |
|
|
| 1161.4 |
|
|
| 27.3 |
|
|
| 2.3 |
|
Comparison between PSSM and LS
| Structure name | Time (s) | RMSD (Å) | ||
|---|---|---|---|---|
| id1(size)–id2(size) | LS | PSSM | LS | LS |
| d1cih (108)–d1lfma (103) | 0.002 | 0.331 | 0.6 | 0.6 |
| d1cih (108)–d2pcbb (104) | 0.007 | 9.418 | 0.7 | 0.7 |
| d1cih (108)–d1m60a (104) | 0.006 | 21.484 | 1.2 | 1.2 |
| d2pcbb (104)–d1m60a (104) | 0.007 | 7.394 | 1.3 | 1.3 |
| d1cih (108)–d1kyow (108) | 0.002 | 3.768 | 0.7 | 0.7 |
Comparison of PSSM with -match and CPSARST
| PDB/SCOP entries | PSSM |
| CPSARST | |||
|---|---|---|---|---|---|---|
| id1(size)–id2(size) | Aligned | RMSD (Å) | Aligned | RMSD (Å) | Aligned | RMSD (Å) |
| 1nls (237)–2bqpA (228) | 228 | 1.4 | 214 | 1.3 | 218 | 1.4 |
| 1glh (214)–1cpn (208) | 208 | 0.7 | 206 | 0.5 | 206 | 0.5 |
| 1yadA (190)–2duaA (283) | 190 | 2.6 | 130 | 1.7 | 151 | 2.4 |
| 1zbdA (177)–1pujA (261) | 177 | 3.2 | 113 | 1.5 | 130 | 3.2 |
| d1nkla (78)–d1qdma1 (77) | 77 | 2.6 | 49 | 1.4 | 70 | 2.4 |
“Aligned” means how many residues were aligned
Comparison of PSSM with CCP4 and SuperPose
| PDB/SCOP entries | PSSM | CCP4 | SuperPose | |||
|---|---|---|---|---|---|---|
| id1(size)–id2(size) | Aligned | RMSD (Å) | Aligned | RMSD (Å) | Aligned | RMSD (Å) |
| 1nls_ (237)–2bqpA (228) | 228 | 1.4 | 114 | 1.0 | 205 | 18.1 |
| 1glh_ (214)–1cpn_ (208) | 208 | 0.7 | 156 | 0.4 | 156 | 0.4 |
| 1yadA (190)–2duaA (283) | 190 | 2.6 | 157 | 2.4 | 183 | 10.6 |
| 1zbdA (177)–1pujA (261) | 177 | 3.2 | 97 | 2.0 | 177 | 20.0 |
“Aligned” means how many residues were aligned
Performance comparison on Fischer’s dataset
| Fischer’s dataset (67 of 68 pairs) | DALI | MATT | PSSM |
|---|---|---|---|
| aveAligned | 155 | 152 | 186 |
| aveRMSD (Å) | 2.77 | 2.87 | 2.90 |
Fig. 3Superposition RMSD of d1cih and one of its rotated configuration with a number of deleted (a) atoms or (b) main chain atoms, respectively. The atoms are deleted in a random (blue color) or an ordered manner in the first 100 atoms (red color), respectively
The RMSD of pairwise superposition between d1cih and others with PSSM for cytochrome C
| PDB-id1 (size)–PDB-id2 (size) | Time (s) | RMSD (Å) |
|---|---|---|
| d1cih (835)–d1crj (847) | 2.301 | 0.3829 |
| d1cih (835)–d1csu (846) | 2.685 | 0.3881 |
| d1cih (835)–d1csx (846) | 2.674 | 0.4852 |
| d1cih (835)–d1yeb (847) | 3.108 | 0.7979 |
| d1cih (835)–d1kyow (850) | 48.480 | 0.9363 |
| d1cih (835)–d1lfma (800) | 6.399 | 1.0420 |
| d1cih (835)–d2pcbb (823) | 424.890 | 1.1760 |
| d1cih (835)–d1u74d (847) | 1196.996 | 0.8338 |
| d1cih (835)–d1m60a (819) | 754.727 | 1.4786 |
The RMSD of pairwise superposition between 2pka and others with PSSM for serine proteinases data set
| PDB-id1 (size)–PDB-id2 (size) | Time (s) | RMSD (Å) |
|---|---|---|
| 2pka (232)–3est (240) | 397.0237 | 1.5222 |
| 2pka (232)–1ton (227) | 300.4844 | 1.3310 |
| 2pka (232)–3rp2 (224) | 460.3419 | 1.5825 |
| 2pka (232)–4ptp (223) | 236.3811 | 1.1994 |
| 2pka (232)–5cha (236) | 454.4466 | 1.7583 |
| 2pka (232)–1ppb (295) | 542.5367 | 2.9835 |
The RMSD of pairwise superposition between 2dhbb and others with PSSM for Globins data set
| PDB-id1 (size)–PDB-id2 (size) | Time (s) | RMSD (Å) |
|---|---|---|
| 2dhbb (146)–1hhoa (141) | 26.9685 | 1.4944 |
| 2dhbb (146)–1hhob (146) | 0.2768 | 1.0898 |
| 2dhbb (146)–2dhba (141) | 43.1975 | 1.4393 |
| 2dhbb (146)–1mbd (153) | 15.4869 | 1.4735 |
Comparison of PSSM with MUSTANG using the Globins and Serine Proteinases data sets
| Data sets | PDB codes | PSSM | MUSTANG | ||
|---|---|---|---|---|---|
| RMSD (Å) | Aligned | RMSD (Å) | Aligned | ||
| Globins (5) | 1hhoa, 2dhba, 1hhob, 2dhbb, 1mbd | 1.37 | 141 | 1.41 | 139 |
| Serine Proteinases (7) | 3est, 2pka, 1ton, 3rp2, 4ptp, 5cha, 1ppb | 1.72 | 223 | 1.56 | 205 |