| Literature DB >> 23039758 |
Noah M Daniels1, Shilpa Nadimpalli, Lenore J Cowen.
Abstract
BACKGROUND: The quality of multiple protein structure alignments are usually computed and assessed based on geometric functions of the coordinates of the backbone atoms from the protein chains. These purely geometric methods do not utilize directly protein sequence similarity, and in fact, determining the proper way to incorporate sequence similarity measures into the construction and assessment of protein multiple structure alignments has proved surprisingly difficult.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23039758 PMCID: PMC3585936 DOI: 10.1186/1471-2105-13-259
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Formatt frame-offset repair example. Example of Formatt’s frame-offset repair on a subset (residues 37-50 of chain A of PDB ID 1c9f, and residues 64-76 of chain A of PDB ID 1d4b) of the HOMSTRAD “CIDE-N” group. In both sequence and structural alignments, difference between Matt and Formatt are shown in orange and green; red and blue regions are αand βstructures aligned identically by Matt and Formatt. Note that the Formatt alignment has fewer non-core residues (three) than Matt (five).
HOMSTRAD multiple alignments (all values are averages)
| HOMSTRAD | 126.8 | 2.71 | 1.37 | 1.82 | 1.60 | 136.2 | 2.13 | (100%) |
| Mustang | 152.8 | 3.60 | 1.54 | 1.86 | 1.70 | 165.3 | 2.26 | |
| Matt | 1.72 | 1.54 | 1.55 | 1.55 | 189.0 | 2.34 | 73.4% | |
| SAlign | 172.6 | 2.29 | 1.79 | 2.86 | 2.32 | 2.45 | 78.1% | |
| (Promals3D) | - | 1.55 | - | - | 43% | |||
| Formatt (mafft) | 148.2 | 155.6 | 2.18 | 78.7% | ||||
| Formatt (clustalw) | 169.3 | 1.72 | 1.47 | 1.55 | 1.51 | 182.0 | 72.9% | |
| Formatt (muscle) | 169.6 | 1.72 | 1.48 | 1.55 | 1.51 | 179.1 | 2.15 | 72.4% |
| Formatt (probcons) | 168.9 | 1.73 | 1.50 | 1.55 | 1.53 | 178.8 | 2.18 | 73.6% |
SABMark Twilight Zone multiple alignments (all values are averages)
| Mustang | 63.4 | 4.12 | 3.92 | 3.05 | 3.49 | 97.6 | 4.46 |
| Matt | 2.64 | 4.15 | 2.29 | 3.35 | 84.0 | 4.58 | |
| SAlign | 59.6 | 3.51 | 3.88 | 2.83 | 3.36 | 4.34 | |
| (Promals3D) | - | 3.89 | - | - | 4.39 | ||
| Formatt (mafft) | 45.01 | 54.0 | |||||
| Formatt (clustalw) | 64.05 | 2.75 | 3.98 | 2.32 | 3.15 | 76.2 | 4.49 |
| Formatt (muscle) | 64.37 | 2.77 | 3.99 | 2.33 | 3.16 | 76.0 | 4.55 |
| Formatt (probcons) | 64.5 | 2.97 | 4.00 | 2.44 | 3.22 | 75.6 | 4.54 |
Figure 2Staccato conservation score vs. alignment length. Separation of 1,000 domain pairs where both domains are in the same SCOP family, and 1,000 domain pairs where both domains are in different SCOP families, along the dimensions of Staccato conservation score and core length of the pairwise alignment.