| Literature DB >> 23157214 |
Constantin Vogel1, Michael Widmann, Martina Pohl, Jürgen Pleiss.
Abstract
BACKGROUND: Standard numbering schemes for families of homologous proteins allow for the unambiguous identification of functionally and structurally relevant residues, to communicate results on mutations, and to systematically analyse sequence-function relationships in protein families. Standard numbering schemes have been successfully implemented for several protein families, including lactamases and antibodies, whereas a numbering scheme for the structural family of thiamine-diphosphate (ThDP) -dependent decarboxylases, a large subfamily of the class of ThDP-dependent enzymes encompassing pyruvate-, benzoylformate-, 2-oxo acid-, indolpyruvate- and phenylpyruvate decarboxylases, benzaldehyde lyase, acetohydroxyacid synthases and 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexadiene-1-carboxylate synthase (MenD) is still missing.Despite a high structural similarity between the members of the ThDP-dependent decarboxylases, their sequences are diverse and make a pairwise sequence comparison of protein family members difficult.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23157214 PMCID: PMC3534367 DOI: 10.1186/1471-2091-13-24
Source DB: PubMed Journal: BMC Biochem ISSN: 1471-2091 Impact factor: 4.059
The set of 16 representative proteins used for establishing a standard numbering scheme
| pyruvate decarboxylase | 2VK8 | |
| 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cycloheaxdiene-1-carboxylate synthase | 2JLC | |
| pyruvate decarboxylase | 1ZPD | |
| branched-chain keto acid decarboxylase | 2VBF | |
| benzoylformate decarboxylase | 1BFD | |
| carboxyethylarginine synthase | 2IHT | |
| cyclohexene-1,2-dione hydrolase | 2PGN | |
| oxalyl-CoA decarboxylase | 2C31 | |
| pyruvate oxidase | 1V5F | |
| pyruvate dehydrogenase | 3EYA | |
| indolepyruvate decarboxylase | 1OVM | |
| acetohydroxyacid synthase | 1JSC | |
| acetohydroxyacid synthase | 1YBH | |
| acetohydroxyacid synthase | 1OZF | |
| benzaldehyde lyase | 2AG0 | |
| glyoxylate carboligase | 2PAN |
Of each PDB entry, chain A was used for alignment. It was verified that for all proteins chain A corresponds to the catalytic subunit.
Figure 1Alignment of a query sequence and the reference sequence from the web interface of the numbering method. Alignment of a query sequence (here: branched-chain alpha-ketoacid decarboxylase from L. lactis, genbank: 75369656) to the reference sequence (ScPDC, swissprot: P06169, PDB: 2VK8). By positioning the cursor on an amino acid (here: proline), the standard numbering (here: 168) as derived from the reference sequence and the absolute numbering of the respective query sequence (here: 165) as well as annotation information (here: end of the PYR domain) are displayed. All residues are highlighted for which annotation information is available in the TEED [1].
Figure 2Analysis of accordance of two multisequence alignments. The comparison of columns of two multisequence alignments of 15 sequences using the numbering method and T-Coffee revealed five types of column similarity. 48% of the investigated columns were identical in both alignments, 25% of the columns were “highly similar” (up to 3 mismatches out of 15 sequences), 5% were “similar” (4 – 8 mismatches), 12% of the columns had 9 to 12 mismatches and are therefore called “dissimilar” and 10% of the columns showed more than 12 mismatches (“divergent”).