| Literature DB >> 19758436 |
Neil D Rawlings1, Alex Bateman.
Abstract
BACKGROUND: Peptidase family A1, to which pepsin belongs, had been assumed to be restricted to eukaryotes. The tertiary structure of pepsin shows two lobes with similar folds and it has been suggested that the gene has arisen from an ancient duplication and fusion event. The only sequence similarity between the lobes is restricted to the motif around the active site aspartate and a hydrophobic-hydrophobic-Gly motif. Together, these contribute to an essential structural feature known as a psi-loop. There is one such psi-loop in each lobe, and so each lobe presents an active Asp. The human immunodeficiency virus peptidase, retropepsin, from peptidase family A2 also has a similar fold but consists of one lobe only and has to dimerize to be active. All known members of family A1 show the bilobed structure, but it is unclear if the ancestor of family A1 was similar to an A2 peptidase, or if the ancestral retropepsin was derived from a half-pepsin gene. The presence of a pepsin homologue in a prokaryote might give insights into the evolution of the pepsin family.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19758436 PMCID: PMC2761423 DOI: 10.1186/1471-2164-10-437
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Homologues of pepsin from completely sequenced bacterial genomes
| CP000083 | AAZ24813 | CPS_0549 | c(546351..547676) | |
| NC_009654 | YP_001340503 | Mmwyl1_1642 | 1847648..1848868 | |
| CP000507 | ABL98994 | Sama_0787 | c(969066..970403) | |
| CP000302 | ABE54094 | Sden_0804 | 923752..925041 | |
| CP000606 | ABO24862 | Shew_2996 | 3556266..3557951 | |
| CP000821 | ABV38171 | Ssed_3567 | 4347419..4348963 | |
| CP000738 | ABR61223 | Smed_3492 | 2467002..2468108 |
Members of peptidase family A1 found in the proteomes derived from completely sequenced bacterial genomes. Base pairs in parentheses preceded by the letter "c" are on the complimentary strand with respect to the database entry.
Predicted characteristics of bacterial homologues of pepsin
| 441 | D36, Y68, D300 | 6.975 (14.690) | |
| 406 | D49, F95, D274 | 13.692 (13.678) | |
| 445 | D37, Y69, D282 | 4.976 (13.455) | |
| 429 | D37, Y69, D287 | 8.983 (10.423) | |
| 561 | D41, Y73, D399 | 3.441 (15.338) | |
| 514 | D41, Y88, D364 | 12.942 (15.906) | |
| 368 | D53,F97,D243 | 5.649 |
Predicted active site residues are numbered according to the translation of the respective coding sequence. The threshold scores for Alien Hunter analysis are given in parenthesis after the raw score; a different threshold exists for each genome analysis. The pepsin homologue in Sinorhizobium medica is derived from a gene that is split between two 5000-residue windows in the Alien Hunter analysis, hence there are two scores.
Bacterial pepsin homologues in environmental samples
| AACY023084992 | Eastern Pacific ocean surface water | 38% | |
| AACY023056044 | Eastern Pacific ocean surface water | 31% | |
| AACY024067159 | Eastern Pacific ocean surface water | 43% | |
| AAFZ01015571 | microbial mat from gray whale carcass in the Santa Cruz Basin, Pacific Ocean (depth 1674 metres) | 37% |
Figure 1Alignment of bacterial pepsin homologues with human pepsin A and memapsins 1 and 2. Residues are numbered according to mature human pepsin A. Inserts relative to pepsin A are indicated by letters. Each active site residue (Asp32, Tyr75 and Asp215) is indicated by an asterisk. The hydrophobic-hydrophobic-Gly motifs in the psi-loops are indicated by ampersands. A disulphide bridge is indicated by a slash over one cysteine followed by a number and a backslash preceded by the same number over the second cysteine. The Chroma software [37] has been used to highlight residues according to amino acid properties and to generate the consensus line below each alignment block showing 80% conservation or more. Inserts found in only one sequence have been removed and are indicated by the number of amino acids excised in parentheses.
Figure 2Phylogenetic tree derived from members of peptidase family A1. The tree was generated for all peptidase unit sequences of family A1 holotypes, plus those from the sequences of the bacteria listed in Table 1. The tree is unrooted, but the sequence of plasmepsin-5, which is very divergent, was chosen as the outgroup. Homologues from bacteria are highlighted with a blue background. Branches that were present in 90% of the bootstrap trees are shown in red. Key: [see Additional file 1].
Comparison of replacements among bacteria with pepsin homologues
| A1 (pepsin) | 55.5 | 42.4 | 43.6 | 41.3 | 17.1 | 15.5 |
| A8 (signal peptidase 2) | 70.8 | 72.4 | 67.6 | 48.3 | 35.8 | 46.4 |
| S16 (endpeptidase Clp) | 89.0 | 90.9 | 89.9 | 78.1 | 61.6 | 67.2 |
The protein sequences from Shewanella denitrificans were compared with orthologous protein sequences from other bacteria. Sequences were aligned with Muscle [30] and percentage identities calculated using Alistat .