| Literature DB >> 30065210 |
Athena Andreou1, Petros Giastas2, Elias Christoforides3, Elias E Eliopoulos4.
Abstract
Functional and folding constraints impose interdependence between interacting sites along the protein chain that are envisaged through protein sequence evolution. Studying the influence of structure in phylogenetic models requires detailed and reliable structural models. Polysaccharide deacetylases (PDAs), members of the carbohydrate esterase family 4, perform mainly metal-dependent deacetylation of O- or N-acetylated polysaccharides such as peptidoglycan, chitin and acetylxylan through a conserved catalytic core termed the NodB homology domain. Genomes of Bacillus anthracis and its relative Bacillus cereus contain multiple genes of putative or known PDAs. A comparison of the functional domains of the recently determined PDAs from B. anthracis and B. cereus and multiple amino acid and nucleotide sequence alignments and phylogenetic analysis performed on these closely related species showed that there were distinct differences in binding site formation, despite the high conservation on the protein sequence, the folding level and the active site assembly. This may indicate that, subject to biochemical verification, the binding site-forming sequence fragments are under functionally driven evolutionary pressure to accommodate and recognize distinct polysaccharide residues according to cell location, use, or environment. Finally, we discuss the suggestion of the paralogous nature of at least two genes of B. anthracis, ba0330 and ba0331, via specific differences in gene sequence, protein structure, selection pressure and available localization patterns. This study may contribute to understanding the mechanisms under which sequences evolve in their structures and how evolutionary processes enable structural variations.Entities:
Keywords: Bacillus anthracis; Bacillus cereus; functional divergence; polysaccharide deacetylase; structural evolution
Year: 2018 PMID: 30065210 PMCID: PMC6115787 DOI: 10.3390/genes9080386
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Ribbon diagram of the Ba0331 polysaccharide deacetylase (PDA) protein architecture showing the two distinct domains. Helices are represented as cyan ribbons, β-strands are in magenta and loops are shown as orange strings. The Zn ion is shown as a grey sphere with the coordinating amino acid residues (Asp213, His271 and His275) in stick representation.
PDAs of the Bacillus anthracis str. Ames studied as well as designation, accession numbers, putative function, the localization prediction [51,52], the protein data base (PDB) ID and their homolog from Bacillus cereus ATCC14579.
| NCBI RefSeq: NC_003997.3 | NCBI RefSeq Protein (aa) | Possible Function | LocateP Data Base Prediction [ | PDB ID of Corresponding X-ray Crystal Structure | Homolog | |
|---|---|---|---|---|---|---|
| BA1961 | 1087084, (1847060, 1847887) | NP_844369 (275) | Peptidoglycan GlcNAcdeacetylase | Cytoplasmic/Intracellular (No CS) | – | BC1960, NP_831730 |
| BA3679 | 1089257, (3383280, 3383921, complement) | NP_845942 (213) | Peptidoglycan GlcNAcdeacetylase | Cytoplasmic/Intracellular (No CS) | – | BC3618, |
| BA3480 | 1083768, (3198882, 3201665, complement) | NP_845761 (927) | glycosyltransferase group two family protein/polysaccharide deacetylase | Membrane/multi-transmembrane (No CS) | – | – |
| BA1977 | 1086062, (1861493, 1862314) | NP_844383 (273) | Peptidoglycan GlcNAcdeacetylase | Membrane/N-terminally anchored (No CS) | – | BC1974, NP_831744 (273), 97.15 |
| BA2944 | 1085595, (2708639, 2709466) | NP_845280 (275) | Peptidoglycan GlcNAcdeacetylase | Extracellular/Secretory (released) (with CS) | – | BC2929, NP_832677 (275), 94.9 |
| BA5436 | 1085036, (4923836, 4924573) | NP_847604 (245) | Peptidoglycan GlcNAc deacetylase | Extracellular/Secretory (released) (with CS) | – | BC5204, NP_834868 (245), 93.5 |
| BA0424 | 1087807, (444231, 445013) | NP_842967 (260) | Peptidoglycan MurNAc deacetylase | Extracellular/Secretory (released) (with CS) | 2J13 | BC0467, NP_830306 (260), 98.5 |
| BA0150 | 1086778, (144110, 144874, complement) | NP_842717 (254) | Polysaccharide deacetylase | Membrane/N-terminally anchored (No CS) | 4M1B | BC0171, NP_830050 (254), 95.3 |
| BA1836 | 1086408, (1722907, 1723611, complement) | NP_844255 (234) | Polysaccharide deacetylase | Extracellular/Secretory (released) (with CS) | – | BC1768, NP_831543 (234), 91.9 |
| BA3943 | 1086849, (3618407, 3619306, complement) | NP_846187 (299) | Polysaccharide deacetylase | Extracellular/Secretory (released) (with CS) | – | BC3804, NP_833526 (299), 95.7 |
| BA0330 | 1085388, (338279, 339361, complement) | NP_842877 (360) | Polysaccharide deacetylase | Extracellular/Lipid anchored (with CS) | 4V33 | BC0361, NP_830200 (360),90.6 |
| BA0331 | 1085987, (339513, 340616, complement) | NP_842878 (367) | Polysaccharide deacetylase | Extracellular/Lipid anchored (with CS) | 6GO1 | _ |
CDS: coding sequence, CS: Cleavage Site, aa: the number of amino acid residues.
Figure 2Molecular phylogenetic analysis of the PDA NodB domain. (a). The unrooted tree of 23 protein sequences of the PDA NodB domain from B. anthracis strain Ames (Ba) and B. cereus strain ATCC14579 (Bc) by the Maximum Likelihood method and with the highest log likelihood (−4084.19) is shown. The sequences are clustered in three groups represented with different colours (Group 1 in black, Group 2 in red and Group 3 in blue). (b). The unrooted tree of the NodB domain of 80 PDA amino acid sequences (homologues of the B. anthracis strain Ames (Ba) and B. cereus strain ATCC14579 (Bc) given in Figure 1a), selected from six different Bacilii species by the Maximum Likelihood method and with the highest log likelihood (−5578.53) is shown. The percentage of trees where the associated taxa clustered together is shown next to the branches. Grouping of the orthologues with respect to the B. anthracis/B. cereus species is shown in different colours.
Figure 3Molecular Phylogenetic analysis of the PDA Fn3 domain. The unrooted tree of 117 amino acid sequences of the PDA Fn3 domain by the Maximum Likelihood method is shown. The tree with the highest log likelihood (−10,732.65) is shown. The sequences are clustered in two distinct groups (Ba0330 like (blue) and Ba0331 like (red). Ba0330, Bc0361 and Ba0331 are highlighted in yellow. Sequences are from the Bacilii species that have been found to contain PDAs with the Fn3 domain.
Figure 4Sequence alignment diagram of the NodB domain of 23 B. anthracis str. Ames and B. cereus str. ATCC14579 PDAs. Numbering is according to Ba0331. Proteins with known tertiary structures have their name coloured purple. Highly conserved residues between the sequences are given in bold letters. Coloured bars show the conserved motifs. The amino acids found in the binding site-forming are boxed in blue, cyan, magenta, yellow, orange, or red colours, respectively. The secondary structure of Ba0331 and Bc1974 is shown on the top and bottom (α-helices as spirals, β-strands as bold arrows, turns as bold T). Diagram using the program ESPRIPT [60].
Figure 5Location of conserved motifs MT1-MT5(′) on the NodB domain of Ba0331 and Bc1974 structures. The elements shown in Figure 4 sequence alignment with orange (MT1, MT2, MT3, MT4 and MT5) and green bars (MT4′ and MT5′) are shown on the 3D Cα representation of representative Group 1 and Group 2 structures. (a) Ba0331 and (b) Bc1974.
Figure 6(a) Ribbon diagram of the superimposed Fn3 domain structures from Ba0330, Bc0361 (in shades of purple) and Ba0331 (in cyan) with the totally conserved hydrophobic residues in pink and the conserved interacting loops RTAD (res. 86–89 in blue) and RGE (res. 119–121 in green). (b) The Fn3 surface association with the NodB domain. (c) The Fn3-NodB contact in ribbon representation with amino acid residue details. Conserved loop RGE between Fn3 β5–β6 β-strands in green and loop RTAD between β3–β4 strands in blue, NodB interacting loop in yellow (res. 201–204). Conserved surface residues between Ba0330 and Ba0331 are coloured in red. Diagrams created using the program PyMOL.
Figure 7Conservation of NodB binding sites. (a). Surface representation of the NodB PDA binding domain facing the active site with the Group 1 amino acid residue identity (in red) for Ba0331, Ba0330 and Bc0361. The metal (Zn) ion position in the active site is indicated with a grey sphere. (b). Surface representation of the NodB PDA binding site from selective B. anthracis and B. cereus structurally determined NodB domains superimposed (Bc1974 (yellow), Ba0331 (blue), Ba0330 (green), Bc1960 (beige), Ba0424 (white)) in two orthogonal views to highlight the similarities (shape) and detailed differences between the PDAs. A trisaccharide GlcNAc is modelled in Bc1974 (in orange sticks) [35]. The formation of the binding crevice (running horizontally) is shown as well as the detailed differences between the PDAs on the surface.
Figure 8Cavity volumes of the substrate binding grooves of experimentally determined PDAs. The volume of the cavity extending on each side of the zinc ion is represented with grey spheres. The domains that set the outer boundaries of the binding groove are coloured in blue, cyan, magenta, yellow, orange, or red in accordance with Figure 4 and their corresponding amino acid residues are shown in sticks. Where applicable, the zinc ion is represented as red balls. Despite the overall similar fold of all NodB domains, the relative analysis revealed profound variation between the Fn3-containing PDAs Ba0331 (a) and Ba0330 (b) as well as between the highly homologous Bc1974 (c) and Bc1960 (d). The non-metal containing Ba0150 (e) has a rather small and deep crevice, while the cavity of the putative MurNAc deacetylase Ba0424 (f) contains a restriction ring at the post-MT1 region (coloured orange).
Figure 9Superposition of the NodB domain containing the binding sites of Ba0331 and Ba0330. Ba0331 is shown in green and Ba0330 is shown in magenta (acquired from PDBIDs: 4V33, 6GO1). The figure is centred on the zinc ion and therefore only the binding site grooves are shown. Τhe residues that either belong directly to an MT1-5 domain or lie within a 5 Å distance from one of these motifs are represented as sticks. The acetate ions were removed for clarity reasons. A black sphere represents the zinc ion in the active site for Ba0331 and a grey sphere shows the zinc ion for Ba0330. The zinc coordination sphere or the interacting network unique in Ba0331 is shown in black dashed lines (distances in angstroms). Only the different residues between Ba0331 and Ba0330 are labelled. A strong interacting network (on the left side of the figure), which corresponds to the upper rim of the binding groove, was observed in Ba0331. In contrast to the Ba0330/Bc0361 deacetylases where the particular side is occupied by hydrophobic or small amino acids, in Ba0331, a number of charged or hydrophilic residues were located. Strong interactions of D165 with positively charged residues stabilize the β9–β10 loop to a closed conformation, thus reducing the available volume of the binding groove. On the right side of the figure, which corresponds to the lower side of the binding groove, the presence of M153, L170 and F352 characterized the boundaries as hydrophobic.