| Literature DB >> 27110766 |
Hilde S Vollan1,2, Tone Tannæs3, Gert Vriend4, Geir Bukholm5,6.
Abstract
Diffusion channels are involved in the selective uptake of nutrients and form the largest outer membrane protein (OMP) family in Gram-negative bacteria. Differences in pore size and amino acid composition contribute to the specificity. Structure-based multiple sequence alignments shed light on the structure-function relations for all eight subclasses. Entropy-variability analysis results are correlated to known structural and functional aspects, such as structural integrity, multimericity, specificity and biological niche adaptation. The high mutation rate in their surface-exposed loops is likely an important mechanism for host immune system evasion. Multiple sequence alignments for each subclass revealed conserved residue positions that are involved in substrate recognition and specificity. An analysis of monomeric protein channels revealed particular sequence patterns of amino acids that were observed in other classes at multimeric interfaces. This adds to the emerging evidence that all members of the family exist in a multimeric state. Our findings are important for understanding the role of members of this family in a wide range of bacterial processes, including bacterial food uptake, survival and adaptation mechanisms.Entities:
Keywords: bacteria; entropy-variability analysis; multiple sequence analysis; porins; specific diffusion channels; structure analysis; β-barrel membrane proteins
Mesh:
Substances:
Year: 2016 PMID: 27110766 PMCID: PMC4849052 DOI: 10.3390/ijms17040599
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Outer membrane protein (OMP) structure. OMP structures are composed of a transmembrane motif that consists of β-strands that form a transmembrane β-barrel. These strands are connected by short loops (also called turns) in the periplasm and longer extracellular loops. The number of β-strands, the length of the β-strands and the loop lengths vary widely among OMPs. Here, the trimeric general diffusion porin (GDP) from E. coli (PDB (Protein Data Bank) ID: 2J1N, [4]) is shown as an example from (A) the side-view with the ß-barrel core embedded in the membrane and (B) the top-view, where the pores are visible viewed from the extracellular side. The β-barrel with loops and turns removed (blue) is referred to as the core.
Figure 2Porin structure. The constriction loop L3 of sugar-specific Salmonella Typhimurium ScrY porin is highlighted in green (PDB ID: 1OH2, [12]). The barrel core is colored blue; periplasmic turns and extracellular loops are red; the substrate is orange; and calcium ions are gray. (A) Side-view of the porin where strands 14, 15 and 16 of molecule P were removed to better visualize the constriction loop L3; (B) top-view of the porin.
Protein structure information of bacterial porins and specific diffusion channels. This table lists protein names and biological and structural information of the structures used in our analyses. The size column lists the number of β-strands.
| Groups | Name | Bacteriae | Size | PDB ID | Resolution (Å) |
|---|---|---|---|---|---|
| Monomeric proteins | |||||
| NMP | EcOmpA | 8 | 1QJP | 1.7 | |
| NMP | KpOmpA | 8 | 2K0L | – | |
| NMP | OmpG | 14 | 2IWV | 2.3 | |
| SMDC | NanC | 12 | 2WJR | 1.8 | |
| SMDC | KdgM | 12 | 4FQE | 1.9 | |
| SMDC | OprB | 16 | 4GEY | 2.7 | |
| SMDC | OccD | 18 | 3SYS, 3SZD, 3SZV, 3T0S, 3T20, 3T24, 4FRT, 4FRX, 4FT6, 4FSO, 4FSP, 3JTY, 3SY7, 3SY9, 3SYB | Average: 2.4 (1.5–3.2) | |
| Trimeric proteins | |||||
| NTP | EcOmpC | 16 | 2J1N | 2.0 | |
| NTP | StOmpC | 16 | 3UPG | 3.2 | |
| NTP | OmpK36 | 16 | 1OSM | 3.2 | |
| NTP | StOmpF | 16 | 3NSG | 2.8 | |
| NTP | EcOmpF | 16 | 4GCS | 1.9 | |
| NTP | PhoE | 16 | 1PHO | 3.0 | |
| NTP | RcGDP | 16 | 2POR | 1.8 | |
| NTP | RbGDP | 16 | 1PRN | 2.0 | |
| NTP | Omp32 | 16 | 2FGQ | 1.5 | |
| STDC | EcMaltoporin | 18 | 1AF6 | 2.4 | |
| STDC | StMaltoporin | 18 | 2MPR | 2.4 | |
| STDC | ScrY | 18 | 1OH2 | 2.4 | |
| STDC | OprP | 16 | 2O4V | 1.9 |
EcMaltoporin (Escherichia coli maltoporin); EcOmpA (E. coli outer membrane protein A); EcOmpC (E. coli outer membrane protein C); EcOmpF (E. coli outer membrane protein F); KdgM (oligogalacturonate-specific channel); KpOmpA (K. pneumonia outer membrane protein A); NanC (N-acetylneuraminic acid-inducible outer-membrane channel); NMP (non-specific monomeric porins); NTP (non-specific trimeric porin); OccD (outer membrane carboxylate channel); Omp32 (outer membrane protein 32); OmpG (outer membrane protein G); OmpK36 (outer membrane porins of K. pneumoniae); OprB (outer membrane porin B); OprP (outer membrane porin P); PDB ID (Protein Data Bank Identifier); PhoE (phosphoporin); RbGDP (R. blastica general diffusion porin); RcGDP (R. capsulatus general diffusion porin); ScrY (Sugar specific porin); SMDC (specific monomeric diffusion channel); STDC (specific trimeric diffusion channel); StMaltoporin (S. Typhium maltoporin); StOmpC (S. Typhimurium outer membrane protein C); StOmpF (S. Typhimurium outer membrane protein F); Å (Angstrom).
Secondary structure composition. Secondary structure composition of the average non-specific porin and specific diffusion channel composition, which lists the average length (number of amino acids), percentages of amino acids located in β-strands, periplasmic loops and extracellular loops.
| Porin Group | Average Length | Barrel (%) | Periplasmic Loop (%) | Extracellular Loop (%) |
|---|---|---|---|---|
| NMP | 219 | 57 | 12 | 30 |
| SMDC | 385 | 56 | 14 | 28 |
| NTP | 325 | 57 | 11 | 30 |
| STDC | 417 | 54 | 11 | 34 |
NMP (non-specific monomeric porins); NTP (non-specific trimeric porin); SMDC (specific monomeric diffusion channel); STDC (specific trimeric diffusion channel).
Figure 3Phylogenetic tree of bacterial porins. Colors represent the four groups and the multimeric state of the porin based on the old classification system.
The non-specific porin and specific diffusion channel family distributed among six classes and further divided into eight subclasses. Corresponding OMPdb [35] classifications are found in Table S1. The empty classes (and subclasses) are reserved for future structures. “Size” refers to the number of strands.
| Class | Subclass | Protein Structures | |||
|---|---|---|---|---|---|
| Number | Size | Number | Name | Protein Name | PDB ID Template (Other) |
| 1 | 8 | 1A | Non-specific, petite porin | OmpA | 2K0L (1QJP) |
| 2 | 10 | 2A | Non-specific, mini porin | – | – |
| 3 | 12 | 3A | Non-specific, small porin | – | – |
| 3B | Oligogalacturonate-specific, small channel | KdgM and NanC | 4FQE (2WJR) | ||
| 4 | 14 | 4A | Non-specific, intermediate porin | OmpG | 2IWV |
| 5 | 16 | 5A | Non-specific, medium porin | OmpC, OmpK36, OmpF, PhoE, Omp32, RcGDP and RbGDP | 2J1N (1PRN, 3UPG, 1OSM, 3NSG, 4GCS, 2POR, 2FGQ, and 1PHO) |
| 5B | Sugar-specific, medium channel | OprB | 4GEY | ||
| 5C | Phosphate-specific, medium channel | OprP | 2O4V | ||
| 6 | 18 | 6A | Non-specific, large porin | – | – |
| 6B | Sugar-specific, large channel | Maltoporin and ScrY | 2MPR (1AF6 and 1OH2) | ||
| 6C | Carboxyl-specific, large channel | Occ Channels | 3SZV (3SYS, 3SZD, 3T0S, 3T20, 3T24, 4FRT, 4FRX, 4FT6, 4FSO, 4FSP. 3JTY, 3SY7, 3SY9, and 3SYB) | ||
EcMaltoporin (Escherichia coli maltoporin); EcOmpA (E. coli outer membrane protein A); EcOmpC (E. coli outer membrane protein C); EcOmpF (E. coli outer membrane protein F); KdgM (oligogalacturonate-specific channel); KpOmpA (K. pneumonia outer membrane protein A); NanC (N-acetylneuraminic acid-inducible outer-membrane channel); OccD (outer membrane carboxylate channel); Omp32 (outer membrane protein 32); OmpG (outer membrane protein G); OmpK36 (outer membrane porins of K. pneumoniae); OprB (outer membrane porin B); OprP (outer membrane porin P); PDB ID (Protein Data Bank Identifier); PhoE (phosphoporin); RbGDP (R. blastica general diffusion porin); RcGDP (R. capsulatus general diffusion porin); ScrY (Sugar specific porin); StMaltoporin (S. Typhium maltoporin); StOmpC (S. Typhimurium outer membrane protein C); StOmpF (S. Typhimurium outer membrane protein F).
Structure alignment statistics of non-specific porins and specific diffusion channels. Core structures are analyzed after the loops were removed (so only the barrel was used). The percentages of aligned residues are averages for each pair-wise alignment possible in the subclass. Empty cells represent classes with only one structure. Minimum pore radii are determined using HOLE [37] for the whole monomeric protein and its β-barrel core, respectively.
| Subclasses | Resolution | % Residues Superposed | % Sequence Identity | Pore Size | |||
|---|---|---|---|---|---|---|---|
| Number | Name | RMSD (Å) | Mustang | Mustang | Clustal Ω | Core | Whole |
| 1A | Non-specific, petite porin | 1.6 | 98.7 | 96.1 | 93.6 | ~ 0 | 0.1 |
| 3B | Oligogalacturonate-specific, small channel | 1.4 | 95.0 | 25.6 | 28.1 | 2.9 | 2.9 |
| 4A | Non-specific, intermediate porin | – | – | – | – | 3.7 | 4.0 |
| 5A | Non-specific, medium porin | 1.4 | 90.7 | 40.2 | 41.3 | 2.8 | 6.9 |
| 5B | Sugar-specific, medium channel | – | – | – | – | 2.2 | 7.0 |
| 5C | Phosphate-specific, medium channel | – | – | – | – | 1.6 | 6.4 |
| 6B | Sugar-specific, large channel | 0.9 | 95.6 | 50.3 | 49.5 | 2.1 | 6.9 |
| 6C | Carboxyl-specific, large channel | 0.9 | 96.2 | 46.4 | 45.7 | 1.8 | 7.8 |
| Average | 1.2 | 95.2 | 52.0 | 51.6 | 2.4 | 7.5 | |
Figure 4Estimated minimum Angstrom pore size radius for each subclass using the HOLE software [37]. Each colored diamonds represent one monomer structure. (A) Whole structure; (B) core structure where loops and turns were removed. The barrel size number of strands in each β-barrel) is listed in brackets next to the class name labels.
Number of sequences used in MSA (multiple sequence alignment analyses). The number of sequences used in the final MSA that was generated for each class is listed together with the class number and name. The lacking classes (and subclasses) are reserved for future structures and highlighted with a question mark.
| Subclass Number | Subclass Name | Number of Sequences Used in the MSA |
|---|---|---|
| 1A | Non-specific, petite porin | 389 |
| 3B | Oligogalacturonate-specific, small channel | 246 |
| 4A | Non-specific, intermediate porin | 50 |
| 5A | Non-specific, medium porin | 725 |
| 5B | Sugar-specific, medium channel | 319 |
| 5C | Phosphate-specific, medium channel | 180 |
| 6B | Sugar-specific, large channel | 663 |
| 6C | Carboxyl-specific, large channel | 1394 |
Figure 5Entropy-variability (EV) plots of the eight subclasses. Red is for those residues likely to be in the main active site; orange is the main active site; green is the regulatory site; yellow communicates; blue is those with unknown function (see the Methods Section for more details).
Figure 6Entropy-variability (EV) results of the eight subclass from (A) side-view and (B) top-view. The structures are visualized as molecular surfaces and are colored according to residue conservation as described in the Methods Section. This figure illustrates differences in barrel size, pore size and loop variability. All structures are visualized in the monomeric form. These results can be downloaded and viewed in 3D from the associated websites (YASARA (Yet Another Scientific Artificial Reality Application) scene files; see the Supplementary Materials).
Figure 7Conserved non-specific porin and specific diffusion channel residues. (A) Side view and (B) top view of conserved residues (Boxes 11, 12, and 22). Only the monomeric state is shown for all subclasses. Loops and turns are deleted to visualize the protein core. These results can be downloaded and viewed in 3D from the associated websites (YASARA scene files; see the Supplementary Materials).
Figure 8Non-specific porin and specific diffusion channel conservation. This figure illustrates differences in barrel size, pore size and loop variability. All structures are visualized in the trimeric form. Only the structures of Classes 5A (A); 5C (B); and 6B (C) proteins have been solved in an oligomeric conformation (all of them being homotrimeric). Class 1A (D); Class 3B (E); Class 4A (F); Class 5B (G) and Class 6C (H) are monomeric structures modeled in a trimeric conformation by hand. Same remark as the previous figures.
Figure 9Porin variability. Side and top views displaying the loop variability of all protein subclasses analyzed (including the modeled trimeric interactions shown in Figure 8). All structures are visualized in the predicted trimeric form for Class 1A (2, 3, 5) and eight porins. Class 1A porins are represented in the monomeric state, and only conserved residues are highlighted (see the Discussion for more details). Red, orange and yellow represent conserved residues; green shows partially conserved residues involved in regulation; blue represents highly variable sites with unknown functions. Same as previous figures.
Non-specific porin and specific diffusion channel oligomerization state. This table lists articles discussing the multimeric state of non-specific porins and specific diffusion channels, the protein name and information regarding the presumed multimeric state (monomeric, dimeric or oligomerization).
| Class Number | Class Name | Function | Monomer | Dimer | Oligomer |
|---|---|---|---|---|---|
| 1A | Non-specific, petite porin | Abundant multifunctional porin; host evasion [ | EcOmpA [ | EcOmpA [ | FopA [ |
| 3B | Oligogalacturonate-specific, small channel | Oligogalacturonate-specific channel [ | KdgM [ | NanC [ | – |
| 4A | Non-specific, intermediate porin | pH-dependent rare rescue porin [ | EcOmpG [ | EcOmpG [ | – |
| 5A | Non-specific, medium porin | Classical porins or general diffusion porin (GDP) [ | – | – | GDP [ |
| 5B | Sugar-specific, medium channel | Oligosaccharide specific channel [ | OprB [ | – | OprB [ |
| 5C | Phosphate-specific, medium channel | Phosphate specific channel [ | – | – | OprP [ |
| 6B | Sugar-specific, large channel | Sugar specific channel [ | – | – | EcMaltoporin [ |
| 6C | Carboxyl-specific, large channel | Small water-soluble specific channel (Occ channels) [ | Occ [ | – | OccK1 (OpdK) [ |
AbOprB (Acinetobacter baumannii outer membrane porin B); EcMaltoporin (Escherichia coli maltoporin); EcOmpA (E. coli outer membrane protein A); EcOmpC (E. coli outer membrane protein C); EcOmpF (E. coli outer membrane protein F); KdgM (oligogalacturonate-specific channel); KpOmpA (K. pneumonia outer membrane protein A); NanC (N-acetylneuraminic acid-inducible outer-membrane channel); OccD (outer membrane carboxylate channel); Omp32 (outer membrane protein 32); OmpG (outer membrane protein G); OmpK36 (outer membrane porins of K. pneumoniae); OprB (outer membrane porin B); OprP (outer membrane porin P); PDB ID (Protein Data Bank Identifier); PhoE (phosphoporin); RbGDP (R. blastica general diffusion porin); RcGDP (R. capsulatus general diffusion porin); ScrY (Sugar specific porin); StMaltoporin (S. Typhium maltoporin); StOmpC (S. Typhimurium outer membrane protein C); StOmpF (S. Typhimurium outer membrane protein F).
Figure 10The sequence analysis workflow [102] (copied with permission from Bio-Prodict) shows the process of structure superposition guided by a profile alignment of the sequences. Step a1 shows the superposition of two structures (Omp32, colored blue, and PhoE in yellow; PDB IDs: 2FGQ and 1PHO, respectively); Step a2 shows the core alignment of the barrel structures in which loops have been removed; Step a3 is core alignment of similar structure; Step b is a sequence alignment against the nearest structure. All sequences and structures from one subclass are aligned together in step c. Both structures used in this MSA (multiple sequence alignment) belong to the general diffusion protein subclass. The WHAT IF module of the YASARA/WHAT IF twinset was used for these analyses [39].