| Literature DB >> 35053207 |
Karla N Valenzuela-Valderas1, Gabriel Moreno-Hagelsieb2, John R Rohde1, Rafael A Garduño1.
Abstract
Group I chaperonins are a highly conserved family of essential proteins that self-assemble into molecular nanoboxes that mediate the folding of cytoplasmic proteins in bacteria and organelles. GroEL, the chaperonin of Escherichia coli, is the archetype of the family. Protein folding-independent functions have been described for numerous chaperonins, including HtpB, the chaperonin of the bacterial pathogen Legionella pneumophila. Several protein folding-independent functions attributed to HtpB are not shared by GroEL, suggesting that differences in the amino acid (aa) sequence between these two proteins could correlate with functional differences. GroEL and HtpB differ in 137 scattered aa positions. Using the Evolutionary Trace (ET) bioinformatics method, site-directed mutagenesis, and a functional reporter test based upon a yeast-two-hybrid interaction with the eukaryotic protein ECM29, it was determined that out of those 137 aa, ten (M68, M212, S236, K298, N507 and the cluster AEHKD in positions 471-475) were involved in the interaction of HtpB with ECM29. GroEL was completely unable to interact with ECM29, but when GroEL was modified at those 10 aa positions, to display the HtpB aa, it acquired a weak ability to interact with ECM29. This constitutes proof of concept that the unique functional abilities of HtpB can be mapped to specific aa positions.Entities:
Keywords: ECM29; GroEL; HtpB; Legionella; chaperonin; evolutionary trace; site-directed mutagenesis; yeast-two-hybrid
Mesh:
Substances:
Year: 2021 PMID: 35053207 PMCID: PMC8774168 DOI: 10.3390/biom12010059
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Primers used. The primer’s name in alphabetical/numerical order, nucleotide sequence, and application are given. For those primers used in multi-site directed mutagenesis, their application is designated as “Mutagenesis-multi”. Nucleotides in boldface show the sequence of the restriction site given in parenthesis, and the boldface-underlined nucleotides are those mutated from the wild type/parental sequence, to achieve the desired amino acid substitutions.
| Primer’s Name | 5′ → 3′ Sequence | Application |
|---|---|---|
| BamHI-htpB_R | ATA | PCR and sequencing |
| D475A_F | GGTAGCTGAGCACAAAG | Mutagenesis |
| D475A_R | CGTTGAAACCGTAGTTG | Mutagenesis |
| EcoRI-htpB_F | CCG | PCR and sequencing |
| EcoRI-groEL_F | CGG | PCR and sequencing |
| E67M_R | CACCATCTGCGCACCCATATT | Mutagenesis-multi |
| E472A_F | TAGTAAACAAGGTAGCTG | Mutagenesis |
| E472A_R | CCGTAGTTGTCTTTGTGC | Mutagenesis |
| GroEL461_F | CCGACGAAACCGTAGGTAAA | Sequencing |
| GroEL470-474_F | CTGTTGTTGCTAACACCGTT | Mutagenesis-multi |
| GroEL470-474_R | CTGCGTTGTAACCGTAGTTGGCG | Mutagenesis-multi |
| GroEL1154_R | TAGCAGCACCCACTTTGATAA | Sequencing |
| G211M_R | TTCCAGTTCTACTGC | Mutagenesis-multi |
| G297K_R | GATCTCTTCAGAGATCACGGT | Mutagenesis-multi |
| HtpB419_F | AAGACAGCAAAGCCATTG | Sequencing |
| HtpB1200_R | AGCATCTTCAACACGAGC | Sequencing |
| H473A_F | GTAGTAAACAAGGTAGCTGAG | Mutagenesis |
| H473A_R | TTGAAACCGTAGTTGTCTTTG | Mutagenesis |
| K298A_F | AGCGATGTTGCAAGACATTGCTATTTTGACT | Mutagenesis |
| K298A_R | AGAAATAACTTGACCC | Mutagenesis |
| K298G_R | AGAAATAACTTGACCC | Mutagenesis-multi |
| K474A_F | TAAACAAGGTAGCTGAGCAC | Mutagenesis |
| K474A_R | CGTTGAAACCGTAGTTGTCT | Mutagenesis |
| MMBD_F | TCATCGGAAGAGAGTAGTAAC | Sequencing |
| MMBD_R | CCTAAGAGTCACTTTAAAATTTGTATAC | Sequencing |
| M68A_F | TGAGTTTGAGCATCGTTTC | Mutagenesis |
| M68A_R | CATTTGAGCGCCCATGTTC | Mutagenesis |
| M68E_R | CATTTGAGCGCCCATGTTC | Mutagenesis-multi |
| M212A_F | TTTATCAACAACCAGCAAAAC | Mutagenesis |
| M212A_R | GGATGCTCAAGTTCACAGCTC | Mutagenesis |
| M212G | GGATGCTCAAGTTCACAGCTC | Mutagenesis-multi |
| N507A_F | CAAAGTAACCCGTATGGCTCTGCAA | Mutagenesis |
| N507A_R | TACAGAAGCTGCA | Mutagenesis |
| N507Y_R | CTACAGAAGCTGCAT | Mutagenesis-multi |
| P235S_R | AGCTTCCAGAACCG | Mutagenesis-multi |
| SalI-groEL_R | AGTC | PCR and sequencing |
| S236A_F | CAGTATTCGTGAAATGTTG | Mutagenesis |
| S236A_R | GCAACACCTTCCAATACGG | Mutagenesis |
| S236P | GCAACACCTTCCAATACGG | Mutagenesis-multi |
| Y506N_R | CACAGAAGCTGCGT | Mutagenesis-multi |
| 1411-12-13-15_F | ATGAAGCTTCTGTTGTAGTAAACAAGGTA | Mutagenesis-multi |
| 1411-12-13-15_R | GTTGAAACCGTAGTTGTCTTTGTGC | Mutagenesis-multi |
| 1415-17-18-20-21-24_F | CTTCTGTTGTAGTAAACAAGGTAGCTG | Mutagenesis-multi |
| 1415-17-18-20-21-24_R | CCAGTTGCAGCGTTGAAACCGTAGTTG | Mutagenesis multi |
| 1417-18-20-22-24_F | CTGTTGTAGTAAACAAGGTAAAAGGG | Mutagenesis-multi |
| 1417-18-20-22-24_R | CAGTTGCAGCGTTGAAACCGTAGTTG | Mutagenesis-multi |
Figure 1Histogram showing the distribution of ET rank values for the 550 amino acid positions of HtpB, in intervals of 25 arbitrary units (horizontal axis). The number of amino acids in each interval is represented by the bars’ heights, against the scale of the vertical axis. The mean value of the ET ranks (55.80) is marked by the dotted vertical line.
Figure 2Surface representations of the 3-D predicted structure of HtpB, showing 180° rotation images created in the ModWeb server. The scale at the top shows the color code of ET ranks, from the lowest value (1 = deep red) to the highest (234.92 = bright purple) and applies to the three panels. (A) ET ranks of all 550 amino acids (aa) in HtpB. The three structural domains of chaperonins are indicated by braces, and the red arc shows the HtpB concave face. N and C indicate the N-terminus (first aa) and C-terminus (last aa), respectively. (B) ET ranks of the 137 aa positions where HtpB and GroEL differ. The locations of aa important for protein folding functions have been manually delineated with dotted outlines as follows: Blue shadow = Substrate polypeptide recognition, Red shadow = ATP-binding pocket, Green shadow = Intra-ring contacts, and Yellow shadow = Inter-ring contacts. (C) ET ranks of the 41 less-likely substitutions between HtpB and GroEL. Aa positions selected for functional testing are specifically labeled and correspond to the less likely substitutions either with a low (green) ET rank or present in a cluster.
HtpB amino acids likely involved in protein folding-related functions. Amino acids are given in single letter code, and the numeral that follows the letter indicates the position in the 550 amino acid sequence. The ET rank for each position is given in parentheses. For those positions in which the HtpB and GroEL amino acids are different, the GroEL amino acid is given in square brackets.
| Protein Folding-Related Function | |||
|---|---|---|---|
| Intra-Ring Contacts (Formation of Heptameric Rings) | ATP-Binding | Polypeptide Substrate Recognition | Inter-Ring Contacts (Formation of 14-mer Barrel) |
| L7(47.14) [V] | R14(6.33) | Y200(10.29) | D12(133.91) |
| A23(125.52) [V] | T31(1) | S202(6.66) | L15(113.12) [V] |
| R37(9.63) | M32(16.1) [L] | Y204(1.65) | K106(36) |
| N38(9.8) | G33(1) | F205(14.17) | A109(48.43) |
| V39(5.59) | P34(1) | R232(32.97) | A110(24.42) |
| V40(44.13) | K52(6.27) | L235(31.35) | G111(2.21) |
| L41(39.88) | D53(1) | L238(3.42) | M112(49.9) |
| E42(62.78) [D] | G54(1) | E239(9.22) | D435(146.41) [E] |
| K81(21.07) | D88(1) | A242(46.79) | R446(79.93) |
| D84(9.44) | T92(1.64) | L260(7.37) | R453(48.94) |
| N113(22.45) | I151(29.03) | T262(14.39) | E462(32.73) |
| M115(68.31) | S152(4.14) | V264(19.47) | S464(50.97) |
| N182(43.17) [T] | A153(23.45) | V265(32.63) | V465(42.35) |
| L184(66.68) | A384(8.56) | N266(1.49) | N468(101.41) |
| R198(10.2) | D399(1.72) | R269(10.8) | |
| N208(47.69) [K] | A407(3.87) | I271(54.8) | |
| E217(117.15) | G416(1) | ||
| K227(3.43) | I455(8.15) | ||
| R232(32.97) | N480(41.39) | ||
| R246(79.05) [K] | A481(13.62) | ||
| E253(14.91) | A482(88.43) | ||
| E256(41.75) | I494(30.94) | ||
| E258(12.87) | D496(1.4) | ||
| K273(74.67) | |||
| F282(7.64) | |||
| D284(16.16) | |||
| R285(3.19) | |||
| R286(2) | |||
| Y361(9.78) | |||
| A385(35.43) | |||
| E387(4.19) | |||
| M390(22.23) | |||
| A459(37.54) [C] | |||
| T517(4.29) | |||
| E519(25.31) | |||
| C520(50.52) | |||
| M521(64.93) | |||
| V522(68.71) | |||
| A523(69.18) [T] | |||
Positions of the multiple sequence alignment (MSA), for which HtpB and GroEL amino acids (aa) are different and represent less likely substitutions predicted by a negative BLOSUM score. Of the 137 positions in which HtpB and GroEL aa differ, only the 41 with a negative BLOSUM 69 score are shown here. The five positions with low ET ranks (real value evolutionary trace rank, or rvET rank) are highlighted.
| Aligment Position | HtpB (aa) | GroEL (aa) | Variability | Variability (aa) b | rvET Rank | Blosum Score |
|---|---|---|---|---|---|---|
| 3 | M | A | 10 | A | 46.31 | −1 |
| 19 | A | R | 15 | RKE | 130.61 | −1 |
| 65 | H | D | 10 | DCN | 46.08 | −1 |
| 68 | M | E | 13 | ERKAQ | 29.84 | −2 |
| 105 | H | L | 14 | LINA | 66.77 | −3 |
| 126 | L | T | 15 | TEAKINVDQ | 210.52 | −1 |
| 137 | K | V | 13 | V | 93.13 | −2 |
| 161 | A | K | 15 | KDNERQSLT | 201.68 | −1 |
| 209 | Q | P | 14 | PSANTR | 120.55 | −1 |
| 212 | M | G | 7 | GQ | 21.73 | −3 |
| 214 | C | V | 7 | VA | 76.14 | −1 |
| 218 | H | S | 11 | SDKENRQT | 115.53 | −1 |
| 236 | S | P | 8 | PH | 22.91 | −1 |
| 295 | I | T | 7 | T | 113.27 | −1 |
| 298 | K | G | 10 | GA | 38.33 | −2 |
| 300 | Q | T | 14 | T | 112.40 | −1 |
| 308 | K | M | 17 | MLRISFY | 85.41 | −1 |
| 312 | G | K | 12 | KNTDSAQ | 116.02 | −2 |
| 337 | E | V | 20 | VALDNGMSKI. | 157.89 | −2 |
| 340 | A | E | 17 | ETSKP | 169.34 | −1 |
| 342 | E | A | 18 | AQDSN | 187.52 | −1 |
| 352 | A | Q | 14 | Q | 151.18 | −1 |
| 424 | Q | A | 17 | AYISTVLK | 149.47 | −1 |
| 426 | A | K | 19 | KPT | 173.35 | −1 |
| 428 | D | A | 18 | ALESTKH | 191.20 | −2 |
| 444 | L | A | 7 | AVI | 64.13 | −1 |
| 445 | R | L | 16 | LKIF | 122.42 | −2 |
| 457 | T | L | 18 | LFHAVEIK | 194.97 | −1 |
| 461 | Y | E | 19 | ELKF | 142.12 | −2 |
| 463 | A | P | 10 | PG | 46.27 | −1 |
| 469 | K | T | 14 | T | 126.50 | −1 |
| 471 | A | K | 14 | KRLM | 115.70 | −1 |
| 472 | E | G | 17 | GNHSA | 185.86 | −2 |
| 473 | H | G | 17 | G.RAKSNTEV | 109.10 | −2 |
| 474 | K | D | 15 | DPASE | 183.88 | −1 |
| 475 | D | G | 20 | GAVSLPKERYIT | 184.40 | −1 |
| 484 | G | E | 12 | E | 82.24 | −2 |
| 503 | M | S | 14 | SCTIV | 65.83 | −1 |
| 507 | N | Y | 11 | Y | 29.53 | −2 |
| 530 | E | A | 15 | AKGS.P | 219.66 | −1 |
| 536 | D | G | 15 | G.A | 118.38 | −1 |
a Number of possible aa in the entire MSA that can be found at the corresponding position. b List of the “X” possible aa that can be found at the corresponding position, where the value of X is given in the Variability (No.) column. Amino acids are ordered by frequency of occurrence. The GroEL aa were the most frequently found, and the HtpB aa are shown in boldface to highlight their commonness or rarity. A period in the list means that a sequence gap is found in some chaperonins in the corresponding position.
Figure 3Analysis of the secondary structure of the HtpB amino acid (aa) cluster in positions 471-475. Secondary structure was predicted using VADAR and visualized using PyMol. (A) Table of parameters associated with the secondary structure of the HtpB polypeptide between aa positions 469-477. The positions corresponding to the AEHKD cluster are highlighted in red. Residue names (RES. NAME) are shown as three-letter codes. The secondary structure (SCND STRUC) of which each aa is part of, is denoted as follows: H = alpha helix, C = random coil and B=beta sheet; the first three letters representing the predictions by three independent methods, and the fourth letter (separated by a space) being the consensus secondary structure. The residue accessible surface area (RES. ASA) for each aa is given in square angstroms. The higher the ASA value, the more accessible the corresponding aa would be to interact with a water molecule. (B) Diagrammatic close-up of the ribbon structure of HtpB, showing the predicted conformation of the random coil formed by the amino acids in positions 471–475 (blue section). The blue area that is part of the alpha helix seen in the back, corresponds to Alanine 471.
Figure 4Confirmation of the physical interaction between HtpB and hECM29. (A) Immunoblot of the immunoprecipitation (IP) done with the HA-specific antibody (anti-HA). Total proteins from S. cerevisiae co-expressing c-Myc-HtpB and HA-hECM29 fusions were immuno-precipitated with anti-HA (IP-HA), separated by SDS-PAGE, transferred to a membrane, and immuno-stained with c-Myc-specific antibody (Anti-cMyc). An ~85-kDa band corresponding to the HtpB+Gal4 DNA binding domain+c-Myc tag fusion protein is seen in the immunoprecipitate (IP-HA), but not in the IP supernatant (Unbound). (B) Immunoblot of the IP done with anti-c-Myc, where total proteins from S. cerevisiae co-expressing c-Myc-HtpB and HA-hECM29 fusions were immuno-precipitated with anti-cMyc (IP-cMyc), and immuno-stained with anti-HA. An ~57-kDa band corresponding to the hECM29+Gal4 activating domain+SV40 nuclear-localization signal+HA tag fusion protein is seen in the immuno-precipitate (IP-cMyc), but not in the IP supernatant (Unbound). (C) Diagrams to aid in the interpretation of the immunoblots. The diagram on the left shows the HtpB-hECM29 complex captured with anti-HA immobilized on a protein A/G agarose bead (Prot. A/G); as in the assay corresponding to Panel A. The diagram on the right shows the HtpB-hECM29 complex captured with immobilized anti-c-Myc; as in the assay corresponding to Panel B. The mass of the protein standards shown at the left of panels A and B is given in kDa.
Figure 5Effect of amino acid substitutions upon the HtpB-hECM29 interaction, evaluated in Y2H Plate assays. Images show the 5-day growth of yeast spotted on QDO/X/A plates, as well as the positive growth control spotted on DDO plates (far right column). The baits used to interact with hECM29 are listed in the first column. The positive interaction control is the wild-type HtpB bait (WT-HtpB), and the negative interaction control is the wild-type GroEL bait (WT-GroEL). Notice that for WT-GroEL, there is a shadow lawn in the 100 dilution, but not in the 10−1 dilution. A shadow lawn is produced by yeast cells that, upon incubation, dry up on the agar plate surface, in the absence of defined colony growth. The interaction score for each bait is given as: (+++) = Positive, (++) = Impaired, and (–) = negative. The positive growth control was spotted with the highest dilution used (10−3), to ensure that any growth defect observed on QDO/X/A plates was not due to faulty inoculation. The images shown are representative of at least three independent experiments (all giving the same results).
Figure 6Bar graphs showing quantitative measurements of the HtpB-hECM29 interaction, evaluated in Y2H Broth assays. All quantitative measurements were taken after 6 days of incubation in QDO/A broth at 30 °C. Height of the bars represents the mean, and the error bars represent ± one standard deviation, from three independent measurements (n = 3). Asterisks indicate statistical significance in relation to the quantitative measurements of wild type HtpB (WT HtpB), with the following p-values: * <0.05, ** <0.01, *** <0.001. Statistical significance was calculated by the Student-t test. (A) Growth of yeast cells carrying the different HtpB baits indicated in the horizontal axis. Growth was measured as OD600 of the broth culture and is indicated in the Y axis. (B) Alpha-galactosidase activity of yeast cells carrying the different HtpB baits (indicated in the horizontal axis), is given in the Y axis as milli units per mL, per cell.
Figure 7Effect of amino acid substitutions upon the GroEL-hECM29 interaction, evaluated in Y2H plate assays. Images show the 5-day growth of yeast spotted on QDO/X/A plates, as well as the growth of a positive control spotted on DDO plates (far right column). The baits used to interact with hECM29 are listed in the first column. The interaction score for each bait is given in parenthesis as follows: (+) = weak, and (–) = negative. The positive growth control was spotted with 10 μL of the highest dilution used (10−3), to ensure proper inoculation. The images shown are representative of at least three independent experiments (all giving the same results).