| Literature DB >> 36032666 |
Deepa Paliwal1, Michelle Thom2, Areej Hussein1, Divyashree Ravishankar1, Alex Wilkes1, Bryan Charleston2, Ian M Jones1.
Abstract
Bovine tuberculosis caused by Mycobacterium bovis, is a significant global pathogen causing economic loss in livestock and zoonotic TB in man. Several vaccine approaches are in development including reverse vaccinology which uses an unbiased approach to select open reading frames (ORF) of potential vaccine candidates, produce them as recombinant proteins and assesses their immunogenicity by direct immunization. To provide feasibility data for this approach we have cloned and expressed 123 ORFs from the M. bovis genome, using a mixture of E. coli and insect cell expression. We used a concatenated open reading frames design to reduce the number of clones required and single chain fusion proteins for protein pairs known to interact, such as the members of the PPE-PE family. Over 60% of clones showed soluble expression in one or the other host and most allowed rapid purification of the tagged bTB protein from the host cell background. The catalogue of recombinant proteins represents a resource that may be suitable for test immunisations in the development of an effective bTB vaccine.Entities:
Keywords: Mycobacterium bovis; PPE; bovine tuberculosis; expression; genome; open reading frame; protein purification; vaccine
Year: 2022 PMID: 36032666 PMCID: PMC9402895 DOI: 10.3389/fmolb.2022.889667
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
The bTB ORFs selected for recombinant expression tests.
| Expression host | Soluble expression | Purification yield | ||
|---|---|---|---|---|
|
| ||||
| Mb0923 | Outer membrane protein Omp A |
| +++ | +++ |
| Mb0485 | Iron-regulated heparin binding hemagglutinin hbha |
| +++ | +++ |
| Mb0419c | Probable glutamine-binding lipoprotein glnh (glnbp) |
| ++ | + |
| Mb3840 | Exported repetitive protein precursor PirG |
| +++ | + |
| Mb0348 | Isoniazid inductible gene protein iniB |
| +++ | +++ |
| Mb2293 | Probable lipoprotein lppN |
| +++ | +++ |
| Mb2807c | Probable lipoprotein lppU | Insect | +++ | +++ |
| Mb0598c | Probable lipoprotein lpqN |
| +++ | +++ |
| Mb1260 | Probable lipoprotein lpqX | Insect | +++ | +++ |
| Mb1568c | Probable lipoprotein lprI |
| +++ | +++ |
| Mb1891c | Alanine and proline rich secreted protein APA |
| +++ | +++ |
| Mb3905 | 6 kda early secretory antigenic target esxa (Esat-6) |
| +++ | +++ |
| Mb3904 | 10 kda culture filtrate antigen esxb (lhp) (cfp10) |
| +++ | +++ |
| Mb0296 | Low molecular weight antigen 7 esxh (10 kda antigen) (cfp-7) | Insect | +++ | +++ |
| Mb0295 | Esat-6 like protein esxg | Insect | +++ | +++ |
| Mb1229 | Esat-6 like protein esxk (Esat-6 like protein 3) |
| +++ | +++ |
| Mb1230 | Putative Esat-6 like protein esxi (Esat-6 like protein 4) | Insect | +++ | +++ |
| Mb1820 | Esat-6 like protein esxm |
| +++ | +++ |
| Mb1821 | Putative Esat-6 like protein esxn (Esat-6 like protein 5) | Insect | +++ | ++ |
| Mb3475c | Esat-6 like protein esxu |
| +++ | +++ |
| Mb3911c | Proteolytic substrate protein espb |
| ++ | ++ |
| Mb0674 | Possible ribonucleotide-transport ATP-binding protein |
| +++ | ++ |
| Mb1036 | Probable resuscitation-promoting factor rpfb |
| ++ | - |
| Mb1445 | Aminoglycosides/tetracycline-transport integral membrane protein |
| ++ | - |
| Mb3070 | Probable FeIII-dicitrate-binding periplasmic lipoprotein fecb |
| ++ | ++ |
| Mb3338 | Acid phosphatase sapm |
| ++ | ++ |
| Mb2653c | Probable conserved transmembrane protein |
| - | - |
| Mb3646c | Esx-1 secretion-associated protein, espa |
| +++ | + |
| Mb3645c | Esx-1 secretion-associated protein espc |
| +++ | +++ |
| Mb3644c | ESX-1 secretion-associated protein espd |
| - | - |
| Mb2002c | Immunogenic protein mpt64 (antigen mpt64/mpb64) |
| +++ | +++ |
| Mb1762 | Probable conserved transmembrane protein |
| 0 | ++ |
|
| ||||
| Mb0055 | Single-strand binding protein Ssb |
| + | + |
| Mb0671 | 50s ribosomal protein l7/l12 rplL (sa1) |
| +++ | +++ |
| Mb0704 | Probable iron-regulated elongation factor tu tuf |
| ++ | + |
| Mb0670 | 50s ribosomal protein l10 rplJ |
| +++ | ++ |
| Mb1656 | 30s ribosomal protein s1 rpsA |
| +++ | +++ |
|
| ||||
| Mb3441 | Possible antitoxin VapB47 | Insect | +++ | +++ |
| Mb2891 | Toxin RelG |
| +++ | - |
|
| ||||
| Mb2548 | PE family PE26 | Insect | + | + |
| Rv3622c | PE family protein PE32 |
| +++ | +++ |
| Mb0293 | PE family protein PE5 |
| +++ | +++ |
| Mb0294 | PE family protein PE4 |
| +++ | +++ |
| Mb3902 | PE family-related protein PE35 |
| +++ | +++ |
| Mb3922c | PE family protein PE36 |
| +++ | +++ |
| Mb3504 | PE family protein PE31 | Insect | +++ | +++ |
| Mb0940c | PE family protein PE7 | Insect | +++ | +++ |
| Mb1835 | PE family protein PE20 | Insect | +++ | +++ |
| Mb1421 | PE family protein PE15 |
| +++ | +++ |
| Mb3505 | PE family protein PPE 60 |
| +++ | +++ |
| Mb1069c | PE family protein PE 8 |
| +++ | +++ |
| Mb2457c | PE family protein PE 25 |
| +++ | +++ |
| Mb1202c | PE family protein |
| +++ | +++ |
| Mb3903 | PPE family protein PPE68 |
| +++ | +++ |
| Mb3921c | PPE family-related protein PPE69 |
| +++ | +++ |
| Mb0939c | PPE family protein PPE 14 |
| +++ | +++ |
| Mb1836 | PPE family protein PPE 31 |
| +++ | +++ |
| Mb1068c | PPE family protein PPE15 |
| +++ | +++ |
| Mb1422 | PPE family protein PPE 20 | Insect | +++ | +++ |
| Mb2456c | PPE family protein PE41 |
| +++ | +++ |
| Mb1228 | PPE family protein PPE 18 |
| +++ | +++ |
| Mb1200c | PPE17 (part) |
| +++ | +++ |
| Mb1201c | PPE 17 (part) |
| +++ | +++ |
|
| ||||
| Mb3871 | Bacterioferritin BfrB |
| +++ | +++ |
| Mb2981 | Possible glycosyl transferase |
| ++ | + |
| Mb0130 | probable serine protease pepA |
| ++ | ++ |
| Mb1272 | Probable malate dehydrogenase mdh |
| ++ | - |
| Mb0977 | Probable succinyl-CoA synthetase (a chain) sucD |
| ++ | ++ |
| Mb3652 | Inorganic pyrophosphatase PPA |
| +++ | |
| Mb1129c | Fructose 1,6-bisphosphatase glpX |
| ++ | - |
| Mb3412 | Diterpene synthase |
| ++ | ++ |
|
| ||||
| Mb2315c | Hypothetical protein | Insect | + | - |
| Mb3935c | Putative Esat-6 like protein esxf (Esat-6 like protein 13) |
| +++ | +++ |
| Mb3920c | Possible Esat-6 like protein esxd |
| +++ | +++ |
| Mb3934c | Putative Esat-6 like protein esxe (Esat-6 like protein 12) |
| +++ | +++ |
| Mb1066c | Putative Esat-6 like protein esxI (Esat-6 like protein 1) | Insect | +++ | +++ |
| Mb1067c | Esat-6 like protein esxj |
| +++ | +++ |
| Mb2375c | Putative Esat-6 like protein esx0 (Esat-6 like protein 6) | Insect | + | + |
| Mb3042c | Esat-6 like protein esxq (tb12.9) (Esat-6 like protein 8) | Insect | + | + |
| Mb3045c | Secreted Esat-6 like protein esxr (Esat-6 like protein 9) |
| +++ | +++ |
| Mb3046c | Esat-6 like protein esxs |
| +++ | +++ |
| Mb3474c | Putative Esat-6 like protein esxt |
| +++ | +++ |
| Mb3919c | Esat-6 like protein esxc (Esat-6 like protein 11) | Insect | + | - |
| Mb0959 | Periplasmic phosphate-binding lipoprotein PstS1 |
| +++ | +++ |
| Mb1858 | Conserved protein with fha domain, gara |
| ++ | ++ |
| Mb1868c | Malate synthase G GlcB |
| ++ | ++ |
| Mb1943c | Catalase-peroxidase-peroxynitritase T KatG |
| ++ | ++ |
| Mb2006c | Probable cutinase precursor CFP21 |
| ++ | +++ |
| Mb2057c | Stress protein induced by anoxia |
| ++ | +++ |
| Mb2244 | Glutamine synthetase GlnA1 |
| ++ | +++ |
| Mb2898 | Cell surface lipoprotein Mpt83 (lipoprotein P23) |
| ++ | +++ |
| Mb2900 | Major secreted immunogenic protein Mpt70 |
| ++ | ++ |
| Mb0169 | Conserved protein TB18.5 |
| + | + |
| Mb0418c | Serine/threonine-protein kinase PknG |
| +++ | +++ |
| Mb2477c | Probable resuscitation-promoting factor RpfE |
| ++ | - |
| Mb1319 | Conserved protein |
| ++ | ++ |
| Mb1596 | Involved in biotin biosynthesis |
| ++ | ++ |
| Mb2656 | Universal stress protein family protein TB31.7 |
| ++ | ++ |
| Mb3046c | Esat-6 like protein EsxS |
| ++ | ++ |
| Mb2982c | Possible glycosyl transferase |
| ++ | + |
| Mb0455c | Cyclopropane fatty acid synthase |
| +++ | ++ |
| Mb2054c | pfkb |
| +++ | +++ |
| Mb3157c | devR |
| +++ | +++ |
| Mb2970c | Probable conserved lipoprotein LppX |
| ++ | |
| Mb0891 | Possible resuscitation-promoting factor rpfA |
| +++ | ++ |
| Mb1916 | Probable resuscitation-promoting factor rpfC |
| ++ | ++ |
| Mb2410 | Probable resuscitation-promoting factor rpfD |
| ++ | ++ |
| Mb3274c | Two component sensory transduction transcriptional regulatory protein mtrA |
| +++ | +++ |
| Mb0463c | Conserved protein |
| +++ | +++ |
| Mb3641 | Hypothetical arginine and proline rich protein |
| ++ | - |
| Mb0062 | Hypothetical protein |
| ++ | ++ |
| Mb1843 | Conserved protein |
| ++ | ++ |
| Mb3743c | Conserved protein |
| +++ | +++ |
| Mb1833c | Conserved protein | Insect | +++ | +++ |
| Mb0854c | Conserved protein |
| +++ | +++ |
| Mb2058 | Conserved protein Acg |
| ++ | - |
| Mb0337c | hypothetical protein |
| ++ | ++ |
| Mb2660c | Conserved protein |
| ++ | - |
| Mb 2659 | hypoxic response protein 1 hrp1 |
| +++ | +++ |
| Mb3707 | Probable bifunctional membrane-associated penicillin-binding protein 1a/1b pona2 |
| ++ | + |
| Mb0014c | Transmembrane Serine/therorine protein Kinase-B (pknb) |
| ++ | +++ |
| Mb0979 | Probable conserved Transmembrane protein |
| ++ | +++ |
| Mb0448 | GROEL protein-2 |
| ++ | +++ |
Soluble expression levels: +++ - strongest band on SDS-PAGE, ++ - among the stronger bands, + - visible band, − no visible band. Purification yields: +++ ∼1 mg/L, ++ ∼0.1 mg/L, + <0.1 mg/L, - not purified. Greyed boxes required wash and elution buffers with 0.1% sodium sarkosyl.
FIGURE 1The cloning and selection strategies for the generation of the M. bovis recombinant protein atlas. Panel (A). Generic construct design showing the use of either complete or concatenated ORFs tagged at the C-terminus with polyhistidine as present in vector pTriEx1.1. In the event of poor expression the same ORFs were re-cloned with an N-terminal His tag as shown. The sequence of the flexible linker is indicated. Asterisk–stop codon. Panel (B). The protein expression and purification regimen showing the iterative nature of the process. The varied constructs are those with alternate His tags shown in (A). Initial screening was by Western blot for the His tag in all cases.
FIGURE 2Example Western blot detection of recombinant M. bovis proteins with anti-His antibody. Panel (A). Detection of M. bovis ORFs expressed in induced E.coli. The lane identity and predicted molecular mass are: 1-Mb1228 (39 kDa), 2-Mb1916 (18 kDa), 3-Mb2410 (15 kDa), 4-Mb3441 (11 kDa), 5-Mb3412c (34 kDa), 6-EsxRST (32 kDa), 7-EsxNOQ (30 kDa), 8-EsxU (24 kDa), 9-EsxFDE (27 kDa), 10-EsxAB (21.2 kDa). Panel (B). Detection of M. bovis ORFs expressed in baculovirus infected insect cells. The lane identity and predicted molecular mass are: 1-PPE15_PE8 (66 kDa), 2-PPE20_PE15 (67 kDa), 3-PPE31_PE20 (51 kDa), 4-PPE41_PE25 (34 kDa), 5-PPE60_PE31 (53 kDa), 6-MTRA (28 kDa), 7-PPE14_PE7 (53 kDa), 8- Mb0463_Mb0674 (59 kDa), 9-Mb0854_Mb0977 (63 kDa), 10-Mb1036_Mb1129c (74 kDa), 11-Mb1272_Mb2477c (54 kDa), 12-Mb0891c (34 kDa), 13- Mb3070_Mb3641 (54 kDa), 14-Mb3338_Mb3644c (54 kDa). Open square (□) symbols indicate protein which show some breakdown as indicated by at least 2 His antibody reactive bands. Asterisk (*) indicates all Esx related proteins, most as concatenates. Diamonds (♦) indicate concatenated PPE-PE pairs. Circles (◘) other protein concatenates. M indicates the marker track, the molecular masses of which are given on the left of panel (A) in kilodaltons.
FIGURE 3Example histidine-tagged M. bovis proteins purified by immobilised metal affinity chromatography pull down with magnetic beads from E.coli (A) and insect cells (B). The lane identity and predicted molecular masses are: Panel A, 1-Mb0704 (44 kDa), 2-Mb3070 (37 kDa), 3-Mb2054 (36 kDa), 4-Mb2002_Mb0062 (35 kDa), 5-Mb0671 (17 kDa), 6-Mb0854 (31 kDa), 7- Mb3157c_Mb3743c (37 kDa), 8-Mb0337 (26 kDa). Panel B, 1-PE5_PE32 (19.5 kDa), 2-LpqN (25 kDa), 3-LpqX (22 kDa), 4-EsxAB (21.2 kDa), 5-EsxFDE (27.3 kDa), 6-EsxNOQ (30 kDa), 7- EsxRST (32 kDa), 8-EsxU (24 kDa), 9- Mb0485 (21 kDa), 10-PPE41_PE25 (34 kDa), 11-Mb1228 (39 kDa), 12-PPE69_PE36 (46.6 kDa), 13- PPE68_PE35 (47.5 kDa), 14-PPE14_PE7 (51 kDa), 15-PPE4_PE5 (67 kDa), 16- PPE15_PE8 (66 kDa), 17- PPE20_PE15 (67 kDa), 18-PPE31_PE20 (51 kDa). Asterisk (*) indicates all Esx related proteins, most as concatenates. Diamonds (♦) indicate concatenated PPE-PE pairs. M indicates the marker track, the molecular masses of which are given on the left of panel (A) in kilodaltons.
FIGURE 4Rescue of soluble purified recombinant bTB proteins that were originally poorly expressed by alternate positioning of the His tag. Panel (A). Examples of N-terminal tagged proteins purified from E.coli. 1- Mb3338 (31 kDa), 2-Mb2656 (31 kDa), 3-Mb2898 (22 kDa), 4-Mb2900 (19 kDa), 5-Mb0130 (34 kDa), 6-Mb0854 (31 kDa), 7- Mb1656 (53 kDa), 8-Mb3652 (18 kDa), 9-Mb3871 (20 kDa). Panel (B). Examples of N-terminal tagged proteins purified from insect cells. 1- Mb0977_Mb3046c (43 kDa), 2-Mb0959 (39 kDa), 3-Mb1319 (44 kDa), 4-Mb3070 (37 kDa), 5-Mb2244 (53 kDa), 6-Mb1228 (39 kDa). Molecular weight markers (M) are labelled to the left of each panel and are in kilodaltons.
FIGURE 5Rescued expression of problematic bTB proteins in E.coli by non-His fusion tags. Panel (A). The secretion accessory proteins EspC and EspA. 1- C-terminally His tagged EspC, 2- N-terminally GFP tagged EspA. Panel (B). 1-GB_Mb1762 (29 kDa), 2-GB_Mb1916 (25 kDa), 3-GB_Mb2410 (22 kDa) all expressed and purified as N-terminal fusions with the B1 domain of streptococcal protein G. Molecular weight markers (M) are labelled to left of each panel and are in kilodaltons.
Known and predicted structural features of difficult to express proteins.
|
|
| Structure yes (Y), No (N), homology based (H) | Swiss-model | Alphafold Feature | LLPS Probability |
|---|---|---|---|---|---|
| Mb0130 | Rv1003 | H (2–230 of 285) | 3Kwp.1.B | None | 0.19 |
| Mb0854 | Rv0831c | N | Unstructured N-term | 0.2 | |
| Mb1228 | Rv1196 | H (2–175 of 391) | 5xfs.1.B | Unstructured C-term | 0.944 |
| Mb1319 | Rv1288 | H (164–447 of 456) | 6sx4.1.A | Burried N-term | 0.19 |
| Mb1445 | Rv1410c | N | Unstructured C-term | 0.136 | |
| Mb1656 | Rv1630 | Y (283–438 of 481) | 4NNI | Unstructured N-term; highly exteded structure | 0.225 |
| Mb1762 | Rv1733c | N | Unstructured N-term | 0.5 | |
| Mb1916c | Rv1884c | Y (68–153 of 176) | 4OW1 | Unstructured N-term | 0.31 |
| Mb2244 | Rv2220 | Y | 1HTQ | None | 0.19 |
| Mb2410 | Rv2389c | Y (50–127 of 154) | 4ow1.1.A | Unstructured N-term | 0.54 |
| Mb2653 | Rv2620c | N | None | 0.22 | |
| Mb2656 | Rv2623 | Y | 3CIS | None | 0.19 |
| Mb2898 | Rv2873 | Y (58–219 of 220) | 1nyo.1.A | Unstructured N-term | 0.63 |
| Mb2900 | Rv2875 | Y (31–193 of 193) | 1NYO | Unstructured N-term | 0.17 |
| Mb3070 | Rv3044 | H (68–352 of 359) | 3tny.1.A | Unstructured N-term | 0.76 |
| Mb3338 | Rv3310 | H (5–284 of 299) | 1e3c.1.B | Unstructured N-term | 0.2 |
| Mb3644 | Rv3614c | N | Unstructured N-term | 0.46 | |
| Mb3646c | Rv3616c | N | Unstructured C-term | 0.57 | |
| Mb3652 | Rv3628 | Y | 1.WCF | None | 0.18 |
| Mb3871 | Rv3841 | Y | 7O6E | None | 0.13 |