| Literature DB >> 28839189 |
Caroline Tawk1, Malvika Sharan1,2, Ana Eulalio1, Jörg Vogel3,4.
Abstract
Many pathogenic bacteria utilize specialized secretion systems to deliver proteins called effectors into eukaryotic cells for manipulation of host pathways. The vast majority of known effector targets are host proteins, whereas a potential targeting of host nucleic acids remains little explored. There is only one family of effectors known to target DNA directly, and effectors binding host RNA are unknown. Here, we take a two-pronged approach to search for RNA-binding effectors, combining biocomputational prediction of RNA-binding domains (RBDs) in a newly assembled comprehensive dataset of bacterial secreted proteins, and experimental screening for RNA binding in mammalian cells. Only a small subset of effectors were predicted to carry an RBD, indicating that if RNA targeting was common, it would likely involve new types of RBDs. Our experimental evaluation of effectors with predicted RBDs further argues for a general paucity of RNA binding activities amongst bacterial effectors. We obtained evidence that PipB2 and Lpg2844, effector proteins of Salmonella and Legionella species, respectively, may harbor novel biochemical activities. Our study presenting the first systematic evaluation of the RNA-targeting potential of bacterial effectors offers a basis for discussion of whether or not host RNA is a prominent target of secreted bacterial proteins.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28839189 PMCID: PMC5570926 DOI: 10.1038/s41598-017-09527-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic representation of the combined computational and CLIP-screening approach to identify RNA-binding effectors (A). The main steps involving the computational analysis by APRICOT pipeline. A comprehensive list of all the available secreted effector proteins was assembled for the identification of RBDs. RNA-binding domains were identified using various domain databases and the associated domain search algorithms assembled. The predictions were ranked using feature-based scoring (See materials and methods). (B) Selected candidates were cloned into a bidirectional plasmid with an N-terminal Strep-FLAG tag and expressed in HEK293T cells. Proteins were analyzed using the CLIP-screening approach. The main steps of the CLIP-screening workflow are indicated. Cells expressing the effectors were UV-crosslinked in vivo, the protein-RNA complexes purified and subjected to enzymatic treatments, then visualized after separation on a gel.
Prediction of characterized eukaryotic domains
| Organism | DomainID | FullName | Evalue | % Coverage | % Similarity |
|---|---|---|---|---|---|
|
| NleH1 | Kinase domain, and C-terminal PDZ-binding motif[ | |||
| SSF56112 | Protein kinase | 1,5E-05 | — | — | |
|
|
|
| |||
| cd00180 | Catalytic domain of Protein Kinases | 0,07 | 54,4 | 22,3 | |
| PF00069 | Protein kinase domain | 0,04 | 45,4 | 21,5 | |
|
|
|
| |||
| cd09562 | SAM domain of liprin-alpha1,2,3,4 proteins repeat 1. Liprins were originally identified as LAR (leukocyte common antigen-related) transmembrane protein-tyrosine phosphatase-interacting proteins. | 1,5 | 45,1 | 29,6 | |
| PF13350 | Tyrosine phosphatase family | 2,5 | 15,2 | 9,1 | |
|
|
|
| |||
| PF00102 | Protein-tyrosine phosphatase. | 7E-39 | 100,0 | 43,5 | |
| cd00047 | Protein tyrosine phosphatases (PTP) | 1E-45 | 101,3 | 43,7 | |
|
|
|
| |||
| PF03496 | This entry represents an ADP-ribosyltransferase domain found in various proteins | 2,9E-14 | 82,3 | — | |
|
|
|
| |||
| None found | |||||
|
|
|
| |||
| None found | |||||
|
|
|
| |||
| cd07660 | BAR domain of Arfaptin. Arfaptins are ubiquitously expressed proteins implicated in mediating cross-talk between Rac, a member of the Rho family GTPases, and Arf (ADP-ribosylation factor) small GTPases. | 4,8 | 41,8 | 19,9 | |
|
|
|
| |||
| PF04488 | Glycosyltransferase sugar-binding region containing DXD motif | 6E-06 | 100,0 | 34,4 | |
|
|
|
| |||
| PLN03076 | ARF guanine nucleotide exchange factor (ARF-GEF) | 0 | 10,3 | 6,3 | |
| cd00171 | The Sec. 7 domain is the central domain of the guanine-nucleotide-exchange factors (GEFs) of the ADP-ribosylation factor family of small GTPases (ARFs) | 8E-60 | 100,5 | 63,2 | |
|
|
|
| |||
| COG4886 | Leucine-rich repeat (LRR) protein [Transcription]. | 4E-18 | 76,4 | 25,1 | |
| sd00033 | leucine-rich repeats, ribonuclease inhibitor (RI)-like subfamily | 2E-13 | 71,0 | 40,3 | |
|
|
|
| |||
| None found | |||||
Twelve effector proteins with characterized eukaryotic-like domains were selected and submitted for domain prediction with APRICOT. The organisms, effector protein names, the characterized domains, and the reference studies are highlighted in grey and bold. The results and scores from the automated prediction are below each effector in white. ‘−’Indicates value not available.
Selected effector candidates
| Protein | Predicted domain name | Domain category | Domain type | E-value | Similarity % | Coverage % | Species |
|---|---|---|---|---|---|---|---|
| legAS4 /Lpg1718 | RNA recognition motif-like Smg4_UPF3 | RRM | Classical | 0.15 | 29.55 | 61.36 |
|
| legC2 | Ribonuclease Y | RNAse Y | Non-Classical | 0.00 | 15.18 | 33.85 |
|
| legC8 | Putative RNA-associated protein | RNA_bind | Non-Classical | 1.40 | 12.55 | 27.71 |
|
| legL1 | Leucine rich repeat, ribonuclease inhibitor type | RNAse | Non-classical | 8.20 | — | — |
|
| lepA | Predicted RNA-binding protein | RNA_bind | Non-Classical | 0.13 | 8.87 | 26.24 |
|
| lepB | Ribonuclease Y | RNAse Y | Non-Classical | 0.32 | 16.15 | 37.94 |
|
| lpg0191 | La RNA-binding domain of La-related protein 4 | La | Classical | 0.68 | 22.67 | 41.33 |
|
| lpg1290 | RNA recognition motif in U2 small nuclear | LSM | Classical | 0.47 | 22.86 | 46.67 |
|
| lpg1489 | Superfamily II RNA helicase | DEAD | Classical | 0.52 | 4.42 | 12.10 |
|
| lpg1751 | RNA polymerase sigma factor | possible RBD | Non-Classical | 7.10 | 20.25 | 45.57 |
|
| lpg1962 | Cyclophilin_RRM: cyclophilin-type | RRM | Classical | 0.00 | 48.19 | 101.20 |
|
| lpg1962 | Cyclophilin_WD40: cyclophilin-type | WD40 | Non-Classical | 0.00 | — | — |
|
| lpg2327 | RRP7 domain ribosomal RNA-processing protein 7 | Ribosomal | Non-Classical | 4.70 | 22.66 | 46.88 |
|
| lpg2844 | Ebola nucleoprotein | possible RBD | Non-Classical | 0.62 | 8.65 | 31.52 |
|
| lpg2847 | Leucyl/phenylalanyl-tRNA protein | tRNA | Non-Classical | 7.10 | 9.25 | 20.81 |
|
| lpg2936 | RNA methyltransferase | RNA methyl- transferase | Classical | 0.00 | — | — |
|
| lpg2936 | RNA methyltransferase | RNA methyl- transferase | Classical | 0.00 | 60.00 | 99.58 |
|
| lubX | Zinc finger (Znf) domains | ZnFC2HC, zf-CCHC | Non-Classical | 0,00 | — | — |
|
| lubX | Zinc finger (Znf) domains | ZnFC2HC, zf-CCHC | Non-Classical | 5.10 | 21.21 | 29.29 |
|
| pipB | mRNA capping enzyme | mRNA capping | Classical | 2.80 | 1.34 | 3.47 |
|
| pipB2 | Sm protein G | LSM | Classical | 1.70 | 15.71 | 41.43 |
|
| sipB | Alanyl-tRNA synthetase | tRNA_synth | Non-Classical | 2.00 | 3.63 | 8.01 |
|
| slrP | Leucine Rich Repeat | possible RBD | Non-Classical | 0.42 | — | — |
|
| sseK3 | tRNA pseudouridine synthase B | PseudoU_synth | Non-Classical | 6.30 | 10.26 | 35.90 |
|
| sspH2 | Leucine rich repeat | possible RBD | Non-Classical | 0.19 | — | — |
|
| sspH1 | Leucine Rich Repeat | possible RBD | Non-Classical | 0.02 | — | — |
|
| vrgS | The phosphoinositide binding Phox Homology | Non-RBD | 0.51 | 16.54 | 62.99 |
| |
| ipaC | Seryl-tRNA synthetase | Ribosomal | Non-Classical | 0.01 | 6.76 | 20.75 |
|
| ipaH9.8 | RNA recognition motif 2 of RNA-binding protein | RRM | Classical | 0.28 | 46.15 | 101.28 |
|
| ospG | Poly(A) polymerase | PAP_assoc | Non-Classical | 0.86 | 5.85 | 12.20 |
|
| IpaH3 | Leucine Rich Repeat | possible RBD | Non-Classical | 0.12 | — | — |
|
| ORF169b | Ribosomal protein L25 | Ribosomal | Non-Classical | 2.80 | 14.77 | 32.39 |
|
| yopB | NOP domain | Nop | Non-Classical | 0.70 | 27.66 | 60.64 |
|
| yopD | NR_LBD_ER_like | Non-RBD | Non-RBD | 2.30 | 9.95 | 15.38 |
|
| yopE | SUD-M, Single-stranded poly(A) binding domain. | PAM2 | Classical | 5.20 | 10.26 | 35.90 |
|
| yopM | Leucine rich repeat | possible RBD | Non-Classical | 0.25 | — | — |
|
Thirty-three candidate RNA-binding effector proteins were selected for screening. The table comprises the protein name, the predicted RNA-binding domain, domain category and domain type according to the selected 112 known RBDs, the prediction scores, and the bacterial species. ‘—’ indicates value not available, the corresponding candidates were selected individually (see main text).
Figure 2Development of the CLIP-screening workflow using an RBP test-set (A). Detection on Western blot of Strep-FLAG tagged proteins, using an anti-FLAG antibody. Proteins are detected in the lysate (input) and after purification (eluate); all proteins are expressed and purified. The protein levels are compared to the endogenous β-actin levels. (B) Upper panel, autoradiograph showing the labelled RNA in association with the purified proteins. Arrowheads show the approximate molecular weight of the purified protein. Middle panel, an increased exposure of ENO-1 and SHMT2 signals. Lower panel, Western blot analysis detecting the tagged proteins in the purified complexes using an anti-FLAG antibody.
Figure 3Positive candidates obtained from CLIP-screening. Nine screened effector candidates showed a positive signal on the autoradiograph. (A–F) Effectors showing a crosslink signal in the absence and the presence of UV-irradiation. (G–I) Effectors showing a crosslink signal only in the presence of UV-irradiation.
Figure 4ATP-interaction and phosphorylation as major sources of CLIP false-positives (A). Analysis of YopM autoradiograph signals in the presence and absence of T4 PNK in the labeling reaction. The labeling is independent of PNK activity. TIAR was used as a positive control, the signal with TIAR is lost in the absence of the PNK. (B,C) Same as in (A). Performed on (B). SspH2. and (C). SlrP. (D) YopM was purified from the cytosolic and the nuclear fraction, and analyzed separately for PNK-dependent labeling. Cells expressing the tag alone were used as negative control.
Figure 5Analysis of effectors showing a UV-dependent crosslink product (A). Using different concentrations of RNase I to test the sensitivity of PipB2 substrates to RNAse digestion. TIAR was used as a positive control. (B) DNase I sensitivity assays and PNK-dependent labeling performed on PipB2. (C) Analysis of the tagged full-length or truncated domains of PipB2 by CLIP assays. The previously identified pentapeptide motif in the C-terminus is not required for the PipB2 crosslink product. (B) Immuno-staining using anti-FLAG (red), showing the sub-cellular localization of PipB2 domains. PipB2 N- and C-terminus are unable to localize correctly. The 1–320 truncated fragment of PipB2 localized comparably to the full-length protein. Cell nuclei are stained with Hoechst (blue).