Literature DB >> 29165646

Development and application of a recombination-based library versus library high- throughput yeast two-hybrid (RLL-Y2H) screening system.

Fang Yang1,2, Yingying Lei3,4, Meiling Zhou3,4, Qili Yao3,4, Yichao Han5, Xiang Wu3,4, Wanshun Zhong1,2, Chenghang Zhu3,4, Weize Xu3,4, Ran Tao3,4, Xi Chen3,4, Da Lin3,4, Khaista Rahman3,4, Rohit Tyagi3,4, Zeshan Habib3,4, Shaobo Xiao3,4, Dang Wang3,4, Yang Yu6, Huanchun Chen3,4, Zhenfang Fu3,4,7, Gang Cao3,4,5,8,9.   

Abstract

Protein-protein interaction (PPI) network maintains proper function of all organisms. Simple high-throughput technologies are desperately needed to delineate the landscape of PPI networks. While recent state-of-the-art yeast two-hybrid (Y2H) systems improved screening efficiency, either individual colony isolation, library preparation arrays, gene barcoding or massive sequencing are still required. Here, we developed a recombination-based 'library vs library' Y2H system (RLL-Y2H), by which multi-library screening can be accomplished in a single pool without any individual treatment. This system is based on the phiC31 integrase-mediated integration between bait and prey plasmids. The integrated fragments were digested by MmeI and subjected to deep sequencing to decode the interaction matrix. We applied this system to decipher the trans-kingdom interactome between Mycobacterium tuberculosis and host cells and further identified Rv2427c interfering with the phagosome-lysosome fusion. This concept can also be applied to other systems to screen protein-RNA and protein-DNA interactions and delineate signaling landscape in cells.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29165646      PMCID: PMC5815087          DOI: 10.1093/nar/gkx1173

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

As a fundamental milestone in the advancement of science, the human genome project has deciphered almost the entire gene sequences in our genetic instruction set and greatly promoted the progress of the current biomedical research. As the final product of most genes, proteins interact with each other and compose complicated and elegant networks to maintain the proper biological activities in all organisms. While the next generation sequencing technologies further facilitate genomic sequencing, the lack of high-throughput proteomic approaches for decoding proteomic networks hampers the systematical understanding of the molecular proteomic machinery of life. Generally, two approaches have been applied to investigate these intriguing protein networks. The proteomics-based approaches usually take advantage of mass spectrometry technologies to decipher the protein complex precipitated by a specific protein. Although this type of approach is straight forward, it is expensive and very challenging to detect low abundant proteins. Also, it is hard to distinguish direct and indirect interactions by this method (1). Protein array is another proteomic approach to identify protein interactions (2). Although it can be high-throughput, this method requires expensive and laborious protein purification processes. The genomic approaches, such as yeast two-hybrid (Y2H) system and phage display system, utilize cDNA to encode protein library and screen the library by various reporter systems. Y2H is a powerful tool, which has been extensively applied to decipher gene function and signaling pathways in most organisms. During the last two decades, a lot of progress has been achieved for Y2H system to reduce the cost and labor for large-scale library screening (3–6). However, for most Y2H systems, it still needs to pick up individual colony or prepare arrays for specific yeast clones. Recently, several high-throughput Y2H systems such as eY2H, Smart-pooling Y2H, Y2H-Seq, Stitch-Seq Y2H, BFG-Y2H have been developed to improve the library screening efficiency (2–4,7–10). While these systems substantially increase the Y2H screening efficiency, the time-consuming and costly individual treatment, such as plasmid isolation and PCR amplification of individual positive colonies, individual arrays for library preparation, or individual gene barcoding, are still required. For example, the recently reported BFG-Y2H system utilizes Cre recombinase to tag each bait and prey genes with a specific barcode and identifies all interacting proteins simultaneously by high-throughput sequencing (9). However, the barcode fusion mediated by Cre recombinase can be reversible, it may influence the efficiency of the system. Nevertheless, BFG-Y2H system can easily decode the interaction network via reading these chimeric protein-pair barcodes from pooled Y2H-positive colonies. Since barcoding each bait and prey is required before screening, it is still laborious, expensive, and even not applicable when a large number of proteins need to be tested for interactions. A very recent study developed a CrY2H system for all-by-all high-throughput PPI screening. Instead of fusing barcodes for bait and prey genes, this system could directly link the interacting bait and prey pairs after screening (11). All of the PCR products of fused bait and prey genes were then randomly sheared and subjected to sequencing. Although this system does not require individual gene barcoding, it needs massively deep sequencing for big PPI network, as only a small fraction (2.4%) of the CrY2H reads can be mapped to both prey and bait genes (11). Thus, a simple, high-throughput and cost-effective Y2H library vs library system without any individual treatment is highly desired to explore the intricate and intriguing protein interaction landscape. The microbes parasitized in host cells equip with two PPI networks. One constitutes of the proteins from the microbes themselves, and the other includes proteins from the microbes and host cells. The later trans-kingdom protein interaction network plays an important role in perturbing and modulating the host protein network, so that the microbes can escape the immunity and colonize host cell. Mycobacterium tuberculosis (M.tb) is an intracellular pathogen causing the most devastating zoonotic disease in the world (12). In 2015, 10.4 million new TB cases have been reported and 1.8 million died from this disease worldwide, which demonstrated TB became the leading infectious disease, even ranking above HIV/AIDS. During the process of infection and parasitization, the membrane and secretory proteins of M.tb extensively and specifically interact with host proteins to enter the cells, inhibit cellular anti-pathogen proteins, reprogram the host gene expression, and finally colonize the cells. One striking phenomenon is that M.tb could arrest phagosome maturation and escape into the cytosol for survival (13,14). In addition, the cytosolic M.tb perturbs host autophagy pathway by their secretory proteins, such as ESAT6, PknG (15,16). Deciphering the interactome between M.tb and host is pivotal to understand the molecular pathological basis of tuberculosis and facilitate the identification of anti-M.tb drug targets. The aim of this study is therefore to develop a simple, low-cost and high-throughput library vs library Y2H screening system without any individual treatment and apply it to delineate the global trans-kingdom protein interaction network between the M.tb proteins (all the secretory and membrane genes) and the host proteins (TB susceptibility gene, immunity and autophagy-related genes).

MATERIALS AND METHODS

Plasmid construction

The bacterial attachment site ATTB and phage attachment site ATTP were inserted behind the BamHI site of Clontech GAL4 Y2H vector pGBKT7 and pGADT7, respectively, by recombination. The PhiC31 expression cassette Pol III:PhiC31:Cyc1 terminator was inserted into pGBKT7-ATTB and pGADT7-ATTP vector backbones at the site of AvrII and NheI, respectively, to generate mBD and mAD vectors for the following library construction. All insertions were confirmed by Sanger sequencing. The expression efficiency of the modified vectors was tested by comparing the transformation efficiency and interacting strength to the conventional ones using the known positive interacting pair, murine p53 and SV40 T-antigen. The PCR primer synthesis and DNA sequencing were performed by Shanghai Sangon Biotech, China.

Library construction, screening and data Analysis

See Supplemental Experimental Protocol for RLL-Y2H.

GFPph-mCherry-LC3 stable cell line construction and autophagy assay

Raw264.7 cells were maintained in a humidified incubator at 37°C with 5% CO2 and grow in Dulbecco's modified Eagle's medium (DMEM) containing 10% fetal bovine serum (FBS), 100 U/ml penicillin, and 100 mg/ml streptomycin. The H11 site homolog arms were amplified from the genomic DNA of Raw264.7 cells by PCR. The eGFP gene was mutated to a pH super-sensitive version using NEB site-direct mutagenesis kit according to the manufacturer's protocol. The mutated eGFP was then fused with mCherry-LC3 and homology arm by overlapping PCR (The sequence was shown in Supplemental Figure S6). The gRNA (5′-CACCTTTAGTCATAGTGTTG-3′) targeted to the H11 site in the Raw264.7 cell was designed and cloned in CRISPR/Cas system vector px335 according to the protocol from http://www.genome-engineering.org/crispr. GFPph-mCherry-LC3 with H11 site homology arms and px335-gRNA plasmid were co-transfected into Raw264.7 cells using Lipofectamine 2000 (Invitrogen) according to the manufacturer's instructions. The cells with GFPph-mCherry-LC3 reporter were selected by FACS (BD, CA, USA). For autophagy assay, the aggregation and intensity of GFP/mCherry fluorescence signal were measured by fluorescence microscopy (Olympus, Japan) and analyzed using ImageJ, as previously described (17).

Statistical analysis

A Two-way t-test was used to analyze the significant differences of all the experiments by GraphPad Prism (version 5.01) software (San Diego, CA, USA). Differences were considered statistically significant when P < 0.05.

Supplemental experimental protocol for RLL-Y2H

Bait and prey library construction

A total of 542 M.tb (H37Rv) membrane and secretory protein ORFs (a kind gift from Dr Xiao in Huazhong Agricultural university, based on the previous studies and prediction by M.tb genome website: http://tuberculist.epfl.chandhttp://www.tbdb.org) were amplified and cloned into mBD vector by recombination. Basically, PCR products with homologous arms and linearized mBD vector were together transformed into yeast strain Y2Hgold, and plated on SD/-T selective plates. The growing colonies were then amplified to confirm that a PCR fragment was integrated into mBD vector properly via Sanger sequencing or plasmid digestion. The positive colonies for 542 ORFs were then separately mated with Y187 strain harboring empty mAD vector and were plated onto SD/-LT and SD/-AHLT selective plates to test their self-activation. Those not showing any self-activation were then pooled and used as the bait library. A total of 316 host protein ORFs were amplified from BALB/c mice lung and brain cDNA.The ORFs include Autophagy-related genes (www.mblintl.com), SNARE genes (Cell signaling technology, www.cellsignal.com), TB susceptible genes, Jak/Stat signaling, IL 6 receptor family, NF-kB signaling, T cell receptor signaling, as well as TLR signaling pathway (18–21). These ORFs were cloned into the mAD vector individually, and the subsequent plasmids were transformed into yeast strain Y187. Mixtures of transformants of every 20 plasmids were mated with Y2Hgold harboring murine p53 to rule out those showing sticky interaction or self-activation. The rest were pooled and used as the prey library.

Y2H library screen

Mate the bait and prey libraries at 50 rpm in 4 × 50 ml of 2xYPDA medium for 24 h. Next day, collect the cells and resuspend in 15 ml 0.9% NaCl, and plate onto 50 SD/-AHLT selective plates and grow for ∼7 days, meanwhile plating several dilutions on SD/-L, SD/-T and SD/-LT selective plates to calculate the mating efficiency. Pool all positive colonies from SD/-AHLT to isolate the integrated plasmids using the Yeast Plasmid Kit (OMEGA, D3376-01) according to the user manual.

High-throughput DNA library construction

Amplify the recombinated bait and prey fragments using the isolated integrated plasmids from the pooled positive colonies and amplify for 18–20 cycles. A smear band should be visible on agarose gel. Digest the purified PCR products with MmeI (NEB, CA) which yields a ∼110 bp target band, followed by low-melting gel purification. Prepare Illumina P5 and P7 adapters by annealing overhang NN-nucleotides P5: 5′-ACACTCTTTCCCTACACGACGCTCTTCCGATCTNN-3′ and 5′-AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT-3′; P7: 5′-GATCGGAAGAGCACACGTCTGAACTCCAGTCAC-3′ and 5′-GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCNN-3′, respectively. Add P5 and P7 adapters to purified 110bp fragments by T4 ligation at 16°C for more than 2 h. Amplify the ligation products using primers (5′-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT-3′ and 5′-CAAGCAGAAGACGGCATACGAGAT(index)GTGACTGGAGTTCAGACGTGTGCTC TTCCGATC-3′) to construct high-throughput sequencing library. The deep sequencing was performed on HiSeq2000 sequencer (Illumina, CA, USA).

Data processing

Scan all the reads for ATTL site ‘TAGCGTGCGGGTGCCAGGGCGTGCCCTTGAGT TCTCTCAGTTGGGGGCGTTGAC’ and MmeI recognition sites ‘TCCGAC’. Retain the reads containing about 20bp DNA flanking both sides of MmeI-ATTL-MmeI structure for subsequent analysis. Trimmed the sequences to 13 bp from the ends of raw reads. Built the reference libraries based on the ORFs in bait and prey libraries (There are 542 bait and 316 prey ORFs in the case of this study). Align the trimmed sequencing pairs against the reference libraries using BLAST+. Only unique mapping reads with no mismatch and the sequence pairs in which both of reads could be aligned unambiguously with no mismatch are retained. Interaction frequency of an interacting protein pair can be calculated by counting the sequence pairs that could be aligned to them.

RESULTS

Design of the technology

To establish a high-throughput system which allows simultaneous PPI screening between a bait library and a prey library, the bait and prey vectors of the conventional Y2H system from Clontech were modified to develop a novel recombination-based ‘library versus library’ Y2H system, designated RLL-Y2H (Figure 1). The design of this system is based on the high efficient DNA integration ability of PhiC31 as well as its specific recognition sites, ATTB and ATTP (22,23). Basically, ATTB and ATTP sites were fused right behind the bait and prey genes, respectively. In the presence of PhiC31 integrase, the bait and prey plasmids will be irreversibly integrated into a single plasmid, in which the bait and prey genes are fused at tail-to-tail orientation with an ATTL linker in between (designated bait-ATTL-prey) (Figure 1A–C; Supplemental Figure S1A–C). In order to highly express PhiC31 and avoid competition with the ADH1 promoter which drives the bait and prey genes, we screened several Saccharomyces cerevisiae promoters and selected Pol III promoter to drive PhiC31 expression for its high activities. As illustrated in Figure 1B, PhiC31 together with its recognition sites ATTB and ATTP were inserted into pGBKT7 and pGADT7 vectors of the Matchmaker GAL4-based two-hybrid assay system to generate new bait and prey vectors named mBD and mAD. The derived plasmids containing bait and prey genes were introduced into diploid yeast cells via mating and selected on dropout plates lacking Adenine/Histidine/Leucine/Tryptophan (SD/-AHLT). The integration of bait and prey plasmids occurred in individual yeast cells in vivo and only the cells containing positive interacting protein pair could grow on the selective SD/-AHLT plates. The survived cells containing interacting bait-and-prey pairs were pooled for DNA extraction and PCR amplification. This approach avoids individual yeast isolation, DNA preparation and individual gene barcoding, therefore greatly reduces the laborious workload. Moreover, the sequences of the fused bait and prey amplicons can be used to unambiguously decode each interacting bait-and-prey pair and allow high throughput ‘library vs library’ PPI screening. The PCR amplicons of fused bait and prey pairs were mostly more than 1 kb with variable sizes, which are very challenging for high-throughput sequencing. To this end, type II restriction enzyme MmeI (which cuts DNA ∼20 bp downstream of its recognition site) recognition sequences were inserted right after bait and prey genes, respectively, to generate a∼110 bp DNA fragments containing a pair of 20 bp interacting proteins cDNA strands and ATTL linker in between (Figure 1C). These fragments were then ligated with Illumina P5 and P7 adapters and subjected to high-throughput sequencing.
Figure 1.

Flowchart of Recombination-based ‘Library vs Library’ Y2H system (RLL-Y2H) screening. (A) The yeasts with bait and prey plasmids are mated and only the yeasts containing interacting bait-prey pairs can grow on SD/-AHLT selection plates. The bait and prey plasmids are integrated into a single plasmid so that the interacting bait and prey genes are paired in a single DNA fragment. (B) The integrase PhiC31 recognition sites ATTB and ATTP are inserted right behind the bait and prey genes, respectively. PhiC31 is inserted into the backbone and is driven by the Pol III promoter. The integrase PhiC31 recognizes ATTB and ATTP sites and integrates the bait and prey plasmids into a big plasmid in vivo which unambiguously couples the interacting pair of specific bait and prey genes. The ADH1 promoters and polyadenylation sites, as well as their positions after the recombination were indicated by the green bars with letter ‘P’ and ‘T’, respectively. The stop codons are indicated by the red lines right behind Bait and Prey. (C) The recombinated bait and prey fragments are amplified and digested by Type II restriction enzyme MmeI to generate ∼110 bp fragments containing bait tail-ATTL-prey tail, followed by DNA sequencing library construction and high-throughput sequencing.

Flowchart of Recombination-based ‘Library vs Library’ Y2H system (RLL-Y2H) screening. (A) The yeasts with bait and prey plasmids are mated and only the yeasts containing interacting bait-prey pairs can grow on SD/-AHLT selection plates. The bait and prey plasmids are integrated into a single plasmid so that the interacting bait and prey genes are paired in a single DNA fragment. (B) The integrase PhiC31 recognition sites ATTB and ATTP are inserted right behind the bait and prey genes, respectively. PhiC31 is inserted into the backbone and is driven by the Pol III promoter. The integrase PhiC31 recognizes ATTB and ATTP sites and integrates the bait and prey plasmids into a big plasmid in vivo which unambiguously couples the interacting pair of specific bait and prey genes. The ADH1 promoters and polyadenylation sites, as well as their positions after the recombination were indicated by the green bars with letter ‘P’ and ‘T’, respectively. The stop codons are indicated by the red lines right behind Bait and Prey. (C) The recombinated bait and prey fragments are amplified and digested by Type II restriction enzyme MmeI to generate ∼110 bp fragments containing bait tail-ATTL-prey tail, followed by DNA sequencing library construction and high-throughput sequencing.

Development of the technology

To test the efficiency of this RLL-Y2H system, positive controls consisting of a well-known interacting protein pair, murine p53 and SV40 T-antigen (24), were inserted into the mBD and mAD of RLL-Y2H system, respectively. The interaction was clearly detected (Figure 2A), and the transformation, mating efficiency as well as the interacting efficiency were compatible with the conventional Y2H system (data not shown). To investigate the integration efficiency, the survived colonies containing p53 bait plasmid and T-antigen prey plasmid on the SD/-AHLT selective plates were used to amplify the p53-ATTL-T-antigen integration fragment. As shown in Figure 2B, amplification of each colony yielded the expected PCR products, and the precise integration was further confirmed by Sanger sequencing (Figure 2C). Next, this p53-ATTL-T-antigen integration fragment was digested by MmeI, which generated an expected ∼110bp fragment comprising of p53 MmeI end, ATTL linker and T-antigen MmeI end (Figure 2D). Together, these data provide proof of the concept for our novel high-throughput RLL-Y2H system.
Figure 2.

Proof-of-principle experiment of RLL-Y2H system. (A) Positive control interacting protein pair, murine p53 and SV40 T-antigen were inserted into mBD and mAD of RLL-Y2H system and were transformed into yeast respectively. Only the yeasts containing both p53 and SV40 T-antigen but not the controls can grow on SD/-AHL T selection plates. (B) After plasmid extraction from yeast, p53, SV40 T-antigen and their recombinated fragments were amplified by PCR. (C) Sanger sequencing of the recombinated p53 and SV40 T-antigen fragments. The linker ATTL and MmeI sites were highlighted. (D) The PCR products of the recombinated p53 and SV40 T-antigen fragments before (left panel) and after (right panel) MmeI digestion.

Proof-of-principle experiment of RLL-Y2H system. (A) Positive control interacting protein pair, murine p53 and SV40 T-antigen were inserted into mBD and mAD of RLL-Y2H system and were transformed into yeast respectively. Only the yeasts containing both p53 and SV40 T-antigen but not the controls can grow on SD/-AHL T selection plates. (B) After plasmid extraction from yeast, p53, SV40 T-antigen and their recombinated fragments were amplified by PCR. (C) Sanger sequencing of the recombinated p53 and SV40 T-antigen fragments. The linker ATTL and MmeI sites were highlighted. (D) The PCR products of the recombinated p53 and SV40 T-antigen fragments before (left panel) and after (right panel) MmeI digestion.

Construction and screening of M.tb bait library and host prey library

Next, we applied this RLL-Y2H system to investigate the interactome between M.tb and host cells, which occurs mostly between the membrane and secretory proteins of M.tb and immunity-related proteins, the autophagy pathway proteins and M.tb susceptibility genes of host cells. To this end, 542 M.tb (H37Rv) membrane and secretory protein ORFs (predicted by http://tuberculist.epfl.ch and http://www.tbdb.org) and 316 mouse immunity-related proteins, M.tb susceptibility genes, autophagy pathway protein ORFs (18–21) were cloned to construct the bait and prey libraries, respectively (Figure 3A, Supplemental Tables S1 and S2). Each bait and prey was tested for self-activation before screening. Twenty out of 542 baits were detected as self-activators, and were therefore excluded for the subsequent screenings (red marked in Supplemental Table S1). The bait library was mated with the prey library in 4 × 50 ml mating culture with∼17% mating efficiency, yielded ∼8800 colonies on SD/-AHLT selective plates. All the colonies were pooled for plasmid extraction and PCR amplification. A smear band representing integrated bait-ATTL-prey fragments were observed (Figure 3B) and then harvested for MmeI digestion, which generated a ∼110bp band comprising of the bait MmeI end, the ATTL linker and the prey MmeI end (Figure 3C). These digested products were ligated with Illumina P5 and P7 adapters to generate a high-throughput sequencing library (Figure 3D). To evaluate the reproducibility of the RLL-Y2H system, we performed two independent parallel screening from a small bait library containing 150 M.tb genes and a small prey library containing 200 host cell genes. To decode the protein interaction network between M.tb and host cell from the high-throughput sequencing data, a custom script was used to filter raw data, align reads and construct PPI network (Figure 3E). As shown in Figure 3F, 190 out of 227 of the paired interacting bait and prey sequences from screening 1# can be re-captured by an independent screening 2# (with 273 pairs).
Figure 3.

RLL-Y2H M.tb and host libraries construction and screening. (A) Flowchart of M.tb bait library and host prey library construction and screening. A total of 542 M.tb ORFs and 316 host protein ORFs were cloned and transformed into yeast Y2HGold and Y1871 strains, respectively. The bait library and prey library were mated for screening in a single pool. (B) After plasmid isolation from the positive colony pool, the recombinated prey and bait fragments were amplified and (C) digested by MmeI. (D) Electrophoresis of the RLL-Y2H sequencing library. (E) Flowchart of sequencing data analysis and PPI network construction. (F) Venn diagram of the bait-prey pair library sequencing data from two independent screening.

RLL-Y2H M.tb and host libraries construction and screening. (A) Flowchart of M.tb bait library and host prey library construction and screening. A total of 542 M.tb ORFs and 316 host protein ORFs were cloned and transformed into yeast Y2HGold and Y1871 strains, respectively. The bait library and prey library were mated for screening in a single pool. (B) After plasmid isolation from the positive colony pool, the recombinated prey and bait fragments were amplified and (C) digested by MmeI. (D) Electrophoresis of the RLL-Y2H sequencing library. (E) Flowchart of sequencing data analysis and PPI network construction. (F) Venn diagram of the bait-prey pair library sequencing data from two independent screening.

Delineation and confirmation of PPI network between M.tb and the host

Next, we performed the full screening with 542 ORFs of M.tb and 316 ORFs of host cells. Among 46 million raw reads, 14 million reads could be mapped as M.tb-host interacting protein pairs. We identified a total of 1,438 PPIs between 82 M.tb proteins and 128 host proteins (Figure 4A; Supplemental Table S3). The interaction frequencies of M.tb-host protein pairs were calculated based on the number of corresponding aligned reads. Of note, the number of the interaction pairs with low sequencing counts (<5) is dramatically higher than that of the others (Figure 4B). To obtain a high-quality protein interaction network, the interaction pairs with low counts (<5) were excluded from PPIs. Meanwhile, the distribution of host and TB proteins according to the number of their interactors showed that there were long tails in both graphs indicating a small amount of proteins with a large number of interactors (more than 45, Figure 4C and D).Thus, the proteins with more than 45 interactors were also removed from the network, as they are most likely sticky proteins with high chance to physically but not physiologically interact with many proteins. Finally, we delineated a PPI network between M.tb proteins and host proteins containing 441 PPIs (Figure 4E; Supplemental Table S4). The host proteins were divided into several groups according to their physiological functions such as toll-like receptor signaling, T cell receptor signaling, Autophagy, M.tb susceptible genes, Jak/Stat signaling, NF-kB signaling and B cell receptor signaling. The PPI sub-networks belong to specific signaling pathways were then constructed to show their potential physiological function (Figure 5A and Supplemental Figures S2 and S3).
Figure 4.

Delineation the protein interaction network between M.tb and host cells. (A) The whole genome PPI network between M.tb bait and host cell. Blue nodes represent TB proteins and yellow nodes represent host proteins. (B) The distribution of TB–host protein interaction pairs according to their sequencing counts. 5 was chosen as cut-off as there was a dramatic increase of interaction pairs below 5 counts. (C and D) The distribution of host and TB proteins is according to the numbers of their interactors. The sticky proteins (>45 interactors) have higher degree than others and therefore were cut-off for future analysis. (E) The filtered PPI network between M.tb bait and host cell.

Figure 5.

Identification of the M.tb proteins interfering autophagy in host cells. (A) The PPI network between M.tb and autophagy pathway proteins of host cells. (B) Raw264.7 cells expressing GFPph-mCherry-LC3 for autophagy assay was generated using CRISPR/Cas9 system. The stable cell line transfected either with an empty vector or with 38 M.tb genes in the M.tb-autophagy PPI network individually, were treated with rapamycin to induce autophagy. (C) Rv2427c and control plasmid were transfected to GFPph-mCherry-LC3 Raw264.7 stable cell line and induce autophagy by rapamycin. Transfection of Rv2427c significantly rescued the rapamycin-induced decrease of GFP/mCherry fluorescence ratio of the aggregation dots. Arrows indicate transfection positive cells. Arrows indicate transfection positive cells. (D) Quantification of GFP/mCherry fluorescence ratio of the aggregation dots (n = 48). (E) Verification of the protein-protein interaction by BiFC assay. Rv2427c were co-transfected with WASP and the negative controls into HEK 293T cells, respectively. Cells were harvested 36h after transfection for confocal fluorescence microscopy-based image analysis. The interactions between Rv2427c and prey proteins allow re-formation of a bimolecular fluorescent complex. (F) Quantification of the fluorescence.

Delineation the protein interaction network between M.tb and host cells. (A) The whole genome PPI network between M.tb bait and host cell. Blue nodes represent TB proteins and yellow nodes represent host proteins. (B) The distribution of TB–host protein interaction pairs according to their sequencing counts. 5 was chosen as cut-off as there was a dramatic increase of interaction pairs below 5 counts. (C and D) The distribution of host and TB proteins is according to the numbers of their interactors. The sticky proteins (>45 interactors) have higher degree than others and therefore were cut-off for future analysis. (E) The filtered PPI network between M.tb bait and host cell. Identification of the M.tb proteins interfering autophagy in host cells. (A) The PPI network between M.tb and autophagy pathway proteins of host cells. (B) Raw264.7 cells expressing GFPph-mCherry-LC3 for autophagy assay was generated using CRISPR/Cas9 system. The stable cell line transfected either with an empty vector or with 38 M.tb genes in the M.tb-autophagy PPI network individually, were treated with rapamycin to induce autophagy. (C) Rv2427c and control plasmid were transfected to GFPph-mCherry-LC3 Raw264.7 stable cell line and induce autophagy by rapamycin. Transfection of Rv2427c significantly rescued the rapamycin-induced decrease of GFP/mCherry fluorescence ratio of the aggregation dots. Arrows indicate transfection positive cells. Arrows indicate transfection positive cells. (D) Quantification of GFP/mCherry fluorescence ratio of the aggregation dots (n = 48). (E) Verification of the protein-protein interaction by BiFC assay. Rv2427c were co-transfected with WASP and the negative controls into HEK 293T cells, respectively. Cells were harvested 36h after transfection for confocal fluorescence microscopy-based image analysis. The interactions between Rv2427c and prey proteins allow re-formation of a bimolecular fluorescent complex. (F) Quantification of the fluorescence. As each screening system can generate false positive data, the identified PPIs were then subjected to interaction retest. To systematically evaluate the false positive rate, 269 randomly selected protein interaction pairs were transformed into Y187 and Y2HGold yeast strains, respectively, and were mated to confirm their interaction individually. As shown in Supplemental Figure S4 and Table S5, 189 out of 269 (70.3%) protein interaction pairs can be confirmed. To verify the cutoff value (five counts), we re-tested 40 interacting pairs with less than 5 counts. As shown in Supplemental Figure S5 and Table S6, only 6 out 40 (15%) could be confirmed, which was much less than the average positive rate (70.3%). Meanwhile, the positive rate of re-test data from the randomly selected 269 interacting pairs were plotted against their sequencing counts. As shown in Supplemental Figure S5, the positive rate and the sequencing counts are positively correlated, suggesting that sequencing count 5 is a rational cutoff.

Identification of the M.tb proteins interfering autophagy in host cells

Next, we aimed to elucidate the potential physiological function of the PPI network we identified between M.tb and the host cells. As an intracellular pathogen, M.tb interferes with host cell defense machinery, such as autophagy pathway (25). Thus, the sub-set network between M.tb and autophagy pathway was extracted from the entire PPI network (Figure 5A). To test whether the M.tb proteins in this network could potentially interfere with host cell autophagy, we generated a stable cell line expressing GFPph-mCherry-LC3 for autophagy assay. More specifically, eGFP were mutated to a pH super-sensitive version (26) and then fused with mCherry and LC3. This autophagy reporter was inserted into the H11site (27) of Raw264.7 cell genome using CRISPR/Cas system (Figure 5B; Supplemental Figure S6A and B). In this cell line, rapamycin could dramatically induce autophagy as indicated by the aggregation of GFP/mCherry dots and their decrease of GFP/mCherry fluorescence ratio which changed in response to the pH gradient due to phagosome-lysosome fusion (28). Next, 38 M.tb genes in this M.tb-autophagy PPI network and a control (empty vector) were individually transfected into this stable cell line (Figure 5B). Autophagy was induced by rapamycin and then subjected to florescence assay. Of note, transfection of Rv2427c could significantly rescue the rapamycin-induced decrease of GFP/mCherry fluorescence ratio of aggregation dots, suggesting this protein can interfere with autolysosome formation (Figure 5C and D; Supplemental Figure S7). We also performed immunofluorescence experiment to confirm the expression of Rv2427c in GFPph-mCherry-LC3 Raw cell line as shown in Supplemental Figure S7. According to our PPI network, four host autophagy pathway related proteins (PERK, Rubicon,TSC2 and WASP) have been confirmed to interact with Rv2427c. As the interaction frequency of PERK and TSC2 were dramatically lower than that of Rubicon and WASP, we further verified the interaction between Rv2427c and Rubicon and WASP by bimolecular fluorescence complementation (BiFC) assay in eukaryotic system. As shown in Figure 5E and F, the interaction between Rv2427c and WASP can be validated by the BiFC assay. However, the interaction between Rv2427c and Rubicon could not be confirmed by this approach and needs to be further investigated.

DISCUSSION

Y2H is an ideal system to screen direct PPI but suffers a lot for its intensive labor. While recent-state-of-the-art high-throughput Y2H technologies, such as Smart-pooling Y2H, Y2H-Seq and Stitch-Seq Y2H systems reduced the Y2H screening efforts and cost (3,4,7–9). These systems require either individual arrays to prepare gene library, yeast transformation or individual treatment for plasmid extraction and PCR amplification which are still expensive and time-consuming. A recently reported BFG-Y2H system ingeniously tags each pair of bait and prey genes with a specific barcode and utilize Cre-lox system to fuse barcodes from distinct plasmids (9). As Cre-lox system mediated barcode fusion can be reversible, it may influence the efficiency of the system. Nevertheless, this system can easily decode the interaction network via reading these chimeric protein-pair barcodes. However, it is still laborious and expensive when a large library with unique barcodes needs to be prepared. Recently, a CrY2H system was developed for all-by-all high-throughput PPI screening, in which the PCR products of the fused bait and prey genes were randomly sheared for sequencing (11). Although this system doesn’t need barcoding individual gene, only 2.4% CrY2H reads contain sequence of both prey and bait genes. Thus, to delineate a big PPI network, a massive deep sequencing is required. In this study, the RLL-Y2H system took advantage of a highly efficient recombination activity of Phic31 integrase to efficiently and irreversibly combine a bait and a prey into a single plasmid in vivo (22,23). As the length of fused DNA fragments of interacting bait and prey genes are long and diverse, they are not suitable for high-throughput sequencing. To this end, we inserted MmeI restriction sites on both ends of bait and prey genes, which could release a 110bp fragment after digestion. This fragment is suitable for high-throughput sequencing, of which the sequences containing matrix information of interacting bait and prey pair can be directly used to decode PPI networks without any barcoding procedure. Together, our simple RLL-Y2H system could substantially reduce screening time, cost and labor work, and has theoretically the capacity for genome-widely massive library vs library screening. Compared with the approaches mentioned above, our system has several advantages: 1) No requirement for individual treatment such as plasmid isolation and PCR amplification; 2) No requirement for gene barcoding which allows big library vs library screening at once for huge PPI network; 3) The application of MmeI digestion can efficiently remove the massive deep sequencing noise. As an intracellular pathogen, M.tb has equipped with various machinery to evade host immune system in order to replicate and persist within host cells. A trans-kingdom interactome is hence evolved when M.tb proteins encounter host proteins. The center of this interactome is the interaction network between M.tb secretory and membrane proteins and the host immunity- and autophagy-related proteins. Here, we applied the RLL-Y2H system to decipher the central PPI network between M.tb and host cells and to investigate the physiological role of the subset interaction network between M.tb and host autophagy pathway. Interestingly, Rv2427c was identified to interfere with phagosome-lysosome fusion, which may play an important role in perturbing host autophagy pathway. It has been shown that WASP is crucial for lysosome motility (29), and can directly bind to ezrin which is required for the efficient fusion of lysosome and phagosome (30). Therefore, it could be possible that Rv2427c may interfere the physiological function of WASP by direct protein-protein interaction and thus inhibit autolysosome formation. It would be of great importance to further investigate the physiological role of these M.tb proteins in counteracting autophagy pathway, such as initiation, elongation, maturation and recycling. These will shed a light on the pathogenesis of M.tb and may facilitate the identification of anti-M.tb drug targets.

Limitation

One general shortcoming of high throughput screening is the high false positive rate. In our study, 70.3% of PPI were confirmed by independent retransformation of the 269 randomly chosen pairs of the identified interacting proteins. In two previous large-scale human interactome screening studies using traditional Y2H system, the authors could verify 65% and 78% of the identified interactions, respectively (6). A recent study developing CrY2H-seq system for PPI screening demonstrated a positive rate of 73% (11). Our positive rate is comparable with previous studies, suggesting that the ∼29.7% false positive rate is unlikely due to the modification of the conventional Y2H system or interference by the PhiC31. It is rather a common feature of Y2H screening which usually yields a certain amount of false positive interacting proteins (31–33). In this scenario, we showed that 190 out of 227 pairs of the interacting protein pairs can be re-captured by independent parallel screening. It would be ideal if the PPI screening system could also be applied to quantify the protein interaction strength. Theoretically, this can be semi-quantified by the frequency of sequencing reads, as the strong interactors generally grow faster than those weak interactors. However, it is challenging at this stage, mostly due to the fact that the amplification of the integrated plasmids could be biased. Since the length of the fused bait and prey fragments before MmeI digestion is very diverse, ranging from ∼600 bp to 5 kb. Compared to the long fragments of the fused bait-prey, the short fragments might be most likely over-amplified due to the bias of PCR amplification. This problem might be solved by using emulsion PCR which could independently amplify the individual DNA on beads (34). It is thus possible to update RLL-Y2H as a semi-quantitative screening system to map the PPI strength by the frequency of sequencing reads. Every technology has its own most suitable application. In this regards, RLL-Y2H is more suitable for large-scale library vs library interaction screening. For medium scale PPI network screening, array-based Y2H such as eY2H, Mini-pool Y2H or Smart-pooling Y2H might be a better option. In our system, although the gene coding region will not be affected after recombination, the lack of polyadenylation sites may affect the expression of bait and prey proteins. Nevertheless, in our positive control experiments using P53-T pair, the yeast could grow properly on selective plates, suggesting that the expression of bait and prey genes were not significantly affected. This could be attributed to several reasons: (i) the bait, prey and fusion plasmids are all present simultaneously in a zygote yeast; (ii) the fusion of bait and prey plasmids happens after the expression of phic31, during this time the bait and prey genes are independently expressed and can thus initiate the expression of the selective marker genes if they bind to each other; (iii) even there are no polyadenylation sites after the bait and prey genes in fusion plasmids, these genes can still be expressed at a certain level. In future studies, this system could be updated by moving the polyadenylation sites to the fronts of ATTB and ATTP sites (Supplemental Figure S8), so that the expression of all bait and prey genes would not be affected at all after recombination. In our study, we plotted the numbers of the host or TB proteins against the number of their interactors and found long tails in both graphs. These proteins located in the end of tails are most likely sticky proteins with high chance to physically but not physiologically interact with many proteins, and were therefore removed from PPI network. It would be ideal to experimentally verify this cutoff value. However, while we can design a rational experiment to verify the lowest cutoff value, there is no perfect experiment to validate the highest cutoff value. The cutoff value needs to be carefully selected for different experiments depending on the sequencing depth, overall positive rate and some other factors.

Perspective

Other than Y2H for PPI screening, this recombination-based ‘library vs library’ screening concept can also be applied to other yeast screening systems such as Y1H (35,36) and Y3H (37,38). These approaches may open new avenues for delineating the global PPI, protein–DNA and protein–RNA interaction landscape and might pave the way to understand the details of the signaling processes in cells.

AVAILABILITY

The accession number for the sequencing data reported in this paper is GEO: GSE93036. Click here for additional data file.
  36 in total

1.  Increasing specificity in high-throughput yeast two-hybrid experiments.

Authors:  Pierre-Olivier Vidalain; Mike Boxem; Hui Ge; Siming Li; Marc Vidal
Journal:  Methods       Date:  2004-04       Impact factor: 3.608

2.  Ipr1 gene mediates innate immunity to tuberculosis.

Authors:  Hui Pan; Bo-Shiun Yan; Mauricio Rojas; Yuriy V Shebzukhov; Hongwei Zhou; Lester Kobzik; Darren E Higgins; Mark J Daly; Barry R Bloom; Igor Kramnik
Journal:  Nature       Date:  2005-04-07       Impact factor: 49.962

3.  A three-hybrid system to detect RNA-protein interactions in vivo.

Authors:  D J SenGupta; B Zhang; B Kraemer; P Pochart; S Fields; M Wickens
Journal:  Proc Natl Acad Sci U S A       Date:  1996-08-06       Impact factor: 11.205

4.  Gene-centered yeast one-hybrid assays.

Authors:  John S Reece-Hoyes; Albertha J M Walhout
Journal:  Methods Mol Biol       Date:  2012

5.  Variations in the NRAMP1 gene and susceptibility to tuberculosis in West Africans.

Authors:  R Bellamy; C Ruwende; T Corrah; K P McAdam; H C Whittle; A V Hill
Journal:  N Engl J Med       Date:  1998-03-05       Impact factor: 91.245

6.  Identification of mutations in p53 that affect its binding to SV40 large T antigen by using the yeast two-hybrid system.

Authors:  B Li; S Fields
Journal:  FASEB J       Date:  1993-07       Impact factor: 5.191

7.  A proteome-scale map of the human interactome network.

Authors:  Thomas Rolland; Murat Taşan; Benoit Charloteaux; Samuel J Pevzner; Quan Zhong; Nidhi Sahni; Song Yi; Irma Lemmens; Celia Fontanillo; Roberto Mosca; Atanas Kamburov; Susan D Ghiassian; Xinping Yang; Lila Ghamsari; Dawit Balcha; Bridget E Begg; Pascal Braun; Marc Brehme; Martin P Broly; Anne-Ruxandra Carvunis; Dan Convery-Zupan; Roser Corominas; Jasmin Coulombe-Huntington; Elizabeth Dann; Matija Dreze; Amélie Dricot; Changyu Fan; Eric Franzosa; Fana Gebreab; Bryan J Gutierrez; Madeleine F Hardy; Mike Jin; Shuli Kang; Ruth Kiros; Guan Ning Lin; Katja Luck; Andrew MacWilliams; Jörg Menche; Ryan R Murray; Alexandre Palagi; Matthew M Poulin; Xavier Rambout; John Rasla; Patrick Reichert; Viviana Romero; Elien Ruyssinck; Julie M Sahalie; Annemarie Scholz; Akash A Shah; Amitabh Sharma; Yun Shen; Kerstin Spirohn; Stanley Tam; Alexander O Tejeda; Shelly A Trigg; Jean-Claude Twizere; Kerwin Vega; Jennifer Walsh; Michael E Cusick; Yu Xia; Albert-László Barabási; Lilia M Iakoucheva; Patrick Aloy; Javier De Las Rivas; Jan Tavernier; Michael A Calderwood; David E Hill; Tong Hao; Frederick P Roth; Marc Vidal
Journal:  Cell       Date:  2014-11-20       Impact factor: 41.582

8.  Next-generation sequencing to generate interactome datasets.

Authors:  Haiyuan Yu; Leah Tardivo; Stanley Tam; Evan Weiner; Fana Gebreab; Changyu Fan; Nenad Svrzikapa; Tomoko Hirozane-Kishikawa; Edward Rietman; Xinping Yang; Julie Sahalie; Kourosh Salehi-Ashtiani; Tong Hao; Michael E Cusick; David E Hill; Frederick P Roth; Pascal Braun; Marc Vidal
Journal:  Nat Methods       Date:  2011-04-24       Impact factor: 28.547

9.  NOD2 and toll-like receptors are nonredundant recognition systems of Mycobacterium tuberculosis.

Authors:  Gerben Ferwerda; Stephen E Girardin; Bart-Jan Kullberg; Lionel Le Bourhis; Dirk J de Jong; Dennis M L Langenberg; Reinout van Crevel; Gosse J Adema; Tom H M Ottenhoff; Jos W M Van der Meer; Mihai G Netea
Journal:  PLoS Pathog       Date:  2005-11-25       Impact factor: 6.823

10.  Pooled-matrix protein interaction screens using Barcode Fusion Genetics.

Authors:  Nozomu Yachie; Evangelia Petsalaki; Joseph C Mellor; Jochen Weile; Yves Jacob; Marta Verby; Sedide B Ozturk; Siyang Li; Atina G Cote; Roberto Mosca; Jennifer J Knapp; Minjeong Ko; Analyn Yu; Marinella Gebbia; Nidhi Sahni; Song Yi; Tanya Tyagi; Dayag Sheykhkarimli; Jonathan F Roth; Cassandra Wong; Louai Musa; Jamie Snider; Yi-Chun Liu; Haiyuan Yu; Pascal Braun; Igor Stagljar; Tong Hao; Michael A Calderwood; Laurence Pelletier; Patrick Aloy; David E Hill; Marc Vidal; Frederick P Roth
Journal:  Mol Syst Biol       Date:  2016-04-22       Impact factor: 11.429

View more
  10 in total

1.  Barcode fusion genetics-protein-fragment complementation assay (BFG-PCA): tools and resources that expand the potential for binary protein interaction discovery.

Authors:  Daniel Evans-Yamamoto; François D Rouleau; Piyush Nanda; Koji Makanae; Yin Liu; Philippe C Després; Hitoshi Matsuo; Motoaki Seki; Alexandre K Dubé; Diana Ascencio; Nozomu Yachie; Christian R Landry
Journal:  Nucleic Acids Res       Date:  2022-05-20       Impact factor: 19.160

Review 2.  In vivo interactome profiling by enzyme-catalyzed proximity labeling.

Authors:  Yangfan Xu; Xianqun Fan; Yang Hu
Journal:  Cell Biosci       Date:  2021-01-29       Impact factor: 7.133

3.  Rv3722c Promotes Mycobacterium tuberculosis Survival in Macrophages by Interacting With TRAF3.

Authors:  Yingying Lei; Xiaojian Cao; Weize Xu; Bing Yang; Yangyang Xu; Wei Zhou; Shuang Dong; Qijun Wu; Khaista Rahman; Rohit Tyagi; Shuhong Zhao; Xi Chen; Gang Cao
Journal:  Front Cell Infect Microbiol       Date:  2021-02-25       Impact factor: 5.293

4.  Next-generation yeast-two-hybrid analysis with Y2H-SCORES identifies novel interactors of the MLA immune receptor.

Authors:  Valeria Velásquez-Zapata; J Mitch Elmore; Sagnik Banerjee; Karin S Dorman; Roger P Wise
Journal:  PLoS Comput Biol       Date:  2021-04-02       Impact factor: 4.475

5.  Unraveling Protein Interactions between the Temperate Virus Bam35 and Its Bacillus Host Using an Integrative Yeast Two Hybrid-High Throughput Sequencing Approach.

Authors:  Ana Lechuga; Cédric Lood; Mónica Berjón-Otero; Alicia Del Prado; Jeroen Wagemans; Vera van Noort; Rob Lavigne; Margarita Salas; Modesto Redrejo-Rodríguez
Journal:  Int J Mol Sci       Date:  2021-10-14       Impact factor: 5.923

6.  Distinct Persistence Fate of Mycobacterium tuberculosis in Various Types of Cells.

Authors:  Xi Chen; Xiaojian Cao; Yingying Lei; Aikebaier Reheman; Wei Zhou; Bing Yang; Weipan Zhang; Weize Xu; Shuang Dong; Rohit Tyagi; Zhen F Fu; Gang Cao
Journal:  mSystems       Date:  2021-08-17       Impact factor: 6.496

7.  Clp protease and antisense RNA jointly regulate the global regulator CarD to mediate mycobacterial starvation response.

Authors:  Xinfeng Li; Fang Chen; Xiaoyu Liu; Jinfeng Xiao; Binda T Andongma; Qing Tang; Xiaojian Cao; Shan-Ho Chou; Michael Y Galperin; Jin He
Journal:  Elife       Date:  2022-01-26       Impact factor: 8.140

8.  Visualizing protein-protein interactions in plants by rapamycin-dependent delocalization.

Authors:  Joanna Winkler; Evelien Mylle; Andreas De Meyer; Benjamin Pavie; Julie Merchie; Peter Grones; Dani L Van Damme
Journal:  Plant Cell       Date:  2021-05-31       Impact factor: 11.277

9.  Revealing protein-protein interactions at the transcriptome scale by sequencing.

Authors:  Kara L Johnson; Zhijie Qi; Zhangming Yan; Xingzhao Wen; Tri C Nguyen; Kathia Zaleta-Rivera; Chien-Ju Chen; Xiaochen Fan; Kiran Sriram; Xueyi Wan; Zhen Bouman Chen; Sheng Zhong
Journal:  Mol Cell       Date:  2021-08-03       Impact factor: 19.328

10.  Construction and characterization of a high-quality cDNA library of Cymbidium faberi suitable for yeast one- and two-hybrid assays.

Authors:  Yanqin Xu; Junjiang Zhou; Qingqing Liu; Kunpeng Li; Yin Zhou
Journal:  BMC Biotechnol       Date:  2020-01-16       Impact factor: 2.563

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.