| Literature DB >> 20144949 |
Roberto Di Niro1, Ana-Marija Sulic, Flavio Mignone, Sara D'Angelo, Roberta Bordoni, Michele Iacono, Roberto Marzari, Tiziano Gaiotto, Miha Lavric, Andrew R M Bradbury, Luigi Biancone, Dina Zevin-Sonkin, Gianluca De Bellis, Claudio Santoro, Daniele Sblattero.
Abstract
We have developed a high-throughput protein expression and interaction analysis platform that combines cDNA phage display library selection and massive gene sequencing using the 454 platform. A phage display library of open reading frame (ORF) fragments was created from mRNA derived from different tissues. This was used to study the interaction network of the enzyme transglutaminase 2 (TG2), a multifunctional enzyme involved in the regulation of cell growth, differentiation and apoptosis, associated with many different pathologies. After two rounds of panning with TG2 we assayed the frequency of ORFs within the selected phage population using 454 sequencing. Ranking and analysis of more than 120,000 sequences allowed us to identify several potential interactors, which were subsequently confirmed in functional assays. Within the identified clones, three had been previously described as interacting proteins (fibronectin, SMOC1 and GSTO2), while all the others were new. When compared with standard systems, such as microtiter enzyme-linked immunosorbant assay, the method described here is dramatically faster and yields far more information about the interaction under study, allowing better characterization of complex systems. For example, in the case of fibronectin, it was possible to identify the specific domains involved in the interaction.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20144949 PMCID: PMC2875021 DOI: 10.1093/nar/gkq052
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Interactome discovery pipeline. Poly A+ RNA is fragmented and retro-transcribed to cDNA by random priming. After normalization and linker ligation cDNAs are cloned into pPAO10 ORF-filtering vector and the library is filtered on chloramphenicol/ampicillin containing plates. After CRE-mediated recombination, ORF-displaying phages are challenged with TG2 throughout two cycles of selection and amplification. The ORF inserts of selected phages are obtained by digestion with restriction fragments and massively sequenced. Reads are mapped to the human genome and identified genomic regions are ranked on the basis of read coverage. Clones containing candidate interacting ORFs are rescued from selected libraries by inverse PCR using primers matching the contig sequence core. Recovered clones are expressed and validated by ELISA.
Summary of reads obtained from 454 sequencing
| NS | II-BIO | II-SP | |
|---|---|---|---|
| Total numer of sequences | 67 587 | 60 275 | 61 863 |
| Total with 5′ | 45 127 | 43 097 | 37 340 |
| Total with 3′ | 40 663 | 39 985 | 35 224 |
| Total with 5′ and 3′ | 27 946 | 30 541 | 24 049 |
| Average length | 245 | 246 | 211 |
| Reads mapping | 51 071 | 47 393 | 43 623 |
| Genes | 7576 | 7091 | 7524 |
Summary of reads obtained from 454 sequencing of not selected (NS) and selected (BIO, SP) libraries. Number of total sequences, and of sequences with vector encoded restriction site at one or both ends are reported. Reads were defined as mapping if aligned to the genome with at least 95% identity and 90% overlap. Genes identified by at least one read are reported.
Figure 2.(A) Rank abundance curve obtained by plotting the total number of mapped informative reads (presence of both 454 primers) versus the total number of identified genes. (B) Chart shows how many genes are supported by different number of reads. As a result of cDNA normalization the vast majority of genes are represented by up to 10 reads while only very few genes show more reads (with a maximum of 680). X-axis has been limited to 50.
Figure 3.(A) Venn diagram of the first 50 genes on the ranking list of non selected (NS) and selected (BIO, SP) libraries. (B) Probability of identifying all top five or top 10 genes by random selecting sample of clones of increasing size. Probability (pTop) has been approximated by a 10 000 cycle simulation for each sample size (picked clones). Simulation was performed using a custom developed PERL script. (C) Focus indexes for the first 50 genes of each library ranked on the basis of the number of supporting reads (coverage). The horizontal bars represent the mean value.
Description of the ELISA-validated top five ranking clones from the selected libraries
| Code | Gene name and frame | Hugo | Accession number | First base | Length (bp) |
|---|---|---|---|---|---|
| SP selected clones | |||||
| SP-1 | Leucine aminopeptidase 3, frame 1 | LAP3 | NM_015907.2 | 1078 | 186 |
| SP-2 | Glutathione S-transferase omega 2, frame 1 | GSTO2 | NM_183239.1 | 623 | 177 |
| SP-3 | Fibronectin 1, transcript variant 1, frame 1 | FN1 | NM_212482.1 | 1581 | 390 |
| SP-4 | H19, imprinted maternally expressed transcript (non-protein coding) | H19 | NR_002196.1 | 50 | 114 |
| SP-5 | Heat shock 70 kDa protein 1 A, frame 2 | HSPA1A | NM_005345.4 | 1343 | 396 |
| BIO selected clones | |||||
| BIO-1 | Aldolase B, fructose-bisphosphate, frame 1 | ALDOB | NM_000035.2 | 126 | 354 |
| BIO-2 | Tetratricopeptide repeat domain 31, frame 1 | TTC31 | NM_022492.4 | 311 | 330 |
| BIO-3 | SPARC related modular calcium binding 1, frame 1 | SMOC1 | NM_022137.4 | 1139 | 408 |
| BIO-4 | Zinc finger protein 23 (KOX 16), frame 3 | ZNF23 | NM_145911.1 | 1728 | 498 |
| BIO-5 | Myosin XVIIIA, frame 1 | MYO18A | NM_078471.3 | 4956 | 291 |
Top five ranking clones from each selection. Ranking is based on the number of reads. Name, frame, abbreviation, accession number, first base of the gene fragment, as well as length of the clone is reported.
Figure 4.ELISA reactivity of the top five clones from BIO and SP selection. Reactivity was tested on recombinant human and mouse TG2 proteins and on negative control proteins (BSA and WNT4) coated on solid surface (A) and as soluble biotinylated proteins (B).
Validation of selected clones by PCA assay
| TG2 | Negative Control | ||||
|---|---|---|---|---|---|
| Cam/ Kan | 15 Amp | 20 Amp | 30 Amp | 15 Amp | |
| BIO-1 | + | – | – | – | – |
| BIO-2 | + | +++ | +++ | +++ | – |
| BIO-3 | + | ++ | + | + | – |
| BIO-4 | + | +/– | – | – | – |
| BIO-5 | + | +++ | +++ | +++ | – |
| SP-1 | + | +++ | +++ | +++ | – |
| SP-2 | + | +++ | +++ | +++ | – |
| SP-3 | + | +++ | +++ | +++ | – |
| SP-4 | + | ++ | + | + | – |
| SP-5 | + | +++ | +++ | +++ | –– |
| Positive control | + | +++ | +++ | +++ | + |
| Negative control | + | – | – | – | – |
In vivo validation of selected interactors. E. coli cells containing the pα-TG2 vector were transformed with the pω vector carrying the cloned interactors. Bacteria were plated on plates supplemented with kanamicin/chloramphenicol to check the presence of both vectors, and on plates supplemented with increasing ampicillin concentration (15 to 20 to 30 μg/ml) and IPTG 1 mM to assay interaction. Positive and negative controls are described in the ‘Materials and Methods’ section.