| Literature DB >> 21629697 |
Dubravka Škalamera1, Max V Ranall, Benjamin M Wilson, Paul Leo, Amy S Purdon, Carolyn Hyde, Ehsan Nourbakhsh, Sean M Grimmond, Simon C Barry, Brian Gabrielli, Thomas J Gonda.
Abstract
In response to the growing need for functional analysis of the human genome, we have developed a platform for high-throughput functional screening of genes overexpressed from lentiviral vectors. Protein-coding human open reading frames (ORFs) from the Mammalian Gene Collection were transferred into lentiviral expression vector using the highly efficient Gateway recombination cloning. Target ORFs were inserted into the vector downstream of a constitutive promoter and upstream of an IRES controlled GFP reporter, so that their transfection, transduction and expression could be monitored by fluorescence. The expression plasmids and viral packaging plasmids were combined and transfected into 293T cells to produce virus, which was then used to transduce the screening cell line. We have optimised the transfection and transduction procedures so that they can be performed using robotic liquid handling systems in arrayed 96-well microplate, one-gene-per-well format, without the need to concentrate the viral supernatant. Since lentiviruses can infect both dividing and non-dividing cells, this system can be used to overexpress human ORFs in a broad spectrum of experimental contexts. We tested the platform in a 1990 gene pilot screen for genes that can increase proliferation of the non-tumorigenic mammary epithelial cell line MCF-10A after removal of growth factors. Transduced cells were labelled with the nucleoside analogue 5-ethynyl-2'-deoxyuridine (EdU) to detect cells progressing through S phase. Hits were identified using high-content imaging and statistical analysis and confirmed with vectors using two different promoters (CMV and EF1α). The screen demonstrates the reliability, versatility and utility of our screening platform, and identifies novel cell cycle/proliferative activities for a number of genes.Entities:
Mesh:
Year: 2011 PMID: 21629697 PMCID: PMC3101218 DOI: 10.1371/journal.pone.0020057
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Lentiviral expression library generation.
Step 1 shows the structures of the plasmids involved in the Gateway LR recombination reaction between the ORFeome entry vector and the lentiviral expression vector. attL1, attL2, attR1, attR2 are the recombination sites,: ORF - open reading frame; LTR - long terminal repeats (“S” in the 3′ LTR indicates that it harbours a deletion rendering it self-inactivating); CMV/EF1α – promoter; IRES – internal ribosome entry site sequence; GFP – green fluorescent protein, ccdB – gene encoding a bacterial toxin (which is replaced by the ORF and is used to select against non-recombinant plasmids). Steps 2, 3 and 4 are viral packaging in 293T cells, collection of lentivirus-containing supernatant and transduction of target cells, respectively. For details see Materials and Methods.
Figure 2Workflow for the screening of the MCF-10A cells.
The left-hand panel shows plate manipulations and cell treatments for each day of the screen. The right-hand panel shows a representative image of a scanned field as a pseudo-colour overlay (top) of individual scanning channels (bottom) shown individually for the boxed area (DAPI – blue, GFP – green, EdU – red, scale bar = 50 µM). Blue lines in the enlargements encircle the object (nucleus) area selected by the scanning algorithm. See Materials and Methods for further details.
Figure 3Data analysis steps used to identify hits.
Image analysis data was exported from the Cellomics ArrayScan instrument and further analysed using the R programming platform. The first step in analysis is assignment of Edu and GFP status to each cell, (See also Figure 2 and Figure S3). Following further data processing (see Materials and Methods), hits were selected based on Q-Q plots for z-scores. The expected z-score distribution (x axis) is plotted against the observed scores (y axis), and a hit threshold (blue solid line) set where observed values are outside the 95% confidence interval (red dashed line) from the expected y = x diagonal (red solid line). Empty vector negative controls (light blue circles) lie on the diagonal, while the positive control wells, containing cells overexpressing CCNE1 (red circles), are in the predicted hit region.
Identity and characteristics of confirmed EdU incorporation hits: function criteria, cell-cycle expression pattern and activity in related functional screens.
| Symbol | Description | Function criteria | Cell cycle phase | Functional screens |
| APEX1 | APEX nuclease (multifunctional DNA repair enzyme) 1 | ML | 1−,2− | |
| ATOX1 | ATX1 antioxidant protein 1 homolog | U | 6 | |
| BUB3 | budding uninhibited by benzimidazoles 3 homolog (yeast) | C | Ma,G2b | 4b |
| C2orf83 | chromosome 2 open reading frame 83 | U | ||
| CAPN3 | calpain 3, (p94) | ML | ||
| CDC20 | cell division cycle 20 homolog | C | Ma, G2c | 4a,4b |
| CDK2 | cyclin-dependent kinase 2 | MR, P | 1−,5 | |
| CDK9 | cyclin-dependent kinase 9 | C, P | G2a | 1+,2+,3+ |
| CELA2B | chymotrypsin-like elastase family, member 2B | U | ||
| CLDN1 | claudin 1 | ML | ||
| CLK2 | CDC-like kinase 2 | P | 1+ | |
| CRK | v-crk sarcoma virus CT10 oncogene homolog | C | M/G1b | 1+,2+ |
| CYTH2 | cytohesin 2 | C | G2ab | 1− |
| DCAF7 | DDB1 and CUL4 associated factor 7 | C | Ma,G2b | 2+,3 |
| DPYSL3 | dihydropyrimidinase-like 3 | C | G2a | 2− |
| EXOC8 | exocyst complex component 8 | C | Sa | |
| GAPDH | glyceraldehyde-3-phosphate dehydrogenase | C | G2a | |
| GNA15 | guanine nucleotide binding protein (G protein), alpha 15 (Gq class) | C | Ma | 3 |
| GNB1 | guanine nucleotide binding protein (G protein), beta polypeptide 1 | C | Sa,M/G1b | 2+,3 |
| H2BFWT | H2B histone family, member W, testis-specific | U | ||
| HNRNPH2 | heterogeneous nuclear ribonucleoprotein H2 (H′) | U | ||
| HOXB5 | homeobox B5 | U | ||
| ILKAP | integrin-linked kinase-associated serine/threonine phosphatase 2C | P | ||
| IRAK3 | interleukin-1 receptor-associated kinase 3 | U | ||
| KRT19 | keratin 19 | C | G2a | 1−,2− |
| KRT40 | keratin 40 | U | ||
| MAK | male germ cell-associated kinase | P | ||
| NAB2 | NGFI-A binding protein 2 (EGR1 binding protein 2) | C | Sa | |
| NDUFS3 | NADH dehydrogenase Fe-S protein 3, 30 kDa | P | ||
| NEK6 | NIMA (never in mitosis gene a)-related kinase 6 | P | 2+ | |
| PACSIN1 | protein kinase C and casein kinase substrate in neurons 1 | U | ||
| PDK2 | pyruvate dehydrogenase kinase, isozyme 2 | P | 4a | |
| PPM1G | protein phosphatase, Mg2+/Mn2+ dependent, 1G | MR, P | 5 | |
| PSTPIP2 | proline-serine-threonine phosphatase interacting protein 2 | ML, | 5, 6 | |
| RBP1 | retinol binding protein 1, cellular | U | ||
| RGS20 | regulator of G-protein signaling 20 | C | Ma | |
| RIT1 | Ras-like without CAAX 1 | U | ||
| RPS6KA4 | ribosomal protein S6 kinase, 90 kDa, polypeptide 4 | MR, P | 5 | |
| SETD2 | SET domain containing 2 | P | ||
| SH3GL2 | SH3-domain GRB2-like 2 | C | Sb | |
| SOD1 | superoxide dismutase 1, soluble | ML | 4a,4b | |
| SPOP | speckle-type POZ protein | C | lateG1a | 2+,3,4a |
| SYT5 | synaptotagmin V | MR | 6, 5 | |
| TK1 | thymidine kinase 1, soluble | MR | 5 | |
| TMEM55A | transmembrane protein 55A | C | Sa | |
| UBE2S | ubiquitin-conjugating enzyme E2S | C | Ma,M/G1b,G2c | |
| WWP2 | WW domain containing E3 ubiquitin protein ligase 2 | C | Sa | 2+,4a |
| ZWINT | ZW10 interactor | C | Sb | 4b |
*Function criteria used for inclusion in the lentiviral ORF library, based on published microarray expression pattern or functional screening result: C- cell cycle phase-specific expression; ML-Myeloid cell proliferation [26]; P- phosphoregulation [27], [28]; MR- mitotic regulation [24]; U- unspecified i.e. randomly picked clones.
Cell cycle phase-specific expression determined by microarray analysis in: a, primary human foreskin fibroblasts [22] or HeLa cells b, [25] and c, [23].
Positive (+), negative (−) or neutral (*) effect on cell proliferation detected in either cDNA overexpression screens in: 1, SMMC7721-human hepatoma cells [11]; 2, NIH 3T3 mouse fibroblasts [11]; 3, U2OS-human osteosarcoma cells [9]; or in RNAi-mediated knock-down screens for effects on mitosis detected in 4a- HeLa [29], 4b- Hela [32], 5- HT29-human colon carcinoma [24], and 6- U2OS cells [31].
Figure 4Summary measurements for hits showing increased total cell proliferation.
CMV, EF1α indicate values for ORFs expressed under control of the CMV and EF1α promoters, respectively. A – ratio of total objects counted on day 4 over day 2 of the assay (Figure 2); B – ratio of the proportion of EdU positive cells in GFP positive over that in GFP negative objects on day 4; C-percentage of GFP positive objects on day4. All bars represent a mean of three wells, error bars are standard deviation. Genes represented here were significantly different (p≤0.05) in A and B from the control (in this case vectors encoding a truncated peptide originating from CPNE3, which had no effect on proliferation in primary and validation screens). Data for all genes tested is in Table S3.
Figure 5Effect of transgene overexpression on 2-day growth rate of FACS-purified cell populations.
MCF-10A cells were transduced with control (truncated CPNE3), CCNE1, CAPN3 or NEK6 vectors and sorted by flow cytometry for GFP expression (minimum 98% GFP positive cells). A- Cells were seeded in complete medium (5%HS and EGF), and the next day media were replaced with either reduced (1% HS no EGF) or complete medium. Cells were counted 1 and 3 days later and ratios calculated. Bars represent mean of 4 wells and error bars show the standard deviation. B – Western blots of whole cell protein extracts obtained from sorted cell lines after one freeze-thaw cycle and 8 subsequent passages. Targeted antigen is indicated on the left of the image while the introduced ORF is indicated on top of each lane. Except for the no virus control, cells used were confirmed to be >98% GFP positive prior to harvesting.
Figure 6Cell cycle analysis of proliferation-inducing hits.
A- DAPI intensity (x-axis) histograms (y-axis = number of objects) obtained by CellCycle v3 application in the Cellomics ArrayScan Software, representing typical profiles observed on Day 0, Day 2 and Day 4 of the assay (Figure 2), depending on growth conditions and viral transduction. Profiles and data shown are derived from a representative single well from each of the categories: not transduced (no virus), transduced with empty plv101 vector, or with the vector expressing NEK6, analysed using cells grown in complete or restrictive medium as indicated. The proportion of cells in each cell cycle phase is indicated (based on DNA content: 2N (%G1), between 2N and 4N (%S), and 4N (%G2)). B, C – Graphs representing the proportion of cells in G2 on day 2 (B) and day 4 (C) for proliferation-inducing hits (Bars = mean of 3 wells; error bars = standard deviation). CMV, EF1α - values for clones expressed under control of the CMV and EF1α promoters respectively. N.T-not transduced, control – truncated CPNE3. Values for all analysed genes are in Table S4, for all transduced wells, values represent GFP positive cell population only.