| Literature DB >> 21548937 |
Troy Ketela1, Lawrence E Heisler, Kevin R Brown, Ron Ammar, Dahlia Kasimer, Anuradha Surendra, Elke Ericson, Kim Blakely, Dina Karamboulas, Andrew M Smith, Tanja Durbic, Anthony Arnoldo, Kahlin Cheung-Ong, Judice L Y Koh, Shuba Gopal, Glenn S Cowley, Xiaoping Yang, Jennifer K Grenier, Guri Giaever, David E Root, Jason Moffat, Corey Nislow.
Abstract
BACKGROUND: Genome-wide screening in human and mouse cells using RNA interference and open reading frame over-expression libraries is rapidly becoming a viable experimental approach for many research labs. There are a variety of gene expression modulation libraries commercially available, however, detailed and validated protocols as well as the reagents necessary for deconvolving genome-scale gene screens using these libraries are lacking. As a solution, we designed a comprehensive platform for highly multiplexed functional genetic screens in human, mouse and yeast cells using popular, commercially available gene modulation libraries. The Gene Modulation Array Platform (GMAP) is a single microarray-based detection solution for deconvolution of loss and gain-of-function pooled screens.Entities:
Mesh:
Year: 2011 PMID: 21548937 PMCID: PMC3115879 DOI: 10.1186/1471-2164-12-213
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Description of the features on the UT GMAP 1.0
| Probe ID | Description | Unique probes | Replicates | Total number of probes | Probe length |
|---|---|---|---|---|---|
| hORF | Human ORFeome | 134901 | 1 | 134901 | 25 |
| huORF | HuGene ORFs | 58087 | 1 | 58087 | 25 |
| HP | shRNA sequences (mouse and human) | 248049 | 3 | 744147 | 22 |
| HPC | shRNA negative controls | 138 | 33 | 4554 | 22 |
| HSPI | Hybridization spike-in probes | 200 | 25 | 5000 | 22 |
| HPTMM | shRNA mismatch control probes | 8097 | 3 | 24291 | 22 |
| TAG | Yeast barcode probes | 26801 | 3 | 80403 | 20 |
| yORF | Yeast open reading frames | 11421 | 1 | 11421 | 25 |
Figure 1Readout of high complexity shRNA pools. (a) Surface plot of signal intensities for all 248,049 shRNA features on the GMAP microarray following hybridization of probe generated from the human 78 k plasmid shRNA pool or the mouse 78 k plasmid shRNA pool. The surface plot shows the signal intensity distributions of the human and mouse 78 k features plotted against each other and the remaining shRNA probe features not corresponding to either pool. The signal peaks are labelled according to which set of features they represent. (b) Distribution plots of GMAP features data for a dilution series of shRNAs in separate pools. 4x, 16x, and 64x curves are plotted as the distribution of log2 difference between the array signal for shRNAs in the dilution series and their signal in the reference (Even) pool. The 1x curve is plotted as the log2 difference between a group of undiluted shRNAs in the 4x pool and the same shRNAs in the Even pool. (c) Distribution plots of GMAP array data for a dilution series of shRNAs within the same pool. 2x, 5x, 10x, and 20x curves are plotted as the distribution of log2 difference between the array signal for groups of diluted shRNAs and their signal in the reference (Even) pool. The 1x curve is plotted as the log2 difference between a group of undiluted shRNAs in the 2x-20x pool and the same shRNAs in the Even pool. All microarrays were run in triplicate, and the replicate signal intensities were averaged.
Summary of shRNA pool deconvolution by next-generation sequencing
| Statistic | Even | 4x | 16x | 64x | 2-20x |
|---|---|---|---|---|---|
| Reads passed filter | 22407757 | 24596943 | 24793754 | 24304519 | 25684516 |
| Mapped reads | 22296083 | 24504006 | 24687271 | 24235485 | 25605983 |
| Unmapped reads | 111674 | 94937 | 106483 | 69034 | 78533 |
| Unmapped reads % | 0.5 | 0.39 | 0.43 | 0.28 | 0.31 |
| 1st quartile | 40 | 34 | 26 | 35 | 21 |
| Median | 153 | 149 | 140 | 162 | 107 |
| Mean | 315 | 352.4 | 351.6 | 350.7 | 370.2 |
| 3rd quartile | 375 | 408 | 404 | 417 | 385 |
| Max | 17960 | 19360 | 21750 | 20000 | 29460 |
| Uncounted | 9784 | 10748 | 10045 | 10983 | 10739 |
Figure 2Comparison of sequencing and GMAP performance with high complexity shRNA pools. (a) Scatter plot of GMAP array signal intensity (X-axis) versus sequencing read number (Y-axis) for shRNA clones from the human Even shRNA pool. (b) Distribution plots of Illumina sequencing data for a dilution series of shRNAs in separate pools. 4x, 16x, and 64x curves are plotted as the distribution of log2 difference between the number of sequencing reads for shRNAs in the dilution series and their read count in the reference (Even) pool. The 1x curve is plotted as the log2 difference between a group of undiluted shRNAs in the 4x pool and the same shRNAs in the Even pool. (c) Distribution plots of Illumina sequencing data for a dilution series of shRNAs within the same pool. 2x, 5x, 10x, and 20x curves are plotted as the distribution of log2 difference between the number of sequencing reads for groups of diluted shRNAs and their read count in the reference (Even) pool. The 1x curve is plotted as the log2 difference between a group of undiluted shRNAs in the 2x-20x pool and the same shRNAs in the Even pool. (d) Distribution plots of GMAP features data for a dilution series of shRNAs contained within sub fractions of a ~90,000 shRNA pool where the probe was amplified from shRNA plasmid template DNA. (e) Distribution plots of GMAP features data for a dilution series of shRNAs contained within sub fractions of a ~90,000 shRNA pool where the probe was amplified from genomic DNA of A549 cells infected with lentiviral pools. 2x, 5x, 10x, and 20x curves are plotted as the distribution of log2 difference between the array signal for groups of diluted shRNAs in the 90 k Dilution pool and their signal in the 90 k Reference pool. The 1x curve is plotted as the log2 difference between a group of undiluted shRNAs in the 90 k Dilution pool and the same shRNAs in the 90 k Reference pool.
Figure 3Quality of human ORF features on GMAP chip. (a) Intensity signals from the GMAP array (X-axis) plotted against signals from the Human Gene 1.0 ST array (Y-axis) following amplification of human ORFeome v5.1 pools and hybridization of the resulting probe to each of these arrays. Common features between the two arrays were used to calculate the Pearson correlation coefficient. (b) Distribution of signal intensities from replicate GMAP or Human 1.0 ST arrays as described in (a) for shared features. Hybridization behaviour for the huORF features compared to the (c) huGene features or the (d) 22-mer hairpin features for corresponding genes.