| Literature DB >> 20056656 |
Wei Shi1, Carolyn A de Graaf, Sarah A Kinkel, Ariel H Achtman, Tracey Baldwin, Louis Schofield, Hamish S Scott, Douglas J Hilton, Gordon K Smyth.
Abstract
A fundamental question in microarray analysis is the estimation of the number of expressed probes in different RNA samples. Negative control probes available in the latest microarray platforms, such as Illumina whole genome expression BeadChips, provide a unique opportunity to estimate the number of expressed probes without setting a threshold. A novel algorithm was proposed in this study to estimate the number of expressed probes in an RNA sample by utilizing these negative controls to measure background noise. The performance of the algorithm was demonstrated by comparing different generations of Illumina BeadChips, comparing the set of probes targeting well-characterized RefSeq NM transcripts with other probes on the array and comparing pure samples with heterogenous samples. Furthermore, hematopoietic stem cells were found to have a larger transcriptome than progenitor cells. Aire knockout medullary thymic epithelial cells were shown to have significantly less expressed probes than matched wild-type cells.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20056656 PMCID: PMC2853118 DOI: 10.1093/nar/gkp1204
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Data sets used in this study
| ID | Platform | Number regular probes | Number negative controls | Experiment description |
|---|---|---|---|---|
| 1 | MouseWG-6 V1.1 | 46 657 | 1603 | Six cell types: hematopoietic stem cells, CMPs, GMPs, pro DC precursors, neutrophils and macrophages. Number of arrays per cell type: 4, 2, 1, 3, 1 and 3, respectively. |
| 2 | MouseWG-6 V1.1 | 46 657 | 1603 | Two cell types: wild type and Aire knockout MHCII |
| 3 | MouseWG-6 V2 | 45 281 | 936 | Three cell types: pro DC precursors, neutrophils and macrophages. Number of arrays per cell type: 9, 3 and 3, respectively. |
| 4 | MouseWG-6 V2 | 45 281 | 936 | Four cell types: hematopoietic stem cells, CMPs, GMPs and MEPs. Number of arrays per cell type: 3. |
| 5 | HumanWG-6 V1 | 47 312 | 1517 | Six conditions: MCF7 and Jurkat samples were mixed at six different proportions (see |
| 6 | HumanWG-6 V1 | 47 312 | 1517 | Four conditions: Universal Human Reference RNA(UHRR) and Human Brain Reference RNA(HBRR) were mixed at four different proportions (see |
| 7 | HumanWG-6 V2 | 48 687 | 1374 | Six conditions: three subtypes of T lymphocytes taken from two patients infected with hepatitis C virus. Number of arrays per condition: 1. Published in ref. ( |
| 8 | HumanHT-12 | 48 799 | 759 | Twelve samples. Bone marrow from seven malaria-infected and five uninfected donors. Number of arrays per sample: 1. |
| 9 | HumanWG-6 V3 | 48 803 | 759 | Four cell types: Lin−CD49f |
Figure 1.Intensity distributions for regular probes and negative control probes. Data from 15 arrays using Illumina MouseWG-6 version 2 was used for this plot. Numbers of regular probes and negative control probes are 45 281 and 936, respectively on each array. Intensities are on log2 scale.
Figure 2.Proportions of expressed probes estimated for different BeadChip types. All data described in Table 1 are included except for datasets 2 and 6 and the erythrocyte progenitors from data set 4. For each BeadChip type, the boxes show the minimum, first quartile, median, second quartile and maximum of estimated expression proportions across all its arrays.
Figure 3.Proportions of expressed probes by RefSeq annotation and BeadChip type. Data are as for Figure 2.
Figure 4.Expression proportion estimation for samples from two mixture experiments. (a) Estimated expression proportions for samples from the in-house mixture experiment. Jurkat and MCF7 samples were mixed at the proportions of 100:0, 94:6, 88:12, 76:24, 50:50 and 0:100. Error bars are standard errors (n = 2). (b) Estimated expression proportions for samples used in the MAQC project. Universal Human Reference RNA(UHRR) and Human Brain Reference RNA(HBRR) samples were mixed at the proportions of 100:0, 75:25, 25:75 and 0:100. Error bars are standard errors (n = 5). (c) RefSeq NM probes commonly and exclusively expressed in MCF7 and Jurkat samples. (d) RefSeq NM probes commonly and exclusively expressed in UHRR and HBRR samples.
Figure 5.Correspondence between hematopoietic stem cell differentiation tree and estimated expression proportions for different cell types. (a) Hematopoietic stem cells differentiate into different progenitor cells. (b) Estimated proportions of expressed probes for four different cell types. Error bars are standard errors (n = 3).