| Literature DB >> 19187545 |
Vlad Popovici1, Darlene R Goldstein, Janine Antonov, Rolf Jaggi, Mauro Delorenzi, Pratyaksha Wirapati.
Abstract
BACKGROUND: Gene expression analysis has emerged as a major biological research area, with real-time quantitative reverse transcription PCR (RT-QPCR) being one of the most accurate and widely used techniques for expression profiling of selected genes. In order to obtain results that are comparable across assays, a stable normalization strategy is required. In general, the normalization of PCR measurements between different samples uses one to several control genes (e.g. housekeeping genes), from which a baseline reference level is constructed. Thus, the choice of the control genes is of utmost importance, yet there is not a generally accepted standard technique for screening a large number of candidates and identifying the best ones.Entities:
Mesh:
Year: 2009 PMID: 19187545 PMCID: PMC2640357 DOI: 10.1186/1471-2105-10-42
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The ten public microarray data sets used (n = number of samples).
| Data set ID and reference | Platform | |
| BWH [ | 47 | Affymetrix U133v2 |
| EMC [ | 286 | Affymetrix U133A |
| EXPO [ | 1375 | Affymetrix U133Plus2 |
| JRH2 [ | 61 | Affymetrix U133A |
| MGH [ | 60 | Agilent |
| NKI [ | 337 | Agilent (custom) |
| STOCK [ | 159 | Affymetrix U133A, B |
| TGIF1 [ | 49 | Affymetrix U133A |
| UNC [ | 153 | Agilent HuA1 |
| UPP [ | 249 | Affymetrix U133A, B |
Figure 1Example of variance stabilization by LOESS correction. LOESS correction applied to three data sets: BWH, NKI and EXPO-breast, respectively. The first row shows the original data with the fitted first order LOESS curve, while the second row shows the variance-stabilized data.
Figure 2Scatter plots of standard deviation versus mean log-intensity for BWH, NKI and EXPO-breast data sets, respectively. The shading codes the gene stability scores, with darker colors indicating higher scores. These three data sets are from different microarray platforms. The light gray points indicate the discarded genes (those with mean expression level below the β value – see Eq. 1). The curves correspond to equal score levels and are one score unit apart.
Top 20 control genes from the ten breast cancer data sets and top 20 genes from the aggregated list (Meta column)
| BWH | EMC | JRH2 | MGH | NKI | STOCK | TGIF1 | UNC | UPP | EXPO-breast | Meta |
| RPL37A | PPIA | RPL41 | ZNF557 | UBC | RPS11 | RPL41 | RPS10 | RPL9 | CALM2 | RPL37A |
| RPL41 | CALM2 | RPL39 | CDR1 | UBB | RPS24 | RPL37A | RPS18 | RPL37A | HNRPA1 | RPL27A |
| RPS18 | SRP14 | RPL23A | PPP1R2 | OAZ1 | RPL9 | EEF1A1 | RPLP1 | ACTG1 | NACA | RPS18 |
| RPL39 | RPL37A | RPL37A | TCN2 | DYNLL1 | RPL37A | RPL30 | RPS11 | RPL27A | UBA52 | RPL30 |
| RPL23A | RPS18 | EEF1A1 | SSBP1 | RAPSN | RPL41 | RPL39 | RPS23 | CFL1 | LAPTM4A | RPL41 |
| RPL9 | RPL30 | RPS23 | RPL27A | PCBP1 | RPL27A | PPIA | RPL37A | RPS11 | RPL27A | CALM2 |
| RPLP1 | RPL27A | RPS27 | RPS3 | KCNH3 | RPL39 | ACTG1 | RPL11 | RPS13 | RPL30 | RPL27 |
| RPS27 | RPS11 | CALM2 | BRCC3 | RPL3 | RPLP1 | CFL1 | RPS15 | RPL27 | RPL9 | K-ALPHA-1 |
| RPL27A | RPL39 | RPS18 | PTMA | RPL8 | UBB | RPS23 | RPL14 | RPL41 | RPL31 | RPS11 |
| RPL30 | RPS15 | ACTG1 | ABCF2 | MYL6 | RPS15A | RPL10 | NACA | RPS18 | RPL37 | RPL39 |
| RPS29 | RPS24 | RPL10 | PCDH18 | RPL14 | CALM2 | CALM2 | RPL36AL | RPS15 | RPS11 | RPS13 |
| ACTG1 | RPL32 | RPS24 | LAX1 | RPL7A | NACA | RPS11 | UBA52 | RPL6 | RPS29 | NACA |
| CALM2 | RPS15A | RPS15A | TPMT | FAU | RPL30 | HNRPA1 | NEDD8 | RPLP1 | RPS24 | RPL23A |
| RPS13 | RPLP1 | RPL32 | GALE | ARF1 | CFL1 | RPL6 | PCBP1 | RPL32 | RPS13 | RPS24 |
| HNRPA1 | RPL9 | RPL27A | MTCH1 | CCT3 | RPS13 | RPL23A | NDUFB2 | RPL31 | RPS21 | HNRPA1 |
| RPS24 | UBB | UBB | ATP5G2 | PSAP | RPS3A | K-ALPHA-1 | HNRPM | RPL39 | UBB | RPL9 |
| RPL31 | K-ALPHA-1 | RPS29 | SF3B2 | CD81 | RPL37 | RPS18 | HNRPC | UBB | RPS27A | RPLP1 |
| RPL34 | RPS13 | RPL30 | SND1 | SQSTM1 | RPS18 | EEF1G | NDUFB8 | RPS24 | RPS15 | RPL32 |
| RPS15A | RPL27 | PPIA | RPL5 | K-ALPHA-1 | RPL27 | TUBA6 | ATP5J2 | RPS27 | RPL32 | LAPTM4A |
| RPS21 | FAU | CFL1 | SKAP2 | CALR | RPL24 | RPS3A | TARDBP | DDX5 | RPL24 | RPS15A |
Figure 3Scatter plots of standard deviation versus mean log-intensity for two validation data sets (from left to right: NKI and UPP). The top 100 breast cancer control genes resulting from aggregating eight data sets are plotted as circles. Triangles correspond to the five control genes used in [17] (NKI does not contain the ACTB gene).
Top 50 control genes as resulting from aggregating the ten breast cancer data sets. Two genes – RPS11 and UBB – were selected as control genes and validated by RT-PCR
| Rank | Gene symbol | Gene ID | Description |
| 1 | RPL37A | 6168 | ribosomal protein L37a |
| 2 | RPL27A | 6157 | ribosomal protein L27a |
| 3 | RPS18 | 6222 | ribosomal protein S18 |
| 4 | RPL30 | 6156 | ribosomal protein L30 |
| 5 | RPL41 | 6171 | ribosomal protein L41 |
| 6 | CALM2 | 805 | calmodulin 2 (phosphorylase kinase, delta) |
| 7 | RPL27 | 6155 | ribosomal protein L27 |
| 8 | K-ALPHA-1 | 10376 | alpha tubulin |
| 9 | 6205 | ribosomal protein S11 | |
| 10 | RPL39 | 6170 | ribosomal protein L39 |
| 11 | RPS13 | 6207 | ribosomal protein S13 |
| 12 | NACA | 4666 | nascent-polypeptide-associated complex alpha polypeptide |
| 12 | RPL23A | 6147 | ribosomal protein L23a |
| 14 | RPS24 | 6229 | ribosomal protein S24 |
| 15 | HNRPA1 | 3178 | heterogeneous nuclear ribonucleoprotein A1 |
| 16 | RPL9 | 6133 | ribosomal protein L9 |
| 17 | RPLP1 | 6176 | ribosomal protein, large, P1 |
| 18 | RPL32 | 6161 | ribosomal protein L32 |
| 19 | LAPTM4A | 9741 | lysosomal-associated protein transmembrane 4 alpha |
| 20 | RPS15A | 6210 | ribosomal protein S15a |
| 21 | DYNLL1 | 8655 | dynein, light chain, LC8-type 1 |
| 22 | ACTG1 | 71 | actin, gamma 1 |
| 23 | TUBA6 | 84790 | tubulin, alpha 6 |
| 24 | SRP14 | 6727 | signal recognition particle 14kDa (homologous Alu RNA binding protein) |
| 25 | MYL6 | 4637 | myosin, light chain 6, alkali, smooth muscle and non-muscle |
| 26 | RPL24 | 6152 | ribosomal protein L24 |
| 27 | FAU | 2197 | Finkel-Biskis-Reilly murine sarcoma virus (FBR-MuSV) ubiquitously expressed (fox derived); ribosomal protein S30 |
| 28 | RPL31 | 6160 | ribosomal protein L31 |
| 29 | RPS15 | 6209 | ribosomal protein S15 |
| 30 | MTCH1 | 23787 | mitochondrial carrier homolog 1 (C. elegans) |
| 31 | 7314 | ubiquitin B | |
| 32 | RPL37 | 6167 | ribosomal protein L37 |
| 33 | HMGN2 | 3151 | high-mobility group nucleosomal binding domain 2 |
| 34 | RPS27 | 6232 | ribosomal protein S27 (metallopanstimulin 1) |
| 35 | GDF8 | 2660 | growth differentiation factor 8 |
| 36 | RPL38 | 6169 | ribosomal protein L38 |
| 37 | RPS29 | 6235 | ribosomal protein S29 |
| 38 | SULT1C2 | 27233 | sulfotransferase family, cytosolic, 1C, member 2 |
| 39 | RPL6 | 6128 | ribosomal protein L6 |
| 40 | UBC | 7316 | ubiquitin C |
| 41 | UBA52 | 7311 | ubiquitin A-52 residue ribosomal protein fusion product 1 |
| 42 | MRFAP1 | 93621 | Mof4 family associated protein 1 |
| 43 | HNRPK | 3190 | heterogeneous nuclear ribonucleoprotein K |
| 44 | PARK7 | 11315 | Parkinson disease (autosomal recessive, early onset) 7 |
| 45 | PSMC1 | 5700 | proteasome (prosome, macropain) 26S subunit, ATPase, 1 |
| 46 | LOC158572 | 158572 | hypothetical protein LOC158572 |
| 47 | RPS8 | 6202 | ribosomal protein S8 |
| 48 | ATP5A1 | 498 | ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle |
| 49 | EIF4H | 7458 | eukaryotic translation initiation factor 4H |
| 50 | CD63 | 967 | CD63 molecule |
Figure 4RT-QPCR experiment. Standard deviation as a function of the mean expression level (expressed as raw Ct values) of 47 genes in a RT-QPCR experiment. Higher expression levels correspond to smaller raw Ct values. Control genes are represented by triangles, test genes by circles. The new control genes RPS11 and UBB are in the lower left corner.