| Literature DB >> 15272935 |
Bin Yao1, Sanjay N Rakhade, Qunfang Li, Sharlin Ahmed, Raul Krauss, Sorin Draghici, Jeffrey A Loeb.
Abstract
BACKGROUND: cDNA microarrays are a powerful means to screen for biologically relevant gene expression changes, but are often limited by their ability to detect small changes accurately due to "noise" from random and systematic errors. While experimental designs and statistical analysis methods have been proposed to reduce these errors, few studies have tested their accuracy and ability to identify small, but biologically important, changes. Here, we have compared two cDNA microarray experimental design methods with northern blot confirmation to reveal changes in gene expression that could contribute to the early antiproliferative effects of neuregulin on MCF10AT human breast epithelial cells.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15272935 PMCID: PMC509237 DOI: 10.1186/1471-2105-5-99
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Anti-proliferative effects of neuregulin on MCF10AT cells. Quadruplicate cultures of MCF10AT cells were treated with and without 1 nM neuregulin 3 days after plating and cell counts were performed demonstrating a significant decrease in their growth 24 and 48 hours after treatment. The p value for 24 hr was 0.0011, and for 48 hr was 1.46E-05.
Figure 2(A) Highly reproducible systematic errors from gene-dye interactions. The arrows demonstrate intense red dye labeling for a given gene spotted in duplicate both for the T/C slide where the treated sample is labeled with red dye (Cy5) and control sample is labeled with green dye (Cy3), as well as in the C/C slide where the same sample is labeled both red and green. (B) Experimental designs. Two different experimental methods were compared: A dye-swap approach, where the dye color is reversed for T/C hybridizations, and a "control correction" design, where T/C and C/C hybridizations are performed without reversing the dyes. T denotes the neuregulin treated cells, while C denotes the untreated, control cells. Each arrows represent a replicate and the tails of the arrows indicate cy5 labeling and the heads indicate cy3 labeling. (C) Data processing flow chart for the control correction method.
Figure 3Array and intensity-dependent variation can be corrected by normalization based on intensity. (A) This is an MA-plot before normalization for one of T/C slides that plots the log intensity ratios against the averaged intensities at both wavelengths: M = log (T/C) and A = 1/2log(T*C). The majority of the data is less than zero in a "banana" or "comma" shaped distribution. This demonstrates a systematic, intensity-dependent dye effect, prominent at lower intensities. (B) After normalization using the lowess function, the MA-plot shows a more even distribution at all intensities.
Figure 4Control correction of each spot markedly improves the distribution of log ratios. (A) Histograms show that T/C and C/C log ratio distributions after lowess normalization still have a marked asymmetry with a larger tail towards the left (increased down-regulated genes). The distribution becomes symmetric after subtracting the log (C/C) from the log (T/C). (B) Quantile-quantile plots similarly show that the log ratio distribution becomes more normal after correction of each spot with the control ratio.
Figure 5The control correction method identifies gene expression changes from spots with variable C/C ratios. The log ratios for each gene are plotted both for the T/C (□) and the corresponding C/C (▲) hybridizations for the 46 genes selected from control correction method. The error bars represent one standard deviation in each direction.
Figure 6Both control correction and dye-swap methods reveal statistically significant changes in gene expression. Volcano plots of the control correction method (A) and the dye-swap method (B) reveal a small proportion of genes that met our arbitrary criteria of having >1.7 fold changes with p values <0.05, determined individually for each gene. The horizontal lines on each graph represent p = 0.05. The vertical lines represent 1.7 fold changes, both up- and down-regulated. Genes shown in blue in upper left and right areas were selected for northern blot confirmation.
List of identified genes. Gene accession numbers, gene descriptions, fold-changes, and p-values for genes identified by the dye-swap method with ANOVA and regularized t-test analysis and the control correction method (CCM). Genes are broadly classified into three groups: proliferation-related, differentiation-related and unclassified.
| NM_006597 | heat shock 70 kDa protein 8 | -2.8 | 9.5E-6 | -2.9 | 4.2E-7 | -2.9 | 8.4E-13 | |
| M34664 | heat shock 60 kDa protein 1 (chaperonin) | -1.9 | 3.9E-6 | -1.9 | 4.8E-10 | |||
| L15189 | heat shock 70 kDa protein 9B (mortalin-2) | -2.2 | 1.4E-3 | -1.8 | 1.8E-6 | -1.8 | 6.5E-6 | |
| M22382 | Human mitochondrial matrix protein P1 | -1.8 | 1.5E-5 | -1.7 | 2.9E-9 | |||
| M94859 | Human calnexin mRNA, complete cds | -1.8 | 4.1E-3 | |||||
| L27706 | Human chaperonin protein (Tcp20) gene complete cds | -2.1 | 3.8E-4 | |||||
| D29677 | helicase with zinc finger domain | -1.9 | 3.1E-5 | -1.9 | 2.8E-10 | |||
| X91257 | seryl-tRNA synthetase | -1.8 | 1.2E-3 | -1.8 | 4.5E-8 | -1.9 | 2.0E-11 | |
| D30655 | eukaryotic translation initiation factor 4A, isoform 2 | -1.7 | 5.5E-5 | |||||
| U76111 | Human translation repressor NAT1 mRNA, complete cds | -1.7 | 8.9E-3 | |||||
| M74719 | transcription factor 4 | -1.9 | 2.0E-3 | |||||
| X13293 | Human mRNA for B-myb gene | 1.7 | 4.2E-9 | |||||
| U00968 | Human SREBP-1 mRNA | -1.7 | 8.4E-9 | |||||
| L41490 | eukaryotic translation elongation factor 1 alpha 1-like 14 | -1.8 | 1.4E-5 | -1.7 | 5.3E-5 | -1.8 | 5.0E-12 | |
| D16481 | Homo sapiens mRNA for mitochondrial 3-ketoacyl-CoA thiolase beta-subunit of trifunctional protein, complete cds | -1.8 | 5.6E-5 | |||||
| Y00711 | Human mRNA for lactate dehydrogenase B (LDH-B) | -1.8 | 6.9E-5 | -1.7 | 2.4E-8 | |||
| D78130 | Homo sapiens mRNA for squalene epoxidase, complete cds | -1.9 | 1.2E-5 | -1.8 | 9.5E-7 | |||
| M37154 | Human glutamate dehydrogenase (GDH) mRNA, complete cds | -1.8 | 4.0E-3 | -1.7 | 8.0E-4 | |||
| U62961 | 3-oxoacid CoA transferase | -2.0 | 1.2E-4 | |||||
| Y13647 | Homo sapiens mRNA for stearoyl-CoA desaturase | -1.9 | 1.5E-6 | |||||
| Y13286 | Homo sapiens mRNA for GDP dissociation inhibitor beta | -1.8 | 1.8E-5 | -1.7 | 1.3E-6 | |||
| X51408 | Human mRNA for n-chimaerin | 2.1 | 1.8E-2 | |||||
| U47413 | Human cyclin G1 mRNA, complete cds | -1.7 | 9.6E-7 | |||||
| AF139897 | Homo sapiens BASS1 (BASS1) mRNA, partial cds | -1.8 | 1.8E-3 | |||||
| D00265 | Homo sapiens mRNA for cytochrome c, partial cds | -2.0 | 2.0E-2 | |||||
| X68836 | H. sapiens mRNA for S-adenosylmethionine synthetase | -2.2 | 8.3E-6 | |||||
| U19251 | Homo sapiens neuronal apoptosis inhibitory protein mRNA, complete cds | 1.8 | 1.1E-2 | |||||
| S45630 | alpha B-crystallin=Rosenthal fiber component [human, glioma cell line, mRNA, 691 nt] | 2.1 | 2.7E-3 | |||||
| X86779 | H. sapiens mRNA for FAST kinase | 1.7 | 1.9E-2 | |||||
| X03541 | Human mRNA of trk oncogene | -1.8 | 3.1E-3 | |||||
| M19722 | Human fgr proto-oncogene encoded p55-c-fgr protein, complete cds | -1.9 | 5.0E-3 | |||||
| X77548 | H. sapiens cDNA for RFG | -2.1 | 1.5E-3 | |||||
| D63390 | platelet-activating factor acetylhydrolase, isoform Ib, beta subunit 30 kDa | -1.8 | 4.4E-4 | |||||
| U62739 | Human branched-chain amino acid aminotransferase (ECA40) mRNA, complete cds | 1.7 | 1.2E-2 | |||||
| L12350 | Human thrombospondin 2 (THBS2) mRNA, complete cds | 1.9 | 2.2E-3 | |||||
| X06614 | Human mRNA for receptor of retinoic acid | 1.7 | 2.8E-2 | |||||
| Y07921 | Human mRNA for serine protease | -1.7 | 1.1E-6 | |||||
| AF042857 | Homo sapiens lung cancer antigen NY-LU-12 variant A mRNA, complete cds | 2.1 | 4.4E-3 | |||||
| M21300 | Human small proline rich protein (sprI) mRNA, clone 15B | 1.7 | 2.0E-4 | |||||
| M15395 | Human leukocyte adhesion protein (LFA-1/Mac-1/p150,95 family) beta subunit mRNA | 2.0 | 2.8E-2 | |||||
| S72869 | H4(D10S170) = putative cytoskeletal protein [human, thyroid, mRNA, 3011 nt] | -1.9 | 3.2E-2 | |||||
| D21267 | SNAP25 synaptosomal-associated protein, 25 kDa | 1.9 | 1.7E-2 | |||||
| U95735 | Human thrombospondin 2 (THBS2) mRNA, complete cds | -1.8 | 1.5E-5 | |||||
| X68194 | H. sapiens h-Sp1 mRNA | -1.7 | 4.2E-2 | |||||
| Z74615 | collagen, type I, alpha 1 | 2.5 | 5.3E-3 | |||||
| U68030 | chemokine (C-C motif) receptor 6 | 1.8 | 5.8E-3 | |||||
| X04701 | Human mRNA for complement component C1r | -2.4 | 3.3E-2 | |||||
| D79983 | ring finger protein 144 | 1.7 | 2.3E-2 | |||||
| D29013 | Homo sapiens mRNA for DNA polymerase beta, complete cds | 1.8 | 7.3E-3 | |||||
| J02854 | myosin, light polypeptide 9, regulatory | 2.6 | 4.5E-2 | |||||
| X58141 | Human mRNA for erythrocyte adducin alpha subunit | 2.1 | 3.2E-2 | |||||
| AJ224442 | Homo sapiens mRNA for putative methyltransferase | 1.8 | 3.4E-3 | |||||
| X06661 | Human mRNA for 27-kDa calbindin | 2.1 | 9.0E-3 | |||||
| U24266 | aldehyde dehydrogenase 4 family, member A1 | 1.7 | 2.5E-3 | |||||
| U62432 | Human nicotinic acetylcholine receptor alpha3 subunit precursor, mRNA, complete cds | 1.8 | 7.0E-3 | |||||
| U79259 | Human clone 23945 mRNA, complete cds | -1.7 | 4.2E-3 | |||||
| M18533 | Homo sapiens dystrophin (DMD) mRNA, complete cds | -2.5 | 1.1E-2 | |||||
| J05401 | Human sarcomeric mitochondrial creatine kinase(MtCK) gene, complete cds | 2.2 | 1.5E-2 | |||||
| D13315 | Human mRNA for lactoyl glutathione lyase | -1.7 | 4.6E-7 | |||||
| AB001740 | Homo sapiens mRNA for p27 | 1.8 | 8.6E-4 | |||||
| M15661 | Human ribosomal protein mRNA | -1.7 | 1.3E-4 | |||||
Figure 7Northern blots confirm a majority of gene expression changes for both methods. The far right-hand column shows northern blot results performed in triplicate for genes identified by each microarray design method. For each gene, the fold-change from the microarray together with the average fold-change quantified from the northern blots is shown. To correct for loading differences, in the northern blots each measurement was normalized to the corresponding amount of 18S rRNA measured on each gel. A representative example of the 18S rRNA is shown on the bottom of the figure. Two of the Z74615 northern blot bands were discarded due to contamination. Down-regulated, up-regulated, and false positive genes that were not confirmed by northern blots are indicated.
Figure 8Summary of confirmation rates for the two methods. (A) A Venn diagram summarizes the number of genes identified by each experimental method using 1.7-fold and p < 0.05 cut-offs and the verification rate by northern blot. While all 5 genes common to both methods were confirmed, 7 out of 11 genes from control correction method were confirmed, and 6 out of 7 genes from the dye-swap method using the ANOVA were confirmed. 8 out of 9 genes identified with the regularized t-test were confirmed. (B) A Venn diagram summarizes the number of genes identified by each experimental method of p < 0.05 without a fold change restriction.