Literature DB >> 17479391

Systematic approaches for incorporating control spots and data quality information to improve normalization of cDNA microarray data.

D Wang1, C-H Zhang, M B Soares, J Huang.   

Abstract

BACKGROUND: Normalization and data quality control are two important aspects in microarray data analysis. Proper normalization and data quality control ensure that intensity ratios provide meaningful and accurate measurement of relative gene expression values. Control spots such as spikes and housekeeping genes with known concentrations in two channels are often used for calibrating experimental parameters. They provide valuable information about experimental variation which can be utilized for better normalization. They are also needed for proper normalization in cases that the most of the spots tend to change in one direction. In addition, it is desirable to include information on spot quality. Such information is available in a typical microarray data set, but is not fully utilized by existing normalization methods.
RESULTS: We propose two extensions of the two-way semi-linear model (TW-SLM) for appropriately combining control genes and spot quality information in normalization. The first extension (TW-SLMC) is designed to systematically incorporate control spots in a semi-parametric model to calibrate estimated normalization curves so that the relative fold changes of gene expressions are accurately estimated. Extrapolation is not required in this approach. The second extension (TW-SLMQ) is proposed to incorporate spot quality measure into normalization. This approach down-weights spots with lower quality scores in normalization. These two extensions can be used simultaneously for normalizing a data set. Two microarray data sets are used to demonstrate the proposed methods. AVAILABILITY: An R based computing package is developed for the proposed methods and available from the corresponding authors.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17479391     DOI: 10.1080/10543400701199544

Source DB:  PubMed          Journal:  J Biopharm Stat        ISSN: 1054-3406            Impact factor:   1.051


  2 in total

1.  Error, reproducibility and sensitivity: a pipeline for data processing of Agilent oligonucleotide expression arrays.

Authors:  Benjamin Chain; Helen Bowen; John Hammond; Wilfried Posch; Jane Rasaiyaah; Jhen Tsang; Mahdad Noursadeghi
Journal:  BMC Bioinformatics       Date:  2010-06-24       Impact factor: 3.169

2.  Advanced spot quality analysis in two-colour microarray experiments.

Authors:  Mikalai Yatskou; Eugene Novikov; Guillaume Vetter; Arnaud Muller; Emmanuel Barillot; Laurent Vallar; Evelyne Friederich
Journal:  BMC Res Notes       Date:  2008-09-17
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.