BACKGROUND: Raw data normalization is a critical step in microarray data analysis because it directly affects data interpretation. Most of the normalization methods currently used are included in the R/BioConductor packages but it is often difficult to identify the most appropriate method. Furthermore, the use of R commands for functions and graphics can introduce mistakes that are difficult to trace. We present here a script written in R that provides a flexible means of access to and monitoring of data normalization for two-color microarrays. This script combines the power of BioConductor and R analysis functions and reduces the amount of R programming required. RESULTS: Goulphar was developed in and runs using the R language and environment. It combines and extends functions found in BioConductor packages (limma and marray) to correct for dye biases and spatial artifacts. Goulphar provides a wide range of optional and customizable filters for excluding incorrect signals during the pre-processing step. It displays informative output plots, enabling the user to monitor the normalization process, and helps adapt the normalization method appropriately to the data. All these analyses and graphical outputs are presented in a single PDF report. CONCLUSION: Goulphar provides simple, rapid access to the power of the R/BioConductor statistical analysis packages, with precise control and visualization of the results obtained. Complete documentation, examples and online forms for setting script parameters are available from http://transcriptome.ens.fr/goulphar/.
BACKGROUND: Raw data normalization is a critical step in microarray data analysis because it directly affects data interpretation. Most of the normalization methods currently used are included in the R/BioConductor packages but it is often difficult to identify the most appropriate method. Furthermore, the use of R commands for functions and graphics can introduce mistakes that are difficult to trace. We present here a script written in R that provides a flexible means of access to and monitoring of data normalization for two-color microarrays. This script combines the power of BioConductor and R analysis functions and reduces the amount of R programming required. RESULTS:Goulphar was developed in and runs using the R language and environment. It combines and extends functions found in BioConductor packages (limma and marray) to correct for dye biases and spatial artifacts. Goulphar provides a wide range of optional and customizable filters for excluding incorrect signals during the pre-processing step. It displays informative output plots, enabling the user to monitor the normalization process, and helps adapt the normalization method appropriately to the data. All these analyses and graphical outputs are presented in a single PDF report. CONCLUSION:Goulphar provides simple, rapid access to the power of the R/BioConductor statistical analysis packages, with precise control and visualization of the results obtained. Complete documentation, examples and online forms for setting script parameters are available from http://transcriptome.ens.fr/goulphar/.
Authors: Karsten Hokamp; Fiona M Roche; Michael Acab; Marc-Etienne Rousseau; Byron Kuo; David Goode; Dana Aeschliman; Jenny Bryan; Lorne A Babiuk; Robert E W Hancock; Fiona S L Brinkman Journal: Nucleic Acids Res Date: 2004-07-01 Impact factor: 16.971
Authors: Rafael A Irizarry; Daniel Warren; Forrest Spencer; Irene F Kim; Shyam Biswal; Bryan C Frank; Edward Gabrielson; Joe G N Garcia; Joel Geoghegan; Gregory Germino; Constance Griffin; Sara C Hilmer; Eric Hoffman; Anne E Jedlicka; Ernest Kawasaki; Francisco Martínez-Murillo; Laura Morsberger; Hannah Lee; David Petersen; John Quackenbush; Alan Scott; Michael Wilson; Yanqin Yang; Shui Qing Ye; Wayne Yu Journal: Nat Methods Date: 2005-04-21 Impact factor: 28.547
Authors: Theodore Bammler; Richard P Beyer; Sanchita Bhattacharya; Gary A Boorman; Abee Boyles; Blair U Bradford; Roger E Bumgarner; Pierre R Bushel; Kabir Chaturvedi; Dongseok Choi; Michael L Cunningham; Shibing Deng; Holly K Dressman; Rickie D Fannin; Fredrico M Farin; Jonathan H Freedman; Rebecca C Fry; Angel Harper; Michael C Humble; Patrick Hurban; Terrance J Kavanagh; William K Kaufmann; Kathleen F Kerr; Li Jing; Jodi A Lapidus; Michael R Lasarev; Jianying Li; Yi-Ju Li; Edward K Lobenhofer; Xinfang Lu; Renae L Malek; Sean Milton; Srinivasa R Nagalla; Jean P O'malley; Valerie S Palmer; Patrick Pattee; Richard S Paules; Charles M Perou; Ken Phillips; Li-Xuan Qin; Yang Qiu; Sean D Quigley; Matthew Rodland; Ivan Rusyn; Leona D Samson; David A Schwartz; Yan Shi; Jung-Lim Shin; Stella O Sieber; Susan Slifer; Marcy C Speer; Peter S Spencer; Dean I Sproles; James A Swenberg; William A Suk; Robert C Sullivan; Ru Tian; Raymond W Tennant; Signe A Todd; Charles J Tucker; Bennett Van Houten; Brenda K Weis; Shirley Xuan; Helmut Zarbl Journal: Nat Methods Date: 2005-04-21 Impact factor: 28.547
Authors: Barry R Zeeberg; Joseph Riss; David W Kane; Kimberly J Bussey; Edward Uchio; W Marston Linehan; J Carl Barrett; John N Weinstein Journal: BMC Bioinformatics Date: 2004-06-23 Impact factor: 3.169
Authors: Svetlana Khakhina; Soraya S Johnson; Raman Manoharlal; Sarah B Russo; Corinne Blugeon; Sophie Lemoine; Anna B Sunshine; Maitreya J Dunham; L Ashley Cowart; Frédéric Devaux; W Scott Moye-Rowley Journal: Eukaryot Cell Date: 2015-02-27
Authors: Matthew B Lohse; Aaron D Hernday; Polly M Fordyce; Liron Noiman; Trevor R Sorrells; Victor Hanson-Smith; Clarissa J Nobile; Joseph L DeRisi; Alexander D Johnson Journal: Proc Natl Acad Sci U S A Date: 2013-04-22 Impact factor: 11.205
Authors: Laurent Jourdren; Aurélie Duclos; Christian Brion; Thomas Portnoy; Hugues Mathis; Antoine Margeot; Stéphane Le Crom Journal: Nucleic Acids Res Date: 2010-02-21 Impact factor: 16.971