Assaf P Oron1, Zhen Jiang, Robert Gentleman. 1. Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA 98109-1024, USA. assaf.oron@gmail.com
Abstract
MOTIVATION: Gene-set enrichment analysis (GSEA) can be greatly enhanced by linear model (regression) diagnostic techniques. Diagnostics can be used to identify outlying or influential samples, and also to evaluate model fit and explore model expansion. RESULTS: We demonstrate this methodology on an adult acute lymphoblastic leukemia (ALL) dataset, using GSEA based on chromosome-band mapping of genes. Individual residuals, grouped or aggregated by chromosomal loci, indicate problematic samples and potential data-entry errors, and help identify hyperdiploidy as a factor playing a key role in expression for this dataset. Subsequent analysis pinpoints suspected DNA copy number abnormalities of specific samples and chromosomes (most prevalent are chromosomes X, 21 and 14), and also reveals significant expression differences between the hyperdiploid and diploid groups on other chromosomes (most prominently 19, 22, 3 and 13)--differences which are apparently not associated with copy number. AVAILABILITY: Software for the statistical tools demonstrated in this article is available as Bioconductor package GSEAlm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Gene-set enrichment analysis (GSEA) can be greatly enhanced by linear model (regression) diagnostic techniques. Diagnostics can be used to identify outlying or influential samples, and also to evaluate model fit and explore model expansion. RESULTS: We demonstrate this methodology on an adult acute lymphoblastic leukemia (ALL) dataset, using GSEA based on chromosome-band mapping of genes. Individual residuals, grouped or aggregated by chromosomal loci, indicate problematic samples and potential data-entry errors, and help identify hyperdiploidy as a factor playing a key role in expression for this dataset. Subsequent analysis pinpoints suspected DNA copy number abnormalities of specific samples and chromosomes (most prevalent are chromosomes X, 21 and 14), and also reveals significant expression differences between the hyperdiploid and diploid groups on other chromosomes (most prominently 19, 22, 3 and 13)--differences which are apparently not associated with copy number. AVAILABILITY: Software for the statistical tools demonstrated in this article is available as Bioconductor package GSEAlm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Lu Tian; Steven A Greenberg; Sek Won Kong; Josiah Altschuler; Isaac S Kohane; Peter J Park Journal: Proc Natl Acad Sci U S A Date: 2005-09-08 Impact factor: 11.205
Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205
Authors: H Caron; B van Schaik ; M van der Mee ; F Baas; G Riggins; P van Sluis ; M C Hermus; R van Asperen ; K Boon; P A Voûte; S Heisterkamp; A van Kampen ; R Versteeg Journal: Science Date: 2001-02-16 Impact factor: 47.728
Authors: Mary E Ross; Xiaodong Zhou; Guangchun Song; Sheila A Shurtleff; Kevin Girtman; W Kent Williams; Hsi-Che Liu; Rami Mahfouz; Susana C Raimondi; Noel Lenny; Anami Patel; James R Downing Journal: Blood Date: 2003-05-01 Impact factor: 22.113
Authors: Vamsi K Mootha; Cecilia M Lindgren; Karl-Fredrik Eriksson; Aravind Subramanian; Smita Sihag; Joseph Lehar; Pere Puigserver; Emma Carlsson; Martin Ridderstråle; Esa Laurila; Nicholas Houstis; Mark J Daly; Nick Patterson; Jill P Mesirov; Todd R Golub; Pablo Tamayo; Bruce Spiegelman; Eric S Lander; Joel N Hirschhorn; David Altshuler; Leif C Groop Journal: Nat Genet Date: 2003-07 Impact factor: 38.330
Authors: Melanie A Huntley; Jessica L Larson; Christina Chaivorapol; Gabriel Becker; Michael Lawrence; Jason A Hackney; Joshua S Kaminker Journal: Bioinformatics Date: 2013-09-29 Impact factor: 6.937
Authors: William J McBride; Mark W Kimpel; Jeanette N McClintick; Zheng-Ming Ding; Petri Hyytia; Giancarlo Colombo; Tiebing Liang; Howard J Edenberg; Lawrence Lumeng; Richard L Bell Journal: Alcohol Date: 2013-10-01 Impact factor: 2.405
Authors: William J McBride; Mark W Kimpel; Jeanette N McClintick; Zheng-Ming Ding; Petri Hyytia; Giancarlo Colombo; Howard J Edenberg; Lawrence Lumeng; Richard L Bell Journal: Pharmacol Biochem Behav Date: 2012-05-10 Impact factor: 3.533
Authors: Aedín C Culhane; Thomas Schwarzl; Razvan Sultana; Kermshlise C Picard; Shaita C Picard; Tim H Lu; Katherine R Franklin; Simon J French; Gerald Papenhausen; Mick Correll; John Quackenbush Journal: Nucleic Acids Res Date: 2009-11-24 Impact factor: 16.971