Literature DB >> 16600052

Mining gene expression data by interpreting principal components.

Joseph C Roden1, Brandon W King, Diane Trout, Ali Mortazavi, Barbara J Wold, Christopher E Hart.   

Abstract

BACKGROUND: There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis.
RESULTS: We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset). We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.). Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation.
CONCLUSION: We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It has proven especially valuable in instances where there are many diverse conditions (10's to hundreds of different tissues or cell types), a situation in which many clustering and ordering algorithms become problematic. This approach also shows promise in other topic domains such as multi-spectral imaging datasets.

Entities:  

Mesh:

Year:  2006        PMID: 16600052      PMCID: PMC1501050          DOI: 10.1186/1471-2105-7-194

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  28 in total

1.  Nonparametric methods for identifying differentially expressed genes in microarray data.

Authors:  Olga G Troyanskaya; Mitchell E Garber; Patrick O Brown; David Botstein; Russ B Altman
Journal:  Bioinformatics       Date:  2002-11       Impact factor: 6.937

2.  The transcriptional program of sporulation in budding yeast.

Authors:  S Chu; J DeRisi; M Eisen; J Mulholland; D Botstein; P O Brown; I Herskowitz
Journal:  Science       Date:  1998-10-23       Impact factor: 47.728

3.  Large-scale temporal gene expression mapping of central nervous system development.

Authors:  X Wen; S Fuhrman; G S Michaels; D B Carr; S Smith; J L Barker; R Somogyi
Journal:  Proc Natl Acad Sci U S A       Date:  1998-01-06       Impact factor: 11.205

4.  Iterative signature algorithm for the analysis of large-scale gene expression data.

Authors:  Sven Bergmann; Jan Ihmels; Naama Barkai
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2003-03-11

5.  Classification-algorithm evaluation: five performance measures based on confusion matrices.

Authors:  A D Forbes
Journal:  J Clin Monit       Date:  1995-05

6.  A gene atlas of the mouse and human protein-encoding transcriptomes.

Authors:  Andrew I Su; Tim Wiltshire; Serge Batalov; Hilmar Lapp; Keith A Ching; David Block; Jie Zhang; Richard Soden; Mimi Hayakawa; Gabriel Kreiman; Michael P Cooke; John R Walker; John B Hogenesch
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-09       Impact factor: 11.205

7.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

8.  Revealing modular organization in the yeast transcriptional network.

Authors:  Jan Ihmels; Gilgi Friedlander; Sven Bergmann; Ofer Sarig; Yaniv Ziv; Naama Barkai
Journal:  Nat Genet       Date:  2002-07-22       Impact factor: 38.330

9.  An unsupervised approach to identify molecular phenotypic components influencing breast cancer features.

Authors:  Florin M Selaru; Jing Yin; Andreea Olaru; Yuriko Mori; Yan Xu; Steven H Epstein; Fumiaki Sato; Elena Deacu; Suna Wang; Anca Sterian; Amy Fulton; John M Abraham; David Shibata; Claudia Baquet; Sanford A Stass; Stephen J Meltzer
Journal:  Cancer Res       Date:  2004-03-01       Impact factor: 12.701

10.  PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.

Authors:  Vamsi K Mootha; Cecilia M Lindgren; Karl-Fredrik Eriksson; Aravind Subramanian; Smita Sihag; Joseph Lehar; Pere Puigserver; Emma Carlsson; Martin Ridderstråle; Esa Laurila; Nicholas Houstis; Mark J Daly; Nick Patterson; Jill P Mesirov; Todd R Golub; Pablo Tamayo; Bruce Spiegelman; Eric S Lander; Joel N Hirschhorn; David Altshuler; Leif C Groop
Journal:  Nat Genet       Date:  2003-07       Impact factor: 38.330

View more
  23 in total

1.  Independent component analysis: mining microarray data for fundamental human gene expression modules.

Authors:  Jesse M Engreitz; Bernie J Daigle; Jonathan J Marshall; Russ B Altman
Journal:  J Biomed Inform       Date:  2010-07-07       Impact factor: 6.317

2.  Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.

Authors:  Jie Tan; Georgia Doing; Kimberley A Lewis; Courtney E Price; Kathleen M Chen; Kyle C Cady; Barret Perchuk; Michael T Laub; Deborah A Hogan; Casey S Greene
Journal:  Cell Syst       Date:  2017-07-12       Impact factor: 10.304

3.  Characterization of pandemic influenza immune memory signature after vaccination or infection.

Authors:  Olivia Bonduelle; Fabrice Carrat; Charles-Edouard Luyt; Catherine Leport; Anne Mosnier; Nora Benhabiles; Anne Krivine; Flore Rozenberg; Nora Yahia; Assia Samri; Dominique Rousset; Sylvie van der Werf; Brigitte Autran; Behazine Combadiere
Journal:  J Clin Invest       Date:  2014-06-09       Impact factor: 14.808

4.  Systems genetics of the nuclear factor-κB signal transduction network. I. Detection of several quantitative trait loci potentially relevant to aging.

Authors:  Vincent P Diego; Joanne E Curran; Jac Charlesworth; Juan M Peralta; V Saroja Voruganti; Shelley A Cole; Thomas D Dyer; Matthew P Johnson; Eric K Moses; Harald H H Göring; Jeff T Williams; Anthony G Comuzzie; Laura Almasy; John Blangero; Sarah Williams-Blangero
Journal:  Mech Ageing Dev       Date:  2011-12-01       Impact factor: 5.432

5.  A multivariate statistical test for differential expression analysis.

Authors:  Michele Tumminello; Giorgio Bertolazzi; Gianluca Sottile; Nicolina Sciaraffa; Walter Arancio; Claudia Coronnello
Journal:  Sci Rep       Date:  2022-05-18       Impact factor: 4.996

6.  Temporal transcriptional response during infection of type II alveolar epithelial cells with Francisella tularensis live vaccine strain (LVS) supports a general host suppression and bacterial uptake by macropinocytosis.

Authors:  Christopher E Bradburne; Anne B Verhoeven; Ganiraju C Manyam; Saira A Chaudhry; Eddie L Chang; Dzung C Thach; Charles L Bailey; Monique L van Hoek
Journal:  J Biol Chem       Date:  2013-01-15       Impact factor: 5.157

7.  Deciphering transcriptional networks that govern Coffea arabica seed development using combined cDNA array and real-time RT-PCR approaches.

Authors:  Jordi Salmona; Stéphane Dussert; Frédéric Descroix; Alexandre de Kochko; Benoît Bertrand; Thierry Joët
Journal:  Plant Mol Biol       Date:  2007-11-15       Impact factor: 4.076

8.  Islet sympathetic innervation and islet neuropathology in patients with type 1 diabetes.

Authors:  Martha Campbell-Thompson; Elizabeth A Butterworth; J Lucas Boatwright; Malavika A Nair; Lith H Nasif; Kamal Nasif; Andy Y Revell; Alberto Riva; Clayton E Mathews; Ivan C Gerling; Desmond A Schatz; Mark A Atkinson
Journal:  Sci Rep       Date:  2021-03-22       Impact factor: 4.379

9.  Spectral gene set enrichment (SGSE).

Authors:  H Robert Frost; Zhigang Li; Jason H Moore
Journal:  BMC Bioinformatics       Date:  2015-03-03       Impact factor: 3.169

10.  Mitochondrial network genes in the skeletal muscle of amyotrophic lateral sclerosis patients.

Authors:  Camilla Bernardini; Federica Censi; Wanda Lattanzi; Marta Barba; Giovanni Calcagnini; Alessandro Giuliani; Giorgio Tasca; Mario Sabatelli; Enzo Ricci; Fabrizio Michetti
Journal:  PLoS One       Date:  2013-02-28       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.