Literature DB >> 12537556

Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data.

Céline Becquet1, Sylvain Blachon, Baptiste Jeudy, Jean-Francois Boulicaut, Olivier Gandrillon.   

Abstract

BACKGROUND: The association-rules discovery (ARD) technique has yet to be applied to gene-expression data analysis. Even in the absence of previous biological knowledge, it should identify sets of genes whose expression is correlated. The first association-rule miners appeared six years ago and proved efficient at dealing with sparse and weakly correlated data. A huge international research effort has led to new algorithms for tackling difficult contexts and these are particularly suited to analysis of large gene-expression matrices. To validate the ARD technique we have applied it to freely available human serial analysis of gene expression (SAGE) data.
RESULTS: The approach described here enables us to designate sets of strong association rules. We normalized the SAGE data before applying our association rule miner. Depending on the discretization algorithm used, different properties of the data were highlighted. Both common and specific interpretations could be made from the extracted rules. In each and every case the extracted collections of rules indicated that a very strong co-regulation of mRNA encoding ribosomal proteins occurs in the dataset. Several rules associating proteins involved in signal transduction were obtained and analyzed, some pointing to yet-unexplored directions. Furthermore, by examining a subset of these rules, we were able both to reassign a wrongly labeled tag, and to propose a function for an expressed sequence tag encoding a protein of unknown function.
CONCLUSIONS: We show that ARD is a promising technique that turns out to be complementary to existing gene-expression clustering techniques.

Entities:  

Mesh:

Year:  2002        PMID: 12537556      PMCID: PMC151169          DOI: 10.1186/gb-2002-3-12-research0067

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  23 in total

1.  Analysis of human transcriptomes.

Authors:  V E Velculescu; S L Madden; L Zhang; A E Lash; J Yu; C Rago; A Lal; C J Wang; G A Beaudry; K M Ciriello; B P Cook; M R Dufault; A T Ferguson; Y Gao; T C He; H Hermeking; S K Hiraldo; P M Hwang; M A Lopez; H F Luderer; B Mathews; J M Petroziello; K Polyak; L Zawel; K W Kinzler
Journal:  Nat Genet       Date:  1999-12       Impact factor: 38.330

Review 2.  Autonomous regulation in mammalian mitochondrial DNA transcription.

Authors:  J A Enríquez; P Fernández-Sílva; J Montoya
Journal:  Biol Chem       Date:  1999 Jul-Aug       Impact factor: 3.915

3.  A hierarchical unsupervised growing neural network for clustering gene expression patterns.

Authors:  J Herrero; A Valencia; J Dopazo
Journal:  Bioinformatics       Date:  2001-02       Impact factor: 6.937

4.  SAGEmap: a public gene expression resource.

Authors:  A E Lash; C M Tolstoshev; L Wagner; G D Schuler; R L Strausberg; G J Riggins; S F Altschul
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

5.  Functional discovery via a compendium of expression profiles.

Authors:  T R Hughes; M J Marton; A R Jones; C J Roberts; R Stoughton; C D Armour; H A Bennett; E Coffey; H Dai; Y D He; M J Kidd; A M King; M R Meyer; D Slade; P Y Lum; S B Stepaniants; D D Shoemaker; D Gachotte; K Chakraburtty; J Simon; M Bard; S H Friend
Journal:  Cell       Date:  2000-07-07       Impact factor: 41.582

6.  Transient activation of the c-Jun N-terminal kinase (JNK) activity by ligation of the tetraspan CD53 antigen in different cell types.

Authors:  Mónica Yunta; José L Oliva; Ramiro Barcia; Vaclav Horejsi; Paula Angelisova; Pedro A Lazo
Journal:  Eur J Biochem       Date:  2002-02

7.  N-myc enhances the expression of a large set of genes functioning in ribosome biogenesis and protein synthesis.

Authors:  K Boon; H N Caron; R van Asperen; L Valentijn; M C Hermus; P van Sluis; I Roobeek; I Weis; P A Voûte; M Schwab; R Versteeg
Journal:  EMBO J       Date:  2001-03-15       Impact factor: 11.598

Review 8.  Molecular profiling of human cancer.

Authors:  L Liotta; E Petricoin
Journal:  Nat Rev Genet       Date:  2000-10       Impact factor: 53.242

9.  Peptidylprolyl isomerase A (PPIA) as a preferred internal control over GAPDH and beta-actin in quantitative RNA analyses.

Authors:  F Feroze-Merzoug; I M Berquin; J Dey; Y Q Chen
Journal:  Biotechniques       Date:  2002-04       Impact factor: 1.993

10.  Nerve growth factor selectively regulates expression of transcripts encoding ribosomal proteins.

Authors:  James M Angelastro; Béata Töröcsik; Lloyd A Greene
Journal:  BMC Neurosci       Date:  2002-02-28       Impact factor: 3.288

View more
  23 in total

1.  Discovery of error-tolerant biclusters from noisy gene expression data.

Authors:  Rohit Gupta; Navneet Rao; Vipin Kumar
Journal:  BMC Bioinformatics       Date:  2011-11-24       Impact factor: 3.169

2.  Translational bioinformatics and healthcare informatics: computational and ethical challenges.

Authors:  Prerna Sethi; Kimberly Theodos
Journal:  Perspect Health Inf Manag       Date:  2009-09-16

3.  Association Rule Discovery Has the Ability to Model Complex Genetic Effects.

Authors:  William S Bush; Tricia A Thornton-Wells; Marylyn D Ritchie
Journal:  IEEE Symp Comput Intell Data Min       Date:  2007-03-01

4.  Expression Data Analysis for the Identification of Potential Biomarker of Pregnancy Associated Breast Cancer.

Authors:  Raja Rajeswary Thanmalagan; Leimarembi Devi Naorem; Amouda Venkatesan
Journal:  Pathol Oncol Res       Date:  2016-11-10       Impact factor: 3.201

5.  Stem cell antigen 2: a new gene involved in the self-renewal of erythroid progenitors.

Authors:  C Bresson-Mazet; O Gandrillon; S Gonin-Giraud
Journal:  Cell Prolif       Date:  2008-10       Impact factor: 6.831

6.  Association rule based similarity measures for the clustering of gene expression data.

Authors:  Prerna Sethi; Sathya Alagiriswamy
Journal:  Open Med Inform J       Date:  2010-05-28

7.  Identification of temporal association rules from time-series microarray data sets.

Authors:  Hojung Nam; KiYoung Lee; Doheon Lee
Journal:  BMC Bioinformatics       Date:  2009-03-19       Impact factor: 3.169

8.  Discovering associations in biomedical datasets by link-based associative classifier (LAC).

Authors:  Pulan Yu; David J Wild
Journal:  PLoS One       Date:  2012-12-05       Impact factor: 3.240

9.  Fast rule-based bioactivity prediction using associative classification mining.

Authors:  Pulan Yu; David J Wild
Journal:  J Cheminform       Date:  2012-11-23       Impact factor: 5.514

10.  MIDClass: microarray data classification by association rules and gene expression intervals.

Authors:  Rosalba Giugno; Alfredo Pulvirenti; Luciano Cascione; Giuseppe Pigola; Alfredo Ferro
Journal:  PLoS One       Date:  2013-08-06       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.