Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data.

Literature DB >> 12537556

Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data.

Céline Becquet¹, Sylvain Blachon, Baptiste Jeudy, Jean-Francois Boulicaut, Olivier Gandrillon.

Abstract

BACKGROUND: The association-rules discovery (ARD) technique has yet to be applied to gene-expression data analysis. Even in the absence of previous biological knowledge, it should identify sets of genes whose expression is correlated. The first association-rule miners appeared six years ago and proved efficient at dealing with sparse and weakly correlated data. A huge international research effort has led to new algorithms for tackling difficult contexts and these are particularly suited to analysis of large gene-expression matrices. To validate the ARD technique we have applied it to freely available human serial analysis of gene expression (SAGE) data.
RESULTS: The approach described here enables us to designate sets of strong association rules. We normalized the SAGE data before applying our association rule miner. Depending on the discretization algorithm used, different properties of the data were highlighted. Both common and specific interpretations could be made from the extracted rules. In each and every case the extracted collections of rules indicated that a very strong co-regulation of mRNA encoding ribosomal proteins occurs in the dataset. Several rules associating proteins involved in signal transduction were obtained and analyzed, some pointing to yet-unexplored directions. Furthermore, by examining a subset of these rules, we were able both to reassign a wrongly labeled tag, and to propose a function for an expressed sequence tag encoding a protein of unknown function.
CONCLUSIONS: We show that ARD is a promising technique that turns out to be complementary to existing gene-expression clustering techniques.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Year: 2002 PMID： 12537556 PMCID： PMC151169 DOI： 10.1186/gb-2002-3-12-research0067

Source DB: PubMed Journal: Genome Biol ISSN： 1474-7596 Impact factor: 13.583

23 in total

1. Analysis of human transcriptomes.

Authors: V E Velculescu; S L Madden; L Zhang; A E Lash; J Yu; C Rago; A Lal; C J Wang; G A Beaudry; K M Ciriello; B P Cook; M R Dufault; A T Ferguson; Y Gao; T C He; H Hermeking; S K Hiraldo; P M Hwang; M A Lopez; H F Luderer; B Mathews; J M Petroziello; K Polyak; L Zawel; K W Kinzler
Journal: Nat Genet Date: 1999-12 Impact factor: 38.330

Review 2. Autonomous regulation in mammalian mitochondrial DNA transcription.

Authors: J A Enríquez; P Fernández-Sílva; J Montoya
Journal: Biol Chem Date: 1999 Jul-Aug Impact factor: 3.915

3. A hierarchical unsupervised growing neural network for clustering gene expression patterns.

Authors: J Herrero; A Valencia; J Dopazo
Journal: Bioinformatics Date: 2001-02 Impact factor: 6.937

4. SAGEmap: a public gene expression resource.

Authors: A E Lash; C M Tolstoshev; L Wagner; G D Schuler; R L Strausberg; G J Riggins; S F Altschul
Journal: Genome Res Date: 2000-07 Impact factor: 9.043

5. Functional discovery via a compendium of expression profiles.

Authors: T R Hughes; M J Marton; A R Jones; C J Roberts; R Stoughton; C D Armour; H A Bennett; E Coffey; H Dai; Y D He; M J Kidd; A M King; M R Meyer; D Slade; P Y Lum; S B Stepaniants; D D Shoemaker; D Gachotte; K Chakraburtty; J Simon; M Bard; S H Friend
Journal: Cell Date: 2000-07-07 Impact factor: 41.582

6. Transient activation of the c-Jun N-terminal kinase (JNK) activity by ligation of the tetraspan CD53 antigen in different cell types.

Authors: Mónica Yunta; José L Oliva; Ramiro Barcia; Vaclav Horejsi; Paula Angelisova; Pedro A Lazo
Journal: Eur J Biochem Date: 2002-02

7. N-myc enhances the expression of a large set of genes functioning in ribosome biogenesis and protein synthesis.

Authors: K Boon; H N Caron; R van Asperen; L Valentijn; M C Hermus; P van Sluis; I Roobeek; I Weis; P A Voûte; M Schwab; R Versteeg
Journal: EMBO J Date: 2001-03-15 Impact factor: 11.598

Review 8. Molecular profiling of human cancer.

Authors: L Liotta; E Petricoin
Journal: Nat Rev Genet Date: 2000-10 Impact factor: 53.242

9. Peptidylprolyl isomerase A (PPIA) as a preferred internal control over GAPDH and beta-actin in quantitative RNA analyses.

Authors: F Feroze-Merzoug; I M Berquin; J Dey; Y Q Chen
Journal: Biotechniques Date: 2002-04 Impact factor: 1.993

10. Nerve growth factor selectively regulates expression of transcripts encoding ribosomal proteins.

Authors: James M Angelastro; Béata Töröcsik; Lloyd A Greene
Journal: BMC Neurosci Date: 2002-02-28 Impact factor: 3.288

23 in total

1. Discovery of error-tolerant biclusters from noisy gene expression data.

Authors: Rohit Gupta; Navneet Rao; Vipin Kumar
Journal: BMC Bioinformatics Date: 2011-11-24 Impact factor: 3.169

2. Translational bioinformatics and healthcare informatics: computational and ethical challenges.

Authors: Prerna Sethi; Kimberly Theodos
Journal: Perspect Health Inf Manag Date: 2009-09-16

3. Association Rule Discovery Has the Ability to Model Complex Genetic Effects.

Authors: William S Bush; Tricia A Thornton-Wells; Marylyn D Ritchie
Journal: IEEE Symp Comput Intell Data Min Date: 2007-03-01

4. Expression Data Analysis for the Identification of Potential Biomarker of Pregnancy Associated Breast Cancer.

Authors: Raja Rajeswary Thanmalagan; Leimarembi Devi Naorem; Amouda Venkatesan
Journal: Pathol Oncol Res Date: 2016-11-10 Impact factor: 3.201

5. Stem cell antigen 2: a new gene involved in the self-renewal of erythroid progenitors.

Authors: C Bresson-Mazet; O Gandrillon; S Gonin-Giraud
Journal: Cell Prolif Date: 2008-10 Impact factor: 6.831

6. Association rule based similarity measures for the clustering of gene expression data.

Authors: Prerna Sethi; Sathya Alagiriswamy
Journal: Open Med Inform J Date: 2010-05-28

7. Identification of temporal association rules from time-series microarray data sets.

Authors: Hojung Nam; KiYoung Lee; Doheon Lee
Journal: BMC Bioinformatics Date: 2009-03-19 Impact factor: 3.169

8. Discovering associations in biomedical datasets by link-based associative classifier (LAC).

Authors: Pulan Yu; David J Wild
Journal: PLoS One Date: 2012-12-05 Impact factor: 3.240

9. Fast rule-based bioactivity prediction using associative classification mining.

Authors: Pulan Yu; David J Wild
Journal: J Cheminform Date: 2012-11-23 Impact factor: 5.514

10. MIDClass: microarray data classification by association rules and gene expression intervals.

Authors: Rosalba Giugno; Alfredo Pulvirenti; Luciano Cascione; Giuseppe Pigola; Alfredo Ferro
Journal: PLoS One Date: 2013-08-06 Impact factor: 3.240