Literature DB >> 18194724

Data mining in genomics.

Jae K Lee1, Paul D Williams, Sooyoung Cheon.   

Abstract

This article reviews important emerging statistical concepts, data mining techniques, and applications that have been recently developed and used for genomic data analysis. First, general background and some critical issues in genomic data mining are summarized. A novel concept of statistical significance is described, the so-called "false discovery rate"-the rate of false-positives among all positive findings-which has been suggested to control the error rate of numerous false-positives in large screening biological data analysis. Two recent statistical testing methods are then introduced: significance analysis of microarray and local pooled error tests. Statistical modeling in genomic data analysis is then presented, such as analysis of variance and heterogeneous error modeling approaches that have been suggested for analyzing microarray data obtained from multiple experimental or biological conditions. Two sections then describe data exploration and discovery tools largely termed as supervised learning and unsupervised learning. The former approaches include several multivariate statistical methods to investigate coexpression patterns of multiple genes, and the latter are the classification methods to discover genomic biomarker signatures for predicting important subclasses of human diseases. The last section briefly summarizes various genomic data mining approaches in biomedical pathway analysis and patient outcome or chemotherapeutic response prediction.

Entities:  

Mesh:

Year:  2008        PMID: 18194724      PMCID: PMC2253491          DOI: 10.1016/j.cll.2007.10.010

Source DB:  PubMed          Journal:  Clin Lab Med        ISSN: 0272-2712            Impact factor:   1.935


  45 in total

1.  Bayesian hierarchical error model for analysis of gene expression data.

Authors:  HyungJun Cho; Jae K Lee
Journal:  Bioinformatics       Date:  2004-03-25       Impact factor: 6.937

2.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

3.  Robust classification modeling on microarray data using misclassification penalized posterior.

Authors:  Mat Soukup; HyungJun Cho; Jae K Lee
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

4.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer.

Authors:  Soonmyung Paik; Steven Shak; Gong Tang; Chungyeul Kim; Joffre Baker; Maureen Cronin; Frederick L Baehner; Michael G Walker; Drew Watson; Taesung Park; William Hiller; Edwin R Fisher; D Lawrence Wickerham; John Bryant; Norman Wolmark
Journal:  N Engl J Med       Date:  2004-12-10       Impact factor: 91.245

5.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies.

Authors:  Andrea H Bild; Guang Yao; Jeffrey T Chang; Quanli Wang; Anil Potti; Dawn Chasse; Mary-Beth Joshi; David Harpole; Johnathan M Lancaster; Andrew Berchuck; John A Olson; Jeffrey R Marks; Holly K Dressman; Mike West; Joseph R Nevins
Journal:  Nature       Date:  2005-11-06       Impact factor: 49.962

6.  An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival.

Authors:  Lance D Miller; Johanna Smeds; Joshy George; Vinsensius B Vega; Liza Vergara; Alexander Ploner; Yudi Pawitan; Per Hall; Sigrid Klaar; Edison T Liu; Jonas Bergh
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-02       Impact factor: 11.205

7.  Gene profiling identifies genes specific for well-differentiated epithelial thyroid tumors.

Authors:  L G Puskas; F Juhasz; A Zarva; L Hackler; N R Farid
Journal:  Cell Mol Biol (Noisy-le-grand)       Date:  2005-09-05       Impact factor: 1.770

8.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors:  T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

9.  Developing optimal prediction models for cancer classification using gene expression data.

Authors:  Mat Soukup; Jae K Lee
Journal:  J Bioinform Comput Biol       Date:  2004-01       Impact factor: 1.122

10.  Rank-invariant resampling based estimation of false discovery rate for analysis of small sample microarray data.

Authors:  Nitin Jain; HyungJun Cho; Michael O'Connell; Jae K Lee
Journal:  BMC Bioinformatics       Date:  2005-07-22       Impact factor: 3.169

View more
  8 in total

Review 1.  Clinical applications of metabolomics in oncology: a review.

Authors:  Jennifer L Spratlin; Natalie J Serkova; S Gail Eckhardt
Journal:  Clin Cancer Res       Date:  2009-01-15       Impact factor: 12.531

2.  Data mining of mental health issues of non-bone marrow donor siblings.

Authors:  Morihito Takita; Yuji Tanaka; Yuko Kodama; Naoko Murashige; Nobuyo Hatanaka; Yukiko Kishi; Tomoko Matsumura; Yukio Ohsawa; Masahiro Kami
Journal:  J Clin Bioinforma       Date:  2011-07-20

3.  Bioinformatic-driven search for metabolic biomarkers in disease.

Authors:  Christian Baumgartner; Melanie Osl; Michael Netzer; Daniela Baumgartner
Journal:  J Clin Bioinforma       Date:  2011-01-20

Review 4.  Clinical and diagnostic utility of saliva as a non-invasive diagnostic fluid: 
a systematic review.

Authors:  Lazaro Alessandro Soares Nunes; Sayeeda Mussavira; Omana Sukumaran Bindhu
Journal:  Biochem Med (Zagreb)       Date:  2015-06-05       Impact factor: 2.313

5.  Prediction of breast cancer survival through knowledge discovery in databases.

Authors:  Hadi Lotfnezhad Afshar; Maryam Ahmadi; Masoud Roudbari; Farahnaz Sadoughi
Journal:  Glob J Health Sci       Date:  2015-01-26

Review 6.  Insights into Systemic Disease through Retinal Imaging-Based Oculomics.

Authors:  Siegfried K Wagner; Dun Jack Fu; Livia Faes; Xiaoxuan Liu; Josef Huemer; Hagar Khalid; Daniel Ferraz; Edward Korot; Christopher Kelly; Konstantinos Balaskas; Alastair K Denniston; Pearse A Keane
Journal:  Transl Vis Sci Technol       Date:  2020-02-12       Impact factor: 3.283

7.  Challenges of the information age: the impact of false discovery on pathway identification.

Authors:  Colin J Rog; Srinivasa C Chekuri; Mary E Edgerton
Journal:  BMC Res Notes       Date:  2012-11-21

8.  Molecular and genetic markers in hepatocellular carcinoma: In silico analysis to clinical validation (current limitations and future promises).

Authors:  Sarah El-Nakeep
Journal:  World J Gastrointest Pathophysiol       Date:  2022-01-22
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.