Literature DB >> 19854753

Knowledge-based data analysis comes of age.

Michael F Ochs1.   

Abstract

The emergence of high-throughput technologies for measuring biological systems has introduced problems for data interpretation that must be addressed for proper inference. First, analysis techniques need to be matched to the biological system, reflecting in their mathematical structure the underlying behavior being studied. When this is not done, mathematical techniques will generate answers, but the values and reliability estimates may not accurately reflect the biology. Second, analysis approaches must address the vast excess in variables measured (e.g. transcript levels of genes) over the number of samples (e.g. tumors, time points), known as the 'large-p, small-n' problem. In large-p, small-n paradigms, standard statistical techniques generally fail, and computational learning algorithms are prone to overfit the data. Here we review the emergence of techniques that match mathematical structure to the biology, the use of integrated data and prior knowledge to guide statistical analysis, and the recent emergence of analysis approaches utilizing simple biological models. We show that novel biological insights have been gained using these techniques.

Entities:  

Mesh:

Year:  2009        PMID: 19854753      PMCID: PMC3700349          DOI: 10.1093/bib/bbp044

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  67 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Functional discovery via a compendium of expression profiles.

Authors:  T R Hughes; M J Marton; A R Jones; C J Roberts; R Stoughton; C D Armour; H A Bennett; E Coffey; H Dai; Y D He; M J Kidd; A M King; M R Meyer; D Slade; P Y Lum; S B Stepaniants; D D Shoemaker; D Gachotte; K Chakraburtty; J Simon; M Bard; S H Friend
Journal:  Cell       Date:  2000-07-07       Impact factor: 41.582

3.  Identifying functional modules using expression profiles and confidence-scored protein interactions.

Authors:  Igor Ulitsky; Ron Shamir
Journal:  Bioinformatics       Date:  2009-03-17       Impact factor: 6.937

4.  Quantitative monitoring of gene expression patterns with a complementary DNA microarray.

Authors:  M Schena; D Shalon; R W Davis; P O Brown
Journal:  Science       Date:  1995-10-20       Impact factor: 47.728

5.  Metabolomic profiles delineate potential role for sarcosine in prostate cancer progression.

Authors:  Arun Sreekumar; Laila M Poisson; Thekkelnaycke M Rajendiran; Amjad P Khan; Qi Cao; Jindan Yu; Bharathi Laxman; Rohit Mehra; Robert J Lonigro; Yong Li; Mukesh K Nyati; Aarif Ahsan; Shanker Kalyana-Sundaram; Bo Han; Xuhong Cao; Jaeman Byun; Gilbert S Omenn; Debashis Ghosh; Subramaniam Pennathur; Danny C Alexander; Alvin Berger; Jeffrey R Shuster; John T Wei; Sooryanarayana Varambally; Christopher Beecher; Arul M Chinnaiyan
Journal:  Nature       Date:  2009-02-12       Impact factor: 49.962

6.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

7.  Genetic variants associated with carboplatin-induced cytotoxicity in cell lines derived from Africans.

Authors:  R Stephanie Huang; Shiwei Duan; Emily O Kistner; Christine M Hartford; M Eileen Dolan
Journal:  Mol Cancer Ther       Date:  2008-09-02       Impact factor: 6.261

8.  The caBIG terminology review process.

Authors:  James J Cimino; Terry F Hayamizu; Olivier Bodenreider; Brian Davis; Grace A Stafford; Martin Ringwald
Journal:  J Biomed Inform       Date:  2008-12-25       Impact factor: 6.317

9.  Investigating the correspondence between transcriptomic and proteomic expression profiles using coupled cluster models.

Authors:  Simon Rogers; Mark Girolami; Walter Kolch; Katrina M Waters; Tao Liu; Brian Thrall; H Steven Wiley
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

10.  Network-based analysis of affected biological processes in type 2 diabetes models.

Authors:  Manway Liu; Arthur Liberzon; Sek Won Kong; Weil R Lai; Peter J Park; Isaac S Kohane; Simon Kasif
Journal:  PLoS Genet       Date:  2007-06       Impact factor: 5.917

View more
  7 in total

Review 1.  Phenomics: the next challenge.

Authors:  David Houle; Diddahally R Govindaraju; Stig Omholt
Journal:  Nat Rev Genet       Date:  2010-12       Impact factor: 53.242

2.  OnionTree XML: a format to exchange gene-related probabilities.

Authors:  Alexander Favorov; Dmitrijs Lvovs; William Speier; Giovanni Parmigiani; Michael F Ochs
Journal:  J Biomol Struct Dyn       Date:  2011-10

3.  ConReg-R: Extrapolative recalibration of the empirical distribution of p-values to improve false discovery rate estimates.

Authors:  Juntao Li; Puteri Paramita; Kwok Pui Choi; R Krishna Murthy Karuturi
Journal:  Biol Direct       Date:  2011-05-20       Impact factor: 4.540

4.  Leveraging domain information to restructure biological prediction.

Authors:  Xiaofei Nan; Gang Fu; Zhengdong Zhao; Sheng Liu; Ronak Y Patel; Haining Liu; Pankaj R Daga; Robert J Doerksen; Xin Dang; Yixin Chen; Dawn Wilkins
Journal:  BMC Bioinformatics       Date:  2011-10-18       Impact factor: 3.169

5.  UGT2B17 and miR-224 contribute to hormone dependency trends in adenocarcinoma and squamous cell carcinoma of esophagus.

Authors:  Xiangyao Lian; Ancha Baranova; Jimmy Ngo; Guiping Yu; Hongbao Cao
Journal:  Biosci Rep       Date:  2019-07-05       Impact factor: 3.840

6.  Knowledge-based identification of soluble biomarkers: hepatic fibrosis in NAFLD as an example.

Authors:  Sandra Page; Aybike Birerdinc; Michael Estep; Maria Stepanova; Arian Afendy; Emanuel Petricoin; Zobair Younossi; Vikas Chandhoke; Ancha Baranova
Journal:  PLoS One       Date:  2013-02-06       Impact factor: 3.240

7.  Knowledge-based compact disease models identify new molecular players contributing to early-stage Alzheimer's disease.

Authors:  Anatoly Mayburd; Ancha Baranova
Journal:  BMC Syst Biol       Date:  2013-11-07
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.