Literature DB >> 25285052

Algorithms for Discovery of Multiple Markov Boundaries.

Alexander Statnikov1, Nikita I Lytkin1, Jan Lemeire2, Constantin F Aliferis3.   

Abstract

Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.

Entities:  

Keywords:  Markov boundary discovery; information equivalence; variable/feature selection; violations of faithfulness

Year:  2013        PMID: 25285052      PMCID: PMC4184048     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   3.654


  19 in total

1.  Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions.

Authors:  R L Somorjai; B Dolenko; R Baumgartner
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

2.  High-resolution serum proteomic features for ovarian cancer detection.

Authors:  T P Conrads; V A Fusaro; S Ross; D Johann; V Rajapakse; B A Hitt; S M Steinberg; E C Kohn; D A Fishman; G Whitely; J C Barrett; L A Liotta; E F Petricoin; T D Veenstra
Journal:  Endocr Relat Cancer       Date:  2004-06       Impact factor: 5.678

3.  Outcome signature genes in breast cancer: is there a unique set?

Authors:  Liat Ein-Dor; Itai Kela; Gad Getz; David Givol; Eytan Domany
Journal:  Bioinformatics       Date:  2004-08-12       Impact factor: 6.937

4.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.

Authors:  Alexander Statnikov; Constantin F Aliferis; Ioannis Tsamardinos; Douglas Hardin; Shawn Levy
Journal:  Bioinformatics       Date:  2004-09-16       Impact factor: 6.937

5.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

6.  Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer.

Authors:  Liat Ein-Dor; Or Zuk; Eytan Domany
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-03       Impact factor: 11.205

7.  A study in causal discovery from population-based infant birth and death records.

Authors:  S Mani; G F Cooper
Journal:  Proc AMIA Symp       Date:  1999

8.  Analysis and computational dissection of molecular signature multiplicity.

Authors:  Alexander Statnikov; Constantin F Aliferis
Journal:  PLoS Comput Biol       Date:  2010-05-20       Impact factor: 4.475

9.  Multiple robust signatures for detecting lymph node metastasis in head and neck cancer.

Authors:  Paul Roepman; Patrick Kemmeren; Lodewijk F A Wessels; Piet J Slootweg; Frank C P Holstege
Journal:  Cancer Res       Date:  2006-02-15       Impact factor: 12.701

10.  On the number of close-to-optimal feature sets.

Authors:  Edward R Dougherty; Marcel Brun
Journal:  Cancer Inform       Date:  2007-02-16
View more
  9 in total

1.  Multiple predictively equivalent risk models for handling missing data at time of prediction: With an application in severe hypoglycemia risk prediction for type 2 diabetes.

Authors:  Sisi Ma; Pamela J Schreiner; Elizabeth R Seaquist; Mehmet Ugurbil; Rachel Zmora; Lisa S Chow
Journal:  J Biomed Inform       Date:  2020-01-28       Impact factor: 6.317

2.  Markov Boundary Discovery with Ridge Regularized Linear Models.

Authors:  Eric V Strobl; Shyam Visweswaran
Journal:  J Causal Inference       Date:  2015-11-03

3.  Just Add Data: automated predictive modeling for knowledge discovery and feature selection.

Authors:  Ioannis Tsamardinos; Paulos Charonyktakis; Georgios Papoutsoglou; Giorgos Borboudakis; Kleanthi Lakiotaki; Jean Claude Zenklusen; Hartmut Juhl; Ekaterini Chatzaki; Vincenzo Lagani
Journal:  NPJ Precis Oncol       Date:  2022-06-16

4.  Extending greedy feature selection algorithms to multiple solutions.

Authors:  Giorgos Borboudakis; Ioannis Tsamardinos
Journal:  Data Min Knowl Discov       Date:  2021-05-01       Impact factor: 3.670

5.  All Models are Wrong, but Many are Useful: Learning a Variable's Importance by Studying an Entire Class of Prediction Models Simultaneously.

Authors:  Aaron Fisher; Cynthia Rudin; Francesca Dominici
Journal:  J Mach Learn Res       Date:  2019       Impact factor: 5.177

6.  Bridging a translational gap: using machine learning to improve the prediction of PTSD.

Authors:  Karen-Inge Karstoft; Isaac R Galatzer-Levy; Alexander Statnikov; Zhiguo Li; Arieh Y Shalev
Journal:  BMC Psychiatry       Date:  2015-03-16       Impact factor: 3.630

Review 7.  A survey about methods dedicated to epistasis detection.

Authors:  Clément Niel; Christine Sinoquet; Christian Dina; Ghislain Rocheleau
Journal:  Front Genet       Date:  2015-09-10       Impact factor: 4.599

8.  An Evaluation of Active Learning Causal Discovery Methods for Reverse-Engineering Local Causal Pathways of Gene Regulation.

Authors:  Sisi Ma; Patrick Kemmeren; Constantin F Aliferis; Alexander Statnikov
Journal:  Sci Rep       Date:  2016-03-04       Impact factor: 4.379

9.  Computational causal discovery for post-traumatic stress in police officers.

Authors:  Glenn N Saxe; Sisi Ma; Leah J Morales; Isaac R Galatzer-Levy; Constantin Aliferis; Charles R Marmar
Journal:  Transl Psychiatry       Date:  2020-08-11       Impact factor: 6.222

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.