Literature DB >> 16999578

On the consistency of Bayesian variable selection for high dimensional binary regression and classification.

Wenxin Jiang1.   

Abstract

Modern data mining and bioinformatics have presented an important playground for statistical learning techniques, where the number of input variables is possibly much larger than the sample size of the training data. In supervised learning, logistic regression or probit regression can be used to model a binary output and form perceptron classification rules based on Bayesian inference. We use a prior to select a limited number of candidate variables to enter the model, applying a popular method with selection indicators. We show that this approach can induce posterior estimates of the regression functions that are consistently estimating the truth, if the true regression model is sparse in the sense that the aggregated size of the regression coefficients are bounded. The estimated regression functions therefore can also produce consistent classifiers that are asymptotically optimal for predicting future binary outputs. These provide theoretical justifications for some recent empirical successes in microarray data analysis.

Mesh:

Year:  2006        PMID: 16999578     DOI: 10.1162/neco.2006.18.11.2762

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  2 in total

1.  Bayesian Methods for High Dimensional Linear Models.

Authors:  Himel Mallick; Nengjun Yi
Journal:  J Biom Biostat       Date:  2013-06-01

2.  Bayesian detection of causal rare variants under posterior consistency.

Authors:  Faming Liang; Momiao Xiong
Journal:  PLoS One       Date:  2013-07-26       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.