Literature DB >> 35910400

Prediction and outlier detection in classification problems.

Leying Guan1, Robert Tibshirani2.   

Abstract

We consider the multi-class classification problem when the training data and the out-of-sample test data may have different distributions and propose a method called BCOPS (balanced and conformal optimized prediction sets). BCOPS constructs a prediction set C(x) as a subset of class labels, possibly empty. It tries to optimize the out-of-sample performance, aiming to include the correct class and to detect outliers x as often as possible. BCOPS returns no prediction (corresponding to C(x) equal to the empty set) if it infers x to be an outlier. The proposed method combines supervised learning algorithms with conformal prediction to minimize a misclassification loss averaged over the out-of-sample distribution. The constructed prediction sets have a finite sample coverage guarantee without distributional assumptions. We also propose a method to estimate the outlier detection rate of a given procedure. We prove asymptotic consistency and optimality of our proposals under suitable assumptions and illustrate our methods on real data examples.
© 2022 The Authors. Journal of the Royal Statistical Society: Series B (Statistical Methodology) published by John Wiley & Sons Ltd on behalf of Royal Statistical Society.

Entities:  

Keywords:  BCOPS; conformal inference; distributional change; label shift; set‐valued prediction

Year:  2022        PMID: 35910400      PMCID: PMC9305480          DOI: 10.1111/rssb.12443

Source DB:  PubMed          Journal:  J R Stat Soc Series B Stat Methodol        ISSN: 1369-7412            Impact factor:   4.933


  1 in total

1.  Distribution Free Prediction Sets.

Authors:  Jing Lei; James Robins; Larry Wasserman
Journal:  J Am Stat Assoc       Date:  2013       Impact factor: 5.033

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.