Literature DB >> 35308632

Unobserved classes and extra variables in high-dimensional discriminant analysis.

Michael Fop1, Pierre-Alexandre Mattei2, Charles Bouveyron2, Thomas Brendan Murphy2.   

Abstract

In supervised classification problems, the test set may contain data points belonging to classes not observed in the learning phase. Moreover, the same units in the test data may be measured on a set of additional variables recorded at a subsequent stage with respect to when the learning sample was collected. In this situation, the classifier built in the learning phase needs to adapt to handle potential unknown classes and the extra dimensions. We introduce a model-based discriminant approach, Dimension-Adaptive Mixture Discriminant Analysis (D-AMDA), which can detect unobserved classes and adapt to the increasing dimensionality. Model estimation is carried out via a full inductive approach based on an EM algorithm. The method is then embedded in a more general framework for adaptive variable selection and classification suitable for data of large dimensions. A simulation study and an artificial experiment related to classification of adulterated honey samples are used to validate the ability of the proposed framework to deal with complex situations.
© The Author(s) 2021.

Entities:  

Keywords:  Adaptive supervised classification; Conditional estimation; Model-based discriminant analysis; Unobserved classes; Variable selection

Year:  2022        PMID: 35308632      PMCID: PMC8924148          DOI: 10.1007/s11634-021-00474-3

Source DB:  PubMed          Journal:  Adv Data Anal Classif        ISSN: 1862-5355


  10 in total

1.  Multifrequency species classification of acoustic-trawl survey data using semi-supervised learning with class discovery.

Authors:  M Woillez; P H Ressler; C D Wilson; J K Horne
Journal:  J Acoust Soc Am       Date:  2012-02       Impact factor: 1.840

2.  Application of Fourier transform midinfrared spectroscopy to the discrimination between Irish artisanal honey and such honey adulterated with various sugar syrups.

Authors:  J Daniel Kelly; Cristina Petisco; Gerard Downey
Journal:  J Agric Food Chem       Date:  2006-08-23       Impact factor: 5.279

3.  Variable selection for clustering with Gaussian mixture models.

Authors:  Cathy Maugis; Gilles Celeux; Marie-Laure Martin-Magniette
Journal:  Biometrics       Date:  2009-02-04       Impact factor: 2.571

4.  General subspace learning with corrupted training data via graph embedding.

Authors:  Bing-Kun Bao; Guangcan Liu; Richang Hong; Shuicheng Yan; Changsheng Xu
Journal:  IEEE Trans Image Process       Date:  2013-07-22       Impact factor: 10.856

5.  mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models.

Authors:  Luca Scrucca; Michael Fop; T Brendan Murphy; Adrian E Raftery
Journal:  R J       Date:  2016-08       Impact factor: 3.984

6.  Variable Selection and Updating In Model-Based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications.

Authors:  Thomas Brendan Murphy; Nema Dean; Adrian E Raftery
Journal:  Ann Appl Stat       Date:  2010-03-01       Impact factor: 2.083

7.  clustvarsel: A Package Implementing Variable Selection for Gaussian Model-Based Clustering in R.

Authors:  Luca Scrucca; Adrian E Raftery
Journal:  J Stat Softw       Date:  2018-04-17       Impact factor: 6.440

8.  Improved initialisation of model-based clustering using Gaussian hierarchical partitions.

Authors:  Luca Scrucca; Adrian E Raftery
Journal:  Adv Data Anal Classif       Date:  2015-10-26

9.  The application of sparse estimation of covariance matrix to quadratic discriminant analysis.

Authors:  Jiehuan Sun; Hongyu Zhao
Journal:  BMC Bioinformatics       Date:  2015-02-18       Impact factor: 3.169

10.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances.

Authors:  Anthony Bagnall; Jason Lines; Aaron Bostrom; James Large; Eamonn Keogh
Journal:  Data Min Knowl Discov       Date:  2016-11-23       Impact factor: 3.670

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.