| Literature DB >> 26473165 |
Elena Vildjiounaite1, Georgy Gimel'farb2, Vesa Kyllönen1, Johannes Peltola1.
Abstract
Intelligent computer applications need to adapt their behaviour to contexts and users, but conventional classifier adaptation methods require long data collection and/or training times. Therefore classifier adaptation is often performed as follows: at design time application developers define typical usage contexts and provide reasoning models for each of these contexts, and then at runtime an appropriate model is selected from available ones. Typically, definition of usage contexts and reasoning models heavily relies on domain knowledge. However, in practice many applications are used in so diverse situations that no developer can predict them all and collect for each situation adequate training and test databases. Such applications have to adapt to a new user or unknown context at runtime just from interaction with the user, preferably in fairly lightweight ways, that is, requiring limited user effort to collect training data and limited time of performing the adaptation. This paper analyses adaptation trends in several emerging domains and outlines promising ideas, proposed for making multimodal classifiers user-specific and context-specific without significant user efforts, detailed domain knowledge, and/or complete retraining of the classifiers. Based on this analysis, this paper identifies important application characteristics and presents guidelines to consider these characteristics in adaptation design.Entities:
Year: 2015 PMID: 26473165 PMCID: PMC4581555 DOI: 10.1155/2015/434826
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Modelling situation-dependency scenario; dashed lines denote optional data.
Figure 2Lightweight adaptation approaches.
Characteristics of the reviewed adaptation approaches. Notations: X: yes; A: depending on a chosen algorithm; for example, training data can be used instead of assumptions about new context and unlabelled data are used only as negative examples; MK: multiclass classification problems only; U: data are vectors of user preferences; base classifier: either ensemble member or the only context-specific model.
| Method name | Applicability | Training | Additional data | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Suits dissimilar contexts | Suits large varieties of problems/algorithms | Most lightweight | Requires little assumptions about new contexts | For training of base classifier(s) | For classifier selection/combination | Uses contextual data | Uses raw primary data of other contexts | Uses models for other contexts | Uses unlabelled data for the target context | |
| Model selection | ||||||||||
| Contextual weighting | X | X | A | A | X | X | A | |||
| Optimising utility function | X | A | A | A | A | |||||
| Tuning classifiers for small datasets | X | X | X | A | ||||||
| Cascaded training | X | X | X | X | X | |||||
| Learning context-specific relations between classifier outputs | X | MK | X | X | X | A | ||||
| Optimising model parameters with evolutionary algorithms | X | X | X | X | X | A | X | |||
| Optimising model parameters with gradient descent | A | X | A | X | A | X | ||||
| Algorithm-specific methods to shift a decision boundary | A | X | X | A | X | A | ||||
| Adapting only selected parameters | X | X | X | X | X | A | X | |||
| Error weighting | X | X | X | X | A | |||||
| The use of model parameters as training data | A | X | X | |||||||
| Vector modification | X | U | X | X | X | X | A | |||
| Modifying a similarity measure | X | U | A | A | X | A | A | A | ||
| Target context-specific combinations of cues, obtained in other contexts | X | U | A | X | X | A | A | A | ||
|
| ||||||||||
| Ensembles | ||||||||||
| Factor ensembles | X | X | X | X | A | |||||
| Diversity-based ensembles | X | X | X | X | A | |||||
| Optimising a pool of classifiers, trained on data for several contexts | X | X | X | X | X | |||||
| Ensemble of generalisers | X | X | X | X | ||||||
| Knowledge transfer ensembles | X | X | X | X | X | A | X | A | A | |
| Stacked ensembles | X | X | X | X | ||||||
| Dynamic selection of base classifiers | X | X | A | X | X | A | A | |||
| Sample-selecting ensembles | X | X | A | X | X | A | A | |||
|
| ||||||||||
| Context as a feature | ||||||||||
| Embedding contextual parameters as additional nodes into graphical models | X | X | X | X | A | |||||
| Using historical contexts as nodes in graphical models | X | X | A | X | A | |||||
| Using contextual parameters as input features | X | X | X | X | X | X | X | A | ||
| Including contextual similarity into a distance measure | X | X | X | X | X | A | ||||
Figure 3Lightweight adaptation summary (Section 4).
Figure 4Recommended adaptation and data usage types.