| Literature DB >> 33260851 |
Giulio Gabrieli1, Jan Paolo Macapinlac Balagtas1, Gianluca Esposito1,2,3, Peipei Setoh1.
Abstract
Fixation time measures have been widely adopted in studies with infants and young children because they can successfully tap on their meaningful nonverbal behaviors. While recording preverbal children's behavior is relatively simple, analysis of collected signals requires extensive manual preprocessing. In this paper, we investigate the possibility of using different Machine Learning (ML)-a Linear SVC, a Non-Linear SVC, and K-Neighbors-classifiers to automatically discriminate between Usable and Unusable eye fixation recordings. Results of our models show an accuracy of up to the 80%, suggesting that ML tools can help human researchers during the preprocessing and labelling phase of collected data.Entities:
Keywords: eye-tracking; machine learning; signal quality
Year: 2020 PMID: 33260851 PMCID: PMC7731361 DOI: 10.3390/s20236775
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Summary of the steps employed for data augmentation, training, and hypertuning of the models.
Participants’ demographic information and number of usable and unusable samples per Study included in this work.
| Study | Participants | Age (Months) | Total Number of Samples | ||
|---|---|---|---|---|---|
| Study 1 | 48 | 24.6 ± 4.5 | 71 | 26 | 97 |
| Study 2 | 32 | 26.9 ± 4.9 | 60 | 17 | 77 |
| Study 3 | 32 | 26.7 ± 6,2 | 69 | 8 | 77 |
Usable and Unusable Samples in Training and Test Score by Experiment.
| Study | Training Set | Test Set | |||
|---|---|---|---|---|---|
| Usable | Unusable | Unusable after AGWN | Usable | Unusable | |
| Study 1 | 57 | 21 | 84 | 14 | 5 |
| Study 2 | 47 | 15 | 13 | 13 | 2 |
| Study 3 | 52 | 8 | 32 | 17 | 0 |
Model’s scores (Accuracy, Precision, Recall, F1, and MCC) by type of classifier and test set.
| Classifier | Train_MCC | Train_Acc. | Accuracy | Precision | Recall | F1 | MCC |
|---|---|---|---|---|---|---|---|
| Linear SVC | 0.207 | 0.602 | 0.576 | 0.958 | 0.523 | 0.676 | 0.262 |
| Non-Linear SVC | 0.491 | 0.810 | 0.706 | 1.00 | 0.659 | 0.795 | 0.492 |
| K-neighbors | 0.533 | 0.810 | 0.803 | 0.972 | 0.795 | 0.875 | 0.493 |
Best hyper-parameters for the trained models.
| Classifier | Best Parameters |
|---|---|
| Linear SVC | ‘C’: 30, ‘random_state’: 1, ‘tol’: 0.1 |
| Non-Linear SVC | ‘C’: 40, ‘decision_function_shape’: ‘ovo’, ‘gamma’: ’scale’, ‘kernel’: ‘rbf’, ‘random_state’: 0, ‘tol’: 0.1 |
| K-Neighbors | ‘leaf_size’: 1, ‘n_neighbors’: 8 |
kNN features predictive weight (Importance Mean) obtained through permutation feature importance.
| Repetition 1 | Repetition 2 | Repetition 3 |
|---|---|---|
| −0.0151 | 0 | 0.1265 |
SVM features’ predictive weights.
| Repetition 1 | Repetition 2 | Repetition 3 |
|---|---|---|
| −0.05269684 | 0.00185775 | 0.26624044 |