| Literature DB >> 35205498 |
Joseph Naiman1, Peter Xuekun Song1.
Abstract
Motivated by mobile devices that record data at a high frequency, we propose a new methodological framework for analyzing a semi-parametric regression model that allow us to study a nonlinear relationship between a scalar response and multiple functional predictors in the presence of scalar covariates. Utilizing functional principal component analysis (FPCA) and the least-squares kernel machine method (LSKM), we are able to substantially extend the framework of semi-parametric regression models of scalar responses on scalar predictors by allowing multiple functional predictors to enter the nonlinear model. Regularization is established for feature selection in the setting of reproducing kernel Hilbert spaces. Our method performs simultaneously model fitting and variable selection on functional features. For the implementation, we propose an effective algorithm to solve related optimization problems in that iterations take place between both linear mixed-effects models and a variable selection method (e.g., sparse group lasso). We show algorithmic convergence results and theoretical guarantees for the proposed methodology. We illustrate its performance through simulation experiments and an analysis of accelerometer data.Entities:
Keywords: functional predictor; functional principal component analysis; linear mixed-effects model; mobile device; sparse group regularization; wearable device data
Year: 2022 PMID: 35205498 PMCID: PMC8871497 DOI: 10.3390/e24020203
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Activity counts over 7 d from a tri-axis (X-, Y- and Z-axis) accelerometer of a subject.
Goodness-of-fit and the concordance regression for Scenario 2.
| Model |
|
| Reg of | ||
|---|---|---|---|---|---|
| Intercept | Slope |
| |||
|
| 0.830 | 2.00 | −0.062 | 1.01 | 0.848 |
|
| 0.937 | 1.99 | −0.055 | 1.01 | 0.972 |
|
| 0.928 | 2.00 | −0.051 | 1.01 | 0.955 |
|
| 0.835 | 2.01 | −0.062 | 1.01 | 0.856 |
|
| 0.935 | 1.99 | −0.056 | 1.01 | 0.970 |
|
| 0.911 | 1.99 | −0.049 | 1.01 | 0.937 |
| COSSO | 0.832 | – | – | – | – |
| LM + Lasso | 0.453 | – | – | – | – |
| LM + GLasso | 0.324 | – | – | – | – |
| LM + SGL | 0.450 | – | – | – | – |
| LM + MCP | 0.513 | – | – | – | – |
| LM + GMCP | 0.307 | – | – | – | – |
Sensitivity and specificity of functional selection for Scenario 2.
| Model | Selection Frequency | |||
|---|---|---|---|---|
|
|
|
|
| |
|
| 100 | 100 | 0 | 0 |
|
| 100 | 100 | 4 | 4 |
|
| 100 | 100 | 0 | 0 |
|
| 100 | 100 | 0 | 0 |
|
| 100 | 100 | 3 | 4 |
| COSSO | 100 | 100 | 5 | 6 |
| LM + Lasso | 100 | 100 | 19 | 21 |
| LM + GLasso | 94 | 99 | 7 | 8 |
| LM + SGL | 100 | 100 | 19 | 18 |
| LM + MCP | 100 | 100 | 20 | 19 |
| LM + GMCP | 93 | 99 | 7 | 8 |
FPC feature selection for signal functional in Scenario 2.
| Model | Selection Frequency | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
|
| 100 | 1 | 97 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
|
| 100 | 21 | 100 | 71 | 26 | 20 | 17 | 16 | 15 |
|
| 100 | 1 | 99 | 1 | 0 | 0 | 0 | 0 | 0 |
|
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| COSSO | 100 | 2 | 100 | 93 | 1 | 0 | 0 | 1 | 0 |
| LM + Lasso | 100 | 10 | 100 | 100 | 10 | 8 | 7 | 10 | 5 |
| LM + GLasso | 94 | 94 | 94 | 94 | 94 | 94 | 94 | 94 | 94 |
| LM + SGL | 100 | 12 | 100 | 100 | 10 | 8 | 8 | 11 | 5 |
| LM + MCP | 100 | 10 | 100 | 100 | 9 | 8 | 9 | 7 | 5 |
| LM + GMCP | 93 | 93 | 93 | 93 | 93 | 93 | 93 | 93 | 93 |
FPC feature selection for signal functional in Scenario 2.
| Model | Selection Frequency | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
|
| 0 | 3 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
|
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
|
| 16 | 100 | 14 | 7 | 16 | 23 | 100 | 15 | 7 |
|
| 0 | 11 | 0 | 0 | 0 | 1 | 100 | 0 | 0 |
|
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| COSSO | 8 | 97 | 5 | 5 | 5 | 15 | 100 | 3 | 3 |
| LM + Lasso | 17 | 100 | 14 | 7 | 16 | 23 | 100 | 15 | 6 |
| LM + GLasso | 99 | 99 | 99 | 99 | 99 | 99 | 99 | 99 | 99 |
| LM + SGL | 17 | 100 | 14 | 7 | 16 | 23 | 100 | 15 | 7 |
| LM + MCP | 17 | 100 | 13 | 6 | 16 | 23 | 100 | 15 | 8 |
| LM + GMCP | 99 | 99 | 99 | 99 | 99 | 99 | 99 | 99 | 99 |
Figure 2Five marginal estimates of important feature functions with 95% shaded confidence bands evaluated at 100 grid points while holding all other components equal to in Scenario 2.
Figure 3The 24 h minute-by-minute medians of 7 d ACs for one subject.
Goodness-of-fit for the five models used in the data analysis.
| Model | Adjusted |
|---|---|
| M0: LM | 0.07 |
| M1: LM + SGL | 0.13 |
| M2: LSKM | 0.18 |
| M3: | 0.30 |
| M4: COSSO | 0.14 |
Axis-specific FPC feature selection.
| Model | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||||||
| COSSO | ✓ | ✓ | ✓ | ||||||||||||
| LM + SGL | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||