| Literature DB >> 36159919 |
Daniel Gallucci1, Ernest C Y Ho1, Joseph Geraci1,2,3, Joseph Loren1, Luca Pani4,5.
Abstract
Nurosene's NURO app (nurosene.com) is an innovative smartphone application that gathers and analyzes active self-report metrics from users, empowering them with data-driven health machine intelligence. We present the data collected and analyzed from the initial round of participants who responded to a 12-question survey on their life-style and health status. Exploratory results using a variational autoencoder (VAE) suggested that much of the variability of the 12 dimensional data could be accounted for by two approximately uncorrelated latent variables: one pertaining to stress and sleep, and the other pertaining to exercise and diet. Subsequent modeling of the data using exploratory and confirmatory factor analyses (EFAs and CFAs) found that optimal data fits consisted of four factors, namely exercise, diet, stress, and sleep. Covariance values were high between exercise and diet, and between stress and sleep, but much lower between other pairings of non-identical factors. Both EFAs and CFAs provided extra contexts to and quantified the more preliminary VAE observations. Overall, our results significantly reduce the apparent complexity of the response data. This reduction allows for more efficient future stratification and analyses of participants based on simpler latent variables. Our discovery of novel relationships between stress and sleep, and between exercise and diet suggests the possibility of applying predictive analytics in future efforts.Entities:
Keywords: confirmatory factor analyses; digital health technology; exploratory factor analyses; machine learning; mental and physical wellbeing; smartphone application; survey questionnaire; variational auto encoder
Year: 2022 PMID: 36159919 PMCID: PMC9494943 DOI: 10.3389/fpsyt.2022.945780
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 5.435
The 12 questions on the questionnaire.
|
|
|
|
|---|---|---|
| 1 | Exercise | How much cardiovascular and/or aerobic exercise do you get on a weekly basis? (e.g., running, bike riding, swimming) |
| 2 | Exercise | How much anaerobic exercise do you get on a weekly basis? (e.g., weight training, circuit training) |
| 3 | Exercise | How often would you consider yourself to be cognitively engaged and/or stimulated? |
| 4 | Stress | How often are you experiencing physical stress? (e.g., injury, chronic pain, gut distress) |
| 5 | Stress | How often are you experiencing psychological stress? (e.g., loneliness, trauma, conflicts) |
| 6 | Stress | How often are you experiencing social stress? (e.g., stress with others and life events in general) |
| 7 | Diet | How often are you eating what you consider to be a “healthy” diet? |
| 8 | Diet | How often do you consume nutritional supplements and/or compounds? |
| 9 | Diet | How often do you experience gastrointestinal issues after eating? (e.g., bloating, gas, indigestion) |
| 10 | Sleep | How often do you have difficulty falling asleep? |
| 11 | Sleep | How often do you have difficulty staying asleep? |
| 12 | Sleep | How often do you have difficulty waking up in the morning? |
Scoring scheme.
|
|
|
|
|
|
|---|---|---|---|---|
| 1 | 5+ days | 3–4 days | 1–2 days | None |
| 2 | 5+ days | 3–4 days | 1–2 days | None |
| 3 | Always | Often | Occasionally | Rarely/never |
| 4 | Rarely/Never | Occasionally | Often | Always |
| 5 | Rarely/Never | Occasionally | Often | Always |
| 6 | Rarely/Never | Occasionally | Often | Always |
| 7 | Daily | Often | Occasionally | Rarely/never |
| 8 | Daily | Often | Occasionally | Rarely/never |
| 9 | Rarely/Never | Occasionally | Often | Daily |
| 10 | Rarely/Never | Occasionally | Often | Daily |
| 11 | Rarely/Never | Occasionally | Often | Daily |
| 12 | Rarely/Never | Occasionally | Often | Daily |
Except the first column, the name of each column (“4”, “3”, “2”, “1”) denotes the numeric score used for each question for analyses. Each entry of a respective column is the actual response choice of the question corresponding to the converted numeric score. The scheme is so designed that a higher score indicates superior performance of respondents on the particular survey question.
Figure 1An instance of latent variables of a random subset of data on the trained VAE. The VAE was trained with this very subset of data. Each of the 12 sub-figures is color coded with the responses of one of 12 questions (Q1-Q12).
Figure 2Path diagram of the two-factor CFA model. *Besides a value means statistical significance. 1-star denotes p < 0.05, 2-star denotes p < 0.01, 3-star denotes p < 0.001.
Basic statistical quantities of the responses for each question item.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean | 1.95 | 1.76 | 2.26 | 2.81 | 2.66 | 2.78 | 2.28 | 2.17 | 3.06 | 2.87 | 3.02 | 2.87 |
| SD | 0.96 | 0.93 | 0.91 | 0.97 | 0.97 | 0.94 | 0.96 | 1.12 | 0.92 | 1.00 | 0.97 | 1.07 |
| Skewness | 0.71 | 0.96 | 0.26 | −0.32 | −0.16 | −0.35 | 0.25 | 0.46 | −0.69 | −0.43 | −0.60 | −0.45 |
| Kurtosis | 2.49 | 2.80 | 2.25 | 2.08 | 2.02 | 2.24 | 2.10 | 1.82 | 2.59 | 2.07 | 2.27 | 1.92 |
SD stands for standard deviation.
Results of exploratory factor analysis on the same randomly selected subset of data used for VAE.
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
| |
| 1 | 0.04 |
|
| −0.07 | 0.11 | −0.03 | −0.01 | 0.01 |
|
| 2 | 0.11 |
|
| 0.07 | 0.04 | 0.14 | −0.03 | 0.30 |
|
| 3 | −0.02 |
|
| 0.09 | −0.12 | 0.12 | −0.14 |
| 0.25 |
| 4 |
| 0.03 | 0.02 |
| 0.22 |
| 0.19 | −0.02 | 0.02 |
| 5 |
| 0.04 | 0.00 |
| 0.04 |
| −0.04 | −0.00 | −0.02 |
| 6 |
| 0.02 | −0.01 |
| 0.03 |
| 0.02 | −0.02 | −0.02 |
| 7 | −0.08 |
|
| −0.05 | − 0.05 | −0.03 | 0.05 |
| −0.01 |
| 8 | −0.14 |
|
| 0.01 | − 0.18 | 0.03 | −0.13 |
| 0.01 |
| 9 |
| -0.09 | −0.07 | 0.30 |
| 0.31 |
| −0.06 | −0.03 |
| 10 |
| −0.04 | 0.02 | − 0.01 |
| −0.01 |
| 0.05 | −0.02 |
| 11 |
| −0.12 | −0.08 | 0.01 |
| 0.05 |
| −0.16 | 0.07 |
| 12 |
| 0.06 | 0.08 | 0.25 |
| 0.28 |
| 0.00 | 0.08 |
| Var. exp. | 23% | 16% | 16% | 15% | 12% | 16% | 12% | 11% | 11% |
| BIC | −83 | −104 | −92 | ||||||
| TLI | 0.77 | 0.88 | 0.95 | ||||||
| RMSEA | 0.105 | 0.075 | 0.051 | ||||||
Var. exp. denotes the percentage of variance explained by individual factors. TLI stands for Tucker-Lewis index. RMSEA is root mean square error of approximation. BIC stands for Schwarz-Bayes Information Criterion (25). Bold values indicate the largest factor loading in absolute magnitude of the item represented by the row in each EFA model.
Figure 3Path diagram of the three-factor CFA model. Star conventions as in Figure 2.
Figure 4Path diagram of the first four-factor CFA model (Model A). Star conventions as in Figure 2.
Figure 5Path diagram of the second four-factor CFA model (Model B). Star conventions as in Figure 2.
Summary of CFA model fitting statistics.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Two-factor ( | 53 | 40,671 | 40,800 | 0.83 | 0.098 (0.092, 0.105) | 0.061 | 36% |
| Three-factor ( | 51 | 40,431 | 40,571 | 0.89 | 0.080 (0.074, 0.087) | 0.054 | 41% |
| Four-factor A ( | 48 | 40,252 | 40,408 | 0.94 | 0.063 (0.056, 0.070) | 0.047 | 45% |
| Four-factor B ( | 44 | 40,133 | 40,309 | 0.97 | 0.047 (0.039, 0.054) | 0.034 | 47% |
DoF stands for degrees of freedom, AIC (BIC) stands for Akaike (Schwarz-Bayes) Information Criterion (25, 33), CFI stands for comparative fit index, CI stands for confidence interval, RMSEA is the root mean square error of approximation and SRMR is the standard root mean square residual.