| Literature DB >> 34203674 |
Abstract
While smartphone addiction is becoming a recent concern with the exponential increase in the number of smartphone users, it is difficult to predict problematic smartphone users based on the usage characteristics of individual smartphone users. This study aimed to explore the possibility of predicting smartphone addiction level with mobile phone log data. By Korea Internet and Security Agency (KISA), 29,712 respondents completed the Smartphone Addiction Scale developed in 2017. Integrating basic personal characteristics and smartphone usage information, the data were analyzed using machine learning techniques (decision tree, random forest, and Xgboost) in addition to hypothesis tests. In total, 27 variables were employed to predict smartphone addiction and the accuracy rate was the highest for the random forest (82.59%) model and the lowest for the decision tree model (74.56%). The results showed that users' general information, such as age group, job classification, and sex did not contribute much to predicting their smartphone addiction level. The study can provide directions for future work on the detection of smartphone addiction with log-data, which suggests that more detailed smartphone's log-data will enable more accurate results.Entities:
Keywords: machine learning; predictor; problematic smartphone use; smartphone addiction
Year: 2021 PMID: 34203674 PMCID: PMC8296286 DOI: 10.3390/ijerph18126458
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The decision Tree example.
Variables.
| Variables | Contents | |
|---|---|---|
| Independent variables | ● Age | |
| ● Sex | ||
| ● Occupation | ||
| ● Smartphone usage level by content | News | |
| Business | ||
| Education | ||
| Product/service Traffic | ||
| Web document | ||
| Game | ||
| Adult content | ||
| Movie/TV/video Music | ||
| Web novel | ||
| Sports betting | ||
| Messenger | ||
| SNS | ||
| Product purchase | ||
| Product selling | ||
| Finance | ||
| Life management | ||
| Game | ||
| Movie/TV/video | ||
| e-book, web-toon, web-fiction | ||
| ● Monthly expenditure on each content | ||
| ● Number of times of use on weekdays | ||
| ● Number of times of use on weekends | ||
| Target variables | Addiction type (1: High risk, 2: Normal) | |
The number of participants for each addiction type.
| Population | ||
|---|---|---|
| High Risk | Normal | |
| 10s | 183 | 4961 |
| 20s | 136 | 3548 |
| 30s | 139 | 5225 |
Grid Search Parameters.
| Method | Hyper-Parameters | ||
|---|---|---|---|
| Decision tree | Gini depth | Entropy depth | |
| 2, 3, 4, 5 | 2, 3, 4, 5 | ||
| Random forest | Sample split | Estimators | |
| 4, 5 | 100, 500 | ||
| Xgboost | Base score | Depth | Estimators |
| 0.5, 0.55, 0.6, 0.65, 0.7 | 2, 3, 4 | 10, 50, 70, 100 | |
Figure 2Users’ General Characteristics.
Smartphone Usage Level for Each Usage Content.
| 19 Contents, 7-Point Likert Scale | Normal Group | Risk Group | t-Test | |||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Normal vs. Risk Groups | ||
| t-Value | ||||||
| News | 3.34 | 2.480 | 4.18 | 1.917 | −11.880 | |
| Office search | 2.44 | 2.385 | 2.25 | 2.555 | 2.096 | |
| Education | 1.47 | 2.116 | 2.29 | 2.549 | −8.828 | |
| Product/service search | 2.44 | 2.366 | 3.58 | 1.919 | −15.977 | |
| Transportation | 2.30 | 2.390 | 3.24 | 2.416 | −10.545 | |
| Web document | 3.08 | 2.208 | 3.07 | 2.608 | 0.153 | |
| Game | 3.22 | 2.176 | 4.59 | 1.979 | −18.714 | |
| Adult content | 0.28 | 1.018 | 0.53 | 1.354 | −5.072 | |
| Movie/TV/video | 2.78 | 2.358 | 3.49 | 2.517 | −7.748 | |
| Music | 2.99 | 2.467 | 4.20 | 2.306 | −14.277 | |
| Web novel | 1.45 | 2.124 | 3.10 | 2.161 | −20.819 | |
| Sports betting | 0.13 | 0.691 | 0.19 | 0.866 | −1.819 | |
| 1.61 | 2.157 | 2.82 | 2.366 | −13.904 | ||
| Messenger | 5.00 | 1.798 | 5.43 | 1.523 | −7.650 | |
| SNS | 3.31 | 2.429 | 4.19 | 2.389 | −10.076 | |
| Product/service purchase | 2.00 | 2.319 | 2.67 | 2.485 | −7.312 | |
| Product/service selling | 0.56 | 1.431 | 1.08 | 1.959 | −7.206 | |
| Finance | 1.74 | 2.290 | 2.20 | 2.495 | −5.071 | |
| Daily management | 1.21 | 1.973 | 1.61 | 2.177 | −5.108 | |
Smartphone Usage Characteristics.
| Normal Group | Risk Group ( | t-Test | ||||
|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Normal vs. Risk | ||
| t-Value | ||||||
|
| ||||||
| Game | 1299.94 | 3815.277 | 2586.30 | 5090.486 | −6.911 | |
| Movie/TV/video | 1032.54 | 3239.524 | 1694.33 | 4120.399 | −4.390 | |
| e-books/cartoons | 487.93 | 1858.009 | 1181.82 | 2740.036 | −6.935 | |
|
| ||||||
| Number of uses on weekdays | 23.41 | 27.739 | 56.74 | 55.516 | −16.491 | |
| Number of uses on weekends | 35.60 | 44.275 | 85.44 | 81.874 | −16.707 | |
Accuracy and Recall Value for Each Model.
| Accuracy (%) | |||
|---|---|---|---|
| Decision tree | Random forest | Xgboost | |
| 10s | 75.45 | 86.36 | 84.45 |
| 20s | 76.82 | 80.48 | 80.48 |
| 30s | 71.42 | 80.95 | 77.38 |
| Average | 74.56 | 82.59 | 80.77 |
| Recall value | |||
| High recall | Normal recall | ||
| Decision tree | |||
| 10s | 0.80 | 0.71 | |
| 20s | 0.84 | 0.68 | |
| 30s | 0.79 | 0.64 | |
| Average | 0.81 | 0.67 | |
| Random forest | |||
| 10s | 0.83 | 0.89 | |
| 20s | 0.77 | 0.84 | |
| 30s | 0.81 | 0.81 | |
| Average | 0.80 | 0.85 | |
| Xgboost | |||
| 10s | 0.83 | 0.86 | |
| 20s | 0.77 | 0.84 | |
| 30s | 0.83 | 0.71 | |
| Average | 0.81 | 0.80 | |
Accuracy Difference.
| Accuracy (%) | |||
|---|---|---|---|
| Decision tree | |||
| Usage and personal information | Usage only | Difference | |
| 10s | 75.45 | 68.18 | −7.27 |
| 20s | 76.82 | 71.95 | −4.87 |
| 30s | 71.42 | 71.42 | 0.00 |
| Average | 74.56 | 70.51 | −4.05 |
| Random forest | |||
| Usage and personal information | Usage only | Difference | |
| 10s | 86.36 | 84.45 | −1.91 |
| 20s | 80.48 | 79.26 | −1.22 |
| 30s | 80.95 | 77.28 | −3.67 |
| Average | 82.59 | 80.33 | −2.26 |
| Xgboost | |||
| Usage and personal information | Usage only | Difference | |
| 10s | 84.45 | 82.72 | −1.73 |
| 20s | 80.48 | 80.48 | 0.00 |
| 30s | 77.38 | 75.00 | −2.38 |
| Average | 80.77 | 79.40 | −1.37 |
Prediction based on Users’ General Characteristics.
| Prediction Based on Users’ General Characteristics | Accuracy (%) |
|---|---|
| Based on age group | |
| 10s | 91.10 |
| 20s | 56.48 |
| 30s | 48.21 |
| Average | 65.26 |
| Based on employment status (whether employed) | |
| No | 72.87 |
| Yes | 93.75 |
| Average | 85.08 |
| Based on sex | |
| Male | 57.34 |
| Female | 56.12 |
| Average | 56.73 |