| Literature DB >> 35402139 |
Polina Panicheva1, Larisa Mararitsa1,2, Semen Sorokin1, Olessia Koltsova1, Paolo Rosso3.
Abstract
Despite recent achievements in predicting personality traits and some other human psychological features with digital traces, prediction of subjective well-being (SWB) appears to be a relatively new task with few solutions. COVID-19 pandemic has added both a stronger need for rapid SWB screening and new opportunities for it, with online mental health applications gaining popularity and accumulating large and diverse user data. Nevertheless, the few existing works so far have aimed at predicting SWB, and have done so only in terms of Diener's Satisfaction with Life Scale. None of them analyzes the scale developed by the World Health Organization, known as WHO-5 - a widely accepted tool for screening mental well-being and, specifically, for depression risk detection. Moreover, existing research is limited to English-speaking populations, and tend to use text, network and app usage types of data separately. In the current work, we cover these gaps by predicting both mentioned SWB scales on a sample of Russian mental health app users who represent a population with high risk of mental health problems. In doing so, we employ a unique combination of phone application usage data with private messaging and networking digital traces from VKontakte, the most popular social media platform in Russia. As a result, we predict Diener's SWB scale with the state-of-the-art quality, introduce the first predictive models for WHO-5, with similar quality, and reach high accuracy in the prediction of clinically meaningful classes of the latter scale. Moreover, our feature analysis sheds light on the interrelated nature of the two studied scales: they are both characterized by negative sentiment expressed in text messages and by phone application usage in the morning hours, confirming some previous findings on subjective well-being manifestations. At the same time, SWB measured by Diener's scale is reflected mostly in lexical features referring to social and affective interactions, while mental well-being is characterized by objective features that reflect physiological functioning, circadian rhythms and somatic conditions, thus saliently demonstrating the underlying theoretical differences between the two scales.Entities:
Keywords: Digital traces; Mental health prediction; Subjective well-being
Year: 2022 PMID: 35402139 PMCID: PMC8978494 DOI: 10.1140/epjds/s13688-022-00333-x
Source DB: PubMed Journal: EPJ Data Sci ISSN: 2193-1127 Impact factor: 3.184
Specificity and sensitivity of the selected WHO-5 cutoff values in the mental health dataset
| Condition | Metric | Binary cutoff (0.51) | Lower trinary cutoff (0.35) | Upper trinary cutoff (0.59) | |
|---|---|---|---|---|---|
| Depression | 344 | Sensitivity | 0.80 | 0.49 | 0.90 |
| Specificity | 0.58 | 0.87 | 0.45 | ||
| Anxiety | 309 | Sensitivity | 0.82 | 0.53 | 0.92 |
| Specificity | 0.54 | 0.83 | 0.41 | ||
| Stress | 323 | Sensitivity | 0.85 | 0.47 | 0.93 |
| Specificity | 0.66 | 0.88 | 0.50 |
Descriptive statistics for subjective well-being, age and gender in the final dataset
| Range | Mean | Std | Mean (norm) | Std (norm) | Cronbach’s | ||
|---|---|---|---|---|---|---|---|
| SLWS | 372 | 5–35 | 18.30 | 6.73 | 0.4433 | 0.2243 | 0.8365 |
| WHO-5 | 5–30 | 16.51 | 4.66 | 0.4604 | 0.1865 | 0.8205 | |
| Age | 18–53 | 23.06 | 5.06 | ||||
| Gender | Male, Female | 298 (80%) Female |
Descriptive statistics for the textual and phone app usage features in the final dataset
| Data | Sum | Mean | Median | Min | Max |
|---|---|---|---|---|---|
| Messages | 6739K | 18,115 | 10,948.5 | 52 | 131,368 |
| Message alters | 53K | 143 | 107.5 | 2 | 1029 |
| Message volume (chars) | 160,707K | 432,009 | 240,831 | 671 | 2,983,231 |
| Posts | 7K | 19 | 4 | 0 | 1880 |
| Post volume (chars) | 857K | 2303 | 84 | 0 | 87,708 |
| App Usage (seconds) | 1573K | 4231 | 3715.5 | 24 | 16,329 |
Figure 1Distribution of SWSL values
Figure 4Distribution of Gender values
User metadata and overall activity features
| Feature name | Description | Number |
|---|---|---|
| Age | – | 1 |
| Gender | – | 1 |
| NVkFriends | N | 1 |
| AllAlters | N | 1 |
| Subscriptions | N | 1 |
| Mess_ 1 | Total number of messages written in the last 30 days | 1 |
| MessChars_ 1 | Total size (in characters) of messages written in the last 30 days | 1 |
| growth-2to-1weighted | Weighted difference between total size of messages written in the months −1 and −2 | 1 |
| altersdiff | Weighted difference between numbers of alters in the months −1 and −2 | 1 |
| AppUsage1Week | Number of active app usage instances in the period of app data sharing time (one week) | 1 |
| AllAppTime1Week | Total time of phone app usage in the period of app data sharing time (in seconds) | 1 |
| RatioAppTime1Week | Ratio of phone app usage time in the week of app data sharing time | 1 |
| AppUsage 0–3, 3–6, 6–9, 9–12, 12–15, 15–18, 18–21, 21–24 | Time of phone app usage in 3-hour time periods – each out of the 8 features represents a 3-hour time period | 8 |
| AppUsage 0–3, 3–6, 6–9, 9–12, 12–15, 15–18, 18–21, 21–24 Ratio | Time of phone app usage in 3-hour time periods normalized by total app usage time – each out of the 8 features represents a 3-hour time period | 8 |
| Alters −1–−12 | Numbers of alters in every month (30 days) before the DigitalFreud install time, for months between −1 and −12 | 12 |
| Total | 40 | |
Best word cluster features
| Regularization weight | Consensus clustering threshold | Infrequent words | No of clusters | MAE | |
|---|---|---|---|---|---|
| SWLS | 500 | 0.45 | − | 28 | 0.1704 |
| WHO-5 | 0 | 0.45 | + | 19 | 0.1525 |
Phone app category features
| Feature type | N | Example feature name | Description |
|---|---|---|---|
| Total time logged in category by a user | 9 | GAME | Total time logged in Game apps by a user |
| Total time logged in category in time period by a user | 72 | GAME_21-24 | Total time logged in Game apps between 21 and 24 h by a user |
| Total time logged in category in time period/total time logged in category by a user | 72 | PHOTOGRAPHY_0-3/PHOTOGRAPHY | Ratio of time logged in Photography apps between 0 and 3 AM to total time logged in Photography apps by a user |
| Total time logged in category in time period/total time logged in time period by a user | 72 | EDUCATION + PRODUCTIVITY_15-18/15-18 | Ratio of time logged in Education+Productivity apps between 15 and 18 h AM to total time logged in apps between 15 and 18 h AM by a user |
SWLS value prediction results
| Features | Best model | Results | ||
|---|---|---|---|---|
| MAE | Pearson R | R-2 | ||
| Mean baseline | 0.1853 | – | – | |
| Median baseline | 0.185 | – | – | |
| Words | ElasticNet | 0.1744 | 0.3402 | 0.1022 |
| RuLIWC | DecisionTree | 0.182 | 0.2168 | 0.0142 |
| AppCats | ElasticNet | 0.1762 | 0.2737 | 0.0172 |
| Behavior | DecisionTree | 0.1785 | 0.191 | 0.0195 |
| Clusters | RandomForest | 0.1814 | 0.1709 | 0.026 |
WHO-5 value prediction results
| Features | Best model | Results | ||
|---|---|---|---|---|
| MAE | Pearson R | R-2 | ||
| Mean baseline | 0.1542 | – | – | |
| Median baseline | 0.1533 | – | – | |
| Words | Lasso | 0.1441 | 0.3179 | 0.0817 |
| RuLIWC | Lasso | 0.1529 | 0.1276 | 0.0197 |
| AppCats | ElasticNet | 0.1511 | 0.2172 | 0.0329 |
| Behavior | DecisionTree | 0.1497 | 0.2463 | 0.0096 |
| Clusters | Lasso | 0.1516 | 0.1533 | 0.0241 |
| AdaBoost | ||||
Best WHO-5 classification results
| Classifi cation | Thre-shold | N (Classes) | Best model | Best features | F1-macro | F1-weigh-ted | F1-low | F1-high | True Positive Rate (low) | False Positive Rate (low) |
|---|---|---|---|---|---|---|---|---|---|---|
| Binary | 0.51 | 221/151 | Ada-Boost | Words + RuLIWC + AppCats | 0.692 | 0.706 | 0.768 | 0.616 | 0.792 | 0.404 |
| Binary majority baseline | 0.378 | 0.456 | 0.373 | 0 | 1.0 | 1.0 | ||||
| Trinary | 0.35/0.59 | 111/158/103 | Ada-Boost | Clusters + RuLIWC + Words | 0.483 | 0.493 | 0.502 | 0.433 | 0.450 | 0.161 |
| Trinary majority baseline | 0.199 | 0.253 | – | – | 0.0 | 0.0 | ||||
Predictive features in SWLS scale. Slang, misspellings and unconventional word forms are shown with an asterisk (*). Errors in lemmatization are enclosed in brackets
| Feature type | Feature | Translation/Description | Coefficient |
|---|---|---|---|
| Words | спать_[NOUN] | sleep_VERB | 41,086 |
| интим_NOUN | intimacy_NOUN (suggestive of ‘intercourse’) | −44,937 | |
| орг_NOUN* | org(aniser)_NOUN | 23,978 | |
| дропнуть_VERB* | quit_VERB | −64,677 | |
| тратиться_VERB | spend_VERB | −24,593 | |
| отл_UNKN* | fine_UNKN | 34,184 | |
| пояснение_NOUN | explanation_NOUN | −22,499 | |
| стебать_VERB* | bully_VERB (rude) | −28,898 | |
| [вифя]_NOUN* | wifi_NOUN | −48,114 | |
| спойлерить_VERB* | spoil_VERB | −48,530 | |
| ооохнуть_VERB* | gasp_VERB | −44,864 | |
| милый_COMP | nice_COMPARATIVE | 56,128 | |
| [пиздёжа]_NOUN* | lie_NOUN (rude) | −22,727 | |
| обжечь_VERB | burn_VERB | −40,019 | |
| Sentiment | negative sentiment in the last month | −29 | |
| Activity | Ratio of phone app usage time between 9 and 12 AM normalized by total app usage time | 10 | |
| AppUsage0-3Ratio | Ratio of phone app usage time between 0 and 3 AM normalized by total app usage time | −8 | |
| AppCats | SOCIAL + COMMUNICATION + DATING_0-3/SOCIAL + COMMUNICATION + DATING | Ratio of time logged in Social + Communication + Dating apps between 0 and 3 AM to total time logged in Social + Communication + Dating apps | 11 |
| PHOTOGRAPHY_18-21/18-21 | Ratio of time logged in Photography apps between 18 and 21 h PM to total time logged in apps between 18 and 21 h PM | 8 |
Predictive features in WHO-5 scale
| Feature type | Feature | Translation/Description | Coefficient |
|---|---|---|---|
| AppCats | GAME_3-6/GAME | Ratio of time logged in Game apps between 3 and 6 h AM to total time logged in Game apps | −5 |
| ENTERTAINMENT_3-6/ENTERTAINMENT | Ratio of time logged in Entertainment apps between 3 and 6 h AM to total time logged in Entertainment apps | 4 | |
| HEALTH+MEDICAL_3-6/HEALTH+MEDICAL | Ratio of time logged in Health + Medical apps between 3 and 6 h AM to total time logged in Health + Medical apps | 3 | |
| PERSONALIZATION_0-3/0-3 | Ratio of time logged in Personalization apps between 0 and 3 h AM to total time logged in apps between 0 and 3 h AM | −4 | |
| EDUCATION + PRODUCTIVITY_9-12/EDUCATION + PRODUCTIVITY | Ratio of time logged in Education + Productivity apps between 9 and 12 h AM to total time logged in Education + Productivity apps | −3 | |
| TOOLS_18-21/18-21 | Ratio of time logged in Tools apps between 18 and 21 h PM to total time logged in apps between 18 and 21 h PM | −3 | |
| SOCIAL + COMMUNICATION + DATING_3-6/SOCIAL + COMMUNICATION + DATING | Ratio of time logged in Social + Communication + Dating apps between 3 and 6 AM to total time logged in Social + Communication + Dating app | 7 | |
| GAME_9-12/GAME | Ratio of time logged in Game apps between 9 and 12 h AM to total time logged in Game apps | 2 | |
| OTHER_3-6/OTHER | Ratio of time logged in Other apps between 3 and 6 h AM to total time logged in Other apps | −2 | |
| ENTERTAINMENT_9-12/ENTERTAINMENT | Ratio of time logged in Entertainment apps between 9 and 12 h AM to total time logged in Entertainment apps | 2 | |
| PHOTOGRAPHY_0-3/PHOTOGRAPHY | Ratio of time logged in Photography apps between 0 and 3 h AM to total time logged in Photography apps | −2 | |
| EDUCATION + PRODUCTIVITY_21-24/EDUCATION + PRODUCTIVITY | Ratio of time logged in Education + Productivity apps between 21 and 24 h PM to total time logged in ducation + Productivity apps | −2 | |
| RuLIWC | Bio_RuLIWC | Words related to Biological processes in RuLIWC | −20 |
| Words | (face-blowing-a-kiss_emoji)_UNKN | (face-blowing-a-kiss_emoji) | 35 |
| но_CONJ | but_CONJ | −16 | |
| Activity | Ratio of phone app usage time between 9 and 12 AM normalized by total app usage time | 7 | |
| Sentiment | negative sentiment in the last month | −33 | |
| Negative_year | negative sentiment in the last year | −29 | |
| Negative_all | negative sentiment in overall messages | −23 |
Figure 2Distribution of WHO-5 values
Distribution of the most common cities identified in the overall data sample
| City | Percents |
|---|---|
| Moscow | 47.4 |
| St. Petersburg | 36.9 |
| Yekaterinburg | 8 |
| Kazan | 6.2 |
| Minsk | 5.7 |
| Chelyabinsk | 5.7 |
| Novosibirsk | 5.7 |
| Nizhny Novgorod | 5 |
| Krasnodar | 4.7 |
| Rostov-on-Don | 4.2 |
Distribution of the most common cities identified in the final dataset
| City | Percents |
|---|---|
| Moscow | 41.6 |
| St. Petersburg | 31.9 |
| Yekaterinburg | 8 |
| Nizhny Novgorod | 5.3 |
| Voronezh | 4.4 |
| Chelyabinsk | 4.4 |
| Vladivostok | 4.4 |
| Tyumen | 3.5 |
| Kirov | 3.5 |
| Yaroslavl | 3.5 |
Total list of words used as features for the SWLS and WHO-5 prediction
| SWLS | WHO-5 |
|---|---|
| 1000_NUMB | !_PNCT |
| 22_NUMB | 2000_NUMB |
| 2500_NUMB | |
| t_LATN | r_LATN |
| адрес_NOUN | аааа_NOUN |
| апрель_ NOUN | ааааа_NOUN |
| ахуесть_VERB | ааааааааааааааааа_NOUN |
| ахуеть_ VERB | адрес_NOUN |
| ахуй_NOUN | анимешник_NOUN |
| бабочка_NOUN | арми_ NOUN |
| баня_NOUN | ахахахи_NOUN |
| бгод_NOUN | ахахахха_NOUN |
| бесить_ VERB | байка_ NOUN |
| бланк_NOUN | бантан_ NOUN |
| бля_INTJ | блестеть_ VERB |
| блядь_INTJ | блч_UNKN |
| блятба_ NOUN | блять_NOUN |
| блять_NOUN | бляяяяять_VERB |
| бляять_ GRND | бляяяяяять_GRND |
| большой_ADJ | борис_NOUN |
| борис_NOUN | будто_CONJ |
| будто_CONJ | валя_NOUN |
| бухать_ GRND | вежливый_ADJ |
| василий_NOUN | вифя_NOUN |
| ващий_ADJ | вообще_ ADV |
| вечно_ADV | воооот_ NOUN |
| водный_ADJ | воскресение_ NOUN |
| воскресение_NOUN | впервые_ADV |
| впустить_VERB | впустить_VERB |
| выглянуть_VERB | вскрыться_VERB |
| графика_NOUN | выпилиться_VERB |
| грубый_ADJ | выставить_VERB |
| даж_UNKN | выступить_VERB |
| делаться_VERB | глупенький_ADJ |
| день_NOUN | горе_NOUN |
| добрый_ADJ | гуглить_VERB |
| договориться_VERB | даун_NOUN |
| долбиться_VERB | дельфин_NOUN |
| е_NOUN | демон_NOUN |
| ебал_NOUN | дерьмо_NOUN |
| ебануться_VERB | джон_NOUN |
| ебать_VERB | джуна_NOUN |
| еби_UNKN | дилемма_NOUN |
| еблана_ NOUN | добровольно_ADV |
| ебу_UNKN | добрый_ADJ |
| ет_UNKN | доказательство_NOUN |
| жарко_ADV | дразнить_VERB |
| жить_VERB | дропнуть_VERB |
| заебал_ NOUN | ебаный_ADJ |
| заебок_ NOUN | ебу_UNKN |
| закрыться_VERB | ет_ UNKN |
| замечание_NOUN | ж_CONJ |
| запасный_ADJ | жестокий_ADJ |
| запрещать_VERB | животный_ADJ |
| знач_NOUN | загуглила_NOUN |
| именно_ PRCL | заезжать_VERB |
| комиссия_NOUN | заехать_VERB |
| корея_NOUN | замуж_ADV |
| кофеёк_ NOUN | запереть_VERB |
| критерий_NOUN | заржать_VERB |
| лана_NOUN | засиживаться_VERB |
| лариса_ NOUN | звонить_VERB |
| лень_NOUN | звёздочка_NOUN |
| ложь_NOUN | инглихой_COMP |
| лях_NOUN | интим_NOUN |
| маман_NOUN | истинный_ADJ |
| мамаша_ NOUN | как_CONJ |
| маркетинг_NOUN | кальян_NOUN |
| маркус_ NOUN | камбэк_NOUN |
| милах_NOUN | капёс_NOUN |
| мразь_NOUN | кб_NOUN |
| мудак_NOUN | коль_CONJ |
| мёд_NOUN | комикс_NOUN |
| набрать_VERB | кореец_NOUN |
| научный_ADJ | корея_NOUN |
| нах_UNKN | косплей_NOUN |
| нахуй_NOUN | кпоп_NOUN |
| нееет_UNKN | ладить_ VERB |
| ненавидеть_VERB | листочек_NOUN |
| несмотря_PREP | лосиный_ADJ |
| неудобный_ADJ | магнитный_ADJ |
| никто_NPRO | милах_NOUN |
| нихуй_NOUN | милый_COMP |
| обжечь_ VERB | монст_NOUN |
| окончание_NOUN | мразь_NOUN |
| орало_NOUN | мррррра_NOUN |
| орг_NOUN | мутный_ADJ |
| организация_NOUN | мфц_UNKN |
| отвлечь_VERB | мэн_NOUN |
| отвратительный_ADJ | набрать_VERB |
| отл_UNKN | наверна_NOUN |
| отлично_ADV | наехать_VERB |
| отсталый_ADJ | намджуна_NOUN |
| передать_VERB | наорать_VERB |
| петух_NOUN | настолько_ADV |
| пизда_NOUN | неинтересно_ADV |
| пиздец_ NOUN | неловко_ADV |
| пиздуть_VERB | ненавидеть_ VERB |
| пиздёжа_NOUN | несчастный_ADJ |
| подробный_ADJ | нету_PRED |
| поебать_VERB | неудобно_ADV |
| пока_ADV | никогда_ADV |
| показатель_NOUN | но_CONJ |
| получить_VERB | ноооо_NOUN |
| пользователь_ NOUN | ноут_NOUN |
| помереть_VERB | обидный_ADJ |
| помеха_ NOUN | облизывать_VERB |
| потерять_PRTS | объяснять_VERB |
| похуй_NOUN | объёмный_ADJ |
| пояснение_NOUN | он_ NPRO |
| предать_VERB | оооо_ NOUN |
| предсказуемый_ADJ | ооооооо_NOUN |
| признак_NOUN | оооохнуть_VERB |
| приобрести_VERB | ооохнуть_VERB |
| припереться_VERB | орало_NOUN |
| прогуливать_VERB | останавливать_VERB |
| прогулять_VERB | отбирать_ VERB |
| равно_CONJ | отвлечься_VERB |
| разом_ADV | отвратительный_ADJ |
| разреветься_VERB | отвратный_ADJ |
| разрывать_VERB | отлично_ADV |
| рамка_NOUN | офф_UNKN |
| растеряться_VERB | ох_INTJ |
| результат_NOUN | паника_NOUN |
| рил_NOUN | педик_NOUN |
| руководитель_ NOUN | переключить_VERB |
| рушить_ VERB | переписывать_ VERB |
| рэп_NOUN | пересматривать_ VERB |
| свалить_VERB | пиздец_NOUN |
| скот_NOUN | пират_NOUN |
| скучно_ ADV | писаться_VERB |
| смеяться_VERB | подъехать_VERB |
| сосуд_NOUN | поебать_VERB |
| спока_NOUN | пожениться_VERB |
| спорый_ADJ | покинуть_VERB |
| ссылка_ NOUN | помнить_VERB |
| стебать_VERB | поплакать_VERB |
| сук_NOUN | порешать_VERB |
| съебывать_VERB | поступок_ NOUN |
| тиндёр_ NOUN | потерянный_ADJ |
| тратиться_VERB | потерять_PRTS |
| труп_NOUN | поттер_NOUN |
| трус_NOUN | пошло_ADV |
| тэхен_NOUN | ппц_UNKN |
| ущербный_ADJ | предатель_ NOUN |
| факультет_NOUN | предать_VERB |
| херить_ VERB | привет_NOUN |
| хит_NOUN | пригонять_VERB |
| хм_INTJ | приобнять_VERB |
| хрень_NOUN | продумать_VERB |
| хуй_NOUN | прописать_VERB |
| хуйня_NOUN | псих_NOUN |
| хула_NOUN | психануть_VERB |
| хы_UNKN | пытаться_VERB |
| цель_NOUN | пялить_VERB |
| через_PREP | работа_ NOUN |
| шава_NOUN | разреветься_VERB |
| шеф_NOUN | разрывать_VERB |
| шлюшка_ NOUN | расплатиться_VERB |
| шуга_NOUN | расстроить_PRTF |
| эт_UNKN | растягивать_VERB |
| эх_INTJ | реветь_VERB |
| я_NPRO | репер_NOUN |
| (glowing-star_emoji)_UNKN | репетиция_NOUN |
| (thinking-face_emoji)_UNKN | риал_NOUN |
| рил_NOUN | |
| рушить_VERB | |
| саба_NOUN | |
| сам_ADJ | |
| свалить_VERB | |
| серега_ NOUN | |
| серия_NOUN | |
| слеза_NOUN | |
| слишком_ADV | |
| смеяться_VERB | |
| спасать_VERB | |
| спать_NOUN | |
| спойлерить_VERB | |
| спорый_ADJ | |
| ссора_NOUN | |
| старший_NOUN | |
| стебать_VERB | |
| страдать_VERB | |
| страшно_ADV | |
| стремный_ADJ | |
| съездить_VERB | |
| таак_NOUN | |
| тони_NOUN | |
| тренировка_NOUN | |
| труп_NOUN | |
| тц_UNKN | |
| тэхен_NOUN | |
| убивать_VERB | |
| удовлетворение_NOUN | |
| умирать_VERB | |
| умыться_VERB | |
| упад_NOUN | |
| фандом_ NOUN | |
| ханна_NOUN | |
| хардкор_NOUN | |
| хдд_UNKN | |
| хл_UNKN | |
| хм_INTJ | |
| хорошо_ADV | |
| хотя_CONJ | |
| худой_COMP | |
| червь_NOUN | |
| через_PREP | |
| чертовый_ADJ | |
| чонгук_ NOUN | |
| чувство_NOUN | |
| чудом_ADV | |
| чуть_ADV | |
| шлюшка_NOUN | |
| шов_NOUN | |
| шуга_NOUN | |
| ь_UNKN | |
| это_NPRO | |
| этот_ADJ | |
| юнга_NOUN | |
| я_NPRO | |
| (medium-light-skin-tone_emoji)_UNKN | |
| (face-blowing-a-kiss_emoji)_UNKN | |
| (drooling-face_emoji)_UNKN |
Correlation between sentiment class and WHO score
| Sentiment class | Correlation with WHO score |
|---|---|
| negative | −0.14921 |
| positive | 0.024321 |
| neutral | 0.09399 |
| speech | |
| skip | −0.114221 |
Results for linear regression model with sentiment class frequency features. Mean absolute error and Pearson correlation
| Sentiment classes combinations | Mean absolute error | Pearson correlation |
|---|---|---|
| negative, positive | 0.1434 | 0.1243 |
| negative, neutral, positive | 0.1265 | |
| negative, neutral, positive, skip, speech | 0.1447 |
Models and hyperparameters used for SWLS and WHO-5 regression
| Model | Hyperparameters |
|---|---|
| AdaBoostRegressor | loss’: [‘linear’, ‘square’, ‘exponential’], ‘n_estimators’: [10,100] |
| DecisionTreeRegressor | criterion’: [‘mae’], ‘max_depth’: [2,3], ‘min_samples_leaf’: [2], ‘max_leaf_nodes’: [3], ‘splitter’: [‘best’], ‘min_samples_split’: [2], ‘max_features’: [‘auto’] |
| ElasticNet | alpha’: [100,10,1,0.1,0.01,0.001,0.0001], ‘normalize’: [False, True], ‘selection’: [‘cyclic’, ‘random’], ‘max_ iter’: [500,1000], ‘l1_ratio’: [0.25,0.5,0.75] |
| Lasso | alpha’: [100,10,1,0.1,0.01,0.001,0.0001], ‘normalize’: [False, True], ‘selection’: [‘cyclic’, ‘random’],’max_iter’: [500,1000,2000] |
| LinearRegression | normalize’: [False, True] |
| RandomForestRegressor | n_estimators’: [2,5,10,20], ‘max_depth’: [2,3], ‘min_samples_split’: [2], ‘min_samples_leaf’: [1], ‘max_ features’: [‘auto’] |
| Ridge | alpha’: [100,10,1,0.1,0.01,0.001,0.0001], ‘normalize’: [False, True] |
Models and hyperparameters used for WHO-5 classification
| Model | Hyperparameters |
|---|---|
| AdaBoostClassifier | “algorithm”: [“SAMME.R”] |
| DecisionTreeClassifier | “criterion”: [“gini”, “entropy”], “max_depth”: [None, 10, 50, 100] |
| RandomForestClassifier | “n_estimators”: [10, 50, 100], “max_depth”: [None, 10, 50, 100] |
SWLS regression results for all feature sets
| Features | Best model | Results | ||
|---|---|---|---|---|
| MAE | Pearson R | R-2 | ||
| Mean baseline | 0.1853 | – | – | |
| Median baseline | 0.185 | – | – | |
| Words | ElasticNet | 0.1744 | 0.3402 | 0.1022 |
| RuLIWC | DecisionTree | 0.182 | 0.2168 | 0.0142 |
| AppCats | ElasticNet | 0.1762 | 0.2737 | 0.0172 |
| Behavior | DecisionTree | 0.1785 | 0.191 | 0.0195 |
| Clusters | RandomForest | 0.1814 | 0.1709 | 0.026 |
| AppCats + RuLIWC | ElasticNet | 0.1776 | 0.2478 | 0.0296 |
| AppCats + Behavior | ElasticNet | 0.1784 | 0.2227 | 0.0248 |
| AppCats + Words | Ridge | 0.1756 | 0.2992 | 0.0864 |
| RuLIWC + Behavior | DecisionTree | 0.1818 | 0.1949 | 0.0133 |
| RuLIWC + Words | ElasticNet | 0.1722 | 0.352 | 0.0988 |
| Behavior + Words | ElasticNet | 0.1754 | 0.314 | 0.0752 |
| clusters + AppCats | ElasticNet | 0.1786 | 0.2545 | 0.0129 |
| clusters + RuLIWC | DecisionTree | 0.1769 | 0.2769 | 0.0507 |
| clusters + Behavior | DecisionTree | 0.1765 | 0.2243 | 0.0368 |
| clusters + Words | Lasso | 0.1715 | 0.3435 | 0.112 |
| AppCats + RuLIWC + Behavior | ElasticNet | 0.1761 | 0.3093 | 0.0704 |
| AppCats + RuLIWC + Words | Lasso | 0.1753 | 0.2913 | 0.0711 |
| AppCats + Behavior + Words | ElasticNet | 0.1735 | 0.3004 | 0.0724 |
| RuLIWC + Behavior + Words | ElasticNet | 0.1752 | 0.3506 | 0.0934 |
| clusters + AppCats + RuLIWC | ElasticNet | 0.1778 | 0.2636 | 0.0314 |
| clusters + AppCats + Behavior | ElasticNet | 0.1756 | 0.2341 | 0.0528 |
| clusters + AppCats + Words | Lasso | 0.1712 | 0.2958 | 0.0932 |
| clusters + RuLIWC + Behavior | DecisionTree | 0.1765 | 0.2275 | 0.038 |
| clusters + RuLIWC + Words | ElasticNet | 0.1712 | 0.3673 | 0.1192 |
| clusters + Behavior + Words | ElasticNet | 0.1712 | 0.3459 | 0.1228 |
| clusters + AppCats + RuLIWC + Behavior | ElasticNet | 0.1748 | 0.2962 | 0.0048 |
| clusters + AppCats + RuLIWC + Words | Ridge | 0.1751 | 0.2882 | 0.0811 |
| clusters + RuLIWC + Behavior + Words | Lasso | 0.1776 | 0.294 | 0.0616 |
| AppCats + RuLIWC + Behavior + Words | ElasticNet | 0.1719 | 0.3255 | 0.096 |
WHO-5 regression results for all feature sets
| Features | Best model | Results | ||
|---|---|---|---|---|
| MAE | Pearson R | R-2 | ||
| Mean baseline | 0.1542 | – | – | |
| Median baseline | 0.1533 | – | – | |
| Words | Lasso | 0.1441 | 0.3179 | 0.0817 |
| RuLIWC | Lasso | 0.1529 | 0.1276 | 0.0197 |
| AppCats | ElasticNet | 0.1511 | 0.2172 | 0.0329 |
| Behavior | DecisionTree | 0.1497 | 0.2463 | 0.0096 |
| Clusters | Lasso | 0.1516 | 0.1533 | 0.0241 |
| AppCats + RuLIWC | Ridge | 0.1505 | 0.2578 | 0.0371 |
| AppCats + Behavior | Lasso | 0.1458 | 0.2934 | 0.0678 |
| AppCats + Words | ElasticNet | 0.1458 | 0.3228 | 0.0772 |
| RuLIWC + Behavior | DecisionTree | 0.1505 | 0.2399 | 0.0032 |
| RuLIWC + Words | Ridge | 0.1445 | 0.3242 | 0.0964 |
| Behavior + Words | AdaBoost | 0.1473 | 0.2813 | 0.0476 |
| clusters + AppCats | ElasticNet | 0.1502 | 0.2537 | 0.0492 |
| clusters + RuLIWC | AdaBoost | 0.1527 | 0.1822 | -0.007 |
| clusters + Behavior | DecisionTree | 0.15 | 0.2343 | -0.0026 |
| clusters + Words | ElasticNet | 0.1449 | 0.2628 | 0.0975 |
| clusters + AppCats + RuLIWC | ElasticNet | 0.1493 | 0.2807 | 0.0786 |
| clusters + AppCats + Behavior | ElasticNet | 0.1469 | 0.3013 | 0.0739 |
| clusters + AppCats + Words | Ridge | 0.1444 | 0.338 | 0.0894 |
| clusters + RuLIWC + Behavior | DecisionTree | 0.1505 | 0.2399 | 0.0032 |
| clusters + Behavior + Words | Ridge | 0.1462 | 0.2389 | 0.0653 |
| AppCats + RuLIWC + Behavior | ElasticNet | 0.145 | 0.3363 | 0.0835 |
| AppCats + RuLIWC + Words | Ridge | 0.146 | 0.3222 | 0.0817 |
| RuLIWC + Behavior + Words | ElasticNet | 0.1479 | 0.2531 | 0.0531 |
| AppCats + Behavior + Words | ElasticNet | 0.1452 | 0.3152 | 0.0975 |
| clusters + AppCats + RuLIWC + Behavior | ElasticNet | 0.1456 | 0.3394 | 0.0938 |
| clusters + AppCats + RuLIWC + Words | ElasticNet | 0.1472 | 0.3088 | 0.0716 |
| clusters + AppCats + Behavior + Words | Lasso | 0.1457 | 0.3339 | 0.0701 |
| clusters + RuLIWC + Behavior + Words | ElasticNet | 0.1478 | 0.2961 | 0.072 |
| clusters + AppCats + RuLIWC + Behavior + Words | ElasticNet | 0.148 | 0.2952 | 0.0544 |
WHO-5 classification results
| Classification | Threshold | N (Classes) | Features | Best model | F1-macro | F1-weighted | F1-low | F1-high | TruePositiveRate (low) | FalsePositiveRate (low) |
|---|---|---|---|---|---|---|---|---|---|---|
| binary | 0.51 | 221/151 | Words | AdaBoost | 0.56 | 0.581 | 0.669 | 0.452 | 0.697 | 0.57 |
| RuLIWC | DecisionTree | 0.571 | 0.582 | 0.631 | 0.512 | 0.611 | 0.457 | |||
| AppCats | AdaBoost | 0.58 | 0.602 | 0.694 | 0.466 | 0.738 | 0.57 | |||
| Behavior | DecisionTree | 0.543 | 0.559 | 0.63 | 0.456 | 0.638 | 0.55 | |||
| Clusters | RandomForest | 0.539 | 0.571 | 0.714 | 0.363 | 0.832 | 0.715 | |||
| binary majority baseline | 0.378 | 0.456 | 0.373 | 0 | 1 | 1 | ||||
| trinary | 0.35/0.59 | 111/158/103 | Words | AdaBoost | 0.44 | 0.447 | 0.407 | 0.43 | 0.378 | 0.195 |
| RuLIWC | AdaBoost | 0.381 | 0.399 | 0.413 | 0.238 | 0.405 | 0.241 | |||
| AppCats | AdaBoost | 0.422 | 0.443 | 0.402 | 0.294 | 0.396 | 0.241 | |||
| Behavior | AdaBoost | 0.425 | 0.438 | 0.427 | 0.329 | 0.414 | 0.23 | |||
| Clusters | DecisionTree | 0.358 | 0.364 | 0.338 | 0.339 | 0.351 | 0.295 | |||
| clusters + RuLIWC + Words | AdaBoost | 0.483 | 0.493 | 0.502 | 0.433 | 0.45 | 0.161 | |||
| trinary majority baseline | 0.199 | 0.253 | – | – | 0 | 0 | ||||
Features significant in SWLS regression
| Feature | Mean importance | Count in 10-CV |
|---|---|---|
| спать_NOUN | 41,086.4144049898 | 5 |
| интим_NOUN | −44,937.4613019008 | 5 |
| орг_NOUN | 23,978.9614411828 | 5 |
| дропнуть_VERB | −64,677.1586467715 | 5 |
| тратиться_VERB | −24,593.5714641034 | 5 |
| отл_UNKN | 34,184.2112504721 | 5 |
| пояснение_NOUN | −22,499.9757533852 | 5 |
| стебать_VERB | −28,898.951393906 | 5 |
| вифя_NOUN | −48,114.1470241285 | 5 |
| спойлерить_VERB | −48,530.1211086886 | 5 |
| ооохнуть_VERB | −44,864.4233831708 | 5 |
| милый_COMP | 56,128.262155605 | 5 |
| пиздёжа_NOUN | −22,727.1849476408 | 5 |
| Negative_month | −29.2652084171193 | 5 |
| AppUsage9-12Ratio | 10.3365760075427 | 5 |
| SOCIAL+COMMUNICATION+DATING_0/SOCIAL+COMMUNICATION+DATING | 11.9200141620517 | 5 |
| AppUsage0-3Ratio | −8.02782185058373 | 5 |
| обжечь_ VERB | −40,019.2136897226 | 5 |
| PHOTOGRAPHY_6/6 | 8.00565760998601 | 5 |
| объёмный_ADJ | −22,927.1436115299 | 4 |
| разрывать_VERB | −30,217.4675429819 | 4 |
| AppUsage6-9Ratio | 6.14938364734453 | 4 |
| Negative_year | −42.3845120787015 | 4 |
| Negative_all | −31.9683341076574 | 4 |
| (face-blowing-a-kiss_emoji)_UNKN | 30.6632155496334 | 4 |
| упад_NOUN | −18,580.4570156265 | 4 |
| чонгук_ NOUN | −17,463.4313634737 | 4 |
| дельфин_NOUN | 21,536.5345583292 | 4 |
| пиздуть_VERB | −14,962.8346296741 | 4 |
| продумать_VERB | 17,494.2319623544 | 4 |
| PERSONALIZATION_3/3 | 7.00814836414381 | 4 |
| 385 | 24,539.3867498811 | 4 |
| хл_UNKN | −16147.5539040561 | 4 |
| TOOLS_2/2 | 6.32964889581622 | 4 |
| блч_UNKN | −14,422.917489824 | 4 |
| мразь_NOUN | −18,116.8461473664 | 4 |
| ENTERTAINMENT_0/0 | 5.31480531809703 | 4 |
| Percept_RuLIWC | −40.0983869406501 | 3 |
| камбэк_ NOUN | −11,161.9164871315 | 3 |
| помеха_ NOUN | 16,006.6097736205 | 3 |
| неудобный_ADJ | 14,581.2712382097 | 3 |
| байка_NOUN | −13,460.0791647949 | 3 |
| но_CONJ | −33.8820301427059 | 3 |
| бляяяяять_VERB | 16,679.7792973482 | 3 |
| OTHER_6/6 | 6.70178805009534 | 3 |
| OTHER_6/OTHER | −5.92779708025781 | 3 |
| OTHER_5/5 | −6.64941501653413 | 3 |
| OTHER_5/OTHER | 7.9587777321838 | 3 |
| пожениться_VERB | 7984.73733958357 | 3 |
| джуна_NOUN | −15,756.4913230161 | 3 |
| хорошо_ ADV | 27.4832618347005 | 3 |
| расстроить_PRTF | 10,055.3069444193 | 3 |
| предать_VERB | 9610.44789534047 | 3 |
| критерий_NOUN | 13,168.233814062 | 3 |
| офф_UNKN | −16,763.6305231621 | 3 |
| грубый_ ADJ | −9967.34916619578 | 3 |
| съебывать_VERB | −14,161.996571157 | 3 |
| фандом_ NOUN | −7058.65855912861 | 3 |
| бляяяяяять_GRND | −8683.89310820064 | 3 |
| PHOTOGRAPHY_4/4 | −11.4270318169998 | 3 |
| кореец_ NOUN | −8536.07198679033 | 3 |
| бантан_ NOUN | 11,240.4372555059 | 3 |
| разреветься_VERB | 9104.89644806333 | 3 |
| GAME_0/GAME | 3.63645809844946 | 3 |
| EDUCATION+PRODUCTIVITY_1/1 | −3.06771772638087 | 3 |
| PERSONALIZATION_5/PERSONALIZATION | 4.18099320731029 | 3 |
| HEALTH+MEDICAL_7/HEALTH+MEDICAL | 3.38255612173981 | 3 |
| EDUCATION+PRODUCTIVITY_6/EDUCATION+PRODUCTIVITY | 3.6103446132533 | 3 |
| GAME_5/GAME | −3.60075795654176 | 3 |
| SOCIAL+COMMUNICATION+DATING_1/SOCIAL+COMMUNICATION+DATING | 6.3430323785416 | 3 |
| HEALTH+MEDICAL_1/HEALTH+MEDICAL | 3.65748802084215 | 3 |
| GAME_4/GAME | 6.1492563234405 | 3 |
| PERSONALIZATION_6/6 | −7.74392289361535 | 3 |
| HEALTH+MEDICAL_4/4 | −10.3948608039209 | 3 |
| PERSONALIZATION_5/5 | −4.53626915989602 | 3 |
| PERSONALIZATION_6/PERSONALIZATION | 5.56692243721861 | 3 |
| EDUCATION+PRODUCTIVITY_3/EDUCATION+PRODUCTIVITY | −3.23127315840786 | 3 |
| SOCIAL+COMMUNICATION+DATING_6/SOCIAL+COMMUNICATION+DATING | 4.04158603764638 | 3 |
| GAME_1/GAME | −7.22900142005906 | 3 |
| PERSONALIZATION_3/PERSONALIZATION | −5.34430996987665 | 3 |
| ENTERTAINMENT_1/ENTERTAINMENT | 4.14113541946902 | 3 |
| ENTERTAINMENT_2/2 | 4.31642268738505 | 3 |
| PERSONALIZATION_0/PERSONALIZATION | −4.4149112724685 | 3 |
| привет_ NOUN | 23.4902884073796 | 2 |
| EDUCATION+PRODUCTIVITY_4/EDUCATION+PRODUCTIVITY | 0.992124886908841 | 2 |
| Positive_month | 25.7634204715238 | 2 |
| EDUCATION+PRODUCTIVITY_5/EDUCATION+PRODUCTIVITY | 1.05862884395041 | 2 |
| TOOLS_5/TOOLS | −2.00490447501796 | 2 |
| EDUCATION+PRODUCTIVITY_7/EDUCATION+PRODUCTIVITY | 1.11603306580073 | 2 |
| SOCIAL+COMMUNICATION+DATING_7/7 | 3.0908805432526 | 2 |
| TOOLS_2/TOOLS | −6.49804253661374 | 2 |
| TOOLS_4/TOOLS | −3.31388672845536 | 2 |
| ENTERTAINMENT_2/ENTERTAINMENT | −2.54736564980811 | 2 |
| SOCIAL+COMMUNICATION+DATING_7/SOCIAL+COMMUNICATION+DATING | −6.83408164115133 | 2 |
| EDUCATION+PRODUCTIVITY_2/EDUCATION+PRODUCTIVITY | 2.43891633479369 | 2 |
| выпилиться_VERB | −3338.05050867729 | 2 |
| EDUCATION+PRODUCTIVITY_2/2 | 2.59731699527922 | 2 |
| еби_UNKN | 19,201.5250764129 | 2 |
| выглянуть_VERB | −7762.75345745476 | 2 |
| гуглить_VERB | −1079.55071853441 | 2 |
| растягивать_VERB | −5127.61602039587 | 2 |
| жестокий_ADJ | −6724.2195734053 | 2 |
| GAME_2/2 | −2.50496667340079 | 2 |
| заржать_VERB | −9032.15201413262 | 2 |
| мэн_NOUN | 18,667.8692410825 | 2 |
| ENTERTAINMENT_4/4 | −0.6931462276039 | 2 |
| долбиться_VERB | −14,770.1618041756 | 2 |
| петух_NOUN | −7131.85541414396 | 2 |
| подробный_ADJ | 6083.97042642484 | 2 |
| оооохнуть_VERB | −12,538.581928229 | 2 |
| загуглила_NOUN | −8903.85747472549 | 2 |
| ущербный_ADJ | −10,188.6678026704 | 2 |
| GAME_6/GAME | −0.557326373511503 | 2 |
| EDUCATION+PRODUCTIVITY_0/EDUCATION+PRODUCTIVITY | 1.32014414545248 | 2 |
| See_RuLIWC | −44.9066092959057 | 2 |
| TOOLS_1/TOOLS | 0.246966591171979 | 2 |
| SOCIAL+COMMUNICATION+DATING_0/0 | 0.145830646062444 | 2 |
| HEALTH+MEDICAL_2/HEALTH+MEDICAL | −0.660450103454573 | 2 |
| PHOTOGRAPHY_3/PHOTOGRAPHY | 1.22028192527858 | 2 |
| PHOTOGRAPHY_2/PHOTOGRAPHY | −3.17674625105626 | 2 |
| PHOTOGRAPHY_7/PHOTOGRAPHY | −1.47432690185411 | 2 |
| PHOTOGRAPHY_1/PHOTOGRAPHY | 3.16110100642227 | 2 |
| PHOTOGRAPHY_0/PHOTOGRAPHY | −2.35307771691311 | 2 |
| OTHER_1/1 | −1.69717416262808 | 2 |
| OTHER_2/2 | −1.36082437833807 | 2 |
| OTHER_3/OTHER | 4.37604917806843 | 2 |
| SOCIAL+COMMUNICATION+DATING_4/4 | −3.52025229911742 | 2 |
| gender_merged | 0.843233408502276 | 2 |
| PHOTOGRAPHY_4/PHOTOGRAPHY | −1.33192379668633 | 2 |
| HEALTH+MEDICAL_7/7 | 9.42472510509919 | 2 |
| HEALTH+MEDICAL_6/HEALTH+MEDICAL | 2.38143334031195 | 2 |
| HEALTH+MEDICAL_4/HEALTH+MEDICAL | 0.0719895107619945 | 2 |
| SOCIAL+COMMUNICATION+DATING_6/6 | 3.03202297494442 | 2 |
| HEALTH+MEDICAL_1/1 | −12.9847665999778 | 2 |
| GAME_0/0 | −0.867851376244603 | 2 |
| HEALTH+MEDICAL_0/HEALTH+MEDICAL | −1.41730497804968 | 2 |
| PERSONALIZATION_4/PERSONALIZATION | −3.12105445284181 | 2 |
| PERSONALIZATION_2/PERSONALIZATION | −1.24127259627768 | 2 |
| PERSONALIZATION_7/PERSONALIZATION | 1.51585802743145 | 2 |
| TOOLS_4/4 | −9.92596640160934 | 2 |
| PERSONALIZATION_0/0 | −3.19660012339845 | 2 |
| ENTERTAINMENT_7/ENTERTAINMENT | −0.84084761762985 | 2 |
| HEALTH+MEDICAL_5/HEALTH+MEDICAL | −0.129810361433915 | 1 |
| шава_NOUN | −8941.24555908169 | 1 |
| AppUsage15-18Ratio | −2.07642255991409 | 1 |
| AppUsage21-24Ratio | −0.676863148867948 | 1 |
| маркус_ NOUN | 61,863.8448291371 | 1 |
| ENTERTAINMENT_6/6 | −13.5275610189105 | 1 |
| научный_ADJ | 4729.48292716799 | 1 |
| ноооо_NOUN | 5259.91641964248 | 1 |
| намджуна_NOUN | −510.55327160631 | 1 |
| AppUsage12-15Ratio | −10.37076321492 | 1 |
| HEALTH+MEDICAL_3/3 | 26.6170643844024 | 1 |
| ENTERTAINMENT_7/7 | 17.4976861479316 | 1 |
| GAME_1/1 | −0.0205129700590116 | 1 |
| Alters_-9 | −0.129041531649635 | 1 |
| GAME_2/GAME | −2.71200941173272 | 1 |
| EDUCATION+PRODUCTIVITY_4/4 | 0.177772668811767 | 1 |
| TOOLS_0/0 | −2.14285870492742 | 1 |
| Alters_-7 | 0.153125018883583 | 1 |
| TOOLS_6/TOOLS | 2.69914996314137 | 1 |
| OTHER_1/OTHER | −2.35449675924834 | 1 |
| ENTERTAINMENT_3/3 | 1.1328494815993 | 1 |
| PHOTOGRAPHY_1/1 | 0 | 1 |
| ENTERTAINMENT_5/ENTERTAINMENT | 0.149165800159887 | 1 |
| ENTERTAINMENT_6/ENTERTAINMENT | −0.41391451812445 | 1 |
| PERSONALIZATION_1/PERSONALIZATION | −0.330707466660351 | 1 |
| HEALTH+MEDICAL_2/2 | 0.121502401677361 | 1 |
| шов_NOUN | −4721.2176187302 | 1 |
| бланк_NOUN | −4764.15988799968 | 1 |
| GAME_4/4 | −3.65092628597215 | 1 |
| EDUCATION+PRODUCTIVITY_3/3 | −6.80750290356738 | 1 |
| ENTERTAINMENT_4/ENTERTAINMENT | −3.30081880604151 | 1 |
| TOOLS_7/TOOLS | −4.04245464338458 | 1 |
| TOOLS_5/5 | −4.14302286116742 | 1 |
| PERSONALIZATION_4/4 | 10.7567609647953 | 1 |
| TOOLS_3/TOOLS | −4.12230296427837 | 1 |
| TOOLS_0/TOOLS | −5.53385037327718 | 1 |
| HEALTH+MEDICAL_3/HEALTH+MEDICAL | 2.05274970971747 | 1 |
| altersdiff | −0.844560300538125 | 1 |
| HEALTH+MEDICAL_5/5 | 15.552182342051 | 1 |
| EDUCATION+PRODUCTIVITY_0/0 | −1.51789074810355 | 1 |
| GAME_6/6 | 9.22791978620134 | 1 |
| OTHER_7/OTHER | 1.97270978786112 | 1 |
| SOCIAL+COMMUNICATION+DATING_1/1 | −0.103568092714721 | 1 |
| потерянный_ADJ | −11,345.6959433921 | 1 |
| саба_NOUN | 2077.71458808275 | 1 |
| SOCIAL+COMMUNICATION+DATING_5/5 | −1.05176654037101 | 1 |
| припереться_VERB | −5406.87753364421 | 1 |
| OTHER_4/4 | −5.5431060969613 | 1 |
| OTHER_4/OTHER | 3.93086530165725 | 1 |
| OTHER_0/0 | 3.27602573972759 | 1 |
| HEALTH+MEDICAL_6/6 | 15.2161412609318 | 1 |
| писаться_VERB | −5296.54331114917 | 1 |
| OTHER_0/OTHER | −2.08967236336632 | 1 |
| поплакать_VERB | −318.617179611988 | 1 |
| рэп_NOUN | −4852.08132918677 | 1 |
| ложь_NOUN | 6888.93905838351 | 1 |
| PHOTOGRAPHY_5/PHOTOGRAPHY | 0.837922272744407 | 1 |
| growth-2to-1weighted | 0.0682381400607952 | 1 |
Features significant in WHO-5 regression
| Feature | Mean importance | Count in 10-CV |
|---|---|---|
| GAME_1/GAME | −5.30288559374647 | 7 |
| ENTERTAINMENT_1/ENTERTAINMENT | 4.48794365614162 | 7 |
| HEALTH+MEDICAL_1/HEALTH+MEDICAL | 2.6216421331719 | 6 |
| AppUsage9-12Ratio | 7.2634466399016 | 6 |
| PERSONALIZATION_0/0 | −3.93650446203669 | 6 |
| EDUCATION+PRODUCTIVITY_3/EDUCATION+PRODUCTIVITY | −2.75547290725553 | 6 |
| TOOLS_6/6 | −3.38562106644281 | 5 |
| SOCIAL+COMMUNICATION+DATING_1/SOCIAL+COMMUNICATION+DATING | 7.08554306182447 | 5 |
| GAME_3/GAME | 2.11983623880978 | 5 |
| OTHER_1/OTHER | −1.6572596556467 | 5 |
| Bio_RuLIWC | −20.8118206754822 | 5 |
| (face-blowing-a-kiss_emoji)_UNKN | 35.1292524225535 | 5 |
| EDUCATION+PRODUCTIVITY_7/EDUCATION+PRODUCTIVITY | −1.52932660473865 | 5 |
| Negative_month | −32.9859591887424 | 5 |
| Negative_year | −28.7441191861823 | 5 |
| Negative_all | −22.8213190036261 | 5 |
| но_CONJ | −16.0358199801479 | 5 |
| ENTERTAINMENT_3/ENTERTAINMENT | 1.89327664411053 | 5 |
| PHOTOGRAPHY_0/PHOTOGRAPHY | −1.86907348608951 | 5 |
| AppUsage6-9Ratio | 3.86122248368424 | 4 |
| See_RuLIWC | −17.771085104379 | 4 |
| Percept_RuLIWC | −16.0075235125978 | 4 |
| PHOTOGRAPHY_4/4 | −11.8301205279096 | 4 |
| SOCIAL+COMMUNICATION+DATING_7/SOCIAL+COMMUNICATION+DATING | 5.36396284427798 | 4 |
| OTHER_6/OTHER | −2.72825023219845 | 4 |
| PERSONALIZATION_2/PERSONALIZATION | −2.65288208258266 | 4 |
| хорошо_ ADV | 11.8899128397086 | 4 |
| HEALTH+MEDICAL_1/1 | −12.7633565212712 | 4 |
| PERSONALIZATION_0/PERSONALIZATION | −1.96354072113084 | 4 |
| EDUCATION+PRODUCTIVITY_5/5 | −8.71922494241247 | 4 |
| gender_merged | 1.72537700855949 | 4 |
| PHOTOGRAPHY_4/PHOTOGRAPHY | −1.737331956631 | 4 |
| EDUCATION+PRODUCTIVITY_4/EDUCATION+PRODUCTIVITY | 1.25522498498395 | 4 |
| ENTERTAINMENT_0/ENTERTAINMENT | −1.36074073571704 | 4 |
| SOCIAL+COMMUNICATION+DATING_6/6 | 2.22405894249721 | 4 |
| ENTERTAINMENT_6/ENTERTAINMENT | 0.966349839431597 | 3 |
| PHOTOGRAPHY_1/1 | −4.02406844554479 | 3 |
| OTHER_4/4 | −2.7677868583523 | 3 |
| OTHER_5/OTHER | 3.85343503542729 | 3 |
| PHOTOGRAPHY_1/PHOTOGRAPHY | 3.71328277476559 | 3 |
| PERSONALIZATION_1/PERSONALIZATION | −3.35252125337103 | 3 |
| AppUsage15-18Ratio | −3.26882386884001 | 3 |
| SOCIAL+COMMUNICATION+DATING_4/4 | −3.31997843801445 | 3 |
| HEALTH+MEDICAL_4/HEALTH+MEDICAL | −0.9428147681181 | 3 |
| PERSONALIZATION_3/3 | −3.84520578986224 | 3 |
| HEALTH+MEDICAL_2/2 | 3.92987294169931 | 3 |
| PHOTOGRAPHY_3/PHOTOGRAPHY | 0.40923079097447 | 3 |
| PHOTOGRAPHY_6/PHOTOGRAPHY | −1.29996830780878 | 3 |
| OTHER_1/1 | −0.784546722768581 | 3 |
| altersdiff | −0.92538030647002 | 3 |
| OTHER_0/OTHER | 2.25557936917862 | 3 |
| PHOTOGRAPHY_6/6 | 9.01538392975499 | 2 |
| SOCIAL+COMMUNICATION+DATING_4/SOCIAL+COMMUNICATION+DATING | 5.87503025553596 | 2 |
| обжечь_ VERB | −48,599.526427015 | 2 |
| офф_UNKN | −28,194.9413170442 | 2 |
| PHOTOGRAPHY_0/0 | 3.92198345485137 | 2 |
| SOCIAL+COMMUNICATION+DATING_5/SOCIAL+COMMUNICATION+DATING | −2.50586082868358 | 2 |
| жестокий_ADJ | −21,716.6197777305 | 2 |
| потерянный_ADJ | −20,316.8257664484 | 2 |
| тратиться_VERB | −17,634.6227724749 | 2 |
| PHOTOGRAPHY_7/PHOTOGRAPHY | −1.20678360573034 | 2 |
| OTHER_0/0 | −1.44570452070656 | 2 |
| OTHER_3/OTHER | 1.37984322453403 | 2 |
| пригонять_VERB | 21,225.458567208 | 2 |
| дропнуть_VERB | −53,030.1050709822 | 2 |
| OTHER_6/6 | 4.06504182009785 | 2 |
| предать_VERB | 36,199.9239128262 | 2 |
| AppUsage12-15Ratio | −4.38636786177102 | 2 |
| червь_NOUN | 55,826.7353210136 | 2 |
| ущербный_ADJ | −26,488.8457950415 | 2 |
| ооохнуть_VERB | −20,570.0983686645 | 2 |
| магнитный_ADJ | 19,900.8586557732 | 2 |
| оооохнуть_VERB | −34,380.0974521155 | 2 |
| блч_UNKN | −19,552.7345641462 | 2 |
| приобнять_VERB | 49,086.8536049488 | 2 |
| SOCIAL+COMMUNICATION+DATING_0/0 | 0.00300824222269163 | 2 |
| вифя_NOUN | −22,048.0034645223 | 2 |
| TOOLS_2/TOOLS | −1.55091403718346 | 2 |
| growth-2to-1weighted | −0.506104865550831 | 2 |
| вообще_ ADV | −5.14238891529362 | 2 |
| привет_ NOUN | 12.8226744206666 | 2 |
| он_NPRO | −18.0820059246943 | 2 |
| GAME_0/GAME | 1.55931076688544 | 2 |
| GAME_0/0 | −1.05587959441984 | 2 |
| GAME_3/3 | −2.654246487825 | 2 |
| PHOTOGRAPHY_5/5 | 17.2783798216844 | 2 |
| HEALTH+MEDICAL_7/7 | 2.71003871145643 | 2 |
| EDUCATION+PRODUCTIVITY_1/1 | −1.52282930897709 | 2 |
| EDUCATION+PRODUCTIVITY_2/EDUCATION+PRODUCTIVITY | −1.07500201275638 | 2 |
| HEALTH+MEDICAL_0/0 | 3.6972143104661 | 2 |
| Social_RuLIWC | 21.8729634558609 | 2 |
| TOOLS_3/TOOLS | 0.807683500445982 | 2 |
| TOOLS_6/TOOLS | 1.50491110974576 | 2 |
| ENTERTAINMENT_4/ENTERTAINMENT | −0.841292236238171 | 2 |
| ENTERTAINMENT_7/ENTERTAINMENT | −0.53781828777184 | 2 |
| PERSONALIZATION_1/1 | 2.05013057484557 | 2 |
| PERSONALIZATION_2/2 | 2.41275356218853 | 2 |
| PERSONALIZATION_4/PERSONALIZATION | −0.277615145486821 | 2 |
| TOOLS_4/TOOLS | −0.332621565714857 | 2 |
| PERSONALIZATION_6/PERSONALIZATION | 1.98331184812083 | 2 |
| ханна_NOUN | −20,365.7841289252 | 1 |
| отбирать_VERB | −16,422.664060777 | 1 |
| шлюшка_ NOUN | 7462.2668479906 | 1 |
| интим_NOUN | −8415.75871614007 | 1 |
| отл_UNKN | 18,240.2790750433 | 1 |
| бабочка_NOUN | 22,242.8428202378 | 1 |
| кпоп_NOUN | −22,706.902332252 | 1 |
| объёмный_ADJ | −30,296.798373289 | 1 |
| упад_NOUN | −17,378.1196878735 | 1 |
| анимешник_NOUN | −8288.74761112379 | 1 |
| хотя_CONJ | −2.32750500704937 | 1 |
| критерий_NOUN | 37,329.1994647902 | 1 |
| слишком_ADV | −2.67847601625444 | 1 |
| AppUsage18-21Ratio | 3.02994634652915 | 1 |
| EDUCATION+PRODUCTIVITY_7/7 | 3.4977040679328 | 1 |
| выглянуть_VERB | −19,143.9898454013 | 1 |
| хдд_UNKN | −2.41764183408381 | 1 |
| PERSONALIZATION_3/PERSONALIZATION | −1.45062334734687 | 1 |
| загуглила_NOUN | 38,940.8898266739 | 1 |
| HEALTH+MEDICAL_2/HEALTH+MEDICAL | −1.70766506251123 | 1 |
| SOCIAL+COMMUNICATION+DATING_1/1 | 0.214043849547984 | 1 |
| SOCIAL+COMMUNICATION+DATING_7/7 | 0.118134328089427 | 1 |
| GAME_6/GAME | 0.334957523880728 | 1 |
| GAME_7/7 | 2.06442883253146 | 1 |
| EDUCATION+PRODUCTIVITY_3/3 | 1.17072928106647 | 1 |
| TOOLS_0/0 | 0.690890866736 | 1 |
| TOOLS_5/5 | 1.77062991115663 | 1 |
| TOOLS_7/TOOLS | 1.2692618820954 | 1 |
| ENTERTAINMENT_3/3 | 4.91803252713477 | 1 |
| PERSONALIZATION_5/PERSONALIZATION | −0.015250988462515 | 1 |
| HEALTH+MEDICAL_5/HEALTH+MEDICAL | −0.949406780362511 | 1 |
| Alters_-7 | 0.0266628614988393 | 1 |
| SOCIAL+COMMUNICATION+DATING_2/SOCIAL+COMMUNICATION+DATING | −1.16690389645055 | 1 |
| SOCIAL+COMMUNICATION+DATING_6/SOCIAL+COMMUNICATION+DATING | −3.05422385174122 | 1 |
| PHOTOGRAPHY_2/2 | 0.335187657879573 | 1 |
| PHOTOGRAPHY_5/PHOTOGRAPHY | 1.93680325337435 | 1 |
| OTHER_4/OTHER | 2.1244521858398 | 1 |
| OTHER_5/5 | −2.29647607260118 | 1 |
| OTHER_7/7 | 2.51738861629993 | 1 |
| AppUsage0-3Ratio | −2.10969417137118 | 1 |
| GAME_1/1 | −1.06574716160273 | 1 |
| PERSONALIZATION_7/7 | 0 | 1 |
| джуна_NOUN | −45,463.0625569559 | 1 |