| Literature DB >> 35125931 |
Guilherme F Marchezini1, Anisio M Lacerda1, Gisele L Pappa1, Wagner Meira1, Debora Miranda1, Marco A Romano-Silva1, Danielle S Costa1, Leandro Malloy Diniz1.
Abstract
This paper deals with the problem of modeling counterfactual reasoning in scenarios where, apart from the observed endogenous variables, we have a latent variable that affects the outcomes and, consequently, the results of counterfactuals queries. This is a common setup in healthcare problems, including mental health. We propose a new framework where the aforementioned problem is modeled as a multivariate regression and the counterfactual model accounts for both observed and a latent variable, where the latter represents what we call the patient individuality factor ( φ ). In mental health, focusing on individuals is paramount, as past experiences can change how people see or deal with situations, but individuality cannot be directly measured. To the best of our knowledge, this is the first counterfactual approach that considers both observational and latent variables to provide deterministic answers to counterfactual queries, such as: what if I change the social support of a patient, to what extent can I change his/her anxiety? The framework combines concepts from deep representation learning and causal inference to infer the value of φ and capture both non-linear and multiplicative effects of causal variables. Experiments are performed with both synthetic and real-world datasets, where we predict how changes in people's actions may lead to different outcomes in terms of symptoms of mental illness and quality of life. Results show the model learns the individually factor with errors lower than 0.05 and answers counterfactual queries that are supported by the medical literature. The model has the potential to recommend small changes in people's lives that may completely change their relationship with mental illness.Entities:
Keywords: Counterfactual inference; Mental health; Multivariate regression
Year: 2022 PMID: 35125931 PMCID: PMC8801560 DOI: 10.1007/s10618-021-00818-9
Source DB: PubMed Journal: Data Min Knowl Discov ISSN: 1384-5810 Impact factor: 5.406
Fig. 2Causal DAG with multiple outcome , features x, and latent individuality factor . Calculated values appear in black boxes, observed variables in black circles, and unobserved variables in white
Comparison of your work with other from the literature
| Three levels of causation | Tabular data | Latent vars. | Non-linear | |
|---|---|---|---|---|
|
Bica et al. ( | ||||
|
Graham et al. ( | ||||
|
Pawlowski et al. ( | ||||
|
Zhang and Bareinboim ( | ||||
|
Zhang et al. ( | ||||
| Ours |
Fig. 1A view frequently adopted in regular regression models (a) assumes variables are independently manipulated inputs to a given fixed and deterministic regressor h. In the causal approach to counterfactual inference taken in this work, we rather view variables as causally related to each other by a structural causal model (SCM) (b) with associated causal graph (c)
Fig. 3An illustration of the proposed counterfactual model, highlighting the factual (i.e. ) and counterfactual inputs (i.e. ) to compute the factual (i.e. ) and counterfactual outcome (i.e. )
Fig. 5Double descent behavior of the base model in the test loss
Fig. 4Comparison of train and test normalized association errors for different combinations of and
Fig. 6Comparison of normalized counterfactual errors for simulated data with different combinations of and : the left column shows the values for and the right column for
Number of input and output variables and MAE measured for association relations with BSI and WHOQOL
| BSI | WHOQOL | Train | Test | ||||
|---|---|---|---|---|---|---|---|
| # | |||||||
| 8 | 9 | 0.0179 | 0.0079 | 0.0439 | 0.0081 | ||
| 8 | 12 | 0.0007 | 0.0007 | 0.0165 | 0.0217 | ||
| 8 | 10 | 0.0004 | 0.0005 | 0.0070 | 0.0094 | ||
Individual-level counterfactual analysis
| ID | Score | Feature | F Value | F Score | CF Value | CF Score |
|---|---|---|---|---|---|---|
| 1 | BSI_Psy | BSI_45 | 0 | 0.0712 | 4 | 0.0713 |
| 2 | 0 | 0.0355 | 4 | 0.0712 | ||
| 1 | Psychological | WHOQOL_19 | 3 | 0.6291 | 5 | 0.7355 |
| 2 | 3 | 0.7294 | 5 | 0.7838 | ||
| 3 | Physical | WHOQOL_2 | 3 | 0.7399 | 5 | 0.7903 |
| 4 | 3 | 0.6716 | 5 | 0.6996 | ||
| 5 | Environment | WHOQOL_5 | 4 | 0.7418 | 2 | 0.7218 |
| 6 | 4 | 0.8246 | 2 | 0.8044 | ||
| 7 | 4 | 0.7971 | 2 | 0.7785 | ||
| 8 | 4 | 0.7137 | 2 | 0.6921 |
Fig. 7Heatmaps showing populational changes when counterfactualy changing values of variables. First and third columns show the percentage of the population that has the value of the score (indicated in the caption) decreased when the variable in the line receives minimum and maximum values, respectively. Second and fourth columns show the same percentage of the population that have their score values increased with these changes
Percentage of variation in the BSI and WHOQOL scores as counterfactual interventions are made in the variables indicated in the Id column
| Physical intervention | Psychological intervention | Environmental intervention | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Id | Min | Max | Id | Min | Max | Id | Min | Max | ||||||||||||
| > | < | > | < | > | < | > | < | > | < | > | < | |||||||||
| 1 | 0.1 | 0.37 | 99.53 | 1.08 | 0.01 | 98.91 | 1 | 0.1 | 8.37 | 91.53 | 17.41 | 0.01 | 82.58 | 1 | 21.6 | 32.96 | 45.44 | 33.83 | 19.82 | 46.35 |
| 9 | 0.46 | 0.15 | 99.39 | 0.82 | 0.15 | 99.03 | 9 | 0.08 | 13.53 | 86.39 | 73.98 | 0.01 | 26.01 | 9 | 34.44 | 45.67 | 19.9 | 62 | 13.42 | 24.57 |
| 12 | 0.04 | 0.26 | 99.7 | 2.39 | 0.09 | 97.51 | 12 | 0.06 | 13.38 | 86.56 | 37.13 | 0.02 | 62.85 | 12 | 28.96 | 43.69 | 27.35 | 57.25 | 18.26 | 24.49 |
| 23 | 0 | 99.98 | 0.01 | 100 | 0 | 0 | 23 | 0.02 | 76.38 | 23.61 | 99.54 | 0.01 | 0.45 | 23 | 6.69 | 88.88 | 4.43 | 94.91 | 3.43 | 1.66 |
| 37 | 0 | 99.99 | 0.01 | 100 | 0 | 0 | 37 | 0.04 | 80.44 | 19.51 | 99.86 | 0.01 | 0.13 | 37 | 6.11 | 89.9 | 3.99 | 96.5 | 2.35 | 1.15 |
| 38 | 0.28 | 0.09 | 99.63 | 0.51 | 0.02 | 99.47 | 38 | 0.08 | 4.29 | 95.63 | 12.22 | 0.01 | 87.78 | 38 | 24.91 | 31.92 | 43.16 | 44.19 | 17.98 | 37.82 |
| 45 | 0.3 | 1.66 | 98.04 | 6.86 | 0.08 | 93.06 | 45 | 0.01 | 31.37 | 68.62 | 82.68 | 0.01 | 17.31 | 45 | 27.6 | 55.62 | 16.78 | 49.69 | 17.65 | 32.67 |
| 49 | 0.13 | 0.19 | 99.68 | 1.52 | 0.07 | 98.41 | 49 | 0.07 | 15.78 | 84.15 | 50.08 | 0.02 | 49.9 | 49 | 29.07 | 43.55 | 27.39 | 53.68 | 16.66 | 29.66 |
| 2 | 4.58 | 88.62 | 6.8 | 78.42 | 3.05 | 18.52 | *6 | 0.31 | 99.2 | 0.5 | 93.14 | 0.1 | 6.76 | 1 | 0.79 | 1.74 | 97.47 | 1.4 | 0.96 | 97.64 |
| 6 | 37.51 | 8.74 | 53.75 | 10.19 | 9.58 | 80.23 | 10 | 1.21 | 96.21 | 2.58 | 81.48 | 0.89 | 17.63 | 2 | 0.36 | 77.9 | 21.74 | 35.78 | 0.38 | 63.84 |
| 7 | 6.93 | 82.06 | 11.01 | 79.73 | 7.1 | 13.16 | 14 | 0.95 | 80.57 | 18.47 | 64.56 | 1.2 | 34.24 | 5 | 0.15 | 82.2 | 17.65 | 60.54 | 0.04 | 39.42 |
| 9 | 8.17 | 10.18 | 81.65 | 9.36 | 6.87 | 83.78 | 16 | 4.44 | 88.88 | 6.68 | 76.91 | 5.04 | 18.06 | 6 | 0.11 | 94.65 | 5.24 | 37.82 | 0.16 | 62.02 |
| 10 | 1.4 | 96.64 | 1.96 | 98.98 | 0.4 | 0.62 | 18 | 2.73 | 61.63 | 35.64 | 14.13 | 4.15 | 81.72 | 7 | 0.17 | 90.24 | 9.59 | 74.68 | 0.06 | 25.26 |
| 11 | 7.88 | 81.41 | 10.71 | 75.26 | 7.15 | 17.58 | *19 | 0.13 | 99.54 | 0.33 | 99.66 | 0.08 | 0.26 | *9 | 0.01 | 99.25 | 0.74 | 99.76 | 0 | 0.23 |
| 12 | 4.41 | 21.66 | 73.93 | 30.51 | 4.27 | 65.22 | 20 | 6.34 | 88.19 | 5.46 | 72.22 | 4.7 | 23.08 | 15 | 0.06 | 98.36 | 1.58 | 88.07 | 0.05 | 11.88 |
| 17 | 0.65 | 98.43 | 0.92 | 99.63 | 0.15 | 0.21 | 22 | 4.19 | 90.94 | 4.87 | 68.32 | 2.9 | 28.79 | 22 | 0.15 | 79.6 | 20.25 | 34.86 | 0.16 | 64.98 |
| 19 | 4.69 | 82.58 | 12.73 | 48.13 | 4.65 | 47.22 | 25 | 5.77 | 64.77 | 29.46 | 11.93 | 3.63 | 84.43 | *23 | 0.01 | 99.77 | 0.23 | 99.88 | 0 | 0.12 |
| 23 | 8.56 | 81.66 | 9.78 | 55.61 | 5.58 | 38.81 | *26 | 0.22 | 99.28 | 0.5 | 99.59 | 0.13 | 0.27 | |||||||
| 24 | 5.25 | 66.7 | 28.05 | 45.84 | 5.62 | 48.54 | ||||||||||||||
| 26 | 57.26 | 9.72 | 33.02 | 13.43 | 16.48 | 70.09 | ||||||||||||||
Ids proceed by an (*) indicate the variables directly used to generate the score
WHOQOL questions considered by the model. All questions are answered following a range from 1 to 5, and scores in positive direction, i.e., higher scores denote higher quality of life
| ID | WHOQOL question |
|---|---|
| 1 | How would you rate your quality of life? |
| 2 | How satisfied are you with your health? |
| 5 | How much do you enjoy life? |
| 6 | To what extent do you feel your life to be meaningful? |
| 7 | How well are you able to concentrate? |
| 9 | How healthy is your physical environment? |
| 10 | Do you have enough energy for everyday life? |
| 11 | Are you able to accept your bodily appearance? |
| 12 | Have you enough money to meet your needs? |
| 14 | To what extent do you have the opportunity for leisure activities? |
| 15 | How well are you able to get around? |
| 16 | How satisfied are you with your sleep? |
| 17 | How satisfied are you with your ability to perform your daily living activities? |
| 18 | How satisfied are you with your capacity for work? |
| 19 | How satisfied are you with yourself? |
| 20 | How satisfied are you with your personal relationships? |
| 22 | How satisfied are you with the support you get from your friends? |
| 23 | How satisfied are you with the conditions of your living place? |
| 24 | How satisfied are you with your access to health services? |
| 25 | How satisfied are you with your transport? |
| 26 | How often do you have negative feelings such as blue mood, despair, anxiety, depression? |
BSI topics considered by the model. All questions are answered following a 5-point scale of distress, ranging from “not-at-all” to “extremely”
| ID | BSI factor |
|---|---|
| 1 | Nervousness or shakiness |
| 9 | Thoughts of ending your life |
| 12 | Suddenly scared for no reason |
| 23 | Nausea or upset stomach |
| 37 | Feeling weak in parts of your body |
| 38 | Feeling tense or keyed up |
| 45 | Spells of terror or panic |
| 49 | Feeling so restless you could not sit still |