| Literature DB >> 33172939 |
Yi-Sheng Chao1, Kuan-Fu Lin2, Chao-Jung Wu3, Hsing-Chien Wu4, Hui-Ting Hsu5, Lien-Cheng Tsao6, Yen-Po Cheng6, Yi-Chun Lai7, Wei-Chih Chen8,9,10.
Abstract
OBJECTIVES: Composite diagnostic criteria alone are likely to create and introduce biases into diagnoses that subsequently have poor relationships with input symptoms. This study aims to understand the relationships between the diagnoses and the input symptoms, as well as the magnitudes of biases created by diagnostic criteria and introduced into the diagnoses of mental illnesses with large disease burdens (major depressive episodes, dysthymic disorder, and manic episodes). SETTINGS: General psychiatric care. PARTICIPANTS: Without real-world data available to the public, 100 000 subjects were simulated and the input symptoms were assigned based on the assumed prevalence rates (0.05, 0.1, 0.3, 0.5 and 0.7) and correlations between symptoms (0, 0.1, 0.4, 0.7 and 0.9). The input symptoms were extracted from the diagnostic criteria. The diagnostic criteria were transformed into mathematical equations to demonstrate the sources of biases and convert the input symptoms into diagnoses. PRIMARY AND SECONDARY OUTCOMES: The relationships between the input symptoms and diagnoses were interpreted using forward stepwise linear regressions. Biases due to data censoring or categorisation introduced into the intermediate variables, and the three diagnoses were measured.Entities:
Keywords: bias; forward-stepwise regression; frailty; index mining; the health and retirement study
Mesh:
Year: 2020 PMID: 33172939 PMCID: PMC7656951 DOI: 10.1136/bmjopen-2020-037022
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
The assumptions and parameters in the simulations
| Assumptions | ||
| Equal prevalence rates for the input symptoms of the same diagnosis; presence of input symptoms assigned randomly | ||
| Same correlations between the input symptoms of the diagnoses of major depressive episodes and dysthymic disorder; same correlations between the input symptoms of manic episodes | ||
| The input symptoms of manic episodes created independent of those of major depressive episodes and dysthymic disorder | ||
| Diagnoses made accurately based on the diagnostic criteria and symptoms reported precisely by patients | ||
| Population sizes | 100 000 | |
| Prevalence rates (uniform for all input symptoms in a simulation) | 0.05, 0.1, 0.3, 0.5 and 0.7 | |
| Correlations (uniform between all input symptoms of the same diagnosis in a simulation) | 0, 0.1, 0.4, 0.7 and 0.9 | |
| Number of simulations for each combination of the assumed prevalence rates and between-variable correlations of the input symptoms | 100 |
Figure 1The prevalence rates of an intermediate variable for the diagnosis of major depressive episodes by assumed input symptom prevalence and correlations. The intermediate variable is ‘significant unintentional weight loss or gain’ and the input symptoms are ‘significant unintentional weight loss’ and ‘significant unintentional weight gain.’ The black line represents the situation where the prevalence rates of the input symptoms are the same as that of the intermediate variable. Lines above the black lines have the prevalence rates of the intermediate variable larger than those of the input symptoms. CI, confidence interval.
Figure 2The prevalence rates of dysthymic disorder by assumed input symptom prevalence and correlations. Dysthymic disorder is diagnosed when both the major (depressed mood most of the day for more days than not, for at least 2 years) and minor criteria (at least two of the six items) are met. The black line represents the situation where the prevalence rates of the input symptoms are the same as those of dysthymic disorder. Lines below the black lines have dysthymic disorder prevalence rates lower than those of the input symptoms. CI, confidence interval.
The derived prevalence rates of the diagnoses of major depressive episodes, dysthymic disorder, and manic episodes based on the assumed prevalence rates and between-variable correlations of the input symptoms
| Assumed correlations between input symptoms | Assumed prevalence of input symptoms | Major depressive episodes | Dysthymic disorder | Manic episodes |
| 0.05 | 0 (95% CI 0 to 0) | 0.004 (95% CI 0.004 to 0.004) | 0 (95% CI 0 to 0) | |
| 0.1 | 0.001 (95% CI 0.001 to 0.001) | 0.025 (95% CI 0.025 to 0.025) | 0.002 (95% CI 0.002 to 0.002) | |
| 0.3 | 0.067 (95% CI 0.067 to 0.067) | 0.249 (95% CI 0.249 to 0.249) | 0.136 (95% CI 0.135 to 0.136) | |
| 0.5 | 0.245 (95% CI 0.244 to 0.245) | 0.493 (95% CI 0.493 to 0.493) | 0.436 (95% CI 0.436 to 0.436) | |
| 0.7 | 0.49 (95% CI 0.49 to 0.49) | 0.7 (95% CI 0.7 to 0.7) | 0.692 (95% CI 0.692 to 0.693) | |
| 0.05 | 0.004 (95% CI 0.004 to 0.004) | 0.018 (95% CI 0.018 to 0.018) | 0.007 (95% CI 0.007 to 0.007) | |
| 0.1 | 0.011 (95% CI 0.011 to 0.011) | 0.049 (95% CI 0.049 to 0.049) | 0.022 (95% CI 0.021 to 0.022) | |
| 0.3 | 0.094 (95% CI 0.094 to 0.094) | 0.25 (95% CI 0.25 to 0.25) | 0.172 (95% CI 0.171 to 0.172) | |
| 0.5 | 0.267 (95% CI 0.267 to 0.268) | 0.482 (95% CI 0.482 to 0.482) | 0.425 (95% CI 0.425 to 0.425) | |
| 0.7 | 0.51 (95% CI 0.509 to 0.51) | 0.697 (95% CI 0.697 to 0.697) | 0.679 (95% CI 0.679 to 0.679) | |
| 0.05 | 0.019 (95% CI 0.019 to 0.019) | 0.037 (95% CI 0.037 to 0.037) | 0.029 (95% CI 0.029 to 0.029) | |
| 0.1 | 0.042 (95% CI 0.042 to 0.042) | 0.078 (95% CI 0.078 to 0.078) | 0.062 (95% CI 0.062 to 0.062) | |
| 0.3 | 0.166 (95% CI 0.166 to 0.167) | 0.267 (95% CI 0.267 to 0.267) | 0.231 (95% CI 0.231 to 0.231) | |
| 0.5 | 0.344 (95% CI 0.344 to 0.344) | 0.476 (95% CI 0.476 to 0.476) | 0.44 (95% CI 0.44 to 0.441) | |
| 0.7 | 0.57 (95% CI 0.57 to 0.57) | 0.689 (95% CI 0.688 to 0.689) | 0.666 (95% CI 0.666 to 0.666) | |
| 0.05 | 0.035 (95% CI 0.035 to 0.035) | 0.046 (95% CI 0.046 to 0.046) | 0.042 (95% CI 0.042 to 0.042) | |
| 0.1 | 0.071 (95% CI 0.071 to 0.071) | 0.092 (95% CI 0.092 to 0.092) | 0.085 (95% CI 0.085 to 0.085) | |
| 0.3 | 0.233 (95% CI 0.233 to 0.234) | 0.285 (95% CI 0.285 to 0.285) | 0.27 (95% CI 0.27 to 0.27) | |
| 0.5 | 0.422 (95% CI 0.421 to 0.422) | 0.486 (95% CI 0.485 to 0.486) | 0.469 (95% CI 0.468 to 0.469) | |
| 0.7 | 0.635 (95% CI 0.635 to 0.635) | 0.69 (95% CI 0.69 to 0.691) | 0.678 (95% CI 0.677 to 0.678) | |
| 0.05 | 0.042 (95% CI 0.042 to 0.042) | 0.048 (95% CI 0.048 to 0.048) | 0.046 (95% CI 0.046 to 0.046) | |
| 0.1 | 0.085 (95% CI 0.085 to 0.085) | 0.096 (95% CI 0.096 to 0.097) | 0.093 (95% CI 0.093 to 0.093) | |
| 0.3 | 0.268 (95% CI 0.268 to 0.268) | 0.293 (95% CI 0.293 to 0.293) | 0.286 (95% CI 0.286 to 0.287) | |
| 0.5 | 0.463 (95% CI 0.463 to 0.463) | 0.493 (95% CI 0.492 to 0.493) | 0.485 (95% CI 0.485 to 0.486) | |
| 0.7 | 0.669 (95% CI 0.669 to 0.669) | 0.695 (95% CI 0.694 to 0.695) | 0.688 (95% CI 0.688 to 0.688) |
CI, confidence interval.
Figure 5The approximation of the diagnosis of dysthymic disorder by input symptoms, bias variables and both, measured by R-squared. The diagnosis of dysthymic disorder is approximated by all variable, including input symptoms and bias variables, using forward-stepwise regression. The selection of the variables was determined by adjusted R-squared. Circles are the maximal adjusted R-squared achieved by the regression with input symptoms, bias variables, or both of them. See table 4 for the details in the input symptoms and the bias variables. The assumed correlations between the input symptoms are 0.4 and the assumed prevalence rates of the input symptoms are 0.7 in this figure.
The input symptoms, intermediate variables and bias variables for the diagnosis of manic episodes based on the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision
| Classification of symptoms | Criterion variable | Major or minor criteria (domains) | Domain variables | Symptoms | Symptom variables | Equations | Approximation | Mechanisms related to introducing biases |
| manic = (1- man_ma1 x | manic=intercept + coef1 x man_ma1+coef2 x man_ma2+coef3 x man_ma3+coef4 x man_mi1+coef5 x man_mi2+coef6 x man_mi3+coef7 x man_mi4+coef8 x man_mi5+coef9 x man_mi6+coef10 x man_mi7+coef11 x man_bias | Multiplication to create the situations where one of the symptom in the major criteria met (union of three binomial variables, such as man_ma1+man_ma2 and man_ma1 x man_ma2), multiplication for the condition of presenting irritable mood (… x man_ma3), and the bias variable (man_bias) equivalent to the residual of the diagnosis not explained by the input symptoms and the bias variables due to censoring; the bias variables introduced by categorising the number of input symptoms confirmed in the minor criteria (man_bias1 and man_bias2) | ||||||
| A distinct period of abnormally and persistently elevated, expansive or irritable mood, lasting at least 1 week (or any duration if hospitalisation is necessary) | ||||||||
| Elevated mood, lasting at least 1 week | man_ma1 | |||||||
| Expansive mood, lasting at least 1 week | man_ma2 | |||||||
| Irritable mood, lasting at least 1 week | man_ma3 | |||||||
| Increased self-esteem or grandiosity | man_mi1 | man_mi1=man_mi1_1+ | Censoring of the sum of multiple input variables | |||||
| Increased self-esteem | man_mi1_1 | |||||||
| Grandiosity | man_mi1_2 | |||||||
| Information of the domain not explained by the input variables | man_mi1_bias | |||||||
| Decreased need for sleep (eg, feels rested after only 3 hours of sleep) | man_mi2 | |||||||
| More talkative than usual or pressure to keep talking | man_mi3 | man_mi3=man_mi3_1+ | Censoring of the sum of multiple input variables | |||||
| More talkative than usual | man_mi3_1 | |||||||
| Pressure to keep talking | man_mi3_2 | |||||||
| Information of the domain not explained by the input variables | man_mi3_bias | |||||||
| Flight of ideas or subjective experience that thoughts are racing | man_mi4 | man_mi4=man_mi4_1+ | Censoring of the sum of multiple input variables | |||||
| Flight of ideas | man_mi4_1 | |||||||
| Subjective experience that thoughts are racing | man_mi4_2 | |||||||
| Information of the domain not explained by the input variables | man_mi4_bias | |||||||
| Distractibility (ie, attention too easily drawn to unimportant or irrelevant external stimuli) | man_mi5 | |||||||
| Increase in goal-directed activity (either socially, at work or school, or sexually) or psychomotor agitation | man_mi6 | man_mi6=man_mi6_1+ | Censoring of the sum of multiple input variables | |||||
| Increase in goal-directed activity | man_mi6_1 | |||||||
| Psychomotor agitation | man_mi6_2 | |||||||
| Information of the domain not explained by the input variables | man_mi6_bias | |||||||
| Excessive involvement in pleasurable activities that have a high potential for painful consequences (eg, engaging in unrestrained buying sprees, sexual indiscretions, or foolish business investments) | man_mi7 | |||||||
| man_bias1 | Bias introduced by categorising the number of input symptoms confirmed in the minor criteria | |||||||
| man_bias2 | Bias introduced by categorising the number of input symptoms confirmed in the minor criteria | |||||||
| man_bias | Information of the diagnosis not explained by the input symptoms and the bias variables generated due to data categorisation, man_bias1 and man_bias2 |
The input symptoms, intermediate variables and bias variables for the diagnosis of major depressive episodes based on the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision
| Classification of symptoms | Criterion variable | Domains in the major or minor criteria | Domain variables | Symptoms | Symptom variables | Equations to derive diagnosis or domain variables | Approximation by linear regression | Mechanisms related to introducing biases |
| mde=mde_ma1 x | mde=intercept + coef1 x mde_ma1+coef2 x mde_ma2+coef3 x mde_mi3+coef4 x mde_mi4+coef5 x mde_mi5+coef6 x mde_mi6+coef7 x mde_mi7+coef8 x mde_mi8+coef9 x mde_mi9+coef10 x mde_bias | Multiplication to create the situations when one or two symptoms in the major criteria confirmed and the bias (mde_bias) calculated by extracting the information of the diagnosis not explained by the input symptoms and two bias variables generated by censoring (mde_bias1 and mde_bias2) Categorising of the sum of the input symptoms in the minor criteria at the threshold of three or four (mde_bias1 and mde_bias2) | ||||||
| Depressed mood or a loss of interest or pleasure in daily activities for more than 2 weeks. | ||||||||
| Depressed mood for more than 2 weeks. | mde_ma1 | |||||||
| Loss of interest or pleasure in daily activities for more than 2 weeks. | mde_ma2 | |||||||
| mde_mi | ||||||||
| Significant unintentional weight loss or gain | mde_mi3 | mde_mi3=mde_mi3_1+ | Censoring of the sum of multiple input variables | |||||
| Significant unintentional weight gain | mde_mi3_1 | |||||||
| Significant unintentional weight loss | mde_mi3_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi3_bias | |||||||
| Insomnia or sleeping too much | mde_mi4 | mde_mi4=mde_mi4_1+ | Censoring of the sum of multiple input variables | |||||
| Insomnia | mde_mi4_1 | |||||||
| Sleeping too much | mde_mi4_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi4_bias | |||||||
| Agitation or psychomotor retardation noticed by others | mde_mi5 | mde_mi5=mde_mi5_1+ | Censoring of the sum of multiple input variables | |||||
| Agitation | mde_mi5_1 | |||||||
| Psychomotor retardation noticed by others | mde_mi5_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi5_bias | |||||||
| Fatigue or loss of energy | mde_mi6 | mde_mi6=mde_mi6_1+ | Censoring of the sum of multiple input variables | |||||
| Fatigue | mde_mi6_1 | |||||||
| Loss of energy | mde_mi6_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi6_bias | |||||||
| Feelings of worthlessness or excessive guilt | mde_mi7 | mde_mi7=mde_mi7_1+ | Censoring of the sum of multiple input variables | |||||
| Feelings of worthlessness | mde_mi7_1 | |||||||
| Feelings of excessive guilt | mde_mi7_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi7_bias | |||||||
| Diminished ability to think or concentrate, or indecisiveness | mde_mi8 | mde_mi8=mde_mi8_1+ | Censoring of the sum of multiple input variables | |||||
| Diminished ability to think or concentrate | mde_mi8_1 | |||||||
| Indecisiveness | mde_mi8_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi8_bias | |||||||
| Recurrent thoughts of death | mde_mi9 | |||||||
| mde_bias1 | Bias introduced to categorise the sum of the number of confirmed symptoms in the minor criteria | |||||||
| mde_bias2 | Bias introduced to categorise the sum of the number of confirmed symptoms in the minor criteria | |||||||
| mde_bias | Information of the diagnosis not explained by the input variables and two bias variables generated due to data categorisation |
The individual input symptoms that best explained the diagnoses based on adjusted R-squared: major depressive episodes, dysthymic disorder and manic episodes by assumed input symptom prevalence and correlations
| Assumed correlations between input symptoms | Assumed prevalence of input symptoms | Major depressive episodes | Dysthymic disorder | Manic episodes |
| 0.05 | mde_ma1 | dys_ma | man_ma3 | |
| 0.05 | 0.001 (95% CI 0.001 to 0.001) | 0.076 (95% CI 0.075 to 0.077) | 0.002 (95% CI 0.002 to 0.002) | |
| 0.1 | mde_ma1 | dys_ma | man_ma3 | |
| 0.1 | 0.01 (95% CI 0.01 to 0.01) | 0.228 (95% CI 0.227 to 0.229) | 0.021 (95% CI 0.02 to 0.021) | |
| 0.3 | mde_ma1 | dys_ma | man_ma3 | |
| 0.3 | 0.167 (95% CI 0.167 to 0.167) | 0.774 (95% CI 0.773 to 0.774) | 0.366 (95% CI 0.366 to 0.367) | |
| 0.5 | mde_ma2 | dys_ma | man_ma3 | |
| 0.5 | 0.324 (95% CI 0.324 to 0.325) | 0.971 (95% CI 0.971 to 0.971) | 0.773 (95% CI 0.772 to 0.773) | |
| 0.7 | mde_ma2 | dys_ma | man_ma3 | |
| 0.7 | 0.412 (95% CI 0.412 to 0.412) | 0.999 (95% CI 0.999 to 0.999) | 0.964 (95% CI 0.964 to 0.964) | |
| 0.05 | mde_ma2 | dys_ma | man_ma3 | |
| 0.05 | 0.07 (95% CI 0.07 to 0.071) | 0.353 (95% CI 0.352 to 0.355) | 0.136 (95% CI 0.135 to 0.137) | |
| 0.1 | mde_ma1 | dys_ma | man_ma3 | |
| 0.1 | 0.101 (95% CI 0.1 to 0.101) | 0.462 (95% CI 0.461 to 0.463) | 0.199 (95% CI 0.198 to 0.199) | |
| 0.3 | mde_ma2 | dys_ma | man_ma3 | |
| 0.3 | 0.242 (95% CI 0.242 to 0.243) | 0.777 (95% CI 0.777 to 0.778) | 0.483 (95% CI 0.483 to 0.484) | |
| 0.5 | mde_ma2 | dys_ma | man_ma3 | |
| 0.5 | 0.365 (95% CI 0.365 to 0.366) | 0.932 (95% CI 0.931 to 0.932) | 0.74 (95% CI 0.74 to 0.741) | |
| 0.7 | mde_ma2 | dys_ma | man_ma3 | |
| 0.7 | 0.445 (95% CI 0.445 to 0.446) | 0.986 (95% CI 0.986 to 0.986) | 0.906 (95% CI 0.906 to 0.907) | |
| 0.05 | mde_ma1 | dys_ma | man_ma3 | |
| 0.05 | 0.375 (95% CI 0.373 to 0.376) | 0.731 (95% CI 0.729 to 0.732) | 0.561 (95% CI 0.559 to 0.562) | |
| 0.1 | mde_ma1 | dys_ma | man_ma3 | |
| 0.1 | 0.395 (95% CI 0.394 to 0.396) | 0.763 (95% CI 0.762 to 0.764) | 0.595 (95% CI 0.594 to 0.596) | |
| 0.3 | mde_ma1 | dys_ma | man_ma3 | |
| 0.3 | 0.465 (95% CI 0.465 to 0.466) | 0.851 (95% CI 0.85 to 0.851) | 0.701 (95% CI 0.701 to 0.702) | |
| 0.5 | mde_ma2 | dys_ma | man_ma3 | |
| 0.5 | 0.525 (95% CI 0.524 to 0.525) | 0.908 (95% CI 0.908 to 0.908) | 0.787 (95% CI 0.786 to 0.787) | |
| 0.7 | mde_ma2 | dys_ma | man_ma3 | |
| 0.7 | 0.568 (95% CI 0.568 to 0.569) | 0.946 (95% CI 0.946 to 0.947) | 0.855 (95% CI 0.854 to 0.855) | |
| 0.05 | mde_ma2 | dys_ma | man_ma3 | |
| 0.05 | 0.688 (95% CI 0.687 to 0.69) | 0.909 (95% CI 0.908 to 0.909) | 0.831 (95% CI 0.83 to 0.832) | |
| 0.1 | mde_ma1 | dys_ma | man_ma3 | |
| 0.1 | 0.688 (95% CI 0.687 to 0.689) | 0.912 (95% CI 0.911 to 0.913) | 0.836 (95% CI 0.835 to 0.836) | |
| 0.3 | mde_ma2 | dys_ma | man_ma3 | |
| 0.3 | 0.71 (95% CI 0.709 to 0.711) | 0.93 (95% CI 0.93 to 0.93) | 0.862 (95% CI 0.861 to 0.862) | |
| 0.5 | mde_ma2 | dys_ma | man_ma3 | |
| 0.5 | 0.729 (95% CI 0.728 to 0.729) | 0.944 (95% CI 0.943 to 0.944) | 0.882 (95% CI 0.882 to 0.883) | |
| 0.7 | mde_ma1 | dys_ma | man_ma3 | |
| 0.7 | 0.745 (95% CI 0.744 to 0.745) | 0.954 (95% CI 0.954 to 0.955) | 0.9 (95% CI 0.9 to 0.9) | |
| 0.05 | mde_ma1 | dys_ma | man_ma3 | |
| 0.05 | 0.828 (95% CI 0.827 to 0.829) | 0.958 (95% CI 0.957 to 0.958) | 0.918 (95% CI 0.917 to 0.919) | |
| 0.1 | mde_ma2 | dys_ma | man_ma3 | |
| 0.1 | 0.838 (95% CI 0.838 to 0.839) | 0.961 (95% CI 0.961 to 0.961) | 0.925 (95% CI 0.924 to 0.925) | |
| 0.3 | mde_ma2 | dys_ma | man_ma3 | |
| 0.3 | 0.856 (95% CI 0.856 to 0.857) | 0.969 (95% CI 0.968 to 0.969) | 0.937 (95% CI 0.936 to 0.937) | |
| 0.5 | mde_ma2 | dys_ma | man_ma3 | |
| 0.5 | 0.862 (95% CI 0.862 to 0.863) | 0.972 (95% CI 0.972 to 0.972) | 0.942 (95% CI 0.942 to 0.943) | |
| 0.7 | mde_ma2 | dys_ma | man_ma3 | |
| 0.7 | 0.865 (95% CI 0.865 to 0.866) | 0.974 (95% CI 0.974 to 0.974) | 0.946 (95% CI 0.946 to 0.946) |
See table 2 to 4 for variable definitions. Adjusted R-squared is derived from linear regressions using individual input symptoms as predictor with 95% confidence intervals (CIs) derived from 100 simulations for each combination of assumed input symptom prevalence and correlations.
CI, confidence interval.
The input symptoms, intermediate variables and bias variables for the diagnosis of dysthymic disorder based on the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision
| Classification of symptoms | Criterion variable | Major or minor criteria (domains) | Intermediate variables | Symptoms | Symptom variables | Equations to generate diagnosis or domain variables | Approximation | Mechanisms related to introducing biases |
| dys=dys_ma x dys_mi | dys=intercept + coef1 x dys_ma+coef2 x dys_mi+coef3 x dys_bias | Multiplication to create the situations where both the major and minor criteria met (union of two binomial variables, mde_ma x mde_mi) and the bias variable (dys_bias) equivalent to the residual of the diagnosis not explained by the input symptoms and the bias variables due to censoring and categorisation | ||||||
| Depressed mood most of the day for more days than not, for at least 2 years | dys_ma | |||||||
| dys_mi | dys_mi=dys_mi1+dys_mi2+ | Categorising of the sum of multiple input variables | ||||||
| Poor appetite or overeating | dys_mi1 | dys_mi1=dys_mi1_1+ | Censoring of the sum of multiple input variables | |||||
| Poor appetite | dys_mi1_1 | |||||||
| Overeating | dys_mi1_2 | |||||||
| Information of the domain not explained by the input variables | dys_mi1_bias | |||||||
| Insomnia or sleeping too much* | dys_mi2/mde_mi4 | dys_mi2=mde_mi4= | Censoring of the sum of multiple input variables | |||||
| Insomnia | mde_mi4_1 | |||||||
| Sleeping too much | mde_mi4_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi4_bias | |||||||
| Low energy or fatigue* | dys_mi3/mde_mi6 | dys_mi3=mde_mi6= | Censoring of the sum of multiple input variables | |||||
| Fatigue | mde_mi6_1 | |||||||
| Loss of energy (low energy) | mde_mi6_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi6_bias | |||||||
| Low self-esteem | dys_mi4 | |||||||
| Poor concentration or difficulty making decisions* | dys_mi5/mde_mi8 | dys_mi5=mde_mi8= | Censoring of the sum of multiple input variables | |||||
| Diminished ability to think or concentrate (Poor concentration) | mde_mi8_1 | |||||||
| difficulty making decisions (indecisiveness) | mde_mi8_2 | |||||||
| Information of the domain not explained by the input variables | mde_mi8_bias | |||||||
| Feelings of hopelessness | dys_mi6 | |||||||
| Information of minor criteria not explained by input variables | dys_mi_bias | Bias introduced by categorising the number of input symptoms confirmed in the minor criteria | ||||||
| dys_bias | Information of the diagnosis not explained by the input symptoms and the bias variables generated due to data categorisation (dys_mi_bias) |
*The input symptoms used for the diagnosis of both major depressive episodes and dysthymic disorder.
The individual bias variables that best explained the diagnoses based on adjusted R-squared: major depressive episodes, dysthymic disorder and manic episodes by assumed input symptom prevalence and correlations
| Assumed correlations between input symptoms | Assumed prevalence of input symptoms | Major depressive episodes | Dysthymic disorder | Manic episodes |
| 0.05 | mde_bias2 | dys_bias | man_bias2 | |
| 0.05 | 0 (95% CI 0 to 0) | 0.028 (95% CI 0.028 to 0.028) | 0.001 (95% CI 0.001 to 0.001) | |
| 0.1 | mde_bias2 | dys_bias | man_bias2 | |
| 0.1 | 0.004 (95% CI 0.004 to 0.004) | 0.053 (95% CI 0.053 to 0.054) | 0.011 (95% CI 0.011 to 0.011) | |
| 0.3 | mde_bias2 | dys_bias | man_bias1 | |
| 0.3 | 0.015 (95% CI 0.015 to 0.015) | 0.045 (95% CI 0.045 to 0.045) | 0.089 (95% CI 0.089 to 0.09) | |
| 0.5 | mde_bias | dys_bias | man_bias1 | |
| 0.5 | 0.013 (95% CI 0.013 to 0.014) | 0.007 (95% CI 0.007 to 0.007) | 0.035 (95% CI 0.034 to 0.035) | |
| 0.7 | mde_bias | dys_bias | man_bias1 | |
| 0.7 | 0.01 (95% CI 0.01 to 0.01) | 0 (95% CI 0 to 0) | 0.002 (95% CI 0.002 to 0.002) | |
| 0.05 | mde_bias2 | dys_bias | man_bias1 | |
| 0.05 | 0.037 (95% CI 0.037 to 0.037) | 0.113 (95% CI 0.113 to 0.114) | 0.083 (95% CI 0.083 to 0.084) | |
| 0.1 | mde_bias2 | dys_bias | man_bias1 | |
| 0.1 | 0.047 (95% CI 0.047 to 0.048) | 0.122 (95% CI 0.121 to 0.122) | 0.116 (95% CI 0.115 to 0.116) | |
| 0.3 | mde_bias2 | dys_mi_bias | man_bias1 | |
| 0.3 | 0.077 (95% CI 0.077 to 0.077) | 0.105 (95% CI 0.105 to 0.106) | 0.198 (95% CI 0.197 to 0.198) | |
| 0.5 | mde_bias2 | dys_mi_bias | man_bias1 | |
| 0.5 | 0.079 (95% CI 0.079 to 0.08) | 0.073 (95% CI 0.073 to 0.073) | 0.166 (95% CI 0.166 to 0.167) | |
| 0.7 | mde_bias2 | dys_mi_bias | man_bias1 | |
| 0.7 | 0.065 (95% CI 0.065 to 0.065) | 0.047 (95% CI 0.046 to 0.047) | 0.094 (95% CI 0.093 to 0.094) | |
| 0.05 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.05 | 0.294 (95% CI 0.293 to 0.295) | 0.415 (95% CI 0.413 to 0.416) | 0.432 (95% CI 0.431 to 0.433) | |
| 0.1 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.1 | 0.304 (95% CI 0.303 to 0.304) | 0.419 (95% CI 0.418 to 0.42) | 0.445 (95% CI 0.444 to 0.445) | |
| 0.3 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.3 | 0.335 (95% CI 0.334 to 0.335) | 0.411 (95% CI 0.411 to 0.412) | 0.473 (95% CI 0.472 to 0.473) | |
| 0.5 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.5 | 0.354 (95% CI 0.354 to 0.355) | 0.395 (95% CI 0.395 to 0.396) | 0.475 (95% CI 0.474 to 0.475) | |
| 0.7 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.7 | 0.356 (95% CI 0.355 to 0.356) | 0.367 (95% CI 0.366 to 0.367) | 0.451 (95% CI 0.45 to 0.451) | |
| 0.05 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.05 | 0.616 (95% CI 0.615 to 0.617) | 0.705 (95% CI 0.704 to 0.706) | 0.723 (95% CI 0.722 to 0.724) | |
| 0.1 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.1 | 0.611 (95% CI 0.611 to 0.612) | 0.699 (95% CI 0.698 to 0.699) | 0.72 (95% CI 0.72 to 0.721) | |
| 0.3 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.3 | 0.623 (95% CI 0.623 to 0.624) | 0.699 (95% CI 0.699 to 0.7) | 0.728 (95% CI 0.728 to 0.729) | |
| 0.5 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.5 | 0.632 (95% CI 0.632 to 0.633) | 0.696 (95% CI 0.696 to 0.697) | 0.731 (95% CI 0.731 to 0.732) | |
| 0.7 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.7 | 0.639 (95% CI 0.638 to 0.639) | 0.693 (95% CI 0.692 to 0.693) | 0.732 (95% CI 0.731 to 0.732) | |
| 0.05 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.05 | 0.777 (95% CI 0.776 to 0.778) | 0.835 (95% CI 0.834 to 0.835) | 0.847 (95% CI 0.847 to 0.848) | |
| 0.1 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.1 | 0.788 (95% CI 0.788 to 0.789) | 0.842 (95% CI 0.841 to 0.843) | 0.855 (95% CI 0.854 to 0.855) | |
| 0.3 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.3 | 0.807 (95% CI 0.806 to 0.807) | 0.854 (95% CI 0.853 to 0.854) | 0.867 (95% CI 0.867 to 0.868) | |
| 0.5 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.5 | 0.811 (95% CI 0.811 to 0.811) | 0.855 (95% CI 0.855 to 0.856) | 0.87 (95% CI 0.87 to 0.871) | |
| 0.7 | mde_bias1 | dys_mi_bias | man_bias1 | |
| 0.7 | 0.812 (95% CI 0.811 to 0.812) | 0.853 (95% CI 0.853 to 0.853) | 0.869 (95% CI 0.869 to 0.87) |
See table 2 to 4 for variable definitions. Adjusted R-squared is derived from linear regressions using individual bias variables as predictor with 95% confidence intervals (CIs) derived from 100 simulations for each combination of assumed input symptom prevalence and correlations.
CI, confidence interval.
Approximating the diagnoses using input symptoms and derived adjusted R-squared
| Assumed correlations between input symptoms | Assumed prevalence of input symptoms | Major depressive episodes | Dysthymic disorder | Manic episodes |
| 0.05 | 0.003 (95% CI 0.002 to 0.003) | 0.122 (95% CI 0.121 to 0.123) | 0.004 (95% CI 0.004 to 0.005) | |
| 0.1 | 0.024 (95% CI 0.023 to 0.024) | 0.305 (95% CI 0.304 to 0.306) | 0.039 (95% CI 0.038 to 0.039) | |
| 0.3 | 0.348 (95% CI 0.348 to 0.349) | 0.842 (95% CI 0.841 to 0.842) | 0.483 (95% CI 0.482 to 0.483) | |
| 0.5 | 0.649 (95% CI 0.649 to 0.649) | 0.986 (95% CI 0.986 to 0.986) | 0.817 (95% CI 0.817 to 0.817) | |
| 0.7 | 0.823 (95% CI 0.823 to 0.823) | 1 (95% CI 1 to 1) | 0.967 (95% CI 0.967 to 0.967) | |
| 0.05 | 0.143 (95% CI 0.141 to 0.144) | 0.435 (95% CI 0.433 to 0.436) | 0.212 (95% CI 0.211 to 0.213) | |
| 0.1 | 0.198 (95% CI 0.197 to 0.199) | 0.539 (95% CI 0.538 to 0.54) | 0.29 (95% CI 0.289 to 0.291) | |
| 0.3 | 0.45 (95% CI 0.45 to 0.451) | 0.826 (95% CI 0.826 to 0.827) | 0.588 (95% CI 0.588 to 0.589) | |
| 0.5 | 0.663 (95% CI 0.663 to 0.664) | 0.952 (95% CI 0.952 to 0.952) | 0.799 (95% CI 0.799 to 0.799) | |
| 0.7 | 0.809 (95% CI 0.809 to 0.809) | 0.991 (95% CI 0.991 to 0.991) | 0.922 (95% CI 0.922 to 0.922) | |
| 0.05 | 0.587 (95% CI 0.585 to 0.588) | 0.782 (95% CI 0.781 to 0.783) | 0.675 (95% CI 0.674 to 0.676) | |
| 0.1 | 0.607 (95% CI 0.606 to 0.608) | 0.807 (95% CI 0.807 to 0.808) | 0.698 (95% CI 0.697 to 0.698) | |
| 0.3 | 0.688 (95% CI 0.688 to 0.689) | 0.878 (95% CI 0.877 to 0.878) | 0.775 (95% CI 0.774 to 0.775) | |
| 0.5 | 0.761 (95% CI 0.761 to 0.762) | 0.925 (95% CI 0.924 to 0.925) | 0.838 (95% CI 0.838 to 0.838) | |
| 0.7 | 0.821 (95% CI 0.821 to 0.822) | 0.956 (95% CI 0.956 to 0.956) | 0.887 (95% CI 0.887 to 0.888) | |
| 0.05 | 0.813 (95% CI 0.812 to 0.814) | 0.925 (95% CI 0.925 to 0.926) | 0.877 (95% CI 0.877 to 0.878) | |
| 0.1 | 0.826 (95% CI 0.826 to 0.827) | 0.928 (95% CI 0.927 to 0.928) | 0.881 (95% CI 0.881 to 0.882) | |
| 0.3 | 0.86 (95% CI 0.86 to 0.86) | 0.942 (95% CI 0.942 to 0.942) | 0.9 (95% CI 0.9 to 0.9) | |
| 0.5 | 0.88 (95% CI 0.88 to 0.88) | 0.953 (95% CI 0.953 to 0.953) | 0.913 (95% CI 0.913 to 0.913) | |
| 0.7 | 0.895 (95% CI 0.895 to 0.895) | 0.962 (95% CI 0.962 to 0.962) | 0.925 (95% CI 0.925 to 0.925) | |
| 0.05 | 0.903 (95% CI 0.903 to 0.904) | 0.965 (95% CI 0.965 to 0.966) | 0.941 (95% CI 0.94 to 0.941) | |
| 0.1 | 0.91 (95% CI 0.91 to 0.911) | 0.968 (95% CI 0.968 to 0.968) | 0.945 (95% CI 0.945 to 0.945) | |
| 0.3 | 0.923 (95% CI 0.923 to 0.923) | 0.974 (95% CI 0.974 to 0.974) | 0.954 (95% CI 0.953 to 0.954) | |
| 0.5 | 0.928 (95% CI 0.928 to 0.928) | 0.976 (95% CI 0.976 to 0.977) | 0.958 (95% CI 0.957 to 0.958) | |
| 0.7 | 0.932 (95% CI 0.932 to 0.932) | 0.978 (95% CI 0.978 to 0.978) | 0.96 (95% CI 0.96 to 0.96) |
Adjusted R-squared is the maximal values from the forward-stepwise linear regressions using all input symptoms as candidate predictors with 95% confidence intervals (CIs) derived from 100 simulations for each combination of assumed input symptom prevalence and correlations.
CI, confidence interval.
Approximating the diagnoses using bias variables and derived R-squared
| Assumed correlations between input symptoms | Assumed prevalence of input symptoms | Major depressive episodes | Dysthymic disorder | Manic episodes |
| 0.05 | 0.003 (95% CI 0.002 to 0.003) | 0.029 (95% CI 0.029 to 0.03) | 0.004 (95% CI 0.004 to 0.004) | |
| 0.1 | 0.013 (95% CI 0.012 to 0.013) | 0.056 (95% CI 0.056 to 0.056) | 0.017 (95% CI 0.017 to 0.017) | |
| 0.3 | 0.083 (95% CI 0.083 to 0.083) | 0.047 (95% CI 0.047 to 0.047) | 0.098 (95% CI 0.098 to 0.099) | |
| 0.5 | 0.111 (95% CI 0.111 to 0.112) | 0.007 (95% CI 0.007 to 0.007) | 0.039 (95% CI 0.038 to 0.039) | |
| 0.7 | 0.095 (95% CI 0.095 to 0.095) | 0 (95% CI 0 to 0) | 0.012 (95% CI 0.012 to 0.013) | |
| 0.05 | 0.083 (95% CI 0.082 to 0.084) | 0.145 (95% CI 0.144 to 0.146) | 0.126 (95% CI 0.125 to 0.127) | |
| 0.1 | 0.096 (95% CI 0.095 to 0.097) | 0.156 (95% CI 0.155 to 0.156) | 0.154 (95% CI 0.153 to 0.154) | |
| 0.3 | 0.145 (95% CI 0.144 to 0.145) | 0.139 (95% CI 0.138 to 0.139) | 0.216 (95% CI 0.216 to 0.216) | |
| 0.5 | 0.172 (95% CI 0.172 to 0.173) | 0.097 (95% CI 0.097 to 0.097) | 0.182 (95% CI 0.181 to 0.182) | |
| 0.7 | 0.175 (95% CI 0.175 to 0.175) | 0.065 (95% CI 0.064 to 0.065) | 0.115 (95% CI 0.115 to 0.116) | |
| 0.05 | 0.421 (95% CI 0.419 to 0.423) | 0.455 (95% CI 0.453 to 0.456) | 0.505 (95% CI 0.504 to 0.506) | |
| 0.1 | 0.422 (95% CI 0.421 to 0.423) | 0.454 (95% CI 0.453 to 0.455) | 0.507 (95% CI 0.506 to 0.508) | |
| 0.3 | 0.435 (95% CI 0.434 to 0.435) | 0.442 (95% CI 0.442 to 0.443) | 0.512 (95% CI 0.512 to 0.513) | |
| 0.5 | 0.452 (95% CI 0.452 to 0.453) | 0.427 (95% CI 0.427 to 0.427) | 0.506 (95% CI 0.505 to 0.506) | |
| 0.7 | 0.46 (95% CI 0.459 to 0.46) | 0.403 (95% CI 0.402 to 0.403) | 0.481 (95% CI 0.481 to 0.482) | |
| 0.05 | 0.728 (95% CI 0.727 to 0.729) | 0.729 (95% CI 0.728 to 0.731) | 0.764 (95% CI 0.763 to 0.765) | |
| 0.1 | 0.722 (95% CI 0.721 to 0.723) | 0.723 (95% CI 0.722 to 0.724) | 0.76 (95% CI 0.759 to 0.761) | |
| 0.3 | 0.726 (95% CI 0.726 to 0.727) | 0.722 (95% CI 0.722 to 0.723) | 0.761 (95% CI 0.761 to 0.762) | |
| 0.5 | 0.732 (95% CI 0.731 to 0.732) | 0.72 (95% CI 0.719 to 0.72) | 0.76 (95% CI 0.76 to 0.761) | |
| 0.7 | 0.737 (95% CI 0.736 to 0.737) | 0.717 (95% CI 0.716 to 0.717) | 0.758 (95% CI 0.758 to 0.759) | |
| 0.05 | 0.852 (95% CI 0.851 to 0.853) | 0.85 (95% CI 0.849 to 0.851) | 0.871 (95% CI 0.871 to 0.872) | |
| 0.1 | 0.86 (95% CI 0.859 to 0.861) | 0.857 (95% CI 0.856 to 0.857) | 0.876 (95% CI 0.876 to 0.877) | |
| 0.3 | 0.872 (95% CI 0.871 to 0.872) | 0.867 (95% CI 0.867 to 0.868) | 0.886 (95% CI 0.886 to 0.886) | |
| 0.5 | 0.874 (95% CI 0.874 to 0.875) | 0.869 (95% CI 0.868 to 0.869) | 0.888 (95% CI 0.887 to 0.888) | |
| 0.7 | 0.874 (95% CI 0.874 to 0.875) | 0.867 (95% CI 0.866 to 0.867) | 0.886 (95% CI 0.886 to 0.886) |
Adjusted R-squared is the maximal values from the forward-stepwise linear regressions using all bias variables as candidate predictors with 95% confidence intervals (CIs) derived from 100 simulations for each combination of assumed input symptom prevalence and correlations.
CI, confidence interval.