| Literature DB >> 26496707 |
Alexander J Millner1, Michael D Lee2, Matthew K Nock1.
Abstract
Suicide is a leading cause of death worldwide. Although research has made strides in better defining suicidal behaviors, there has been less focus on accurate measurement. Currently, the widespread use of self-report, single-item questions to assess suicide ideation, plans and attempts may contribute to measurement problems and misclassification. We examined the validity of single-item measurement and the potential for statistical errors. Over 1,500 participants completed an online survey containing single-item questions regarding a history of suicidal behaviors, followed by questions with more precise language, multiple response options and narrative responses to examine the validity of single-item questions. We also conducted simulations to test whether common statistical tests are robust against the degree of misclassification produced by the use of single-items. We found that 11.3% of participants that endorsed a single-item suicide attempt measure engaged in behavior that would not meet the standard definition of a suicide attempt. Similarly, 8.8% of those who endorsed a single-item measure of suicide ideation endorsed thoughts that would not meet standard definitions of suicide ideation. Statistical simulations revealed that this level of misclassification substantially decreases statistical power and increases the likelihood of false conclusions from statistical tests. Providing a wider range of response options for each item reduced the misclassification rate by approximately half. Overall, the use of single-item, self-report questions to assess the presence of suicidal behaviors leads to misclassification, increasing the likelihood of statistical decision errors. Improving the measurement of suicidal behaviors is critical to increase understanding and prevention of suicide.Entities:
Mesh:
Year: 2015 PMID: 26496707 PMCID: PMC4619664 DOI: 10.1371/journal.pone.0141606
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic information.
| Age |
|
|
|---|---|---|
| 33.9 | 12.8 | |
|
|
| |
| 30.0 | 18–79 | |
|
|
|
|
| Female | 1203 | 74.4 |
| Male | 398 | 24.6 |
| Other | 17 | 1.1 |
|
|
|
|
| Caucasian | 1214 | 75.0 |
| Other or Mixed Race | 134 | 8.3 |
| Race No Response | 126 | 7.8 |
| African American | 107 | 6.6 |
| Native American/Alaska Native | 26 | 1.6 |
| Native Hawaiian/Pacific Islander | 11 | 0.7 |
|
|
|
|
| Not Latino | 1404 | 86.8 |
| Latino | 180 | 11.1 |
| Ethnicity No Response | 34 | 2.1 |
|
|
|
|
| Northeast (10 States) | 444 | 27.4 |
| Midwest (11 States) | 341 | 21.1 |
| West (10 States) | 336 | 20.8 |
| Southeast (13 States) | 261 | 16.1 |
| Southwest (4 States) | 221 | 13.7 |
| Location No Response | 9 | 0.6 |
| Not from the U.S. | 6 | 0.4 |
Fig 1A flow chart of the online survey.
Gate questions are single-item questions for three suicidal outcomes. Suicide ideation follow-up questions include eight different thoughts and suicide action follow-up questions include five different actions. Participants that denied any suicidal thoughts or behaviors in the first three sections skipped to the end of the survey. Participants that endorsed any suicidal thought or behavior completed several follow-up questions regarding planning behaviors and narrative descriptions of their behavior.
Fig 2A visual example of DV-misclassification relationships and the results of the statistical simulations.
(A) Model data of a “true effect" (B) Model data of a “true null effect.” The red and green arrows in (A) and (B) represent data points that will be misclassified due to participants inaccurately categorizing their suicidal behavior. (A) and (C) show an example of when the misclassification is random (i.e., dependent variable (DV)-misclassification correlation is zero). Because true attempters tend to be higher on the DV, true attempters randomly misclassified into the non-attempters group will tend to increase the group mean of non-attempters (e.g., from 39.57 in (A) to 42.17 in (C)). Likewise, in the example, true non-attempters tend to be lower on the DV and therefore, true non-attempters randomly misclassified into the attempters group will reduce the group mean of attempters (e.g., from 51.63 (A) to 44.03 in (C)). Thus, when there is a true difference between the groups at the population level, random misclassification will cause the two group means to move closer together reducing the power to detect the true effect. (B) and (D) shows an example of when there is a pattern to the misclassification (the DV-misclassification correlation is approximately r = 0.20). In this example, attempters higher on the DV tend to misclassify their behavior, causing an increase in the mean of non-attempters (e.g., from 43.66 in (B) to 52.56 in (D)). Non-attempters that are lower on the DV tend to misclassify their behavior, causing a decrease in the mean of attempters (e.g., from 43.76 in (B) to 34.83 in (D)). Thus, during a “true null effect” when the DV-misclassification correlation is present, misclassification causes the group means to shift further apart, increasing the probability of finding a false significant difference. (E) The outcome of the statistical simulation for data with a “true effect” across different effect sizes (varying sample sizes to maintain statistical power at 0.95). Without a DV-misclassification correlation, power to detect the true effect is reduced by approximately 15% across all effect sizes. Power decreases as the DV-misclassification correlation strengthens in the opposite direction of the true effect, increasing the chance of a false negative result (i.e., Type II error). This reduction in power is greater when the effect size is small and the sample size is large compared to when there are larger effect sizes and smaller sample sizes. (F) The “true null effect” across the same sample sizes as in the true effect simulation (alpha = 0.05). When misclassification is random for a “true null effect,” it does not affect the false positive error rate. As the DV-misclassification correlation strengthens in either direction, chances of a false positive increase (i.e., Type I error). Compared to larger sample sizes, chance of a false positive is greatest when there is a smaller sample size.
Responses to gate and follow-up questions for suicide ideation.
| A. Gate Question: Suicide Ideation | YES | NO | ||
|---|---|---|---|---|
| "Have you ever seriously thought about killing yourself?" (n = 1570) | % | Num | % | Num |
| 78.85 | 1238 | 21.15 | 332 | |
|
| Among those who said YES to Gate | Among those who said NO to Gate | ||
| "I should kill myself" | 74.64 | 924 | 3.31 | 11 |
| "Maybe I should kill myself" | 16.56 | 205 | 10.54 | 35 |
| "I wish I was dead" | 5.57 | 69 | 16.87 | 56 |
| "My life is not worth living" | 0.97 | 12 | 8.43 | 28 |
| "I wish I was never born" | 0.24 | 3 | 3.01 | 10 |
| "I wish I could disappear or not exist" | 0.81 | 10 | 9.94 | 33 |
| "What happens to people when they die?" | 0.89 | 11 | 34.64 | 115 |
| "What will it be like when I die?" | 0.08 | 1 | 6.33 | 21 |
| "No to all follow-up questions" | 0.24 | 3 | 6.93 | 23 |
Responses to gate and follow-up questions for suicide plans.
| B. Gate Question: Suicide Plan | YES | NO | ||
|---|---|---|---|---|
| "Have you ever made a plan to kill yourself?" (n = 263) | % | Ratio | % | Ratio |
| 33.08 | 87/263 | 66.92 | 176/263 | |
|
| Among ideators who said YES to Gate | Among ideators who said NO to Gate | ||
| Thought of a method | 93.10 | 81/87 | 64.77 | 114/176 |
| Thought of a place | 75.86 | 66/87 | 38.07 | 67/176 |
| Preparatory actions | 48.75 | 39/80 | 29.19 | 47/161 |
| Sure of method | 64.20 | 52/81 | 37.72 | 43/114 |
| Sure of place | 61.54 | 40/65 | 40.91 | 27/66 |
| Four or five planning components | 51.72 | 30/58 | 39.50 | 18/48 |
a Participants who responded “No response/Not applicable” or left the item blank were excluded resulting in changing totals (i.e., denominators).
b Only participants that endorsed or denied all 5 planning components were included. Using the full sample in the denominator (i.e., 176), 10.80% of people who denied a plan, endorsed four or five planning components
Fig 3Characteristics of suicide planning among participants who attempted suicide and either endorsed a single-item plan or did not.
For those that endorsed a single-item lifetime suicide plan (blue) and for those that denied a single-item lifetime suicide plan (red), the distributions of (A) the amount of time between thinking of the method and attempting suicide, (B) the decision to attempt and attempting suicide and (C) thinking of place to attempt suicide and attempting suicide and (D) the percentage of participants that made any preparatory actions and the mean number of preparatory actions prior to a suicide attempt. If participants interpreted a “suicide plan” similarly, one would expect minimal overlap between distributions of planning characteristics for single-item “planners” and “non-planners.” Instead, each of the distributions overlap substantially (71–92%) using the bins in the current plot, suggesting participants do not reliably interpret the single-item suicide planning question.
Fig 4Responses to single-item suicide attempt question and the results of coded narrative responses.
Single-item suicide attempt questions result in a 10.7% false positive rate and 6.39% false negate rate for the sample (9.91% false negative rate among ideators).
Classification statistics for the suicide attempt gate and follow-up questions.
| Gate Question | Follow-Up Question | Gate QuestionAmong Ideators | Follow-up Question Among Ideators | |||||
|---|---|---|---|---|---|---|---|---|
|
| 0.89 | (401/451) | 0.93 | (418/451) | 0.90 | (393/439) | 0.94 | (371/395) |
|
| 0.94 | (732/780) | 0.97 | (759/780) | 0.91 | (418/461) | 0.95 | (278/293) |
|
| 0.89 | (401/449) | 0.95 | (418/439) | 0.90 | (393/436) | 0.96 | (371/386) |
|
| 0.94 | (732/782) | 0.96 | (759/792) | 0.90 | (418/464) | 0.92 | (278/302) |
a PPV = Positive Predictive Value
b NPV = Negative Predictive Value
Note: Sensitivity is the proportion of actions coded as a true suicide attempt among people that endorsed a suicide attempt on the gate and follow-up questions. Specificity is the proportion of behaviors coded as not being a suicide attempt among people that denied a suicide attempt on the gate and follow-up questions. Positive predictive value is the proportion of people that endorsed a suicide attempt on the gate and follow-up questions among those that were coded as having attempted suicide. Negative predictive value is the proportion of people that denied a suicide attempt on the gate and follow-up questions among those that were coded as not having attempted suicide. Raw values for the proportions are given in parentheses.