Literature DB >> 26551904

Commentary on: Are we overpathologizing everyday life? A tenable blueprint for behavioral addiction research. The diagnostic pitfalls of surveys: If you score positive on a test of addiction, you still have a good chance not to be addicted.

Aniko Maraz^1,2, Orsolya Király², Zsolt Demetrovics².

Abstract

BACKGROUND AND AIMS: Survey-based studies often fail to take into account the predictive value of a test, in other words, the probability of a person having (or not having) the disease when scoring positive (or negative) on the given screening test.
METHODS: We re-visited the theory and basic calculations of diagnostic accuracy.
RESULTS: In general, the lower the prevalence the worse the predictive value is. When the disorder is relatively rare, a positive test finding is typically not useful in confirming its presence given the high proportion of false positive cases. For example, using the Compulsive Buying Scale (Faber & O'Guinn, 1992) three in four people classified as having compulsive buying disorder will in fact not have the disorder.
CONCLUSIONS: Screening tests are limited to serve as an early detection "gate" and only clinical (interview-based) studies are suitable to claim that a certain behaviour is truly "pathological".

Entities: Disease Gene Species

Keywords: accuracy; assessment; behavioural addiction; diagnosis; negative predictive value; positive predictive value; sensitivity; severity; specificity

Mesh：

Year: 2015 PMID： 26551904 PMCID： PMC4627675 DOI： 10.1556/2006.4.2015.026

Source DB: PubMed Journal: J Behav Addict ISSN： 2062-5871 Impact factor: 6.756

Introduction

We welcome the initiative of Billieux, Schimmenti, Khazaal, Maurage and Heeren (2015) in which they question the clinical validity of certain behaviours that are considered addictions. Hereby, we would like to contribute to this discussion by pointing out an important although often ignored statistical phenomenon closely related to the overpathologising of everyday behaviours: the predictive value of screening tests. Similar to the one carried out by Targhetta, Nalpas and Perney (2013) many studies struggle with the issue of separating “asymptomatic” and “symptomatic” (addicted or disordered) individuals performing a given behaviour. Although inventories are generally developed to provide a close estimate to a clinical test based on self-report, a screening instrument will never have diagnostic validity. But how precise can a screening instrument be compared to a clinical diagnosis?

Specificity, Sensitivity, Positive and Negative Predictive Value

Diagnostic accuracy, originally developed for the evaluation of laboratory screening instruments, is an indicator of the utility of a test (Glaros & Kline, 1988). It is measured by its agreement with a reference or “gold” standard that is the best available indicator of the presence or absence of the condition (Bossuyt et al., 2003). Accuracy is based on four concepts (see Table 1). Sensitivity and specificity provide information about the ability of the test to detect diseased and non-diseased persons correctly. For example, if sensitivity equals 80, it means that out of 100 diseased cases, the test will identify 80 as diseased. A specificity of 80, on the other hand, would mean that out of 100 non-diseased cases the test will identify 80 as negative and 20 as positive (diseased). Sensitivity and specificity are “fixed values” of the test (Streiner, 2003), which means that as long as the test is used in similar samples, these attributes remain the same. Positive and negative predictive value, on the other hand, provide information about the probability of a person having (or not having) the disease when scoring positive (or negative) on the screening test. A positive predictive value (PPV) of 80 means that out of 100 individuals scoring positive on the test, 80 are truly diseased, and 20 are not. A negative predictive value (NPV) of 80 would mean that 80 of 100 will be correctly classified as non-diseased, but 20 diseased individuals will score negative on the screening test. PPV and NPV are not “fixed values” but dependent on the prevalence of the disease in the sample where the screening test is administered (Streiner, 2003).

Table 1.

Calculation of accuracy

	Diseased	Non-diseased
Screened +	True positive (TP)	False positive (FP)	All positive (AP)
Screened −	False negative (FN)	True negative (TN)	All negative (AN)
	All diseased (AD)	All non-diseased (AnD)

Ideally, the number of true positive (truly diseased cases scoring positive on the screening test) and true negative cases (non-diseased cases scoring negative on the test) are both high and the number of false positive (cases that score positive although truly non-diseased) and false negative cases (who score negative although truly diseased) are both kept to minimum. This yields the best accuracy of the screening test. When the prevalence is kept constant, then sensitivity, specificity, PPV and NPV values are interdependent. In general, the lower the cut-off value on a given instrument, the higher the number of true positive cases, and the higher the number of false positive cases as well. This leads to higher sensitivity but lower specificity and PPV. Another general tendency is that when the prevalence is high then the proportion of false negatives may also be high, and when the prevalence of the disease is low then the proportion of false positives tends to be high (Streiner, 2003), which is generally the case with behavioural addictions. Thus, in order to calculate the probability of the disease given a positive test result one has to consider the a priori (antecedent) probability which is the prevalence rate (for the Bayesian approach of the calculations see: Meehl & Rosen, 1955). Calculation of accuracy Note: Sensitivity = TP/AD, Specificity = TN/AnD, Positive Predictive Value = TP/AP, Negative Predictive Value = TN/AN, Accuracy (or Efficiency) = (TP + TN)/total

Examples

The question arises: given a positive test, what is the probability that the individual truly has the given disorder? A few examples are shown in Figure 1. Note that as the prevalence drops, so does the PPV (whereas the proportion of false positives increases).

Figure 1.

Positive Predictive Value of actual and hypothetical instruments depending on prevalence

Notes: Sens = sensitivity, Spec = specificity. Positive Predictive Value = the probability of a person having the disease when scoring positive on the screening test.

As it appears in Figure 1, even when specificity and sensitivity are both at 99%, given a prevalence of 1%, the individual has a 50% chance of not having the disease when the screening is positive. But screening instruments usually have much lower sensitivity and specificity values than 99%. Positive Predictive Value of actual and hypothetical instruments depending on prevalence Notes: Sens = sensitivity, Spec = specificity. Positive Predictive Value = the probability of a person having the disease when scoring positive on the screening test. One of the most widely used tests to measure compulsive buying behaviour is the Compulsive Buying Scale (CBS) by Faber and O’Guinn (1992). Using a group of self-identified compulsive buyers as the criterion group, the authors reported a sensitivity of 89.8% and specificity of 85.3% for the CBS. According to a recent meta-analysis (Maraz, Griffiths & Demetrovics, 2015) the pooled prevalence of compulsive buying is 4.9%. This means that out of those scoring negative, 99% are probably non-diseased, but of those that score positive for compulsive buying, only 24% would probably be truly diseased. Although the test is unlikely to miss a pathological case, three in four people classified as having compulsive buying disorder will in fact not have the disorder. Other instruments have an even lower predictive value. For example, one of the few clinically validated Internet addiction measures is the Scale for the Assessment of Internet (and Computer Game) Addiction by Müller, Beutel and Wölfling (2014). This instrument was validated on a sample of 221 treatment seeking, clinically diagnosed problematic Internet users for which the authors reported a test sensitivity of 80.5% and a specificity of 82.4%. Using the same instrument, the authors conducted a population-based survey and reported a prevalence rate of 2.1% for Internet addiction (Müller, Glaesmer, Brähler, Woelfling & Beutel, 2014). Based on this prevalence rate, NPV is nearly perfect (99%), however, PPV is only 8.9% (for the exact calculations see the Appendix). This means that out of those scoring positive on the test, only 8.9% has the correct classification. Thus out of a 100 individuals screened positive for Internet addiction, only 9 will truly have the disease, and 91 will be misclassified.

Further Challenges

A critical point in the test accuracy is the criteria or “gold standard” that the inventory is assessed against. Technically, if the individual scores positive on the compulsive buying scale, then he or she has 24% chance of being a self-identified compulsive buyer, because this was the “gold standard” against which specificity and sensitivity were tested. Thus it is paramount to test inventories against clinical criteria to provide a sensible estimate of the extent of the given behavioural addiction. Establishing an “external criteria” for addiction is another challenge. Unlike substance-related disorders, complete abstinence is often impossible and indicators of pathology are difficult to define. This is especially the case with the “innovative yet absurd addictive disorders” – as Billieux et al. (2015) state – such as tango addiction (Targhetta et al., 2013), tanning addiction (Kourosh, Harrington & Adinoff, 2010), study addiction (Atroszko, Andreassen, Griffiths & Pallesen, 2015) or “research addiction” from Billieux et al. (2015). From a statistical point of view, an instrument that has not been tested against a clinically valid (diagnosed) group is unsuitable to assess the disorder.

Conclusions and Future Recommendations

The accuracy model was initially developed for medical purposes where (1) there is usually a clear criteria of what constitutes problematic and (2) the cost of misclassification is relatively low. Classifying 100 individuals as “positive” and referring them to further tests is more reasonable than missing one person who might suffer from serious consequences if the early signs of the disease are missed. But is the same logic true for behavioural “addicts”? Even if the cost of missing a case is the same, the cost of misdiagnosing is certainly higher compared to medical conditions given the scaremongering of the media that often exaggerates the impact of high prevalence estimates by presenting certain behaviours – such as using the Internet – as inherently dangerous. As a consequence, the moral panic may create unnecessary conflicts in families. Low PPVs contribute to overpathologising everyday behaviours because the proportion of truly diseased people is much lower than the proportion of those scoring positive on a screening test. When the disorder is relatively rare, a positive test finding is typically not useful in confirming its presence given the high proportion of false positive cases. When the prevalence is low, a test is best used to rule out a condition but not to rule it in (Streiner, 2003). At the same time the low predictive value of a test does not imply that behavioural addictions are non-existing or that they are not pathological. It only means that the use of surveys and screening tests is limited to serve as an early detection “gate”. One must always keep in mind that only clinical (interview-based) studies are suitable to claim that a certain behaviour for a given individual is truly “pathological”.

Authors’ contribution

AM designed, AM and OK wrote the manuscript and DZ revised the text. Each author has read and agrees with the information contained in the current article.

Conflict of interest

ZD is the Editor-in-Chief of the Journal of Behavioral Addictions and AM is Associate Editor of the Journal of Behavioral Addictions. OK has no conflict of interest to report.

9 in total

1. Diagnosing tests: using and misusing diagnostic and screening tests.

Authors: David L Streiner
Journal: J Pers Assess Date: 2003-12

2. Antecedent probability and the efficiency of psychometric signs, patterns, or cutting scores.

Authors: P E MEEHL; A ROSEN
Journal: Psychol Bull Date: 1955-05 Impact factor: 17.737

3. A contribution to the clinical characterization of Internet addiction in a sample of treatment seekers: validity of assessment, severity of psychopathology and type of co-morbidity.

Authors: K W Müller; M E Beutel; K Wölfling
Journal: Compr Psychiatry Date: 2014-01-28 Impact factor: 3.735

4. Understanding the accuracy of tests with cutting scores: the sensitivity, specificity, and predictive value model.

Authors: A G Glaros; R B Kline
Journal: J Clin Psychol Date: 1988-11

Review 5. Tanning as a behavioral addiction.

Authors: Arianne S Kourosh; Cynthia R Harrington; Bryon Adinoff
Journal: Am J Drug Alcohol Abuse Date: 2010-09 Impact factor: 3.829

6. Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative.

Authors: Patrick M Bossuyt; Johannes B Reitsma; David E Bruns; Constantine A Gatsonis; Paul P Glasziou; Les M Irwig; Jeroen G Lijmer; David Moher; Drummond Rennie; Henrica C W de Vet
Journal: Clin Chem Lab Med Date: 2003-01 Impact factor: 3.694

7. Study addiction--a new area of psychological study: conceptualization, assessment, and preliminary empirical findings.

Authors: Paweł A Atroszko; Cecilie Schou Andreassen; Mark D Griffiths; Ståle Pallesen
Journal: J Behav Addict Date: 2015-05-27 Impact factor: 6.756

8. Are we overpathologizing everyday life? A tenable blueprint for behavioral addiction research.

Authors: Joël Billieux; Adriano Schimmenti; Yasser Khazaal; Pierre Maurage; Alexandre Heeren
Journal: J Behav Addict Date: 2015-05-27 Impact factor: 6.756

9. Argentine tango: Another behavioral addiction?

Authors: Remi Targhetta; Bertrand Nalpas; Perney Pascal
Journal: J Behav Addict Date: 2013-06-14 Impact factor: 6.756

9 in total

25 in total

1. Development of a short form of the compulsive internet use scale in Switzerland.

Authors: Gerhard Gmel; Yasser Khazaal; Joseph Studer; Stéphanie Baggio; Simon Marmet
Journal: Int J Methods Psychiatr Res Date: 2019-01-16 Impact factor: 4.035

2. Treatments of internet gaming disorder: a systematic review of the evidence.

Authors: Kristyn Zajac; Meredith K Ginley; Rocio Chang
Journal: Expert Rev Neurother Date: 2019-09-26 Impact factor: 4.618

Review 3. Prevalence of problematic internet use in Slovenia.

Authors: Mirna Macur; Orsolya Király; Aniko Maraz; Katalin Nagygyörgy; Zsolt Demetrovics
Journal: Zdr Varst Date: 2016-05-10

4. Problematic Social Media Use: Results from a Large-Scale Nationally Representative Adolescent Sample.

Authors: Fanni Bányai; Ágnes Zsila; Orsolya Király; Aniko Maraz; Zsuzsanna Elekes; Mark D Griffiths; Cecilie Schou Andreassen; Zsolt Demetrovics
Journal: PLoS One Date: 2017-01-09 Impact factor: 3.240

5. Problematic gaming exists and is an example of disordered gaming.

Authors: Mark D Griffiths; Daria J Kuss; Olatz Lopez-Fernandez; Halley M Pontes
Journal: J Behav Addict Date: 2017-08-17 Impact factor: 6.756

6. Lost in the chaos: Flawed literature should not generate new disorders.

Authors: Antonius J Van Rooij; Daniel Kardefelt-Winther
Journal: J Behav Addict Date: 2017-03-17 Impact factor: 6.756

7. Validation of the Internet Gaming Disorder Scale - Short-Form (IGDS9-SF) in an Italian-speaking sample.

Authors: Lucia Monacis; Valeria de Palo; Mark D Griffiths; Maria Sinatra
Journal: J Behav Addict Date: 2016-11-23 Impact factor: 6.756

8. The development of the Problematic Series WatchingScale (PSWS).

Authors: Gábor Orosz; Beáta Bőthe; István Tóth-Király
Journal: J Behav Addict Date: 2016-03 Impact factor: 6.756

9. The relationship between study addiction and work addiction: A cross-cultural longitudinal study.

Authors: Paweł A Atroszko; Cecilie Schou Andreassen; Mark D Griffiths; Ståle Pallesen
Journal: J Behav Addict Date: 2016-11-15 Impact factor: 6.756

10. Psychometric Properties of the Problematic Internet Use Questionnaire Short-Form (PIUQ-SF-6) in a Nationally Representative Sample of Adolescents.

Authors: Zsolt Demetrovics; Orsolya Király; Beatrix Koronczai; Mark D Griffiths; Katalin Nagygyörgy; Zsuzsanna Elekes; Domokos Tamás; Bernadette Kun; Gyöngyi Kökönyei; Róbert Urbán
Journal: PLoS One Date: 2016-08-09 Impact factor: 3.240