| Literature DB >> 24494152 |
Azam Rastegari1, Ali Akbar Haghdoost2, Mohammad Reza Baneshi3.
Abstract
BACKGROUND: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predicting drug injection among prisoners.Entities:
Keywords: Classification and regression trees; Drug abuse; History of drug injection; Logistic regression model
Year: 2013 PMID: 24494152 PMCID: PMC3905563
Source DB: PubMed Journal: Addict Health ISSN: 2008-4633
Frequency and percentage of prisoners with history of drug injection
| Data | History of drug injection | |
|---|---|---|
| Yes | No | |
| n (%) | n (%) | |
| Training data group | 458 (22.4) | 1591 (77.6) |
| Testing data group | 160 (23.8) | 511 (76.2) |
| Total data | 618 (22.7) | 2101 (77.3) |
Figure 1The classification and regression tree on training data (Code 1: prisoners with the history of drug injection. At each node, the group with the highest percentage was considered as the predicting group)
The results of the logistic regression model and the classification and regression tree in training data group
| Variable | Odds ratio | 95% Confidence interval | Sensitivity (%) | Specificity (%) | Accuracy (%) | |
|---|---|---|---|---|---|---|
| Logistic regression model | ||||||
| First step | Heroin use | 6.42 | 4.95-8.32 | 0.0 | 100 | 77.6 |
| Second step | Heroin use | 7.00 | 5.39-9.20 | 8.3 | 97.6 | 77.6 |
| History of arrest | 1.17 | 1.12-1.23 | ||||
| Third step | Heroin | 6.18 | 4.7-8.12 | 15.3 | 96.8 | 78.6 |
| History of arrest | 1.17 | 1.14-1.25 | ||||
| Age at first drug use | 0.92 | 0.9-0.94 | ||||
| Fourth step | Heroin use | 5.84 | 4.4-7.70 | 20.1 | 96.2 | 79.2 |
| History of arrest | 1.18 | 1.13 -1.24 | ||||
| Age at first drug use | 0.92 | 0.89-0.94 | ||||
| Marital status | 1.16 | 0.90-1.50 | ||||
| Other | 2.18 | 1.57-3.028 | ||||
| Fifth step | Heroin | 4.30 | 3.2-5.90 | 27.1 | 95.2 | 79.0 |
| History of arrest | 1.15 | 1.10-1.22 | ||||
| Age at first drug use | 0.90 | 0.88-0.92 | ||||
| Marital status | 1.53 | 1.14-2.05 | ||||
| Other | 2.16 | 1.55-3.02 | ||||
| Age | 1.05 | 1.03-1.07 | ||||
| Opium use | 0.60 | 0.38-0.70 | ||||
| Methadone use | 2.21 | 1.37-3.50 | ||||
| Ecstasy use | 9.50 | 1.47-62.54 | ||||
| Tree model | ||||||
| First step | Heroin use | - | - | 0.0 | 100 | 77.0 |
| Second step | Heroin use | - | - | 26.4 | 93.0 | 78.0 |
| History of arrest | - | - | - | - | - | |
| Third step | Heroin use | - | - | - | - | - |
| History of arrest | - | - | 8.1 | 99.2 | 79.0 | |
| Age at first drug use | - | - | - | - | - | |
| Forth step | Heroin use | - | - | - | - | - |
| History of arrest | - | - | - | - | - | |
| Age at first drug use | - | - | 14.2 | 96.7 | 78.0 | |
| Marital status | - | - | - | - | - | |
Comparing the results of the logistic regression model and the classification and regression tree (CART) on total data and training and testing data
| Model | Sensitivity (%) | Specificity (%) | Accuracy (%) | |
|---|---|---|---|---|
| Training data | Logistic regression | 27.1 | 95.2 | 79.0 |
| CART | 14.2 | 96.7 | 78.0 | |
| Testing data | Logistic regression | 30.0 | 94.0 | 78.0 |
| CART | 25.0 | 96.7 | 80.0 | |
| Total data | Logistic regression | 31.9 | 94.0 | 79.9 |
| CART | 24.9 | 95.5 | 79.5 |
CART: Classification and regression tree