| Literature DB >> 26925900 |
Zahra Shayan1, Naser Mohammad Gholi Mezerji, Leila Shayan, Parisa Naseri.
Abstract
BACKGROUND: Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices.Entities:
Mesh:
Year: 2015 PMID: 26925900 PMCID: PMC4965639 DOI: 10.5539/gjhs.v8n7p41
Source DB: PubMed Journal: Glob J Health Sci ISSN: 1916-9736
Comparison of logistic and linear discriminant analysis based on classification indices with different sample sizes (the first random sampling)
| n | Q | B | % Classification Error | |||
|---|---|---|---|---|---|---|
| LR | LDA | LR | LDA | LR | LDA | |
| 50 | 0.13907 | 0.13419 | 0.79099 | 0.79064 | 32 | 32 |
| 100 | 0.24627 | 0.24133 | 0.82432 | 0.82327 | 25 | 25 |
| 150 | 0.18272 | 0.16065 | 0.8024 | 0.79913 | 32.7 | 30.7 |
| 200 | 0.17096 | 0.15044 | 0.80163 | 0.79786 | 31 | 32.5 |
| 220 | 0.18768 | 0.16764 | 0.8056 | 0.80086 | 32.7 | 32.3 |
| Total sample | 0.17325 | 0.16053 | 0.80157 | 0.7988 | 31.7 | 32.9 |
Comparison of logistic and linear discriminant analysis based on classification indices with different sample size (the second random sampling)
| n | Q | B | % Classification Error | |||
|---|---|---|---|---|---|---|
| LR | LDA | LR | LDA | LR | LDA | |
| 50 | 0.09043 | 0.08933 | 0.78025 | 0.77949 | 36 | 34 |
| 100 | 0.20682 | 0.19196 | 0.81148 | 0.80863 | 27 | 28 |
| 150 | 0.19205 | 0.18664 | 0.80634 | 0.80602 | 30 | 30 |
| 200 | 0.17271 | 0.16254 | 0.80082 | 0.79909 | 32.5 | 34 |
| 220 | 0.17195 | 0.16205 | 0.80098 | 0.79913 | 31.4 | 32.7 |
| Total sample | 0.17325 | 0.16053 | 0.80157 | 0.7988 | 31.7 | 32.9 |
Effect of categorization of variables on classification indices with different sample sizes (based on the first random sampling)
| n | Q | B | % Classification Error | |||
|---|---|---|---|---|---|---|
| LR | LDA | LR | LDA | LR | LDA | |
| 50 | 0.14323 | 0.14228 | 0.79442 | 0.79424 | 34 | 34 |
| 100 | 0.22965 | 0.22651 | 0.81936 | 0.81871 | 25 | 25 |
| 150 | 0.14863 | 0.14278 | 0.7939 | 0.79359 | 31.3 | 31.3 |
| 200 | 0.16303 | 0.16015 | 0.79902 | 0.79866 | 32 | 32 |
| 220 | 0.17487 | 0.16748 | 0.80141 | 0.8005 | 31.8 | 31.8 |
| Total sample | 0.16162 | 0.15824 | 0.79809 | 0.79781 | 32.5 | 32.5 |