| Literature DB >> 30914854 |
Mohamad Adam Bujang1, Nadiah Sa'at2, Tg Mohd Ikhwan Tg Abu Bakar Sidik2, Lim Chien Joo1.
Abstract
BACKGROUND: Different study designs and population size may require different sample size for logistic regression. This study aims to propose sample size guidelines for logistic regression based on observational studies with large population.Entities:
Keywords: logistic regression; observational studies; sample size
Year: 2018 PMID: 30914854 PMCID: PMC6422534 DOI: 10.21315/mjms2018.25.4.12
Source DB: PubMed Journal: Malays J Med Sci ISSN: 1394-195X
Information for an audit data, variables name and the code
| Variables | Code for variable |
|---|---|
| HbA1c | |
| Poor | Reference group |
| Good | 1 |
| Gender | |
| Male | 1 |
| Female | Reference group |
| BMI Category | |
| Normal | 2 |
| Underweight | 3 |
| Overweight | 4 |
| Obese | Reference group |
| Duration of diabetes | |
| < 5 years | 5 |
| 5–10 years | 6 |
| > 10 years | Reference group |
| Treatment | |
| Diet only | 7 |
| Oral ADA only | 8 |
| Insulin only | 9 |
| Both oral and insulin | Reference group |
| Co-morbidity | |
| No | Reference group |
| Hypertension only | 10 |
| Dyslipidemia only | 11 |
| Hypertension and Dyslipidemia | 12 |
| Age | 13 |
| Low-density lipoprotein | 14 |
| Blood pressure (systolic) | 15 |
Figure 1The comparison of differences of coefficients between results derived from parameters and statistics based on various sample sizes
Figure 2The comparison of differences of Nagelkerke r-squared between results derived from parameters and statistics based on various sample sizes
Comparison with the basis of sample size based on rule of thumb between EPV (prevalence of poor control = 80.0% and number of independent variables = 8) and formula of n = 100 + xi (x is integer and i represents number of independent variable)
| Guideline | Minimum sample in poor control | Minimum sample size based on EPV | Number of independent variables | Minimum sample size based on formula |
|---|---|---|---|---|
| EPV of 10 | 80 | 100 (80 in poor outcome category) | ||
| EPV of 20 | 160 | 200 (160 in poor outcome category) | ||
| EPV of 30 | 240 | 300 (240 in poor outcome category) | ||
| EPV of 40 | 320 | 400 (320 in poor outcome category) | ||
| EPV of 50 | 400 | 500 (400 in poor outcome category) | ||
| 100 + 10 ( | 8 | 180 | ||
| 100 + 20 ( | 8 | 260 | ||
| 100 + 30 ( | 8 | 340 | ||
| 100 + 40 ( | 8 | 420 | ||
| 100 + 50 ( | 8 | 500 | ||
Figure 3The comparison of differences of coefficients between results derived from parameters and statistics based on various sample sizes tested with larger sample