| Literature DB >> 32685710 |
Bambang Widjanarko Otok1, Marsuddin Musa1, Septia Devi Prihastuti Yasmirullah1.
Abstract
INTRODUCTION: Observational research in the field of health often does not conduct randomized controlled trials on research subjects. A non-random selection process on research subjects can result in a biased treatment effect due to an imbalance between the treatment and control groups.Entities:
Keywords: Bootstrap aggregating; Classification trees analysis; Mathematics; Opportunistic infection; Propensity score stratification; Statistics
Year: 2020 PMID: 32685710 PMCID: PMC7355728 DOI: 10.1016/j.heliyon.2020.e04288
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Research variables.
| No | Variables | Operational Definition | Scale |
|---|---|---|---|
| 1 | Opportunistic Infection ( | Infectious diseases that accompany HIV/AIDS sufferers, such as pneumonia, tuberculosis, hepatitis, etc. Data is categorized by: | Nominal |
| 2 | Age ( | The length of life someone has lived and is calculated based on the last birthday. The data unit is in the form of years. | Ratio |
| 3 | Knowlegde ( | Something that is known by patients about HIV/AIDS includes understanding, signs, symptoms, treatment, and the ways to prevent transmission of HIV. Data is categorized by: | Nominal |
| 4 | Self Conpect ( | Attitudes or acceptance towards oneself (individuals with HIV/AIDS). Data is categorized by 0 = Negative | Nominal |
| 5 | Attitudes towards HIV/AIDS ( | Patient's perception of the quality of life, which includes social relationships, physical well-being, psychological, and spiritual. Data is categorized by: | Nominal |
| 6 | Family Support ( | Patient's perceptions of family support include emotional support, moral, information, and social. Data is categorized by: | Nominal |
| Time suffering from HIV/AIDS ( | The duration of suffering from HIV/AIDS starts from the first diagnosis until the study time. The data unit is in the form of months. | Ratio | |
| 8 | Treatments ( | Giving different treatments for patients. Data is categorized by: | Nominal |
Patient Characteristics based on Opportunistic Infections Status.
| Variable | Measure | Opportunistic Infections (OIs) Status | |
|---|---|---|---|
| There is OIs | There is not OIs | ||
| Age | Mean | 34.82 | 33.22 |
| StDev | 8.57 | 7.89 | |
| Min | 23.00 | 23.00 | |
| Max | 61.00 | 56.00 | |
| Time suffering from HIV/AIDS | Mean | 37.95 | 40.16 |
| StDev | 26.21 | 27.43 | |
| Min | 11.00 | 3.00 | |
| Max | 147.00 | 146.00 | |
Classification results of testing data based on the learning data.
| Proportion of Learning Data | Classification Results | |
|---|---|---|
| Accuracy (%) | AUC (%) | |
| 75% | 55.26 | 49.36 |
| 80% | 54.84 | 50.68 |
| 85% | 52.17 | 45.53 |
| 90% | 50.00 | 28.57 |
The highest accuracy of the classification results is 63.04%, so the proportion of learning data is 70%. The number of research data points is 150, which is the learning data was used 104 data points, while the testing data was used 46 data points.
Figure 1Optimal Classification Tree.
Accuracy of optimal classification tree.
| Actual | Prediction | Accuracy (%) | AUC (%) | ||
|---|---|---|---|---|---|
| 0 | 1 | ||||
| Learning Data | 0 | 63 | 22 | 74.04 | 73.90 |
| 1 | 5 | 14 | |||
| Testing Data | 0 | 24 | 12 | 60.87 | 53.33 |
| 1 | 6 | 4 | |||
Accuracy of classification.
| Number of Replication | Accuracy (%) | |
|---|---|---|
| Training Data | Testing Data | |
| 25 | 96.15 | 60.87 |
| 50 | 97.11 | 56.52 |
| 75 | 97.11 | 60.87 |
| 100 | 97.11 | 58.70 |
| 150 | 97.11 | 60.87 |
| 175 | 97.11 | 60.87 |
Accuracy of bagging CART.
| Actual | Prediction | Accuracy (%) | AUC (%) | ||
|---|---|---|---|---|---|
| 0 | 1 | ||||
| Learning Data | 0 | 66 | 20 | 78.85 | 82.81 |
| 1 | 2 | 16 | |||
| Testing Data | 0 | 26 | 13 | 63.04 | 54.76 |
| 1 | 4 | 3 | |||
Propensity score estimation results.
| No | |||||||
|---|---|---|---|---|---|---|---|
| 1–7 | 0.552 | 0.992 | 0.808 | 0.768 | 0.920 | 0.864 | 0.632 |
| 8–14 | 0.952 | 0.968 | 0.960 | 1.000 | 0.808 | 0.952 | 0.976 |
| 15–21 | 0.976 | 0.984 | 0.984 | 0.944 | 0.984 | 0.760 | 0.984 |
| 22–28 | 0.992 | 0.896 | 0.720 | 0.792 | 0.968 | 0.496 | 0.816 |
| 29–35 | 1.000 | 0.752 | 0.976 | 0.816 | 0.992 | 0.944 | 0.992 |
| 36–42 | 0.808 | 0.936 | 0.352 | 0.936 | 0.800 | 0.984 | 0.752 |
| 43–49 | 0.976 | 0.816 | 0.960 | 0.976 | 0.784 | 0.928 | 0.968 |
| 50–56 | 0.928 | 0.968 | 0.968 | 0.752 | 0.720 | 0.672 | 0.736 |
| 57–63 | 0.504 | 0.616 | 0.728 | 0.984 | 0.632 | 0.936 | 0.984 |
| 64–70 | 0.992 | 0.984 | 0.912 | 0.992 | 0.968 | 0.120 | 0.448 |
| 71–77 | 0.880 | 0.096 | 0.520 | 0.472 | 0.512 | 0.192 | 0.120 |
| 78–84 | 0.312 | 0.232 | 0.504 | 0.672 | 0.384 | 0.304 | 0.720 |
| 85–91 | 0.504 | 0.680 | 0.184 | 0.440 | 0.056 | 0.648 | 0.632 |
| 92–98 | 0.520 | 0.112 | 0.424 | 0.328 | 0.688 | 0.672 | 0.880 |
| 99–104 | 0.080 | 0.728 | 0.160 | 0.600 | 0.296 | 0.024 | |
Propensity score stratification results.
| Strata | ||||||||
|---|---|---|---|---|---|---|---|---|
| Strata 1 | 0.024 | 0.056 | 0.080 | 0.096 | 0.112 | 0.120 | 0.120 | 0.160 |
| 0.184 | 0.192 | 0.232 | 0.296 | 0.304 | 0.312 | 0.328 | 0.352 | |
| 0.384 | 0.424 | 0.440 | 0.448 | |||||
| Strata 2 | 0.472 | 0.496 | 0.504 | 0.504 | 0.504 | 0.512 | 0.520 | 0.520 |
| 0.552 | 0.600 | 0.616 | 0.632 | 0.632 | 0.632 | 0.648 | 0.672 | |
| 0.672 | 0.672 | 0.680 | 0.688 | |||||
| Strata 3 | 0.720 | 0.720 | 0.720 | 0.728 | 0.728 | 0.736 | 0.752 | 0.752 |
| 0.752 | 0.760 | 0.768 | 0.784 | 0.792 | 0.800 | 0.808 | 0.808 | |
| 0.808 | 0.816 | 0.816 | 0.816 | |||||
| Strata 4 | 0.864 | 0.880 | 0.880 | 0.896 | 0.912 | 0.920 | 0.928 | 0.928 |
| 0.936 | 0.936 | 0.936 | 0.944 | 0.944 | 0.952 | 0.952 | 0.960 | |
| 0.960 | 0.968 | 0.968 | 0.968 | 0.968 | 0.968 | |||
| Strata 5 | 0.968 | 0.976 | 0.976 | 0.976 | 0.976 | 0.976 | 0.984 | 0.984 |
| 0.984 | 0.984 | 0.984 | 0.984 | 0.984 | 0.984 | 0.992 | 0.992 | |
| 0.992 | 0.992 | 0.992 | 0.992 | 1.000 | 1.000 | |||
Results of covariate balance testing.
| Covariate | Before PSS ( | After PSS ( | ||||
|---|---|---|---|---|---|---|
| Strata 1 | Strata 2 | Strata 3 | Strata 4 | Strata 5 | ||
| Age (X1) | 0.552 | 0.129 | 0.883 | 0.877 | 0.992 | 0.992 |
| Knowledge (X2) | 0.460 | 0.798 | 0.067 | 0.588 | 0.650 | 0.684 |
| Attitude (X3) | 0.046 | 0.717 | 0.456 | 0.154 | 0.427 | 0.629 |
| Self Concept (X4) | 0.090 | 0.717 | 0.199 | 0.639 | 0.427 | 0.175 |
| Family Support (X5) | 0.002 | NaN | 0.402 | 0.127 | 0.158 | NaN |
| Time Infected (X6) | 0.514 | 0.878 | 0.220 | 0.843 | 0.121 | 0.121 |
declare the covariate is not balanced.
ATE estimation results.
| ATE | SE | ||
|---|---|---|---|
| 0.222 | 0.038 | 5.815 | 0.000 |
PBR using PSS Bagging CTA.
| Bias Before PSS | Bias After PSS | PBR (%) |
|---|---|---|
| 0.445 | 0.046 | 89.54 |