| Literature DB >> 35790774 |
Ying Li1, Fanggen Lu2, Yani Yin3,4,5.
Abstract
In countries with a high incidence of tuberculosis, the typical clinical features of Crohn's disease (CD) may be covered up after tuberculosis infection, and the identification of atypical Crohn's disease and intestinal tuberculosis (ITB) is still a dilemma for clinicians. Least absolute shrinkage and selection operator (LASSO) regression has been applied to select variables in disease diagnosis. However, its value in discriminating ITB and atypical Crohn's disease remains unknown. A total of 400 patients were enrolled from January 2014 to January 2019 in second Xiangya hospital Central South University.Among them, 57 indicators including clinical manifestations, laboratory results, endoscopic findings, computed tomography enterography features were collected for further analysis. R software version 3.6.1 (glmnet package) was used to perform the LASSO logistic regression analysis. SPSS 20.0 was used to perform Pearson chi-square test and binary logistic regression analysis. In the variable selection step, LASSO regression and Pearson chi-square test were applied to select the most valuable variables as candidates for further logistic regression analysis. Secondly, variables identified from step 1 were applied to construct binary logistic regression analysis. Receiver operating characteristic (ROC) curve analysis was performed on these models to assess the ability and the optimal cutoff value for diagnosis. The area under the ROC curve (AUC), sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy rate, together with their 95% confidence and intervals (CIs) were calculated. MedCalc software (Version 16.8) was applied to analyze the ROC curves of models. 332 patients were eventually enrolled to build a binary logistic regression model to discriminate CD (including comprehensive CD and tuberculosis infected CD) and ITB. However, we did not get a satisfactory diagnostic value via applying the binary logistic regression model of comprehensive CD and ITB to predict tuberculosis infected CD and ITB (accuracy rate:79.2%VS 65.1%). Therefore, we further established a binary logistic regression model to discriminate atypical CD from ITB, based on Pearsonchi-square test (model1) and LASSO regression (model 2). Model 1 showed 89.9% specificity, 65.9% sensitivity, 88.5% PPV, 68.9% NPV, 76.9% diagnostic accuracy, and an AUC value of 0.811, and model 2 showed 80.6% specificity, 84.4% sensitivity, 82.3% PPV, 82.9% NPV, 82.6% diagnostic accuracy, and an AUC value of 0.887. The comparison of AUCs between model1 and model2 was statistically different (P < 0.05). Tuberculosis infection increases the difficulty of discriminating CD from ITB. LASSO regression showed a more efficient ability than Pearson chi-square test based logistic regression on differential diagnosing atypical CD and ITB.Entities:
Mesh:
Year: 2022 PMID: 35790774 PMCID: PMC9256608 DOI: 10.1038/s41598-022-15609-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Different endoscopic appearances of CD and ITB.(A)typical cobblestone appearance in patients with CD. (B).transverse ulcer in patients with ITB. (C)CD patient associated with TB infection. (D) CD like patient with lymph node liquefaction,finally diagnosed ITB.
Figure 2Flowchart for patients enrollment.
Clinical features of patients with UCD and ITB.
| Variables | UCD (n = 109) | ITB (n = 105) | |
|---|---|---|---|
| Abdominal pain (%) | 95.4 | 81.0 | 0.001 |
| Diarrhea (%) | 42.2 | 28.6 | 0.037 |
| Perianal abscess (%) | 11.9 | 2.90 | 0.012 |
| Perianal fistula (%) | 13.8 | 1.00 | 0.000 |
| Ileus (%) | 26.6 | 11.4 | 0.005 |
| Bowl resection history (%) | 15.6 | 6.67 | 0.038 |
| Platelets↑ (%) | 42.2 | 27.6 | 0.025 |
| Albumin↓ (%) | 83.5 | 70.5 | 0.024 |
| CRP↑(%) | 75.2 | 54.3 | 0.001 |
| PPD skin test positive (%) | 36.7 | 56.2 | 0.004 |
| T-SPOT positive (%) | 49.5 | 81.0 | 0.000 |
| FOBT positive (%) | 67.9 | 54.3 | 0.041 |
| Comb sign (%) | 13.8 | 0.00 | 0.000 |
| Segmental distribution of lesion (%) | 26.6 | 12.4 | 0.009 |
| Cobblestone appearance (%) | 14.7 | 0.90 | 0.000 |
| Longitudinal ulcers (%) | 24.8 | 8.60 | 0.002 |
| Jejunal involvement | 11.0 | 3.80 | 0.045 |
| Rectal involvement (%) | 31.2 | 19.0 | 0.041 |
Figure 3(A,B) LASSO regression showed log(λ) = − 3.662 when the error of the model is minimized, and 26 variables were selected for further logistic regression analysis.
Validation value of predicting model in differentiation between UCD and ITB.
| Data set | Sensitivity (%) | Specificity (%) | NPV (%) | PPV (%) | Accuracy (%) |
|---|---|---|---|---|---|
| Development set | 78.4 | 89.9 | 63.2 | 94.9 | 81.8 |
| Validation set | 77.8 | 55.6 | 56.8 | 76.9 | 65.1 |
Cutoff point for predictable diagnosed as ITB < 0.738.
Development set was randomly selected from 70% samples of comprehensive CD and ITB.
Validation set consists of 27 UCD and 36 ITB patients.
Diagnostic equations of prediction models in differentiation between ITB and UCD.
| Differential diagnosis | Equations |
|---|---|
| Model 1 | P = 1/[1 + e−(−1.806+1.882X1+3.037X2+1.290X3+0.973X4–1.319X5+2.374X6+2.047X7)] X1, abdominal pain; X2, perianal fistula; X3, illeus; X4, elevated CRP; X5, T-SPOT positive; X6, cobblestone appearance; X7, involvement of jejunum |
| Mode2 | P = 1/[1 + e−(−1.488+1.524X1+2.408X2+1.250X3–1.633X4–1.305X5+1.306X6–1.692X7+1.332X8+1.513X9+2.071X10–2.027X11+1.457X12+2.647X13)] X1, abdominal pain; X2, perianal abscess; X3, illeus; X4, hepatobiliary disease; X5, tuberculosis history; X6, eleveted CRP; X7, T-SPOT; X8, segmental lesions; X9, longitudinal ulcer; X10, jejunum involvement; X11, ascending colon involvement; X12, rectum involvement; X13, perianal fistula |
Model 1: Pearson chi-square test based logistic regression; Model 2: LASSO regression based logistic regression.
Comparison between model 1 and model 2.
| Model | Sensitivity (%) | Specificity (%) | NPV (%) | PPV (%) | Accuracy (%) | AUCa | 95%CI lower | 95%CI upper | P value |
|---|---|---|---|---|---|---|---|---|---|
| Model1 | 65.9 | 89.9 | 68.9 | 88.5 | 76.9 | 0.811 | 0.743 | 0.879 | 0.000 |
| Model2 | 84.4 | 80.6 | 82.9 | 82.3 | 82.6 | 0.887 | 0.835 | 0.939 | 0.000 |
Model 1: Pearson chi-square test based logistic regression; Model 2: LASSO regression based logistic regression.
aAUCs between model 1 and model 2 were statistically different (P < 0.05).