| Literature DB >> 33623892 |
Liyan Pan1, Guangjian Liu1, Xiaojian Mao2, Huiying Liang1.
Abstract
OBJECTIVE: The study aimed to develop simplified diagnostic models for identifying girls with central precocious puberty (CPP), without the expensive and cumbersome gonadotropin-releasing hormone (GnRH) stimulation test, which is the gold standard for CPP diagnosis.Entities:
Keywords: GnRH stimulation test; central precocious puberty; machine learning; multisource data
Year: 2020 PMID: 33623892 PMCID: PMC7886559 DOI: 10.1093/jamiaopen/ooaa063
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Current overall diagnosis flow for CPP. Different colors represent different data sources. EMR: electronic medical record. LAB: laboratory. US: ultrasound. BA: bone age.
Figure 2.Flow chart for CPP diagnostic model development. Different colors represent different data sources.
Characteristics of patients enrolled in this study
| Characteristics | Non-CPP | CPP |
|
|---|---|---|---|
| Laboratory parameters ( | 1370 | 1153 | |
| LH (IU/L) | 0.11 (0.17) | 0.78 (1.12) | <.001 |
| FSH (IU/L) | 1.83 (1.23) | 2.94 (1.64) | <.001 |
| GH (ng/mL) | 3.18 (2.96) | 4.13 (4.01) | <.001 |
| IGF-1 (ng/mL) | 232.25 (65.73) | 295.19 (90.74) | <.001 |
| IGFBP-3 (μg/mL) | 4.69 (0.69) | 4.85 (0.64) | <.001 |
| E2 (pmol/L) | 106.03 (59.17) | 120.79 (56.03) | <.001 |
| PRL (ng/mL) | 9.42 (7.05) | 8.765 (5.02) | .007 |
| TTE (nmol/L) | 0.80 (0.38) | 0.90 (0.47) | <.001 |
| Clinical parameters ( | 881 | 794 | |
| Age (years) | 7.05 (1.13) | 7.47 (1.09) | <.001 |
| Duration (months) | 8.15 (10.86) | 9.95 (10.44) | <.001 |
| Height (cm) | 126.11 (8.60) | 129.95 (8.83) | <.001 |
| Weight (kg) | 26.04 (5.35) | 28.33 (5.38) | <.001 |
| BMI (kg/m2) | 16.22 (2.01) | 16.66 (1.96) | <.001 |
| Breast, tanner stage | <.001 | ||
| 1 | 72 (60.50) | 47 (39.50) | |
| 2 | 418 (67.97) | 197 (32.03) | |
| 3 | 347 (46.33) | 402 (53.67) | |
| 4 | 43 (23.12) | 143 (76.88) | |
| 5 | 1 (16.67) | 5 (83.33) | |
| Core | .027 | ||
| Yes | 724 (51.42) | 684 (48.58) | |
| No | 157 (58.80) | 110 (41.20) | |
| Vulva, tanner stage | .009 | ||
| 1 | 833 (53.57) | 722 (46.43) | |
| 2 | 44 (41.90) | 61 (58.10) | |
| 3 | 4 (26.67) | 11 (73.33) | |
| Pubes, tanner stage | .002 | ||
| 1 | 837 (51.99) | 773 (48.01) | |
| 2 | 43 (71.67) | 17 (28.33) | |
| 3 | 1 (20.00) | 4 (80.00) | |
| Pigmentation | .137 | ||
| Yes | 57 (60.00) | 38 (40.00) | |
| No | 824 (52.15) | 756 (47.85) | |
| Bone age information ( | 897 | 715 | |
| BA value (years) | 8.70 (1.61) | 9.72 (1.49) | <.001 |
| Age ratio | 1.25 (0.20) | 1.32 (0.29) | <.001 |
| Pelvic ultrasonography ( | 1048 | 923 | |
| Left ovarian volume (mL) | 2.04 (1.34) | 2.46 (1.56) | <.001 |
| Right ovarian volume (mL) | 1.94 (1.32) | 2.26 (1.17) | <.001 |
| Uterine volume (mL) | 1.57 (1.33) | 2.69 (1.90) | <.001 |
LH: luteinizing hormone; IGF-1: insulin-like growth factor-1; FSH: follicle-stimulation hormone; PRL: prolactin; GH: growth hormone; E2: estradiol; BMI: body mass index; TTE: testosterone; BA: bone age.
Model performance of models using different data source combinations
| Data sources | Sensitivity | Specificity | AUC | Youden index | ||||
|---|---|---|---|---|---|---|---|---|
| LAB | EMR | BA | US | |||||
| BA ratio | BA image | |||||||
| ● | 76.62 | 84.62 | 0.85 | 0.61 | ||||
| ● | 53.25 | 79.81 | 0.72 | 0.33 | ||||
| ● | 81.82 | 38.46 | 0.62 | 0.20 | ||||
| ● | 88.31 | 47.12 | 0.71 | 0.35 | ||||
| ● | 66.23 | 75.96 | 0.76 | 0.42 | ||||
| ● | ● | ● | 67.53 | 82.69 | 0.80 | 0.50 | ||
| ● | ● | 81.82 | 82.69 | 0.86 | 0.65 | |||
| ● | ● | ● | ● | 85.71 | 77.88 | 0.88 | 0.64 | |
The first five lines show performance of models with single-source data. The last three lines show performance of selected models.
LAB: laboratory; EMR: electronic medical records; BA: bone age; US: ultrasonography.
Figure 3.AUC and Youden index ranking among 23 models. The left panel represents the AUC of the models based on different data source combinations, and the right panel represents the Youden index. LAB: laboratory; EMR: electronic medical records; BA: bone age; US: ultrasonography.
Figure 4.Learning curves for models based on different single-source data. The black dot represents the performance at the maximum size (n = 720) of the training set with comprehensive data. The horizontal axis represents the sample size, and the vertical axis represents the model performance that varied with the sample size. LAB: laboratory; EMR: electronic medical records; BA: bone age; US: ultrasonography.
Figure 5.Feature importance of each variable (top 20). LH: luteinizing hormone; FSH: follicle-stimulation hormone; IGF-1: insulin-like growth factor-1.