| Literature DB >> 34852805 |
Haowen Deng1, Youyou Zhou2, Lin Wang3, Cheng Zhang4.
Abstract
BACKGROUND: Neonatal jaundice may cause severe neurological damage if poorly evaluated and diagnosed when high bilirubin occurs. The study explored how to effectively integrate high-dimensional genetic features into predicting neonatal jaundice.Entities:
Keywords: Genetic variants; Hyperbilirubinemia; Machine learning; Transcutaneous bilirubin
Mesh:
Substances:
Year: 2021 PMID: 34852805 PMCID: PMC8638201 DOI: 10.1186/s12911-021-01701-9
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The methodological workflow of our study
Descriptive summary of daily TcB levels (μmol/L)
| Age (day) | Min | 25% | Mean | Mode | 75% | Max | Std | n | Ratio (%) |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 0 | 0 | 1.2 | 0 | 2.1 | 3.9 | 1.2 | 128 | 13 |
| 2 | 0 | 44.5 | 65.2 | 54.7 | 85.5 | 186.4 | 30.7 | 941 | 95.6 |
| 3 | 0 | 102.6 | 128.1 | 119.7 | 153.9 | 270.2 | 37.4 | 973 | 98.9 |
| 4 | 20.5 | 141.9 | 168.3 | 205.2 | 194.9 | 307.8 | 42.7 | 964 | 98 |
| 5 | 0 | 157.7 | 181.7 | 205.2 | 206.9 | 302.7 | 44.6 | 730 | 74.2 |
| 6 | 0 | 145.4 | 172.8 | 205.2 | 205.2 | 290.7 | 48.6 | 297 | 30.2 |
| 7 | 0 | 141.9 | 166.7 | 196.6 | 201.8 | 256.5 | 54.1 | 105 | 10.7 |
Ratio denotes the fraction of samples
Thresholds to start phototherapy and the number of neonates exceeds the threshold (n +) according to different guidelines
| Age (day) | CN220 | NICE | P95 | Sample size | |||
|---|---|---|---|---|---|---|---|
| Thresholds | n+ | Thresholds | n+ | Thresholds | n+ | ||
| 2 | 220 | 0 | 100 | 110 | 119.7 | 52 | |
| 3 | 220 | 4 | 150 | 282 | 186.4 | 50 | |
| 4 | 220 | 65 | 200 | 212 | 239.4 | 51 | |
| 5 | 220 | 107 | 200 | 289 | 256.5 | 45 | |
| 6 | 220 | 35 | 200 | 105 | 248.6 | 16 | |
| 7 | 220 | 10 | 200 | 30 | 186.4 | 6 | |
The best method is marked in bold with respect to each metric
Fig. 2The architecture of Gradient Boosting Decision Tree
Discrimination results of predicting neonatal jaundice with CRF and GV under CN220 guideline
| Variables | Method | AUC | F1-score | Precision |
|---|---|---|---|---|
| CRF | Lightgbm | 0.136 (0.109–0.161) | ||
| Cart | 0.553 (0.509–0.592) | 0.150 (0.074–0.211) | ||
| Logistic | 0.785 (0.753–0.821) | 0.210 (0.178–0.240) | 0.122 (0.103–0.141) | |
| Naive Bayes | 0.735 (0.673–0.782) | 0.165 (0.129–0.188) | 0.091 (0.069–0.104) | |
| rf | 0.766 (0.711–0.806) | 0.206 (0.177–0.245) | 0.123 (0.106–0.147) | |
| GV36 | Lightgbm | 0.105 (0.074–0.131) | ||
| Cart | 0.558 (0.522–0.598) | |||
| Logistic | 0.569 (0.519–0.614) | 0.118 (0.093–0.141) | 0.068 (0.053–0.081) | |
| Naive bays | 0.562 (0.509–0.622) | 0.112 (0.106–0.116) | 0.059 (0.057–0.062) | |
| rf | 0.587 (0.522–0.652) | 0.148 (0.104–0.197) | 0.103 (0.074–0.136) | |
| CRF_GV36 | Lightgbm | |||
| Cart | 0.569 (0.517–0.621) | 0.184 (0.103–0.269) | 0.175 (0.095–0.250) | |
| Logistic | 0.781 (0.730–0.816) | 0.218 (0.185–0.251) | 0.129 (0.110–0.150) | |
| Naive Bayes | 0.642 (0.563–0.707) | 0.114 (0.105–0.124) | 0.061 (0.056–0.067) | |
| rf | 0.792 (0.753–0.833) | 0.228 (0.193–0.259) | 0.139 (0.118–0.158) |
The best performance by algorithms with CRF, GV36 and CRF_GV36 variables are marked in bold
95% CI is shown in parentheses
Fig. 3Relative feature importance from ensemble nethod in predicting neonatal jaundice under CN220 guideline
Fig. 4Calibration curves on external validation sets
Calibration results of predicting neonatal jaundice with CRF and GV. 95%
| Recali-brated | Guideline | Variables | AUC | Brier | Event rate | Average risk |
|---|---|---|---|---|---|---|
| No | CN220 | CRF | 0.792 (0.757–0.828) | 0.054 (0.05–0.058) | 0.055 | 0.047 |
| CRF_GV36 | 0.82 (0.785–0.857) | 0.053 (0.05–0.057) | 0.055 | 0.038 | ||
| NICE_R1 | CRF | 0.72 (0.695–0.744) | 0.172 (0.164–0.179) | 0.254 | 0.250 | |
| CRF_GV36 | 0.756 (0.736–0.78) | 0.165 (0.155–0.175) | 0.254 | 0.244 | ||
| P95 | CRF | 0.68 (0.623–0.737) | 0.053 (0.05–0.056) | 0.054 | 0.048 | |
| CRF_GV36 | 0.709 (0.657–0.773) | 0.054 (0.049–0.06) | 0.054 | 0.043 | ||
| Yes | CN220 | CRF | 0.795 (0.761–0.83) | 0.051 (0.049–0.052) | 0.055 | 0.055 |
| CRF_GV36 | 0.83 (0.802–0.862) | 0.049 (0.048–0.051) | 0.055 | 0.055 | ||
| NICE_R1 | CRF | 0.724 (0.702–0.752) | 0.168 (0.163–0.173) | 0.254 | 0.254 | |
| CRF_GV36 | 0.762 (0.739–0.787) | 0.158 (0.152–0.164) | 0.254 | 0.255 | ||
| P95 | CRF | 0.683 (0.622–0.733) | 0.05 (0.049–0.052) | 0.054 | 0.055 | |
| CRF_GV36 | 0.717 (0.669–0.772) | 0.049 (0.047–0.05) | 0.054 | 0.055 |
95% CI is shown in parentheses
Prediction performance of recalibrated lightgbm under CN220 guideline with different combinations of GV
| Variables | AUC | F1-score | Precision | Specific GV |
|---|---|---|---|---|
| CRF | 0.795 (0.761–830) | 0.217 (0.171–0.261) | 0.143 (0.113–0.171) | None |
| CRF + GV4 | 0.807 (0.779–841) | 0.242 (0.195–0.286) | 0.165 (0.132–0.194) | rs2071749 rs4148323, rs6717546, rs6719561 |
| CRF + GV11 | 0.813 (0.781–0.847) | 0.251 (0.198–0.298) | 0.176 (0.141–0.207) | (GT)n, rs9607267, rs2071749 rs887829, (TA)n, rs4148323, rs1018124, rs6717546, rs11563250, rs6719561, rs4663972 |
| CRF + GV36 | 0.830 (0.802–0.826) | 0.285 (0.229–0.333) | 0.217 (0.173–0.252) | rs2071746, (GT)n, rs9607267, rs2071749 rs4399719, rs887829, (TA)n, rs4148323, rs1018124, rs6717546, rs11563250, rs6719561, rs4663972 rs1181601, rs1181574, rs10486752, rs699512, rs17246016, rs589570 rs4149013, rs10743408, rs3899743, rs981262, rs7138177, rs4149026, rs976754, rs4149034, rs12313639, rs2306283, rs4149044, rs4149056, rs4149057, rs4363657, rs4149076, rs12578392, rs4149085 |
95% CI is shown in parentheses
Fig. 5ROC curve of neonatal jaundice prediction with CRF and GV by ensemble learning
Fig. 6Comparison of ROC curve of neonatal jaundice prediction after introducing genetic variants (GV36)