| Literature DB >> 34730546 |
Zhaohua Lu1, Jin-Ah Sim2,3, Jade X Wang1, Christopher B Forrest4, Kevin R Krull2, Deokumar Srivastava1, Melissa M Hudson5, Leslie L Robison2, Justin N Baker5, I-Chan Huang2.
Abstract
BACKGROUND: Assessing patient-reported outcomes (PROs) through interviews or conversations during clinical encounters provides insightful information about survivorship.Entities:
Keywords: PROs; machine learning; natural language processing; pediatric oncology
Mesh:
Year: 2021 PMID: 34730546 PMCID: PMC8600437 DOI: 10.2196/26777
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Figure 1The natural language processing and machine learning pipeline to analyze unstructured patient-reported outcomes data. BERT: bidirectional encoder representations from transformers; PROs: patient-reported outcomes; SVM: support vector machine; XGBoost: extreme gradient boosting.
Characteristics of study participants (N=87).
| Characteristics | Survivors (n=52) | Caregivers (n=35) | |||
| Age at evaluation (years), mean (SD) | 13.8 (2.8) | 39.6 (7.0) | |||
|
| |||||
|
| Female | 31 (61) | 32 (91) | ||
|
| Male | 20 (39) | 3 (9) | ||
|
| |||||
| White, non-Hispanic | 30 (59) | 24 (69) | |||
| Black, non-Hispanic | 14 (28) | 10 (29) | |||
| Other | 7 (14) | 1 (3.0) | |||
|
| |||||
|
| Non-CNSa solid tumor | 22 (42) | N/Ab | ||
|
| Leukemia | 17 (33) | N/A | ||
|
| CNS malignancy | 9 (17) | N/A | ||
|
| Lymphoma | 4 (8.0) | N/A | ||
aCNS: central nervous system.
bN/A: not applicable.
Performance of natural language processing/machine learning models for pain interference domain by 3 symptom attributes.
| Attributes and models | Precision (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | Accuracy (95% CI) | F1 (95% CI) | AUROCCa (95% CI) | AUPRCb (95% CI) | |
|
| ||||||||
|
| BERTc | 0.692 (0.555-0.811) | 0.507 (0.387-0.618) | 0.950 (0.924-0.972) | 0.870 (0.836-0.903) | 0.585 (0.467-0.683) | 0.875 (0.824-0.948) | 0.677 (0.568-0.770) |
|
| Word2vec/SVMd | 0.722 (0.562-0.867) | 0.366 (0.262-0.479) | 0.969 (0.948-0.987) | 0.859 (0.824-0.893) | 0.486 (0.362-0.594) | 0.868 (0.826-0.922) | 0.623 (0.5090.743) |
|
| Word2vec/XGBooste | 0.697 (0.528-0.857) | 0.324 (0.221-0.435) | 0.969 (0.949-0.987) | 0.852 (0.813-0.887) | 0.442 (0.318-0.551) | 0.830 (0.769-0.888) | 0.553 (0.437-0.659) |
|
| ||||||||
|
| BERT | 0.800 (0.657-0.935) | 0.583 (0.432-0.735) | 0.980 (0.964-0.994) | 0.931 (0.905-0.957) | 0.675 (0.543-0.779) | 0.923 (0.879-0.997) | 0.818 (0.735-0.917) |
|
| Word2vec/SVM | 0.760 (0.583-0.920) | 0.396 (0.254-0.533) | 0.983 (0.967-0.994) | 0.910 (0.882-0.939) | 0.521 (0.361-0.648) | 0.900 (0.863-0.957) | 0.609 (0.434-0.761) |
|
| Word2vec/XGBoost | 0.769 (0.500-1.000) | 0.208 (0.104-0.333) | 0.991 (0.980-1.000) | 0.895 (0.867-0.926) | 0.328 (0.178-0.474) | 0.828 (0.748-0.905) | 0.474 (0.321-0.630) |
|
| ||||||||
|
| BERT | 0.636 (0.461-0.800) | 0.500 (0.349-0.652) | 0.966 (0.946-0.983) | 0.916 (0.887-0.941) | 0.560 (0.410-0.690) | 0.857 (0.786-0.918) | 0.566 (0.402-0.750) |
|
| Word2vec/SVM | 0.286 (0-0.668) | 0.048 (0-0.118) | 0.986 (0.973-0.997) | 0.885 (0.854-0.916) | 0.082 (0.035-0.200) | 0.804 (0.742-0.878) | 0.309 (0.173-0.426) |
|
| Word2vec/XGBoost | 0.556 (0.222-0.875) | 0.119 (0.029-0.229) | 0.989 (0.977-0.997) | 0.895 (0.864-0.923) | 0.196 (0.072-0.343) | 0.786 (0.728-0.850) | 0.304 (0.148-0.420) |
aAUROCC: area under the receiver operating characteristic curve.
bAUPRC: area under precision-recall curve.
cBERT: bidirectional encoder representations from transformers.
dSVM: support vector machine.
eXGBoost: extreme gradient boosting.
Performance of natural language processing/machine learning models for fatigue domain by 3 symptom attributes.
| Attributes and models | Precision (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | Accuracy (95% CI) | F1 (95% CI) | AUROCCa (95% CI) | AUPRCb (95% CI) | |
|
| ||||||||
|
| BERTc | 0.593 (0.468-0.717) | 0.427 (0.315-0.538) | 0.929 (0.901-0.956) | 0.832 (0.794-0.867) | 0.496 (0.384-0.593) | 0.775 (0.723-0.848) | 0.537 (0.443-0.634) |
|
| Word2vec/SVMd | 0.600 (0.286-0.900) | 0.073 (0.026-0.136) | 0.988 (0.974-0.997) | 0.810 (0.770-0.848) | 0.130 (0.048-0.227) | 0.726 (0.670-0.780) | 0.375 (0.224-0.474) |
|
| Word2vec/XGBooste | 0.595 (0.432-0.773) | 0.268 (0.169-0.364) | 0.956 (0.934-0.977) | 0.822 (0.784-0.858) | 0.370 (0.250-0.474) | 0.726 (0.665-0.798) | 0.461 (0.338-0.575) |
|
| ||||||||
|
| BERT | 0.803 (0.696-0.895) | 0.757 (0.652-0.854) | 0.963 (0.941-0.981) | 0.929 (0.903-0.953) | 0.779 (0.697-0.855) | 0.948 (0.922-0.979) | 0.855 (0.791-0.930) |
|
| Word2vec/SVM | 0.829 (0.690-0.946) | 0.414 (0.292-0.535) | 0.983 (0.968-0.994) | 0.889 (0.861-0.917) | 0.552 (0.418-0.657) | 0.917 (0.886-0.951) | 0.730 (0.632-0.855) |
|
| Word2vec/XGBoost | 0.767 (0.625-0.884) | 0.471 (0.359-0.586) | 0.972 (0.953-0.988) | 0.889 (0.858-0.917) | 0.584 (0.468-0.684) | 0.860 (0.817-0.924) | 0.659 (0.550-0.782) |
|
| ||||||||
|
| BERT | 0.679 (0.500-0.848) | 0.422 (0.289-0.568) | 0.976 (0.960-0.990) | 0.917 (0.891-0.943) | 0.521 (0.379-0.658) | 0.796 (0.704-0.912) | 0.561 (0.434-0.741) |
|
| Word2vec/SVM | 0.778 (0.429-1.000) | 0.156 (0.057-0.267) | 0.995 (0.987-1.000) | 0.905 (0.877-0.929) | 0.259 (0.102-0.406) | 0.817 (0.756-0.881) | 0.393 (0.203-0.534) |
|
| Word2vec/XGBoost | 0.571 (0.286-0.833) | 0.178 (0.068-0.300) | 0.984 (0.971-0.995) | 0.898 (0.868-0.924) | 0.271 (0.118-0.415) | 0.780 (0.706-0.850) | 0.330 (0.154-0.436) |
aAUROCC: area under the receiver operating characteristic curve.
bAUPRC: area under precision-recall curve.
cBERT: bidirectional encoder representations from transformers.
dSVM: support vector machine.
eXGBoost: extreme gradient boosting.
Figure 2Area under the receiver operating characteristic curves and precision-recall curves for the best models of pain interference domain (left column) and fatigue domain (right column) by 3 symptom attributes. BERT: bidirectional encoder representations from transformers; PR: precision recall; ROC: receiver operating characteristic; SVM: support vector machine.