| Literature DB >> 34814905 |
Tannaz Khaleghi1, Alper Murat2, Suzan Arslanturk3.
Abstract
BACKGROUND: In surgical department, CPT code assignment has been a complicated manual human effort, that entails significant related knowledge and experience. While there are several studies using CPTs to make predictions in surgical services, literature on predicting CPTs in surgical and other services using text features is very sparse. This study improves the prediction of CPTs by the means of informative features and a novel re-prioritization algorithm.Entities:
Keywords: Current procedure terminology (CPT) code; Ensemble learning; Importance weight; Machine learning; Multi-class classification; Random Forest; Surgery code
Mesh:
Year: 2021 PMID: 34814905 PMCID: PMC8612004 DOI: 10.1186/s12911-021-01665-w
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Random Forest classification algorithm
Fig. 2Algorithm for calculating class weight when word w is in both dictionaries, and (named as and ). C(w) and I(w) refer to the count and importance measure of the word w
Fig. 3Algorithm for calculating class weight when word w exists in but not in (named as and ). C(w) and I(w) refer to the count and importance measure of the word w
Fig. 4The class weight recalculation procedure. Algorithms 2 and 3 are calculating new weights based on the observed CPT instances
Fig. 5a The top figure shows CPT labels distribution for CPT codes with frequency greater than 20 in specialty data. b The bottom figure represents the average scheduled duration of CPTs with frequency more than 20 in each specialty dataset
CPT prediction accuracy measures under different filter combinations and accuracy calculation approaches for each specialty
| Algorithm: | Random Forest | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Compare to: | CPT_RVUmax | CPT_set | ||||||||||||||
| Compare with predicted: | 1 CPT | 2 CPTs | 1 CPT | 2 CPTs | ||||||||||||
| Filters: | C | F1 | F2 | F1F2 | C | F1 | F2 | F1F2 | C | F1 | F2 | F1F2 | C | F1 | F2 | F1F2 |
| Cardio | 45 | 53 | 57 | 62 | 49 | 59 | 58 | 71 | 50 | 54 | 60 | 67 | 54 | 62 | 60 | 72 |
| General | 68 | 78 | 77 | 87 | 71 | 73 | 76 | 81 | 69 | 78 | 79 | 88 | 71 | 73 | 76 | 81 |
| Urology | 52 | 62 | 56 | 71 | 55 | 67 | 63 | 72 | 54 | 63 | 56 | 71 | 57 | 69 | 65 | 74 |
| OBGYN | 57 | 74 | 64 | 81 | 59 | 74 | 65 | 82 | 58 | 74 | 67 | 82 | 61 | 74 | 68 | 84 |
| Other | 36 | 51 | 54 | 68 | 37 | 53 | 56 | 69 | 38 | 52 | 56 | 69 | 40 | 54 | 58 | 70 |
| Average | 52 | 64 | 62 | 74 | 54 | 65 | 64 | 75 | 54 | 64 | 64 | 75 | 57 | 66 | 65 | 76 |
Different data filtering methods for exploratory result analysis
| Filter label | Description |
|---|---|
| C | Complete dataset |
| Overlook small CPT difference | |
| Eliminate rare CPT occurrences | |
Fig. 6Plot of average CPT prediction accuracy under optimal settings