| Literature DB >> 34037526 |
Hyeon Joo1,2, Michael Burns2, Sai Saradha Kalidaikurichi Lakshmanan2, Yaokun Hu2, V G Vinod Vydiswaran1,3.
Abstract
BACKGROUND: Administrative costs for billing and insurance-related activities in the United States are substantial. One critical cause of the high overhead of administrative costs is medical billing errors. With advanced deep learning techniques, developing advanced models to predict hospital and professional billing codes has become feasible. These models can be used for administrative cost reduction and billing process improvements.Entities:
Keywords: CPT classification; machine learning; natural language processing; neural machine translation
Year: 2021 PMID: 34037526 PMCID: PMC8190648 DOI: 10.2196/22461
Source DB: PubMed Journal: JMIR Form Res ISSN: 2561-326X
Figure 1The architecture of an automated current procedural terminology coding system based on the Transformer model. CPT: current procedural terminology.
Figure 2The flowchart of data selection and rules to split the training, validation, and holdout sets. CPT: current procedural terminology.
The prevalence of anesthesiology procedures and services categorized by area of the body in the training, validation, and holdout data sets (N=187,927).
| Body parta | CPTb codes | Data setc, n (%) | ||
|
|
| Training setd (n=117,373) | Validation setd (n=29,340) | Holdout sete (n=41,214) |
| Head | 00100-00222 | 27,863 (23.74) | 6964 (23.73) | 9110 (22.10) |
| Neck | 00300-00352 | 9403 (8.01) | 2352 (8.01) | 2822 (6.84) |
| Thorax (chest and shoulder) | 00400-00474 | 5554 (4.73) | 1388 (4.73) | 1710 (4.14) |
| Intrathoracic | 00500-00580 | 8781 (7.48) | 2194 (7.47) | 2903 (7.04) |
| Spine and spinal cord | 00600-00670 | 2606 (2.22) | 651 (2.21) | 883 (2.14) |
| Upper abdomen | 00700-00797 | 12,083 (10.29) | 3022 (10.29) | 5518 (13.38) |
| Lower abdomen | 00800-00882 | 13,338 (11.36) | 3334 (11.36) | 6547 (15.88) |
| Perineum | 00902-00952 | 9853 (8.39) | 2464 (8.39) | 3001 (7.28) |
| Pelvis (except hip) | 01112-01173 | 551 (0.46) | 137 (0.46) | 194 (0.47) |
| Upper leg (except knee) | 01200-01274 | 2390 (2.03) | 595 (2.02) | 674 (1.63) |
| Knee and popliteal area | 01320-01444 | 2629 (2.24) | 658 (2.24) | 792 (1.92) |
| Lower leg (below knee) | 01462-01522 | 2117 (1.80) | 530 (1.80) | 606 (1.47) |
| Shoulder and axilla | 01610-01680 | 2084 (1.77) | 520 (1.77) | 668 (1.62) |
| Upper arm and elbow | 01710-01782 | 785 (0.66) | 196 (0.66) | 192 (0.46) |
| Forearm, wrist, and hand | 01810-01860 | 3035 (2.58) | 757 (2.58) | 870 (2.11) |
| Radiological procedure | 01916-01936 | 8709 (7.42) | 2179 (7.42) | 2829 (6.86) |
| Burn excisions or debridement | 01951-01953 | 505 (0.43) | 126 (0.42) | 105 (0.25) |
| Obstetric | 01958-01969 | 4969 (4.23) | 1243 (4.23) | 1722 (4.17) |
| Other procedure | 01990-01999 | 118 (0.10) | 30 (0.10) | 68 (0.16) |
aAnesthesiology current procedural terminology codes are categorized based on the area of the body part.
bCPT: current procedural terminology.
cThe percentage may not sum up to 100 because of rounding.
dThe training and validation sets were stratified and split to maintain the same prevalence of procedures.
eThe holdout set is new data collected for 6 months to prevent data leakage.
Descriptive statistics of operative procedure text, preoperative diagnosis, and preferred terms.
| Data set | Number of unique CPTa codes | Number of unique procedure texts | Number of tokens in procedure textb | Number of tokens in preoperative diagnosisb | Number of tokens in preferred termsb | ||||||
|
|
|
| Mean (SD) | Range | Mean (SD) | Range | Mean (SD) | Range | |||
| Training | 252 | 13,847 | 5.12 (3.57) | 1-60 | 4.12 (2.5) | 0-15 | 13.23 (6.44) | 3-46 | |||
| Validation | 231 | 6012 | 5.15 (3.64) | 1-51 | 4.11 (2.5) | 0-13 | 13.23 (4.45) | 3-41 | |||
| Holdout | 224 | 6731 | 4.98 (3.52) | 1-60 | 4.01 (2.52) | 0-13 | 13.24 (6.18) | 3-41 | |||
aCPT: current procedural terminology.
bThe unit of descriptive statistics is token (word).
Figure 3The distribution of current procedural terminology codes in the training, testing, and holdout sets, sorted by most to least frequent codes.
Performance comparison of support vector machine, long short-term memory, and neural machine translation models with raw, curated procedure text and combined, curated procedure text and preoperative diagnosis.
| Model | Top-1 accuracya (95% CI) | Top-3 accuracyb (95% CI) | |||
|
| Validation set | Holdout set | Validation set | Holdout set | |
|
| |||||
|
| SVMd | 83.61 (83.07-84.16) | 81.19 (80.63-81.75) | 97.56 (97.36-97.76) | 95.64 (95.35-95.93) |
|
| LSTMe | 81.86 (81.33-82.40) | 80.94 (80.42-81.46) | 95.38 (95.06-95.71) | 95.72 (95.44-95.99) |
|
| NMTf | 81.68 (81.14-82.21) | 81.64 (81.11-82.18) | 95.27 (94.96-95.58) | 95.60 (95.30-95.89) |
|
| |||||
|
| SVM | 83.38 (82.85-83.90) | 81.27 (80.72-81.82) | 97.45 (97.23-97.67) | 95.75 (95.47-96.04) |
|
| LSTM | 81.81 (81.28-82.34) | 81.07 (80.54-81.59) | 95.32 (95.00-95.64) | 95.67 (95.40-95.95) |
|
| NMT | 81.72 (81.18-82.26) | 81.71 (81.18-82.24) | 95.41 (95.11-95.71) | 95.69 (95.40-95.98) |
|
| |||||
|
| SVM | 87.62 (87.15-88.09) | 84.30 (83.81-84.79) | 99.16 (99.03-99.29) | 95.88 (95.60-96.15) |
|
| LSTM | 83.52 (83.00-84.04) | 83.70 (83.20-84.20) | 95.82 (95.53-96.12) | 95.93 (95.65-96.20) |
|
| NMT | 82.43 (81.90-82.96) | 82.80 (82.31-83.29) | 94.75 (94.44-95.06) | 95.06 (94.77-95.35) |
aTop-1 accuracy is the accuracy of models on the best current procedural terminology code predicted, equivalent to the F1-score micro.
bTop-3 accuracy is the accuracy of models if the true current procedural terminology is within the top three best codes predicted.
cRaw procedure text is manually entered by physicians without text preprocessing.
dSVM: support vector machine.
eLSTM: long short-term memory.
fNMT: neural machine translation.
gCurated procedure text is cleaned text with preprocessing techniques.
hThe curated procedure text is concatenated with preoperative diagnosis to training models for current procedural terminology prediction.
Figure 4The top-1 and top-3 accuracy comparison based on the training sample size. LSTM: long short-term memory; NMT: neural machine translation; SVM: support vector machine.
Figure 5Bilingual Evaluation Understudy scores of imbalanced labels for translating manually entered procedure text into preferred terms in step 1 of the neural machine translation–based model. BLEU: Bilingual Evaluation Understudy.
The neural machine translation–based model’s source (input) and target (output) translation examples from the holdout data set. The translation task is to convert manually entered procedure text in electronic health records to preferred terms in the Unified Medical Language System.
| Current procedural terminology code | Current procedural terminology descriptiona | Example source textb | Target textc | |
|
| ||||
|
| 00868 (group 7) | Anesthesia for extraperitoneal procedures in lower abdomen, including urinary tract; renal transplant (recipient) | Right kidney transplant cadaveric donor | Anesthesia for extraperitoneal procedures in lower abdomen including urinary tract renal transplant recipient |
|
| 01610 (group 9) | Anesthesia for all procedures on nerves, muscles, tendons, fascia, and bursae of shoulder and axilla | Midline wide local excision melanoma iolm sln axillary inguinal | Anesthesia for all procedures on nerves muscles tendons fascia and bursae of shoulder and axilla |
|
| 00142 (group 10) | Anesthesia for procedures on eye; lens surgery | Left phacoemulsification intraocular lens | Anesthesia for procedures on eye lens surgery |
|
| 00104 (group 10) | Anesthesia for electroconvulsive therapy | Electroconvulsive therapy | Anesthesia for electroconvulsive therapy |
|
| 01220 (group 5) | Anesthesia for all closed procedures involving upper two thirds of femur | Left closed reduction cast application | Anesthesia for all closed procedures involving hip joint |
|
| ||||
|
| 01402 (group 8) | Anesthesia for open or surgical arthroscopic procedures on knee joint; total knee arthroplasty | Right total knee arthroplasty knee arthritis | Anesthesia for open or surgical arthroscopic procedures on knee joint total knee arthroplasty |
|
| 01922 (group 10) | Anesthesia for noninvasive imaging or radiation therapy | Magnetic resonance imaging cardiac anesthesia hypoplastic left heart syndrome | Anesthesia for noninvasive imaging or radiation therapy |
|
| 00402 (group 9) | Anesthesia for procedures on the integumentary system on the extremities, anterior trunk, and perineum; reconstructive procedures on breast (eg, reduction or augmentation mammoplasty, muscle flaps) | Left breast tissue expander placement unilateral tissue expander infection | Anesthesia for procedures on the integumentary system on the extremities anterior trunk and perineum reconstructive procedures on breast eg reduction or augmentation mammoplasty muscle flaps |
|
| 00812 (group 10) | Anesthesia for lower intestinal endoscopic procedures, endoscope introduced distal to duodenum; screening colonoscopy | Colonoscopy anesthesia encounter screening malignant neoplasm colon | Anesthesia for lower intestinal endoscopic procedures endoscope introduced distal to duodenum screening colonoscopy |
|
| 01200 (group 5) | Anesthesia for all closed procedures involving hip joint | Left closed reduction cast application left prosthetic hip dislocation | Anesthesia for all closed procedures involving hip joint |
aThe current procedural terminology description is the gold standard translation.
bSource text is an example of the preprocessed input of the neural machine translation model.
cThe target text is the translated sentence from the neural machine translation model.
dThe preoperative diagnosis in the combined text is underlined to distinguish it from the procedure text.