| Literature DB >> 31021325 |
Liyuan Zhou1,2, Hanna Suominen1,2,3,4, Tom Gedeon1.
Abstract
BACKGROUND: Deep learning (DL) has been widely used to solve problems with success in speech recognition, visual object recognition, and object detection for drug discovery and genomics. Natural language processing has achieved noticeable progress in artificial intelligence. This gives an opportunity to improve on the accuracy and human-computer interaction of clinical informatics. However, due to difference of vocabularies and context between a clinical environment and generic English, transplanting language models directly from up-to-date methods to real-world health care settings is not always satisfactory. Moreover, the legal restriction on using privacy-sensitive patient records hinders the progress in applying machine learning (ML) to clinical language processing.Entities:
Keywords: artificial intelligence; computer systems; deep learning; information storage and retrieval; medical informatics; nursing records; patient handoff
Year: 2019 PMID: 31021325 PMCID: PMC6658232 DOI: 10.2196/11499
Source DB: PubMed Journal: JMIR Med Inform
Figure 1A processing pipeline that transforms verbal clinical handover information into electronic structured records automatically.
Figure 2Descriptive statistics of text snippets highlighted in the training, validation, and test set.
Figure 3Transfer learning model structure.
Word embeddings training corpora.
| Corpus | Size | Source |
| English Wikipedia | 3.4 billion words | Wikimedia downloads [ |
| UMBCa | >3 billion words | UMBC WebBase corpus [ |
| One Billion | 0.8 billion words | One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling [ |
| I2B2 | 18,082 unique words | I2B2 NLPb research data sets [ |
| PubMed | 27 million records | PubMed resources [ |
| PubMed Central | 1 million articles | PubMed resources [ |
| National Information and Communications Technology Australia Train | 101 records | Hospital handover forms [ |
aUMBC: University of Maryland, Baltimore County.
bNLP: natural language processing.
Mapping of entity types between the source and target corpora.
| In-domain source: Bolt, Beranek and Newman | Out-domain source: I2B2 | Target: NICTAa Clinical Handover |
| PERSON | PATIENT | PATIENT_INTRODUCTION/Given_names |
| PERSON | PATIENT | PATIENT_INTRODUCTION/Last_name |
| PERSON | DOCTOR | PATIENT_INTRODUCTION/Under_Dr:Given_names |
| PERSON | DOCTOR | PATIENT_INTRODUCTION/Under_Dr:Last_name |
| PERSON | DOCTOR | APPOINTMENTS/Clinician:Last_name |
| PERSON | DOCTOR | APPOINTMENTS/Clinician: Given_names |
| ORGANIZATION:HOSPITAL | HOSPITAL | APPOINTMENTS/Hospital |
| DATE:AGE | —b | PATIENT_INTRODUCTION/Age_in_years |
| PER_DESC | — | PATIENT_INTRODUCTION/Gender |
| PER_DESC | — | APPOINTMENTS/Clinician Title |
| GPE:CITY | LOCATION | APPOINTMENTS/City |
| DATE:DATE | DATE | APPOINTMENTS/Day |
| TIME | — | APPOINTMENTS/Time |
| CARDINAL | ID | PATIENT_INTRODUCTION/Current_room |
| CARDINAL | ID | PATIENT_INTRODUCTION/Current_bed |
| PRODUCT:OTHER | — | Medication/Medicine |
| SUBSTANCE:DRUG | — | Medication/Medicine |
| QUANTITY:3D (volume) | — | Medication/Dosage |
| QUANTITY:OTHER | — | Medication/Dosage |
| QUANTITY:TEMPERATURE | — | My_shift/Status |
| QUANTITY:WEIGHT | — | My_shift/Status |
| SUBSTANCE:FOOD | — | My_shift/Input_diet |
| FACILITY | — | APPOINTMENTS/Ward |
| DISEASE | — | PATIENT_INTRODUCTION/Admission_reason/diagnosis |
| DISEASE | — | PATIENT_INTRODUCTION/Chronic_condition |
| DISEASE | — | PATIENT_INTRODUCTION/Disease/problem_history |
aNICTA: National Information and Communications Technology Australia.
bDoes not contain any matching label from the source domain.
Results of transfer learning compared with baseline systems.
| Method | MacPreca | MacRecb | MacF1c | MicPrecd | MicRece | MicF1f |
| Trans_BBN | 0.547 | 0.516 | ||||
| Trans_I2B2 | 0.481 | 0.390 | 0.392 | 0.565 | 0.471 | 0.514 |
| TUC-MI-B | 0.493 | 0.369 | 0.382 | 0.500 | 0.505 | 0.503 |
| ECNU_ICA-A | 0.493 | 0.406 | 0.374 | 0.510 | 0.522 | 0.516 |
| General+I2B2+train | 0.477 | 0.361 | 0.354 | 0.483 | ||
| I2B2+train | 0.443 | 0.367 | 0.354 | 0.604 | 0.484 | 0.537 |
| General | 0.429 | 0.356 | 0.345 | 0.606 | 0.478 | 0.535 |
| LQRZ-B | 0.425 | 0.383 | 0.345 | 0.490 | 0.517 | 0.503 |
| General+PubMed+PMC | 0.409 | 0.346 | 0.334 | 0.606 | 0.474 | 0.532 |
| Unigram | 0.393 | 0.292 | 0.311 | 0.574 | 0.448 | 0.503 |
| TUC-MI-A | 0.423 | 0.300 | 0.311 | 0.503 | 0.443 | 0.471 |
| LQRZ-A | 0.411 | 0.307 | 0.308 | 0.563 | 0.472 | 0.514 |
| ECNU_ICA-B | 0.428 | 0.292 | 0.297 | 0.581 | 0.459 | 0.513 |
| National Information and Communications Technology Australia | 0.435 | 0.233 | 0.246 | 0.433 | 0.368 | 0.398 |
| Random | 0.018 | 0.028 | 0.019 | 0.018 | 0.030 | 0.022 |
| Majority | 0.000 | 0.029 | 0.001 | 0.016 | 0.027 | 0.020 |
aMacro averaged precision.
bMacro averaged recall.
cMacro averaged F1.
dMicro averaged precision.
eMicro averaged recall.
fMicro averaged F1.
gItalics indicate the best result over the column.
Results of subclasses when transfer learning improved in the baseline system (F1 score).
| Entity type | Instances (n) | General | Trans_I2B2 | Trans_BBN |
| PATIENT_INTRODUCTION: Age (years) | 246 | 0.948 | 0.879 | |
| PATIENT_INTRODUCTION: Gender | 88 | 0.826 | 0.896 | |
| PATIENT_INTRODUCTION: Admission reason | 412 | 0.214 | 0.311 | |
| PATIENT_INTRODUCTION: Chronic condition | 70 | 0.000 | 0.081 | |
| PATIENT_INTRODUCTION: Disease/problem history | 147 | 0.016 | 0.044 | |
| PATIENT_INTRODUCTION: Care plan | 36 | 0.069 | 0.129 | |
| PATIENT_INTRODUCTION: Allergy | 14 | 0.267 | 0.500 | |
| APPOINTMENTS: Time | 28 | 0.114 | 0.400 | |
| APPOINTMENTS: Place: Ward | 3 | 0.000 | 0.000 | |
| APPOINTMENTS: Status | 159 | 0.111 | 0.132 | |
| FUTURE_CARE: Alert/warning/abnormal result | 59 | 0.000 | 0.087 | |
| FUTURE_CARE: Goal/task to be completed/expected outcome | 496 | 0.000 | 0.068 | |
| FUTURE_CARE: Discharge/transfer place | 89 | 0.327 | 0.288 | |
| MY_SHIFT: Status | 481 | 0.570 | 0.638 | |
| MY_SHIFT: Input/diet | 101 | 0.413 | 0.783 | |
| MY_SHIFT: Output/diuresis/bowel movement | 52 | 0.286 | 0.396 | |
| MY_SHIFT: Wounds/skin | 55 | 0.444 | 0.357 | |
| MY_SHIFT: Activities of daily living | 245 | 0.579 | 0.748 | |
| MY_SHIFT: Other observation | 361 | 0.177 | 0.202 | |
| MEDICATION: Medicine | 156 | 0.450 | 0.495 | |
| MEDICATION: Status | 68 | 0.034 | 0.085 |
aItalics indicate the best result over the column.