| Literature DB >> 30537974 |
Tian Bai1, Ashis Kumar Chanda1, Brian L Egleston2, Slobodan Vucetic3.
Abstract
BACKGROUND: There has been an increasing interest in learning low-dimensional vector representations of medical concepts from Electronic Health Records (EHRs). Vector representations of medical concepts facilitate exploratory analysis and predictive modeling of EHR data to gain insights about the patterns of care and health outcomes. EHRs contain structured data such as diagnostic codes and laboratory tests, as well as unstructured free text data in form of clinical notes, which provide more detail about condition and treatment of patients.Entities:
Keywords: Distributed representation; Electronic health records; Healthcare; Natural language processing
Mesh:
Year: 2018 PMID: 30537974 PMCID: PMC6290514 DOI: 10.1186/s12911-018-0672-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The framework of Skip-gram. Each word is used to predict its neighbours in a small context window. In this example the size of context window is 2
Fig. 2The framework of JointSkip-gram. a Each code is used to predict all other codes and words in the same visit. b Each word is used to predict all codes in the same visit and its neighbour words in a small context window to keep its syntactic properties
Most important 15 words (ranked by importance) for ICD-9 codes “570”, “174”, “295”, “348”, “311”, “042”
|
|
| ||
|
|
|
|
|
| Liver | Arrest | Metastatic | Breast |
| Hepatic | Pea | Mets | Pres |
| Cirrhosis | Cooling | Cancer | Mastectomy |
| Rising | Sun | Breast | Flap |
| Markedly | Arctic | Metastases | Mets |
| Shock | Rewarmed | Malignant | Ca |
| Lactate | Cooled | Metastasis | Cancer |
| Encephalopathy | Atrophine | Oncologist | Metastatic |
| Amps | Dopamine | Oncology | Chemotherapy |
| Picture | Rewarming | Chemotherapy | Malignant |
| Rise | Cardiac | Infiltrating | Oncologist |
| Elevated | Coded | Palliative | Polumoprhic |
| Cirrhotic | Continue | Tumor | Reversible |
| Bicarb | Prognosis | Melanoma | Mastectomies |
| AQlcoholic | Ems | Mastectomy | Crisis |
|
|
| ||
|
|
|
|
|
| Schizophrenia | Schizophrenia | Hemorrhagic | Arrest |
| Psych | Paranoid | Herniation | Herniation |
| Bipolar | Psych | Temporal | Unresponsive |
| Suicide | Psychiatric | Cerebral | Corneal |
| Psychiatry | Disorders | Brain | Pupils |
| Kill | Personality | Hemorrhage | Brain |
| Paranoid | Hiss | Parietal | Cooling |
| Ideation | Guardian | Ganglia | Posturing |
| Psychiatrist | Psychiatry | Occipital | Head |
| Hallucinations | Hypothyroidism | Extension | Nemorrhage |
| Psychosis | Home | Surrounding | Noxious |
| Personality | Aloe | Head | Family |
| Sitter | Arrest | Effacement | Prognosis |
| Disorder | Pt | Ataxia | Pea |
| Abuse | Unresponsive | Burr | Gag |
|
|
| ||
|
|
|
|
|
| Patient | Depression | Aids | Aids |
| Abuse | Tablet | Viral | Immunodeficiency |
| Hallucinations | Blood | Fungal | Virus |
| Withdrawal | Daily | Opportunistic | Human |
| Ingestion | Campus | Bacterial | Viral |
| Questionable | Mg | Disseminated | Load |
| Thiamine | Garage | Immuno-deficiency | Cooling |
| Remote | Capsule | Tuberculosis | Partner |
| Alcohol | Building | Organisms | Acyclovir |
| Significant | Parking | Herpes | Thrush |
| Overdose | One | Undetectable | Fevers |
| Prior | Discharge | Acyclovir | Induced |
| Apparent | Normal | Detectable | Antigen |
| Depression | East | Chlamydia | Pneumonia |
| Although | Coherent | Syphilis | Blanket |
Disease description and frequency are listed in the brackets
Evaluation results by clinical experts
|
| ||||||
|
|
|
|
|
|
|
|
| JointSkip-gram | 4 | 2 | 3 | 4 | 4 | 2 |
| LLDA | 0 | 2 | 1 | 0 | 0 | 2 |
|
| ||||||
|
|
|
|
|
|
|
|
| JointSkip-gram | 2.25 | 0.75 | 0.75 | 1.25 | 3.25 | 0.75 |
| LLDA | 9.25 | 1.75 | 3 | 3.75 | 6.5 | 2.75 |
Most important 15 words (including nonstandard English words) (ranked by importance) for ICD-9 codes “570”
|
| |
|---|---|
|
|
|
|
| An organ that produces biochemicals necessary for digestion |
| Renal | Relating to the kidneys |
| Hepatorenal | A life-threatening medical condition that consists of rapid deterioration in kidney |
| Crrt | CRRT is a dialysis modality used to treat critically ill, hospitalized patients |
| Vasopressin | A hormone synthesized |
|
| Shock liver is a condition defined as an acute liver injury |
| Failure | Liver failure can occur gradually |
| Levophed | Injection |
| Ascites | Ascites is the abnormal buildup of fluid in the abdomen |
| Oliguric | A urine output |
| Pigtail | Pigtail drainage is used for liver abscess |
| Transplant | liver transplant is a surgical procedure |
| Rifaximin | Antibiotic |
|
| Cirrhosis is a late stage of scarring (fibrosis) of the liver |
|
| Relating to the liver. |
Most important 15 words (including nonstandard English words) (ranked by importance) for ICD-9 codes 174
|
| |
|---|---|
|
|
|
| Xeloda | A prescription medicine used to treat people with cancer |
| Tamoxifen | A medication that is used to prevent breast cancer |
|
| A pathogenic agent’s spread from an primary site to a different site |
|
| A treatment by the use of chemical substances |
|
| A disease in which abnormal cells divide uncontrollably and destroy body tissue |
| Carboplatin | It is used to treat ovarian cancer |
| Onc | Abbreviations of oncologist |
|
| A doctor who treats cancer |
| Taxol | It belongs to a class of chemotherapy drugs is the abnormal buildup of fluid in the abdomen |
| Chemo | Short form of chemotherapy |
| Gemcitabine | Gemcitabine is an anti-cancer |
|
| Abbreviations of metastasis |
| Compazine | This medication is used to treat severe nausea |
|
| A medical care for relieving pain |
| Metastases | The development of secondary malignant growths |
Performance of predicting medical codes of the next visit
| Model | Top-20 recall | Top-30 recall | Top-40 recall |
|---|---|---|---|
| Concatenation-One | 0.489 ±0.004 | 0.590 ±0.004 | 0.661 ±0.004 |
| SVD | 0.478 ±0.004 | 0.588 ±0.004 | 0.652 ±0.004 |
| LDA | 0.431 ±0.004 | 0.530 ±0.004 | 0.605 ±0.004 |
| Codes-JointSG | 0.499 ±0.003 | 0.592 ±0.003 | 0.662 ±0.003 |
| Words-JointSG | 0.437 ±0.004 | 0.536 ±0.004 | 0.609 ±0.004 |
| Concatenation-JointSG |
|
|
|
The average and standard error of Top-k recall (k=20, 30, 40) are provided
Fig. 3Top-k recall (k=20, 30 and 40) for JointSkip-gram and Skip-gram. The error bars indicate the standard error