| Literature DB >> 32381026 |
Freja Karuna Hemmingsen Sørup1, Søren Brunak1, Robert Eriksson2.
Abstract
BACKGROUND: Most structured clinical data, such as diagnosis codes, are not sufficient to obtain precise phenotypes and assess disease burden. Text mining of clinical notes could provide a basis for detailed profiles of phenotypic traits. The objective of the current study was to determine whether drug dose, regardless of polypharmacy, is associated with the length of clinical notes, and to determine the frequency of adverse events per word in clinical notes.Entities:
Keywords: Adverse event; Antipsychotic drugs; Natural language processing; Text mining
Mesh:
Substances:
Year: 2020 PMID: 32381026 PMCID: PMC7204249 DOI: 10.1186/s12874-020-00993-1
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Initially dose interval groups were formed by binning equivalent doses. The three equivalents consisted of separate dosage intervals, all starting from 0. DDDs were binned in intervals of 0.5 DDD, chlorpromazine equivalents were binned in intervals of 100 mg, and olanzapine equivalents were binned in intervals of 5 mg. The length of the clinical notes was analyzed in each binned dosage interval. Three equally wide dose intervals (low, mid, high) were defined to investigate whether the number of potential adverse events per clinical word was associated with the total normalized dose. Intervals containing less than 10 patients were excluded from all analyses, intervals containing less than 100 patients were only excluded from the analysis comparing the drug dose with the length of clinical notes
Drugs covered by the conversion methods
| Drug | DDD | Olanzapine equivalents | Chlorpromazine equivalents |
|---|---|---|---|
| YES | YES | NO | |
| YES | YES | YES | |
| YES | YES | NO | |
| YES | YES | YES | |
| YES | YES | NO | |
| YES | YES | YES | |
| YES | YES | NO | |
| YES | NO | NO | |
| YES | YES | YES | |
| YES | YES | NO | |
| YES | NO | NO | |
| YES | YES | YES | |
| YES | YES | NO | |
| YES | YES | NO | |
| YES | YES | YES | |
| YES | YES | YES | |
| YES | YES | NO | |
| YES | YES | NO | |
| YES | YES | YES | |
| YES | YES | YES | |
| YES | YES | NO |
Patient characteristics for the cohorts covered by the three normalization methods. Diagnoses are coded in International Classification of Diseases version 10
| Normalization method | DDD n (%) | Olanzapine equivalents n (%) | Chlorpromazine equivalents n (%) |
|---|---|---|---|
| Number of patients | 1589 | 1539 | 438 |
| Male sex | 1043 (65.6) | 1010 (65.6) | 287 (65.5) |
| Mean age in years (SD) | 40.3 (6.3) | 40.1 (6.4) | 40.1 (6.3) |
| F10 Alcohol related disorders | 666 (41.9) | 645 (41.9) | 149 (34.0) |
| F11 Opioid related disorders | 266 (16.7) | 256 (16.6) | 66 (15.1) |
| F12 Cannabis related disorders | 476 (30.0) | 458 (29.8) | 130 (29.7) |
| F13 Sedative, hypnotic, or anxiolytic related disorders | 255 (16.0) | 247 (16.0) | 57 (13.0) |
| F14 Cocaine related disorders | 189 (11.9) | 182 (11.8) | 41 (9.4) |
| F15 Other stimulant related disorders | 126 (7.9) | 122 (7.9) | 28 (6.4) |
| F19 Other psychoactive substance related disorders | 225 (14.2) | 218 (14.2) | 60 (13.7) |
| F20 Schizophrenia | 740 (46.6) | 707 (45.9) | 197 (45.0) |
| F21 Schizotypal disorder | 105 (6.6) | 103 (6.7) | 28 (6.4) |
| F22 Delusional disorder | 56 (3.5) | 54 (4.9) | 20 (4.6) |
| F23 Brief psychotic disorder | 25 (1.6) | 25 (1.6) | 10 (2.3) |
| F25 Schizoaffective disorder | 50 (3.1) | 49 (4.9) | 13 (3.0) |
| F31 Bipolar disorder | 98 (6.2) | 97 (6.3) | 26 (5.9) |
| F32 Mayor depressive disorder, single episode | 120 (7.6) | 119 (7.7) | 18 (4.1) |
| F33 Mayor depressive disorder, recurrent | 197 (12.4) | 193 (12.5) | 40 (9.1) |
| F41 Other anxiety disorders | 162 (10.2) | 159 (10.3) | 33 (7.5) |
| F60 Specific personality disorders | 329 (20.7) | 325 (21.1) | 80 (18.3) |
| Z046 Encounter for general psychiatric examination, requested by authority | 266 (16.7) | 250 (16.2) | 66 (15.1) |
Fig. 2Violin plots of antipsychotic dose load and number of words in the clinical notes per day using the three equalization methods. The medians of the distributions are represented by black dots. The width of each area represents a dose interval of, respectively, 0.5 DDD, 5 mg olanzapine equivalents, or 100 mg chlorpromazine equivalents. The same intervals were used to bin data from notes recorded by all staff categories (physicians, nursing staff, physical therapists, occupational therapists, psychologists, social workers, and secretaries), notes by physicians, and notes by nursing staff. The daily note length by all staff, physicians and nursing staff are plotted individually. Each value of the note length originates from zero and the values are not additive. Intervals containing less than 100 patients are not plotted
Fig. 3Potential adverse drug events per word recorded in the clinical narratives. Three dose intervals were selected for each of the three normalization methods