| Literature DB >> 23847423 |
Sunghwan Sohn1, Cheryl Clark, Scott R Halgrim, Sean P Murphy, Siddhartha R Jonnalagadda, Kavishwar B Wagholikar, Stephen T Wu, Christopher G Chute, Hongfang Liu.
Abstract
A large amount of medication information resides in the unstructured text found in electronic medical records, which requires advanced techniques to be properly mined. In clinical notes, medication information follows certain semantic patterns (eg, medication, dosage, frequency, and mode). Some medication descriptions contain additional word(s) between medication attributes. Therefore, it is essential to understand the semantic patterns as well as the patterns of the context interspersed among them (ie, context patterns) to effectively extract comprehensive medication information. In this paper we examined both semantic and context patterns, and compared those found in Mayo Clinic and i2b2 challenge data. We found that some variations exist between the institutions but the dominant patterns are common.Entities:
Keywords: electronic medical record; medication extraction; natural language processing
Year: 2013 PMID: 23847423 PMCID: PMC3702197 DOI: 10.4137/BII.S11634
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Semantic type annotation of medication information in mayo and i2b2.
| i2b2 | Mayo | ||
|---|---|---|---|
|
|
| ||
| Semantic type | Definition | Semantic type | Definition |
| medication (m) | medication name | medication (m) | medication name |
| dosage (do) | the amount of a single medication used in each administration (e.g., | dosage (do) | how many of each medication the patient is taking (e.g., “ |
| strength number (sn) | e.g., “ | ||
| strength unit (su) | e.g., “ | ||
| frequency (f) | how often each dose of the medication should be taken (e.g., “ | freq number (fn) | e.g., “ |
| freq unit (fu) | e.g., “ | ||
| mode (mo) | the route for administering the medication (e.g., “ | route (mo) | same as the i2b2 |
| duration (du) | how long the medication is to be administered (e.g., “ | duration (du) | same as the i2b2 but not include preposition (e.g., “ |
| form | the physical appearance of the medication (e.g., | ||
Notes: Parenthesized strings in the semantic type column are abbreviations.
In i2b2, there is no form annotation but it may be part of the medication or dosage (eg, m: “Aspirin tablet”, do: “1 tab”).
Examples of medication semantic patterns and symbols.
| Symbol | Semantic pattern | Example |
|---|---|---|
| A | m | <m>Aspirin</m> |
| B | m|do|mo|f | <m>Zocor</m> <do>5 mg</do> <mo>p.o.</mo> <f>q.h.s.</f> |
| C | m|do|f | <m>Glipizide</m> <do>5 mg</do> <f>b.i.d</f>. |
| D | m|mo | <m>Lasix</m> <mo>drip</mo> |
| E | m|do | <m>Coumadin</m> <do>alternating doses of 4 mg</do> |
| F | mo|m | <mo>IV</mo> <m>ACE inhibitors</m> |
| G | do|m | <do>one unit</do> of <m> packed red cells</m> |
| H | m|du | <m>Dilantin</m> <du>for less than a year</du> |
| I | m|f | <m>Pepcid AC</m> <f>QHS</f> |
| J | m|do|mo|f|du | <m>KEFLEX (CEPHALEXIN)</m> <do>250 MG</do> <mo>PO</mo> |
| <f>QID</f> <du>X 12 doses</du> | ||
| K | m|do|mo | <m>acetylsalicylic acid</m> <do>325 mg</do> <mo>p.o.</m> |
| L | m|do|f|du | <m>Lasix</m> <do>40 mg</do> <f>QD</f> <du>x3 doses</du> |
| M | du|m | <du>two days</du> of <m>Indocin</m> |
| N | f|m | stable on <f>b.i.d.</f> <m>torsemide</m> |
| O | m|mo|f | <m>nitroglycerin</m> <mo>sublingual</m> <f>p.r.n.</f> |
| P | m|mo|do|f | <m>Lovenox</m> <mo>subcutaneously</mo> <do>90 mg</do> <f>daily</f> |
| Q | do|mo|m | <do>10 mg</do> of <mo>IV</mo> <m>Lopressor</m> |
| R | mo|m|du | <mo>IV</mo> <m>cefotaxime</m> <du>for the 7-day period</du> |
| S | do|m|f | <do>low dose</do> <m>dilaudid</m> <m>as needed</m> |
| T | m|do|du | <m>heparin</m> <do>500 units</do> <du>for 48 hours</du> |
Note: In example, semantic types are annotated as XML-style tags.
Section-level statistics for mayo medication semantic patterns.
| Section | # medications | # patterns | Patterns |
|---|---|---|---|
| Impression/report/plan | 227 | 53 | m (97), m|sn|su|fu (42), m|sn|su (8), m|fu (5), |
| Current medications | 183 | 38 | m (52), m|sn|su|fu (28), m|fu (11), m|sn|su|mo|fu (9), m|sn|su|do|fu (9), m|sn|su|do|fn (8), m|sn|su|fn (6), m|sn|su (6), m|sn|su|fn|fu (5) |
| History of present illness | 139 | 26 | m (82), m|sn|su|fu (14), m|sn|su|fn (8), mo|m (5) |
| Allergies | 39 | 1 | m (39) |
| Past medical/surgical history | 21 | 4 | m (15) |
| Problem oriented hosp. course | 19 | 8 | m (10) |
| Social history | 10 | 3 | m (8) |
| Chief complaint/reason for visit | 7 | 3 | m (5) |
| The other sections | 14 | 1 | m (14) |
Note:
Number within ( ) denotes # medications of a given pattern.
Figure 1i2b2 medication semantic patterns (top 95%).
Figure 2Mayo medication semantic patterns mapped to i2b2 (top 95%).
Figure 3Mayo vs. i2b2 medication semantic patterns in list (top 95%).
Figure 4Mayo vs. i2b2 medication semantic patterns in narrative (top 95%).
Figure 5i2b2 vs. Mayo vs. Mayo10K medication semantic patterns with moving average trend (top 95%).
Medication context patterns in mayo (number denotes frequency).
| Partial pattern | Status change pattern |
|---|---|
| 22 per|a <fu> | 6 <m> |
| 12 for (period of|an additional) <du> | 1 <m> |
| 7 <fu> times (a) <fu> | 3 <m> |
| 6 <do>|<mo> every <fn> | |
| 5 <m> dose|cycle (of|at) <sn> | |
| 5 <m> to <sn> | |
| 5 <su>|<du>|<fm> of <m> |
Note:
Lexical variations were normalized.
Medication context patterns in i2b2 (number denotes frequency).
| Exact pattern | Partial pattern | Status change pattern |
|---|---|---|
| 71 <do> of <m> | 141 <do>|<du> of <m> | 16 <m> INCREASE indication <do> |
| 28 <du> of <m> | 21 <m> at <do> | 2 <m> INCREASE indication <f> |
| 9 <do> of <m> <f> | 8 <m> (dose) from <do> | 1 <m><do> INCREASE indication <f> |
| 8 <du> of <mo> <m> | 5 <m> from <do> | 8 <m> DECREASE indication <do> |
| 6 <m> at <do> | 6 <m> to <do> | 1 <m> DECREASE indication <du> |
| 6 <do> of <mo> <m> | 4 <m> with <do> | 1 <m> DECREASE indication <f> |
| 5 <m> at <do> <mo> <f> | 8 <m> START indication <do> | |
| 5 <m> at <do> <f> | 3 <m> START indication <du> | |
| 5 <do> of <m> <du> | 1 <du> START indication <m> |
Note:
Lexical variations were normalized.