| Literature DB >> 35608886 |
Chengyi Zheng1, Jonathan Duffy2, In-Lu Amy Liu1, Lina S Sy1, Ronald A Navarro3, Sunhea S Kim1, Denison S Ryan1, Wansu Chen1, Lei Qian1, Cheryl Mercado1, Steven J Jacobsen1.
Abstract
BACKGROUND: Shoulder injury related to vaccine administration (SIRVA) accounts for more than half of all claims received by the National Vaccine Injury Compensation Program. However, due to the difficulty of finding SIRVA cases in large health care databases, population-based studies are scarce.Entities:
Keywords: EHR; NLP; SIRVA; artificial intelligence; big data; causal relation; electronic health records; health; informatics; natural language processing; pharmacovigilance; population health; real-world data; shoulder injury related to vaccine administration; temporal relation; vaccine safety; vaccines
Mesh:
Substances:
Year: 2022 PMID: 35608886 PMCID: PMC9175103 DOI: 10.2196/30426
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Figure 1Flowchart showing selection of eligible vaccinations with presumptive shoulder injuries, application of natural language processing algorithm, and shoulder injury related to vaccine administration (SIRVA) case confirmation results (index date is vaccination date). ICD: International Classification of Diseases, 10th Revision, Clinical Modification; NLP: natural language processing.
Figure 2Flowchart to create data set for training and validation data sampling (group A: shoulder disorder diagnoses reported in shoulder injury related to vaccine administration [SIRVA] literature; group B: shoulder disorder diagnoses not previously reported in SIRVA literature; group C: shoulder symptom codes; group D: shoulder injury codes [ICD-10-CM chapter 19: Injury, poisoning and certain other consequences of external causes]). NLP: natural language processing; ICD-10: International Classification of Diseases, 10th Revision, Clinical Modification.
Figure 3Cross-sentence search query example. This query searches over a span of 4 sentences (4s in diagram) with a maximum number of 50 words (≤50w in diagram) in between query items. There are 2 nested relationship queries inside the outermost relationship search. The first query searches for shoulder conditions, and the second query searches for causality statement. We removed other contextual query items from diagram due to space limitations. w: week; s: sentence.
Types of causes associated with shoulder injuries.
| Order | Type of cause | Description |
| 1 | Vaccination | Specific vaccine name or general vaccine terms |
| 2 | Accident | Accidents such as auto accident, fall, hit |
| 3 | Work | Work-related injury |
| 4 | Other medical conditions | Medical conditions that can cause shoulder injury such as arthritis or chest pain radiating to the shoulder |
| 5 | Exercise | Exercise or sports-related injury |
| 6 | Daily activity | Injuries occurred during other daily activities such as lifting groceries, overuse, or side sleeping |
| 7 | Unknown | Insidious or unknown cause |
Error analyses on the validation dataset.
| Clinical text examples and the causes of Natural Language Processing (NLP) errors | ||
|
| ||
|
| 1 | “She has chronic pain—neck, low back, B/La shoulders. She has fibromyalgia and also fell a few weeks ago which worsened her back pain.” |
|
| 2 | Prior condition reported on day 0 visit: “My left shoulder pain never went away despite still doing physical therapy and living on NSAIDsb. Now it is constant and much worse today.” |
|
| ||
|
| 3 | On day 136, “States in past pain would travel to left shoulder causing numbness to left arm and lasting a few days but today denies any numbness.” |
|
| ||
|
| 4 | “...with 1 day of pain in the left arm and shoulder. Denies any injury. Did some lifting yesterday.” |
|
| 5 | “She has been working on the computer a lot. Overhead movement exacerbates the pain... No injury or trauma.” |
|
| 6 | “...who complains of left shoulder pain that started 3 weeks ago after vacuuming.” |
|
| 7 | “...likely subdeltoid bursitis and supraspinatus tendinopathy in the setting of DMc likely from acute movement with pain when getting IVd placed.” |
|
| 8 | “Patient reports left shoulder pain with movement; no trauma. Patient worked for years caring for young children and had to carry and lift them.” |
aB/L: bilateral.
bNSAIDs: Nonsteroidal anti-inflammatory drugs.
cDM: diabetes mellitus.
dIV: intravenous.
Number of cases identified by natural language processing (NLP) in the base study population (n=53,585).
| Natural language processing–identified cases | n | %a (n=53,585) | %b (n=46,086) | ||||
| Shoulder injury identified | 46,086 | 86 | —c | ||||
|
| |||||||
|
| Laterality identified | 44,488 | 83 | 96.5 | |||
|
| Laterality mismatch | 1220 | 2.3 | 2.6 | |||
|
| |||||||
|
| Cause identifiedd | 25,325 | 47.3 | 55.0 | |||
|
| Cause identifiede | 19,039 | 35.5 | 41.3 | |||
|
| |||||||
|
| Onset identified | 45,252 | 84.4 | 98.2 | |||
|
| Symptom duration >30 days postvaccination | 35,135 | 65.6 | 76.2 | |||
| SIRVAf cases | 467 | 0.9 | 1 | ||||
Percentage of cases among the number of cases with shoulder injury diagnosis code (n=53,585).
bPercentage of cases among the number of natural language processing–identified shoulder injury cases (n=46,086).
cNot applicable.
dIncludes unknown cause stated in the clinical notes.
eExcludes unknown cause stated in the clinical notes.
fSIRVA: shoulder injury related to vaccine administration.
Number of natural language processing–identified cases and chart-confirmed cases.
| NLPa-identified group | NLP-identified | Chart confirmed | Confirmation rate (%) | |||
| Definite | 291 | 278 | 95.5 | |||
|
| 124 | 84 | 67.7 | |||
|
| Cross-sentence causality | 64 | 46 | 71.9 | ||
|
| Vaccination cause identified ≤30 days after vaccination | 41 | 26 | 63.4 | ||
|
| Vaccine mismatch | 19 | 12 | 63.2 | ||
| Possible | 52 | 9 | 17.3 | |||
| Total | 467 | 371 | 79.4 | |||
aNLP: natural language processing.