| Literature DB >> 21031012 |
Richard A Wilson1, Wendy W Chapman, Shawn J Defries, Michael J Becich, Brian E Chapman.
Abstract
BACKGROUND: Clinical records are often unstructured, free-text documents that create information extraction challenges and costs. Healthcare delivery and research organizations, such as the National Mesothelioma Virtual Bank, require the aggregation of both structured and unstructured data types. Natural language processing offers techniques for automatically extracting information from unstructured, free-text documents.Entities:
Keywords: Information extraction; cancer history classifcation; natural language processing
Year: 2010 PMID: 21031012 PMCID: PMC2956176 DOI: 10.4103/2153-3539.71065
Source DB: PubMed Journal: J Pathol Inform
Figure 1Django web annotation tool. Screen shots of the web annotation tool: main portal, annotation screen, progress report and administrative interface
Domain-specific lexicon
| Type | Terms | |
|---|---|---|
| Cancer identification | Noncancers | “adenoma,” “hematoma,” “adenomas,” “cystadenoma,” “hamartoma,” “hematomas,” “glaucoma,” “hemangioma,” “lipoma,” “hemartoma,” “coma,” “diploma,” “aroma” |
| Cancer acronyms | “ALL,” “ALCL,” “AMKL,” “ANLL,” “CTCL,” “CLL,” “AML,” “CML,” “HCC,” “HCL,” “LMM,” “TCC,” “T-PLL,” “PLL,” “SCC,” “SMLC,” “SCLC” | |
| Unique cancers | “hodgkin,” “leukemia,” “neoplasm,” “tumor” | |
| Modifiers | Descriptors | “acute,” “acute lymphoblastic,” “acute myelogenous,” “adrenocortical,” “aids-related,” “anal,” “basal cell,” “bile duct,” “bladder,” “bone,” “brain,” “brain stem,” “breast,” “bronchial,” “central nervous system,” “cerebellar,” “cervical,” “chronic lymphocytic,” “chronic myelogenous,” “colon,” “colorectal,” “cutaneous t-cell,” “endocrine,” “endometrial,” “esophageal,” “eye,” “gallbladder,” “gastric,” “gastrointestinal,” “gastrointestinal carcinoid,” “germ cell,” “hairy cell,” “head,” “hepatocellular,” “hodgkin,” “hodgkin's,” “hypopharyngeal,” “hypothalamic,” “intraocular,” “kaposi,” “kidney,” “laryngeal,” “lip,” “liver,” “lung,” “lymphoblastic,” “lymphocytic,” “malignant,” “melanoma”, “metastatic,” “merkel cell,” “mouth,” “myelogenous,” “myeloid,” “nasal cavity,” “nasopharyngeal,” “neck,” “non-hodgkin,” “non-small cell,” “oral,” “ovarian,” “pancreas,” “pancreatic,” “paranasal,” “parathyroid,” “penile,” “pharyngeal,” “pituitary,” “plasma cell,” “pleuropulmonary,” “prostate,” “rectal,” “renal,” “renal cell,” “salivary gland,” “sinus,” “skin,” “small cell,” “small intestine” “soft tissue,” “spinal cord,” “squamous cell,” “stomach,” “stromal,” “t-cell,” “t-cell prolymphocytic,” “testicular,” “throat,” “thymic,” “thymus,” “thymoma,” “thyroid,” “transitional cell,” “unknown,” “unknown primary site,” “uterine,” “vaginal,” “vulvar,” “wilms” |
| Nouns with cancer serving as an adjective | “conference,” “marker,” “markers,” “registry” | |
| Kinship terms | First-degree relatives (FDR) | “mother,” “father,” “sister,” “sisters,” “brother,” “brothers,” “half brother,” “half sister,” “son,” “sons,” “daughter,” “daughters,” “half-brother,” “half-sister,” “step brother,” “step sister,” “step-brother,” “step-sister,” “mom,” “dad,” (step- and half-siblings intentionally included) |
| Non-FDRs | “uncle,” “aunt,” “cousin,” “grandfather,” “grandmother,” “grandpa,” “grandma” | |
| Nonrelatives | “husband,” “wife,” “spouse” | |
| Section headings | Family history sections | “family history,” “family history:”, “familyhistory,” “family history,” “family history\r,” “FH:” |
| Nonsubjective headings | “Physical Exam,” “Physical Exam:”, “Physicalexam,” “Physicalexam:”, “PHYSICAL EXAMINATION,” “PHYSICAL EXAMINATION:”, “PHYSICALEXAMINATION,” “PHYSICALEXAMINATION:”, “Assessment,” “ASSESSMENT,” “ASSESSMENT:”, “ASSESSMENT/PLAN,” “ASSESSMENT/PLAN,” “ASSESSMENT/PLAN:”, “ASSESSMENT AND PLAN,” “ASSESSMENT AND PLAN:”, “ASSESSMENT PLAN,” “ASSESSMENT-PLAN,” “ASSESSMENT-PLAN:”, “IMPRESSION,” “IMPRESSION:” | |
| Negation phrases | Negation | “negative,” “negative for,” “denies,” “denies any history of,” “denies any family history of,” “negative history of,” “no family history of,” “no,” “unremarkable,” “unremarkable for,” “no history of,” “no history of other,” “no hx of,” “no family history of,” “no family hx of,” “without a history of,” “without of hx of,” “without” |
Figure 2Frame build module: Dynamic-Window method
Figure 3Frame build (step 2): Variable-size bi-directional window search (modifiers). Frame building step 2: variable-size window search for modifying terms from the initial “hot-spot”
Figure 4Frame build (step 3): Dynamic-Window search (negation). Frame building step 3: dynamically expand the “hot-spot” and window search for negation
Figure 5Frame evaluation module and reference resolution
Annotator agreement. Reference standard agreement among our human annotators
| Annotators | Observed agreement | Specific agreement | Kappa | ||
|---|---|---|---|---|---|
| Ppos | Pneg | ||||
| Q1: Personal history of ancillary cancer? | #1 vs. #2 | 0.963 | 0.909 | 0.977 | 0.886 |
| #1 vs. #3 | 0.963 | 0.908 | 0.977 | 0.885 | |
| #2 vs. #3 | 0.96 | 0.902 | 0.975 | 0.877 | |
| Average | 0.962 | 0.906 | 0.976 | 0.883 | |
| Q2: Family history of cancer? | #1 vs. #2 | 0.983 | 0.969 | 0.989 | 0.957 |
| #1 vs. #3 | 0.993 | 0.988 | 0.995 | 0.983 | |
| #2 vs. #3 | 0.99 | 0.981 | 0.993 | 0.975 | |
| Average | 0.989 | 0.979 | 0.992 | 0.972 | |
Overall algorithm performance with both frame methods. Averaged algorithm performance on both questions versus the reference standard
| Dynamic-Window | ConText | |
|---|---|---|
| Accuracy | 0.962 | 0.962 |
| Precision | 0.893 | 0.877 |
| Recall | 0.944 | 0.961 |
| F-measure | 0.918 | 0.916 |
| Avg kappa (ref std) | 0.928 | |
Specific frame method performance. Frame building performance on each question versus the reference standard
| Q1: Personal history of ancillary cancer? | Q2: Family history of cancer? | |||
|---|---|---|---|---|
| Dynamic-window | ConText | Dynamic-window | ConText | |
| Accuracy | 0.937 | 0.967 | ||
| (0.927, 0.975) | (0.903, 0.959) | (0.940, 0.982) | (0.966, 0.995) | |
| Precision | 0.789 | 0.900 | ||
| (0.782, 0.943) | (0.680, 0.868) | (0.821, 0.947) | (0.900, 0.988) | |
| Recall | 0.900 | |||
| (0.799, 0.953) | (0.841, 0.974) | (0.934, 0.998) | (0.934, 0.998) | |
| F-measure | 0.855 | 0.942 | ||
| Avg kappa (ref std) | 0.883 | 0.972 | ||
Error analysis. Errors for each question by algorithm and error type
| Dynamic-window | ConText | ||||
|---|---|---|---|---|---|
| Q1: Personal (out of 300) | False positves | 7 | (0.023) | 15 | (0.050) |
| False negatives | 6 | (0.020) | 4 | (0.013) | |
| Q2: Family (out of 300) | False positves | 9 | (0.030) | 3 | (0.010) |
| False negatives | 1 | (0.003) | 1 | (0.003) | |
| Total (out of 600) | 23 | (0.038) | 23 | (0.038) | |
Unique errors by frame method. Number of unique errors for each frame building method and errors common to both
| Unique to | Dynamic-window | Con Text | Error in both | |
|---|---|---|---|---|
| Q1: Personal (out of 300) | False positves | 1 | 9 | 6 |
| False negatives | 2 | 0 | 4 | |
| Q2: Family (out of 300) | False positves | 8 | 2 | 1 |
| False negatives | 0 | 0 | 1 | |
| Total (out of 600) | 11 | 11 | 12 |
Performance with improved lexicon. Posttest performance for each frame building method with lexicon improvement
| Dynamic-window | ConText | |||
|---|---|---|---|---|
| Original test | With improved lexicon | Original test | With improved lexicon | |
| Accuracy | 0.962 | 0.972 | 0.962 | 0.980 |
| Precision | 0.893 | 0.898 | 0.877 | 0.919 |
| Recall | 0.944 | 0.982 | 0.961 | 1.000 |
| F-measure | 0.918 | 0.938 | 0.916 | 0.957 |
| Avg kappa (ref std) | 0.928 | |||