| Literature DB >> 30404767 |
Gondy Leroy1, Yang Gu1, Sydney Pettygrove1, Maureen K Galindo1, Ananyaa Arora1, Margaret Kurzius-Spencer1.
Abstract
BACKGROUND: Electronic health records (EHRs) bring many opportunities for information utilization. One such use is the surveillance conducted by the Centers for Disease Control and Prevention to track cases of autism spectrum disorder (ASD). This process currently comprises manual collection and review of EHRs of 4- and 8-year old children in 11 US states for the presence of ASD criteria. The work is time-consuming and expensive.Entities:
Keywords: Autism Spectrum Disorder; DSM; complex entity extraction; decision tree; electronic health records; machine learning; natural language processing; parser
Mesh:
Year: 2018 PMID: 30404767 PMCID: PMC6249505 DOI: 10.2196/10497
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Decision tree evaluation for sentence classification.
| Rule | Count of positive cases | % positive cases (of all sentences) | Precision | Recall | F-score | Specificity |
| A1a | 120 | 0.021 | 0.70 | 0.52 | 0.59 | 0.99 |
| A1b | 91 | 0.016 | 0.50 | 0.42 | 0.45 | 0.99 |
| A1c | 35 | 0.006 | 0.16 | 0.17 | 0.17 | 0.99 |
| A1d | 160 | 0.029 | 0.54 | 0.14 | 0.22 | 1.00 |
| A2a | 388 | 0.069 | 0.71 | 0.39 | 0.50 | 1.00 |
| A2b | 321 | 0.057 | 0.69 | 0.37 | 0.48 | 0.99 |
| A2c | 120 | 0.021 | 0.54 | 0.47 | 0.51 | 0.99 |
| A2d | 62 | 0.011 | 0.34 | 0.19 | 0.25 | 1.00 |
| A3a | 64 | 0.011 | 0.20 | 0.09 | 0.13 | 1.00 |
| A3b | 123 | 0.022 | 0.81 | 0.47 | 0.59 | 1.00 |
| A3c | 66 | 0.012 | 0.70 | 0.32 | 0.44 | 1.00 |
| A3d | 27 | 0.005 | 0.27 | 0.30 | 0.28 | 1.00 |
| Microaverage | 1577 | 0.024 | 0.60 | 0.35 | 0.45 | 0.99 |
Lexicon overview.
| Pattern use of lexicons | Lexicons | Number of terms | Example lexicon | Example terms |
| All rules | 11 | 345 | Body_parts | arm, eye, hair, teeth, toe, tongue, finger, fingers, nose |
| Group A1 | 7 | 105 | A1_interact | interact, interactions, communicate, relationship |
| Group A2 | 3 | 72 | A2_positive | severe, significant, pervasive, marked |
| Group A3 | 2 | 72 | A3_object | door, toys, vacuum, blocks, book, television, lights |
| A1a | 4 | 42 | A1a_nonVerbalBehavior | eye contact, eye-to-eye gaze, gestures, nonverbal cues |
| A1b | 2 | 11 | A1b_consistent | good, consistent, appropriately, satisfactory |
| A1c | 5 | 61 | A1c_affect | excitement, feelings, satisfaction, concerns |
| A1d | 12 | 159 | A1d_engage | recognize, recognizes, reacts, respond, regard, attend |
| A2a | 4 | 117 | A2a_gained | gained, used, had, obtained, said, spoke |
| A2b | 8 | 240 | A2b_recepLang | direction, instructions, questions, conversations |
| A2c | 7 | 145 | A2c_idiosyncratic | breathy, echolalia, jargon, neologism, reduced |
| A2d | 7 | 83 | A2d_actions | actions, routines, play, signs, gestures, movements |
| A3a | 7 | 106 | A3a_obsess | obsessed, obsessive, perseverates, preoccupation |
| A3b | 7 | 119 | A3b_nonFunctionalPlay | stack, stacks, lines, lined, nonfunctional, arrange |
| A3c | 3 | 67 | A3c_abnormal | grind, grinds, rocks, twirls, spin, tap, clap, flap |
| A3d | 3 | 43 | A3d_sensitive | defensiveness, sensitivity, hypersensitivities |
| Total | 92 | 1787 | N/Aa | N/A |
aN/A: not applicable.
Figure 1Visualization of 2 (of 7 existing) patterns for Diagnostic Manual of Mental Disorders criteria A2c.
Gold standard overview.
| Diagnostic and Statistical Manual of Mental Disorders diagnostic criteria | Gold standard | ||
| Rule | Theme | Total in records | Average per record |
| A1a | Nonverbal behaviors | 126 | 2.52 |
| A1b | Peer relationships | 91 | 1.82 |
| A1c | Seeking to share | 37 | 0.74 |
| A1d | Emotional reciprocity | 165 | 3.3 |
| A2a | Spoken language | 406 | 8.12 |
| A2b | Initiate or sustain conversation | 333 | 6.66 |
| A2c | Stereotyped or idiosyncratic language | 127 | 2.54 |
| A2d | Social imitative play | 66 | 1.32 |
| A3a | Restricted patterns of interest | 62 | 1.24 |
| A3b | Adherence to routines | 135 | 2.7 |
| A3c | Stereotyped motor mannerisms | 68 | 1.36 |
| A3d | Preoccupation with parts of objects | 28 | 0.56 |
| Total | N/Aa | 1644 | 32.88 |
aN/A: not applicable.
Annotation-level results.
| Annotationsa | Total in gold standard (number of annotationsb) | Evaluation | ||
| Precision | Recall | F-measure | ||
| A1a | 126 | 0.96 | 0.57 | 0.72 |
| A1b | 91 | 0.63 | 0.27 | 0.38 |
| A1c | 37 | 0.78 | 0.19 | 0.30 |
| A1d | 165 | 0.62 | 0.27 | 0.37 |
| A2a | 406 | 0.69 | 0.44 | 0.53 |
| A2b | 333 | 0.79 | 0.44 | 0.57 |
| A2c | 127 | 0.68 | 0.36 | 0.47 |
| A2d | 66 | 0.79 | 0.56 | 0.65 |
| A3a | 62 | 0.83 | 0.40 | 0.54 |
| A3b | 135 | 0.75 | 0.51 | 0.61 |
| A3c | 68 | 0.82 | 0.41 | 0.55 |
| A3d | 28 | 0.53 | 0.29 | 0.37 |
| Microaverage | N/Ac | 0.74 | 0.42 | 0.53 |
aBased on 6634 sentences.
bTotal annotations=1644.
cN/A: not applicable.
Sentence-level results.
| Sentencesa | Total in gold standard (number of sentences)b | Evaluation | |||
| Precision | Recall | F-measure | Specificity | ||
| A1a | 120 | 0.97 | 0.59 | 0.74 | 1.00 |
| A1b | 90 | 0.68 | 0.30 | 0.42 | 1.00 |
| A1c | 35 | 0.78 | 0.20 | 0.32 | 1.00 |
| A1d | 158 | 0.63 | 0.28 | 0.39 | 1.00 |
| A2a | 391 | 0.71 | 0.45 | 0.55 | 0.99 |
| A2b | 329 | 0.83 | 0.47 | 0.60 | 1.00 |
| A2c | 121 | 0.67 | 0.37 | 0.48 | 1.00 |
| A2d | 65 | 0.83 | 0.58 | 0.68 | 1.00 |
| A3a | 61 | 0.73 | 0.36 | 0.48 | 1.00 |
| A3b | 123 | 0.74 | 0.52 | 0.61 | 1.00 |
| A3c | 64 | 0.82 | 0.42 | 0.56 | 1.00 |
| A3d | 28 | 0.53 | 0.29 | 0.37 | 1.00 |
| Microaverage | 1585 | 0.76 | 0.43 | 0.55 | 1.00 |
| Any Rule | 1357 | 0.82 | 0.46 | 0.59 | 0.97 |
aBased on 6634 sentences.
bSentences with annotations =1357.
Figure 2Descriptive information on 4480 records available electronically from the Arizona Developmental Disabilities Surveillance Program.
Figure 3Electronic health record word count for autism spectrum disorder (ASD) and non-ASD cases.
Figure 4Average A1 criteria per record. ASD: autism spectrum disorder; EHR: electronic health record.
Figure 5Average A2 criteria per record. ASD: autism spectrum disorder; EHR: electronic health record.
Figure 6Average A3 criteria per record. ASD: autism spectrum disorder; EHR: electronic health record.