| Literature DB >> 30935389 |
Tong Ruan1, Yueqi Huang2, Xuli Liu2, Yuhang Xia2, Ju Gao3.
Abstract
BACKGROUND: While doctors should analyze a large amount of electronic medical record (EMR) data to conduct clinical research, the analyzing process requires information technology (IT) skills, which is difficult for most doctors in China.Entities:
Keywords: Context-free grammar; Electronic medical record; Graph database; Statistical question answering
Mesh:
Year: 2019 PMID: 30935389 PMCID: PMC6444506 DOI: 10.1186/s12911-019-0798-8
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The workflow of our tool
Fig. 2Patient graph connected with terminology graph. Patient graph contains schema graph and data graph. Patient-schema graph consists of concepts and relationships between concepts, and patient-data graph is the instantiation of the patient-schema graph, which consists of specific patients, diseases, drugs, etc
Fig. 3An example of the tree-like knowledge representation of questions
Types of concepts used in concept linking step
| Concept type | Example | Concept type | Example |
|---|---|---|---|
| Relation | 患有 (suffer from) | RangeSeparator | – |
| Instance | 心衰 (heart failure) | TimeOperator | 先 (before) |
| Class | 疾病 (disease) | LogicalOperator | 和 (and) |
| Property | 年龄 (age) | StatisticOperator | 比例 (ratio) |
| Number | 30.96 | ArithmeticOperator | 等于 (equal) |
| EnumValue | 女 (female) | Unit | 天 (day) |
| TimeValue | [20170909] | Not | 没有 (no) |
Grammar definition with production rules
| 1 | S - > QueryItem | QueryItem S | Condition S | Condition ‘Class’ S |
| 2 | Condition - > RelationCondition | PropertyCondition |
| 3 | Condition - > Condition ‘LogicalOperator’ Condition |
| 4 | Condition - > RelationCondition ‘TimeOperator’ Condition |
| 5 | RelationCondition - > ‘Relation’ ‘Instance’ | ‘Not’ ‘Relation’ ‘Instance’ | TimeBlock ‘Relation’ ‘Instance’ | TimeBlock ‘Not’ ‘Relation’ ‘Instance’ |
| 6 | TimeBlock - > ‘TimeValue’ | ‘TimeValue’ ‘LogicalOperator’ | ‘TimeValue’ ‘RangeSeparator’ ‘TimeValue’ |
| 7 | PropertyCondition - > ‘Property’ PropertyValue | ‘Instance’ ‘Property’ PropertyValue |
| 8 | PropertyValue - > ‘EnumValue’ | NumericValue | NumericRange | TimeBlock |
| 9 | NumericValue - > ‘Number’ | ‘Number’ ‘Unit’ |
| 10 | NumericRange - > NumericValue ‘RangeSeparator’ NumericValue | ‘ArithmeticOperator’ NumericValue |
| 11 | QueryItem - > ‘Property’ ‘StatisticOperator’ | ‘StatisticOperator’ ‘Property’ | ‘Relation’ ‘StatisticOperator’ ‘Class’ | ‘Instance’ ‘Property’ ‘StatisticOperator’ |
Fig. 4Cypher query example
Fig. 5User interface of our tool. The statistics type of the left figure is list, and the right figure is distribution
Data size in performance evaluation
| Node | Quantity | Edge | Quantity |
|---|---|---|---|
| Hospitalization | 13,781 | InHospital | 13,781 |
| Test | 383 | DidTest | 112,838 |
| Drug | 1156 | HadTaken | 324,818 |
| Disease | 324 | Diagnose | 16,241 |
| Procedure | 49 | DidOperate | 18,847 |
| Patient | 6035 |
The result of coverage and precision
| Number of Nodes | Number of Question | Coverage (%) | Precision (%) |
|---|---|---|---|
| 1 | 20 | 95.00 | 100.00 |
| 2 | 112 | 75.00 | 98.81 |
| 3 | 62 | 74.19 | 95.65 |
| > 3 | 17 | 94.11 | 81.25 |
| total | 211 | 78.20 | 96.36 |
The results of tool performance evaluation
| Time | Number | Typical types | Example |
|---|---|---|---|
| t < 1 | 39 | Less than three nodes and no logical operation OR | 2013年10月入院的患者(Patients admitted to hospital in October 2013) |
| 1 s ≤ t ≤ 2 s | 5 | More than three nodes | 患有冠心病没吃中药的患者有哪些(Who are the patients with coronary heart disease and taking no Chinese medicine) |
| 2 s < t ≤ 6 s | 1 | More than three nodes and contain several logic operators, especially OR and TimeOperator | 2013年之后吃了至灵胶囊或用了白玉膏或用了艾迪注射液的男性肺癌患者的年龄分布(The age distribution of male patients with lung cancer who took the ZhiLing capsule or BaiYu cream or injected Addie after 2013) |
| 6 s < t | 5 | Do filtering and matching time operations | 住院次数大于4次的肺癌患者做了哪些手术(What procedures did lung cancer patients with more than 4 hospitalized records take?) |