| Literature DB >> 29472179 |
Henry W Chen1, Jingcheng Du2, Hsing-Yi Song2, Xiangyu Liu2, Guoqian Jiang3, Cui Tao2.
Abstract
BACKGROUND: Today, there is an increasing need to centralize and standardize electronic health data within clinical research as the volume of data continues to balloon. Domain-specific common data elements (CDEs) are emerging as a standard approach to clinical research data capturing and reporting. Recent efforts to standardize clinical study CDEs have been of great benefit in facilitating data integration and data sharing. The importance of the temporal dimension of clinical research studies has been well recognized; however, very few studies have focused on the formal representation of temporal constraints and temporal relationships within clinical research data in the biomedical research community. In particular, temporal information can be extremely powerful to enable high-quality cancer research.Entities:
Keywords: biomedical ontology; common data elements; database; database management systems; time
Year: 2018 PMID: 29472179 PMCID: PMC5843793 DOI: 10.2196/medinform.8175
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Resource Description Framework (RDF) triple example.
Figure 2Graphical representation of Time Event Oncology (TEO).
Keywords represented with regular expressions delimited by commas and their corresponding Time Event Ontology (TEO) class.
| Keyword regular expressions | TEOa class (building blocks) |
| Jan(uary)?,Feb(ruary)?,Mar(ch)?,Apr(il)?,May,June,July,Aug(ust)?,Sept(ember)?,Oct(ober)?,N | TimeInstant |
| TimeInterval | |
| Date | |
| seconds,minutes?,hours?,days?,weeks?,months?,years? | Granularity |
| Duration | |
| before,while,prior to, ago,previous(ly)?,post(- | TemporalRelation |
| TimeOffset | |
| recurrent,frequent,intermittent,periodic,repeat(ed)? | TimePhase |
| Interval | TimeInterval |
aTEO: Time Event Ontology.
Common data element (CDE) annotated with a Time Event Ontology (TEO) pattern. Bold font indicates the class, and italic font indicates the property.
| Representation type | Content | |
| CDEa | ||
| TEOb pattern | ||
| Extended TEO pattern | ||
| RDFc triple representation | <event1> | rdf:type |
| <event2> | ||
aCDE: common data element.
bTEO: Time Event Ontology.
cRDF: Resource Description Framework.
Sensitivity and specificity data of common data element (CDE) parser.
| Annotator | Test set | True positive | True negative | False positive | False negative | Sensitivity | Specificity |
| 1 | 1 | 398 | 408 | 27 | 17 | 0.959036 | 0.937931 |
| 2 | 404 | 415 | 21 | 10 | 0.975845 | 0.951835 | |
| 2 | 1 | 394 | 418 | 31 | 7 | 0.982544 | 0.930958 |
| 2 | 394 | 412 | 33 | 13 | 0.968059 | 0.925843 | |
| 3 | 1 | 391 | 414 | 34 | 11 | 0.972637 | 0.924107 |
| 2 | 397 | 418 | 28 | 7 | 0.982673 | 0.93722 |
Interannotator agreement data (N=425).
| Test set number | No difference, n (%) | One difference, n (%) | All different, n (%) |
| 1 | 279 (65.6) | 133 (31.2) | 13 (3.0) |
| 2 | 258 (60.7) | 146 (34.3) | 21 (4.9) |
Test set common data element (CDE) categorization (N=300).
| Category | n (%) | |
| Existing pattern | 263 (87.7) | |
| New pattern | 9 (2.9) | |
| Not time-related | 20 (6.8) | |
| TEOb cannot represent | 8 (2.6) | |
aCDE: common data element.
bTEO: Time Event Ontology.
Pilot set annotation results.
| Annotator | Test set number | Number of TPa,b | Number of FNc,d | Sensitivity | Margin of error |
| 1 | 1 | 85 | 2 | 0.977 | 0.032 |
| 2 | 82 | 5 | 0.943 | 0.050 | |
| 3 | 83 | 9 | 0.902 | 0.064 | |
| 2 | 1 | 86 | 5 | 0.945 | 0.048 |
| 2 | 82 | 7 | 0.921 | 0.058 | |
| 3 | 77 | 11 | 0.875 | 0.074 | |
| 3 | 1 | 89 | 3 | 0.967 | 0.037 |
| 2 | 84 | 8 | 0.913 | 0.060 | |
| 3 | 83 | 9 | 0.900 | 0.064 |
aTP: true positive.
bDenotes the number of true positive instances.
cFN: false negative.
dDenotes the number of false negative instances.
Statistically significant test set results.
| Annotator | Test set number | Coverage rate |
| 1 | 1 | 0.950 |
| 2 | 0.940 | |
| 2 | 1 | 0.949 |
| 2 | 0.913 | |
| 3 | 1 | 0.964 |
| 2 | 0.935 |
Most frequently used Time Event Ontology (TEO) patterns used in the observing set of N=600, averaged over three annotators.
| Rank | TEOa pattern | n (%) |
| 1 | [Event (hasValidTime=[TimeInstant (hasGranularity, hasOrigTime*)])] | 186 (31.0) |
| 2 | [Event* (hasValidTime=[TimeInterval (hasEndTime=[TimeInstant (hasOrigTime)], | 117 (19.5) |
| 3 | [Event (hasValidTime=[TimeInstant (hasNormalizedTime*)])] | 90 (15.0) |
| 4 | [Event*] [TemporalRelation] [Event] | 42 (7.0) |
| 5 | [Event (hasModality*)] [TemporalRelation] [Event] | 35 (5.9) |
| 6 | [Event (hasValidTime=[TimeInterval (hasEndTime=[Time | 32 (5.4) |
| 7 | [Event (hasValidTime=[TimeInterval (hasStartTime=[Time | 26 (4.4) |
| 8 | [Event* (hasModality*,hasValidTime=[TimeInterval(hasEndTime=[TimeInstant(hasOrigTime)], | 25 (4.2) |
| 9 | [Event (hasValidTime=[TimeInterval(hasDuration=[Duration(hasDurationPattern*)])])] | 17 (2.8) |
| 10 | [Event(hasValidTime=[TimeInterval(hasStartTime=[TimeInstant(hasOrigTime*)],hasEndTi | 11 (1.8) |
aTEO: Time Event Ontology.
Specific examples in Resource Description Framework (RDF) format of most frequently used Time Event Ontology (TEO) patterns. Bold font indicates the class, and italic font indicates the property.
| Rank | PublicID | CDEa LongName | RDFb representation | |
| 1 | 4614514 | Stage IV disease progression platinum-based | <event1> | rdf:type |
| rdfs:label “Stage IV Disease | ||||
| Progression Platinum-Based | ||||
| Chemotherapy”; | ||||
| <tInstant1> | rdf:type | |||
| rdf:label “Date” | ||||
| 2 | 3191975 | Patient reported outcome problem dysuria past week | <event1> | rdf:type |
| rdfs:label *; | ||||
| <tInterval1> | rdf:type | |||
| <tInstant1> | rdf:type | |||
| <durat1> | rdf:type | |||
| 3 | 3100972 | Customer request laboratory final approval date | <event1> | rdf:type |
| rdfs:label “Customer Request | ||||
| Laboratory Final Approval”; | ||||
| <tInstant1> | rdf:type | |||
| 4 | 2683245 | Breast conservation treatment post neoadjuvant | <event1> | rdfs:label *; |
| rdf:type | ||||
| <event2> | rdf:type | |||
| rdfs:label “Neoadjuvant Therapy”; | ||||
| 5 | 3387810 | Maintenance therapy prior recurrent disease | <event1> | rdf:type |
| rdfs:label “Maintenance Therapy | ||||
| Discontinue”; | ||||
| <event2> | rdf:type | |||
| rdfs:label “Recurrent Disease”; | ||||
| 6 | 2790 | Partial response observed end date | <event1> | rdf:type |
| rdfs:label “Partial Response | ||||
| Observed”; | ||||
| <tInterval1> | rdf:type | |||
| <tInstant1> | rdf:type | |||
| 7 | 1157 | Prior RT begin date | <event1> | rdf:type |
| rdfs:label “RT”; | ||||
| <tInterval1> | rdf:type | |||
| rdf:label “Prior”; | ||||
| <tInstant1> | rdf:type | |||
| 8 | 4609733 | FACT-Cog Questionnaire version 3 CogPM1 how | <event1> | rdf:type |
| rdfs:label | ||||
| <tInterval1> | rdf:type | |||
| <tInstant1> | df:type | |||
| r | ||||
| <durat1> | rdf:type | |||
| 9 | 3190457 | Person clinical study assignment follow-up month | <event1> | rdf:type |
| rdfs:label Personal Clinical Study | ||||
| Assignment Follow-up; | ||||
| <tInterval1> | rdf:type | |||
| <durat1> | rdf:type | |||
| 10 | 3177036 | Adverse event outcome assessment observation | <event1> | rdf:type |
| rdfs:label “Adverse Event Outcome | ||||
| Assessment Observation Performed | ||||
| Study Activity” | ||||
| <tInterval1> | rdf:type | |||
| <tInstant1> | rdf:type | |||
| <tInstant2> | rdf:type | |||
aCDE: common data element.
bRDF: Resource Description Framework.
Example standard representation of common data elements (CDEs) versus Time Event Ontology (TEO) patterns.
| LongName | PreferredDefinition | TEOa pattern |
| Off treatment date | OTX_DATE | [Event (hasValidTime=[TimeInstant |
| Pills quantity date | PILL_QUANT_DT | [Event (hasValidTime=[TimeInstant |
| Therapy prior carmustine administered end date | BNCU_ENDDT | [Event (hasValidTime=[TimeInterval |
| Laboratory data inclusion stop date | LAB_INCL_STOP_DT | [Event (hasValidTime=[TimeInterval |
| Breast conservation treatment post neoadjuvant therapy failed performed reason | BCT_P_NEO_FA_PER_RSN | [Event*] [TemporalRelation] [Event] |
| Lymph node post neoadjuvant therapy response code | LN_NEOADJ_RESP_CD | [Event*] [TemporalRelation] [Event] |
aTEO: Time Event Ontology.