| Literature DB >> 32620117 |
Hye Hyeon Kim1, Yu Rang Park2, Suehyun Lee3, Ju Han Kim4.
Abstract
BACKGROUND: Semantic interoperability is essential for improving data quality and sharing. The ISO/IEC 11179 Metadata Registry (MDR) standard has been highlighted as a solution for standardizing and registering clinical data elements (DEs). However, the standard model has both structural and semantic limitations, and the number of DEs continues to increase due to poor term reusability. Semantic types and constraints are lacking for comprehensively describing and evaluating DEs on real-world clinical documents.Entities:
Keywords: Common data elements; Metadata registry; Semantic interoperability; Semantic relationship
Mesh:
Year: 2020 PMID: 32620117 PMCID: PMC7333279 DOI: 10.1186/s12911-020-01168-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Overview of the formal relationship between aCDE and cCDEs with extended semantic types and CDE-type specific constraints
Fig. 2An example hybrid aCDE from a hemodialysis report. a The hemodialysis table of the DialysisNet Project has a tabular data-entry format, where Time (DE:47616) allows two different data types: time and an enumerated string. b The hybrid aCDE (DE:47616) contains two aCDEs (DE:43239 and DE:47614) in a hybrid relationship (http://chmr2.snubi.org:8083/chmr/data_element_view.jsp?id=28476)
Fig. 3Example of the composition of a repeated cCDE from a hemodialysis report form. A repeated cCDE, ‘DE:47575 Hemodialysis_Repeated_Componsite _DE,’ composed of eight aCDEs from a tabular data-entry format (Fig. 2a) for the DialysisNet hemodialysis project (http://chmr2.snubi.org:8083/chmr/data_element_view.jsp?id=28449)
Fig. 4Creation of a dictionary cCDE for a CRF. a The ‘Electrolyte Laboratory Tests’ table on a clinical document is provided as an example tabular data-entry document to capture laboratory test results for sodium (Na+) and potassium (K+) along with two clinical evaluation attributes. b We constructed the ‘Electrolyte Laboratory Tests Dictionary’ table by extracting the relevant attributes from the CDEs defined in the ‘Recommended Labs for Stroke’ from the NINDS CDE project. c The dictionary cCDE (DE:47571) consists of six aCDEs that include a variable aCDE (DE:43938) that relates the dictionary cCDE to the dictionary table in Fig. 4b. Two rules for clinical evaluation are presented (http://chmr2.snubi.org:8083/chmr/data_element_view.jsp?id=28445)
Encoding operated, required, dependent, and ordered constraints for CDEs with prefix notation. Examples of (A) an operated constraint for calculating BMI, (B) a required constraint for demography information, (C) a dependent constraint for smoking history, and (D) an ordered constraint
| Constraints | Example of Clinical Documents | Set of CDE IDs and Names |
|---|---|---|
| Prefix Notation for Formulating Constraints | ||
| A) Operated | Weight (kg): Height (cm): BMI (kg/m2): | |
(IF (= CDE31.unit_of_measure ‘m’) (/ CDE30 CDE31 CDE31) (/ CDE30 CDE31 CDE31 100,100)); (/ CDE30 CDE31 CDE31 100,100) | ||
| B) Required | 1) *Patient Age: 2) *Gender ☐ Female ☐ Male ☐ Unknown ☐ Unspecified ☐ Not reported 3) Ethnicity: ☐ Hispanic or Latino ☐ Unknown ☐ Not Hispanic or Latino ☐ Not reported | |
| (Required CDE40 CDE41) | ||
| C) Dependent | Smoking History 1) *Current tobacco use? ☐ Yes ☐ No ☐ Unknown 2) *Past tobacco use? ☐ Yes ☐ No ☐ Unknown 3) Age when tobacco use started (years)? (Skip if Q1 and Q2 are both No) | |
| (IF (or (!= CDE20 ‘Yes’) (!= CDE21 ‘Yes’)) CDE22 NULL) | ||
| D) Ordered | (Ordered CDE20 CDE21 CDE22) | |
Fig. 5Encoding Operated, Ordered, Required, and Dependent constraints for a repeated cCDE. a A ‘Medical History’ clinical document presented in a tabular format containing six attributes. b A repeated cCDE is created with the corresponding six aCDEs along with four constraint rules: (1) the start date (DE:47618) should be earlier than the end date (DE:47619): (< DE:47618 DE:47619); (2) all attributes are required to have values other than null, except for the end date (DE:47619): (Required DE:37059 DE:47621 DE:31106 DE:47618 DE:44078, 3) when a certain medical history is not ongoing (DE:44078), the end date (DE:47619) cannot be obtained, and vice versa: (IF (! = DE:44078 ‘Yes’) DE:47619 NULL); and (4) aCDEs can be ordered according to a constraint statement such as (Ordered DE:37059 DE:47621 DE:31106 DE:47618 DE:44078 DE:47619) (http://chmr2.snubi.org:8083/chmr/data_element_view.jsp?id=28477)
Numbers of aCDEs and cCDEs extracted from five clinical documents used at five teaching hospitals in Korea
| Hospital | Admission | Initial Medical | Discharge Summary | Emergency Not | Operation Note | Total No. of CDEs | f No. of Unique CDEs | g CDE Reuse Rate | |
|---|---|---|---|---|---|---|---|---|---|
| A | a CDE | 84 | 48 | 70 | 83 | 37 | 322 | 227 | 29.5% |
| bcCDE c (aCDE) | 10 (55) | 9 (40) | 6 (34) | 6 (45) | 2 (10) | 33 (184) | 16 (110) | ||
| d aCDE | 29 | 8 | 36 | 38 | 27 | 138 | 117 | ||
| e cCDE + aCDE | 39 | 17 | 42 | 44 | 29 | 171 | 133 | 24.5% | |
| C | CDE | 30 | 35 | 20 | 27 | 26 | 138 | 87 | 37.0% |
| cCDE (aCDE) | 2 (14) | 3 (20) | 2 (11) | 3 (15) | 1 (5) | 11 (65) | 5 (35) | ||
| aCDE | 16 | 15 | 9 | 12 | 21 | 73 | 52 | ||
| cCDE + aCDE | 18 | 18 | 11 | 15 | 22 | 84 | 57 | 33.3% | |
| G | CDE | 70 | 28 | 44 | 54 | 11 | 207 | 161 | 22.2% |
| cCDE (aCDE) | 4 (23) | 3 (17) | 2 (11) | 2 (17) | 1 (5) | 12 (73) | 7 (50) | ||
| aCDE | 47 | 11 | 33 | 37 | 6 | 134 | 111 | ||
| cCDE + aCDE | 51 | 14 | 35 | 39 | 7 | 146 | 118 | 18.8% | |
| P | CDE | 204 | 123 | 46 | 43 | 12 | 428 | 266 | 37.9% |
| cCDE (aCDE) | 7 (177) | 4 (99) | 3 (34) | 3 (39) | 0 (0) | 15 (349) | 7 (177) | ||
| aCDE | 27 | 24 | 12 | 4 | 12 | 79 | 89 | ||
| cCDE + aCDE | 34 | 28 | 15 | 7 | 12 | 94 | 96 | 36.2% | |
| S | CDE | 12 | 6 | 9 | 10 | 10 | 47 | 31 | 34.0% |
| cCDE (aCDE) | 1 (3) | 0 | 0 | 1 (4) | 0 | 2 (7) | 1 (4) | ||
| aCDE | 9 | 6 | 9 | 6 | 10 | 40 | 27 | ||
| cCDE + aCDE | 10 | 6 | 9 | 7 | 10 | 42 | 28 | 31.9% | |
| Total | CDE | 400 | 240 | 189 | 217 | 96 | 1142 | 606 | 53.1% |
| Unique CDE | 297 | 162 | 142 | 178 | 57 | 836 | 586 | 29.9% | |
| cCDE (aCDE) | 15 (224) | 14 (152) | 9 (71) | 9 (90) | 2 (10) | 49 (547) | 20 (327) | ||
| aCDE | 73 | 10 | 71 | 88 | 47 | 289 | 259 | ||
| cCDE + aCDE | 88 | 24 | 80 | 97 | 49 | 338 | 279 | 46.9% |
a No. of CDEs extracted from each clinical document from each hospital
b No. of cCDEs created for each clinical document
c No. of aCDEs contained in bcCDEs
d No. of remaining aCDEs that are not contained in any of the cCDEs in each clinical document
e Total no. of CDEs consisting of bcCDEs and daCDEs that are not contained in any of the cCDEs in each clinical document
f No. of unique CDEs across the five clinical documents
g Reuse ratio of CDEs across the five clinical documents
Numbers of aCDEs, cCDEs, and constraints at five teaching hospitals in Korea
| Hospital: CDE Semantic Type | A | C | G | P | S | |
|---|---|---|---|---|---|---|
| aCDE | Hybrid | 0 | 0 | 0 | 0 | 0 |
| Variable | 5 | 2 | 2 | 3 | 0 | |
| cCDE | Basic | 9 (20) | 2 (6) | 3 (8) | 2 (2) | 0 |
| Repeated | 2 (5) | 1 (2) | 2 (2) | 2 (6) | 1 (2) | |
| Dictionary | 5 (10) | 2 (3) | 2 (2) | 3 (8) | 0 | |
| Constraints | Operated | 4 (9) | 1 (5) | 2 (5) | 1 (1) | 0 |
| Required | 10 (25) | 3 (8) | 5 (11) | 3 (11) | 0 | |
| Dependent | 15 (26) | 0 | 3 (8) | 3 (10) | 1 (2) | |
| Ordered | 11 (29) | 4 (10) | 5 (11) | 3 (12) | 1 (2) | |
The numbers before the parentheses represent unique counts
Numbers of atomic and composite CDEs extracted from 14 FHIR resources of FHIR bulk sample data
| # | FHIR Resource | aCDE | bcCDE c (aCDE) | d aCDE | e cCDE + aCDE |
|---|---|---|---|---|---|
| 1 | AllergyIntolerance | 13 | 2 (13) | 0 | 2 |
| 2 | CarePlan | 18 | 4 (15) | 3 | 7 |
| 3 | Claim | 21 | 5 (13) | 6 | 11 |
| 4 | Condition | 13 | 2 (13) | 0 | 2 |
| 5 | DiagnosticReport | 13 | 3 (9) | 4 | 7 |
| 6 | Encounter | 15 | 4 (15) | 0 | 4 |
| 7 | Goal | 4 | 1 (4) | 0 | 1 |
| 8 | ImagingStudy | 23 | 3 (14) | 11 | 14 |
| 9 | Immunization | 12 | 1 (4) | 8 | 9 |
| 10 | MedicationRequest | 14 | 3 (14) | 0 | 3 |
| 11 | Observation | 22 | 5 (18) | 4 | 9 |
| 12 | Organization | 15 | 4 (15) | 0 | 4 |
| 13 | Patient | 42 | 8 (29) | 8 | 16 |
| 14 | Procedure | 13 | 3 (13) | 0 | 3 |
| f Total No. of CDEs | 238 | 48 (194) | 44 | 92 | |
| g No. of unique CDEs | 96 | 28 (75) | 21 | 49 | |
a No. of CDEs extracted from each FHIR resource sample data
b No. of cCDEs created for each FHIR resource sample data
c No. of aCDEs contained in bcCDEs
d No. of remaining aCDEs that are not contained in any of the cCDEs in each FHIR resource sample data
e Total no. of CDEs consisting of bcCDEs and daCDEs that are not contained in any of the cCDEs in each FHIR resource sample data
f Total no. of CDEs across 14 FHIR resources
g Total no. of unique CDEs across 14 FHIR resources
Numbers of atomic and composite CDEs and constraints in FHIR bulk data and MIMIC-III demo data
| Data Source: CDE Semantic Type | FHIR | MIMIC-III | |
|---|---|---|---|
| aCDE | Hybrid | N/A | 4 |
| Variable | 3 | 4 | |
| cCDE | General | 18 (64) | 4 (12) |
| Repeated | 7 (87) | 26 (180) | |
| Dictionary | 3 (17) | 4 (17) | |
| Constraints | Operated | N/A | 2 |
| Required | 34 | 52 | |
| Dependent | N/A | N/A | |
| Ordered | 2 | N/A | |
Fig. 6Mapping result of the FHIR Questionnaire resource mapped to the proposed CDE relationships and constraints
Differences between DDE and our CDE semantic relationships
| CDE Semantic Type | Characteristic | Difference from a DDE | |
|---|---|---|---|
| aCDE | Hybrid | Allowing the entry of multiple data types in a hybrid aCDE requires aCDEs that support different data types for the same data item | A DDE does not support the entry of multiple types of data |
| Variable | Connecting to an outside dictionary database | No dictionary-associated constraint in a DDE | |
| cCDE | General | Containing a set of aCDEs | Do not have output DE(s), but a DDE can be a cCDE |
| Repeated | Allowing sequential data entry into a | No | |
| Dictionary | Bringing biomedical knowledge from an outside dictionary database to a | No dictionary connection allowed for a DDE | |
| Constraint | Operated | Allowing mathematical/algebraic expressions between related aCDEs | A DDE has this constraint with the a CALCULATION type |
| Required | Forcing aCDE to have a value other than null | No | |
| Dependent | Dynamic enabling and disabling of an aCDE via a predicate | No | |
| Ordered | Ordering a set of aCDEs | A DDE has this constraint by default | |
a CALCULATION type in DDE only covers arithmetic operators (i.e., +, −, *, /) but, the operated constraints include not only arithmetic operators but also logical operators (i.e., <, >)
Comparison of our proposed semantic types and constraints with the FHIR Questionnaire resource and the other FHIR resources
| CDE Semantic Type | FHIR Questionnaire | FHIR other resources | |
|---|---|---|---|
| aCDE | Hybrid | No, it does not support the entry of multiple types of data. | Not applicable, there is no restriction on the datatype as it is represented JSON, XML. |
| Variable | Yes, it is supported by “coding”. | Yes, it is supported by “coding”. | |
| cCDE | General | Yes, it is supported because the FHIR is following a structured model. | Yes, it is supported because the FHIR is following a structured model. |
| Repeated | Yes, it is supported by “repeats”. | Yes, it is supported because the FHIR is allowing repeated representation of the group of items. | |
| Dictionary | Not applicable, it does not support any value related rule. | Not applicable, it does not support any value related rule. | |
| Constraint | Operated | Allowing only logical operations. | Only resources that have the “operator” are supported (e.g., Observation Resouce). |
| Required | Yes, it is supported by “required”. | Yes, it is supported by “required”. | |
| Dependent | Not applicable, it does not support any value related rule. | Not applicable, it does not support any value related rule | |
| Ordered | Although not explicit, it is included in the structure. | Only resources that have “sequences” are supported (e.g., Claim Resouce) | |