| Literature DB >> 33548541 |
Miguel Pedrera-Jiménez1, Noelia García-Barrio2, Jaime Cruz-Rojo3, Ana Isabel Terriza-Torres4, Elena Ana López-Jiménez5, Fernando Calvo-Boyero6, María Jesús Jiménez-Cerezo7, Alvar Javier Blanco-Martínez8, Gustavo Roig-Domínguez9, Juan Luis Cruz-Bermúdez10, José Luis Bernal-Sobrino11, Pablo Serrano-Balazote12, Adolfo Muñoz-Carrero13.
Abstract
BACKGROUND: COVID-19 ranks as the single largest health incident worldwide in decades. In such a scenario, electronic health records (EHRs) should provide a timely response to healthcare needs and to data uses that go beyond direct medical care and are known as secondary uses, which include biomedical research. However, it is usual for each data analysis initiative to define its own information model in line with its requirements. These specifications share clinical concepts, but differ in format and recording criteria, something that creates data entry redundancy in multiple electronic data capture systems (EDCs) with the consequent investment of effort and time by the organization.Entities:
Keywords: COVID-19; Detailed clinical models; Electronic health records; Real world data; Semantics; Standards
Mesh:
Year: 2021 PMID: 33548541 PMCID: PMC7857038 DOI: 10.1016/j.jbi.2021.103697
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 6.317
Fig. 1Stages of the methodology for obtaining EHR-derived data.
Fig. 2Mind map of the “Oxygen saturation” (“Saturación de oxígeno” in Spanish) archetype.
Fig. 3Code in ADL of the “Oxygen saturation” (“Saturación de oxígeno” in Spanish) archetype.
Fig. 4Iterative algorithm for generation of EHR-derived data extracts.
Fig. 5Extract of semantically interoperable EHR.
Fig. 6Code in R for generating data related to “Oxygen saturation” concept.
Fig. 7Overview of the methodology implementation process.
ISARIC-WHO OE dataset generated from healthcare data.
| EHR | ISARIC-WHO MODULE 1 | ISARIC-WHO MODULE 2 | |||||
|---|---|---|---|---|---|---|---|
| SARS-COV-2 | 9179 | 4286 | 4286 | 95.48 | – | – | – |
| Height | 6781 | 1060 | 1060 | 23.61 | – | – | – |
| Weight | 7596 | 1070 | 1070 | 23.84 | – | – | – |
| Temperature | 148,184 | 3926 | 3926 | 87.46 | 42,015 | 4405 | 98.13 |
| Heart rate | 131,251 | 3799 | 3799 | 84.63 | 39,849 | 4342 | 96.73 |
| Respiratory rate | 6456 | 364 | 364 | 8.11 | 3205 | 1142 | 25.44 |
| Systolic blood pressure | 107,477 | 3773 | 3773 | 84.05 | 39,430 | 4308 | 95.97 |
| Diastolic blood pressure | 107,388 | 3773 | 3773 | 84.05 | 39,425 | 4308 | 95.97 |
| Oxygen saturation | 132,486 | 2873 | 2873 | 64.00 | 36,506 | 4203 | 93.63 |
| Glasgow Coma score | 1012 | 478 | 478 | 10.65 | 737 | 677 | 15.08 |
| Hemoglobin | 37,683 | 4195 | 4195 | 93.45 | 21,971 | 4219 | 93.99 |
| Leukocytes | 37,326 | 4194 | 4194 | 93.43 | 21,965 | 4218 | 93.96 |
| Hematocrit | 37,318 | 4194 | 4194 | 93.43 | 21,965 | 4218 | 93.96 |
| Platelets | 37,322 | 4195 | 4195 | 93.45 | 21,967 | 4219 | 93.99 |
| aPTT | 21,978 | 4044 | 4044 | 90.09 | 13,766 | 4131 | 92.02 |
| Prothrombin time | 21,992 | 4044 | 4044 | 90.09 | 13,767 | 4130 | 92.00 |
| INR | 22,001 | 4044 | 4044 | 90.09 | 13,769 | 4130 | 92.00 |
| ALT/SGPT | 35,031 | 4109 | 4109 | 91.53 | 21,249 | 4193 | 93.41 |
| Bilirubin | 34,435 | 3974 | 3974 | 88.53 | 21,061 | 4192 | 93.38 |
| AST/SGOT | 34,302 | 3973 | 3973 | 88.51 | 20,964 | 4163 | 92.74 |
| Urea | 9896 | 1661 | 1661 | 37.00 | 6564 | 2363 | 52.64 |
| Lactate | 383 | 110 | 110 | 2.45 | 259 | 209 | 4.66 |
| Creatinine | 38,226 | 4168 | 4168 | 92.85 | 22,415 | 4208 | 93.74 |
| Sodium | 37,458 | 4161 | 4161 | 92.69 | 22,338 | 4207 | 93.72 |
| Potassium | 37,257 | 4130 | 4130 | 92.00 | 22,229 | 4204 | 93.65 |
| Procalcitonin | 3621 | 367 | 367 | 8.18 | 3133 | 1371 | 30.54 |
| C reactive protein | 29,695 | 4078 | 4078 | 90.84 | 20,372 | 4154 | 92.54 |
| LDH | 26,188 | 3934 | 3934 | 87.64 | 17,542 | 4104 | 91.42 |
| Creatine kinase | 14,965 | 1852 | 1852 | 41.26 | 11,538 | 3573 | 79.59 |
| Troponin T | 5091 | 751 | 751 | 16.73 | 3804 | 1714 | 38.18 |
| ESR | 286 | 14 | 14 | 0.31 | 64 | 47 | 1.05 |
| D-dimer | 7351 | 1864 | 1864 | 41.52 | 6238 | 2861 | 63.73 |
| Ferritin | 7613 | 615 | 615 | 13.70 | 4381 | 3160 | 70.39 |
| IL-6 | 1046 | 63 | 63 | 1.40 | 807 | 626 | 13.95 |
Standardized set of clinical observable entities.
| Concept | Data type | Values/Unit | SNOMED CT |
|---|---|---|---|
| Height | PQ | cm | 50373000 |Body height measure (observable entity)| |
| Weight | PQ | kg | 27113001 |Body weight (observable entity)| |
| Temperature | PQ | °C | 386725007 |Body temperature (observable entity)| |
| Heart rate | PQ | lat/min | 364075005 |Heart rate (observable entity)| |
| Respiratory rate | PQ | resp/min | 86290005 |Respiratory rate (observable entity)| |
| Systolic blood pressure | PQ | mmHg | 271649006 |Systolic blood pressure (observable entity)| |
| Diastolic blood pressure | PQ | mmHg | 271650006 |Diastolic blood pressure (observable entity)| |
| Oxygen saturation | PQ | % | 103228002 |Hemoglobin saturation with oxygen (observable entity)| |
| Oxygen concentration | PQ | % | 425608004 |Delivered oxygen concentration (observable entity)| |
| Oxygen flow rate | PQ | L/min | 427081008 |Delivered oxygen flow rate (observable entity)| |
| Mean blood pressure | PQ | mmHg | 6797001 |Mean blood pressure (observable entity)| |
| Defecation | INTEGER | 162098000 |Frequency of defecation (observable entity)| | |
| Urination | INTEGER | 364198000 |Frequency of urination (observable entity)| | |
| Vomit | INTEGER | 63361000122100 |Frequency of vomits (observable entity)| | |
| Smoking habit | CV | Non-smoker; | 266918002 |Tobacco smoking consumption (observable entity)| |
| Tobacco exposure | INTEGER | 782516008 |Number of calculated pack years for cumulative lifetime tobacco exposure (observable entity)| | |
| Date started smoking | DATE | 63371000122105 |Date started smoking (observable entity) | |
| Date ceased smoking | DATE | 160625004 |Date ceased smoking (observable entity)| | |
| Glasgow Coma score | INTEGER | 248241002 |Glasgow coma score (observable entity)| | |
| qSOFA score | INTEGER | 63451000122107 |qSOFA score (observable entity)| | |
| SOFA score | INTEGER | 63441000122105 |SOFA score (observable entity)| | |
| NEWS score | INTEGER | 63441000122102 |NEWS score (observable entity)| |
Standardized set of laboratory-related observable entities.
| Concept | Data type | Values/ Unit | LOINC |
|---|---|---|---|
| SARS-COV-2 | CV | Positive; | 94315-9 |SARS coronavirus 2 E gene [Presence] in Unspecified specimen by NAA with probe detection |
| Hemoglobin | PQ | g/dL | 718-7 Hemoglobin [Mass/volume] in Blood |
| Leukocytes | PQ | x1000/µL | 6690-2 Leukocytes [#/volume] in Blood by Automated count |
| Lymphocytes | PQ | x1000/µL | 731-0 Lymphocytes [#/volume] in Blood by Automated count |
| Platelets | PQ | x1000/µL | 777-3 Platelets [#/volume] in Blood by Automated count |
| Neutrophils | PQ | x1000/µL | 751-8 Neutrophils [#/volume] in Blood by Automated count |
| Eosinophils | PQ | x1000/µL | 711-2 Eosinophils [#/volume] in Blood by Automated count |
| Basophils | PQ | x1000/µL | 704-7 Basophils [#/volume] in Blood by Automated count |
| Hematocrit | PQ | % | 4544-3 Hematocrit [Volume Fraction] of Blood by Automated count |
| aPTT | PQ | Sec | 3173-2 aPTT in Blood by Coagulation assay |
| Prothrombin time | PQ | Sec | 5902-2 Prothrombin time (PT) |
| INR | PQ | {INR} | 6301-6 INR in Platelet poor plasma by Coagulation assay |
| Albumin | PQ | g/dL | 1751-7 Albumin [Mass/volume] in Serum or Plasma |
| ALT/SGPT | PQ | U/L | 1742-6 Alanine aminotransferase [Enzymatic activity/volume] in Serum or Plasma |
| Bilirubin | PQ | mg/dL | 1975-2 Bilirubin.total [Mass/volume] in Serum or Plasma |
| AST/SGOT | PQ | U/L | 1920-8 Aspartate aminotransferase [Enzymatic activity/volume] in Serum or Plasma |
| Urea | PQ | mg/dL | 3091-6 Urea [Mass/volume] in Serum or Plasma |
| Lactate | PQ | mmol/L | 2524-7 Lactate [Moles/volume] in Serum or Plasma |
| Creatinine | PQ | mg/dL | 2160-0 Creatinine [Mass/volume] in Serum or Plasma |
| Sodium | PQ | mEq/L | 2951-2 Sodium [Moles/volume] in Serum or Plasma |
| Potassium | PQ | mEq/L | 2823-3 Potassium [Moles/volume] in Serum or Plasma |
| Procalcitonin | PQ | ng/mL | 33959-8 |Procalcitonin [Mass/volume] in Serum or Plasma |
| C reactive protein | PQ | mg/dL | 1988-5 C reactive protein [Mass/volume] in Serum or Plasma |
| LDH | PQ | U/L | 2532-0 Lactate dehydrogenase [Enzymatic activity/volume] in Serum or Plasma |
| Creatine kinase | PQ | U/L | 2157-6 Creatine kinase [Enzymatic activity/volume] in Serum or Plasma |
| Troponin T | PQ | ng/L | 67151-1 Troponin T.cardiac [Mass/volume] in Serum or Plasma by High sensitivity method |
| ESR | PQ | mm/h | 30341-2 Erythrocyte sedimentation rate |
| Fibrinogen | PQ | mg/dL | 3255-7 Fibrinogen [Mass/volume] in Platelet poor plasma by Coagulation assay |
| D-dimer | PQ | ng/mL | 48067-3 Fibrin D-dimer FEU [Mass/volume] in Platelet poor plasma by Immunoassay |
| Triglyceride | PQ | mg/dL | 2571-8 Triglyceride [Mass/volume] in Serum or Plasma |
| Ferritin | PQ | ng/mL | 2276-4 Ferritin [Mass/volume] in Serum or Plasma |
| IL-6 | PQ | pg/mL | 26881-3 Interleukin 6 [Mass/volume] in Serum or Plasma |
| pO2 | PQ | mmHg | 2703-7 Oxygen [Partial pressure] in Arterial blood |
| pCO2 | PQ | mmHg | 2019-8 Carbon dioxide [Partial pressure] in Arterial blood |
| FiO2 | PQ | % | 3150-0 Inhaled oxygen concentration |
| SaO2 | PQ | % | 2708-6 Oxygen saturation in Arterial blood |