| Literature DB >> 32377810 |
Mark Bukowski1, Robert Farkas2, Oya Beyan3,4, Lorna Moll5, Horst Hahn6, Fabian Kiessling7,8,9, Thomas Schmitz-Rode10.
Abstract
Digitization of medicine requires systematic handling of the increasing amount of health data to improve medical diagnosis. In this context, the integration of the versatile diagnostic information, e.g., from anamnesis, imaging, histopathology, and clinical chemistry, and its comprehensive analysis by artificial intelligence (AI)-based tools is expected to improve diagnostic precision and the therapeutic conduct. However, the complex medical environment poses a major obstacle to the translation of integrated diagnostics into clinical research and routine. There is a high need to address aspects like data privacy, data integration, interoperability standards, appropriate IT infrastructure, and education of staff. Besides this, a plethora of technical, political, and ethical challenges exists. This is complicated by the high diversity of approaches across Europe. Thus, we here provide insights into current international activities on the way to digital comprehensive diagnostics. This includes a technical view on challenges and solutions for comprehensive diagnostics in terms of data integration and analysis. Current data communications standards and common IT solutions that are in place in hospitals are reported. Furthermore, the international hospital digitalization scoring and the European funding situation were analyzed. In addition, the regional activities in radiomics and the related publication trends are discussed. Our findings show that prerequisites for comprehensive diagnostics have not yet been sufficiently established throughout Europe. The manifold activities are characterized by a heterogeneous digitization progress and they are driven by national efforts. This emphasizes the importance of clear governance, concerted investments, and cooperation at various levels in the health systems.Key Points• Europe is characterized by heterogeneity in its digitization progress with predominantly national efforts. Infrastructural prerequisites for comprehensive diagnostics are not given and not sufficiently funded throughout Europe, which is particularly true for data integration.• The clinical establishment of comprehensive diagnostics demands for a clear governance, significant investments, and cooperation at various levels in the healthcare systems.• While comprehensive diagnostics is on its way, concerted efforts should be taken in Europe to get consensus concerning interoperability and standards, security, and privacy as well as ethical and legal concerns.Entities:
Keywords: Artificial intelligence; Diagnosis; Electronic health records; Europe; Information storage and retrieval
Year: 2020 PMID: 32377810 PMCID: PMC7476980 DOI: 10.1007/s00330-020-06874-x
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1Overview of different data solutions for comprehensive diagnostics (CD) infrastructure. CD requires solutions for data integration and data analysis. Data integration can be performed based on own data (internal) or data imports (external) or a mixture of both. Data integration can be performed locally or in the cloud to build data warehouses or data lakes. One can also build the data bases from individual cases or groups. Data warehouses store organized data, which requires efforts of structuring and cleaning. Data lakes store raw data. Subsequent efforts need to be taken for the specific selection and organization of the data for each need/analysis. Data analysis can be performed on the integrated data. It can be descriptive (e.g., graphical presentation of data), inferential (concluding from the sample case to the collective), and predictive (pattern found in historical data are used to foresee the fate of present cases). These analyses can be performed locally or by cloud computing. For this purpose, statistical methods, artificial intelligence, and data mining are applied
Comparing the data analytics solutions for integrated health data with a selection of exemplary activities. ETL stands for extract, transform, load
| Features | Cloud computing | Data warehouses or lakes | Distributed analytics |
|---|---|---|---|
| Data Integration and Transformation | Data to be transferred and transformed into a central repository. ETL cost/effort comparably high. | Data to be transferred and transformed into a central repository. ETL cost/effort comparably high. | FAIR data points. ETL cost/effort comparably less. |
| Security | Data need to be exported to an external network and platform owner by different firms or entities. Rules and regulations should be checked. | Data reside in owner site but need to be moved into central server. Some data sources may be privacy sensitive and require encryption or anonymization before moving into central repository. | Data reside in actual source and owner has the authority. Authentication and authorization mechanisms need to be established between parties. |
| Speed | Network latency may occur. | No network latency issues. | Network latency may occur. |
| Scalability | Easy due to dynamic scaling model of cloud. | Need to purchase new software or hardware to accommodate large-scale growth. | Execution power requirement is distributed through different sites. Scalability requirement and extension cost may be less compared to central systems. |
| Data Integration | Assuming that all data are kept in a centralized manner, data integration will be easy. | Assuming that all data are kept in a centralized manner, data integration will be easy. | Data integration is done through executing aggregations on results coming from different nodes. |
| Cost | In general, cloud software is priced under a monthly or annual subscription, with additional recurring fees for support, training and updates. Considered as operating expenditure (an additional overhead cost the organization will continue to pay). | On-premise software is generally priced under a one-time perpetual license fee (usually based on the size of the organization or the number of concurrent users). There are recurring fees for support, training and updates. A capital expenditure (one large investment upfront). | Network cost |
| Reliability | Uptime and reliability are guaranteed through provider’s service agreement. | Dependent on the human resources and equipment and acquired support services. | Dependent on the reliability of each node in the systems. Multiple control centers reduce the risk of a system breakdown. |
| Exemplary activities | End of 2019 the National Health Service (NHS) Shared Business Services (SBS) in UK launched a cloud solutions framework valued at up to £500 m [ “The framework provides access to 24 carefully selected suppliers and offers bespoke and off-the-shelf solutions from cloud solution design and as-is assessment to end-to-end cloud solutions.” [ | The Research IT at Stanford Medicine established a clinical data warehouse “STAnford Research Repository” (STARR) that integrates data of several clinics. 2018 STARR-Radio was introduced: a cloud scale radiology imaging repository that brings data from PACS into a research archive with different modalities, e.g., chest and breast X-ray, MRI, CT, and ultrasound. [ 2019 STARR-OMOP was launched that includes EHR data from ~ 2.67 M patients. This data infrastructure is connected to Stanford Nero, which allows Big Data Analytics. [ | Research partners from the Netherlands, Belgium, and Germany implemented an IT infrastructure “euroCAT” in five radiation clinics. They showed a proof-of-principle with the use-case of predicting severe dyspnea after radiotherapy for future “big data” infra-structures and distributed learning studies for personalized medicine. [ |
Selection of tools with capabilities in data integration (DI) and data analysis (DA). The tools are specified according to their open-source (OS) availability and their use in scientific publications together with the top 5 countries/regions according to their (co-)authorships. As an example: XNAT might be one suitable open-source tool for managing radiomics DICOM (Digital Imaging and Communications in Medicine) image data and clinical patient data supporting the HL7 (Health Level 7) FHIR protocol (Fast Healthcare Interoperability Resources) (see Supplement for further details on the methods)
Selection of commonly used international standards and profiles for medical data documentation and communication. In addition, the use of the standards in scientific publications together with the top 5 countries/regions according to their (co-)authorships is shown (see Supplement for further details on the methods)
| Acronym | Full name | Issued by | Purpose | Publications and top 5 countries by (co-)author affiliation |
|---|---|---|---|---|
| ATC [ | Anatomical Therapeutic Chemical Classification System | WHO | Classification system for the active ingredients of drugs | 5008: 811 US, 707 SE, 659 NL, 577 DK, 517 DE |
| DICOM [ | Digital Imaging and Communication in Medicine | NEMA | Broadly accepted, open communication standard for encoding and exchange of medical image data and associated meta-information | 12,149: 5012 US, 1245 CN, 913 DE, UK 860, 564 KR |
| EDIFACT [ | Electronic Data Interchange for Administration, Commerce and Transport | UN/CEFACT (ISO 9735) | Open communication standard for the exchange of administrative data also healthcare IT systems | 49: 8 UK, 7 DK, 5 FR/US, 3 NL/SE |
| HL7 V2 [ | Health Level 7 | HL7 | Dominant communication standard for event notification between hospital IT systems | 237: 89 US, 14 KR, 11 UK, 9 FR/DE |
| HL7 V3, CDA [ | Health Level 7, Clinical Document Architecture | HL7 | Open documentation standard regarding the structure and content of clinical documents based on XML | 626: 228 US, 38 DE, 32 KR, 20 UK, 18 SP |
| HL7 V4, FHIR [ | Fast Healthcare Interoperability Resources | HL7 | Fine granular communication standard for medical resources, data, and interfaces, including application programming interface (API) specification | 450: 193 US, 33 DE, 28 UK, 25 CA, 14 NL |
| ICD [ | International Classification of Diseases | WHO | Documentation standard for disease classification | 52,948: 23,504 US, 6107 UK, 3271 CA, 2925 CN, 2800 TW |
| IHE PIX [ | Patient Identifier Cross Referencing | IHE | Integration profile for patient ID management based on HL7v2.x | 30: 11 US, 6 DE, 4 CA, 2 CN, 1 BR/DK/SI/ KR/FR/PT/UK |
| IHE XDS [ | Cross-Enterprise Document Sharing | IHE | Dominant interoperability profile for patient electronic health records | 103: 28 US, 10 CA,7 AT, 6 DE/NL |
| ISO/IEEE 11073 [ | Medical/Health Device Communication Standards | ISO/IEEE | Family of ISO, IEEE, and CEN joint standards addressing the interoperability of medical devices | 45: 11 KR, 8 US, 7 SP, 5 UK, 3 IT |
| LOINC [ | Logical Observation Identifiers Names and Codes | Regenstrief Institute | Documentation standard and ontology for laboratory data | 1227: 630 US, 45 DE, 35 UK, 33 KR, 29 FR |
| MeSH [ | Medical Subject Headings | NLM | Nomenclature enabling the indexing of medical publications | 10,526: 3142 US, 1488 UK, 1304 CN, 1111 CA, 640 AU |
| RadLex [ | Radiology Lexicon | RSNA | Comprehensive lexicon for standardized indexing and retrieval of radiology information resources | 267: 166 US, 16 DE, 12 UK, 11 CN, 10 CA/FR |
| SNOMED CT [ | Systemized Nomenclature of Medicine Clinical Terms | SNOMED Int. | Documentation standard for comprehensive, unified medical nomenclature comprising English and other languages | 3717: 1529 US, 367 UK, 173 DE, 151 SW, 144 FR |
| UMLS [ | Unified Medical Language System | NLM | Metathesaurus aiming at integrating all important medical terms | 2829: 1408 US, 190 UK, 146 CN, 100 DE, 85 FR |
| XSPA [ | Cross-Enterprise Security and Privacy Authorization | OASIS | eHealth profiles for Security Assertion Markup Language (SAML) and eXtensible Access Control Markup Language (XACML) | 15: 4 DE/US, 3 SP, 1 AT/BE/CA/PL/TW |
Overview of radiomics generations from handcrafted features to end-to-end learning and delta radiomics
| Radiomics generation | Technical details | References |
|---|---|---|
| 1st | Few, well-understood handcrafted features | Kumar et al 2012 [ |
| 2nd | Large number of generic, deterministic features | Aerts et al 2014 [ |
| 3rd | Self-learning deep CNNs as feature extractor | Khalvati et al 2018 [ |
| 4th | End-to-end deep learning integrates both feature extraction and classification/prediction | Hosny et al 2018 [ |
| Delta | Based on temporal feature changes instead of single time points, can be combined with all radiomics generations | Fave et al 2017 [ |
Fig. 2Annual international publication activity in radiomics from 2011 to 2019 (total 3009) based on a Web of Science search. a Number of publications. b Top 5 countries ranked by their number of (co-)authorships in publications (e.g., in 2012 there were two publications, both NL and US were involved, so each of them has two co-authorships in the two publications of 2012). c Number of highly cited publications (top 1% of the citations). d Top 5 countries ranked by their number of highly cited (co-)authorship publications (in total 60 high cited publications with 167 citations on average) (the methods and table of highly cited publications are part of the Supplement)
Fig. 3Share of 3009 radiomics publications in the clinical and technical research areas assigned by the Web of Science for 2011 to 2019. Multiple assignments of research areas per publications are possible (see Supplement for further details on the methods)
Analysis of European funding in health-related topics for data integration, data analysis, and radiomics. In this context, budget, industrial participation, and geographical hotspots based on Horizon 2020 and FP7 projects (not finished before 2015) were considered. *Industrial participation includes public private partnerships (PPP) such as the German Research Center for Artificial Intelligence (DFKI) (see Supplement for further details on the methods)
| Data integration (DI) | Data analysis (DA) | Radiomics | Total | |
|---|---|---|---|---|
| By number of projects | ||||
| Total number of projects | 26 | 304 | 5 | 330 |
| Top 10 geographical hotspots | DE (7) ES (5) UK (4) IT (2) NL (2) CH (2) LU (1) PL (1) FI (1) IL (1) | UK (45) DE (35) NL (34) ES (33) IT (20) FR (17) EL (15) CH (13) DK (12) IL (12) | NL (4) UK (1) | UK DE ES NL IT FR CH EL IL DK |
| % of projects coordinated by industry* | 23% | 32% | 40% | 31% |
| By project budget (budget rounded in million €) | ||||
| Total budget of projects | 164 | 1006 | 13 | 1170 |
| Top 10 geographical hotspots | ES (72) DE (29) UK (26) IT (14) NL (8) IL (7) CH (5) FI (2) LU (0.4) PL (0.1) | NL (167) UK (140) ES (127) DE (106) EL (71) IT (67) CH (63) FR (49) DK (43) FI (37) | NL (9) UK (4) | ES NL UK DE IT EL CH FR DK FI |
| % of budget of projects coordinated by industry* | 29% | 18% | 20% | 19% |
Top 3 industrial partners* (DI and DA budget > €9 million; radiomics budget > €2 million) | GlaxoSmithKline Alacris Theranostics Sintea Plustek | F. Hoffmann-La Roche Philips German Research Center for Artificial Intelligence (DFKI) | ptTheragnostic | F. Hoffmann-La Roche GlaxoSmithKline Philips |
Fig. 4Degree of digitization of different countries’ hospitals based on the annually averaged EMRAM Score provided by HIMSS Analytics. Since in 2018, the criteria of the EMRAM stages were slightly modified and recent data are not yet available, we present data evaluated between 2011 and 2017. The eight-stage EMRAM Score ranges from 0 “paper-based” to 7 “paperless with data analytics” and it considers specific aspects such as closed-loop medication management. Besides single European nations, also United States (US), Middle East, Canada, and Asia-Pacific (APAC) are included. The numbers on the right represent the EMRAM Scores from 2017. In addition to the countries, the numbers of hospitals with EMRAM Score in 2017 are indicated. We would like to point out that due to the different number of hospitals assessed with the EMRAM Score, only a tendency can be evaluated (see Supplement for further details on the methods)
Fig. 5Challenges and implemented solutions for the digitalization of national healthcare systems including examples of countries at different stages of evolution