| Literature DB >> 35773546 |
Haridimos Kondylakis1, Esther Ciarrocchi2, Leonor Cerda-Alberich3, Ioanna Chouvarda4, Lauren A Fromont5, Jose Manuel Garcia-Aznar6, Varvara Kalokyri7, Alexandra Kosvyra4, Dawn Walker8, Guang Yang9, Emanuele Neri7.
Abstract
A huge amount of imaging data is becoming available worldwide and an incredible range of possible improvements can be provided by artificial intelligence algorithms in clinical care for diagnosis and decision support. In this context, it has become essential to properly manage and handle these medical images and to define which metadata have to be considered, in order for the images to provide their full potential. Metadata are additional data associated with the images, which provide a complete description of the image acquisition, curation, analysis, and of the relevant clinical variables associated with the images. Currently, several data models are available to describe one or more subcategories of metadata, but a unique, common, and standard data model capable of fully representing the heterogeneity of medical metadata has not been yet developed. This paper reports the state of the art on metadata models for medical imaging, the current limitations and further developments, and describes the strategy adopted by the Horizon 2020 "AI for Health Imaging" projects, which are all dedicated to the creation of imaging biobanks.Entities:
Keywords: Artificial intelligence; Diagnostic imaging; Metadata; Radiation therapy; Radiomics
Mesh:
Year: 2022 PMID: 35773546 PMCID: PMC9247122 DOI: 10.1186/s41747-022-00281-1
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
Fig. 1An example of a Digital Imaging and Communications in Medicine (DICOM) contrast-enhanced magnetic resonance image (T1-weighted sequence) of the prostate and DICOM metadata about patient demographics, acquisition-related parameters and image-related parameters
Summary of the most relevant metadata models currently available, the type of metadata they represent, and the scope of the model
| Model | Type of metadata | Scope |
|---|---|---|
| DICOM extensions | Clinical variables | Extend DICOM metadata to other domains |
| SEDI | DICOM tags | Enable semantic search over DICOM tags |
| MIABIS | Biological samples and tissues | Standard for traditional biobanks and Biobanking and Biomolecular Resources Research Infrastructure–European Research Infrastructure Consortium Directory |
| OMOP CDM | Clinical variables | Standardise observational medical outcomes |
| FHIR | Clinical variables | Standard for health care data exchange |
| OMOP on FHIR | Clinical variables | Bidirectional mapping |
| ICGC-ARGO | Cancer-focused clinical variables | Standardise variables, attributes, and permissive values in the cancer domain |
DICOM Digital Imaging and Communications in Medicine, FHIR Fast Healthcare Interoperability Resources, ICGC-ARGO International Cancer Genome Consortium-Accelerating Research in Genomic Oncology, MIABIS Minimum Information About BIobank data Sharing, OMOP CDM Observational Medical Outcomes Partnership Common Data Model, SEDI Semantic DICOM
Summary of the AI4HI projects, listing their goals, use-cases, types of metadata identified so far
| Project | Goal | Considered use cases | Types of metadata | Adopted models |
|---|---|---|---|---|
| PRIMAGE | To build an imaging biobank for the training and validation of machine learning and multiscale simulation algorithms | Paediatric neuroblastoma and diffuse intrinsic pontine glioma | DICOM tags Image analysis metadata (registration, denoising, radiomics) Clinical variables | DICOM-MIABIS OMOP CDM |
| EuCanImage | To build a European cancer imaging platform for enhanced AI in oncology | Eight use cases regarding liver, breast, and colorectal cancer | Imaging data Clinical variables | DICOM-MIABIS ICGC-ARGO |
| INCISIVE | To improve cancer diagnosis and prediction with AI and big data | Lung, breast, colorectal, and prostate cancer | Imaging data Clinical and biological data | FHIR |
| CHAIMELEON | To develop a structured repository of health images and related clinical and molecular data | Lung, breast, prostate, and colorectal cancer | Imaging data Clinical variables | DICOM-MIABIS OMOP CDM |
| ProCancer-I | To develop an AI Platform integrating imaging data and models | Prostate cancer | Imaging data Clinical variables | DICOM-Radiation therapy OMOP CDM with Oncology Extension |
AI Artificial intelligence, AI4HI Artificial Intelligence for Health Imaging, DICOM Digital Imaging and Communications in Medicine, FHIR Fast Healthcare Interoperability Resources, ICGC-ARGO International Cancer Genome Consortium-Accelerating Research in Genomic Oncology, MIABIS Minimum Information About BIobank data Sharing, OMOP CDM Observational Medical Outcomes Partnership Common Data Model, SEDI Semantic DICOM
Metadata management approaches of the AI4HI projects
| Project | Metadata collection | Metadata types | Models used | Unique characteristics |
|---|---|---|---|---|
| PRIMAGE | Structured e-forms | Imaging, clinical, image radiomic analysis | DICOM for imaging metadata MIABIS for biological samples and tissue OMOP-CDM for clinical | Integration of the DICOM and MIABIS standards, and metadata model that captures the biomechanical/signalling behaviour of tumours |
| EuCanImage | Structured e-forms | Imaging, clinical | DICOM-MIABIS for imaging data Extension of ICGC-ARGO for clinical variables | Link between imaging and non-imaging data |
| INCISIVE | Structured e-forms | Clinical, biological, imaging | Multiple terminologies for clinical data ( FHIR for communication | Data Integration Quality Check Tool employed to identify whether data follow the harmonisation requirements defined |
| CHAIMELEON | Structured e-forms | Imaging, clinical | DICOM for imaging metadata MIABIS for biological samples and tissue OMOP-CDM for clinical | A multimodal analytical data engine will facilitate interpretation, extraction, data harmonisation, and exploitation of the stored information. The CHAIMELEON repository will ensure the usability and performance of the repository as a tool fostering AI experimentation |
| ProCancer-I | Data upload tool (e-forms) | Imaging, clinical | DICOM-Radiation therapy for imaging data OMOP-CDM for clinical data | Provides an extension to OMOP-CDM going beyond radiology/oncology extensions and introduces another model (AI passport) for modeling analysis workflows and AI development |
AI Artificial intelligence, AI4HI Artificial Intelligence for Health Imaging, ATC Anatomical Therapeutic Chemical Classification (World Health Organization), DICOM Digital Imaging and Communications in Medicine, FHIR Fast Healthcare Interoperability Resources, ICD 10 International Classification of Diseases 10, ICGC-ARGO International Cancer Genome Consortium-Accelerating Research in Genomic Oncology, MIABIS Minimum Information About BIobank data Sharing, OMOP CDM Observational Medical Outcomes Partnership Common Data Model, SEDI Semantic DICOM, SNOMED-CT Systematized Nomenclature of Medicine Clinical Terms
Fig. 2Level of EuCanImage variable mapping into the Accelerating Research in Genomic Oncology (ARGO) model based on the clinical use cases, at the time of assessment (July 2021). TBC To be confirmed
Fig. 3The design of the INCISIVE data model