| Literature DB >> 36195599 |
Sergios Gatidis1,2, Tobias Hepp3,4, Marcel Früh4, Christian La Fougère5,6,7, Konstantin Nikolaou4,6, Christina Pfannenberg4, Bernhard Schölkopf3, Thomas Küstner4, Clemens Cyran8, Daniel Rubin9.
Abstract
We describe a publicly available dataset of annotated Positron Emission Tomography/Computed Tomography (PET/CT) studies. 1014 whole body Fluorodeoxyglucose (FDG)-PET/CT datasets (501 studies of patients with malignant lymphoma, melanoma and non small cell lung cancer (NSCLC) and 513 studies without PET-positive malignant lesions (negative controls)) acquired between 2014 and 2018 were included. All examinations were acquired on a single, state-of-the-art PET/CT scanner. The imaging protocol consisted of a whole-body FDG-PET acquisition and a corresponding diagnostic CT scan. All FDG-avid lesions identified as malignant based on the clinical PET/CT report were manually segmented on PET images in a slice-per-slice (3D) manner. We provide the anonymized original DICOM files of all studies as well as the corresponding DICOM segmentation masks. In addition, we provide scripts for image processing and conversion to different file formats (NIfTI, mha, hdf5). Primary diagnosis, age and sex are provided as non-imaging information. We demonstrate how this dataset can be used for deep learning-based automated analysis of PET/CT data and provide the trained deep learning model.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36195599 PMCID: PMC9532417 DOI: 10.1038/s41597-022-01718-3
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Patient characteristics across the dataset subcategories.
| diagnosis | patient sex | n/o studies | age [mean SD] |
|---|---|---|---|
| Melanoma | female | 77 | 65.0 ± 12.8 |
| male | 111 | 65.7 ± 13.7 | |
| Lymphoma | female | 69 | 45.1 ± 19.7 |
| male | 76 | 47.3 ± 17.9 | |
| Lung Cancer | female | 65 | 64.2 ± 8.7 |
| male | 103 | 67.0 ± 9.0 | |
| Negative | female | 233 | 59.1 ± 14.7 |
| male | 280 | 58.7 ± 15.1 | |
| All | female | 444 | 58.5 ± 16.1 |
| male | 570 | 60.1 ± 15.9 |
Fig. 1Dataset properties. (a) Coronal views of CT (left) and FDG-PET (right) image volumes without pathologic findings. (b) Example of manual tumor segmentation (bottom image, green area) of a lung cancer mass; top: CT, middle: FDG-PET (c) Distribution of mean SUV, MTV and TLG of studies in patients with lung cancer (blue), lymphoma (red) and melanoma (yellow).
Fig. 2Dataset structure. Patients are identified by a unique, anonymized ID and all studies of a single patient are stored under the respective patient path. (a) DICOM data: Each study folder contains three subfolders with DICOM files of the PET volume, the CT volume and the segmentation mask. (b) NIfTI data: Using the provided conversion script, DICOM data can be converted to NIfTI files. In addition to NIfTI files of the PET volume (PET.nii.gz), the CT volume (CT.nii.gz) and the segmentation mask (SEG.nii.gz), this script generates NIfTI volumes of the PET image in SUV units (SUV.nii.gz) and a CT volume resample to the PET resolution and shape (CTres.nii.gz).
Fig. 3Training and evaluation. (a) Representative loss curve on training data (blue) and validation data (red) from one fold of a 5-fold cross validation. (b) Schematic visualization of the proposed evaluation metrics false positive and false negative volumes (in addition to the Dice score).
Fig. 4Quantitative evaluation of automated lesion segmentation. Top left: Correlation of automatically predicted tumor volume with ground truth tumor volumes from manual segmentation in positive studies. Top right: Distribution of Dice coefficients for automated versus manual tumor segmentation in positive studies. Bottom left: Distribution of false negative volumes over all positive studies. Bottom right: Distribution of false positive volumes over all studies.
Fig. 5Examples of automated lesion segmentation. (a) Example showing excellent agreement between manual (green) and automated (blue) tumor segmentation in a patient with lymphoma. Black arrows point to physiological FDG-uptake in the brain, heart, bowel and urinary bladder (from top to bottom) that was correctly not segmented. (b) Example of false positive segmentation of physiological structures with elevated FDG-uptake. Top: False positive partial segmentation of the left kidney. Bottom: False positive partial segmentation of back muscles.
| Measurement(s) | tumor lesions |
| Technology Type(s) | PET/CT |
| Sample Characteristic - Organism | Homo sapiens |