| Literature DB >> 25129841 |
Kai Yan Eugene Leung1, Fedde van der Lijn, Henri A Vrooman, Miriam C J M Sturkenboom, Wiro J Niessen.
Abstract
We propose an infrastructure for the automated anonymization, extraction and processing of image data stored in clinical data repositories to make routinely acquired imaging data available for research purposes. The automated system, which was tested in the context of analyzing routinely acquired MR brain imaging data, consists of four modules: subject selection using PACS query, anonymization of privacy sensitive information and removal of facial features, quality assurance on DICOM header and image information, and quantitative imaging biomarker extraction. In total, 1,616 examinations were selected based on the following MRI scanning protocols: dementia protocol (246), multiple sclerosis protocol (446) and open question protocol (924). We evaluated the effectiveness of the infrastructure in accessing and successfully extracting biomarkers from routinely acquired clinical imaging data. To examine the validity, we compared brain volumes between patient groups with positive and negative diagnosis, according to the patient reports. Overall, success rates of image data retrieval and automatic processing were 82.5 %, 82.3 % and 66.2 % for the three protocol groups respectively, indicating that a large percentage of routinely acquired clinical imaging data can be used for brain volumetry research, despite image heterogeneity. In line with the literature, brain volumes were found to be significantly smaller (p-value <0.001) in patients with a positive diagnosis of dementia (915 ml) compared to patients with a negative diagnosis (939 ml). This study demonstrates that quantitative image biomarkers such as intracranial and brain volume can be extracted from routinely acquired clinical imaging data. This enables secondary use of clinical images for research into quantitative biomarkers at a hitherto unprecedented scale.Entities:
Mesh:
Year: 2015 PMID: 25129841 PMCID: PMC4303741 DOI: 10.1007/s12021-014-9240-7
Source DB: PubMed Journal: Neuroinformatics ISSN: 1539-2791
Fig. 1Schematic overview of the modular design of the infrastructure. Details of each module are described in the Infrastructure section
Fig. 2An example output of the defacing algorithm, which removes facial features from images using image registration
Fig. 3Fig. 3 T1-weighted image (first row) and result after segmentation in brain tissue and CSF (second row). The brain is depicted in sagittal, coronal and axial orientation
Image acquisition parameters of the MR images used in the statistical analyses
| Scanner type | Image resolution | Image contrast | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Field strength | Slice thickness (mm) | Matrix | Pixel size (mm2) | Number of slices | Scanning sequence | Repetition time (s) | Echo time (ms) | Inversion time (s) | Flip angle (degrees) | Pixel bandwidth (Hz/pixel) | ||
Dementia (positive diagnosis)
| GE Discovery, 67 (81 %) GE Signa, 6 (7 %) Siemens Sonata, 10 (12 %) | 1.5 T 20 (24 %) 3.0 T 63 (76 %) | 1.0 (1.0-1.5) | 240 × 240, 60 (72 %) 320 × 191, 10 (12 %) 416 × 256, 6 (7 %) 256 × 256, 4 (5 %) … | 0.8 (0.3–0.8) | 176 (158–176) | GR 82 (99 %) | 7.9 (7.8–7.9) | 3.1 (3.1–3.1) | 450 (450–450) | 12 (12–12) | 244 (122–244) |
| SE 1 (1 %) | 437 | 10.1 | 0 | 90 | 122 | |||||||
Dementia (negative diagnosis)
| GE Discovery, 29 (37 %) GE Signa, 21 (27 %) Siemens Sonata, 28 (35 %) Philips NT Intera, 1 (1 %) | 1.5 T 58 (73 %) 3.0 T 21 (27 %) | 1.6 (1.0–5.0) | 240 × 240, 18 (23 %) 384 × 224, 13 (17 %) 256 × 132, 9 (11 %) 256 × 256, 7 (9 %) 320 × 191, 7 (9 %) … | 0.3 (0.3–0.8) | 150 (26–176) | GR 49 (54 %) | 7.9 (7.9–15.0) | 3.1 (3.0–4.8) | 450 (450–450) | 12 (12–23) | 170 (122–244) |
| SE 62 (38 %) | 565 (560–577) | 12.0 (11.2–14.0) | 0 (0–0) | 90 (90–90) | 94 (90–98) | |||||||
Multiple sclerosis (positive diagnosis)
| GE Discovery, 9 (12 %) GE Signa, 22 (29 %) Siemens Sonata, 14 (18 %) Philips NT Intera, 31 (41 %) | 1.5 T 68 (90 %) 3.0 T 8 (10 %) | 5.0 (5.0–5.0) | 256 × 192, 23 (30 %) 256 × 132, 6 (8 %) 416 × 384, 6 (8 %) 256 × 144, 3 (4 %) … | 0.8 (0.8–0.8) | 24 (23–26) | GR 4 (5 %) | 14.0 (9.6–15.8) | 4.8 (3.6–8.7) | 450 (450–450) | 23 (15–28) | 170 (122–170) |
| SE 72 (95 %) | 2500 (2,420–2,652) | 15.0 (8.0–20.0) | 0 (0–0) | 90 (90–90) | 122 (122–163) | |||||||
Multiple sclerosis (negative diagnosis)
| GE Discovery, 4 (6 %) GE Signa, 41 (59 %) Siemens Sonata, 12 (17 %) Philips NT Intera, 13 (19 %) | 1.5 T 65 (93 %) 3.0 T 5 (7 %) | 3.5 (2.0–5.0) | 256 × 224, 17 (24 %) 256 × 192, 12 (17 %) 256 × 132, 8 (11 %) 416 × 256, 8 (11 %) … | 0.8 (0.7–0.8) | 35 (23–74) | GR 28 (40 %) | 17.1 (13.4–17.1) | 7.6 (2.5–7.6) | 0 (0–400) | 30 (20–30) | 75 (70–75) |
| SE 42 (60 %) | 2,480 (2,440–2,652) | 15.0 (8.0–20.0) | 0 (0–0) | 90 (90–90) | 122 (122–195) | |||||||
Open question
| GE Discovery, 61 (18 %) GE Signa, 67 (20 %) Siemens Sonata, 58 (17 %) Philips NT Intera, 158 (46 %) | 1.5 T 295 (86 %) 3.0 T 49 (14 %) | 2.0 (1.5–2.0) | 256 × 224, 39 (11 %) 256 × 157, 34 (10 %) 256 × 256, 25 (7 %) 240 × 240, 20 (6 %) 416 × 256, 18 (5 %) … | 0.8 (0.24–0.8) | 70 (64–144) | GR 302 (86 %) | 16.3 (13.4–16.5) | 7.6 (3.7–10.0) | 400 (0–450) | 30 (20–30) | 170 (75–217) |
| SE 42 (14 %) | 513 (500–564) | 12.7 (10.0–14.0) | 0 (0–0) | 90 (90–90) | 98 (90–122) | |||||||
Values represent number (%) or median(lower quartile-upper quartile). Abbreviations: GR gradient echo, SE spin echo
Fig. 4Scatter plots of brain volumes versus age, segregated by gender (Men: open circles. Women: open triangles). For the dementia and multiple sclerosis protocol groups, blue denotes a positive diagnosis and red a negative diagnosis. Regression lines for linear fit are shown for men and women combined. Results are presented for the three protocol groups: dementia, multiple sclerosis and open question respectively (after age ≥45 year and IQR selection). The last plot combines all protocol groups. Volumes are expressed as percentage of intracranial volume
Fig. 5Success rates (%) at different steps in the processing pipeline shown for the three study populations. Starting number of subjects are n = 246, 446 and 924 for the dementia, multiple sclerosis and open question groups
Characteristics of the study population, stratified by gender and group. Note that these numbers are obtained after the age selection of 45 years and older and the automatic outlier exclusion criterion
| Total | Men | Women | ||
|---|---|---|---|---|
Dementia (positive diagnosis) | Number | 83 | 48 | 35 |
| Age (years) | 67.5 ± 8.4 | 68.5 ± 8.4 | 66.1 ± 8.5 | |
Dementia (negative diagnosis) | Number | 79 | 39 | 40 |
| Age (years) | 60.5 ± 8.3 | 60.2 ± 8.1 | 60.7 ± 8.6 | |
Multiple sclerosis (positive diagnosis) | Number | 76 | 27 | 49 |
| Age (years) | 54.4 ± 7.4 | 54.1 ± 7.8 | 54.5 ± 7.3 | |
Multiple sclerosis (negative diagnosis) | Number | 70 | 35 | 35 |
| Age (years) | 58.3 ± 10.4 | 59.9 ± 10.6 | 56.7 ± 10.0 | |
| Open question | Number | 344 | 190 | 154 |
| Age (years) | 60.7 ± 9.8 | 60.5 ± 9.2 | 61.1 ± 10.5 | |
Age distributions are described by mean and standard deviations in years
Intracranial and brain volume (corrected for ICV) stratified per group and gender
| ICV in ml (CI 95 %) | Brain volume in ml (CI 95 %) | |||||
|---|---|---|---|---|---|---|
| Men | Women | Total | Men | Women | Total | |
Dementia (positive diagnosis) | 1,198 (1,169;1,227) | 1,069 (1,040;1,099) | 1,134 (1,105;1,162) | 916 (907;926) | 915 (906;924) | 915 (907;924) |
Dementia (negative diagnosis) | 1,235 (1,208;1,262) | 1,107 (1,080;1,134) | 1,171 (1,145;1,197) | 930 (931;948) | 938 (929;947) | 939 (930;947) |
Multiple sclerosis (positive diagnosis) | 1,199 (1,171;1,228) | 1,071 (1,044;1,098) | 1,135 (1,108;1,162) | 947 (938;956) | 945 (937;955) | 947 (938;955) |
Multiple sclerosis (negative diagnosis) | 1,216 (1,189;1,244) | 1,088 (1,061;1,116) | 1,152 (1,126;1,179) | 948 (939;957) | 946 (938;955) | 947 (939;956) |
| Open question | 1,212 (1,192;1,232) | 1,084 (1,063;1,104) | 1,148 (1,129;1,166) | 933 (927;940) | 932 (926;939) | 933 (927;939) |
| Total pooled | 1,212 (1,196;1,228) | 1,084 (1,068;1,100) | 935 (930;941) | 937 (931;942) | ||
Multiple linear regression: all analyses are corrected for age and scanner effects, Brain volumes are corrected for ICV, the covariates are evaluated at age = 60.5, ICV = 1,150 ml, the Philips NT Intera scanner is taken as reference. Analyses on the total pooled group are corrected for the groups
Fig. 6Intracranial and brain volumes stratified by gender and group. Multiple linear regression: all analyses are corrected for age and scanner effects, Brain volumes are corrected for ICV, the covariates are evaluated at age = 60.5, ICV = 1,150 ml, and the Philips NT Intera is taken as reference. Analyses on the total pooled group are corrected for the individual groups
The relationship between intracranial volume (ICV) and brain volume (corrected for ICV) with age
| ICV | Brain volume | |
|---|---|---|
| Age predictor | Age predictor | |
| β in ml per year (CI 95 %) | β in ml per year (CI 95 %) | |
| Dementia (positive diagnosis) | 0.8 (−1.9;3.5) | −0.5 (−1.2;0.2) |
| Dementia (negative diagnosis) | 3.5 (0.7;6.2) | −0.9 (−1.9;0.2) |
| Multiple sclerosis (positive diagnosis) | −0.1 (−2.9;2.7) | 0.2 (−0.8;1.2) |
| Multiple sclerosis (negative diagnosis) | −1.3 (−3.8;1.2) | −1.1 (−2.1;−0.2) |
| Open question | 1.2 (0.1;2.4) | −1.0 (−1.4;−0.7) |
| Total pooled | 1.0 (0.2;1.8) | −0.9 (−1.2;−0.6) |
Multiple linear regression: all analyses are corrected for gender and scanner effects. Brain volumes are corrected for ICV. Analyses on the total pooled group are corrected for the groups
Fig. 7Intracranial and brain volumes versus age. Multiple linear regression: all analyses are corrected for scanner effects, Brain volumes are corrected for ICV, Analyses on the total pooled group are corrected for the individual groups