Literature DB >> 35154004

Gender-, Age- and Region-Specific Characterization of Vertebral Bone Microstructure Through Automated Segmentation and 3D Texture Analysis of Routine Abdominal CT.

Michael Dieckmeyer1, Nico Sollmann1,2,3, Malek El Husseini1, Anjany Sekuboyina1, Maximilian T Löffler1,4, Claus Zimmer1, Jan S Kirschke1,2, Karupppasamy Subburaj5,6, Thomas Baum1.   

Abstract

Purpose: To identify long-term reproducible texture features (TFs) of spinal computed tomography (CT), and characterize variations with regard to gender, age and vertebral level using our automated quantification framework.
Methods: We performed texture analysis (TA) on baseline and follow-up CT (follow-up duration: 30-90 days) of 21 subjects (8 females, 13 males, age at baseline 61.2 ± 9.2 years) to determine long-term reproducibility. TFs with a long-term reproducibility error Δrel<5% were further analyzed for an association with age and vertebral level in a cohort of 376 patients (129 females, 247 males, age 62.5 ± 9.2 years). Automated analysis comprised labeling and segmentation of vertebrae into subregions using a convolutional neural network, calculation of volumetric bone mineral density (vBMD) with asynchronous calibration and TF extraction. Varianceglobal measures the spread of the gray-level distribution in an image while Entropy reflects the uniformity of gray-levels. Short-run emphasis (SRE), Long-run emphasis (LRE), Run-length non-uniformity (RLN) and Run percentage (RP) contain information on consecutive voxels of a particular grey-level, or grey-level range, in a particular direction. Long runs (LRE) represent coarse texture while short runs (SRE) represent fine texture. RLN reflects similarities in the length of runs while RP reflects distribution and homogeneity of runs with a specific direction.
Results: Six of the 24 extracted TFs had Δrel<5% (Varianceglobal, Entropy, SRE, LRE, RLN, RP), and were analyzed further in 4716 thoracolumbar vertebrae. Five TFs (Varianceglobal,SRE,LRE, RLN,RP) showed a significant difference between genders (p<0.001), potentially being caused by a finer and more directional vertebral trabecular microstructure in females compared to males. Varianceglobal and Entropy showed a significant increase from the thoracic to the lumbar spine (p<0.001), indicating a higher degree and earlier initiation of trabecular microstructure deterioration at lower spinal levels. The four higher-order TFs showed significant variations between spine regions without a clear directional gradient (p ≤ 0.001-0.012). No TF showed a clear age dependence. vBMD differed significantly between genders, age groups and spine regions (p ≤ 0.001-0.002).
Conclusion: Long-term reproducible CT-based TFs of the thoracolumbar spine were established and characterized in a predominantly older adult study population. The gender-, age- and vertebral-level-specific values may serve as foundation for osteoporosis diagnostics and facilitate future studies investigating vertebral microstructure.
Copyright © 2022 Dieckmeyer, Sollmann, El Husseini, Sekuboyina, Löffler, Zimmer, Kirschke, Subburaj and Baum.

Entities:  

Keywords:  automated segmentation; bone microstructure; multi-detector computed tomography; osteoporosis; texture analysis

Mesh:

Year:  2022        PMID: 35154004      PMCID: PMC8828577          DOI: 10.3389/fendo.2021.792760

Source DB:  PubMed          Journal:  Front Endocrinol (Lausanne)        ISSN: 1664-2392            Impact factor:   5.555


Introduction

Texture analysis (TA) is an emerging subfield of radiomics, representing a non-invasive and objective method for the quantitative assessment of medical images. Texture features (TFs) can be used to quantitatively characterize image properties, such as uniformity, heterogeneity and randomness, as well as repetitive image patterns. TA has the potential to enable the extraction of additional diagnostic, predictive and prognostic information beyond what is visually perceptive (1). Traditionally, applications of TA include neuroimaging, musculoskeletal and oncological imaging, e.g. to assess tumor heterogeneity (2, 3). TA has been performed across different anatomical regions and modalities, with CT being the modality used most frequently, arguably due to its multitude of clinical applications, broad availability and data quality (4). Overall, musculoskeletal applications have been based on radiographic, magnetic resonance imaging (MRI) (5, 6) and CT data (7–9). Osteoporosis is a systemic disorder of bone metabolism characterized by reduced bone mineralization and microarchitectural deterioration of osseous tissue (10, 11). Bone health and the closely associated vertebral fracture (VF) risk are dependent on bone mineral density (BMD) and bone microstructure which is primarily defined by the three-dimensional (3D) trabecular bone architecture as suggested by high correlations with micro-CT as reference (12, 13). Volumetric BMD (vBMD) can be measured on a vertebra-specific level with high spatial resolution using quantitative CT (QCT), while CT-based TA enables the evaluation of trabecular bone microstructure. Therefore, osteoporosis represents one of the most promising clinical applications of TA. In a small study cohort, Mookiah et al. showed that TA is feasible for opportunistic osteoporosis screening and can discriminate subjects with and without VFs accurately while exhibiting acceptable long-term scan-rescan reproducibility. However, the size of the used study population does not enable the derivation of TF reference values (7). Muehlematter et al. used machine learning algorithms on CT-based TA to better predict incident VFs (8). Through the combination of 3D-TA and regional vBMD, Valentinitsch et al. could improve the classification of prevalent VFs. In this study, the eventually analyzed TFs were selected from a variety of features, using an exponential search based on the Gini Index and subsequently used for classification of VFs which represent the most relevant clinical outcome of osteoporosis. However, the actual TF values were not reported (9). In spine imaging, commonly used TFs include first-order, second-order and higher-order statistical features, and reliable extraction of specific TFs was derived from sagittal reformations of up to 3 mm slice thickness in routine abdominal multi-detector CT (MDCT) scans with administration of intravenous contrast medium (IVCM) (7). However, TA for the evaluation of the osseous microstructure of the spine and ensuing bone health has not been standardized, yet. One essential and time-consuming part of the TF extraction process is the segmentation to create individual regions of interest (ROIs) of the vertebral bodies, which has usually been performed manually on single slices. However, recent advancements in deep learning, in particular in convolutional neural networks, enable a standardized 3D segmentation by a fully automated pipeline (14–16). In order to discriminate TF values indicative of osteoporosis from normal values, and to enable longitudinal analyses of vertebral microstructure, the characterization of reproducible TF values is needed and can therefore be considered a prerequisite for diagnostic and therapeutic decisions in osteoporosis that use TFs. Therefore, purposes of our study were (i), to identify CT-based TFs of vertebral bone with high long-term reproducibility which would be particularly important for longitudinal studies in the context of intervention monitoring, and (ii), to generate an automated standardized pipeline for segmentation and TA of the spine to establish characteristic TF values and determine physiological variations of the identified highly reproducible TFs with regard to gender, age and vertebral level in a mainly older adult population.

Materials and Methods

Subjects

First, for the assessment of long-term reproducibility, a cohort of 21 Caucasian patients who received baseline (BL) and follow-up (FU) routine abdominal MDCT scans with IVCM and a FU duration between 30 and 90 days were identified. Clinical indications were oncologic staging for the BL scans, and ruling out postoperative complications (e.g. anastomotic insufficiencies, fistulas, infections) for the FU scans. For each TF and vertebra, the relative difference between BL and FU measurement Δrel was calculated as a measure of long-term reproducibility, with TF and TF denoting the measurements of a certain TF at FU and BL, respectively, and averaged across all vertebrae (T4 - L5) for each patient. All TFs showing a reproducibility error Δrel < 5% were investigated further with regard to gender, age and vertebral level in a larger patient cohort as described in the following. Second, 376 Caucasian patients who received routine abdominal MDCT scans with IVCM were analyzed. Clinical indication was oncologic follow-up to rule out tumor recurrence. For each of these patients, all vertebrae from T4 to L5, completely included in the field of view (FOV), were analyzed. All patients were retrospectively identified in our hospital’s digital picture archiving and communication system (PACS). Inclusion criteria were the availability of an MDCT scan including the spine at the same scanner with a specific protocol as outlined below. Exclusion criteria were VFs, osteoporosis, history of bisphosphonate or other bone metabolism-influencing therapy, osseous metastases as well as hematological or other metabolic bone disorders. Osteoporotic vertebrae were defined as vBMD < 80 mg/cm3 according to the cutoff values suggested for spine QCT measurements by the American College of Radiology (ACR) (17). The present study was approved by the local institutional review board. The requirement of the written consent was waived due to the retrospective nature of the study.

CT Image Acquisition

All CT data was acquired on the same 64-row MDCT scanner (Somatom Sensation Cardiac 64; Siemens Medical Solutions, Erlangen, Germany). A standardized protocol with the following dedicated scanning parameters was applied. Average tube voltage: 120 kVp, adapted tube load: 200 mAs (averaged), minimum collimation of 0.6 mm. An intravenous contrast agent (Imeron 400, Bracco, Konstanz, Germany) was administered for each examination following a standard protocol using a high-pressure injector (Fresenius Pilot C; Fresenius Kabi, Bad Homburg, Germany) with a delay of 70 s, a flow rate of 3 mL/s, and a body weight-dependent dose (80 mL for body weight ≤ 80 kg, 90 mL for body weight > 80 kg and ≤ 100 kg, and 100 mL for body weight > 100 kg). Oral contrast medium (1000 mL of Barilux Scan; Sanochemia Diagnostics, Neuss, Germany) was given before each CT scan. Sagittal reformations of the spine with a slice thickness of 3 mm and reconstructed with a standard bone kernel were used for TA.

Automated Image Segmentation

The vertebrae T4 to L5 were automatically segmented in the MDCT images using a deep learning-driven framework (https://anduin.bonescreen.de). The freely available tool identifies and labels each vertebra in a fully automated process and creates corresponding segmentation masks. Additionally, for each vertebra, a defined set of segmentation masks of subregions, including the trabecular compartment of the vertebral body among others, is created ( ) (14, 15). The fully automatically generated labels and segmentation masks of all vertebrae were checked visually by a radiologist with two years of experience in spine imaging, and manually corrected if necessary. In total, labels and segmentations in 53 vertebral bodies of the 376 analyzed patients were manually corrected. Causes of imperfect labeling and segmentation were Schmorl nodes (33 vertebrae), severe degenerative changes (18), partial block vertebrae (3), hemivertebrae (1), or thoracolumbar and lumbosacral transitional vertebrae (7).
Figure 1

Exemplary illustration of the automated labeling and segmentation. (A) individually labeled and segmented vertebrae from T4-L5. (B, C) vertebral subregions, including the trabecular (white) and cortical (green) compartment of the vertebral body.

Exemplary illustration of the automated labeling and segmentation. (A) individually labeled and segmented vertebrae from T4-L5. (B, C) vertebral subregions, including the trabecular (white) and cortical (green) compartment of the vertebral body.

Texture Analysis

All TFs were calculated for the ROI corresponding to the trabecular compartment of each segmented vertebral body. The extracted TFs included three global features, also referred to as first-order statistical moments, which are computed by gray-level histogram analysis, eight second-order features, based on gray-level co-occurrence matrix (GLCM) analysis, and 13 higher-order features, based on gray-level run-length matrix (GLRLM) analysis ( ). In total, 24 TFs were extracted which quantify textural patterns (e.g. fine, coarse, smooth, or irregular) in an image. Each GLCM entry represents the probability of gray-level co-occurrence between voxel pairs at a given vector direction and a fixed length of 1 between the voxel pair. Each GLRLM entry is the probability of voxel occurrences of a specific gray-level for a possible run-length. A gray-level run is a set of successive voxels with identical gray-level values arranged collinearly along a given vector direction, and the run length is defined as the number of voxels in it. GLRLM features are computed based on the occurrence and distribution of such runs in a given image and measure the directional changes in the GLCM (26). In total, there are 13 direction vectors with a displacement of (dx, dy, dz). The matrices of all 13 directions were averaged and normalized before calculating the second-order GLCM and higher-order GLRLM indices. GLCM is an (n × n) matrix where n is the number of gray-levels in each image. GLRLM is an (n × m) matrix, where n is the number of gray-levels in the scan, and m is the run length. gives an overview of the analyzed TFs together with descriptions of the quantified image properties.
Table 1

Global (histogram-based), gray-level co-occurrence matrix (GLCM)-based, and gray-level run-length matrix (GLRLM)-based texture features and descriptions.

CategoryTexture featureDescription (What is quantified)?Reference
GlobalVarianceSpread of gray-level distributionGaztañaga et al. (20)
(Histogram)SkewnessShape of gray-level distribution 
 KurtosisFlatness of gray-level distribution 
Second-orderEnergyUniformityHaralick et al. (21)
(GLCM)ContrastLocal intensity variation 
 EntropyRandomness 
 HomogeneityHomogeneous scene 
 CorrelationLinear spatial relationships between texture elements 
 Sum averageSpread of the mean voxel co-occurrence distribution 
 VarianceVoxel co-occurrence distribution 
 DissimilarityHeterogeneity 
Higher-orderSREShort run distributionGalloway et al. (22)
(GLRLM)LRELong run distribution 
 GLNSimilarities of gray-level 
 RLNSimilarities in length of runs 
  RPDistribution and homogeneity of runs with a specific direction 
  LGLREDistribution of low-gray-level valuesChu et al. (23)
  HGLREDistribution of high-gray-level values 
  SRLGEJoint distribution of short runs and low gray-level valuesDasarathy et al. (24)
  SRHGEJoint distribution of short runs and high gray-level values 
  LRLGEJoint distribution of long runs and low gray-level values 
  LRHGEJoint distribution of long runs and high gray-level values 
  GLVWeighted variances of gray-level valuesThibault et al. (25)
  RLVWeighted variances of gray-level runs 

SRE, short-run emphasis; LRE, long-run emphasis; GLN, gray-level non-uniformity; RLN, run-length non-uniformity; RP, run percentage; LGLRE, low gray-level run emphasis; HGLRE, high gray-level run emphasis; SRLGLE, short-run low gray-level emphasis; SRHGLE, short-run high gray-level emphasis; LRLGLE, long-run low gray-level emphasis; LRHGLE, long-run high gray-level emphasis; GLV, gray-level variance; RLV, and run-length variance; table adapted from (19).

Global (histogram-based), gray-level co-occurrence matrix (GLCM)-based, and gray-level run-length matrix (GLRLM)-based texture features and descriptions. SRE, short-run emphasis; LRE, long-run emphasis; GLN, gray-level non-uniformity; RLN, run-length non-uniformity; RP, run percentage; LGLRE, low gray-level run emphasis; HGLRE, high gray-level run emphasis; SRLGLE, short-run low gray-level emphasis; SRHGLE, short-run high gray-level emphasis; LRLGLE, long-run low gray-level emphasis; LRHGLE, long-run high gray-level emphasis; GLV, gray-level variance; RLV, and run-length variance; table adapted from (19). In order to generate isotropic volumes of the image datasets necessary for comparable TA results, cubic interpolation was used. To prevent sparseness, gray-level quantization was performed using the normalized gray-levels (scale 0 to 1) of the ROI corresponding to the trabecular compartment of each vertebral body. All steps of the TA were performed with MATLAB (version R2021a; MathWorks Inc., Natick, MA, USA) using a modified version of a publicly available radiomics toolbox (https://github.com/mvallieres/radiomics) (27).

vBMD Extraction

vBMD was calculated for the ROI corresponding to the trabecular compartment of each segmented vertebral body applying asynchronous calibration to convert CT attenuation, measured in Hounsfield units (HU), to vBMD. Asynchronous calibration uses bone-equivalent density phantoms to generate HU-to-BMD conversion equations for a specific CT scanner and acquisition protocol. Previously established HU-to-BMD conversion equations were applied in this study using a phantom with hydroxyl-apatite inserts of known density in mg/cm³ (Anthropomorphic Abdomen Phantom, QRM Quality Assurance in Radiology and Medicine) (14). Linear correction equations for the presence of IVCM in portal-venous phase were applied to avoid vBMD bias (28).

Statistical Analysis

Statistical analysis was performed with SPSS (SPSS Inc., Chicago, IL, USA) and MATLAB (version R2021a; The Mathworks, Natick, MA, USA) using a two-sided level of significance of 0.05 for all statistical tests. The Kolmogorov-Smirnov test indicated normally distributed data for the analyzed TFs and vBMD, irrespective of gender. The association between vBMD and the analyzed TFs was determined using Pearson correlation coefficient (r). Independent t-tests were used to test for significant gender-dependent differences of TFs and vBMD. Analysis of variance (ANOVA) was used to test for significant differences across age groups and vertebral levels, respectively. For all tests, Bonferroni correction was applied to adjust for multiple comparisons.

Results

Long-Term Reproducibility of TFs

For each of the 21 included patients (8 females, 13 males, mean age at BL = 61.2 ± 9.2 years, age range at BL = 43.7 – 71.6 years), T4 to L5 were analyzed, resulting in a total of 294 analyzed vertebrae. The mean follow-up duration, which is the time interval between the BL and FU scan, was 59 ± 11 days (range: 34 – 77 days). Of the 24 analyzed TFs, four features showed a relative difference between FU and BL measurements (Δrel) below 1%, and two features between 1% and 5% ( ). In total, six TFs showed a reproducibility error Δrel < 5% (Varianceglobal, Entropy, SRE, LRE, RLN, RP) and were further analyzed with regard to gender, age and vertebral level.
Table 2

Relative difference Δrel between baseline and follow-up measurement of the analyzed TFs.

Texture feature Δrel [%]
Varianceglobal*2.22
Skewnessglobal 248.63
Kurtosisglobal 76.45
Energy45.16
Contrast64.63
Entropy*3.94
Homogeneity45.34
Correlation52.21
SumAverage18.82
Variance33.20
Dissimilarity40.26
SRE**0.15
LRE**0.59
GLN27.66
RLN**0.38
RP**0.19
LGLRE9.38
HGLRE21.57
SRLGLE9.33
SRHGLE21.64
LRLGLE9.59
LRHGLE21.32
GLV34.64
RLV34.32

TF, texture feature; SRE, short-run emphasis; LRE, long-run emphasis; GLN, gray-level non-uniformity; RLN, run-length non-uniformity; RP, run percentage; LGLRE, low gray-level run emphasis; HGLRE, high gray-level run emphasis; SRLGLE, short-run low gray-level emphasis; SRHGLE, short-run high gray-level emphasis; LRLGLE, long-run low gray-level emphasis; LRHGLE, long-run high gray-level emphasis; GLV, gray-level variance; RLV, run-length variance.

TFs with Δrel < 1% and Δrel < 5% are marked with ** and *, respectively.

Relative difference Δrel between baseline and follow-up measurement of the analyzed TFs. TF, texture feature; SRE, short-run emphasis; LRE, long-run emphasis; GLN, gray-level non-uniformity; RLN, run-length non-uniformity; RP, run percentage; LGLRE, low gray-level run emphasis; HGLRE, high gray-level run emphasis; SRLGLE, short-run low gray-level emphasis; SRHGLE, short-run high gray-level emphasis; LRLGLE, long-run low gray-level emphasis; LRHGLE, long-run high gray-level emphasis; GLV, gray-level variance; RLV, run-length variance. TFs with Δrel < 1% and Δrel < 5% are marked with ** and *, respectively.

TF Variations With Regard to Gender, Age and Vertebral Level

In total, 376 patients (129 females, 247 males, mean age = 62.5 ± 9.2 years, age range = 39.0 – 88.0 years) were included, and stratified into four age groups for each gender, respectively: <50 years, 50-59 years, 60-69 years, and ≥70 years ( and ). In total, vertebral bodies from T4 to L5 were included. In total, labels and segmentations were manually corrected in 53 of the 4716 vertebral bodies, resulting in a correction ration of 1.1%. The analyzed vertebral bodies were stratified into three spine regions: mid thoracic spine (T4-T8), lower thoracic spine (T9-T12) and lumbar spine (L1-L5). TF values with regard to gender, age and vertebral level are summarized in – .
Figure 2

Patient age vs vBMD (averaged across all analyzed vertebrae for each patient) for females (red) and males (blue); vBMD, volumetric bone mineral density.

Table 3

Patient demographics stratified by age groups. Mean ± standard deviation (SD) for age (in years).

  MaleFemale
< 50 years n 2117
 Age45.9 ± 2.743.8 ± 2.4
50-59 years n 6522
 Age55.0 ± 2.956.3 ± 2.4
60-69 years n 11461
 Age64.4 ± 2.664.7 ± 2.9
> 69 years n 4729
 Age74.9 ± 4.573.4 ± 3.6
Total n 247129
 Age62.3 ± 8.962.5 ± 9.5
Table 4

Mean and standard deviation (SD) of the six analyzed texture features and vBMD of females, grouped by spine region.

Spine region (females) T4-T8T9-T12L1-L5
Varianceglobal Mean24.6039.2455.94
SD5.306.246.62
EntropyMean14.1614.4014.60
SD0.590.660.73
SREMean0.99220.99160.9916
SD0.00230.00250.0025
LREMean1.03221.03441.0344
SD0.00970.01050.0106
RLNMean0.97940.97800.9781
SD0.00600.00640.0065
RPMean0.98950.98880.9888
SD0.00310.00330.0034
vBMD [mg/cm³]Mean137.2131.5128.9
SD39.839.439.3

SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density.

Table 7

Mean and standard deviation (SD) of the six analyzed texture features and vBMD of males, grouped by age group.

Age group (males) <5050-5960-69≥70
Varianceglobal Mean49.6550.1250.6951.26
SD17.0616.8616.8116.77
EntropyMean14.3114.3314.3914.36
SD0.490.650.680.66
SREMean0.99090.99080.99110.9907
SD0.00200.00250.00250.0026
LREMean1.03731.03791.03661.0383
SD0.00860.01090.01040.0112
RLNMean0.97620.97590.97670.9757
SD0.00520.00650.00640.0068
RPMean0.98790.98770.98810.9876
SD0.00270.00340.00330.0035
vBMD [mg/cm³]Mean154.1142.2131.1125.2
SD38.137.233.429.8

SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density.

Patient demographics stratified by age groups. Mean ± standard deviation (SD) for age (in years). Mean and standard deviation (SD) of the six analyzed texture features and vBMD of females, grouped by spine region. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density. Mean and standard deviation (SD) of the six analyzed texture features and vBMD of males, grouped by spine region. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density. Mean and standard deviation (SD) of the six analyzed texture features and vBMD of females, grouped by age group. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density. Mean and standard deviation (SD) of the six analyzed texture features and vBMD of males, grouped by age group. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density. Patient age vs vBMD (averaged across all analyzed vertebrae for each patient) for females (red) and males (blue); vBMD, volumetric bone mineral density. All of the six analyzed TFs, except Entropy, showed a statistically significant difference between genders ( ).
Table 8

Texture features and vBMD, averaged from T4-L5 and grouped by gender.

 MaleFemalep-value
Varianceglobal 50.54 ± 16.8439.56 ± 14.48<0.001
Entropy14.36 ± 0.6514.38 ± 0.68n.s.
SRE0.9910 ± 0.00250.9918 ± 0.0024<0.001
LRE1.0373 ± 0.01061.0336 ± 0.0103<0.001
RLN0.9763 ± 0.00640.9785 ± 0.0063<0.001
RP0.9879 ± 0.00330.9891 ± 0.0033<0.001
vBMD [mg/cm³]135.2 ± 35.3132.7 ± 39.60.029

Bonferroni-corrected p-values indicate statistically significant differences between genders. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; n.s., not significant.

Texture features and vBMD, averaged from T4-L5 and grouped by gender. Bonferroni-corrected p-values indicate statistically significant differences between genders. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; n.s., not significant. Varianceglobal and Entropy showed a statistically significant increase in cranio-caudal direction along the spine across all subjects as well as for both genders separately ( and , ). All other TFs (SRE, LRE, RLN and RP) showed statistically significant differences between spine regions across all subjects ( ). SRE, LRE, RLN and RP showed also statistically significant differences between spine regions for both genders separately, except for LRE, RLN and RP between T9-T12 and L1-L5 in females ( and , ).
Figure 3

Texture features and vBMD grouped by spine region for females (red) and males (blue). Asterisks (*) mark pairwise significant differences between spine regions. Bonferroni correction was applied to adjust for multiple comparisons. Error bars indicate ± 2 standard deviations. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density.

Table 5

Mean and standard deviation (SD) of the six analyzed texture features and vBMD of males, grouped by spine region.

Spine region (males) T4-T8T9-T12L1-L5
Varianceglobal Mean32.4251.2168.03
SD7.028.217.80
EntropyMean14.2714.3414.47
SD0.570.630.73
SREMean0.99140.99050.9909
SD0.00220.00250.0026
LREMean1.03541.03921.0377
SD0.00950.01090.0110
RLNMean0.97740.97510.9760
SD0.00580.00660.0067
RPMean0.98850.98730.9878
SD0.00300.00340.0035
vBMD [mg/cm³]Mean144.2133.3127.7
SD36.134.133.5

SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density.

Table 9

Pairwise significant differences of texture features between spine regions across all subjects.

Texture feature T4-T8 vs T9-T12T4-T8 vs L1-L5T9-T12 vs L1-L5
Varianceglobal <0.001<0.001<0.001
Entropy<0.001<0.001<0.001
SRE<0.001<0.0010.036
LRE<0.001<0.0010.041
RLN<0.001<0.0010.036
RP<0.001<0.0010.038

Bonferroni-corrected p-values corresponding to a level of significance of 0.05 are given. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage.

Pairwise significant differences of texture features between spine regions across all subjects. Bonferroni-corrected p-values corresponding to a level of significance of 0.05 are given. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage. Texture features and vBMD grouped by spine region for females (red) and males (blue). Asterisks (*) mark pairwise significant differences between spine regions. Bonferroni correction was applied to adjust for multiple comparisons. Error bars indicate ± 2 standard deviations. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density. For both males and females, certain TFs showed a statistically significant difference between age groups, however without a clear directional trend ( and , ).
Figure 4

Texture features and vBMD grouped by age group for females (red) and males (blue). Asterisks (*) mark pairwise significant differences between consecutive age groups. Bonferroni correction was applied to adjust for multiple comparisons. Error bars indicate ± 2 standard deviations. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density.

Table 6

Mean and standard deviation (SD) of the six analyzed texture features and vBMD of females, grouped by age group.

Age group (females) <5050-5960-69≥70
Varianceglobal Mean38.0641.7939.6038.68
SD14.6615.0214.5913.42
EntropyMean14.5414.4214.2814.47
SD0.480.680.680.78
SREMean0.99230.99180.99150.9922
SD0.00190.00240.00240.0027
LREMean1.03171.03351.03491.0320
SD0.00800.01010.01040.0114
RLNMean0.97970.97860.97780.9795
SD0.00490.00620.00630.0070
RPMean0.98970.98910.98870.9896
SD0.00250.00320.00330.0036
vBMD [mg/cm³]Mean179.6134.0126.0113.7
SD43.733.533.726.4

SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density.

Texture features and vBMD grouped by age group for females (red) and males (blue). Asterisks (*) mark pairwise significant differences between consecutive age groups. Bonferroni correction was applied to adjust for multiple comparisons. Error bars indicate ± 2 standard deviations. SRE, short-run emphasis; LRE, long-run emphasis; RLN, run-length non-uniformity; RP, run percentage; vBMD, volumetric bone mineral density. vBMD showed a statistically significant difference between genders ( ), a statistically significant decrease with age as well as statistically significant differences between spine regions for both males and females ( , ). Linear correlation analysis revealed statistically significant (p < 0.05) low positive associations (r = 0.39, r = 0.31, r = 0.31, r = 0.31), and very low to low negative associations (r = -0.19, r = -0.31) between the analyzed TFs and vBMD, respectively.

Discussion

The present study performed automated spine segmentation and TF extraction of vertebral bodies in routine abdominal MDCT scans in a large-scale patient cohort. In a preceding analysis, long-term reproducibility of a set of commonly utilized TFs was evaluated. Six TFs with high reproducibility were identified and selected for further analysis. To the best of our knowledge, this is the first study combining fully automated segmentation, vBMD mapping and long-term reproducibility analysis to establish gender-, age-, and region-specific characteristic values and variations of vertebral TFs. This data could serve as foundation for a reliable differentiation of values indicative of osteoporosis from normal values across genders, age groups and spine regions. In particular, it could be a prerequisite for the longitudinal comparison of TFs. Varianceglobal and Entropy showed a significant increase in cranio-caudal direction along the spine for both males and females. The other four TFs, which are all GLRLM-derived features, also showed significant region-dependent differences, except for females between the lower thoracic and lumbar spine. Five of the six TFs showed a significant difference between genders. None of the TFs showed a significant age-dependence. vBMD showed a significant decrease with age as well as statistically significant differences between genders and spine regions, confirming results from previous studies (13, 18, 29). Dimension reduction and feature selection is a commonly performed step in TA studies to reduce redundancy, and to ensure that selected features fulfill certain conditions and have high relevance. However, the applied approaches are manifold and vary substantially between studies. Among many other methods, GLCM- and GLRLM-derived features can be restricted to or averaged across directions and distances (7, 30). Another popular approach is to only select reproducible features with high inter- and intra-reader agreement (8, 30). Mannil et al. additionally performed correlation analysis and excluded highly correlating TFs to reduce redundancy (30). The emergence of machine learning has further extended the repertoire of feature reduction methods. Using a random forest classifier to identify vertebral fractures, Valentinitsch et al. optimized the number of features based on the Gini importance and classification performance in an exponential search (9). We chose a more clinically driven approach and selected TFs based on a preceding long-term reproducibility analysis to identify features particularly suited for longitudinal comparisons. Our study determined characteristic TF values based on CT images of vertebral bodies. The found values for Varianceglobal show substantial differences to the values reported by Mannil et al. (30), who performed the only similar study we could find in previous literature. This discrepancy, however, is potentially due to the limited comparability of the two studies. While Mannil et al. used manually prescribed ROIs of mid-sagittal planes of the vertebral body for two-dimensional (2D) TA (30), we utilized 3D-ROIs of the entire volume of the trabecular compartment of the vertebral body with subsequent 3D-TA. As a result, number and intra-vertebral localization of the included voxels as well as their gray value distribution vary remarkably between the two studies. Differences in CT image acquisition and reconstruction as well as slice thickness (2 mm vs. 3 mm) may additionally contribute to the discrepancy of the reported TF values. Fully automated labeling and 3D-segmentation of the spine represent major improvements in terms of time efficiency, reproducibility and utilization of available CT data when compared to manually performed 2D-segmentations. Therefore, the segmentation approach used in the present study should be considered as the standard for TA of the spine. Characteristic values of TFs should be used and compared with great care and ideally only if acquisition and reconstruction parameters as well as postprocessing methods are identical. Therefore, we used images acquired at the same MDCT scanner with a standardized protocol, a fully automated and standardized segmentation method, as well as careful preparation of CT data prior to TF computation, including isotropic resampling and gray-level normalization. The TFs Varianceglobal and Entropy showed significant regional differences, increasing from the mid-thoracic to the lumbar spine, irrespective of gender. Along the spine, more inferior vertebral bodies have larger size as well as a higher prevalence and degree of degenerative changes (29, 31). This could be potential explanations for the cranio-caudal increase in Varianceglobal since it reflects the spread of gray-level distribution. The GLCM feature Entropy is a measure of randomness in an image. However, it is not trivial to define what image properties it actually represents. According to Haralick et al., Entropy is associated with gray-level number, complexity and degree of structure in an image (21, 32). Therefore, the found region-dependent variations could indicate a gradient in architectural complexity along the spine, potentially suggesting variability in trabecular microstructure and skeletal integrity (33). The TFs SRE, LRE, RP and RLN also showed gender-specific differences, as well as region-dependent differences, irrespective of gender for most features. These GLRLM features are highly associated with each other and, according to their defining equations, reflect the predominating length of gray-level runs, the distribution of gray-level run lengths, as well as the extent of linear structures (22). The variations we found may therefore indicate gender- and region-specific variability in trabecular microstructure, in line with previous ex vivo results (34). There are several limitations to our study. First, we performed a retrospective analysis with limited availability of weight and height as well as other bone health-related data of the study cohort. Therefore, we did not adjust our results for variations in body mass index (BMI) or other potential confounders, such as smoking status, which are known to be correlated with BMD and could also have an influence on TFs. Second, n = 376 is relatively small sample size for establishing reference values, and the age range of the study is limited, in particular the age group under 40 years is not represented in our cohort. However, primary osteoporosis, as the most important condition for clinical application of the herein used approach and derived characteristic TF values, usually does not start before the age of 40 years. In contrast to values derived from a population of young individuals with normal bone health, age-, gender- and region-specific may also represent a viable approach for the differentiation of physiological and pathological bone alterations of the spine. Nevertheless, extending the analysis to an increased study population with a broader age range should be an aim of future investigations, in order to firmly establish a reference database. Third, the initial set of TFs we considered in our analysis was limited, and additional TFs (e.g. based on local binary patterns, wavelets or other transforms) could have been used (35, 36). Fourth, to maximize comparability of the computed values, CT data acquisition and postprocessing as well as TA were performed with homogenous and standardized settings. The restriction to a specific CT scanner and acquisition protocol may reduce the utility of the findings to other investigations using different settings. However, TF values, in particular for GLCM- and GLRLM-derived features, can only be reliably used for comparisons in studies with identical settings.

Conclusion

In conclusion, we established highly reproducible TF values for CT-based 3D-TA of vertebral bodies in a predominantly older study population, using an automated segmentation and quantification pipeline. These characteristic TF values and the found gender-, age-, and vertebral-level-specific variations can be considered a foundation for the reliable differentiation of physiological and pathological alterations and may be particularly important for future longitudinal studies.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by Ethikkommission der Technischen Universität München. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

MD wrote the manuscript. MD organized the database. MD performed the formal and statistical analysis. MD contributed to the concept and design of the study. MD developed major parts of the used software routines. NS, MH, AS, ML, CZ, JK, KS, and TB critically revised the manuscript. MH, AS, and ML contributed to the development of the software used in the study. TB conceptualized and designed the study. CZ, JK, KS, and TB provided scientific supervision and resources. JK and TB provided funding. TB contributed to the generation and organization of the database. All authors contributed to the article and approved the submitted version.

Funding

The present work was supported by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG, project 432290010, JK and TB), the German Society of Musculoskeletal Radiology (Deutsche Gesellschaft für Muskuloskelettale Radiologie, DGMSR, MD and NS), and the B. Braun Foundation (project BBST-D-19-00106, NS).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  24 in total

1.  Vertebral body insufficiency fractures: detection of vertebrae at risk on standard CT images using texture analysis and machine learning.

Authors:  Urs J Muehlematter; Manoj Mannil; Anton S Becker; Kerstin N Vokinger; Tim Finkenstaedt; Georg Osterhoff; Michael A Fischer; Roman Guggenberger
Journal:  Eur Radiol       Date:  2018-12-05       Impact factor: 5.315

2.  Regional analysis of age-related local bone loss in the spine of a healthy population using 3D voxel-based modeling.

Authors:  Alexander Valentinitsch; Stefano Trebeschi; Eva Alarcón; Thomas Baum; Johannes Kaesmacher; Claus Zimmer; Cristian Lorenz; Jan S Kirschke
Journal:  Bone       Date:  2017-07-14       Impact factor: 4.398

Review 3.  Axial QCT: clinical applications and new developments.

Authors:  Thomas M Link; Thomas F Lang
Journal:  J Clin Densitom       Date:  2014-05-28       Impact factor: 2.617

4.  Correlation of texture analysis of paraspinal musculature on MRI with different clinical endpoints: Lumbar Stenosis Outcome Study (LSOS).

Authors:  Manoj Mannil; Jakob M Burgstaller; Ulrike Held; Mazda Farshad; Roman Guggenberger
Journal:  Eur Radiol       Date:  2018-06-14       Impact factor: 5.315

Review 5.  Osteoporosis imaging: state of the art and advanced imaging.

Authors:  Thomas M Link
Journal:  Radiology       Date:  2012-04       Impact factor: 11.105

Review 6.  X-ray-based quantitative osteoporosis imaging at the spine.

Authors:  M T Löffler; N Sollmann; K Mei; A Valentinitsch; P B Noël; J S Kirschke; T Baum
Journal:  Osteoporos Int       Date:  2019-11-14       Impact factor: 4.507

Review 7.  CT Texture Analysis: Definitions, Applications, Biologic Correlates, and Challenges.

Authors:  Meghan G Lubner; Andrew D Smith; Kumar Sandrasegaran; Dushyant V Sahani; Perry J Pickhardt
Journal:  Radiographics       Date:  2017 Sep-Oct       Impact factor: 5.333

8.  Bone Mineral Density Estimations From Routine Multidetector Computed Tomography: A Comparative Study of Contrast and Calibration Effects.

Authors:  Johannes Kaesmacher; Hans Liebl; Thomas Baum; Jan Stefan Kirschke
Journal:  J Comput Assist Tomogr       Date:  2017 Mar/Apr       Impact factor: 1.826

9.  Assessment of tumor heterogeneity: an emerging imaging tool for clinical practice?

Authors:  Fergus Davnall; Connie S P Yip; Gunnar Ljungqvist; Mariyah Selmi; Francesca Ng; Bal Sanghera; Balaji Ganeshan; Kenneth A Miles; Gary J Cook; Vicky Goh
Journal:  Insights Imaging       Date:  2012-10-24

10.  A Vertebral Segmentation Dataset with Fracture Grading.

Authors:  Maximilian T Löffler; Anjany Sekuboyina; Alina Jacob; Anna-Lena Grau; Andreas Scharr; Malek El Husseini; Mareike Kallweit; Claus Zimmer; Thomas Baum; Jan S Kirschke
Journal:  Radiol Artif Intell       Date:  2020-07-29
View more
  1 in total

1.  Development and validation of multivariate models integrating preoperative clinicopathological and radiographic findings to predict HER2 status in gastric cancer.

Authors:  Mengying Xu; Song Liu; Lin Li; Xiangmei Qiao; Changfeng Ji; Lingyu Tan; Zhengyang Zhou
Journal:  Sci Rep       Date:  2022-08-19       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.