Jessica M Bugeja1,2, Ying Xia2, Shekhar S Chandra1, Nicholas J Murphy3,4, Jillian Eyles3,5, Libby Spiers6, Stuart Crozier1, David J Hunter3,5, Jurgen Fripp2, Craig Engstrom7. 1. School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia. 2. Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation, Health and Biosecurity, Herston, Australia. 3. Kolling Institute of Medical Research, Institute of Bone and Joint Research, University of Sydney, Sydney, Australia. 4. Department of Orthopaedic Surgery, John Hunter Hospital, Newcastle, Australia. 5. Department of Rheumatology, Royal North Shore Hospital, St Leonards, Australia. 6. Centre for Health, Exercise and Sports Medicine, Department of Physiotherapy, University of Melbourne, Melbourne, Australia. 7. School of Human Movement Studies, The University of Queensland, Brisbane, Australia.
Abstract
Background: Femoroacetabular impingement (FAI) cam morphology is routinely assessed using manual measurements of two-dimensional (2D) alpha angles which are prone to high rater variability and do not provide direct three-dimensional (3D) data on these osseous formations. We present CamMorph, a fully automated 3D pipeline for segmentation, statistical shape assessment and measurement of cam volume, surface area and height from clinical magnetic resonance (MR) images of the hip in FAI patients. Methods: The novel CamMorph pipeline involves two components: (I) accurate proximal femur segmentation generated by combining the 3D U-net to identify both global (region) and local (edge) features in clinical MR images and focused shape modelling to generate a 3D anatomical model for creating patient-specific proximal femur models; (II) patient-specific anatomical information from 3D focused shape modelling to simulate 'healthy' femoral bone models with cam-affected region constraints applied to the anterosuperior femoral head-neck region to quantify cam morphology in FAI patients. The CamMorph pipeline, which generates patient-specific data within 5 min, was used to analyse multi-site clinical MR images of the hip to measure and assess cam morphology in male (n=56) and female (n=41) FAI patients. Results: There was excellent agreement between manual and CamMorph segmentations of the proximal femur as demonstrated by the mean Dice similarity index (DSI; 0.964±0.006), 95% Hausdorff distance (HD; 2.123±0.876 mm) and average surface distance (ASD; 0.539±0.189 mm) values. Compared to female FAI patients, male patients had a significantly larger median cam volume (969.22 vs. 272.97 mm3, U=240.0, P<0.001), mean surface area [657.36 vs. 306.93 mm2, t(95)=8.79, P<0.001], median maximum-height (3.66 vs. 2.15 mm, U=407.0, P<0.001) and median average-height (1.70 vs. 0.86 mm, U=380.0, P<0.001). Conclusions: The fully automated 3D CamMorph pipeline developed in the present study successfully segmented and measured cam morphology from clinical MR images of the hip in male and female patients with differing FAI severity and pathoanatomical characteristics. 2022 Quantitative Imaging in Medicine and Surgery. All rights reserved.
Background: Femoroacetabular impingement (FAI) cam morphology is routinely assessed using manual measurements of two-dimensional (2D) alpha angles which are prone to high rater variability and do not provide direct three-dimensional (3D) data on these osseous formations. We present CamMorph, a fully automated 3D pipeline for segmentation, statistical shape assessment and measurement of cam volume, surface area and height from clinical magnetic resonance (MR) images of the hip in FAI patients. Methods: The novel CamMorph pipeline involves two components: (I) accurate proximal femur segmentation generated by combining the 3D U-net to identify both global (region) and local (edge) features in clinical MR images and focused shape modelling to generate a 3D anatomical model for creating patient-specific proximal femur models; (II) patient-specific anatomical information from 3D focused shape modelling to simulate 'healthy' femoral bone models with cam-affected region constraints applied to the anterosuperior femoral head-neck region to quantify cam morphology in FAI patients. The CamMorph pipeline, which generates patient-specific data within 5 min, was used to analyse multi-site clinical MR images of the hip to measure and assess cam morphology in male (n=56) and female (n=41) FAI patients. Results: There was excellent agreement between manual and CamMorph segmentations of the proximal femur as demonstrated by the mean Dice similarity index (DSI; 0.964±0.006), 95% Hausdorff distance (HD; 2.123±0.876 mm) and average surface distance (ASD; 0.539±0.189 mm) values. Compared to female FAI patients, male patients had a significantly larger median cam volume (969.22 vs. 272.97 mm3, U=240.0, P<0.001), mean surface area [657.36 vs. 306.93 mm2, t(95)=8.79, P<0.001], median maximum-height (3.66 vs. 2.15 mm, U=407.0, P<0.001) and median average-height (1.70 vs. 0.86 mm, U=380.0, P<0.001). Conclusions: The fully automated 3D CamMorph pipeline developed in the present study successfully segmented and measured cam morphology from clinical MR images of the hip in male and female patients with differing FAI severity and pathoanatomical characteristics. 2022 Quantitative Imaging in Medicine and Surgery. All rights reserved.
Entities:
Keywords:
Hip joint; cam-type femoroacetabular impingement syndrome (cam-type FAI syndrome); magnetic resonance imaging (MRI); statistical shape modelling
Cam-type femoroacetabular impingement (FAI) syndrome is characterised by an abnormal asphericity of the femoral head which predisposes to impingement at the anterosuperior acetabular rim and is frequently associated with pain during manoeuvres involving internal hip rotation (1,2). In clinical investigations, cam morphology is commonly assessed using techniques including computerised tomography (CT) or magnetic resonance imaging (MRI) whereby a planar projection, from anterosuperior and/or anterior slices of the femoral head-neck region, is used to calculate a two-dimensional (2D) alpha angle to evaluate femoral head asphericity in patients with suspected FAI (3,4). An alpha angle of greater than 55° is routinely considered indicative of a cam formation although higher thresholds have been proposed with respect to increased specificity of symptomatic FAI (5-7). Currently, the alpha angle is the most commonly used index of cam morphology in hip arthroscopic surgery (8), although it has moderate to poor inter- and intra-reader reliabilities (9) and these 2D measurements do not provide direct geometric data on cam dimensions (e.g., volume, surface area, height) for evaluating the specific morphometric characteristics of these osseous formations.The development of a fully automated three-dimensional (3D) approach for the assessment of cam morphology, particularly from clinical magnetic resonance (MR) images, to provide fast, robust analyses using deep learning methods has the potential to improve pre- and post-intervention quantitative assessments in patients with FAI (10). We present CamMorph, a fully automated 3D pipeline developed to extract cam morphology from clinical MR images of the hip in FAI patients. The CamMorph pipeline, which generates patient-specific data within 5 min, was used to analyse clinical MR images of the hip to examine cam morphology in FAI patients from the multi-site Australian FASHIoN trial (11).CamMorph addresses the challenging task of automated segmentation and measurement of cam morphology from clinical 3D MR images in FAI patients acquired on different MR systems and exhibiting varying imaging aspects such as proximal femur coverage, image artefacts and contrast characteristics. In the present study, CamMorph was used to extract cam morphology for volume, surface area and height calculations for a sexual dimorphism comparison of cam morphology between male and female patients with differing FAI severity and pathoanatomical characteristics. We present the following article in accordance with the MDAR reporting checklist (available at https://qims.amegroups.com/article/view/10.21037/qims-22-332/rc).
Methods
The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). Three datasets of hip MR images were used in the current study. Dataset A (12) and B (13) were used in model training and algorithm development for an automated 3D cam segmentation and assessment pipeline (CamMorph). Dataset C (11), consisting of a clinical MR dataset collected in the Australian FASHIoN trial, was used for dedicated measurement of cam volume, surface area and height in FAI patients using the developed automated 3D CamMorph pipeline. In the present study, the CamMorph measurements of cam volume, surface area, maximum- and average-height were compared between male and female patients.
Training and algorithm development in Datasets A and B
Dataset A, containing 3D true fast imaging with steady-state precession (true-FISP) unilateral hip images with manual bone labels of the proximal femur and acetabulum from 56 asymptomatic volunteers (12), was used for the training of 3D U-net networks to segment the proximal femur volume in the present work.Dataset B, containing bone surfaces of the proximal femur reconstructed from bilateral water-excited 3D double echo steady state (DESS) hip MR images (13), was used for focused shape model (FSM) (14) creation in the present work with a-priori focus on the femoral head-neck area to enable targeted shape assessment of cam morphology in this clinical region of interest (ROI) for proximal femur segmentation and simulation of a ‘healthy’ femur bone model through cam morphology relaxation (discussed in later sections).
CamMorph application in Dataset C (Australian FASHIoN MR Dataset)
Dataset C, consisting of baseline MR examinations of the affected hip joint in 97 patients with FAI (56 males, 41 females, aged 16–63 years) from the Australian FASHIoN trial (11), was used for the dedicated measurement of cam volume, surface area and height using the developed automated 3D CamMorph pipeline. All patients recruited for the FASHIoN trial presented with hip pain (assessed using the short form-12, international hip outcome tool and hip osteoarthritis outcome score), radiographic signs of cam and/or pincer morphology (alpha angle of >55° and/or lateral centre edge angle >40° or further radiographic pincer morphology signs such as a positive cross-over sign) and had a surgical opinion that the patient would benefit from arthroscopic hip surgery. For further information on patient recruitment, treatment groups, inclusion and exclusion criteria refer to Murphy et al. (11). The baseline MR images in the multi-site FASHIoN study were acquired on various clinical 3T systems (Siemens Prisma, Siemens Skyra, Phillips Ingenia) with a 3D T2-weighted true-FISP sequence [repetition time: 10.2 ms, echo time: 4.3 ms, weighted average image spacing (voxel size varied in this multi-site study): 0.644×0.644×0.668 mm3, field of view (FOV): 16×16 cm, matrix size: 256×256]. The true-FISP sequence, as used in the Australian FASHIoN trial, can be used for evaluations of hip joint cartilage health and comparisons between MR and X-ray features using the Hip Osteoarthritis MR imaging Scoring System and Hip2Norm software (11).
Proximal femur segmentation analyses
Manual segmentations of the proximal femur from MR examinations of randomly selected patients (n=44) in the FASHIoN study (Dataset C) were used to assess the automated segmentations obtained from a 3D U-net only and a 3D U-net + FSM approach as used in the final CamMorph pipeline. The manual segmentations of the proximal femur (femoral head, neck and proximal shaft) were performed by JMB under expert guidance from an analyst with 20 years of experience (CE). CE reviewed each manual segmentation for quality assurance purposes. These manual segmentations were compared to the automated segmentations using Dice similarity index (DSI), 95% Hausdorff distance (HD) and average surface distance (ASD) values. The equations for these comparative measures are listed below, where A denotes the set of automatically segmented voxels and M denotes the set of manually segmented voxels:Where h(A, M) is the directed HD and is given by:The 95% HD was used in this study, rather than the maximum HD {refer to Eq. [2]} based on the sensitivity of this latter measurement to outliers (15).Where d(A, M) is the directed average HD and is given by:
CamMorph pipeline
provides an overview of the two central components involved in the development of the automated CamMorph segmentation and volumetric analysis pipeline including: component 1 consisting of an initial proximal femur bone 3D U-net segmentation computed on unilateral 3D true-FISP MR images () and component 2 consisting of cam morphology segmentation for calculation of cam volume, surface area and height. The 3D U-net segmentation approach, used for automated bone segmentation of the proximal femur as per component 1 (), was modified from Çiçek et al. (16). The calculation of cam volume, surface area and height, as per component 2 (), starts with a femur bone fitting to the U-net segmentation using femoral bone initialization as per Nishii et al. (17), refined with FSM and atlas-based image registration (). Subsequently, a simulated ‘healthy’ bone model is generated for each individual patient using FSM methods focused on the femoral head-neck region (see the left image in where red indicates the ROI for femoral head-neck relaxation and blue is outside the ROI for relaxation). Here, ‘relaxation’ refers to the process by which the FSM reconstructs the cam-affected femur bone within the femoral head and neck region without any cam morphology knowledge. As a result, any existing cam morphology is relaxed to obtain a simulated ‘healthy’ femur bone (i.e., cam morphology is removed). After these steps, the position and size of the cam morphology is determined using the positive distance from the 3D simulated ‘healthy’ bone surface to the 3D cam-affected bone surface in the anterior and anterosuperior quadrant of the femoral head-neck region (). The right image in displays a 3D femoral bone model with signed distance map scalars (units in mm) representative of the distance between the cam-affected and simulated ‘healthy’ bone surface. In , the distance scalars outside the cam region were thresholded to a value of 0.0 mm for visualization purposes. The left image in shows the cam region defined within this study (where red indicates the ROI and blue is outside this ROI). shows a flow chart indicating that only vertices with a positive distance from the simulated ‘healthy’ bone (in the cam region) were included in cam morphology calculations.
Figure 1
The CamMorph pipeline involving segmentation of the proximal femur and cam morphology as used in the analysis of MR images from the Australian FASHIoN study (Dataset C). (A) Initial 3D true-FISP MR image. (B) U-net femur bone segmentation. (C-F) Cam morphology segmentation for calculation of cam volume, surface area and height. 3D, three-dimensional; true-FISP, true fast imaging with steady-state precession; MR, magnetic resonance; FSM, focused shape model; FHNJ, femoral head-neck junction.
The CamMorph pipeline involving segmentation of the proximal femur and cam morphology as used in the analysis of MR images from the Australian FASHIoN study (Dataset C). (A) Initial 3D true-FISP MR image. (B) U-net femur bone segmentation. (C-F) Cam morphology segmentation for calculation of cam volume, surface area and height. 3D, three-dimensional; true-FISP, true fast imaging with steady-state precession; MR, magnetic resonance; FSM, focused shape model; FHNJ, femoral head-neck junction.
Initial automated 3D U-net based segmentation of the proximal femur
The MR images from Dataset A (12) were pre-processed with BSpline image resampling (0.5 mm isotropic), joint locator-based ROI extraction, denoising and N4 bias field correction. An isotropic resampling of 0.5 mm was chosen based on the high-resolution (0.23×0.23×0.46 mm3) of the unilateral true-FISP images within Dataset A. Subsequently, the 3D U-net networks (see below) were trained on Dataset A for proximal femur volume segmentation.
3D U-net architecture
The network architecture used in the present work was based on a 3D U-net consisting of a down-sampling path and a symmetric up-sampling modified from its original form (16) for the hip joint (). The down-sampling path consists of 5 convolutional blocks, which increases the number of feature channels from 1 to 128. Each down-sampling block has two convolutional layers (kernel size of 3×3×3, stride of 1×1×1) and a Maxpooling layer (pool size of 2×2×2). The fifth block without the Maxpooling layer acts as a bridge to connect both paths. The up-sampling path consists of 4 convolutional blocks, each of which starts with a deconvolutional layer (kernel size of 3×3×3, stride of 2×2×2) and a concatenation with the feature maps corresponding to the down-sampling path. Two convolutional layers are used in each up-sampling block which halve the number of feature channels. All convolutional layers in both down-sampling and up-sampling paths use batch normalization, rectified linear unit activation and ‘same’ padding (to ensure the output image size is the same as the input image size). In the last layer of the up-sampling path, a 1×1×1 convolution with the Softmax activation function is used to map the multi-channel feature maps to the number of label classes N (N=3; background, femur, acetabulum).
Figure 2
U-net architecture. Each green box corresponds to a multi-channel feature map. The number of channels is denoted above or below the box (in blue text). The X-Y-Z-size is provided at the lower left or right edge of the box (in black text). White boxes represent copied feature maps. The grey arrows denote concatenate operations, orange boxes correspond to up-convolution operations, yellow boxes denote max pool operations and brown boxes represent a multiple operations step of convolution, batch normalization and a rectified linear unit. MRI, magnetic resonance image; N, number of output label classes (i.e., background, femur, acetabulum); up-conv, up-convolution operation; max pool, max pool operation; concatenate, concatenate operation; conv, convolution operation; BatchNorm, batch normalization operation; ReLU, rectified linear unit.
U-net architecture. Each green box corresponds to a multi-channel feature map. The number of channels is denoted above or below the box (in blue text). The X-Y-Z-size is provided at the lower left or right edge of the box (in black text). White boxes represent copied feature maps. The grey arrows denote concatenate operations, orange boxes correspond to up-convolution operations, yellow boxes denote max pool operations and brown boxes represent a multiple operations step of convolution, batch normalization and a rectified linear unit. MRI, magnetic resonance image; N, number of output label classes (i.e., background, femur, acetabulum); up-conv, up-convolution operation; max pool, max pool operation; concatenate, concatenate operation; conv, convolution operation; BatchNorm, batch normalization operation; ReLU, rectified linear unit.
Training and segmentation
All images with manual labels from Dataset A were pre-processed and cropped into an image size of 256×256×128. A 7-fold cross-validation procedure was used where the entire dataset was divided into 7 random subsets. In this procedure, 7 independent training iterations (with 3 repeats) were executed in parallel where each time a different subset of images was used for testing and the other 6 subsets of images for training. For each training iteration, 3D data augmentation (including affine transformation and elastic deformation) was applied to increase the diversity of the training data. Additionally, the Mixup technique was used to reduce the likelihood of overfitting and further improve the network performance (18). During the training process, we used the DSI as the loss function and the adaptive moment estimator (Adam) for parameter optimization (learning rate =0.0001). The entire training procedure resulted in an ensemble of 21 (7×3 repeats) 3D U-net networks for segmentation of the femur and acetabulum of the hip joint.
Segmentation and analysis of Dataset C
All the trained 3D U-net networks were used to automatically segment the bones of the hip joint from the clinical MR images in Dataset C (the FASHIoN dataset), which were independent from the MR data used in the U-net training phase (Dataset A) and FSM generation (Dataset B). Briefly, to segment a new hip MR image, trained 3D U-nets were used to generate a series of predictions for the femoral and acetabular bones from the pre-processed image. The final segmentation was obtained by averaging the multiple predicted labels and selecting the predicted label with a probability of more than 0.5. In the present study, only the femur was used in further processing. The subsequent application of the FSM, which simultaneously utilizes image features and priors (shape and appearance) with a distributed weighting identifying the most clinically important areas, improves the U-net segmentations within areas of interest, while conforming to anatomical constraints.
Cam segmentation and measurement pipeline
The following section describes the steps for FSM and atlas-based femur bone model initialization, the generation of a simulated ‘healthy’ bone model and the identification of cam morphology for calculating cam volume, surface area and height data from Dataset C.
FSM femur bone model initialization
Initially, a rough fitting of a 3D proximal femur bone model to the MR image was completed based on the centre of the femoral head (17), followed by FSM-based surface refinement (14) using the U-net segmentations (). The femoral head-neck ROI on the femur models was defined as shown by the red shading in . The DSI values of the U-net + FSM compared to manual segmentations were used to identify an atlas set to optimise the fitting performance of the U-net + FSM segmentation. For cases with DSI <0.95, the initial fitting models were improved using atlas-based registration with the atlas set consisting of cases with DSI ≥0.95 fitting accuracy.
Healthy femur bone generation
A simulated ‘healthy’ femur bone model was generated for each patient () using a FSM (14) focusing on the femoral head-neck region (see left image in ). This FSM was created using the training surfaces from Dataset B, as per Xia et al.’s work (13). A total of three eigenmodes accounting for 92% of shape variations were visually inspected using the simple medical imaging library interface (SMILI) (19) and used in an effort to exclude any eigenmodes associated with cam morphology. This visual inspection consisted of a review of all shape variations within the first three eigenmodes within the anterior and anterosuperior femoral head-neck region. It was confirmed that these eigenmodes did not contain any cam morphology knowledge. Therefore, a FSM reconstruction of the cam-affected femur bone model within the femoral head and neck region simulated a ‘healthy’ femur bone (i.e., cam morphology is removed). This visual inspection was completed during the algorithm development of CamMorph prior to the initialization of the automated CamMorph pipeline. The CamMorph pipeline does not require this visual inspection when applied to an FAI dataset.
Constrained cam extent and location determination for cam morphology analysis
Distance-based methods with regional constraints were used to identify the location and quantify the size characteristics of cam morphology. A signed distance map was calculated between the reconstructed ‘healthy’ proximal femur and the cam-affected femur surface, using the Visualization Toolkit (VTK) (20) (). The distance map scalars were overlayed onto the femoral surface using SMILI (19).The anterior and anterosuperior femoral head-neck region (typical location of cam formations) was manually contoured onto the femoral surface (see left image in ) using SMILI (19) prior to the initialization of the automatic CamMorph pipeline. All cases utilized the same contour within the presented workflow. If a vertex on the femoral model was identified to be within this contoured region and the respective femur surface vertex on the distance model had a value of more than 0, the vertex was included as part of a cam morphology formation (). The cam volume and surface area were calculated using VTK (20) and SMILI (19). The cam height data was obtained at each vertex on the femoral surface using the previously calculated distance map [using a signed distance map from VTK (20)], between the simulated ‘healthy’ femur bone surface and the cam-affected femur surface within the cam ROI. The cam maximum- and average-height were calculated from the set of cam height values obtained for each patient. The cam volume, surface area, maximum- and average-height were used for the primary data comparison between male and female FAI patients in the present study.
Femoral head volume analyses
In a secondary data analysis, values for femoral head volume (see below) were compared between the male and female FAI patients (Dataset C) to assess previously reported sexual dimorphism for this anatomical measure (21-23) and to determine cam volume as a percentage of femoral head volume. The calculation of femoral head volume involved initial resampling of the MR images to be isotropic. Estimations of the femoral head centre and radius were obtained in 3D using the circle Hough transform as per Nishii et al. (17). Subsequently, a refined segmentation of the femoral head used the U-net segmentation to only include voxels within both the U-net and circle Hough transform-based segmentation. This refined segmentation was required to define the boundary between the femoral head and neck within the U-net segmentation for subsequent femoral head volume calculation.
Statistical analyses
For both the primary data (cam volume, surface area, maximum- and average-height) and secondary data analyses (femoral head volume, cam volume: femoral head volume percentage), independent t-tests or Mann-Whitney U rank tests were used for comparisons between the male and female FAI patients. The Shapiro-Wilk test was performed to check the normality of the data and Mann-Whitney U rank tests were used if the data did not have a normal distribution. The Levene test was used to check for homogeneity of variances and Welch’s procedure was applied when the sample variances were not equal. The Pearson correlation coefficient was used to assess the association between cam volume and other cam metrics (surface area, maximum- and average-height) in the male and female patients. Statistical significance was accepted at P<0.05 and analyses were performed using a python package, SciPy (24).
Results
Manual and automated proximal femur segmentation
shows boxplots for the DSI (), 95% HD () and ASD values () for the manual and automated bone segmentations of the proximal femur volume obtained from Dataset C. Overall, the 3D U-net + FSM approach (as implemented in the CamMorph pipeline) had an enhanced performance compared to the U-net alone having better mean scores and decreased spread for the DSI (U-net =0.958±0.017; U-net + FSM =0.964±0.006), 95% HD (U-net =2.717±2.089 mm; U-net + FSM =2.123±0.876 mm) and ASD (U-net =0.577±0.289 mm; U-net + FSM =0.539±0.189 mm) values.
Figure 3
Boxplots of the evaluation metrics used in this study for the manual and automated segmentations of the proximal femur [U-net alone and U-net + FSM (CamMorph)]. (A) DSI values. (B) 95% HD values. (C) ASD values. The boxplot centreline marks the median value. The inset coronal magnetic resonance images show the relatively poorer image quality for outlier cases with the automated segmentation contour overlayed. Case 1 shows nonconforming image contrast characteristics, both Cases 1 and 2 have motion/image artefacts (Case 1 white arrow in sagittal image indicates artefacts), Cases 2 and 3 show comparatively poor bone cartilage interface contrast (white arrows) and Case 4 has, like all the outlier cases shown, incomplete coverage of the femoral neck-greater trochanter region, and a prominent foveal region with surrounding bright synovial fluid which caused difficulty during the FSM fitting. DSI, Dice similarity index; HD, Hausdorff distance; ASD, average surface distance; FSM, focused shape model.
Boxplots of the evaluation metrics used in this study for the manual and automated segmentations of the proximal femur [U-net alone and U-net + FSM (CamMorph)]. (A) DSI values. (B) 95% HD values. (C) ASD values. The boxplot centreline marks the median value. The inset coronal magnetic resonance images show the relatively poorer image quality for outlier cases with the automated segmentation contour overlayed. Case 1 shows nonconforming image contrast characteristics, both Cases 1 and 2 have motion/image artefacts (Case 1 white arrow in sagittal image indicates artefacts), Cases 2 and 3 show comparatively poor bone cartilage interface contrast (white arrows) and Case 4 has, like all the outlier cases shown, incomplete coverage of the femoral neck-greater trochanter region, and a prominent foveal region with surrounding bright synovial fluid which caused difficulty during the FSM fitting. DSI, Dice similarity index; HD, Hausdorff distance; ASD, average surface distance; FSM, focused shape model.Each proximal femur model from the 3D U-net + FSM segmentation was analysed separately using the 95% HD value. The mean and standard deviation HD between the U-net + FSM and manual segmentations are shown in , respectively. Overall, the U-net + FSM result within the femoral head-neck region of the femur, of primary interest in the current study, was in very strong agreement with the manual segmentation. Notably, the weighted average image spacing within this study was 0.644×0.644×0.668 mm, where on average the highest surface distance accuracy attainable within this study is 0.644 mm (the smallest average image spacing dimension). Observations from indicate that the 95% HD values within the femoral head-neck region were below 0.65 mm in both the mean and standard deviation visualizations, respectively. These results indicate the high surface-based accuracy of the CamMorph pipeline which was robust to the spectrum of pathology and image-based issues within Dataset C. In the current study, the greater trochanter and the included shaft region of the femur had larger HD values, however these are not clinically important areas for cam morphology assessment.
Figure 4
The 95% HD between the U-net + FSM and the manual segmentations of the proximal femur. (A) Mean 95% HD. (B) Standard deviation 95% HD. The HD scalar bar (rainbow colour map on right side of the figure) has units in mm. The greater trochanter region and femur shaft region were given a value of zero for visualization purposes. HD, Hausdorff distance; FSM, focused shape model.
The 95% HD between the U-net + FSM and the manual segmentations of the proximal femur. (A) Mean 95% HD. (B) Standard deviation 95% HD. The HD scalar bar (rainbow colour map on right side of the figure) has units in mm. The greater trochanter region and femur shaft region were given a value of zero for visualization purposes. HD, Hausdorff distance; FSM, focused shape model.
Cam volume, surface area and height
displays boxplots, and provides a summary of the mean and standard deviation data, for the cam volume, surface area, maximum- and average-height measurements for the male and female patients from Dataset C. Compared to females, males had a significantly larger median cam volume (969.22 vs. 272.97 mm3, P<0.001), mean surface area (657.36 vs. 306.93 mm2, P<0.001) median maximum-height (3.66 vs. 2.15 mm, P<0.001) and median average-height (1.70 vs. 0.86 mm, P<0.001). In the male and female FAI patients, cam volume was classified based on quartile data (), as negligible (≤Q1), mild (>Q1 and ≤Q2), moderate (>Q2 and ≤Q3), or major (>Q3).
Figure 5
Boxplots of cam size metrics for female and male patients. (A) Cam volume. (B) Cam surface area. (C) Cam maximum-height. (D) Cam average-height (D). The boxplot centreline marks the median value.
Table 1
Mean and standard deviation data for cam volume, surface area, maximum- and average-height in the male and female FAI patients
Measurements
Male patients
Female patients
P value
Volume (mm3)
1,136.87±659.83
337.86±279.95
<0.001
Surface area (mm2)
657.36±203.04
306.93±175.61
<0.001
Maximum-height (mm)
3.89±1.51
2.23±1.09
<0.001
Average-height (mm)
1.94±0.86
1.00±0.57
<0.001
FAI, femoroacetabular impingement.
Table 2
Cam volume quartiles used for classifying cam volume in the male and female FAI patients
Quartile
Male patients
Female patients
Q1
657.38 mm3
111.06 mm3
Q2
969.22 mm3
272.97 mm3
Q3
1,466.51 mm3
497.93 mm3
FAI, femoroacetabular impingement.
Boxplots of cam size metrics for female and male patients. (A) Cam volume. (B) Cam surface area. (C) Cam maximum-height. (D) Cam average-height (D). The boxplot centreline marks the median value.FAI, femoroacetabular impingement.FAI, femoroacetabular impingement.For the male and female FAI patients, there were strong postitive correlations betweeen cam volume and surface area (), cam volume and maximum-height () and cam volume and average-height ().
Figure 6
Scatterplots and regression line analysis of cam size metrics for male and female patients. (A,B) Cam volume vs. surface area. (C,D) Cam volume vs. maximum-height. (E,F) Cam volume vs. average-height. The left column displays male results, and the right column displays female results.
Scatterplots and regression line analysis of cam size metrics for male and female patients. (A,B) Cam volume vs. surface area. (C,D) Cam volume vs. maximum-height. (E,F) Cam volume vs. average-height. The left column displays male results, and the right column displays female results.shows 3D models of the proximal femur which were generated in SMILI (19) to show the cam profiles in individual female and male patients classified as having a negligible (), mild (), moderate () or major () cam volume. Inspection of the cam profiles allows for comparative visual analysis within and between the classification categories; for example, shows the cam profile in a female patient classified with a moderate cam volume where the cam surface area profile is smaller than that in a female patient classified with a mild cam volume ().
Figure 7
3D proximal femoral bone models for visualization of cam profiles in individual female and male patients with different cam volume classifications. (A,E) Negligible cam volumes. (B,F) Mild cam volumes. (C,G) Moderate cam volumes. (D,H) Major cam volumes. The top row displays female results, and the bottom row displays male results. A separate scalar bar (rainbow colour map) for females and males provides cam height in mm. 3D, three-dimensional.
3D proximal femoral bone models for visualization of cam profiles in individual female and male patients with different cam volume classifications. (A,E) Negligible cam volumes. (B,F) Mild cam volumes. (C,G) Moderate cam volumes. (D,H) Major cam volumes. The top row displays female results, and the bottom row displays male results. A separate scalar bar (rainbow colour map) for females and males provides cam height in mm. 3D, three-dimensional.
Femoral head volume
displays boxplots of the femoral head volumes for the male and female FAI patients from Dataset C. Compared to female patients, male patients had a significantly larger mean femoral head volume [66.12 vs. 46.02 cm3, t(95)=13.21, P<0.001]. shows boxplots of the cam volume to femoral head volume, in percentage terms, for the male and female FAI patients. The median cam volume to femoral head volume percentage for the male patients was significantly larger than the female patients (1.47 vs. 0.59, U=389.0, P<0.001).
Figure 8
Boxplots of femoral head volumes in female and male patients. The boxplot centreline marks the median value.
Figure 9
Boxplots of the cam volume to femoral head volume percentage in female and male patients. The boxplot centreline marks the median value.
Boxplots of femoral head volumes in female and male patients. The boxplot centreline marks the median value.Boxplots of the cam volume to femoral head volume percentage in female and male patients. The boxplot centreline marks the median value.
Discussion
The present study involved the successful development of a novel, fully automated 3D cam morphology extraction pipeline which provided patient-specific, direct measures of cam volume, surface area and height from multi-site clinical MR images in male and female patients with FAI. The data calculated from the CamMorph pipeline showed there was significantly greater cam volume, surface area, maximum- and average-height in male, compared to female, FAI patients. In conjunction with the quantitative analyses of cam morphology provided by the CamMorph pipeline, 3D bone models of the proximal femur were generated from the automated segmentations of the MR images from individual patients allowing visualization of differences in cam morphology within and between male and female FAI patients classified with a negligible, mild, moderate or major cam volume. Overall, the developed CamMorph pipeline combined the substantial abilities of the 3D U-net to localize both global (region) and local (edge) features in MR images and a FSM to generate an anatomical model for creating patient-specific models of the proximal femur to provide a robust automated approach for extracting cam morphology and measuring cam volume, surface area and height from clinical 3D MR images in patients with differing FAI severity and pathoanatomical characteristics. The calculation of cam volume, surface area and height data by the fully automated CamMorph pipeline was achieved in under 5 min which is well suited for potential application in clinical environments.
Proximal femur segmentation
The high degree of agreement between the manual and automated CamMorph segmentations of proximal femur volume (mean DSI of 0.964) in the present work compares very favourably with previous studies on automated segmentation of the femur from MR images (25-29) [see Table S1 for specific data (14,26,30-36)]. Overall, the combined 3D U-net + FSM approach used in the CamMorph pipeline resulted in a decreased spread of the DSI values (along with an overall increase in the mean value) compared with the U-net alone segmentations of the proximal femur volume. Enhanced performance of the CamMorph pipeline compared with the U-net alone approach was similarly found for the 95% HD and ASD data highlighting the increased efficacy of the combined U-net + FSM approach for the segmentation of the proximal femur in the clinical MR images obtained from FAI patients in the Australian FASHIoN study. Notably, the high surface-based accuracy for segmentation of the proximal femur in the clinical MR images by the automated CamMorph pipeline was achieved in the presence of variable motion artefacts, contrast differences and FOV issues (insufficiencies) within the multi-site FASHIoN dataset. For the femoral head-neck region specifically, the majority of the mean 95% HD values between the manual and automated CamMorph segmentations were between 0 to 0.65 mm highlighting the robustness of the combined U-net + FSM approach for targeted segmentation of bone in this clinical ROI in the MR images of the male and female FAI patients.The automated CamMorph pipeline presented in the current paper successfully integrated patient-specific healthy models derived from individualised cam-affected models of the proximal femur for the calculation of cam volume, surface area and height data. Previously, Kang et al. (10) proposed calculation of cam morphology with reference to healthy bone models although they used an average model derived from 4 separate femur models from asymptomatic subjects obtained through manual segmentation whereas we used patient-specific healthy models derived from the automated analyses of FAI patients’ 3D MR images. The use of the automated 3D CamMorph pipeline to measure cam morphology, as opposed to manual (10) and semi-automated (9,23,37-40) approaches, offers a fast (under 5 min), robust method which can be run in the background post-MR acquisition to obtain patient-specific data well suited for routine clinical examinations, as well as for larger-scale clinical trials such as the Australian FASHIoN study.In the present analysis of the FASHIoN MR dataset, the median cam volume in male patients (0.97 cm3) was significantly greater than in the female patients (0.27 cm3). Recently, Guidetti et al. performed arthroscopic removal of cam bone mass from the hemipelvis of 7 cadavers (sex not specified) and reported that the surgically excised cam volumes, as determined from semi-automated MR measurements, ranged between 0.25 to 1.75 cm3 (41) which is in good general agreement with our range of cam volumes determined in the male (0 to 3.0 cm3) and female patients (0 to 1.16 cm3) using the CamMorph pipeline (Table S2).Previous CT studies (9,23,37) have reported larger mean cam volumes (e.g., 4.3 cm3 in females, 6.7 cm3 in males, 4.6 to 7.96 cm3 in mixed sex cohorts) from segmentations using a semi-automated region-growing technique based on density values without any reference to the 3D shape profile of the femoral head-neck region. Whilst the substantial differences between the MR derived [current study, Guidetti et al. (41)] and CT derived cam volumes (9,23,37) may be due to real differences in the severity of the cam lesions in these study cohorts, it may relate to an over-estimation of cam volume in the CT studies due to the region-growing approach systematically over-segmenting areas within the lower bone density supero-lateral quadrant of the femoral head (42). An over-segmentation of cam volume in the above CT studies could account for scenarios such as the relative volume of a cam bone mass (11.1 cm3) being 29% of that compared to femoral head volume (39.1 cm3) as reported by Schauwecker et al. (37). In the most extreme cases in the present MR study, the relative cam to femoral head volume was below 4% for the male and female patients with FAI ( and Figure S1).The greater cam volume in males compared to females observed in the present study is consistent with the overall findings from previous CT studies (23,40). Currently, understanding of this sex-based difference in cam volume is incomplete but it may be associated with factors such as males generally having greater absolute body size, higher magnitude loading of the hip joint or specific sex-related dimorphism in the anatomical and alignment characteristics of the femur and acetabulum. Studies have also reported that cam morphology is typically more prevalent in males with FAI whereas pincer morphology of the acetabulum is observed relatively more in female patients although the presence of both cam and pincer morphology, the so called mixed-type FAI, is also common in both male and female patients (43). In the Australian FASHIoN study, patients were included on the basis of hip pain and plain radiographs considered to show signs of FAI related to either cam morphology (e.g., alpha angles >55°), pincer morphology (e.g., lateral centre edge angle >40°, positive cross-over sign) and/or mixed-type FAI rather than being solely restricted to those with cam morphology only; this assortment of presentations is consistent with the automated CamMorph measures of cam volumes ranging from negligible to major determined from the baseline MR images of the male and female FAI patients in the present analyses.Compared to female FAI patients, male patients had a significantly larger mean cam surface area (6.57 vs. 3.07 cm2), median maximum-height (3.66 vs. 2.15 mm) and median average-height (1.70 vs. 0.86 mm) based on the automated CamMorph analysis of the FASHIoN MR dataset. These values agree well with the mean cam surface area (6.158 cm2), maximum-height (3.7 mm) and average-height (1.6 mm) reported from MR measures obtained in the cadaver study of Guidetti et al. (41). Kang et al. reported, based on MR measurements, a considerably smaller mean cam surface area (0.52 cm2), which may be related to a suboptimal fitting of the average ‘healthy’ model to the pathological femur bone models (10). Moreover, their mean cam average-height (3.9 mm) was notably different to the median values obtained from the MR measures in our current work and mean values from Guidetti et al.’s study (41). The use of average ‘healthy’ models by Kang et al. (10), rather than an individual patient model as per the developed CamMorph approach, may account for these differing outcome data.Strong positive correlations were observed between cam volume and surface area, cam volume and maximum-height and cam volume and average-height in the male and female patients from the FASHIoN study (). There were, however, notable cam profile differences visualized in the 3D bone models of the proximal femur in individual patients classified as having negligible, mild, moderate or major cam volumes (). These differences suggest that cam morphology may present with a spectrum of topological forms in FAI patients although further systematic shape-based analyses are required to explore this aspect across factors such as sex, anthropometry and FAI severity.In the present study, analysis of the mean femoral head volume showed male patients (66.12 cm3) had significantly greater volumes than the female patients (46.02 cm3). The femoral head volumes derived from our automated segmentation approach are consistent with values reported in earlier CT and MRI studies (9,21-23,37,40) (Table S3) and studies reporting significantly greater femoral head volumes (21-23) and radii (40) in males compared to females. In previous MR arthrography (21), CT (9,23,37) and cadaveric studies (22), femoral head volume has been calculated using manual segmentation (21), region growing methods (based on CT density) (9,23,37) and direct anatomical measurements using callipers (22), whereas there was no manual (or semi-automated) processing required in the current analyses of the MR images in the FAI patients. Furthermore, our automated segmentation of femoral head volume was robust to varying MR image contrast characteristics and artefacts, differing imaging FOV and the presence of osteophytes and increased synovial fluid surrounding the bone-cartilage interface of the femoral head.There was a significantly greater median cam volume to femoral head volume percentage in male compared to female FAI patients (1.5% vs. 0.6%). A higher median percentage cam volume to femoral head volume in male FAI patients was also established from data obtained from male (10.4%) and female (9.2%) patients in a previous CT study (23). The higher percentage values calculated in the Zhang et al. study (23) were associated with substantially larger cam volumes compared to our values which may reflect an overall greater severity of cam mass in their FAI patients or alternatively, be related to the use of regional bone density values for determining cam morphology. Further studies on male and female FAI patients are required to better clarify the association between cam volume and femoral head volume stratified by sex.In the present study, the automated CamMorph pipeline was successfully developed (Datasets A and B) and implemented (Dataset C) using 3D true-FISP and DESS MR images from three separate databases. Future work assessing the ability of CamMorph to segment and analyze cam morphology from other 3D MRI sequences such as the volumetric interpolated breath hold examination (VIBE) and the sampling perfection with application optimized contrasts using different flip angle evolution (SPACE) sequence, which, along with 3D true-FISP, have been shown to provide bone volumes comparable to CT in a cadaveric study of the knee (44), would be beneficial for further evaluation of our U-net + FSM approach for analysing bone formations in symptomatic conditions such as FAI. Comparisons between MR and CT-based cam measures obtained with CamMorph could be conducted to assess the respective data obtained from these two modalities notwithstanding the clinical advantages of MRI for simultaneous evaluation of bony, cartilaginous and associated soft tissues of the hip joint without exposing patients to ionizing radiation.The cam volume, surface area and height measurements in our study were in good agreement with corresponding data reported in Guidetti et al.’s study based on MR analysis of arthroscopically excised cam bone mass from cadavers (41). In future MR (or CT) studies, the cam morphology measures obtained with CamMorph could be compared directly with “ground truth” data from arthroscopic cam excision although precise delineation of cam bone mass in both pre-operative (cam bone mass identified for removal) and post-operative (actual cam bone mass removed) analyses would be essential for the validity of any investigations.In conjunction with cam morphology measures, the developed 3D CamMorph pipeline offers the potential for future research involving comprehensive shape-based analyses of cam location and topology in MR images of the hip joint acquired from a range of individuals (e.g., asymptomatic/symptomatic, athlete/non-athlete, surgically/non-surgically managed cohorts). Clinically, shape based analyses could be used for detailed assessment of cam characteristics in cross-sectional (e.g., patients with non-dysplastic vs. dysplastic hips) or longitudinal studies (e.g., patients treated conservatively vs. surgically). In FAI patients, these analyses could be used to help inform clinical assessments in regards to patient symptoms, pre- and post-intervention treatment evaluations and patient reported outcome data such as that collected in the current FASHIoN trial. Shape analyses may also offer insights into cam bone morphology in relation to anthropometric characteristics or history of sporting (physical) activity levels.Further improvements to the 3D CamMorph pipeline could include an automated assessment of acetabular pincer-related morphology. This would involve generating a separate acetabulum FSM using data from individuals without pincer morphology. Subsequently, the central steps in the present work involving 3D U-net + FSM segmentation and simulated ‘healthy’ bone creation would be followed. In terms of clinical application, cam and pincer morphology data could be used for both pre- and post-arthroscopic analysis. The 3D proximal femur models generated from CamMorph could also be utilized to improve patient education.
Conclusions
The fully automated 3D CamMorph pipeline developed in the present study successfully segmented and measured cam morphology from clinical MR images of the hip in male and female patients with differing FAI severity and pathoanatomical characteristics.The article’s supplementary files as
Authors: Aleš Neubert; Katharine J Wilson; Craig Engstrom; Rachel K Surowiec; Anthony Paproki; Nicholas Johnson; Stuart Crozier; Jurgen Fripp; Charles P Ho Journal: Eur J Radiol Date: 2017-06-03 Impact factor: 3.528
Authors: Ying Xia; Shekhar S Chandra; Craig Engstrom; Mark W Strudwick; Stuart Crozier; Jurgen Fripp Journal: Phys Med Biol Date: 2014-11-10 Impact factor: 3.609
Authors: Pim van Klij; Michael P Reiman; Jan H Waarsing; Max Reijman; Wichor M Bramer; Jan A N Verhaar; Rintje Agricola Journal: Orthop J Sports Med Date: 2020-08-10
Authors: Adam B Yanke; M Michael Khair; Robert Stanley; David Walton; Simon Lee; Charles A Bush-Joseph; Alejandro Espinoza Orías; Alejandro A Espinosa Orias; Nozomu Inoue; Shane J Nho Journal: Arthroscopy Date: 2015-07-26 Impact factor: 4.772
Authors: R Agricola; J H Waarsing; G E Thomas; A J Carr; M Reijman; S M A Bierma-Zeinstra; S Glyn-Jones; H Weinans; N K Arden Journal: Osteoarthritis Cartilage Date: 2013-11-21 Impact factor: 6.576
Authors: Nicholas J Murphy; Jillian Eyles; Kim L Bennell; Megan Bohensky; Alexander Burns; Fraser M Callaghan; Edward Dickenson; Camdon Fary; Stuart M Grieve; Damian R Griffin; Michelle Hall; Rachel Hobson; Young Jo Kim; James M Linklater; David G Lloyd; Robert Molnar; Rachel L O'Connell; John O'Donnell; Michael O'Sullivan; Sunny Randhawa; Stephan Reichenbach; David J Saxby; Parminder Singh; Libby Spiers; Phong Tran; Tim V Wrigley; David J Hunter Journal: BMC Musculoskelet Disord Date: 2017-09-26 Impact factor: 2.362
Authors: Cem M Deniz; Siyuan Xiang; R Spencer Hallyburton; Arakua Welbeck; James S Babb; Stephen Honig; Kyunghyun Cho; Gregory Chang Journal: Sci Rep Date: 2018-11-07 Impact factor: 4.379
Authors: Pauli Virtanen; Ralf Gommers; Travis E Oliphant; Matt Haberland; Tyler Reddy; David Cournapeau; Evgeni Burovski; Pearu Peterson; Warren Weckesser; Jonathan Bright; Stéfan J van der Walt; Matthew Brett; Joshua Wilson; K Jarrod Millman; Nikolay Mayorov; Andrew R J Nelson; Eric Jones; Robert Kern; Eric Larson; C J Carey; İlhan Polat; Yu Feng; Eric W Moore; Jake VanderPlas; Denis Laxalde; Josef Perktold; Robert Cimrman; Ian Henriksen; E A Quintero; Charles R Harris; Anne M Archibald; Antônio H Ribeiro; Fabian Pedregosa; Paul van Mulbregt Journal: Nat Methods Date: 2020-02-03 Impact factor: 28.547