| Literature DB >> 36253726 |
Joyce Zhanzi Wang1,2, Jonathon Lillia3, Ashnil Kumar4, Paula Bray5,3, Jinman Kim6, Joshua Burns5,3, Tegan L Cheng5,3.
Abstract
BACKGROUND: Predicting morphological changes to anatomical structures from 3D shapes such as blood vessels or appearance of the face is a growing interest to clinicians. Machine learning (ML) has had great success driving predictions in 2D, however, methods suitable for 3D shapes are unclear and the use cases unknown. OBJECTIVE AND METHODS: This systematic review aims to identify the clinical implementation of 3D shape prediction and ML workflows. Ovid-MEDLINE, Embase, Scopus and Web of Science were searched until 28th March 2022.Entities:
Keywords: 3D body shape prediction; Artificial intelligence; Decision making; Machine learning; Neural network; Regression; Surgical planning
Mesh:
Year: 2022 PMID: 36253726 PMCID: PMC9575250 DOI: 10.1186/s12859-022-04979-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1PRISMA flowchart. The inclusion and exclusion criteria are applied to the abstract-title screening and full-text screening process. Studies from citation searching were also identified and went through screening process
General information of the included articles
| Body region | Clinical application | Machine learning categories | Workflow summary | Model performance | Comments | |
|---|---|---|---|---|---|---|
| Cheng, 2015 [ | Lower third of the face | Predict facial deformation of a patient after denture prosthesis. (outcome visualization, surgical planning) | Reinforcement learning | Register pre- and post-operative meshes Construct templates with feature points Predict changes with a neural network Simulate entire deformation | Error: − 2.021 to 2.021 mm, 95% of lower third face: < 2 mm Training/testing time: less than 10 s | Pipeline of prediction figure provided Two steps of prediction (coordinates of feature points, then the remaining area) |
| Do, 2019 [ | Abdominal aorta | predict abdominal aortic aneurysm growth. (decision support) | Supervised learning | Reconstruct 3D shapes Reference registration Generate the IS field Build dynamic IS model Predict fields and uncertainty Generate AAA shape from the predicted field | Error: mean: 10.30 (3.62) mm | Method summarized with figures |
| Knoops, 2019 [ | Face | Establish a large-scale clinical 3D morphable model. (Decision support, surgical planning, communication support) | Supervised learning | Scan patient face data Construct 3DMM Analyze characteristics of 3DMM Reduce high-dimensional manifold Classify face shapes into patient or non-patient group Predict 3D face shapes | Error: LARS: 1.1 ± 0·3 mm, RR: 1.1 ± 0·3 mm, LASSO: 1.3 ± 0.3 mm and LR: 3.0 ± 1.2 mm | The main purpose of the paper was not prediction Revealed the clinical potential of a large-scale clinical 3DMM using 3D scans (non-ionizing) and machine learning |
| Nguyen, 2020 [ | head | Predict skull shape from a given head surface. (Outcome visualization, computer vision system aid for facial rehabilitation) | Supervised learning | Segment head and skull Reconstruct 3D meshes Register head and skull mesh in pairs Train PLSR model to obtain relationships Predict skull from new 3D head shape Deform the predicted 3D skull again based on generic skull mesh | Mean Hausdorff distance: 2.09 ± 0.15 to 2.64 ± 0.26 mm training/testing time: 9 min 4 s ± 10 s | Method summarized with figures Two steps of prediction (coast forecasting, then sophisticated reconstruction) No large sample size required to work well for PLSR Predictors are large but affect PLSR less than PCA |
| Oura, 2017 [ | Radius and Ulnas | Predict whole bone shape from the partial shape. (Cost and radiation exposure reduction) | Supervised learning | Segment and generate 3D forearm shapes Identify landmarks, shape registration and 3D shape cutting Learn the relationship between whole and partial bone shapes Predict new whole shape from new partial bone shape | Error: 0.71–1.03 mm MATE and MARE: 0.48–1.76 mm and 0.99 -6.88 degrees | Method summarized with figures The model and workflow were not described in much detail |
| Rekik, 2016 [ | Infant cortical shape | Build prediction model for longitudinally developing cortical surfaces in infants. (Diagnosis support, outcome visualization) | Supervised learning* | Estimate cortical surface growth Register baseline surfaces onto a common space Estimate temporal evolution for each baseline shape. A mean atlas is built for each timepoint Predict shapes by local shape morphing and the learnt features Deform surface by moving cloud points to the nearest neighbor | Mean surface error distance (mm): Left: 3-month:0.740 ± 0·727; 6-month:0.981 ± 0.949; 9-month:1.059 ± 1·007; 12-month:1.080 ± 1.039 Right: 3-month:0.756 ± 0·739; 6-month:1.037 ± 0.976; 9-month:1.068 ± 1·003; 12-month:1.115 ± 1.050 | Prediction algorithm was specified No request of point-point surface correspondence Slight changed to their energy functionals and algorithms from 2015 |
| Rekik, 2015 [ | Infant cortical shape | Predict dynamic evolution of infant cortical shape. (Diagnosis support, outcome visualization) | Supervised learning* | Align and segment MR image Reconstruct 3D cortical shape. Convert surface shapes into current Build regression model based on the current metrics Register baseline shapes to a common space Estimate the temporal evolution trajectory, build mean atlases Deform surface based on existing clouds using two closeness metrics | Average distance errors: 3-month: 0.811 mm; 6-month: 0.953 mm; 9-month: 1.011 mm Average surface area difference (%) across all ROIs: 3-month: 7.8%; 6-month: 12.9%; 9-month: 15.4% | Prediction algorithm was specified Method summarized with figures Customized mathematical formulation for the experiments |
| Sampathkumar, 2020 [ | Anterior torso from sternal notch to umbilical notch | Generate an estimation of the post-operative breast shape for cosmetic and reconstructive surgery. (Communication support) | Supervised learning | Conduct 1,320 SPHARM coefficients of pre- and post-op shapes Obtain transformation vectors Learn the relationship of transformation vectors Predict post-op shape based on the forecasted transformation vector and pre-op shape | RMSD (pre, post) = 17.24(5.57) RMSD (pre, predict) = 13.36(2.58) RMSD (post, predict) = 19.68(4.74) | Excellent with testing on training dataset, awful result with test data It was not feasible to forecast a single generalized change for all surgery types |
| ter Horst, 2021 [ | Closed jaw | 3D virtual soft-tissue simulation after mandibular advancement Surgery. (Surgical planning, outcome visualization, communication support) | Unsupervised Learning | Register pre- and post-op meshes (CPD). Get the mandibular displacement based on a reference mesh Predict the soft-tissue displacement Apply displacements to the pre-op vertices | Lower face: MAE: 1.0 ± 0.6 mm. RMSE: 1.2 ± 0.6 mm Lower lip: MAE: 1.1 ± 0.9 mm Chin region: MAE: 1.4 ± 0.9 mm | Neural network architecture was clearly illustrated with a figure. They compared the DL results not only with ground truth, also MTM shape |
| Tanikawa, 2021 [ | Face | Predict 3D facial shape after orthognathic surgery and orthodontic treatment. (Surgical planning) | Unsupervised learning | 3D scan faces Identify 18 landmarks and standardize with a common coordinate system Perform GMM to fit meshes Train model with pre-shape (6017 points), values of cephalometric landmarks, and changes of cephalometric landmarks Predict shape changes for new patients Sum pre-shape and predicted changes to form predicted post-shape | Result: average error (S):0.94 mm, average error (E):0.69 mm Abstract: error (S): 0.89 (0.30) mm, error (E): 0.69 (0.18) mm† | Neural network architecture was clearly illustrated with a figure Accuracy was reported differently between result and abstract sections |
| Wu, 2022 [ | Head | Generate implant design for cranioplasty | Supervised learning | Derive 3D shapes from CT image Register shapes, augment and down sampling the data Train supervised learning model with flawed and intact 3D skull shape Predict intact shape from new flawed one‡ | Only assessed by generated implant§ Volumetric error rate: Cylinder: 8.11%; Irregular cylinder: 8.04% Ellipsoid: 6.60%; Irregular ellipsoid: 7.17% | Neural network architecture was clearly illustrated with a figure |
| Yuan, 2017 [ | Lower third of the face | Predict aesthetic reconstruction effects in edentulous patients. (Surgical planning, outcome visualization) | Unsupervised learning | Register edentulous and dentate meshes on the same coordinate system Construct feature template Construct soft tissue deformation Simulate entire deformation | Error (mm): Range: 1.090–0.480, mean: 0.769 ± 0.205 Statistically significant differences between participants Total run time: < 10 s | Contents were reported differently between method (PCA) and other sections (BP) No details of PCA prediction |
IS, implicit surface; 3DMM, 3D morphological model; LR, linear regression; RR, ridge regression; LASR, least-angle regression; LASSO, least absolute shrinkage and selection operator regression; PLSR, partial least squares regression; PCA, principal component analysis; SPHARM, Fourier spherical harmonics; MATE, mean absolute translational errors; MARE, mean absolute rotational errors; ROIs, range of interests; P1, pre-op shape; P2, post-op shape; E, predicted shape; CPD, coherent point drift method; MAE, mean absolute error; RMSE, root mean square error; RMSD, root mean squared distance; DL, deep learning; MTM, mass tensor modelling; BP, back propagation; GMM, landmark-based geometric morphometric method analysis
*Some elements are semi-supervised. For example, a growth model was used while learning the dynamic cortical surface growth with missing data
This paper had conflicting accuracies from result and abstract section. They also estimated two different interventions together: orthognathic surgery (S) and orthodontic treatment (E)
Further steps for generating implant shapes were eliminated since we only focused on 3D shape prediction
This paper was main doing cranial implant design, 3D shape prediction was just a part of the workflow, therefore, they only assessed error of designed implant shape, no accuracy evaluated for skull shape which was predicted by the V-net
Details of machine learning model development of the included studies
| Cheng, 2015 [ | 48 | 43/5 | 29 feature points on the lower third face | Back propagation neural network | An input layer with 29 nodes, a hidden layer with 20 nodes, and an output layer with 29 nodes |
| Do, 2019 [ | 7 age range: 54–73 (100% male) | NA* | IS field | Spatiotemporal Gaussian Process and EM filter (Kalman) | Gaussian process constructs the IS An EM filter (Kalman) estimates parameter of temporal evolution of the IS field |
| Knoops, 2019 [ | N = 151, mean age: 18·4 (2.4), age range: 14–28 (56% female) | Not clear† | Not mentioned | LR RR LARS LASSO | RR and LASSO: the alpha was set to 0·5 and 0·1 LARS, the number of non-zero coefficient was set to 1 All the other parameters were kept the default values All regression methods but LR penalized the weight of the components with a regulizer |
| Nguyen, 2020 [ | 209 age range: 34–88 (23% female) | 146/63 | Head feature points Feature distances Volumes | PLSR | Get the predictor and response variables (thickness and matrix Train the model coefficient matrix B Predict response variables based on given predictor variables using B |
| Oura, 2017 [ | 100 mean age: 45.5, age range: 16–85 (47% female) | 80/20 | Proximal 60%, Distal 60%, Distal 30% and proximal 30% | PLSR | The correlation A was learnt from the training dataset (Y (whole shape) = AX (partial shape)) |
| Rekik, 2016 [ | 12 | 11/1 | Varifold metric | Varifold-based geodesic shape regression | Longitudinal varifold-based shape regression. Fitting the deforming baseline shape into a set of target shapes by minimizing the energy functional The regression was used to link all subjects' longitudinal shapes in space and time A dynamic cloud was generated to model the temporal evolution trajectories of the baseline geometric shapes Virtual shapes were constructed for prediction. Searches of the local topography were used to estimate the geodesic evolution of the shape for a new subject |
| Rekik, 2015 [ | 17 | 14/3 | Current metric | Spatiotemporal current-based surface regression | A 4D surface growth model is trained to learn the deformation of the baseline shape in consecutive timepoints by the diffeomorphic mapping An external momentum of the change locally acts on baseline shape's Dirac delta currents, which deforms to consecutive shapes. The momenta define the surface deformation process by conjugate gradient descent algorithm minimizing the energy functional Virtual shapes were constructed for prediction. Searches of the local topography were used to estimate the geodesic evolution of the shape for a new subject |
| Sampathkumar, 2020 [ | 33 pre-op age range: 24–68, post-op age range: 27–71 | 21/41 | SPHARM 1320 (440*3) coefficients | Least square regression Random Forest regression | A random forest regression was trained to learn the non-linear relationship between the transformation vectors SPHARM coefficients of the pre-op breast with the transformation vector predicts post-op shape using least squares optimization |
| ter Horst, 2021 [ | 133 mean age: 29.5, age range: 14–65 (53% female) | 119/14 | 3129 vertices 5 nodes for mandibular displacements | Deep learning: autoencoder neural network | Six dense layers in full network Each dense block had a Leaky ReLU activation function Batch normalization momentum: 0·5 Dropout rate: 0·5 |
| Tanikawa, 2021 [ | Surgery group: 72; orthodontic group: 65 | Data separated into 11 sets: 10 training, 1 testing | 6017 points on pre-treatment shape, with values of 27 cephalometric landmarks, and changed values of 16 cephalometric landmarks | Deep learning: customized neural network | Two dense layers with ReLU activation function One dropout layer (0·3) Adam optimizer for optimization MSE for loss function |
| Wu, 2022 [ | 73 | 7154(10% for testing) | Flawed 3D cranial model with a volumetric resolution of 112*112*40 | High dimensional autoencoder augmented with skip connections (V-Net) | Twelve 3D convolutional layers: four 3D expansion layers, three max-pooling layers, three up-sampling layers, augmented with eight skip connections 8269 trainable parameters |
| Yuan, 2017 [ | 10 mean age: 73.2 (4.3), age range: 68–80 (50% female) | Not mentioned | 29 feature points on the lower third face | PCA‡ | Reduce dimensionality on a multi-dimensional variable system PCA is an extraction method based on the minimum mean square error. The completed PCA model is also the facial elastic deformation prediction model, which can predict the elastic deformation of the edentulous model feature template |
NA, not applicable, IS, implicit surface, EM, expectation maximization, LR, linear regression, RR, ridge regression, LASR, least-angle regression, LASSO, least absolute shrinkage and selection operator regression, PLSR, partial least squares regression, PCA, principal component analysis, SPHARM, Fourier spherical harmonics, LSFM, Large-scale facial model
*In this paper, each of the 7 patients had multiple CT images (e.g., some with 4). A per patient training scheme was used and the last was used as the ground truth and the rest (e.g., first 3 out of 4) as training. There was no test set because each model was personalized for a patient
†For shape prediction, the paper talks about three 3D morphological model: a global model, a bespoke pre-operative model, and a bespoke post-operative model. The global model (n = 4216) comprised all patient scans as well as healthy volunteer scans from the same age range. The bespoke pre-operative (n = 119) and post-operative (n = 127) models were made exclusively with patient scans
‡Method was documented inconsistently in abstract, results, discussion, and conclusion
Risk of bias as assessed by Quality In Prognosis Studies (QUIPS)
Fig. 2General workflow of 3D shape prediction summarized into three phases. Phase one: Data preparation including segmentation, reconstruction, and registration. Phase two: predictive model development including two phases: learning and predicting. Phase three: 3D shape prediction based on the predictive model developed and optimized in phase two