| Literature DB >> 32864421 |
Sarala Padi1, Petru Manescu1, Nicholas Schaub2, Nathan Hotaling2, Carl Simon3, Kapil Bharti2, Peter Bajcsy1.
Abstract
Predicting Retinal Pigment Epithelium (RPE) cell functions in stem cell implants using non-invasive bright field microscopy imaging is a critical task for clinical deployment of stem cell therapies. Such cell function predictions can be carried out using Artificial Intelligence (AI) based models. In this paper we used Traditional Machine Learning (TML) and Deep Learning (DL) based AI models for cell function prediction tasks. TML models depend on feature engineering and DL models perform feature engineering automatically but have higher modeling complexity. This work aims at exploring the tradeoffs between three approaches using TML and DL based models for RPE cell function prediction from microscopy images and at understanding the accuracy relationship between pixel-, cell feature-, and implant label-level accuracies of models. Among the three compared approaches to cell function prediction, the direct approach to cell function prediction from images is slightly more accurate in comparison to indirect approaches using intermediate segmentation and/or feature engineering steps. We also evaluated accuracy variations with respect to model selections (five TML models and two DL models) and model configurations (with and without transfer learning). Finally, we quantified the relationships between segmentation accuracy and the number of samples used for training a model, segmentation accuracy and cell feature error, and cell feature error and accuracy of implant labels. We concluded that for the RPE cell data set, there is a monotonic relationship between the number of training samples and image segmentation accuracy, and between segmentation accuracy and cell feature error, but there is no such a relationship between segmentation accuracy and accuracy of RPE implant labels.Entities:
Keywords: Age-related macular degeneration; Cell function prediction; Cell segmentation; Deep learning; Retinal Pigment Epithelium Cell; Trans-Epithelial Resistance; Vascular Endothelial Growth Factor
Year: 2020 PMID: 32864421 PMCID: PMC7450761
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Modeling factors considered to compare three approaches used for cell function prediction.
| Type | Factors | Definition |
|---|---|---|
| Complexity | Complexity of modeling design | Exploration of plausible DL or TML model architectures for a given problem |
| No. of modeling parameters | Number of parameters optimized during the training stage of the model | |
| Complexity of optimization | Number of independently optimized parameters with respect to DL & TML models | |
| Effort | Training data preparation | Level of effort required to create ground truth |
| Feature engineering | Effort required to engineer the suitable features | |
| Usability | Model transparency or interpretability | Degree of interpretation of the resulting model coefficients |
| Model generalizability | Degree of reusability in other domains |
Fig. 1.Data flow design of three approaches to cell function prediction. GT stands for ground truth, TML-Traditional Machine Learning.
Approach 1 implementation steps and configuration details. Abbreviations: WIPP- Web Image Processing Pipeline; RF-Random Forest regressor; SVR-Support Vector Regressor; LR-Linear Regressor; SLP-Single Layer Perceptron; MLP-Multi Layer Perceptron; RMSE-Root Mean Square Error.
| Approach 1 |
|---|
| a) Implementation: Keras neural network
library [ |
| b) Configuration: Encoder & Decoder DL
model [ |
| i) Transfer learning |
| a) Implementation: WIPP library [ |
| b) Configuration: Intensity, Texture, Shape |
| i) Extracted per segment |
| ii) Selected manually |
| a) Implementation: Weka library [ |
| b) Configuration: Regression based models |
| i) RF, SVR, LR, SLP, & MLP |
List of features extracted for RPE cell function prediction.
| Feature Name | Feature Type | Feature Name | Feature Type |
|---|---|---|---|
| Eccentricity | Spatial | Mean Intensity | Intensity |
| Extent | Spatial | Min Intensity | Intensity |
| Major Axis Length | Spatial | Max Intensity | Intensity |
| Minor Axis Length | Spatial | Standard Deviation | Intensity |
| Centroid | Spatial | Median Intensity | Intensity |
| Weighted Centroid | Spatial | Mode Intensity | Intensity |
| Area | Spatial | Skewness | Intensity |
| Perimeter | Spatial | Kurtosis | Intensity |
| Equivalent Diameter | Spatial | First Central Moment | Intensity |
| Orientation | Spatial | Contrast | Texture |
| Solidity | Spatial | Correlation | Texture |
| Bounding Box | Spatial | Energy | Texture |
| Euler Number | Spatial | Homogeneity | Texture |
| Filled Area | Spatial | Entropy | Texture |
| Convex Area | Spatial | Feret Diameter | Spatial |
| No. of Neighbors | Spatial | Border and Background | Spatial |
| Neighbor |
Approach 2 implementation steps and configuration details.
| Approach 2 |
|---|
| a) Implementation: Keras neural network
library [ |
| b) Configuration: VGG16 CNN model [ |
Approach 3 implementation steps and configuration details.
| Approach 3 |
|---|
| a) Implementation: WIPP library [ |
| b) Configuration: Intensity, Texture |
| i) Extracted per field of view (FOV) |
| ii) Selected manually |
| a) Implementation: WEKA library [ |
| b) Configuration: Regression based models |
| i) RF, SVR, LR, SLP, & MLP |
Range of values for TER, VEGF, and cell count measurements of RPE cell implants. FOV- per field of view. VEGF ratio- Measuring the VEGF secretion on basal side relative to apical side of the RPE cell monolayer (Basal side/Apical side).
| Type of measurement | Min.value | Max.value |
|---|---|---|
| TER(Ω. | 127 | 1071 |
| VEGF ratio (Ba/Ap) | 2.67 | 11.20 |
| Cell count (per FOV) | 33 | 298 |
Comparison of three approaches used for cell function prediction. For Approaches 1 and 3, best machine learning model results are reported (Random forest regressor model performance is reported).
| Approach | Error (mean) | Root Mean Squared Error (RMSE) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| TER | VEGF | Cell count | TER | VEGF | Cell count | TER | VEGF | Cell count | |
| Approach 1 | 0.17 | −0.006 | −2.34 | 37.85 | 1.29 | 27.01 | 0.5253 | 0.794 | 0.6964 |
| Approach 2 | −0.59 | −0.15 | 5.55 | 24.49 | 1.17 | 25.64 | 0.837 | 0.8442 | 0.7915 |
| Approach 3 | −0.265 | 0.097 | 1.00 | 38.48 | 0.90 | 27.31 | 0.5186 | 0.9095 | 0.6687 |
Fig. 3.Box plots showing the distribution of errors while executing each approach to cell function predictions.
Fig. 2.Mean Absolute Percentage Errors (MAPE) of three approaches for TER, VEGF, and Cell count predictions.
Performance comparison of three approaches to cell function predictions evaluated using holdout and 5-fold cross validation methods. The TML based steps used Random Forest regressor model.
| Approach | Root Mean Squared Error (RMSE) | |||||
|---|---|---|---|---|---|---|
| Holdout validation | 5-fold validation | |||||
| TER | VEGF | Cell count | TER | VEGF | Cell count | |
| Approach 1 | 37.85 | 1.29 | 27.01 | 40.63 | 1.20 | 25.97 |
| Approach 2 | 24.49 | 1.17 | 25.64 | 27.87 | 1.14 | 23.11 |
| Approach 3 | 38.48 | 0.90 | 27.31 | 38.20 | 0.97 | 26.37 |
Fig. 4.Visual comparison of segmentation results.
Fig. 6.TER, VEGF, and cell count prediction errors (ranges of TER<127,1071>, VEGF<2.67,11.20>, cell count<33,298>) with respect to difference. The number next to each plotted data point refers to the number of training images.
Segmentation accuracy comparison with and without transfer learning. DL_Seg model: Deep learning model used for RPE cell segmentation; TL: with transfer learning by adapting the VGG16 pretrained model weights.
| Model | DICE score | Cell count error | |
|---|---|---|---|
| Contour | Region | ||
| DL_Seg model | 0.5209 | 0.4913 | 0.1290 |
| DL_Seg model + TL | 0.6638 | 0.7237 | 0.0171 |
Performance comparison of TML regression models for cell function prediction using Approach 1.
| Model | Root Mean Squared Error (RMSE) | |||||
|---|---|---|---|---|---|---|
| Holdout validation | 5-fold validation | |||||
| TER | VEGF | Cell count | TER | VEGF | Cell count | |
| LR | 43.55 | 1.34 | 37.01 | 41.40 | 1.45 | 34.07 |
| SVR | 40.69 | 1.39 | 38.75 | 40.90 | 1.46 | 33.68 |
| RF | 37.85 | 1.29 | 27.01 | 40.63 | 1.20 | 25.97 |
| SLP | 58.94 | 2.00 | 39.44 | 53.41 | 1.85 | 40.96 |
| MLP | 48.85 | 1.32 | 33.00 | 48.71 | 1.20 | 30.74 |
Performance comparison of TML regression models for cell function prediction using Approach 3.
| Model | Root Mean Squared Error (RMSE) | |||||
|---|---|---|---|---|---|---|
| Holdout validation | 5-fold validation | |||||
| TER | VEGF | Cell count | TER | VEGF | Cell count | |
| LR | 46.66 | 1.18 | 40.81 | 48.02 | 1.29 | 38.65 |
| SVR | 43.98 | 1.27 | 36.52 | 48.92 | 1.29 | 35.26 |
| RF | 38.48 | 0.90 | 27.31 | 38.20 | 0.97 | 26.37 |
| SLP | 44.95 | 1.60 | 37.51 | 53.64 | 1.49 | 36.60 |
| MLP | 34.55 | 0.5707 | 34.55 | 38.50 | 0.72 | 33.54 |
Qualitative comparison of inference times for three approaches.Times are measured in milliseconds (ms), minutes (min).
| Approach | Test time | Approximate time | Speed |
|---|---|---|---|
| 1 | DL_Seg + FE + TML | ms + min + min | low |
| 2 | DL_Reg | ms | high |
| 3 | FE + TML | min + min | low |
Qualitative tradeoffs of the three approaches applied to RPE cell prediction problem. The labels “low”, “medium” and “high” are qualitative values and are assigned based on comparative assessments with respect to ideal values.
| Factors | Approach 1 | Approach 2 | Approach 3 | Ideal |
|---|---|---|---|---|
| Complexity of modeling design | high (2) | low (0) | medium | low (0) |
| No. of modeling parameters | medium (1) | high (2) | low (0) | low (0) |
| Complexity of optimization | high (2) | low (0) | medium (1) | low (0) |
| Training data preparation | high (2) | low (0) | low (0) | low (0) |
| Feature engineering | manual (2) | automatic (0) | manual (2) | automatic (0) |
| Model transparency | high (2) | low (0) | medium (1) | high (2) |
| Model generalizability | medium (1) | high (2) | low (0) | high (2) |
Fig. 5.Segmentation accuracy comparisons of five DL models used for RPE cell segmentation task with and without transfer learning. DL_Seg model: Deep learning model used for RPE cell segmentation; TL: with transfer learning by adapting the VGG16 pretrained model weights.