| Literature DB >> 34220508 |
Sangwoo Seo1, Youngmin Kim2, Hyo-Jeong Han3, Woo Chan Son3, Zhen-Yu Hong4, Insuk Sohn4, Jooyong Shim5, Changha Hwang1,2.
Abstract
Despite several improvements in the drug development pipeline over the past decade, drug failures due to unexpected adverse effects have rapidly increased at all stages of clinical trials. To improve the success rate of clinical trials, it is necessary to identify potential loser drug candidates that may fail at clinical trials. Therefore, we need to develop reliable models for predicting the outcomes of clinical trials of drug candidates, which have the potential to guide the drug discovery process. In this study, we propose an outer product-based convolutional neural network (OPCNN) model which integrates effectively chemical features of drugs and target-based features. The validation results via 10-fold cross-validations on the dataset used for a data-driven approach PrOCTOR proved that our OPCNN model performs quite well in terms of accuracy, F1-score, Matthews correlation coefficient (MCC), precision, recall, area under the curve (AUC) of the receiver operating characteristic, and area under the precision-recall curve (AUPRC). In particular, the proposed OPCNN model showed the best performance in terms of MCC, which is widely used in biomedicine as a performance metric and is a more reliable statistical measure. Through 10-fold cross-validation experiments, the accuracy of the OPCNN model is as high as 0.9758, F1 score is as high as 0.9868, the MCC reaches 0.8451, the precision is as high as 0.9889, the recall is as high as 0.9893, the AUC is as high as 0.9824, and the AUPRC is as high as 0.9979. The results proved that our OPCNN model shows significantly good prediction performance on outcomes of clinical trials and it can be quite helpful in early drug discovery.Entities:
Keywords: clinical trial; convolutional neural network; imbalance; multimodal learning; outer product
Year: 2021 PMID: 34220508 PMCID: PMC8242994 DOI: 10.3389/fphar.2021.670670
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
FIGURE 1A workflow of the proposed OPCNN classifier for predicting successes and failures of clinical trials. Given an outer product of two representative feature vectors as an input, 2D CNN is used to learn features. The architecture of OPCNN consists of three residual blocks and five fully connected (FC) layers. Each residual block has three convolution layers. (A) OPCNN classifier (B) Residual block.
FIGURE 2Graphical representation for the early, intermediate, and late fusions. (A) Early fusion (B) Intermediate fusion (C) Late fusion.
Classification results for various prediction models via a 10-fold cross-validation.
| Multimodal | Model | ACC | F1-score | MCC | Precision | Recall | AUC | AUPRC |
|---|---|---|---|---|---|---|---|---|
| SVM | Base | 0.8308*** (0.0053) | 0.9055*** (0.0031) | 0.1796*** (0.014) | 0.9253*** (0.0017) | 0.8864*** (0.0054) | 0.5622*** (0.0099) | 0.9578*** (0.0099) |
| CW | 0.8152*** (0.0075) | 0.8956*** (0.0046) | 0.1837*** (0.0149) | 0.9263*** (0.002) | 0.8669*** (0.0079) | 0.5658*** (0.0114) | 0.9574*** (0.0011) | |
| SMOTE + CW | 0.7748*** (0.0077) | 0.8687*** (0.005) | 0.1975*** (0.0157) | 0.9297*** (0.0028) | 0.8153*** (0.0085) | 0.5791*** (0.0145) | 0.9569*** (0.0015) | |
| Random | Base | 0.9149*** (0.0018) | 0.9551*** (0.0009) | 0.2018*** (0.0303) | 0.9206*** (0.0013) | 0.9924*** (0.0016) | 0.7019*** (0.0058) | 0.9532*** (0.0015) |
| CW | 0.9156*** (0.0022) | 0.9556*** (0.0011) | 0.1865*** (0.0367) | 0.9193*** (0.0012) |
| 0.7125*** (0.0055) | 0.9565*** (0.0021) | |
| SMOTE + CW | 0.8949*** (0.0026) | 0.9435*** (0.0014) | 0.2484*** (0.0156) | 0.9285*** (0.0012) | 0.9689*** (0.0023) | 0.7045*** (0.0052) | 0.9577*** (0.0015) | |
| OPCNN | Base |
|
|
| 0.9844*** (0.0050) | 0.9893*** (0.0058) |
|
|
| CW | 0.9539*** (0.0249) | 0.9743*** (0.0144) | 0.7620*** (0.0854) | 0.9866** (0.0041) | 0.9628*** (0.0282) | 0.9653*** (0.0247) | 0.9952*** (0.0045) | |
| SMOTE + CW | 0.9329*** (0.0338) | 0.9619*** (0.0201) | 0.7012*** (0.0909) | 0.9889 (0.0048) | 0.9373*** (0.0379) | 0.9583*** (0.0177) | 0.9583*** (0.0177) | |
| DMNN | Base | 0.9653*** (0.0038) | 0.9811*** (0.0021) | 0.7727*** (0.0250) | 0.9760*** (0.0037) | 0.9863*** (0.0041) | 0.9717*** (0.0061) | 0.9968** (0.0010) |
| CW | 0.9492*** (0.0075) | 0.9719*** (0.0042) | 0.7238*** (0.0368) | 0.9843*** (0.0039) | 0.9598*** (0.0065) | 0.9660*** (0.0080) | 0.9961** (0.0011) | |
| SMOTE + CW | 0.9309*** (0.0073) | 0.9612*** (0.0043) | 0.6740*** (0.0228) | 0.9871** (0.0032) | 0.9367*** (0.0089) | 0.9551*** (0.0070) | 0.9944*** (0.0011) | |
| DMNN | Base | 0.9669*** (0.0026) | 0.9819*** (0.0014) | 0.7880*** (0.0175) | 0.9798*** (0.0030) | 0.9840*** (0.0031) | 0.9748** (0.0055) | 0.9972 (0.0012) |
| CW | 0.9449*** (0.0085) | 0.9694*** (0.0048)< | 0.7111*** (0.0358) | 0.9849*** (0.0028) | 0.9544*** (0.0080) | 0.9678*** (0.0073) | 0.9964** (0.0011) | |
| SMOTE + CW | 0.9170*** (0.0090) | 0.9529*** (0.0054) | 0.6465*** (0.0265) | 0.9898 (0.0024) | 0.9187*** (0.0095) | 0.9560*** (0.0108) | 0.9936*** (0.0061) | |
| DMNN | Base | 0.9652*** (0.0038) | 0.9810*** (0.0021) | 0.7715*** (0.0261) | 0.9761*** (0.0035) | 0.9861*** (0.0031) | 0.9751*** (0.0063) | 0.9973 (0.0008) |
| CW | 0.9473*** (0.0057) | 0.9708*** (0.0032) | 0.7125*** (0.0293) | 0.9831*** (0.0031) | 0.9589*** (0.0043) | 0.9662*** (0.0071) | 0.9963** (0.0010) | |
| SMOTE + CW | 0.9345*** (0.0069) | 0.9634*** (0.0039) | 0.6786*** (0.0301) | 0.9855*** (0.0036) | 0.9422*** (0.0064) | 0.9569*** (0.0097) | 0.9943*** (0.0019) | |
| DMNN | Base | 0.9652*** (0.0053) | 0.9811*** (0.0029) | 0.7700*** (0.0365) | 0.9753*** (0.0039) | 0.9869*** (0.0031) | 0.9748** (0.0064) | 0.9971* (0.0009) |
| CW | 0.9512*** (0.0072) | 0.9731*** (0.0041) | 0.7252*** (0.0324) | 0.9822*** (0.0028) | 0.9641*** (0.0070) | 0.9663*** (0.0072) | 0.9963** (0.0009) | |
| SMOTE + CW | 0.9172*** (0.0113) | 0.9531*** (0.0067) | 0.6387*** (0.0295) | 0.9874* (0.0032) | 0.9212*** (0.0128) | 0.9535*** (0.0085) | 0.9943*** (0.0015) | |
| DMNN | Base | 0.9582*** (0.0048) | 0.9773*** (0.0026) | 0.7219*** (0.0309) | 0.9703*** (0.0028) | 0.9845*** (0.0041) | 0.9635*** (0.0091) | 0.9959*** (0.0016) |
| CW | 0.9325*** (0.0133) | 0.9625*** (0.0076) | 0.6429*** (0.0512) | 0.9768*** (0.0040) | 0.9487*** (0.0131) | 0.9454*** (0.0131) | 0.9939*** (0.0017) | |
| SMOTE + CW | 0.8958*** (0.0122) | 0.9407*** (0.0073) | 0.5616*** (0.0345) | 0.9798*** (0.0037) | 0.9046*** (0.0126) | 0.9331*** (0.0105) | 0.9919*** (0.0021) | |
| DMNN | Base | 0.9582*** (0.0041) | 0.9773*** (0.0023) | 0.7212*** (0.0261) | 0.9701*** (0.0033) | 0.9845*** (0.0041) | 0.9630*** (0.0065) | 0.9959*** (0.0008) |
| CW | 0.9315*** (0.0075) | 0.9620*** (0.0043) | 0.6335*** (0.0298) | 0.9757*** (0.0036) | 0.9487*** (0.0083) | 0.9429*** (0.0090) | 0.9934*** (0.0014) | |
| SMOTE + CW | 0.9297*** (0.0081) | 0.9607*** (0.0047) | 0.6522*** (0.0291) | 0.9820*** (0.0029) | 0.9404*** (0.0083) | 0.9462*** (0.0065) | 0.9934*** (0.0010) | |
| DMNN | Base | 0.9484*** (0.0047) | 0.9716*** (0.0026) | 0.6983*** (0.0265) | 0.9774*** (0.0033) | 0.9659*** (0.0042) | 0.9638*** (0.0079) | 0.9960** (0.0015) |
| CW | 0.9311*** (0.0084) | 0.9614*** (0.0048) | 0.6715*** (0.0338) | 0.9863*** (0.0035) | 0.9377*** (0.0078) | 0.9636*** (0.0093) | 0.9959*** (0.0014) | |
| SMOTE + CW | 0.9203*** (0.0076) | 0.9549*** (0.0045) | 0.6518*** (0.0216) | 0.9890 (0.0026) | 0.9231*** (0.0087) | 0.9632*** (0.0069) | 0.9958** (0.0012) | |
| DMNN | Base | 0.9574*** (0.0059) | 0.9769*** (0.0032) | 0.7173*** (0.0390) | 0.9701*** (0.0041) | 0.9838*** (0.0045) | 0.9621*** (0.0082) | 0.9958*** (0.0011) |
| CW | 0.9362*** (0.0122) | 0.9646*** (0.0070) | 0.6542*** (0.0463) | 0.9767*** (0.0036) | 0.9529*** (0.0126) | 0.9522*** (0.0097) | 0.9947*** (0.0013) | |
| SMOTE + CW | 0.9265*** (0.0098) | 0.9588*** (0.0057) | 0.6400*** (0.0347) | 0.9811*** (0.0033) | 0.9376*** (0.0101) | 0.9461*** (0.0083) | 0.9934*** (0.0014) | |
| DMNN | Base | 0.9652*** (0.0053) | 0.9810*** (0.0029) | 0.7774*** (0.0324) | 0.9787*** (0.0035) | 0.9834*** (0.0044) | 0.9678*** (0.0096) | 0.9964** (0.0013) |
| CW | 0.9457*** (0.0054) | 0.9699*** (0.0031) | 0.7068*** (0.0242) | 0.9831*** (0.0029) | 0.9571*** (0.0058) | 0.9632*** (0.0091) | 0.9958*** (0.0013) | |
| SMOTE + CW | 0.9286*** (0.0112)< | 0.9598*** (0.0065) | 0.6740*** (0.0334) | 0.9887 (0.0025)< | 0.9325*** (0.0122) | 0.9597*** (0.0098) | 0.9950*** (0.0016) | |
| DMNN | Base | 0.9464*** (0.0042) | 0.9706*** (0.0023) | 0.6770*** (0.0255) | 0.9736*** (0.0034) | 0.9677*** (0.0038) | 0.9550*** (0.0067) | 0.9948*** (0.0011) |
| CW | 0.9225*** (0.0073) | 0.9564*** (0.0043) | 0.6379*** (0.0225) | 0.9836*** (0.0025) | 0.9307*** (0.0082) | 0.9541*** (0.0111) | 0.9947*** (0.0018) | |
| SMOTE + CW | 0.9057*** (0.0136) | 0.9464*** (0.0081) | 0.6060*** (0.0408) | 0.9857*** (0.0035) | 0.9101*** (0.0129) | 0.9505*** (0.0109) | 0.9943*** (0.0016) | |
| DMNN | Base | 0.9432*** (0.0050) | 0.9691*** (0.0028) | 0.6276*** (0.0310) | 0.9633*** (0.0036) | 0.9750*** (0.0047) | 0.9414*** (0.0084) | 0.9934*** (0.0012) |
| CW | 0.8990*** (0.0107) | 0.9429*** (0.0062) | 0.5476*** (0.0405) | 0.9750*** (0.0049) | 0.9130*** (0.0090) | 0.9228*** (0.0153) | 0.9912*** (0.0020) | |
| SMOTE + CW | 0.9005*** (0.0054) | 0.9434*** (0.0033) | 0.5835*** (0.0181) | 0.9832*** (0.0036) | 0.9067*** (0.0070) | 0.9381*** (0.0057) | 0.9931*** (0.0009) |
FIGURE 3ROC and precision–recall curves for 10-fold cross-validation. (A) ROC curves (B) Precision–recall curves.