| Literature DB >> 35992362 |
Xue-Mei Dong1, Xudong Kong1, Xiaoping Zhang1.
Abstract
When the human brain learns multiple related or continuous tasks, it will produce knowledge sharing and transfer. Thus, fast and effective task learning can be realized. This idea leads to multi-task learning. The key of multi-task learning is to find the correlation between tasks and establish a fast and effective model based on these relationship information. This paper proposes a multi-task learning framework based on stochastic configuration neural networks. It organically combines the idea of the classical parameter sharing multi-task learning with that of constraint sharing configuration in stochastic configuration neural networks. Moreover, it provides an efficient multi-kernel function selection mechanism. The convergence of the proposed algorithm is proved theoretically. The experiment results on one simulation data set and four real life data sets verify the effectiveness of the proposed algorithm.Entities:
Keywords: knowledge sharing and transfer; multi-task learning; neural networks; stochastic configuration; supervised mechanism
Year: 2022 PMID: 35992362 PMCID: PMC9386079 DOI: 10.3389/fbioe.2022.890132
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
FIGURE 1Single-task backpropagation nets.
FIGURE 2The multi-task backpropagation net.
Parameter description.
| Algorithms | Parameters | Parameters’ Range |
|---|---|---|
| MTSL-SCRBN | RBF scale |
|
| MTEN | Regularization parameters |
|
| SVM | RBF scale |
|
| SC-III | Internal weight parameters | ( |
| DMTRL | Factorisation method parameters | { |
| MMoE | Units |
|
| AUTOMTL | Weight |
|
FIGURE 3Distribution of the simulation data set.
The results of MTSL-SCRBN, SC-III and SVM on the simulation data set.
| Task | MTSL-SCRBN | SC-III | SVM | |||
|---|---|---|---|---|---|---|
| training | test | training | test | training | test | |
| Task 1 | 0.1080 |
| 0.1735 | 0.4264 | 0.6624 | 0.8129 |
| Task 2 | 0.5320 |
| 0.8595 | 2.1120 | 3.4796 | 4.2842 |
| Task 3 | 0.9589 |
| 1.5501 | 3.7974 | 6.0838 | 7.6899 |
| Task 4 | 1.3853 |
| 2.2385 | 5.4810 | 8.8058 | 11.1654 |
| Task 5 | 1.8116 |
| 2.9267 | 7.1716 | 11.5199 | 14.6416 |
The results with the minimum test errors are marked in bold.
FIGURE 4Prediction performance of MTSL-SCRBN, SC-III and SVM on Task 1 of simulation data set.
The results of MTSL-SCRBN, MTL, MTEN on the simulation data set.
| MTSL-SCRBN | MTL | MTEN | |||
|---|---|---|---|---|---|
| training | test | training | test | training | test |
| 1.2836 |
| 6.7430 | 5.1472 | 7.8274 | 6.2219 |
| (0.1763) | (0.0929) | (0.0336) | (0.0187) | ||
The results with the minimum test errors are marked in bold.
FIGURE 5Prediction performance of MTSL-SCRBN, MTL, MTEN on Task 1 of simulation data set.
Descriptions of benchmark datasets.
| Data Set | Size | Feature Number | Task Number |
|---|---|---|---|
| Stock | 63 | 6 | 6 |
| SARCOS | 48933 | 21 | 7 |
| School | 15362 | 8 | 139 |
| Mnist | 70000 | 28*28 | 10 |
The comparison results of six MSTL algorithms on three data sets.
| Data set | Size | MTSL-SCRBN | MTL | MTEN | DMTRL | MMoE | GAMTL | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| training | test | training | test | training | test | training | test | training | test | training | test | ||
| Stock | 0.0796 |
| 0.1524 | 0.3016 | 0.1155 | 0.1397 | 0.1441 | 0.1649 | 0.0885 | 0.1536 | |||
| 10 | (0.0039) | (0.0040) | (0.0619) | (0.0667) | 0.1370 | 0.1387 | (0.0268) | (0.0089) | (0.0045) | (0.0040) | (0.0072) | (0.0018) | |
| 0.0661 |
| 0.1149 | 0.2207 | 0.0973 | 0.1008 | 0.1362 | 0.1577 | 0.0811 | 0.1342 | ||||
| 20 | (0.0050) | (0.0069) | (0.0158) | (0.0754) | 0.1313 | 0.1379 | (0.0188) | (0.0112) | (0.0091) | (0.0057) | (0.0074) | (0.0065) | |
| 0.0542 |
| 0.1055 | 0.1712 | 0.0839 | 0.0975 | 0.1282 | 0.1505 | 0.0769 | 0.1186 | ||||
| 30 | (0.0054) | (0.0078) | (0.0117) | (0.0366) | 0.1380 | 0.1339 | (0.0156) | (0.0078) | (0.0019) | (0.0010) | (0.0033) | (0.0022) | |
| Sarcos | 2.3921 |
| 5.1595 | 5.9636 | 4.8021 | 5.4138 | 3.5679 | 4.3704 | 3.6894 | 4.5503 | |||
| 700 | (0.0221) | (0.0364) | (0.0395) | (0.0895) | 4.2165 | 4.2680 | (0.1312) | (0.1128) | (0.0165) | (0.0112) | (0.0315) | (0.0489) | |
| 2.2732 |
| 4.7317 | 5.6063 | 4.2588 | 4.7137 | 2.7236 | 3.4125 | 2.9128 | 3.6810 | ||||
| 1,400 | (0.0250) | (0.0179) | (0.0342) | (0.0808) | 4.1334 | 4.2281 | (0.0610) | (0.1084) | (0.0100) | (0.0101) | (0.0147) | (0.0286) | |
| 2.1128 |
| 4.5611 | 5.0693 | 3.4181 | 4.0180 | 2.6235 | 2.9898 | 2.7503 | 3.0137 | ||||
| 2,100 | (0.0255) | (0.0392) | (0.0370) | (0.0899) | 4.1552 | 4.1818 | (0.0683) | (0.0639) | (0.0115) | (0.0123) | (0.0239) | (0.0317) | |
| School | 8.9122 |
| 11.9488 | 13.6566 | 12.0246 | 13.2983 | 11.9107 | 13.5805 | 11.7439 | 12.3742 | |||
| 100 | (0.1221) | (0.0634) | (0.1356) | (0.1284) | 12.1952 | 13.9454 | (0.1411) | (0.1155) | (0.1286) | (0.2422) | (0.1280) | (0.1560) | |
| 8.4406 |
| 11.4453 | 13.3710 | 11.7838 | 13.1285 | 11.6480 | 13.2517 | 11.3313 | 11.7067 | ||||
| 150 | (0.0736) | (0.0828) | (0.1141) | (0.1089) | 11.7394 | 13.5306 | (0.1562) | (0.1371) | (0.0907) | (0.1108) | (0.0899) | (0.0249) | |
| 7.6262 |
| 11.1923 | 12.9307 | 11.4090 | 12.9873 | 11.3628 | 12.9779 | 10.8133 | 11.3346 | ||||
| 200 | (0.1177) | (0.1067) | (0.0874) | (0.1145) | 11.3253 | 12.8915 | (0.1589) | (0.2015) | (0.0748) | (0.0612) | (0.0546) | (0.0307) | |
The results with the minimum test errors are marked in bold.
The accuracy of MTSL-SCRBN, DMTRL, MMoE, GAMTL and AUTOMTL on Mnist data set.
| Size | MTSL-SCRBN | DMTRL | MMoE | GAMTL | AUTOMTL | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| training | test | training | test | training | test | training | test | training | test | |
| 92.00% |
| 91.80.00% | 64.67% | 94.16% | 62.22% | 69.47% | 57.01% | 93.26% | 60.78% | |
| 50 | (0.0065) | (0.0067) | (0.0111) | (0.0099) | (0.0077) | (0.0086) | (0.0303) | (0.0124) | (0.0231) | (0.0212) |
| 96.15% |
| 95.90% | 70.793% | 95.92% | 68.15% | 84.62% | 68.51% | 94.56% | 70.45% | |
| 100 | (0.0067) | (0.0066) | (0.0117) | (0.0101) | (0.0084) | (0.0061) | (0.0144) | (0.0158) | (0.0128) | (0.0094) |
| 97.27% |
| 97.97% | 80.05% | 97.04% | 74.61% | 95.52% | 78.89% | 96.87% | 79.34% | |
| 150 | (0.0099) | (0.0064) | (0.0082) | (0.0097) | (0.0081) | (0.0063) | (0.0119) | (0.0087) | (0.0097) | (0.0084) |
The results with the minimum test errors are marked in bold.
Parameter description for the four models.
| Models | Parameters | Parameters’ Range |
|---|---|---|
| MTSL-SCRBN | RBF scale |
|
| MTSL-SCSGM | Internal weight parameters | ( |
| MTSL-SCTANH | Internal weight parameters | ( |
| MTSL-SCReLU | Internal weight parameters | ( |
The comparison results of different activation functions based models on two data sets.
| Data set | Size | MTSL-SCRBN | MTSL-SCSGM | MTSL-SCTANH | MTSL-SCReLU | ||||
|---|---|---|---|---|---|---|---|---|---|
| training | test | training | test | training | test | training | test | ||
| Stock | 0.0796 |
| 0.0819 | 0.1566 | 0.0814 | 0.1586 | 0.0853 | 0.1424 | |
| 10 | (0.0039) | (0.0040) | (0.0048) | (0.0057) | (0.0036) | (0.0036) | (0.0035) | (0.0055) | |
| 0.0661 |
| 0.0766 | 0.1231 | 0.0758 | 0.1242 | 0.0759 | 0.1194 | ||
| 20 | (0.0050) | (0.0069) | (0.0032) | (0.0098) | (0.0034) | (0.0076) | (0.0028) | (0.0051) | |
| 0.0542 |
| 0.0671 | 0.1128 | 0.0665 | 0.1037 | 0.0678 | 0.1009 | ||
| 30 | (0.0054) | (0.0078) | (0.0020) | (0.0061) | (0.0021) | (0.0076) | (0.0012) | (0.0046) | |
| Sarcos | 2.3921 |
| 3.3670 | 5.1386 | 3.2588 | 4.9160 | 3.1868 | 5.5491 | |
| 700 | (0.0221) | (0.0364) | (0.0263) | (0.0868) | (0.0308) | (0.0746) | (0.0140) | (0.0824) | |
| 2.2732 |
| 3.1829 | 4.0958 | 3.0747 | 4.1529 | 2.9880 | 4.1854 | ||
| 1,400 | (0.0250) | (0.0179) | (0.0161) | (0.0713) | (0.0178) | (0.0598) | (0.0112) | (0.0767) | |
| 2.1128 |
| 2.9860 | 3.8698 | 2.8853 | 3.8420 | 2.8539 | 3.8211 | ||
| 2,100 | (0.0255) | (0.0392) | (0.0136) | (0.0764) | (0.0127) | (0.0494) | (0.0705) | (0.0506) | |
The results with the minimum test errors are marked in bold.