| Literature DB >> 36236728 |
Abstract
As the core link of the "Internet + Recycling" process, the value identification of the sorting center is a great challenge due to its small and imbalanced data set. This paper utilizes transfer fuzzy c-means to improve the value assessment accuracy of the sorting center by transferring the knowledge of customers clustering. To ensure the transfer effect, an inter-class balanced data selection method is proposed to select a balanced and more qualified subset of the source domain. Furthermore, an improved RFM (Recency, Frequency, and Monetary) model, named GFMR (Gap, Frequency, Monetary, and Repeat), has been presented to attain a more reasonable attribute description for sorting centers and consumers. The application in the field of electronic waste recycling shows the effectiveness and advantages of the proposed method.Entities:
Keywords: GFMR model; Internet + Recycling; inter-class balanced data selection; transfer clustering; value identification
Mesh:
Year: 2022 PMID: 36236728 PMCID: PMC9572044 DOI: 10.3390/s22197629
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Fitted Distributions of G of Customers.
Clustering customer by .
| Cluster | Count |
|
|
|
|---|---|---|---|---|
| High-value | 165 | 119.3455 | 304.3000 | 16,427.7000 |
| Potential-value | 1146 | 124.1798 | 90.7749 | 5467.7400 |
| Stable-value | 153,094 | 95.2453 | 1.7242 | 96.5492 |
| Low-value | 153,654 | 286.7248 | 1.4013 | 76.7217 |
Clustering customer by .
| Cluster | Count |
|
|
|
|
|---|---|---|---|---|---|
| High-value | 543 | 0.9715 | 193.9931 | 9243.0920 | 156.0792 |
| Potential-value | 57,683 | 5.5829 | 4.6739 | 263.0061 | 4.1112 |
| Stable-value | 6337 | 76.1596 | 2.5564 | 193.2928 | 1.7175 |
| Low-value | 243,496 | 364.9982 | 1.0000 | 56.2338 | 1.0000 |
True centers of target domain.
| Cluster |
|
|
|
|---|---|---|---|
| High-value | 0.9967 | 0.7258 | 0.7035 |
| Potential-value | 0.9718 | 0.1771 | 0.3338 |
| Stable-value | 0.5945 | 0.0073 | 0.0107 |
| Low-value | 0.1865 | 0.0036 | 0.0059 |
Cluster centers of .
| Cluster |
|
|
| Distance |
|---|---|---|---|---|
| High-value | 0.9922 | 0.3373 | 0.2806 | 0.5742 |
| Potential-value | 0.9847 | 0.0070 | 0.0067 | 0.3689 |
| Stable-value | 0.7912 | 0.0030 | 0.0041 | 0.1969 |
| Low-value | 0.0000 | 0.0000 | 0.0013 | 0.1866 |
Cluster centers of .
| Cluster |
|
|
| Distance |
|---|---|---|---|---|
| High-value | 0.9987 | 0.6216 | 0.4156 | 0.3062 |
| Potential-value | 0.9779 | 0.1483 | 0.2847 | 0.0572 |
| Stable-value | 0.6937 | 0.0021 | 0.0031 | 0.0997 |
| Low-value | 0.1392 | 0.0016 | 0.0035 | 0.0473 |
Figure 2Variation of distance between true centers and clustering centers.
Accuracy of transferring top 50% of by TFCM.
|
| 0 | 0.005 | 0.1 | 0.5 | 0.7 | 1 | 1.5 | 10 | 50 | 100 | Average | Max | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||||
| 0 | 30.04% | 31.84% | 25.56% | 83.41% | 77.13% | 90.13% | 3.59% | 2.24% | 2.24% | 2.24% | 34.84% | 90.13% | |
| 0.005 | 29.15% | 24.66% | 25.56% | 83.41% | 77.13% | 89.24% | 3.59% | 2.24% | 2.24% | 2.24% | 33.95% | 89.24% | |
| 0.1 | 69.51% | 69.06% | 2.24% | 78.92% | 75.78% | 2.24% | 3.59% | 2.24% | 2.24% | 2.24% | 30.81% | 78.92% | |
| 0.5 | 82.96% | 82.51% | 89.69% | 34.53% | 52.47% | 2.24% | 3.59% | 2.24% | 2.24% | 2.24% | 35.47% | 89.69% | |
| 0.7 | 86.10% | 86.10% | 91.93% | 19.28% | 28.25% | 48.43% | 2.24% | 2.24% | 2.24% | 2.24% | 36.91% | 91.93% | |
| 1 | 88.79% | 88.79% | 92.83% | 13.00% | 16.14% | 25.56% | 2.24% | 2.24% | 2.24% | 2.24% | 33.41% | 92.83% | |
| 1.5 | 91.48% | 91.48% | 92.38% | 90.58% | 10.31% | 13.45% | 3.59% | 2.24% | 2.24% | 2.24% | 40.00% | 92.38% | |
| 10 | 96.41% | 96.41% | 96.41% | 95.96% | 95.07% | 95.07% | 94.17% | 2.24% | 2.24% | 2.24% | 67.62% | 96.41% | |
| 50 | 96.41% | 96.41% | 96.41% | 96.41% | 96.41% | 96.41% | 95.96% | 96.41% | 2.24% | 2.24% | 77.53% | 96.41% | |
| 100 | 96.41% | 96.41% | 96.41% | 96.41% | 96.41% | 96.41% | 96.41% | 96.41% | 97.31% | 2.24% | 87.09% | 97.31% | |
| Average | 76.73% | 76.37% | 70.94% | 69.19% | 62.51% | 55.92% | 30.90% | 21.08% | 11.75% | 2.24% | 47.76% | ||
Accuracy of transferring by TFCM.
|
| 0 | 0.005 | 0.1 | 0.5 | 0.7 | 1 | 1.5 | 10 | 50 | 100 | Average | Max | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||||
| 0 | 30.04% | 29.60% | 38.57% | 19.28% | 11.21% | 5.38% | 3.59% | 2.24% | 2.24% | 2.24% | 14.44% | 38.57% | |
| 0.005 | 28.70% | 33.18% | 37.67% | 19.28% | 11.21% | 5.38% | 3.59% | 2.24% | 2.24% | 2.24% | 14.57% | 37.67% | |
| 0.1 | 44.84% | 43.95% | 18.39% | 23.32% | 13.45% | 5.38% | 3.59% | 2.24% | 2.24% | 2.24% | 15.96% | 44.84% | |
| 0.5 | 78.92% | 79.37% | 71.75% | 39.01% | 26.91% | 2.24% | 2.24% | 3.59% | 2.24% | 2.24% | 30.85% | 79.37% | |
| 0.7 | 82.51% | 82.51% | 78.92% | 45.29% | 31.39% | 2.24% | 2.24% | 2.24% | 2.24% | 2.24% | 33.18% | 82.51% | |
| 1 | 84.75% | 85.20% | 89.69% | 56.95% | 41.26% | 18.83% | 4.48% | 2.24% | 2.24% | 2.24% | 38.79% | 89.69% | |
| 1.5 | 87.89% | 88.34% | 91.48% | 74.44% | 56.05% | 31.84% | 7.17% | 2.24% | 2.24% | 2.24% | 44.39% | 91.48% | |
| 10 | 92.83% | 92.83% | 91.93% | 92.38% | 92.38% | 91.93% | 91.03% | 3.59% | 2.24% | 2.24% | 65.34% | 92.83% | |
| 50 | 92.83% | 92.83% | 92.83% | 92.38% | 92.38% | 92.38% | 92.38% | 88.34% | 3.59% | 2.24% | 74.22% | 92.83% | |
| 100 | 92.83% | 92.83% | 92.83% | 92.38% | 91.93% | 91.93% | 91.93% | 91.48% | 3.59% | 3.59% | 74.53% | 92.83% | |
| Average | 71.61% | 72.06% | 70.40% | 55.47% | 46.82% | 34.75% | 30.22% | 20.04% | 2.51% | 2.38% | 40.63% | ||
Figure 3Comparison of different clustering approaches.