| Literature DB >> 27376298 |
Xinrong Ji1,2, Cuiqin Hou3, Yibin Hou4, Fang Gao5, Shulong Wang6.
Abstract
In wireless sensor networks, centralized learning methods have very high communication costs and energy consumption. These are caused by the need to transmit scattered training examples from various sensor nodes to the central fusion center where a classifier or a regression machine is trained. To reduce the communication cost, a distributed learning method for a kernel machine that incorporates ℓ 1 norm regularization ( ℓ 1 -regularized) is investigated, and a novel distributed learning algorithm for the ℓ 1 -regularized kernel minimum mean squared error (KMSE) machine is proposed. The proposed algorithm relies on in-network processing and a collaboration that transmits the sparse model only between single-hop neighboring nodes. This paper evaluates the proposed algorithm with respect to the prediction accuracy, the sparse rate of model, the communication cost and the number of iterations on synthetic and real datasets. The simulation results show that the proposed algorithm can obtain approximately the same prediction accuracy as that obtained by the batch learning method. Moreover, it is significantly superior in terms of the sparse rate of model and communication cost, and it can converge with fewer iterations. Finally, an experiment conducted on a wireless sensor network (WSN) test platform further shows the advantages of the proposed algorithm with respect to communication cost.Entities:
Keywords: distributed learning; kernel machines; kernel minimum mean squared error (KMSE); wireless sensor network (WSN); ℓ1 norm regularization (ℓ1-regularized)
Year: 2016 PMID: 27376298 PMCID: PMC4970071 DOI: 10.3390/s16071021
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Performance comparisons of CSVM, L1-CKMSE, L1-DKMSE, DKLS, DPSVM and NDSVM on the synthetic dataset: (a) prediction accuracy; (b) sparse rate of model; (c) communication cost; and (d) iterations.
Datasets from UCI repository.
| Datasets | Classes | Dim. Feature | Size |
|---|---|---|---|
| magic | 2 | 10 | 19,020 |
| default of credit card client | 2 | 24 | 30,000 |
| spambase | 2 | 57 | 4601 |
Values of the parameters of the algorithms on the UCI datasets. CSVM, Centralized SVM learning algorithm; L1-CKMSE, Centralized L1-regularized KMSE learning algorithm; L1-DKMSE, proposed Algorithm 1 in this paper; DKLS, Distributed Kernel Least Squares learning algorithm in [10]; DPSVM, Distributed Parallel SVM learning algorithm in [17]; NDSVM-1, Distributed SVM learning algorithm based on a nonlinear kernel in [14] with 30 shared training examples among all nodes; NDSVM-2, Distributed SVM learning algorithm based on a nonlinear kernel in [14] with 50 shared training examples among all nodes.
| Algorithms | Magic | Default of Credit Card Client | Spambase |
|---|---|---|---|
| CSVM | |||
| L1-CKMSE | |||
| L1-DKMSE | |||
| DKLS | |||
| DPSVM | |||
| NDSVM-1 | |||
| NDSVM-2 |
Figure 2Performance comparisons of CSVM, L1-CKMSE, L1-DKMSE, DKLS, DPSVM and NDSVM on the UCI datasets: (a) prediction accuracy; (b) sparse rate of model; (c) communication cost; and (d) iterations.
Communication costs of the algorithms for the synthetic dataset.
| Algorithms | Amount of Data Sent at Every Turn of Each Node (Byte) | Iterations |
|---|---|---|
| CSVM | 140 | 20 |
| L1-CKMSE | 140 | 20 |
| L1-DKMSE | 26 | 16 |
| DKLS | 312 | 6 |
| DPSVM | 67 | 9 |
| NDSVM-1 | 35 | 85 |
| NDSVM-2 | 51 | 87 |
Correspondence of voltage and battery capacity.
| 500 mA Load Method | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Voltage (V) | 8.40 | 7.94 | 7.74 | 7.58 | 7.46 | 7.36 | 7.30 | 7.24 | 7.16 | 7.02 | 6.84 | 6.00 |
| Battery capacity (%) | 100 | 90 | 80 | 70 | 60 | 50 | 40 | 30 | 20 | 10 | 5 | 0 |
Figure 3Comparison of the energy consumption on each node for the CSVM, L1-CKMSE, L1-DKMSE, DKLS, DPSVM and NDSVM algorithms on the test platform.