| Literature DB >> 31861735 |
Yi-Han Xu1,2, Jing-Wei Xie1, Yang-Gang Zhang3, Min Hua1, Wen Zhou1.
Abstract
Wireless body area networks (WBANs) have attracted great attention from both industry and academia as a promising technology for continuous monitoring of physiological signals of the human body. As the sensors in WBANs are typically battery-driven and inconvenient to recharge, an energy efficient resource allocation scheme is essential to prolong the lifetime of the networks, while guaranteeing the rigid requirements of quality of service (QoS) of the WBANs in nature. As a possible alternative solution to address the energy efficiency problem, energy harvesting (EH) technology with the capability of harvesting energy from ambient sources can potentially reduce the dependence on the battery supply. Consequently, in this paper, we investigate the resource allocation problem for EH-powered WBANs (EH-WBANs). Our goal is to maximize the energy efficiency of the EH-WBANs with the joint consideration of transmission mode, relay selection, allocated time slot, transmission power, and the energy constraint of each sensor. In view of the characteristic of the EH-WBANs, we formulate the energy efficiency problem as a discrete-time and finite-state Markov decision process (DFMDP), in which allocation strategy decisions are made by a hub that does not have complete and global network information. Owing to the complexity of the problem, we propose a modified Q-learning (QL) algorithm to obtain the optimal allocation strategy. The numerical results validate the effectiveness of the proposed scheme as well as the low computation complexity of the proposed modified Q-learning (QL) algorithm.Entities:
Keywords: energy efficient; energy harvesting; reinforcement learning; resource allocation; wireless body area networks
Mesh:
Year: 2019 PMID: 31861735 PMCID: PMC6983140 DOI: 10.3390/s20010044
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Table of notations. SINR: signal to interference plus noise ratio.
| Symbol | Definition |
|---|---|
|
| Hub |
|
| |
|
| Time slot |
|
| |
|
| Transmission mode of |
|
| |
|
| |
|
| |
|
| Data rate of the |
|
| Data rate of the |
|
| Data rate of the |
|
| SINR of |
|
| SINR of the source-relay link in |
|
| SINR of the relay-hub link in |
|
| Transmission power of the |
|
| Transmission gain between the |
|
| Transmission power of |
|
| Transmission gain between the |
|
| Transmission power of |
|
| Transmission gain between the |
|
| Noise power |
|
| Date rate of source-relay link in in cooperative transmission mode |
|
| Date rate of relay-hub link in in cooperative transmission mode |
|
| Data queue length at the |
|
| Maximum traffic queue length of body sensors |
|
| Arriving traffic packets of |
|
| Energy queue length at the |
|
| Maximum energy queue length of body sensors |
|
| Amount of energy harvested by |
|
| Date packet size |
|
| Energy packet size |
|
| Maximum transmission power of body sensors |
|
| Energy efficiency of |
Figure 1Scenario of the wireless body area network (WBAN). ECG: electrocardiogram sensor; EMG: electromyography sensor; EEG: electroencephalography sensor.
The irrelevant state mapping table.
|
| The State Space If Needs to Be Explored |
|---|---|
| {0, 0} | No |
| {0, *} | No |
| {*, 0} | No |
| {*, *} | Yes |
*: valid state; 0: invalid state.
Computation complexity comparison between the modified and classical Q-learning algorithms.
| Modified Q-Learning Algorithm | Classical Q-Learning Algorithm |
|---|---|
|
|
|
|
|
|
| - | - |
|
|
|
Simulation parameters setting.
| Parameters | Value |
|---|---|
|
| 10 m |
| Distance of each body sensor | Random distributed in (2, 5) m |
|
| (1:1:10) |
|
| 1 MHz |
|
| −94 dBm/Hz |
|
| 10 dBm |
|
| (1:1:8) packet/time slot |
|
| (1:1:8) packet/time slot |
|
| 200 |
|
| 0.5 ms |
|
| 8 bits/packet |
|
| 0.0002 J/packet |
|
| 50 packets |
|
| 50 packets |
Figure 2Influence of and on energy efficiency in direct transmission mode.
Figure 3Influence of and on energy efficiency in cooperative transmission mode.
Figure 4The optimization process for energy efficiency.
Figure 5Energy efficiency versus different numbers of body sensor. PEH: powered by energy harvesting; QoS: quality of service.
Figure 6Standard deviation of consumed energy versus different numbers of body sensors.
Figure 7Energy efficiency versus energy harvesting rate with constant
Figure 8Energy efficiency versus energy harvesting rate with constant
Figure 9Energy efficiency versus traffic arrival rate with constant
Figure 10Energy efficiency versus traffic arrival rate with constant
Figure 11Average delivery probability of WBANs versus different numbers of body sensor.