Learning and Fairness in Energy Harvesting: A Maximin Multi-Armed Bandits Approach
2020
Recent advances in wireless radio frequency (RF) energy harvesting allows sensor nodes to increase their lifespan by remotely charging their batteries. The amount of energy harvested by the nodes varies depending on their ambient environment, and proximity to the source and lifespan of the sensor network depends on the minimum amount of energy a node can harvest in the network. It is thus important to learn the least amount of energy harvested by nodes so that the source can transmit on a frequency band that maximizes this amount. We model this learning problem as a novel stochastic \textit{Maximin Multi-Armed Bandits} (Maximin MAB) problem and propose an Upper Confidence Bound (UCB) based algorithm named Maximax UCB. Maximin MAB is a generalization of standard MAB and enjoys the same performance guarantee as to the UCB1 algorithm. Experimental results validate the performance guarantees of our algorithm.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
16
References
0
Citations
NaN
KQI