z-logo
open-access-imgOpen Access
A multi-armed bandit solver method for adaptive power allocation in device-to-device communication
Author(s) -
Muhidul Islam Khan,
Muhammad Mahtab Alam,
Yannick Le Moullec
Publication year - 2018
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2018.04.155
Subject(s) - computer science , throughput , resource allocation , reinforcement learning , solver , distributed computing , reuse , interference (communication) , cellular network , spectral efficiency , power control , multi armed bandit , computer network , power (physics) , wireless , channel (broadcasting) , artificial intelligence , machine learning , telecommunications , ecology , physics , quantum mechanics , biology , programming language , regret
Device to device (D2D) communication has attracted enormous attention for future cellular networks which helps to increase the cellular capacity, improve the user throughput, and extend the battery lifetime of user equipments (UEs) by reusing the spectrum resources. However, D2D devices provide interferences in the system while reusing the resources. Proper control of interferences helps to increase the performance of the overall system. Adaptive power allocation among cellular and D2D users contributes to providing an efficient interference management system. In this paper, we propose an online power allocation method, i.e., multi-armed bandit solver for D2D communication. We explore the proposed method to improve the system throughput and D2D throughput as well. We incorporate the set of states for this learning algorithm with the appropriate number of system-defined variables, which increases the observation space and consequently improve the balance of spectrum usage. Finally, we compare our proposed work with existing distributed reinforcement learning and random allocation of resources. Simulation results depict that the proposed resource allocation method outperforms the existing works regarding overall system throughput as well as D2D throughput by efficiently controlling the interference levels.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom