A multi-armed bandit solver method for adaptive power allocation in device-to-device communication | Zendy

Muhidul Islam Khan | Zendy; Muhammad Mahtab Alam | Zendy; Yannick Le Moullec | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A multi-armed bandit solver method for adaptive power allocation in device-to-device communication

Author(s) -

Muhidul Islam Khan,

Muhammad Mahtab Alam,

Yannick Le Moullec

Publication year - 2018

Publication title -

procedia computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.334

H-Index - 76

ISSN - 1877-0509

DOI - 10.1016/j.procs.2018.04.155

Subject(s) - computer science , throughput , resource allocation , reinforcement learning , solver , distributed computing , reuse , interference (communication) , cellular network , spectral efficiency , power control , multi armed bandit , computer network , power (physics) , wireless , channel (broadcasting) , artificial intelligence , machine learning , telecommunications , ecology , physics , quantum mechanics , biology , programming language , regret

Device to device (D2D) communication has attracted enormous attention for future cellular networks which helps to increase the cellular capacity, improve the user throughput, and extend the battery lifetime of user equipments (UEs) by reusing the spectrum resources. However, D2D devices provide interferences in the system while reusing the resources. Proper control of interferences helps to increase the performance of the overall system. Adaptive power allocation among cellular and D2D users contributes to providing an efficient interference management system. In this paper, we propose an online power allocation method, i.e., multi-armed bandit solver for D2D communication. We explore the proposed method to improve the system throughput and D2D throughput as well. We incorporate the set of states for this learning algorithm with the appropriate number of system-defined variables, which increases the observation space and consequently improve the balance of spectrum usage. Finally, we compare our proposed work with existing distributed reinforcement learning and random allocation of resources. Simulation results depict that the proposed resource allocation method outperforms the existing works regarding overall system throughput as well as D2D throughput by efficiently controlling the interference levels.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research