Reinforcement Learning Spectrum Management Paradigm in Cognitive Radio using Novel State and Action Sets
Author(s) -
Zhijie Yin,
Yiming Wang,
Cheng Wu
Publication year - 2018
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2018.03.020
Subject(s) - reinforcement learning , computer science , cognitive radio , throughput , base station , spectrum management , channel (broadcasting) , markov decision process , channel allocation schemes , q learning , computer network , artificial intelligence , wireless , telecommunications , markov process , statistics , mathematics
This paper proposes a reinforcement learning(RL) model for cognitive radio(CR). By using this model, cognitive base station(CBS) can preform two-step decision of channel allocation, that is, whether to switch the channel for CR users and how to select the best channel if the CBS decides to switch, to avoid excessive channel switch and improve the throughput of the unlicensed user. Also, the performance of RL spectrum management depends highly on exploration strategy. Epsilon-greedy exploration method is used to solve the balance problem of RL decision process. Simulation results show that the implementation of the epsilon-greedy in each decision step has a remarkable effect on the system performance. The proposed method is superior to traditional RL spectrum allocation scheme in improving unlicensed users’ throughput and reducing channel switch.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom