Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning
Author(s) -
Yu Fei,
Vincent W. S. Wong,
Victor C. M. Leung
Publication year - 2005
Publication title -
mobile networks and applications
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.445
H-Index - 85
eISSN - 1572-8153
pISSN - 1383-469X
ISBN - 0-7695-2221-1
DOI - 10.1007/s11036-005-4464-2
Subject(s) - computer science , reinforcement learning , quality of service , markov decision process , computer network , provisioning , wireless network , call admission control , bandwidth (computing) , bandwidth allocation , wireless , handover , distributed computing , multimedia , markov process , artificial intelligence , telecommunications , statistics , mathematics
The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where it is possible to increase or decrease the bandwidth of individual ongoing flows. This paper studies the issues of quality of service (QoS) provisioning in such systems. In particular, call admission control and bandwidth adaptation are formulated as a constrained Markov decision problem. The rapid growth in the number of states and the difficulty in estimating state transition probabilities in practical systems make it very difficult to employ classical methods to find the optimal policy. We present a novel approach that uses a form of discounted reward reinforcement learning known as Q-learning to solve QoS provisioning for wireless adaptive multimedia. Q-learning does not require the explicit state transition model to solve the Markov decision problem; therefore more general and realistic assumptions can be applied to the underlying system model for this approach than in previous schemes. Moreover, the proposed scheme can efficiently handle the large state space and action set of the wireless adaptive multimedia QoS provisioning problem. Handoff dropping probability and average allocated bandwidth are considered as QoS constraints in our model and can be guaranteed simultaneously. Simulation results demonstrate the effectiveness of the proposed scheme in adaptive multimedia mobile communication networks.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom