
Rule-Based Reinforcement Learning on FPGA for QoS-Aware Dynamic Frequency Scaling
Author(s) -
Florian Maurer,
Michael Meidinger,
Yiming Lu,
Matthias Schlemmer,
Thomas Hallermeier,
Anmol Surhonne,
Thomas Wild,
Andreas Herkersdorf
Publication year - 2025
Publication title -
ieee transactions on very large scale integration (vlsi) systems
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.506
H-Index - 105
eISSN - 1557-9999
pISSN - 1063-8210
DOI - 10.1109/tvlsi.2025.3592300
Subject(s) - components, circuits, devices and systems , computing and processing
To improve the system performance of multiprocessor system-on-chips (MPSoCs), modern processors have several built-in hardware features, such as prefetchers, which respond to short-term variations in processor load that occurs on a submillisecond scale. However, even the latest dynamic (voltage) frequency scaling governors using reinforcement learning (RL) are implemented in software and, thus, cannot take advantage of these variations. In this work, we propose a hardware RL agent, augmented with preemptive shielding and eligibility traces, to optimize the execution of deadline-bound quality-of-service (QoS) tasks in mixed-critical environments. We demonstrate the features of our algorithm in a hardware-in-the-loop simulation by running LLVM’s single-source benchmarks on SparcV8 processors. We also present our field-programmable gate array (FPGA) implementation with optimized resource usage and timing performance achieved through quantization and approximation.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom