
Scalable and Energy-Efficient Service Orchestration in the Edge-Cloud Continuum With Multi-Objective Reinforcement Learning
Author(s) -
Nicola Di Cicco,
Gaetano Francesco Pittala,
Gianluca Davoli,
Davide Borsatti,
Walter Cerroni,
Carla Raffaelli,
Massimo Tornatore
Publication year - 2025
Publication title -
ieee transactions on network and service management
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.945
H-Index - 51
eISSN - 1932-4537
DOI - 10.1109/tnsm.2025.3574131
Subject(s) - communication, networking and broadcast technologies , computing and processing
The Edge-Cloud Continuum represents a paradigm shift in distributed computing, seamlessly integrating resources from cloud data centers to edge devices. However, orchestrating services across this heterogeneous landscape poses significant challenges, as it requires finding a delicate balance between different (and competing) objectives, including service acceptance probability, offered Quality-of-Service, and network energy consumption. To address this challenge, we propose leveraging Multi-Objective Reinforcement Learning (MORL) to approximate the full Pareto Front of service orchestration policies. In contrast to conventional solutions based on single-objective RL, a MORL approach allows a network operator to inspect all possible “optimal” trade-offs, and then decide a posteriori on the orchestration policy that best satisfies the system’s operational requirements. Specifically, we first conduct an extensive measurement study to accurately model the energy consumption of heterogeneous edge devices and servers under various workloads, alongside the resource consumption of popular cloud services. Then, we develop a set-based MORL policy for service orchestration that can adapt to arbitrary network topologies without the need for retraining. Illustrative numerical results against selected heuristics show that our MORL policy outperforms baselines by 30% on average over a broad set of objective preferences, and generalizes to network topologies up to 5x larger than training.