Scalable and Energy-Efficient Service Orchestration in the Edge-Cloud Continuum With Multi-Objective Reinforcement Learning | Zendy

Nicola Di Cicco | Zendy; Gaetano Francesco Pittala | Zendy; Gianluca Davoli | Zendy; Davide Borsatti | Zendy; Walter Cerroni | Zendy; Carla Raffaelli | Zendy; Massimo Tornatore | Zendy

Open Access

Scalable and Energy-Efficient Service Orchestration in the Edge-Cloud Continuum With Multi-Objective Reinforcement Learning

Author(s) -

Nicola Di Cicco,

Gaetano Francesco Pittala,

Gianluca Davoli,

Davide Borsatti,

Walter Cerroni,

Carla Raffaelli,

Massimo Tornatore

Publication year - 2025

Publication title -

ieee transactions on network and service management

Language(s) - English

Resource type - Magazines

SCImago Journal Rank - 0.945

H-Index - 51

eISSN - 1932-4537

DOI - 10.1109/tnsm.2025.3574131

Subject(s) - communication, networking and broadcast technologies , computing and processing

The Edge-Cloud Continuum represents a paradigm shift in distributed computing, seamlessly integrating resources from cloud data centers to edge devices. However, orchestrating services across this heterogeneous landscape poses significant challenges, as it requires finding a delicate balance between different (and competing) objectives, including service acceptance probability, offered Quality-of-Service, and network energy consumption. To address this challenge, we propose leveraging Multi-Objective Reinforcement Learning (MORL) to approximate the full Pareto Front of service orchestration policies. In contrast to conventional solutions based on single-objective RL, a MORL approach allows a network operator to inspect all possible “optimal” trade-offs, and then decide a posteriori on the orchestration policy that best satisfies the system’s operational requirements. Specifically, we first conduct an extensive measurement study to accurately model the energy consumption of heterogeneous edge devices and servers under various workloads, alongside the resource consumption of popular cloud services. Then, we develop a set-based MORL policy for service orchestration that can adapt to arbitrary network topologies without the need for retraining. Illustrative numerical results against selected heuristics show that our MORL policy outperforms baselines by 30% on average over a broad set of objective preferences, and generalizes to network topologies up to 5x larger than training.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research