z-logo
open-access-imgOpen Access
Hierarchical Multi-agent Reinforcement Learning Method Using Energy Field in Sports Games
Author(s) -
Hoshin Lee,
Junoh Kim,
Jisun Park,
Phuong Minh Chu,
Kyungeun Cho
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3613359
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
This paper proposes an energy-field-based hierarchical multi-agent reinforcement learning method (HES-COMA) for evaluating individual agent contributions and learning efficient policies in dynamic and complex multi-agent environments such as sports games. The proposed method addresses the limitations of the conventional single-layer approach by using energy fields in a global layer to learn strategic positioning, and in a local layer to determine tactical actions (e.g., shooting, stealing, and blocking) from those positions. Specifically, the method assigns an energy value to represent the relative importance of key elements in the game space (ball, opponents, teammates, basket, and shooting probability spots), and builds a dynamically changing energy field depending on the state of play (offense, defense, free scenario, etc.). Experimental results in a commercialized 3vs3 basketball game environment show that HES-COMA achieves approximately 1.5 times faster learning speed than Counterfactual Multi-Agent Policy Gradients (COMA). It also improved the success rates of steals, rebounds, and blocks by factors of 1.38, 1.87, and 2.71, respectively. Moreover, by combining global strategic positioning information with local tactical decision-making, HES-COMA’s movement patterns more closely resemble those of users and FSM-based agents in terms of spatial utilization. Consequently, HES-COMA effectively addresses contribution evaluation and data diversity issue in dynamic multi-agent sports games, thereby boosting both learning efficiency and overall performance.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom