Deep Operator Neural Network Model Predictive Control | Zendy

T.O. de Jong | Zendy; K. Shukla | Zendy; M. Lazar | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Deep Operator Neural Network Model Predictive Control

Author(s) -

T.O. de Jong,

K. Shukla,

M. Lazar

Publication year - 2025

Publication title -

ieee open journal of control systems

Language(s) - English

Resource type - Magazines

eISSN - 2694-085X

DOI - 10.1109/ojcsys.2025.3614875

Subject(s) - robotics and control systems

In this paper, we consider the design of model predictive control (MPC) algorithms based on deep operator neural networks (DeepONets) [1]. These neural networks are capable of accurately approximating real- and complex-valued solutions [2] of continuous-time nonlinear systems without relying on recurrent architectures. The DeepONet architecture is made up of two feedforward neural networks: the branch network, which encodes the input function space, and the trunk network, which represents dependencies on temporal variables or initial conditions. Utilizing the original DeepONet architecture [1] as a predictor within MPC for Multi-Input Multi-Output (MIMO) systems requires multiple branch networks, to generate multi-output predictions, one for each input. Moreover, to predict multiple time steps into the future, the network has to be evaluated multiple times. Motivated by this, we introduce a multi-step DeepONet (MS-DeepONet) architecture that computes in one-shot multi-step predictions of system outputs from multi-step input sequences, which is better suited for MPC. We prove that the MS-DeepONet is a universal approximator in terms of multi-step sequence prediction. Additionally, we develop automated hyperparameter selection strategies and implement MPC frameworks using both the standard DeepONet and the proposed MS-DeepONet architectures in PyTorch. We compare MS-DeepONet, standard DeepONet, and LSTM-based controllers on learning and predictive control tasks for the Van der Pol oscillator and the quadruple tank process. The MS-DeepONet is also evaluated on a challenging cart–pendulum system, where it successfully learns swing-up and stabilization policies. Across the examples, MS-DeepONet outperforms standard DeepONet in prediction accuracy and control performance, and achieves significantly lower computation times than Long Short-Term Memory (LSTM) based MPC. The code is publicly available on: https://github.com/todejong/Deep-Operator-Neural-Network-Model-Predictive-Control.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research