Abstract: In recent years, researchers have proposed the approximation of model predictive control (MPC) using deep neural networks (DNNs). However, a limitation arises as DNNs inherently offer one-to ...
Abstract: This paper proposes an adaptive iterative learning control (AILC) scheme for multiagent systems (MASs) to improve the containment control performance. To deal with the uncertain nonlinearity ...
Multi-step temporal-difference (TD) learning, where the update targets contain information from multiple time steps ahead, is one of the most popular forms of TD learning for linear function ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results