UPDF AI

Approximate optimal cooperative decentralized control for consensus in a topological network of agents with uncertain nonlinear dynamics

R. Kamalapurkar,Huyen T. Dinh,P. Walters,W. Dixon

2013 · DOI: 10.1109/acc.2013.6580019
American Control Conference · 引用数 20

TLDR

Efforts in this paper seek to combine graph theory with adaptive dynamic programming (ADP) as a reinforcement learning (RL) framework to determine forward-in- time, real-time, approximate optimal controllers for distributed multi-agent systems with uncertain nonlinear dynamics.

摘要

Efforts in this paper seek to combine graph theory with adaptive dynamic programming (ADP) as a reinforcement learning (RL) framework to determine forward-in-time, real-time, approximate optimal controllers for distributed multi-agent systems with uncertain nonlinear dynamics. A decentralized continuous time-varying control strategy is proposed, using only local communication feedback from two-hop neighbors on a communication topology that has a spanning tree. An actor-critic-identifier architecture is proposed that employs a nonlinear state derivative estimator to estimate the unknown dynamics online and uses the estimate thus obtained for value function approximation.

参考文献
引用文献