Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 11
  • Download : 0
The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs). In our study, we adopt a distributed multi-agent framework where individual agents have access only to their own rewards, lacking insights into the rewards of other agents. Moreover, each agent has the ability to share its parameters with neighboring agents through a communication network, represented by a graph. We first introduce a novel distributed DP, inspired by the distributed optimization method of Wang and Elia. Next, a new distributed DP is introduced through a decoupling process. The convergence of the DP algorithms is proved through systems and control perspectives. The study in this paper sets the stage for new distributed temporal different learning algorithms.
Publisher
IEEE Computer Society
Issue Date
2024-06-20
Language
English
Citation

18th IEEE International Conference on Control and Automation, ICCA 2024, pp.960 - 967

DOI
10.1109/ICCA62789.2024.10591854
URI
http://hdl.handle.net/10203/322402
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0