DSpace at KOASAS: Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 11
Download : 0

Export

Lee, Donghwan researcher / Lim, Han-Dong / Kim, Do Wan

The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs). In our study, we adopt a distributed multi-agent framework where individual agents have access only to their own rewards, lacking insights into the rewards of other agents. Moreover, each agent has the ability to share its parameters with neighboring agents through a communication network, represented by a graph. We first introduce a novel distributed DP, inspired by the distributed optimization method of Wang and Elia. Next, a new distributed DP is introduced through a decoupling process. The convergence of the DP algorithms is proved through systems and control perspectives. The study in this paper sets the stage for new distributed temporal different learning algorithms.

Publisher: IEEE Computer Society

Issue Date: 2024-06-20

Language: English

Citation: 18th IEEE International Conference on Control and Automation, ICCA 2024, pp.960 - 967

DOI: 10.1109/ICCA62789.2024.10591854

URI: http://hdl.handle.net/10203/322402

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

KOASAS

Communities & Collections