DSpace at KOASAS: Conjoined architecture for heterogeneous multi-agent reinforcement learning

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Conjoined architecture for heterogeneous multi-agent reinforcement learning다기종 멀티에이전트 강화학습을 위한 결합 아키텍처

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 3
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	김종환	-
dc.contributor.author	Hong, Chansol	-
dc.contributor.author	홍찬솔	-
dc.date.accessioned	2024-08-08T19:31:34Z	-
dc.date.available	2024-08-08T19:31:34Z	-
dc.date.issued	2024	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1100043&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/322143	-
dc.description	학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[vi, 59 p. :]	-
dc.description.abstract	In this dissertation, we propose the conjoined architecture designed for multi-agent reinforcement learning in environments using global state and heterogeneous agents. Under such environments, semantic mismatches in both input and output layers arise and hinder contemporary multi-agent reinforcement learning algorithms to efficiently train under centralized-training decentralized-execution settings using parameter sharing. On that regard, we propose the conjoined architecture capable of effectively train in environments using global state and heterogeneous agents. The conjoined architecture is a partially parameter sharing architecture where heterogeneous agents are considered as a single team to be trained together using global state as the input to be processed through a team network. Unlike traditional fully-centralized training, the conjoined architecture factorizes the output joint action space into individual agents' action spaces represented with agents' own weights and biases. We exemplify the use of the conjoined architecture through proposing two actor-critic algorithms multi-actor-conjoined-critic and conjoined-actor-conjoined-critic. A conjoined critic evaluates all agents' actions as a single sample. Instead of evaluating joint action-space values for all action combinations of agents, the conjoined critic outputs individual Q-values for each agent to reduce output dimension size. Through value decomposition network, individual Q-values are summed to estimate team Q-values, which is the optimization objective for the critic. For multi-actor-conjoined-critic, individual actors are trained with value estimations from conjoined critic while sharing their internal state among each other through bandwidth-limited communication channel. For conjoined-actor-conjoined-critic, a parameter-efficient conjoined actor is used in addition to the conjoined critic to replace individual actors. We evaluate the proposed algorithms in AI Soccer environment that uses global state and heterogeneous agents and compare the results with existing algorithms to demonstrate conjoined architecture's effectiveness. Finally, we conduct ablation studies to investigate effects of components in the proposed algorithms.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	기계학습▼a인공지능▼a강화학습▼a멀티에이전트 강화학습▼a다기종 에이전트	-
dc.subject	Machine learning▼aArtificial intelligence▼aReinforcement learning▼aMulti-agent reinforcement learning▼aHeterogeneous agents	-
dc.title	Conjoined architecture for heterogeneous multi-agent reinforcement learning	-
dc.title.alternative	다기종 멀티에이전트 강화학습을 위한 결합 아키텍처	-
dc.type	Thesis(Ph.D)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	Kim, Jong-Hwan	-

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Conjoined architecture for heterogeneous multi-agent reinforcement learning다기종 멀티에이전트 강화학습을 위한 결합 아키텍처

KOASAS

Communities & Collections