DSpace at KOASAS: Multi-agent deep reinforcement learning in dynamic, adversarial environment

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Multi-agent deep reinforcement learning in dynamic, adversarial environment동적이고 대립적인 환경에서 다수 에이전트의 심층 강화 학습

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 439
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Kim, Jong Hwan	-
dc.contributor.advisor	김종환	-
dc.contributor.author	Hong, Chan Sol	-
dc.date.accessioned	2018-06-20T06:23:13Z	-
dc.date.available	2018-06-20T06:23:13Z	-
dc.date.issued	2017	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=718711&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/243378	-
dc.description	학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2017.8,[v, 43 p. :]	-
dc.description.abstract	In this work, the capability of deep Q-network, a type of deep reinforcement learning algorithm, is examined on a dynamic, multi-agent environment AI soccer simulation game. In the AI soccer simulation game, two teams of three differential-wheel robots compete as in the real soccer game, pushing the orange-colored ball into each other’s goal area to earn more score than the opponent team. The simulation game provides various data including the top-view image of the soccer field, positions and orientations of the robots and the ball, scores, etc. to each team’s controller in every simulation step to be used as the sources for learning and playing the AI soccer game. To control three robots belonging to the home team, two or three deep Q-networks are trained on the AI soccer environment. One deep Q-network is assigned to control a goalkeeper robot. The other two robots are the attackers and controlled in two ways. In one method, one deep Q-network controls two robots simultaneously. In the other method, two deep Q-networks control two robots separately. The deep Q-networks take the top-view image of the soccer field as the input and output the ID of primitive action to be executed by the robot they control. The rewards are set as to motivate the robots to take the role of a goalkeeper and two attackers. For training the deep Q-networks, different sessions are held to train the goalkeeper and two attackers separately and then simultaneously. Through evaluation of the training sessions, the possibility for the deep Q-network to learn how to play the AI soccer game when adequate state, actions, and rewards are defined is shown.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Machine Learning▼aDeep Reinforcement Learning▼aAI Soccer▼aArtificial Neural Network	-
dc.subject	기계 학습▼a심층 강화 학습▼a인공지능 축구▼a인공신경망	-
dc.title	Multi-agent deep reinforcement learning in dynamic, adversarial environment	-
dc.title.alternative	동적이고 대립적인 환경에서 다수 에이전트의 심층 강화 학습	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	홍찬솔	-

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Multi-agent deep reinforcement learning in dynamic, adversarial environment동적이고 대립적인 환경에서 다수 에이전트의 심층 강화 학습

KOASAS

Communities & Collections