Multi-agent deep reinforcement learning in dynamic, adversarial environment동적이고 대립적인 환경에서 다수 에이전트의 심층 강화 학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 439
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Jong Hwan-
dc.contributor.advisor김종환-
dc.contributor.authorHong, Chan Sol-
dc.date.accessioned2018-06-20T06:23:13Z-
dc.date.available2018-06-20T06:23:13Z-
dc.date.issued2017-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=718711&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/243378-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2017.8,[v, 43 p. :]-
dc.description.abstractIn this work, the capability of deep Q-network, a type of deep reinforcement learning algorithm, is examined on a dynamic, multi-agent environment AI soccer simulation game. In the AI soccer simulation game, two teams of three differential-wheel robots compete as in the real soccer game, pushing the orange-colored ball into each other’s goal area to earn more score than the opponent team. The simulation game provides various data including the top-view image of the soccer field, positions and orientations of the robots and the ball, scores, etc. to each team’s controller in every simulation step to be used as the sources for learning and playing the AI soccer game. To control three robots belonging to the home team, two or three deep Q-networks are trained on the AI soccer environment. One deep Q-network is assigned to control a goalkeeper robot. The other two robots are the attackers and controlled in two ways. In one method, one deep Q-network controls two robots simultaneously. In the other method, two deep Q-networks control two robots separately. The deep Q-networks take the top-view image of the soccer field as the input and output the ID of primitive action to be executed by the robot they control. The rewards are set as to motivate the robots to take the role of a goalkeeper and two attackers. For training the deep Q-networks, different sessions are held to train the goalkeeper and two attackers separately and then simultaneously. Through evaluation of the training sessions, the possibility for the deep Q-network to learn how to play the AI soccer game when adequate state, actions, and rewards are defined is shown.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectMachine Learning▼aDeep Reinforcement Learning▼aAI Soccer▼aArtificial Neural Network-
dc.subject기계 학습▼a심층 강화 학습▼a인공지능 축구▼a인공신경망-
dc.titleMulti-agent deep reinforcement learning in dynamic, adversarial environment-
dc.title.alternative동적이고 대립적인 환경에서 다수 에이전트의 심층 강화 학습-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthor홍찬솔-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0