Distributed Online Planning for Min-Max Problems in Networked Markov Games

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 6
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorTzikas, Alexandros E.ko
dc.contributor.authorPark, Jinkyooko
dc.contributor.authorKochenderfer, Mykel J.ko
dc.contributor.authorAllen, Ross E.ko
dc.date.accessioned2024-09-30T05:00:07Z-
dc.date.available2024-09-30T05:00:07Z-
dc.date.created2024-09-30-
dc.date.issued2024-07-
dc.identifier.citationIEEE ROBOTICS AND AUTOMATION LETTERS, v.9, no.7, pp.6656 - 6663-
dc.identifier.issn2377-3766-
dc.identifier.urihttp://hdl.handle.net/10203/323332-
dc.description.abstractMin-max problems are important in multi-agent sequential decision-making because they improve the performance of the worst-performing agent in the network. However, solving the multi-agent min-max problem is challenging. We propose a modular, distributed, online planning-based algorithm that is able to approximate the solution of the min-max objective in networked Markov games, assuming that the agents communicate within a network topology and the transition and reward functions are neighborhood-dependent. This set-up is encountered in the multi-robot setting. Our method consists of two phases at every planning step. In the first phase, each agent obtains sample returns based on its local reward function, by performing online planning. Using the samples from online planning, each agent constructs a concave approximation of its underlying local return as a function of only the action of its neighborhood at the next planning step. In the second phase, the agents deploy a distributed optimization framework that converges to the optimal immediate next action for each agent, based on the function approximations of the first phase. We demonstrate our algorithm's performance through formation control simulations.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleDistributed Online Planning for Min-Max Problems in Networked Markov Games-
dc.typeArticle-
dc.identifier.wosid001246186700009-
dc.identifier.scopusid2-s2.0-85195374609-
dc.type.rimsART-
dc.citation.volume9-
dc.citation.issue7-
dc.citation.beginningpage6656-
dc.citation.endingpage6663-
dc.citation.publicationnameIEEE ROBOTICS AND AUTOMATION LETTERS-
dc.identifier.doi10.1109/LRA.2024.3409028-
dc.contributor.localauthorPark, Jinkyoo-
dc.contributor.nonIdAuthorTzikas, Alexandros E.-
dc.contributor.nonIdAuthorKochenderfer, Mykel J.-
dc.contributor.nonIdAuthorAllen, Ross E.-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorMinimax techniques-
dc.subject.keywordAuthorGames-
dc.subject.keywordAuthorPlanning-
dc.subject.keywordAuthorOptimization-
dc.subject.keywordAuthorDecision making-
dc.subject.keywordAuthorRobots-
dc.subject.keywordAuthorCosts-
dc.subject.keywordAuthorDistributed robot systems-
dc.subject.keywordAuthorMarkov games-
dc.subject.keywordAuthormin-max optimization-
dc.subject.keywordAuthornetworked robots-
dc.subject.keywordAuthorplanning under uncertainty-
Appears in Collection
IE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0