DSpace at KOASAS: Primal-Dual Q-Learning Framework for LQR Design

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

Primal-Dual Q-Learning Framework for LQR Design

Cited 27 time in

Cited 15 time in

Hit : 305
Download : 0

Export

Lee, Donghwan researcher / Hu, Jianghai

Recently, reinforcement learning (RL) is receiving more and more attentions due to its successful demonstrations outperforming human performance in certain challenging tasks. The goal of this paper is to study a new optimization formulation of the linear quadratic regulator (LQR) problem via the Lagrangian duality theories in order to lay theoretical foundations of potentially effective RL algorithms. The new optimization problem includes the Q-function parameters so that it can be directly used to develop Q-learning algorithms, known to be one of the most popular RL algorithms. We prove relations between saddle-points of the Lagrangian function and the optimal solutions of the Bellman equation. As an example of its applications, we propose a model-free primal-dual Q-learning algorithm to solve the LQR problem and demonstrate its validity through examples.

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Issue Date: 2019-09

Language: English

Article Type: Article

Citation: IEEE TRANSACTIONS ON AUTOMATIC CONTROL, v.64, no.9, pp.3756 - 3763

ISSN: 0018-9286

DOI: 10.1109/TAC.2018.2884649

URI: http://hdl.handle.net/10203/272634

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 27 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Primal-Dual Q-Learning Framework for LQR Design

This item is cited by other documents in WoS

KOASAS

Communities & Collections