DSpace at KOASAS: Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Chemical and Biomolecular Engineering(생명화학공학과)CBE-Conference Papers(학술회의논문)

Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system

Cited 11 time in

Cited 0 time in

Hit : 106
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Jong Woo	ko
dc.contributor.author	Park, Byung Jun	ko
dc.contributor.author	Yoo, Haeun	ko
dc.contributor.author	Lee, Jay Hyung	ko
dc.contributor.author	Lee, Jong Min	ko
dc.date.accessioned	2023-09-04T08:01:07Z	-
dc.date.available	2023-09-04T08:01:07Z	-
dc.date.created	2023-09-04	-
dc.date.issued	2018-09	-
dc.identifier.citation	Joint Meeting of the 2nd IFAC Workshop on Linear Parameter Varying Systems (LPVS) / 9th IFAC Symposium on Robust Control Design (ROCOND), pp.257 - 262	-
dc.identifier.issn	2405-8963	-
dc.identifier.uri	http://hdl.handle.net/10203/312166	-
dc.description.abstract	Reinforcement learning (RL) can be used to obtain an approximate numerical solution to the Hamilton-Jacobi-Bellman (HJB) equation. Recent advances in machine learning community enable the use of deep neural networks (DNNs) to approximate high-dimensional nonlinear functions as those that occur in RL, accurately without any domain knowledge. In the standard RL setting, both system and cost structures are unknown, and the amount of data needed to obtain an accurate approximation can be impractically large. Meanwhile, when the structures are known, they can be used to solve the HJB equation efficiently. Herein, the model based globalized dual heuristic programming (GDHP) is proposed, in which the HJB equation is separated into value, costate, and policy functions. A particular class of interest in this research is finite horizon optimal tracking control (FHOC) problem. Additional issues that arise, such as time-varying functions, terminal constraints, and delta-input formulation, are addressed in the context of FHOC. The DNN structure and training algorithm suitable for FHOC are presented. A benchmark continuous reactor example is provided to illustrate the proposed approach.	-
dc.language	English	-
dc.publisher	Elsevier BV	-
dc.title	Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system	-
dc.type	Conference	-
dc.identifier.wosid	000451094400044	-
dc.identifier.scopusid	2-s2.0-85056903034	-
dc.type.rims	CONF	-
dc.citation.beginningpage	257	-
dc.citation.endingpage	262	-
dc.citation.publicationname	Joint Meeting of the 2nd IFAC Workshop on Linear Parameter Varying Systems (LPVS) / 9th IFAC Symposium on Robust Control Design (ROCOND)	-
dc.identifier.conferencecountry	BL	-
dc.identifier.conferencelocation	Florianopolis	-
dc.identifier.doi	10.1016/j.ifacol.2018.11.115	-
dc.contributor.localauthor	Lee, Jay Hyung	-
dc.contributor.nonIdAuthor	Kim, Jong Woo	-
dc.contributor.nonIdAuthor	Park, Byung Jun	-
dc.contributor.nonIdAuthor	Lee, Jong Min	-

Appears in Collection: CBE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 11 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system

This item is cited by other documents in WoS

KOASAS

Communities & Collections