DSpace at KOASAS: Learning Stochastic Optimal Policies via Gradient Descent

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Industrial and Systems Engineering(산업및시스템공학과)IE-Journal Papers(저널논문)

Learning Stochastic Optimal Policies via Gradient Descent

Cited 0 time in webofscience

Cited 0 time in

Hit : 934
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Massaroli, Stefano	ko
dc.contributor.author	Poli, Michael	ko
dc.contributor.author	Peluchetti, Stefano	ko
dc.contributor.author	Park, Jinkyoo	ko
dc.contributor.author	Yamashita, Atsushi	ko
dc.contributor.author	Asama, Hajime	ko
dc.date.accessioned	2021-07-29T01:50:04Z	-
dc.date.available	2021-07-29T01:50:04Z	-
dc.date.created	2021-07-29	-
dc.date.created	2021-07-29	-
dc.date.created	2021-07-29	-
dc.date.issued	2022	-
dc.identifier.citation	IEEE CONTROL SYSTEMS LETTERS, v.6, pp.1094 - 1099	-
dc.identifier.issn	2475-1456	-
dc.identifier.uri	http://hdl.handle.net/10203/286888	-
dc.description.abstract	We systematically develop a learning-based treatment of stochastic optimal control (SOC), relying on direct optimization of parametric control policies. We propose a derivation of adjoint sensitivity results for stochastic differential equations through direct application of variational calculus. Then, given an objective function for a predetermined task specifying the desiderata for the controller, we optimize their parameters via iterative gradient descent methods. In doing so, we extend the range of applicability of classical SOC techniques, often requiring strict assumptions on the functional form of system and control. We verify the performance of the proposed approach on a continuous-time, finite horizon portfolio optimization with proportional transaction costs.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Learning Stochastic Optimal Policies via Gradient Descent	-
dc.type	Article	-
dc.identifier.scopusid	2-s2.0-85107388968	-
dc.type.rims	ART	-
dc.citation.volume	6	-
dc.citation.beginningpage	1094	-
dc.citation.endingpage	1099	-
dc.citation.publicationname	IEEE CONTROL SYSTEMS LETTERS	-
dc.identifier.doi	10.1109/LCSYS.2021.3086672	-
dc.contributor.localauthor	Park, Jinkyoo	-
dc.contributor.nonIdAuthor	Massaroli, Stefano	-
dc.contributor.nonIdAuthor	Poli, Michael	-
dc.contributor.nonIdAuthor	Peluchetti, Stefano	-
dc.contributor.nonIdAuthor	Yamashita, Atsushi	-
dc.contributor.nonIdAuthor	Asama, Hajime	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Optimal control	-
dc.subject.keywordAuthor	Indium tin oxide	-
dc.subject.keywordAuthor	Stochastic processes	-
dc.subject.keywordAuthor	Process control	-
dc.subject.keywordAuthor	Optimization	-
dc.subject.keywordAuthor	Neural networks	-
dc.subject.keywordAuthor	Noise measurement	-
dc.subject.keywordAuthor	Optimal control	-
dc.subject.keywordAuthor	stochastic processes	-
dc.subject.keywordAuthor	machine learning	-

Appears in Collection: IE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Learning Stochastic Optimal Policies via Gradient Descent

KOASAS

Communities & Collections