DSpace at KOASAS: Sequential decision making with only return and action

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Theses_Master(석사논문)

Sequential decision making with only return and action보상반환값과 행동만이 주어진 상황에서의 순차적 의사결정

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 2
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	황성주	-
dc.contributor.author	Seong, Haebin	-
dc.contributor.author	성해빈	-
dc.date.accessioned	2024-07-25T19:30:48Z	-
dc.date.available	2024-07-25T19:30:48Z	-
dc.date.issued	2023	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045740&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/320552	-
dc.description	학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2023.8,[i, 17 p. :]	-
dc.description.abstract	As recent success of transformer architectures have shown superior performance in sequence modeling, several approaches have been proposed to apply transformers in various fields, including sequential decision-making and reinforcement learning, such as the prior work on Decision Transformers. However, Markov Decision Processes (MDPs), the standard problem setting in sequential decision making and reinforcement learning, require information on the transition sequence of state, action, and reward. This information is not always available in real-world problems. In this paper, we propose a new problem setting for decision making, which is a relaxation of the MDP that requires fewer conditions, thus making it easier to apply in many real-world situations, such as robotic control or experimental design. By extending the approach used in Decision Transformers, we suggest a decision making method that leverages the sequence modeling power of transformers in this new problem setting. Additionally, we propose an active learning framework that could enable goal-oriented active learning in this new problem setting, using uncertainty modeling and sequence generation.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	순차적 의사 결정▼a강화 학습▼a의사결정 트랜스포머▼a트랜스포머 구조▼a지피티 구조▼a자기주도학습▼a불확실성 모델링▼a액티브 러닝▼a실험계획법	-
dc.subject	Sequential decision making▼aReinforcement learning▼aDecision transformer▼aTransformer architecture▼aGPT architecture▼aSelf-supervised learning▼aUncertainty modeling▼aActive learning▼aExperimental design	-
dc.title	Sequential decision making with only return and action	-
dc.title.alternative	보상반환값과 행동만이 주어진 상황에서의 순차적 의사결정	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :김재철AI대학원,	-
dc.contributor.alternativeauthor	Hwang, Sung Ju	-

Appears in Collection: AI-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Sequential decision making with only return and action보상반환값과 행동만이 주어진 상황에서의 순차적 의사결정

KOASAS

Communities & Collections