DSpace at KOASAS: Reinforcement with Fading Memories

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

Reinforcement with Fading Memories

Cited 1 time in

Cited 1 time in

Hit : 349
Download : 0

Export

Xu, Kuang / Yun, Se-Young researcher

We study the effect of imperfect memory on decision making in the context of a stochastic sequential action-reward problem. An agent chooses a sequence of actions, which generate discrete rewards at different rates. She is allowed to make new choices at rate β, whereas past rewards disappear from her memory at rate μ. We focus on a family of decision rules where the agent makes a new choice by randomly selecting an action with a probability approximately proportional to the amount of past rewards associated with each action in her memory. We provide closed form formulas for the agent’s steady-state choice distribution in the regime where the memory span is large (μ→0) and show that the agent’s success critically depends on how quickly she updates her choices relative to the speed of memory decay. If β≫μ, the agent almost always chooses the best action (that is, the one with the highest reward rate). Conversely, if β≪μ, the agent chooses an action with a probability roughly proportional to its reward rate.

Publisher: INFORMS

Issue Date: 2020-11

Language: English

Article Type: Article

Citation: MATHEMATICS OF OPERATIONS RESEARCH, v.45, no.4, pp.1258 - 1288

ISSN: 0364-765X

DOI: 10.1287/moor.2019.1031

URI: http://hdl.handle.net/10203/277436

Appears in Collection: AI-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Reinforcement with Fading Memories

This item is cited by other documents in WoS

KOASAS

Communities & Collections