Dynamic resource allocation during reinforcement learning accounts for ramping and phasic dopamine activity강화학습을 위한 동적 자원할당과 관련된 도파민 활동 패턴 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 219
  • Download : 0
Despite the complexity and dynamic nature of the environment, animals find a way to maximize the amount of future reward without any supervision or a complete knowledge of the environment. Accumulating evidence shows that this behavior can be explained by reinforcement learning (RL). According to the RL theory, animals learn to predict future reward through trial and error. As the learning process is constrained by the limitations of time and resources, a biological agent should deal with the tradeoff between task performance and resource consumption. This study investigated whether the performance–efficiency tradeoff is reflected in the activity of dopamine neurons, the neural substrate that is deeply involved in the RL process. The main contributions of this study are as follows. First, we found that RL with dynamic resource allocation accounts for the ramping and phasic activity of dopamine neurons. Second, we showed that dopamine activity further explains how animals resolve the bias–variance tradeoff.
Advisors
Lee, Sang Wanresearcher이상완researcher
Description
한국과학기술원 :바이오및뇌공학과,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 바이오및뇌공학과, 2020.8,[ⅳ, 64 p. :]

Keywords

dopamine▼areinforcement learning▼atemporal-difference learning model▼aresource allocation▼ahabit▼aeligibility trace▼abias-variance tradeoff▼aramping dopamine▼aknowledge▼asalience; 도파민▼a강화학습▼a시간차 학습 모델▼a자원 할당▼a습관▼a적격 흔적도▼a편향-분산 균형▼a증가하는 도파민▼a지식▼a현저함

URI
http://hdl.handle.net/10203/284304
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=924264&flag=dissertation
Appears in Collection
BiS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0