Artificial learning and evolving brain for robots : synaptic plasticity models for hebbian learning and reinforcement learning = 진화/학습이 가능한 인공 로봇 두뇌 개발 : 헤비안 및 강화 학습을 위한 시냅스 가소성 모델synaptic plasticity models for hebbian learning and reinforcement learning

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 491
  • Download : 0
The ultimate goal of this research is developing an artificial brain for robots. On the proposed hypothesis that the fundamental functions of animals’ intelligence are auto-classification and reinforcement learning, several basic synaptic plasticity models have been developed. Researchers believe that STDP (Spike Timing Dependent Plasticity) is an essential brain function for auto-classification and that dopamine system plays an important role for reinforcement learning. A rate-coded STDP (Activity-Variation-Timing Dependent Plasticity, AVTDP) has been derived from the kinetic STDP model of Senn. AVTDP is a simple and efficient model and preserves the timing dependent property. A method to interpret the plasticity mechanism has been proposed in graphical manner. The similarity between AVTDP and STDP is shown through a series of simulations. It is shown that several conditions exist for the parameters that allow the similarity to become valid. Because the formula simply consists of differentiation, multiplication, addition and subtraction, the model is suitable for implementation not only in computer programs but also in electric circuits. It is believed that the proposed model effectively capitalizes on both the rate code and the timing dependent plasticity of the spike code. A synaptic reinforcement algorithm has been developed. The algorithm determines which synapses are to be potentiated or depressed by reward signals. A simulation is conducted to demonstrate how the interaction between the synaptic eligibility and the reward signal influences synaptic plasticity. The use of the pre- and postsynaptic spike correlator (PPSC) is proposed for reinforcement learning in a spiking neural network. The PPSC is used to determine the synaptic pathway eligible for reward. It represents the synaptic eligibility that increases only if the postsynaptic spike occurred shortly after a presynaptic spike. The magnitude of the synaptic eligibility exponentially decreases as a ...
Advisors
Kwon, Dong-Sooresearcher권동수researcher
Description
한국과학기술원 : 기계공학전공,
Publisher
한국과학기술원
Issue Date
2008
Identifier
295282/325007  / 020005212
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 기계공학전공, 2008.2, [ ix, 116 p. ]

Keywords

synaptic plasticity; Spike-Timing Dependent Plasticity (STDP); dopamine system; 시냅스 가소성; 발화 시간 기반 가소성; 도파민 시스템; synaptic plasticity; Spike-Timing Dependent Plasticity (STDP); dopamine system; 시냅스 가소성; 발화 시간 기반 가소성; 도파민 시스템

URI
http://hdl.handle.net/10203/43320
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=295282&flag=dissertation
Appears in Collection
ME-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0