Deep bi-directional LSTM attention via multi order difference feature for action recognition행동인식을 위한 다차원 계차 피쳐를 이용한 집중 방식의 깊은 양방향 장단기간 순환 신경망

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 529
  • Download : 0
We consider a soft attention based model for the task of action recognition in videos. We used C3D 3d cnn network for feature extractor and multi-layered Recurrent Neural Networks (RNNs) with bi-directional Long Short-Term Memory (bi-LSTM) units which are deep both spatially and temporally as a classifier. Our model learns to focus selectively on parts of the video frames and important frames and classifies videos. The model essentially learns which parts and frames are relevant for the task and attaches higher importance to them. We evaluate the model on UCF-101 (YouTube Action)datasets and compare the result without attention mechanism. The result show that it's accuracy is 93% which acurracy is comparable to other state of the art result.
Advisors
Yoo, Chang Dongresearcher유창동researcher
Description
한국과학기술원 :미래자동차학제전공,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 미래자동차학제전공, 2018.2,[iv, 20 p. :]

Keywords

C3D CNN▼abi-directional LSTM▼aattention▼aspatio-temporal bi directional lstm attention; C3D 합성곱 네트워크▼a양방향 장단기억▼a순환 신경망▼a시공간 양방향 장단기억 순환 신경망을 이용한 집중 메카니즘

URI
http://hdl.handle.net/10203/266114
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=733797&flag=dissertation
Appears in Collection
PD-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0