Multiple Spatio-Temporal Scales Neural Network for Contextual Visual Recognition of Human Actions

Cited 7 time in webofscience Cited 0 time in scopus
  • Hit : 315
  • Download : 312
This paper introduces a novel dynamic neural network model which can recognize dynamic visual image patterns of human actions based on learning. The proposed model is characterized by its capability of extracting the spatio-temporal feature hierarchy latent in the training visual image streams. The model achieves this property by integratingtwo essential ideas: (1) multiple spatial-scales processing and(2) multiple timescales processing, which have been introduced the convolutional neural network (CNN) and the multiple timescale recurrent neural network (MTRNN), respectively. The evaluation of the model performance conducted by utilizing the Weizmann dataset showed that the proposed model outperforms other neural network models in recognition of a set of prototypical human movement patterns. Furthermore, additional evaluation testing for recognition of concatenated sequences of these prototypical movement patterns indicates that the model is endowed with a remarkable capability for contextual recognition of long-range dynamic visual patterns.
Publisher
IEEE
Issue Date
2014-10-15
Language
English
Citation

The Fourth Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics (ICDL-Epirob2014)

URI
http://hdl.handle.net/10203/222938
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 7 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0