In this paper we propose a new method for human action categorization by using an effective combination of a new 3D gradient descriptor with an optic flow descriptor, to represent spatio-temporal interest points. These points are used to represent video sequences using a bag of spatio-temporal visual words, following the successful results achieved in object and scene classification. We extensively test our approach on the standard KTH and Weizmann actions datasets, showing its validity and good performance. Experimental results outperform state-of-the-art methods, without requiring fine parameter tuning.

Recognizing Human Actions by Fusing Spatio-temporal Appearance and Motion Descriptors / Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Lorenzo, Seidenari; Serra, Giuseppe. - STAMPA. - (2009), pp. 3569-3572. ( 2009 IEEE International Conference on Image Processing, ICIP 2009 Cairo, egy 7-10 Nov. 2009) [10.1109/ICIP.2009.5414332].

Recognizing Human Actions by Fusing Spatio-temporal Appearance and Motion Descriptors

SERRA, GIUSEPPE
2009

Abstract

In this paper we propose a new method for human action categorization by using an effective combination of a new 3D gradient descriptor with an optic flow descriptor, to represent spatio-temporal interest points. These points are used to represent video sequences using a bag of spatio-temporal visual words, following the successful results achieved in object and scene classification. We extensively test our approach on the standard KTH and Weizmann actions datasets, showing its validity and good performance. Experimental results outperform state-of-the-art methods, without requiring fine parameter tuning.
2009
Inglese
2009 IEEE International Conference on Image Processing, ICIP 2009
Cairo, egy
7-10 Nov. 2009
Proc. of IEEE International Conference on Image Processing (ICIP)
3569
3572
9781424456543
IEEE Computer Society
STATI UNITI D'AMERICA
345 E 47TH ST, NEW YORK, NY 10017 USA
Internazionale
Contributo
Action recognition; spatio-temporal descriptors; bag-of-words
Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Lorenzo, Seidenari; Serra, Giuseppe
Atti di CONVEGNO::Relazione in Atti di Convegno
273
5
Recognizing Human Actions by Fusing Spatio-temporal Appearance and Motion Descriptors / Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Lorenzo, Seidenari; Serra, Giuseppe. - STAMPA. - (2009), pp. 3569-3572. ( 2009 IEEE International Conference on Image Processing, ICIP 2009 Cairo, egy 7-10 Nov. 2009) [10.1109/ICIP.2009.5414332].
none
info:eu-repo/semantics/conferenceObject
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/979904
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 32
  • ???jsp.display-item.citation.isi??? 20
social impact