Video Event Classification Using Bag of Words and String Kernels

The recognition of events in videos is a relevant and challenging task of automatic semantic video analysis. At present one of the most successful frameworks, used for object recognition tasks, is the bag-of-words (BoW) approach. However this approach does not model the temporal information of the video stream. In this paper we present a method to introduce temporal information within the BoW approach. Events are modeled as a sequence composed of histograms of visual features, computed from each frame using the traditional BoW model. The sequences are treated as strings where each histogram is considered as a character. Event classification of these sequences of variable size, depending on the length of the video clip, are performed using SVM classifiers with a string kernel that uses the Needlemann-Wunsch edit distance. Experimental results, performed on two datasets, soccer video and TRECVID 2005, demonstrate the validity of the proposed approach. © 2009 Springer Berlin Heidelberg.

Video Event Classification Using Bag of Words and String Kernels / Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Serra, Giuseppe. - STAMPA. - 5716:(2009), pp. 170-178. ( 15th International Conference on Image Analysis and Processing - ICIAP 2009, Proceedings Vietri sul Mare, ita September 8-11, 2009) [10.1007/978-3-642-04146-4_20].

Video Event Classification Using Bag of Words and String Kernels

Lamberto Ballan;Marco Bertini;Alberto Del Bimbo;SERRA, GIUSEPPE

2009

Abstract

The recognition of events in videos is a relevant and challenging task of automatic semantic video analysis. At present one of the most successful frameworks, used for object recognition tasks, is the bag-of-words (BoW) approach. However this approach does not model the temporal information of the video stream. In this paper we present a method to introduce temporal information within the BoW approach. Events are modeled as a sequence composed of histograms of visual features, computed from each frame using the traditional BoW model. The sequences are treated as strings where each histogram is considered as a character. Event classification of these sequences of variable size, depending on the length of the video clip, are performed using SVM classifiers with a string kernel that uses the Needlemann-Wunsch edit distance. Experimental results, performed on two datasets, soccer video and TRECVID 2005, demonstrate the validity of the proposed approach. © 2009 Springer Berlin Heidelberg.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2009
			
	Titolo del Convegno
	
				15th International Conference on Image Analysis and Processing - ICIAP 2009, Proceedings
			
	Luogo del Convegno
	
				Vietri sul Mare, ita
			
	Data del Convegno
	
				September 8-11, 2009
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-642-04146-4_20
			
	Codice WoS
	
				WOS:000279101900019
			
	Codice Scopus
	
				2-s2.0-76249100989
			
	Serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	N° del Volume
	
				5716
			
	Pagina iniziale
	
				170
			
	Pagina finale
	
				178
			
	Tutti gli autori
	
						Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Serra, Giuseppe
					
	Citazione
	
				Video Event Classification Using Bag of Words and String Kernels / Lamberto, Ballan; Marco, Bertini; Alberto Del, Bimbo; Serra, Giuseppe. - STAMPA. - 5716:(2009), pp. 170-178. ( 15th International Conference on Image Analysis and Processing - ICIAP 2009, Proceedings Vietri sul Mare, ita September 8-11, 2009) [10.1007/978-3-642-04146-4_20].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/979926

Citazioni

ND

14

10

social impact