Bridging vision and commonsense for multimodal situation recognition in pervasive systems

Pervasive services may have to rely on multimodal classification to implement situation-recognition. However, the effectiveness of current multimodal classifiers is often not satisfactory. In this paper, we describe a novel approach to multimodal classification based on integrating a vision sensor with a commonsense knowledge base. Specifically, our approach is based on extracting the individual objects perceived by a camera and classifying them individually with non-parametric algorithms; then, using a commonsense knowledge base, classifying the overall scene with high effectiveness. Such classification results can then be fused together with other sensors, again on a commonsense basis, for both improving classification accuracy and dealing with missing labels. Experimental results are presented to assess, under different configurations, the effectiveness of our vision sensor and its integration with other kinds of sensors, proving that the approach is effective and able to correctly recognize a number of situations in open-ended environments.

Bridging vision and commonsense for multimodal situation recognition in pervasive systems / Bicocchi, Nicola; Lasagni, Matteo; Zambonelli, Franco. - STAMPA. - (2012), pp. 48-56. (Intervento presentato al convegno 10th IEEE International Conference on Pervasive Computing and Communications, PerCom 2012 tenutosi a Lugano, CH nel 19 - 23 March) [10.1109/PerCom.2012.6199848].

Bridging vision and commonsense for multimodal situation recognition in pervasive systems

BICOCCHI, Nicola;LASAGNI, Matteo;ZAMBONELLI, Franco

2012

Abstract

Pervasive services may have to rely on multimodal classification to implement situation-recognition. However, the effectiveness of current multimodal classifiers is often not satisfactory. In this paper, we describe a novel approach to multimodal classification based on integrating a vision sensor with a commonsense knowledge base. Specifically, our approach is based on extracting the individual objects perceived by a camera and classifying them individually with non-parametric algorithms; then, using a commonsense knowledge base, classifying the overall scene with high effectiveness. Such classification results can then be fused together with other sensors, again on a commonsense basis, for both improving classification accuracy and dealing with missing labels. Experimental results are presented to assess, under different configurations, the effectiveness of our vision sensor and its integration with other kinds of sensors, proving that the approach is effective and able to correctly recognize a number of situations in open-ended environments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2012
			
	Titolo del Convegno
	
				10th IEEE International Conference on Pervasive Computing and Communications, PerCom 2012
			
	Luogo del Convegno
	
				Lugano, CH
			
	Data del Convegno
	
				19 - 23 March
			
	Codice DOI
	
				https://dx.doi.org/10.1109/PerCom.2012.6199848
			
	Codice WoS
	
				WOS:000309103700007
			
	Codice Scopus
	
				2-s2.0-84861588136
			
	Pagina iniziale
	
				48
			
	Pagina finale
	
				56
			
	Tutti gli autori
	
						Bicocchi, Nicola; Lasagni, Matteo; Zambonelli, Franco
					
	Citazione
	
				Bridging vision and commonsense for multimodal situation recognition in pervasive systems / Bicocchi, Nicola; Lasagni, Matteo; Zambonelli, Franco. - STAMPA. - (2012), pp. 48-56. (Intervento presentato al  convegno 10th IEEE International Conference on Pervasive Computing and Communications, PerCom 2012 tenutosi a Lugano, CH nel 19 - 23 March) [10.1109/PerCom.2012.6199848].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/738479

Citazioni

ND

15

10

social impact