From Depth Data to Head Pose Estimation: a Siamese approach

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

From Depth Data to Head Pose Estimation: a Siamese approach / Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - 5:(2017), pp. 194-201. (Intervento presentato al convegno 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2017) tenutosi a Porto, Portugal nel 27 february - 1 march, 2017) [10.5220/0006104501940201].

From Depth Data to Head Pose Estimation: a Siamese approach

Venturelli, Marco;BORGHI, GUIDO;VEZZANI, Roberto;CUCCHIARA, Rita

2017

Abstract

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
			2017
		
	Titolo del Convegno
	
			12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2017)
		
	Luogo del Convegno
	
			Porto, Portugal
		
	Data del Convegno
	
			27 february - 1 march, 2017
		
	Codice DOI
	
			https://dx.doi.org/10.5220/0006104501940201
		
	Codice WoS
	
			WOS:000444905600020
		
	Codice Scopus
	
			2-s2.0-85036623561
		
	N° del Volume
	
			5
		
	Pagina iniziale
	
			194
		
	Pagina finale
	
			201
		
	Tutti gli autori
	
			Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita
		
	Citazione
	
			From Depth Data to Head Pose Estimation: a Siamese approach / Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - 5:(2017), pp. 194-201. (Intervento presentato al  convegno 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2017) tenutosi a Porto, Portugal nel 27 february - 1 march, 2017) [10.5220/0006104501940201].
		
	Tipologia
	
			Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
VISAPP_2017_63.pdf Open access Tipologia: Versione pubblicata dall'editore Dimensione 1.5 MB Formato Adobe PDF Visualizza/Apri	1.5 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1118253

Citazioni

ND

14

14

social impact