From Depth Data to Head Pose Estimation: a Siamese approach

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

From Depth Data to Head Pose Estimation: a Siamese approach / Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - 5:(2017), pp. 194-201. ( 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2017) Porto, Portugal 27 february - 1 march, 2017) [10.5220/0006104501940201].

From Depth Data to Head Pose Estimation: a Siamese approach

Venturelli, Marco;BORGHI, GUIDO;VEZZANI, Roberto;CUCCHIARA, Rita

2017

Abstract

The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose tip position. In contrast, we exploit a Convolutional Neural Network (CNN) to perform head pose estimation directly from depth data. We exploit a Siamese architecture and we propose a novel loss function to improve the learning of the regression network layer. The system has been tested on two public datasets, Biwi Kinect Head Pose and ICT-3DHP database. The reported results demonstrate the improvement in accuracy with respect to current state-of-the-art approaches and the real time capabilities of the overall framework.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2017
			
	Titolo del Convegno
	
				12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2017)
			
	Luogo del Convegno
	
				Porto, Portugal
			
	Data del Convegno
	
				27 february - 1 march, 2017
			
	Codice DOI
	
				https://dx.doi.org/10.5220/0006104501940201
			
	Codice WoS
	
				WOS:000444905600020
			
	Codice Scopus
	
				2-s2.0-85036623561
			
	N° del Volume
	
				5
			
	Pagina iniziale
	
				194
			
	Pagina finale
	
				201
			
	Tutti gli autori
	
						Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita
					
	Citazione
	
				From Depth Data to Head Pose Estimation: a Siamese approach / Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - 5:(2017), pp. 194-201. ( 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP 2017) Porto, Portugal 27 february - 1 march, 2017) [10.5220/0006104501940201].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
VISAPP_2017_63.pdf Open access Tipologia: VOR - Versione pubblicata dall'editore Dimensione 1.5 MB Formato Adobe PDF Visualizza/Apri	1.5 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1118253

Citazioni

ND

14

16

social impact