POSEidon: Face-from-Depth for Driver Pose Estimation

Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regression neural network, called POSEidon, which is composed of three independent convolutional nets followed by a fusion layer, specially conceived for understanding the pose by depth. In addition, to recover the intrinsic value of face appearance for understanding head position and orientation, we propose a new Face-from-Depth approach for learning image faces from depth. Results in face reconstruction are qualitatively impressive. We test the proposed framework on two public datasets, namely Biwi Kinect Head Pose and ICT-3DHP, and on Pandora, a new challenging dataset mainly inspired by the automotive setup. Results show that our method overcomes all recent state-of-art works, running in real time at more than 30 frames per second.

POSEidon: Face-from-Depth for Driver Pose Estimation / Borghi, Guido; Venturelli, Marco; Vezzani, Roberto; Cucchiara, Rita. - 2017-:(2017), pp. 5494-5503. (Intervento presentato al convegno 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 tenutosi a Honolulu, Hawaii nel July, 22-25, 2017) [10.1109/CVPR.2017.583].

POSEidon: Face-from-Depth for Driver Pose Estimation

BORGHI, GUIDO;Venturelli, Marco;VEZZANI, Roberto;CUCCHIARA, Rita

2017

Abstract

Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regression neural network, called POSEidon, which is composed of three independent convolutional nets followed by a fusion layer, specially conceived for understanding the pose by depth. In addition, to recover the intrinsic value of face appearance for understanding head position and orientation, we propose a new Face-from-Depth approach for learning image faces from depth. Results in face reconstruction are qualitatively impressive. We test the proposed framework on two public datasets, namely Biwi Kinect Head Pose and ICT-3DHP, and on Pandora, a new challenging dataset mainly inspired by the automotive setup. Results show that our method overcomes all recent state-of-art works, running in real time at more than 30 frames per second.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2017
			
	Titolo del Convegno
	
				30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
			
	Luogo del Convegno
	
				Honolulu, Hawaii
			
	Data del Convegno
	
				July, 22-25, 2017
			
	Codice DOI
	
				https://dx.doi.org/10.1109/CVPR.2017.583
			
	Codice WoS
	
				WOS:000418371405062
			
	Codice Scopus
	
				2-s2.0-85032477445
			
	Serie
	
				PROCEEDINGS - IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION
			
	N° del Volume
	
				2017-
			
	Pagina iniziale
	
				5494
			
	Pagina finale
	
				5503
			
	Tutti gli autori
	
						Borghi, Guido; Venturelli, Marco; Vezzani, Roberto; Cucchiara, Rita
					
	Citazione
	
				POSEidon: Face-from-Depth for Driver Pose Estimation / Borghi, Guido; Venturelli, Marco; Vezzani, Roberto; Cucchiara, Rita. - 2017-:(2017), pp. 5494-5503. (Intervento presentato al  convegno 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 tenutosi a Honolulu, Hawaii nel July, 22-25, 2017) [10.1109/CVPR.2017.583].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
cvpr_2017_poseidon.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 3.76 MB Formato Adobe PDF Visualizza/Apri	3.76 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1127609

Citazioni

ND

147

110

social impact