Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regression neural network, called POSEidon, which is composed of three independent convolutional nets followed by a fusion layer, specially conceived for understanding the pose by depth. In addition, to recover the intrinsic value of face appearance for understanding head position and orientation, we propose a new Face-from-Depth approach for learning image faces from depth. Results in face reconstruction are qualitatively impressive. We test the proposed framework on two public datasets, namely Biwi Kinect Head Pose and ICT-3DHP, and on Pandora, a new challenging dataset mainly inspired by the automotive setup. Results show that our method overcomes all recent state-of-art works, running in real time at more than 30 frames per second.

POSEidon: Face-from-Depth for Driver Pose Estimation / Borghi, Guido; Venturelli, Marco; Vezzani, Roberto; Cucchiara, Rita. - 2017-:(2017), pp. 5494-5503. (Intervento presentato al convegno 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 tenutosi a Honolulu, Hawaii nel July, 22-25, 2017) [10.1109/CVPR.2017.583].

POSEidon: Face-from-Depth for Driver Pose Estimation

BORGHI, GUIDO;VEZZANI, Roberto;CUCCHIARA, Rita
2017

Abstract

Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regression neural network, called POSEidon, which is composed of three independent convolutional nets followed by a fusion layer, specially conceived for understanding the pose by depth. In addition, to recover the intrinsic value of face appearance for understanding head position and orientation, we propose a new Face-from-Depth approach for learning image faces from depth. Results in face reconstruction are qualitatively impressive. We test the proposed framework on two public datasets, namely Biwi Kinect Head Pose and ICT-3DHP, and on Pandora, a new challenging dataset mainly inspired by the automotive setup. Results show that our method overcomes all recent state-of-art works, running in real time at more than 30 frames per second.
2017
30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
Honolulu, Hawaii
July, 22-25, 2017
2017-
5494
5503
Borghi, Guido; Venturelli, Marco; Vezzani, Roberto; Cucchiara, Rita
POSEidon: Face-from-Depth for Driver Pose Estimation / Borghi, Guido; Venturelli, Marco; Vezzani, Roberto; Cucchiara, Rita. - 2017-:(2017), pp. 5494-5503. (Intervento presentato al convegno 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 tenutosi a Honolulu, Hawaii nel July, 22-25, 2017) [10.1109/CVPR.2017.583].
File in questo prodotto:
File Dimensione Formato  
cvpr_2017_poseidon.pdf

Open access

Tipologia: Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione 3.76 MB
Formato Adobe PDF
3.76 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1127609
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 138
  • ???jsp.display-item.citation.isi??? 97
social impact