Recently, deep learning approaches have achieved promising results in various fields of computer vision. In this paper, we tackle the problem of head pose estimation through a Convolutional Neural Network (CNN). Differently from other proposals in the literature, the described system is able to work directly and based only on raw depth data. Moreover, the head pose estimation is solved as a regression problem and does not rely on visual facial features like facial landmarks. We tested our system on a well known public dataset, extit{Biwi Kinect Head Pose}, showing that our approach achieves state-of-art results and is able to meet real time performance requirements.
Recently, deep learning approaches have achieved promising results in various fields of computer vision. In this paper, we tackle the problem of head pose estimation through a Convolutional Neural Network (CNN). Differently from other proposals in the literature, the described system is able to work directly and based only on raw depth data. Moreover, the head pose estimation is solved as a regression problem and does not rely on visual facial features like facial landmarks. We tested our system on a well known public dataset, Biwi Kinect Head Pose, showing that our approach achieves state-of-art results and is able to meet real time performance requirements.
Deep Head Pose Estimation from Depth Data for In-car Automotive Applications / Venturelli, Marco; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - 10188:(2018), pp. 74-85. (Intervento presentato al convegno 2nd International Workshop on Understanding Human Activities Through 3D Sensors, UHA3DS 2016 Held in Conjunction with the 23rd International Conference on Pattern Recognition, ICPR 2016 tenutosi a Cancun (Mexico) nel Dec 4 , 2016) [10.1007/978-3-319-91863-1_6].
Deep Head Pose Estimation from Depth Data for In-car Automotive Applications
BORGHI, GUIDO;VEZZANI, Roberto;CUCCHIARA, Rita
2018
Abstract
Recently, deep learning approaches have achieved promising results in various fields of computer vision. In this paper, we tackle the problem of head pose estimation through a Convolutional Neural Network (CNN). Differently from other proposals in the literature, the described system is able to work directly and based only on raw depth data. Moreover, the head pose estimation is solved as a regression problem and does not rely on visual facial features like facial landmarks. We tested our system on a well known public dataset, Biwi Kinect Head Pose, showing that our approach achieves state-of-art results and is able to meet real time performance requirements.File | Dimensione | Formato | |
---|---|---|---|
deep-head-pose-camera-ready.pdf
Accesso riservato
Tipologia:
Versione pubblicata dall'editore
Dimensione
3.96 MB
Formato
Adobe PDF
|
3.96 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris