RefiNet: 3D Human Pose Refinement with Depth Maps

Human Pose Estimation is a fundamental task for many applications in the Computer Vision community and it has been widely investigated in the 2D domain, i.e. intensity images. Therefore, most of the available methods for this task are mainly based on 2D Convolutional Neural Networks and huge manually-annotated RGB datasets, achieving stunning results. In this paper, we propose RefiNet, a multi-stage framework that regresses an extremely-precise 3D human pose estimation from a given 2D pose and a depth map. The framework consists of three different modules, each one specialized in a particular refinement and data representation, i.e. depth patches, 3D skeleton and point clouds. Moreover, we present a new dataset, called Baracca, acquired with RGB, depth and thermal cameras and specifically created for the automotive context. Experimental results confirm the quality of the refinement procedure that largely improves the human pose estimations of off-the-shelf 2D methods.

RefiNet: 3D Human Pose Refinement with Depth Maps / D’Eusanio, Andrea; Pini, Stefano; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - (2021), pp. 2320-2327. ( 25th International Conference on Pattern Recognition, ICPR 2020 Milan 10-15 January 2021) [10.1109/ICPR48806.2021.9412451].

RefiNet: 3D Human Pose Refinement with Depth Maps

Andrea D’Eusanio;Stefano Pini;Guido Borghi;Roberto Vezzani;Rita Cucchiara

2021

Abstract

Human Pose Estimation is a fundamental task for many applications in the Computer Vision community and it has been widely investigated in the 2D domain, i.e. intensity images. Therefore, most of the available methods for this task are mainly based on 2D Convolutional Neural Networks and huge manually-annotated RGB datasets, achieving stunning results. In this paper, we propose RefiNet, a multi-stage framework that regresses an extremely-precise 3D human pose estimation from a given 2D pose and a depth map. The framework consists of three different modules, each one specialized in a particular refinement and data representation, i.e. depth patches, 3D skeleton and point clouds. Moreover, we present a new dataset, called Baracca, acquired with RGB, depth and thermal cameras and specifically created for the automotive context. Experimental results confirm the quality of the refinement procedure that largely improves the human pose estimations of off-the-shelf 2D methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2021
			
	Lingua/e di pubblicazione
	
				Inglese
			
	Titolo del Convegno
	
				25th International Conference on Pattern Recognition, ICPR 2020
			
	Luogo del Convegno
	
				Milan
			
	Data del Convegno
	
				10-15 January 2021
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ICPR48806.2021.9412451
			
	Codice WoS
	
				WOS:000678409202056
			
	Codice Scopus
	
				2-s2.0-85110549381
			
	Serie
	
				INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION
			
	Titolo del Volume
	
				Proceedings of the 25th International Conference of Pattern Recognition
			
	Pagina iniziale
	
				2320
			
	Pagina finale
	
				2327
			
	Numero di Pagine
	
				8
			
	Codice ISBN del Volume
	
				9781728188089
			
	Nome Editore
	
				Institute of Electrical and Electronics Engineers Inc.
			
	Città Editore
	
				10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA
			
	Rilevanza del Convegno
	
				Internazionale
			
	Tutti gli autori
	
						D’Eusanio, Andrea; Pini, Stefano; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita
					
	Tipologia
	
				Atti di CONVEGNO::Relazione in Atti di Convegno
			
	Tipologia sito docente
	
				273
			
	Numero autori
	
				5
			
	Citazione
	
				RefiNet: 3D Human Pose Refinement with Depth Maps / D’Eusanio, Andrea; Pini, Stefano; Borghi, Guido; Vezzani, Roberto; Cucchiara, Rita. - (2021), pp. 2320-2327. ( 25th International Conference on Pattern Recognition, ICPR 2020 Milan 10-15 January 2021) [10.1109/ICPR48806.2021.9412451].
			
	Fulltext
	
				open
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
ICPR_2020_Human_Pose_Estimation_compressed.pdf Open access Tipologia: AO - Versione originale dell'autore proposta per la pubblicazione Dimensione 910.86 kB Formato Adobe PDF Visualizza/Apri	910.86 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1212262

Citazioni

ND

10

8

social impact