Video registration in egocentric vision under day and night illumination changes

Alletto, Stefano; Serra, Giuseppe; Cucchiara, Rita

doi:10.1016/j.cviu.2016.09.010

With the spread of wearable devices and head mounted cameras, a wide range of application requiring precise user localization is now possible. In this paper we propose to treat the problem of obtaining the user position with respect to a known environment as a video registration problem. Video registration, i.e. the task of aligning an input video sequence to a pre-built 3D model, relies on a matching process of local keypoints extracted on the query sequence to a 3D point cloud. The overall registration performance is strictly tied to the actual quality of this 2D-3D matching, and can degrade if environmental conditions such as steep changes in lighting like the ones between day and night occur. To effectively register an egocentric video sequence under these conditions, we propose to tackle the source of the problem: the matching process. To overcome the shortcomings of standard matching techniques, we introduce a novel embedding space that allows us to obtain robust matches by jointly taking into account local descriptors, their spatial arrangement and their temporal robustness. The proposal is evaluated using unconstrained egocentric video sequences both in terms of matching quality and resulting registration performance using different 3D models of historical landmarks. The results show that the proposed method can outperform state of the art registration algorithms, in particular when dealing with the challenges of night and day sequences.

Video registration in egocentric vision under day and night illumination changes / Alletto, S., Serra, G., Cucchiara, R.. - In: COMPUTER VISION AND IMAGE UNDERSTANDING. - ISSN 1077-3142. - 157:(2017), pp. 274-283. [10.1016/j.cviu.2016.09.010]

Video registration in egocentric vision under day and night illumination changes

ALLETTO, STEFANO;SERRA, GIUSEPPE;CUCCHIARA, Rita

2017

Abstract

With the spread of wearable devices and head mounted cameras, a wide range of application requiring precise user localization is now possible. In this paper we propose to treat the problem of obtaining the user position with respect to a known environment as a video registration problem. Video registration, i.e. the task of aligning an input video sequence to a pre-built 3D model, relies on a matching process of local keypoints extracted on the query sequence to a 3D point cloud. The overall registration performance is strictly tied to the actual quality of this 2D-3D matching, and can degrade if environmental conditions such as steep changes in lighting like the ones between day and night occur. To effectively register an egocentric video sequence under these conditions, we propose to tackle the source of the problem: the matching process. To overcome the shortcomings of standard matching techniques, we introduce a novel embedding space that allows us to obtain robust matches by jointly taking into account local descriptors, their spatial arrangement and their temporal robustness. The proposal is evaluated using unconstrained egocentric video sequences both in terms of matching quality and resulting registration performance using different 3D models of historical landmarks. The results show that the proposed method can outperform state of the art registration algorithms, in particular when dealing with the challenges of night and day sequences.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2017
			
	Data di prima pubblicazione
	
				21-set-2016
			
	Rivista
	
				COMPUTER VISION AND IMAGE UNDERSTANDING
			
	N° del Volume
	
				157
			
	Pagina iniziale
	
				274
			
	Pagina finale
	
				283
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.cviu.2016.09.010
			
	Codice WoS
	
				WOS:000398430300019
			
	Codice Scopus
	
				2-s2.0-84992741066
			
	Citazione
	
				Video registration in egocentric vision under day and night illumination changes / Alletto, S., Serra, G., Cucchiara, R.. - In: COMPUTER VISION AND IMAGE UNDERSTANDING. - ISSN 1077-3142. - 157:(2017), pp. 274-283. [10.1016/j.cviu.2016.09.010]
			
	Tutti gli autori
	
						Alletto, Stefano; Serra, Giuseppe; Cucchiara, Rita
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris