Towards Cycle-Consistent Models for Text and Image Retrieval

Cross-modal retrieval has been recently becoming an hot-spot research, thanks to the development of deeply-learnable architectures. Such architectures generally learn a joint multi-modal embedding space in which text and images could be projected and compared. Here we investigate a different approach, and reformulate the problem of cross-modal retrieval as that of learning a translation between the textual and visual domain. In particular, we propose an end-to-end trainable model which can translate text into image features and vice versa, and regularizes this mapping with a cycle-consistency criterion. Preliminary experimental evaluations show promising results with respect to ordinary visual-semantic models.

Towards Cycle-Consistent Models for Text and Image Retrieval / Cornia, Marcella; Baraldi, Lorenzo; Rezazadegan Tavakoli, Hamed; Cucchiara, Rita. - (2019). (Intervento presentato al convegno European Conference on Computer Vision (ECCV) Workshops tenutosi a Munich, Germany nel 8-14 September 2018) [10.1007/978-3-030-11018-5_58].

Towards Cycle-Consistent Models for Text and Image Retrieval

Cornia, Marcella;Baraldi, Lorenzo;Rezazadegan Tavakoli, Hamed;Cucchiara, Rita

2019

Abstract

Cross-modal retrieval has been recently becoming an hot-spot research, thanks to the development of deeply-learnable architectures. Such architectures generally learn a joint multi-modal embedding space in which text and images could be projected and compared. Here we investigate a different approach, and reformulate the problem of cross-modal retrieval as that of learning a translation between the textual and visual domain. In particular, we propose an end-to-end trainable model which can translate text into image features and vice versa, and regularizes this mapping with a cycle-consistency criterion. Preliminary experimental evaluations show promising results with respect to ordinary visual-semantic models.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Data di prima pubblicazione
	
				2019
			
	Titolo del Convegno
	
				European Conference on Computer Vision (ECCV) Workshops
			
	Luogo del Convegno
	
				Munich, Germany
			
	Data del Convegno
	
				8-14 September 2018
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-030-11018-5_58
			
	Codice WoS
	
				WOS:000594390300057
			
	Codice Scopus
	
				2-s2.0-85061691347
			
	Serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Tutti gli autori
	
						Cornia, Marcella; Baraldi, Lorenzo; Rezazadegan Tavakoli, Hamed; Cucchiara, Rita
					
	Citazione
	
				Towards Cycle-Consistent Models for Text and Image Retrieval / Cornia, Marcella; Baraldi, Lorenzo; Rezazadegan Tavakoli, Hamed; Cucchiara, Rita. - (2019). (Intervento presentato al  convegno European Conference on Computer Vision (ECCV) Workshops tenutosi a Munich, Germany nel 8-14 September 2018) [10.1007/978-3-030-11018-5_58].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1164184

Citazioni

ND

2

0

social impact