What was Monet seeing while painting? Translating artworks to photo-realistic images

State of the art Computer Vision techniques exploit the availability of large-scale datasets, most of which consist of images captured from the world as it is. This brings to an incompatibility between such methods and digital data from the artistic domain, on which current techniques under-perform. A possible solution is to reduce the domain shift at the pixel level, thus translating artistic images to realistic copies. In this paper, we present a model capable of translating paintings to photo-realistic images, trained without paired examples. The idea is to enforce a patch level similarity between real and generated images, aiming to reproduce photo-realistic details from a memory bank of real images. This is subsequently adopted in the context of an unpaired image-to-image translation framework, mapping each image from one distribution to a new one belonging to the other distribution. Qualitative and quantitative results are presented on Monet, Cezanne and Van Gogh paintings translation tasks, showing that our approach increases the realism of generated images with respect to the CycleGAN approach.

What was Monet seeing while painting? Translating artworks to photo-realistic images / Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita. - (2019). (Intervento presentato al convegno European Conference on Computer Vision (ECCV) Workshops tenutosi a Munich, Germany nel 8-14 September 2018) [10.1007/978-3-030-11012-3_46].

What was Monet seeing while painting? Translating artworks to photo-realistic images

TOMEI, MATTEO;Baraldi, Lorenzo;Cornia, Marcella;Cucchiara, Rita

2019

Abstract

State of the art Computer Vision techniques exploit the availability of large-scale datasets, most of which consist of images captured from the world as it is. This brings to an incompatibility between such methods and digital data from the artistic domain, on which current techniques under-perform. A possible solution is to reduce the domain shift at the pixel level, thus translating artistic images to realistic copies. In this paper, we present a model capable of translating paintings to photo-realistic images, trained without paired examples. The idea is to enforce a patch level similarity between real and generated images, aiming to reproduce photo-realistic details from a memory bank of real images. This is subsequently adopted in the context of an unpaired image-to-image translation framework, mapping each image from one distribution to a new one belonging to the other distribution. Qualitative and quantitative results are presented on Monet, Cezanne and Van Gogh paintings translation tasks, showing that our approach increases the realism of generated images with respect to the CycleGAN approach.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Data di prima pubblicazione
	
				2019
			
	Titolo del Convegno
	
				European Conference on Computer Vision (ECCV) Workshops
			
	Luogo del Convegno
	
				Munich, Germany
			
	Data del Convegno
	
				8-14 September 2018
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-030-11012-3_46
			
	Codice WoS
	
				WOS:000594380500046
			
	Codice Scopus
	
				2-s2.0-85061822357
			
	Serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Tutti gli autori
	
						Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita
					
	Citazione
	
				What was Monet seeing while painting? Translating artworks to photo-realistic images / Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita. - (2019). (Intervento presentato al  convegno European Conference on Computer Vision (ECCV) Workshops tenutosi a Munich, Germany nel 8-14 September 2018) [10.1007/978-3-030-11012-3_46].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
2018-eccvw-art.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 3.73 MB Formato Adobe PDF Visualizza/Apri	3.73 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1164580

Citazioni

ND

3

2

social impact