State of the art Computer Vision techniques exploit the availability of large-scale datasets, most of which consist of images captured from the world as it is. This brings to an incompatibility between such methods and digital data from the artistic domain, on which current techniques under-perform. A possible solution is to reduce the domain shift at the pixel level, thus translating artistic images to realistic copies. In this paper, we present a model capable of translating paintings to photo-realistic images, trained without paired examples. The idea is to enforce a patch level similarity between real and generated images, aiming to reproduce photo-realistic details from a memory bank of real images. This is subsequently adopted in the context of an unpaired image-to-image translation framework, mapping each image from one distribution to a new one belonging to the other distribution. Qualitative and quantitative results are presented on Monet, Cezanne and Van Gogh paintings translation tasks, showing that our approach increases the realism of generated images with respect to the CycleGAN approach.

What was Monet seeing while painting? Translating artworks to photo-realistic images / Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita. - (2019). (Intervento presentato al convegno European Conference on Computer Vision (ECCV) Workshops tenutosi a Munich, Germany nel 8-14 September 2018) [10.1007/978-3-030-11012-3_46].

What was Monet seeing while painting? Translating artworks to photo-realistic images

TOMEI, MATTEO;Baraldi, Lorenzo;Cornia, Marcella;Cucchiara, Rita
2019

Abstract

State of the art Computer Vision techniques exploit the availability of large-scale datasets, most of which consist of images captured from the world as it is. This brings to an incompatibility between such methods and digital data from the artistic domain, on which current techniques under-perform. A possible solution is to reduce the domain shift at the pixel level, thus translating artistic images to realistic copies. In this paper, we present a model capable of translating paintings to photo-realistic images, trained without paired examples. The idea is to enforce a patch level similarity between real and generated images, aiming to reproduce photo-realistic details from a memory bank of real images. This is subsequently adopted in the context of an unpaired image-to-image translation framework, mapping each image from one distribution to a new one belonging to the other distribution. Qualitative and quantitative results are presented on Monet, Cezanne and Van Gogh paintings translation tasks, showing that our approach increases the realism of generated images with respect to the CycleGAN approach.
2019
2019
European Conference on Computer Vision (ECCV) Workshops
Munich, Germany
8-14 September 2018
Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita
What was Monet seeing while painting? Translating artworks to photo-realistic images / Tomei, Matteo; Baraldi, Lorenzo; Cornia, Marcella; Cucchiara, Rita. - (2019). (Intervento presentato al convegno European Conference on Computer Vision (ECCV) Workshops tenutosi a Munich, Germany nel 8-14 September 2018) [10.1007/978-3-030-11012-3_46].
File in questo prodotto:
File Dimensione Formato  
2018-eccvw-art.pdf

Open access

Tipologia: Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione 3.73 MB
Formato Adobe PDF
3.73 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1164580
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact