Handwritten Text Generation from Visual Archetypes

Pippi, V.; Cascianelli, S.; Cucchiara, R.

doi:10.1109/CVPR52729.2023.02151

Generating synthetic images of handwritten text in a writer-specific style is a challenging task, especially in the case of unseen styles and new words, and even more when these latter contain characters that are rarely encountered during training. While emulating a writer's style has been recently addressed by generative models, the generalization towards rare characters has been disregarded. In this work, we devise a Transformer-based model for Few-Shot styled handwritten text generation and focus on obtaining a robust and informative representation of both the text and the style. In particular, we propose a novel representation of the textual content as a sequence of dense vectors obtained from images of symbols written as standard GNU Unifont glyphs, which can be considered their visual archetypes. This strategy is more suitable for generating characters that, despite having been seen rarely during training, possibly share visual details with the frequently observed ones. As for the style, we obtain a robust representation of unseen writers' calligraphy by exploiting specific pre-training on a large synthetic dataset. Quantitative and qualitative results demonstrate the effectiveness of our proposal in generating words in unseen styles and with rare characters more faithfully than existing approaches relying on independent one-hot encodings of the characters.

Handwritten Text Generation from Visual Archetypes / Pippi, V.; Cascianelli, S.; Cucchiara, R.. - 2023-:(2023), pp. 22458-22467. (Intervento presentato al convegno 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 tenutosi a can nel 2023) [10.1109/CVPR52729.2023.02151].

Handwritten Text Generation from Visual Archetypes

Pippi V.;Cascianelli S.;Cucchiara R.

2023

Abstract

Generating synthetic images of handwritten text in a writer-specific style is a challenging task, especially in the case of unseen styles and new words, and even more when these latter contain characters that are rarely encountered during training. While emulating a writer's style has been recently addressed by generative models, the generalization towards rare characters has been disregarded. In this work, we devise a Transformer-based model for Few-Shot styled handwritten text generation and focus on obtaining a robust and informative representation of both the text and the style. In particular, we propose a novel representation of the textual content as a sequence of dense vectors obtained from images of symbols written as standard GNU Unifont glyphs, which can be considered their visual archetypes. This strategy is more suitable for generating characters that, despite having been seen rarely during training, possibly share visual details with the frequently observed ones. As for the style, we obtain a robust representation of unseen writers' calligraphy by exploiting specific pre-training on a large synthetic dataset. Quantitative and qualitative results demonstrate the effectiveness of our proposal in generating words in unseen styles and with rare characters more faithfully than existing approaches relying on independent one-hot encodings of the characters.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2023
			
	Titolo del Convegno
	
				2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
			
	Luogo del Convegno
	
				can
			
	Data del Convegno
	
				2023
			
	Codice DOI
	
				https://dx.doi.org/10.1109/CVPR52729.2023.02151
			
	Codice WoS
	
				WOS:001062531306076
			
	Codice Scopus
	
				2-s2.0-85173003799
			
	Serie
	
				PROCEEDINGS IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION
			
	N° del Volume
	
				2023-
			
	Pagina iniziale
	
				22458
			
	Pagina finale
	
				22467
			
	Tutti gli autori
	
						Pippi, V.; Cascianelli, S.; Cucchiara, R.
					
	Citazione
	
				Handwritten Text Generation from Visual Archetypes / Pippi, V.; Cascianelli, S.; Cucchiara, R.. - 2023-:(2023), pp. 22458-22467. (Intervento presentato al  convegno 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 tenutosi a can nel 2023) [10.1109/CVPR52729.2023.02151].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris