Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities

Baraldi, Lorenzo; Cornia, Marcella; Grana, Costantino; Cucchiara, Rita

doi:10.1109/ICPR.2018.8545064

While several approaches to bring vision and language together are emerging, none of them has yet addressed the digital humanities domain, which, nevertheless, is a rich source of visual and textual data. To foster research in this direction, we investigate the learning of visual-semantic embeddings for historical document illustrations, devising both supervised and semi-supervised approaches. We exploit the joint visual-semantic embeddings to automatically align illustrations and textual elements, thus providing an automatic annotation of the visual content of a manuscript. Experiments are performed on the Borso d'Este Holy Bible, one of the most sophisticated illuminated manuscript from the Renaissance, which we manually annotate aligning every illustration with textual commentaries written by experts. Experimental results quantify the domain shift between ordinary visual-semantic datasets and the proposed one, validate the proposed strategies, and devise future works on the same line.

Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities / Baraldi, L., Cornia, M., Grana, C., Cucchiara, R.. - (2018), pp. 1097-1102. (International Conference on Pattern Recognition Beijing, China August 20th-24th, 2018) [10.1109/ICPR.2018.8545064].

Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities

Baraldi, Lorenzo;Cornia, Marcella;Grana, Costantino;Cucchiara, Rita

2018

Abstract

While several approaches to bring vision and language together are emerging, none of them has yet addressed the digital humanities domain, which, nevertheless, is a rich source of visual and textual data. To foster research in this direction, we investigate the learning of visual-semantic embeddings for historical document illustrations, devising both supervised and semi-supervised approaches. We exploit the joint visual-semantic embeddings to automatically align illustrations and textual elements, thus providing an automatic annotation of the visual content of a manuscript. Experiments are performed on the Borso d'Este Holy Bible, one of the most sophisticated illuminated manuscript from the Renaissance, which we manually annotate aligning every illustration with textual commentaries written by experts. Experimental results quantify the domain shift between ordinary visual-semantic datasets and the proposed one, validate the proposed strategies, and devise future works on the same line.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2018
			
	Titolo del Convegno
	
				International Conference on Pattern Recognition
			
	Luogo del Convegno
	
				Beijing, China
			
	Data del Convegno
	
				August 20th-24th, 2018
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ICPR.2018.8545064
			
	Codice WoS
	
				WOS:000455146801019
			
	Codice Scopus
	
				2-s2.0-85059777383
			
	Pagina iniziale
	
				1097
			
	Pagina finale
	
				1102
			
	Tutti gli autori
	
						Baraldi, Lorenzo; Cornia, Marcella; Grana, Costantino; Cucchiara, Rita
					
	Citazione
	
				Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities / Baraldi, L., Cornia, M., Grana, C., Cucchiara, R.. - (2018), pp. 1097-1102. (International Conference on Pattern Recognition Beijing, China August 20th-24th, 2018) [10.1109/ICPR.2018.8545064].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris