Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model

Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neural Networks for predicting gaze fixations. In this paper we go beyond standard approaches to saliency prediction, in which gaze maps are computed with a feed-forward network, and present a novel model which can predict accurate saliency maps by incorporating neural attentive mechanisms. The core of our solution is a Convolutional LSTM that focuses on the most salient regions of the input image to iteratively refine the predicted saliency map. Additionally, to tackle the center bias typical of human eye fixations, our model can learn a set of prior maps generated with Gaussian functions. We show, through an extensive evaluation, that the proposed architecture outperforms the current state of the art on public saliency prediction datasets. We further study the contribution of each key component to demonstrate their robustness on different scenarios.

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model / Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita. - In: IEEE TRANSACTIONS ON IMAGE PROCESSING. - ISSN 1057-7149. - 27:10(2018), pp. 5142-5154. [10.1109/TIP.2018.2851672]

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model

Cornia, Marcella;Baraldi, Lorenzo;Serra, Giuseppe;Cucchiara, Rita

2018

Abstract

Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neural Networks for predicting gaze fixations. In this paper we go beyond standard approaches to saliency prediction, in which gaze maps are computed with a feed-forward network, and present a novel model which can predict accurate saliency maps by incorporating neural attentive mechanisms. The core of our solution is a Convolutional LSTM that focuses on the most salient regions of the input image to iteratively refine the predicted saliency map. Additionally, to tackle the center bias typical of human eye fixations, our model can learn a set of prior maps generated with Gaussian functions. We show, through an extensive evaluation, that the proposed architecture outperforms the current state of the art on public saliency prediction datasets. We further study the contribution of each key component to demonstrate their robustness on different scenarios.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2018
			
	Rivista
	
				IEEE TRANSACTIONS ON IMAGE PROCESSING
			
	N° del Volume
	
				27
			
	Fascicolo
	
				10
			
	Pagina iniziale
	
				5142
			
	Pagina finale
	
				5154
			
	Codice DOI
	
				https://dx.doi.org/10.1109/TIP.2018.2851672
			
	Codice WoS
	
				WOS:000439590400002
			
	Codice Scopus
	
				2-s2.0-85049317191
			
	Codice PubMed
	
				29994710
			
	Citazione
	
				Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model / Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita. - In: IEEE TRANSACTIONS ON IMAGE PROCESSING. - ISSN 1057-7149. - 27:10(2018), pp. 5142-5154. [10.1109/TIP.2018.2851672]
			
	Tutti gli autori
	
						Cornia, Marcella; Baraldi, Lorenzo; Serra, Giuseppe; Cucchiara, Rita
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
manuscript.pdf Open access Tipologia: Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 15.49 MB Formato Adobe PDF Visualizza/Apri	15.49 MB	Adobe PDF	Visualizza/Apri
VQR_08400593.pdf Accesso riservato Tipologia: Versione pubblicata dall'editore Dimensione 4.76 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	4.76 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1162637

Citazioni

23

408

341

social impact