By and large, current visual attention models mostly rely, when considering static stimuli, on the following procedure. Given an image, a saliency map is computed, which, in turn, might serve the purpose of predicting a sequence of gaze shifts, namely a scanpath instantiating the dynamics of visual attention deployment. The temporal pattern of attention unfolding is thus confined to the scanpath generation stage, whilst salience is conceived as a static map, at best conflating a number of factors (bottom-up information, top-down, spatial biases, etc.). In this note we propose a novel sequential scheme that consists of a three-stage processing relying on a center-bias model, a context/layout model, and an object-based model, respectively. Each stage contributes, at different times, to the sequential sampling of the final scanpath. We compare the method against classic scanpath generation that exploits state-of-the-art static saliency model. Results show that accounting for the structure of the temporal unfolding leads to gaze dynamics close to human gaze behaviour.

How to look next? A data-driven approach for scanpath prediction / Boccignone, G.; Cuculo, V.; D'Amelio, A.. - 12232:(2020), pp. 131-145. (Intervento presentato al convegno World Congress on Formal Methods tenutosi a Porto nel 2019) [10.1007/978-3-030-54994-7_10].

How to look next? A data-driven approach for scanpath prediction

Cuculo V.;
2020

Abstract

By and large, current visual attention models mostly rely, when considering static stimuli, on the following procedure. Given an image, a saliency map is computed, which, in turn, might serve the purpose of predicting a sequence of gaze shifts, namely a scanpath instantiating the dynamics of visual attention deployment. The temporal pattern of attention unfolding is thus confined to the scanpath generation stage, whilst salience is conceived as a static map, at best conflating a number of factors (bottom-up information, top-down, spatial biases, etc.). In this note we propose a novel sequential scheme that consists of a three-stage processing relying on a center-bias model, a context/layout model, and an object-based model, respectively. Each stage contributes, at different times, to the sequential sampling of the final scanpath. We compare the method against classic scanpath generation that exploits state-of-the-art static saliency model. Results show that accounting for the structure of the temporal unfolding leads to gaze dynamics close to human gaze behaviour.
2020
World Congress on Formal Methods
Porto
2019
12232
131
145
Boccignone, G.; Cuculo, V.; D'Amelio, A.
How to look next? A data-driven approach for scanpath prediction / Boccignone, G.; Cuculo, V.; D'Amelio, A.. - 12232:(2020), pp. 131-145. (Intervento presentato al convegno World Congress on Formal Methods tenutosi a Porto nel 2019) [10.1007/978-3-030-54994-7_10].
File in questo prodotto:
File Dimensione Formato  
Boccignone2020_Chapter_HowToLookNextAData-DrivenAppro.pdf

Accesso riservato

Dimensione 6.72 MB
Formato Adobe PDF
6.72 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1300657
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact