The Image Captioning research field is currently compromised by the lack of transparency and awareness over the End-of-Sequence token () in the Self-Critical Sequence Training. If the token is omitted, a model can boost its performance up to +4.1 CIDEr-D using trivial sentence fragments. While this phenomenon poses an obstacle to a fair evaluation and comparison of established works, people involved in new projects are given the arduous choice between lower scores and unsatisfactory descriptions due to the competitive nature of the research. This work proposes to solve the problem by spreading awareness of the issue itself. In particular, we invite future works to share a simple and informative signature with the help of a library called SacreEOS. Code available at: https://github.com/jchenghu/sacreeos.

A Request for Clarity over the End of Sequence Token in the Self-Critical Sequence Training / Hu, Jia Cheng; Cavicchioli, R.; Capotondi, A.. - 14233:(2023), pp. 39-50. ( 22nd International Conference on Image Analysis and Processing (ICIAP 2023) Udine, ITALY SEP 11-15, 2023) [10.1007/978-3-031-43148-7_4].

A Request for Clarity over the End of Sequence Token in the Self-Critical Sequence Training

Hu J. C.;Cavicchioli R.;Capotondi A.
2023

Abstract

The Image Captioning research field is currently compromised by the lack of transparency and awareness over the End-of-Sequence token () in the Self-Critical Sequence Training. If the token is omitted, a model can boost its performance up to +4.1 CIDEr-D using trivial sentence fragments. While this phenomenon poses an obstacle to a fair evaluation and comparison of established works, people involved in new projects are given the arduous choice between lower scores and unsatisfactory descriptions due to the competitive nature of the research. This work proposes to solve the problem by spreading awareness of the issue itself. In particular, we invite future works to share a simple and informative signature with the help of a library called SacreEOS. Code available at: https://github.com/jchenghu/sacreeos.
2023
no
Inglese
22nd International Conference on Image Analysis and Processing (ICIAP 2023)
Udine, ITALY
SEP 11-15, 2023
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I
14233
39
50
9783031431470
Springer Science and Business Media Deutschland GmbH
Hu, Jia Cheng; Cavicchioli, R.; Capotondi, A.
Atti di CONVEGNO::Relazione in Atti di Convegno
273
3
A Request for Clarity over the End of Sequence Token in the Self-Critical Sequence Training / Hu, Jia Cheng; Cavicchioli, R.; Capotondi, A.. - 14233:(2023), pp. 39-50. ( 22nd International Conference on Image Analysis and Processing (ICIAP 2023) Udine, ITALY SEP 11-15, 2023) [10.1007/978-3-031-43148-7_4].
open
info:eu-repo/semantics/conferenceObject
   Monetizing car & mobility data for new Entrants, Technologies and Actors
   5GMETA
   European Commission
   Horizon 2020 Framework Programme
   957360
File in questo prodotto:
File Dimensione Formato  
2305.12254.pdf

Open access

Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione
Dimensione 615.7 kB
Formato Adobe PDF
615.7 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1320846
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 4
social impact