Thresholding Procedure via Barzilai-Borwein Rules for the Steplength Selection in Stochastic Gradient Methods

Franchini, G.; Ruggiero, V.; Trombini, I.

doi:10.1007/978-3-030-95470-3_21

A crucial aspect in designing a learning algorithm is the selection of the hyperparameters (parameters that are not trained during the learning process). In particular the effectiveness of the stochastic gradient methods strongly depends on the steplength selection. In recent papers [9, 10], Franchini et al. propose to adopt an adaptive selection rule borrowed from the full-gradient scheme known as Limited Memory Steepest Descent method [8] and appropriately tailored to the stochastic framework. This strategy is based on the computation of the eigenvalues (Ritz-like values) of a suitable matrix obtained from the gradients of the most recent iterations, and it enables to give an estimation of the local Lipschitz constant of the current gradient of the objective function, without introducing line-search techniques. The possible increase of the size of the sub-sample used to compute the stochastic gradient is driven by means of an augmented inner product test approach [3]. The whole procedure makes the tuning of the parameters less expensive than the selection of a fixed steplength, although it remains dependent on the choice of threshold values bounding the variability of the steplength sequences. The contribution of this paper is to exploit a stochastic version of the Barzilai-Borwein formulas [1] to adaptively select the endpoints range for the Ritz-like values. A numerical experimentation for some convex loss functions highlights that the proposed procedure remains stable as well as the tuning of the hyperparameters appears less expensive.

Thresholding Procedure via Barzilai-Borwein Rules for the Steplength Selection in Stochastic Gradient Methods / Franchini, G.; Ruggiero, V.; Trombini, I.. - 13164:(2022), pp. 277-282. (Intervento presentato al convegno 7th International Conference on Machine Learning, Optimization, and Data Science, LOD 2021 tenutosi a Grasmere, Lake District, England – UK nel 2021) [10.1007/978-3-030-95470-3_21].

Thresholding Procedure via Barzilai-Borwein Rules for the Steplength Selection in Stochastic Gradient Methods

Franchini G.;Ruggiero V.;Trombini I.

2022

Abstract

A crucial aspect in designing a learning algorithm is the selection of the hyperparameters (parameters that are not trained during the learning process). In particular the effectiveness of the stochastic gradient methods strongly depends on the steplength selection. In recent papers [9, 10], Franchini et al. propose to adopt an adaptive selection rule borrowed from the full-gradient scheme known as Limited Memory Steepest Descent method [8] and appropriately tailored to the stochastic framework. This strategy is based on the computation of the eigenvalues (Ritz-like values) of a suitable matrix obtained from the gradients of the most recent iterations, and it enables to give an estimation of the local Lipschitz constant of the current gradient of the objective function, without introducing line-search techniques. The possible increase of the size of the sub-sample used to compute the stochastic gradient is driven by means of an augmented inner product test approach [3]. The whole procedure makes the tuning of the parameters less expensive than the selection of a fixed steplength, although it remains dependent on the choice of threshold values bounding the variability of the steplength sequences. The contribution of this paper is to exploit a stochastic version of the Barzilai-Borwein formulas [1] to adaptively select the endpoints range for the Ritz-like values. A numerical experimentation for some convex loss functions highlights that the proposed procedure remains stable as well as the tuning of the hyperparameters appears less expensive.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2022
			
	Titolo del Convegno
	
				7th International Conference on Machine Learning, Optimization, and Data Science, LOD 2021
			
	Luogo del Convegno
	
				Grasmere, Lake District, England – UK
			
	Data del Convegno
	
				2021
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-030-95470-3_21
			
	Codice WoS
	
				WOS:000772650800021
			
	Codice Scopus
	
				2-s2.0-85125482174
			
	Serie
	
				LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
			
	N° del Volume
	
				13164
			
	Pagina iniziale
	
				277
			
	Pagina finale
	
				282
			
	Tutti gli autori
	
						Franchini, G.; Ruggiero, V.; Trombini, I.
					
	Citazione
	
				Thresholding Procedure via Barzilai-Borwein Rules for the Steplength Selection in Stochastic Gradient Methods / Franchini, G.; Ruggiero, V.; Trombini, I.. - 13164:(2022), pp. 277-282. (Intervento presentato al  convegno 7th International Conference on Machine Learning, Optimization, and Data Science, LOD 2021 tenutosi a Grasmere, Lake District, England – UK nel 2021) [10.1007/978-3-030-95470-3_21].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
LOD_short_2021.pdf Accesso riservato Tipologia: VOR - Versione pubblicata dall'editore Dimensione 711.39 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	711.39 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1278477

Citazioni

ND

1

0

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Thresholding Procedure via Barzilai-Borwein Rules for the Steplength Selection in Stochastic Gradient Methods

Franchini G.;Ruggiero V.;Trombini I.

2022

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)