A variable metric proximal stochastic gradient method: An application to classification problems

Cascarano, P.; Franchini, G.; Kobler, E.; Porta, F.; Sebastiani, A.

doi:10.1016/j.ejco.2024.100088

Due to the continued success of machine learning and deep learning in particular, supervised classification problems are ubiquitous in numerous scientific fields. Training these models typically involves the minimization of the empirical risk over large data sets along with a possibly non-differentiable regularization. In this paper, we introduce a stochastic gradient method for the considered classification problem. To control the variance of the objective's gradients, we use an automatic sample size selection along with a variable metric to precondition the stochastic gradient directions. Further, we utilize a non -monotone line search to automatize step size selection. Convergence results are provided for both convex and non-convex objective functions. Extensive numerical experiments verify that the suggested approach performs on par with stateof-the-art methods for training both statistical models for binary classification and artificial neural networks for multi-class image classification. The code is publicly available at https:// github .com /koblererich /lisavm.

A variable metric proximal stochastic gradient method: An application to classification problems / Cascarano, P.; Franchini, G.; Kobler, E.; Porta, F.; Sebastiani, A.. - In: EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION. - ISSN 2192-4414. - 12:(2024), pp. 100088-100088. [10.1016/j.ejco.2024.100088]

A variable metric proximal stochastic gradient method: An application to classification problems

Cascarano P.;Franchini G.;Kobler E.;Porta F.;Sebastiani A.

2024

Abstract

Due to the continued success of machine learning and deep learning in particular, supervised classification problems are ubiquitous in numerous scientific fields. Training these models typically involves the minimization of the empirical risk over large data sets along with a possibly non-differentiable regularization. In this paper, we introduce a stochastic gradient method for the considered classification problem. To control the variance of the objective's gradients, we use an automatic sample size selection along with a variable metric to precondition the stochastic gradient directions. Further, we utilize a non -monotone line search to automatize step size selection. Convergence results are provided for both convex and non-convex objective functions. Extensive numerical experiments verify that the suggested approach performs on par with stateof-the-art methods for training both statistical models for binary classification and artificial neural networks for multi-class image classification. The code is publicly available at https:// github .com /koblererich /lisavm.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Rivista
	
				EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION
			
	N° del Volume
	
				12
			
	Pagina iniziale
	
				100088
			
	Pagina finale
	
				100088
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.ejco.2024.100088
			
	Codice WoS
	
				WOS:001234518400001
			
	Codice Scopus
	
				2-s2.0-85191356406
			
	Citazione
	
				A variable metric proximal stochastic gradient method: An application to classification problems / Cascarano, P.; Franchini, G.; Kobler, E.; Porta, F.; Sebastiani, A.. - In: EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION. - ISSN 2192-4414. - 12:(2024), pp. 100088-100088. [10.1016/j.ejco.2024.100088]
			
	Tutti gli autori
	
						Cascarano, P.; Franchini, G.; Kobler, E.; Porta, F.; Sebastiani, A.
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S2192440624000054-main.pdf Open access Tipologia: VOR - Versione pubblicata dall'editore Licenza: [IR] creative-commons Dimensione 1.68 MB Formato Adobe PDF Visualizza/Apri	1.68 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris