Self-Labeling the Job Shop Scheduling Problem

This work proposes a self-supervised training strategy designed for combinatorial problems. An obstacle in applying supervised paradigms to such problems is the need for costly target solutions often produced with exact solvers. Inspired by semi- and self-supervised learning, we show that generative models can be trained by sampling multiple solutions and using the best one according to the problem objective as a pseudo-label. In this way, we iteratively improve the model generation capability by relying only on its self-supervision, eliminating the need for optimality information. We validate this Self-Labeling Improvement Method (SLIM) on the Job Shop Scheduling (JSP), a complex combinatorial problem that is receiving much attention from the neural combinatorial community. We propose a generative model based on the well-known Pointer Network and train it with SLIM. Experiments on popular benchmarks demonstrate the potential of this approach as the resulting models outperform constructive heuristics and state-of-the-art learning proposals for the JSP. Lastly, we prove the robustness of SLIM to various parameters and its generality by applying it to the Traveling Salesman Problem.

Self-Labeling the Job Shop Scheduling Problem / Corsini, Andrea; Porrello, Angelo; Calderara, Simone; Dell'Amico, Mauro. - 37:(2024). ( 38th Conference on Neural Information Processing Systems, NeurIPS 2024 Vancouver, CANADA Dec. 9 -15, 2024).

Self-Labeling the Job Shop Scheduling Problem

Andrea Corsini;Angelo Porrello;Simone Calderara;Mauro Dell'Amico

2024

Abstract

This work proposes a self-supervised training strategy designed for combinatorial problems. An obstacle in applying supervised paradigms to such problems is the need for costly target solutions often produced with exact solvers. Inspired by semi- and self-supervised learning, we show that generative models can be trained by sampling multiple solutions and using the best one according to the problem objective as a pseudo-label. In this way, we iteratively improve the model generation capability by relying only on its self-supervision, eliminating the need for optimality information. We validate this Self-Labeling Improvement Method (SLIM) on the Job Shop Scheduling (JSP), a complex combinatorial problem that is receiving much attention from the neural combinatorial community. We propose a generative model based on the well-known Pointer Network and train it with SLIM. Experiments on popular benchmarks demonstrate the potential of this approach as the resulting models outperform constructive heuristics and state-of-the-art learning proposals for the JSP. Lastly, we prove the robustness of SLIM to various parameters and its generality by applying it to the Traveling Salesman Problem.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2024
			
	Titolo del Convegno
	
				38th Conference on Neural Information Processing Systems, NeurIPS 2024
			
	Luogo del Convegno
	
				Vancouver, CANADA
			
	Data del Convegno
	
				Dec. 9 -15, 2024
			
	Codice Scopus
	
				2-s2.0-105000545738
			
	Serie
	
				ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS
			
	N° del Volume
	
				37
			
	Tutti gli autori
	
						Corsini, Andrea; Porrello, Angelo; Calderara, Simone; Dell'Amico, Mauro
					
	Citazione
	
				Self-Labeling the Job Shop Scheduling Problem / Corsini, Andrea; Porrello, Angelo; Calderara, Simone; Dell'Amico, Mauro. - 37:(2024). ( 38th Conference on Neural Information Processing Systems, NeurIPS 2024 Vancouver, CANADA Dec. 9 -15, 2024).
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
paper_SelfLabelingJSP.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Licenza: [IR] unspecified-oa Dimensione 2.76 MB Formato Adobe PDF Visualizza/Apri	2.76 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1364432

Citazioni

ND

3

ND

social impact