Dark Experience for General Continual Learning: a Strong, Simple Baseline

Buzzega, Pietro; Boschini, Matteo; Porrello, Angelo; Abati, Davide; Calderara, Simone

Continual Learning has inspired a plethora of approaches and evaluation settings; however, the majority of them overlooks the properties of a practical scenario, where the data stream cannot be shaped as a sequence of tasks and offline training is not viable. We work towards General Continual Learning (GCL), where task boundaries blur and the domain and class distributions shift either gradually or suddenly. We address it through mixing rehearsal with knowledge distillation and regularization; our simple baseline, Dark Experience Replay, matches the network's logits sampled throughout the optimization trajectory, thus promoting consistency with its past. By conducting an extensive analysis on both standard benchmarks and a novel GCL evaluation setting (MNIST-360), we show that such a seemingly simple baseline outperforms consolidated approaches and leverages limited resources. We further explore the generalization capabilities of our objective, showing its regularization being beneficial beyond mere performance.

Dark Experience for General Continual Learning: a Strong, Simple Baseline / Buzzega, Pietro; Boschini, Matteo; Porrello, Angelo; Abati, Davide; Calderara, Simone. - 2020-:(2020). ( 34th Conference on Neural Information Processing Systems (NeurIPS 2020) Vancouver, Canada 6-12 December 2020).

Dark Experience for General Continual Learning: a Strong, Simple Baseline

Pietro Buzzega;Matteo Boschini;Angelo Porrello;Davide Abati;Simone Calderara

2020

Abstract

Continual Learning has inspired a plethora of approaches and evaluation settings; however, the majority of them overlooks the properties of a practical scenario, where the data stream cannot be shaped as a sequence of tasks and offline training is not viable. We work towards General Continual Learning (GCL), where task boundaries blur and the domain and class distributions shift either gradually or suddenly. We address it through mixing rehearsal with knowledge distillation and regularization; our simple baseline, Dark Experience Replay, matches the network's logits sampled throughout the optimization trajectory, thus promoting consistency with its past. By conducting an extensive analysis on both standard benchmarks and a novel GCL evaluation setting (MNIST-360), we show that such a seemingly simple baseline outperforms consolidated approaches and leverages limited resources. We further explore the generalization capabilities of our objective, showing its regularization being beneficial beyond mere performance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2020
			
	Titolo del Convegno
	
				34th Conference on Neural Information Processing Systems (NeurIPS 2020)
			
	Luogo del Convegno
	
				Vancouver, Canada
			
	Data del Convegno
	
				6-12 December 2020
			
	Codice Scopus
	
				2-s2.0-85106139449
			
	Serie
	
				ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS
			
	N° del Volume
	
				2020-
			
	Tutti gli autori
	
						Buzzega, Pietro; Boschini, Matteo; Porrello, Angelo; Abati, Davide; Calderara, Simone
					
	Citazione
	
				Dark Experience for General Continual Learning: a Strong, Simple Baseline / Buzzega, Pietro; Boschini, Matteo; Porrello, Angelo; Abati, Davide; Calderara, Simone. - 2020-:(2020). ( 34th Conference on Neural Information Processing Systems (NeurIPS 2020) Vancouver, Canada 6-12 December 2020).
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
NeurIPS-2020-dark-experience-for-general-continual-learning-a-strong-simple-baseline-Paper.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Licenza: [IR] unspecified-oa Dimensione 3.89 MB Formato Adobe PDF Visualizza/Apri	3.89 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris