Continual semi-supervised learning through contrastive interpolation consistency

Continual Learning (CL) investigates how to train Deep Networks on a stream of tasks without incurring forgetting. CL settings proposed in literature assume that every incoming example is paired with ground-truth annotations. However, this clashes with many real-world applications: gathering labeled data, which is in itself tedious and expensive, becomes infeasible when data flow as a stream. This work explores Continual Semi-Supervised Learning (CSSL): here, only a small fraction of labeled input examples are shown to the learner. We assess how current CL methods (e.g.: EWC, LwF, iCaRL, ER, GDumb, DER) perform in this novel and challenging scenario, where overfitting entangles forgetting. Subsequently, we design a novel CSSL method that exploits metric learning and consistency regularization to leverage unlabeled examples while learning. We show that our proposal exhibits higher resilience to diminishing supervision and, even more surprisingly, relying only on supervision suffices to outperform SOTA methods trained under full supervision.

Continual semi-supervised learning through contrastive interpolation consistency / Boschini, Matteo; Buzzega, Pietro; Bonicelli, Lorenzo; Porrello, Angelo; Calderara, Simone. - In: PATTERN RECOGNITION LETTERS. - ISSN 0167-8655. - 162:(2022), pp. 9-14. [10.1016/j.patrec.2022.08.006]

Continual semi-supervised learning through contrastive interpolation consistency

Boschini, Matteo;Buzzega, Pietro;Bonicelli, Lorenzo;Porrello, Angelo;Calderara, Simone

2022

Abstract

Continual Learning (CL) investigates how to train Deep Networks on a stream of tasks without incurring forgetting. CL settings proposed in literature assume that every incoming example is paired with ground-truth annotations. However, this clashes with many real-world applications: gathering labeled data, which is in itself tedious and expensive, becomes infeasible when data flow as a stream. This work explores Continual Semi-Supervised Learning (CSSL): here, only a small fraction of labeled input examples are shown to the learner. We assess how current CL methods (e.g.: EWC, LwF, iCaRL, ER, GDumb, DER) perform in this novel and challenging scenario, where overfitting entangles forgetting. Subsequently, we design a novel CSSL method that exploits metric learning and consistency regularization to leverage unlabeled examples while learning. We show that our proposal exhibits higher resilience to diminishing supervision and, even more surprisingly, relying only on supervision suffices to outperform SOTA methods trained under full supervision.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2022
			
	Data di prima pubblicazione
	
				17-ago-2022
			
	Rivista
	
				PATTERN RECOGNITION LETTERS
			
	N° del Volume
	
				162
			
	Pagina iniziale
	
				9
			
	Pagina finale
	
				14
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.patrec.2022.08.006
			
	Codice WoS
	
				WOS:000863226700003
			
	Codice Scopus
	
				2-s2.0-85136701505
			
	Citazione
	
				Continual semi-supervised learning through contrastive interpolation consistency / Boschini, Matteo; Buzzega, Pietro; Bonicelli, Lorenzo; Porrello, Angelo; Calderara, Simone. - In: PATTERN RECOGNITION LETTERS. - ISSN 0167-8655. - 162:(2022), pp. 9-14. [10.1016/j.patrec.2022.08.006]
			
	Tutti gli autori
	
						Boschini, Matteo; Buzzega, Pietro; Bonicelli, Lorenzo; Porrello, Angelo; Calderara, Simone
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
2108.06552.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 830.89 kB Formato Adobe PDF Visualizza/Apri	830.89 kB	Adobe PDF	Visualizza/Apri
1-s2.0-S0167865522002458-main.pdf Open access Tipologia: VOR - Versione pubblicata dall'editore Dimensione 1.41 MB Formato Adobe PDF Visualizza/Apri	1.41 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1285646

Citazioni

ND

18

9

social impact