Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective

Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.

doi:10.1109/RTAS48715.2020.000-5

Over the last few years, the ever-increasing use of Graphic Processing Units (GPUs) in safety-related domains has opened up many research problems in the real-time community. The closed and proprietary nature of the scheduling mechanisms deployed in NVIDIA GPUs, for instance, represents a major obstacle in deriving a proper schedulability analysis for latency-sensitive applications. Existing literature addresses these issues by either (i) providing simplified models for heterogeneous CPUGPU systems and their associated scheduling policies, or (ii) providing insights about these arbitration mechanisms obtained through reverse engineering. In this paper, we take one step further by correcting and consolidating previously published assumptions about the hierarchical scheduling policies of NVIDIA GPUs and their proprietary CUDA application programming interface. We also discuss how such mechanisms evolved with recently released GPU micro-architectures, and how such changes influence the scheduling models to be exploited by real-time system engineers.

Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective / Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.. - 2020-:(2020), pp. 213-225. (Intervento presentato al convegno 26th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2020 tenutosi a aus nel 2020) [10.1109/RTAS48715.2020.000-5].

Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective

Olmedo I. S.;Capodieci N.;Martinez J. L.;Marongiu A.;Bertogna M.

2020

Abstract

Over the last few years, the ever-increasing use of Graphic Processing Units (GPUs) in safety-related domains has opened up many research problems in the real-time community. The closed and proprietary nature of the scheduling mechanisms deployed in NVIDIA GPUs, for instance, represents a major obstacle in deriving a proper schedulability analysis for latency-sensitive applications. Existing literature addresses these issues by either (i) providing simplified models for heterogeneous CPUGPU systems and their associated scheduling policies, or (ii) providing insights about these arbitration mechanisms obtained through reverse engineering. In this paper, we take one step further by correcting and consolidating previously published assumptions about the hierarchical scheduling policies of NVIDIA GPUs and their proprietary CUDA application programming interface. We also discuss how such mechanisms evolved with recently released GPU micro-architectures, and how such changes influence the scheduling models to be exploited by real-time system engineers.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
			2020
		
	Titolo del Convegno
	
			26th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2020
		
	Luogo del Convegno
	
			aus
		
	Data del Convegno
	
			2020
		
	Codice DOI
	
			https://dx.doi.org/10.1109/RTAS48715.2020.000-5
		
	Codice WoS
	
			WOS:000713963100016
		
	Codice Scopus
	
			2-s2.0-85086769585
		
	N° del Volume
	
			2020-
		
	Pagina iniziale
	
			213
		
	Pagina finale
	
			225
		
	Tutti gli autori
	
			Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.
		
	Citazione
	
			Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective / Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.. - 2020-:(2020), pp. 213-225. (Intervento presentato al  convegno 26th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2020 tenutosi a aus nel 2020) [10.1109/RTAS48715.2020.000-5].
		
	Tipologia
	
			Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris