Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective

Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.

doi:10.1109/RTAS48715.2020.000-5

Over the last few years, the ever-increasing use of Graphic Processing Units (GPUs) in safety-related domains has opened up many research problems in the real-time community. The closed and proprietary nature of the scheduling mechanisms deployed in NVIDIA GPUs, for instance, represents a major obstacle in deriving a proper schedulability analysis for latency-sensitive applications. Existing literature addresses these issues by either (i) providing simplified models for heterogeneous CPUGPU systems and their associated scheduling policies, or (ii) providing insights about these arbitration mechanisms obtained through reverse engineering. In this paper, we take one step further by correcting and consolidating previously published assumptions about the hierarchical scheduling policies of NVIDIA GPUs and their proprietary CUDA application programming interface. We also discuss how such mechanisms evolved with recently released GPU micro-architectures, and how such changes influence the scheduling models to be exploited by real-time system engineers.

Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective / Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.. - 2020-:(2020), pp. 213-225. ( 26th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2020 aus 2020) [10.1109/RTAS48715.2020.000-5].

Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective

Olmedo I. S.;Capodieci N.;Martinez J. L.;Marongiu A.;Bertogna M.

2020

Abstract

Over the last few years, the ever-increasing use of Graphic Processing Units (GPUs) in safety-related domains has opened up many research problems in the real-time community. The closed and proprietary nature of the scheduling mechanisms deployed in NVIDIA GPUs, for instance, represents a major obstacle in deriving a proper schedulability analysis for latency-sensitive applications. Existing literature addresses these issues by either (i) providing simplified models for heterogeneous CPUGPU systems and their associated scheduling policies, or (ii) providing insights about these arbitration mechanisms obtained through reverse engineering. In this paper, we take one step further by correcting and consolidating previously published assumptions about the hierarchical scheduling policies of NVIDIA GPUs and their proprietary CUDA application programming interface. We also discuss how such mechanisms evolved with recently released GPU micro-architectures, and how such changes influence the scheduling models to be exploited by real-time system engineers.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2020
			
	Titolo del Convegno
	
				26th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2020
			
	Luogo del Convegno
	
				aus
			
	Data del Convegno
	
				2020
			
	Codice DOI
	
				https://dx.doi.org/10.1109/RTAS48715.2020.000-5
			
	Codice WoS
	
				WOS:000713963100016
			
	Codice Scopus
	
				2-s2.0-85086769585
			
	N° del Volume
	
				2020-
			
	Pagina iniziale
	
				213
			
	Pagina finale
	
				225
			
	Tutti gli autori
	
						Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.
					
	Citazione
	
				Dissecting the CUDA scheduling hierarchy: A Performance and Predictability Perspective / Olmedo, I. S.; Capodieci, N.; Martinez, J. L.; Marongiu, A.; Bertogna, M.. - 2020-:(2020), pp. 213-225. ( 26th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2020 aus 2020) [10.1109/RTAS48715.2020.000-5].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
549900a210.pdf Accesso riservato Tipologia: VOR - Versione pubblicata dall'editore Licenza: [IR] publisher-specific-oa Dimensione 2.5 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.5 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris