Deadline-Based Scheduling for GPU with Preemption Support

Modern automotive-grade embedded computing platforms feature high-performance Graphics Processing Units (GPUs) to support the massively parallel processing power needed for next-generation autonomous driving applications (e.g., Deep Neural Network (DNN) inference, sensor fusion, path planning, etc). As these workload-intensive activities are pushed to higher criticality levels, there is a stronger need for more predictable scheduling algorithms that are able to guarantee predictability without overly sacrificing GPU utilization. Unfortunately, the real-rime literature on GPU scheduling mostly considered limited (or null) preemption capabilities, while previous efforts in broader domains were often based on programming models and APIs that were not designed to support the real-rime requirements of recurring workloads. In this paper, we present the design of a prototype real-time scheduler for GPU activities on an embedded System on a Chip (SoC) featuring a cutting edge GPU architecture by NVIDIA adopted in the autonomous driving domain. The scheduler runs as a software partition on top of the NVIDIA hypervisor, and it leverages latest generation architectural features, such as pixel-level preemption and thread level preemption. Such a design allowed us to implement and test a preemptive Earliest Deadline First (EDF) scheduler for GPU tasks providing bandwidth isolations by means of a Constant Bandwidth Server (CBS). Our work involved investigating alternative programming models for compute APIs, allowing us to characterize CPU-to-GPU command submission with more detailed scheduling information. A detailed experimental characterization is presented to show the significant schedulability improvement of recurring real-time GPU tasks.

Deadline-Based Scheduling for GPU with Preemption Support / Capodieci, N.; Cavicchioli, R.; Bertogna, M.; Paramakuru, A.. - 2018-:(2019), pp. 119-130. (Intervento presentato al convegno 39th IEEE Real-Time Systems Symposium, RTSS 2018 tenutosi a usa nel 2018) [10.1109/RTSS.2018.00021].

Deadline-Based Scheduling for GPU with Preemption Support

Capodieci N.;Cavicchioli R.;Bertogna M.;Paramakuru A.

2019

Abstract

Modern automotive-grade embedded computing platforms feature high-performance Graphics Processing Units (GPUs) to support the massively parallel processing power needed for next-generation autonomous driving applications (e.g., Deep Neural Network (DNN) inference, sensor fusion, path planning, etc). As these workload-intensive activities are pushed to higher criticality levels, there is a stronger need for more predictable scheduling algorithms that are able to guarantee predictability without overly sacrificing GPU utilization. Unfortunately, the real-rime literature on GPU scheduling mostly considered limited (or null) preemption capabilities, while previous efforts in broader domains were often based on programming models and APIs that were not designed to support the real-rime requirements of recurring workloads. In this paper, we present the design of a prototype real-time scheduler for GPU activities on an embedded System on a Chip (SoC) featuring a cutting edge GPU architecture by NVIDIA adopted in the autonomous driving domain. The scheduler runs as a software partition on top of the NVIDIA hypervisor, and it leverages latest generation architectural features, such as pixel-level preemption and thread level preemption. Such a design allowed us to implement and test a preemptive Earliest Deadline First (EDF) scheduler for GPU tasks providing bandwidth isolations by means of a Constant Bandwidth Server (CBS). Our work involved investigating alternative programming models for compute APIs, allowing us to characterize CPU-to-GPU command submission with more detailed scheduling information. A detailed experimental characterization is presented to show the significant schedulability improvement of recurring real-time GPU tasks.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Data di prima pubblicazione
	
				2018
			
	Titolo del Convegno
	
				39th IEEE Real-Time Systems Symposium, RTSS 2018
			
	Luogo del Convegno
	
				usa
			
	Data del Convegno
	
				2018
			
	Codice DOI
	
				https://dx.doi.org/10.1109/RTSS.2018.00021
			
	Codice WoS
	
				WOS:000459855300011
			
	Codice Scopus
	
				2-s2.0-85061535999
			
	N° del Volume
	
				2018-
			
	Pagina iniziale
	
				119
			
	Pagina finale
	
				130
			
	Tutti gli autori
	
						Capodieci, N.; Cavicchioli, R.; Bertogna, M.; Paramakuru, A.
					
	Citazione
	
				Deadline-Based Scheduling for GPU with Preemption Support / Capodieci, N.; Cavicchioli, R.; Bertogna, M.; Paramakuru, A.. - 2018-:(2019), pp. 119-130. (Intervento presentato al  convegno 39th IEEE Real-Time Systems Symposium, RTSS 2018 tenutosi a usa nel 2018) [10.1109/RTSS.2018.00021].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
VOR_Deadline-based Scheduling for GPU.pdf Accesso riservato Tipologia: VOR - Versione pubblicata dall'editore Dimensione 403.35 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	403.35 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
RTSS18.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 428.06 kB Formato Adobe PDF Visualizza/Apri	428.06 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1182256

Citazioni

ND

70

54

social impact