Modern automotive-grade embedded computing platforms feature high-performance Graphics Processing Units (GPUs) to support the massively parallel processing power needed for next-generation autonomous driving applications (e.g., Deep Neural Network (DNN) inference, sensor fusion, path planning, etc). As these workload-intensive activities are pushed to higher criticality levels, there is a stronger need for more predictable scheduling algorithms that are able to guarantee predictability without overly sacrificing GPU utilization. Unfortunately, the real-rime literature on GPU scheduling mostly considered limited (or null) preemption capabilities, while previous efforts in broader domains were often based on programming models and APIs that were not designed to support the real-rime requirements of recurring workloads. In this paper, we present the design of a prototype real-time scheduler for GPU activities on an embedded System on a Chip (SoC) featuring a cutting edge GPU architecture by NVIDIA adopted in the autonomous driving domain. The scheduler runs as a software partition on top of the NVIDIA hypervisor, and it leverages latest generation architectural features, such as pixel-level preemption and thread level preemption. Such a design allowed us to implement and test a preemptive Earliest Deadline First (EDF) scheduler for GPU tasks providing bandwidth isolations by means of a Constant Bandwidth Server (CBS). Our work involved investigating alternative programming models for compute APIs, allowing us to characterize CPU-to-GPU command submission with more detailed scheduling information. A detailed experimental characterization is presented to show the significant schedulability improvement of recurring real-time GPU tasks.

Deadline-Based Scheduling for GPU with Preemption Support / Capodieci, N.; Cavicchioli, R.; Bertogna, M.; Paramakuru, A.. - 2018-:(2019), pp. 119-130. ((Intervento presentato al convegno 39th IEEE Real-Time Systems Symposium, RTSS 2018 tenutosi a usa nel 2018 [10.1109/RTSS.2018.00021].

Deadline-Based Scheduling for GPU with Preemption Support

Capodieci N.;Cavicchioli R.;Bertogna M.;
2019

Abstract

Modern automotive-grade embedded computing platforms feature high-performance Graphics Processing Units (GPUs) to support the massively parallel processing power needed for next-generation autonomous driving applications (e.g., Deep Neural Network (DNN) inference, sensor fusion, path planning, etc). As these workload-intensive activities are pushed to higher criticality levels, there is a stronger need for more predictable scheduling algorithms that are able to guarantee predictability without overly sacrificing GPU utilization. Unfortunately, the real-rime literature on GPU scheduling mostly considered limited (or null) preemption capabilities, while previous efforts in broader domains were often based on programming models and APIs that were not designed to support the real-rime requirements of recurring workloads. In this paper, we present the design of a prototype real-time scheduler for GPU activities on an embedded System on a Chip (SoC) featuring a cutting edge GPU architecture by NVIDIA adopted in the autonomous driving domain. The scheduler runs as a software partition on top of the NVIDIA hypervisor, and it leverages latest generation architectural features, such as pixel-level preemption and thread level preemption. Such a design allowed us to implement and test a preemptive Earliest Deadline First (EDF) scheduler for GPU tasks providing bandwidth isolations by means of a Constant Bandwidth Server (CBS). Our work involved investigating alternative programming models for compute APIs, allowing us to characterize CPU-to-GPU command submission with more detailed scheduling information. A detailed experimental characterization is presented to show the significant schedulability improvement of recurring real-time GPU tasks.
2018
39th IEEE Real-Time Systems Symposium, RTSS 2018
usa
2018
2018-
119
130
Capodieci, N.; Cavicchioli, R.; Bertogna, M.; Paramakuru, A.
Deadline-Based Scheduling for GPU with Preemption Support / Capodieci, N.; Cavicchioli, R.; Bertogna, M.; Paramakuru, A.. - 2018-:(2019), pp. 119-130. ((Intervento presentato al convegno 39th IEEE Real-Time Systems Symposium, RTSS 2018 tenutosi a usa nel 2018 [10.1109/RTSS.2018.00021].
File in questo prodotto:
File Dimensione Formato  
VOR_Deadline-based Scheduling for GPU.pdf

non disponibili

Tipologia: Versione dell'editore (versione pubblicata)
Dimensione 403.35 kB
Formato Adobe PDF
403.35 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
RTSS18.pdf

accesso aperto

Tipologia: Post-print dell'autore (bozza post referaggio)
Dimensione 428.06 kB
Formato Adobe PDF
428.06 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11380/1182256
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 42
  • ???jsp.display-item.citation.isi??? 24
social impact