Heprem: Enabling predictable GPU execution on heterogeneous soc

Forsberg, Bjorn; Benini, Luca; Marongiu, Andrea

doi:10.23919/DATE.2018.8342066

Heterogeneous systems-on-A-chip are increasingly embracing shared memory designs, in which a single DRAM is used for both the main CPU and an integrated GPU. This architectural paradigm reduces the overheads associated with data movements and simplifies programmability. However, the deployment of real-Time workloads on such architectures is troublesome, as memory contention significantly increases execution time of tasks and the pessimism in worst-case execution time (WCET) estimates. The Predictable Execution Model (PREM) separates memory and computation phases in real-Time codes, then arbitrates memory phases from different tasks such that only one core at a time can access the DRAM. This paper revisits the original PREM proposal in the context of heterogeneous SoCs, proposing a compiler-based approach to make GPU codes PREM-compliant. Starting from high-level specifications of computation offloading, suitable program regions are selected and separated into memory and compute phases. Our experimental results show that the proposed technique is able to reduce the sensitivity of GPU kernels to memory interference to near zero, and achieves up to a 20 χ reduction in the measured WCET.

Heprem: Enabling predictable GPU execution on heterogeneous soc / Forsberg, Bjorn; Benini, Luca; Marongiu, Andrea. - STAMPA. - 2018-:(2018), pp. 539-544. ( 2018 Design, Automation and Test in Europe Conference and Exhibition, DATE 2018 International Congress Center Dresden, deu 2018) [10.23919/DATE.2018.8342066].

Heprem: Enabling predictable GPU execution on heterogeneous soc

Forsberg, Bjorn;Benini, Luca;Marongiu, Andrea

2018

Abstract

Heterogeneous systems-on-A-chip are increasingly embracing shared memory designs, in which a single DRAM is used for both the main CPU and an integrated GPU. This architectural paradigm reduces the overheads associated with data movements and simplifies programmability. However, the deployment of real-Time workloads on such architectures is troublesome, as memory contention significantly increases execution time of tasks and the pessimism in worst-case execution time (WCET) estimates. The Predictable Execution Model (PREM) separates memory and computation phases in real-Time codes, then arbitrates memory phases from different tasks such that only one core at a time can access the DRAM. This paper revisits the original PREM proposal in the context of heterogeneous SoCs, proposing a compiler-based approach to make GPU codes PREM-compliant. Starting from high-level specifications of computation offloading, suitable program regions are selected and separated into memory and compute phases. Our experimental results show that the proposed technique is able to reduce the sensitivity of GPU kernels to memory interference to near zero, and achieves up to a 20 χ reduction in the measured WCET.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2018
			
	Titolo del Convegno
	
				2018 Design, Automation and Test in Europe Conference and Exhibition, DATE 2018
			
	Luogo del Convegno
	
				International Congress Center Dresden, deu
			
	Data del Convegno
	
				2018
			
	Codice DOI
	
				https://dx.doi.org/10.23919/DATE.2018.8342066
			
	Codice WoS
	
				WOS:000435148800097
			
	Codice Scopus
	
				2-s2.0-85048987303
			
	Serie
	
				PROCEEDINGS - DESIGN, AUTOMATION, AND TEST IN EUROPE CONFERENCE AND EXHIBITION
			
	N° del Volume
	
				2018-
			
	Pagina iniziale
	
				539
			
	Pagina finale
	
				544
			
	Tutti gli autori
	
						Forsberg, Bjorn; Benini, Luca; Marongiu, Andrea
					
	Citazione
	
				Heprem: Enabling predictable GPU execution on heterogeneous soc / Forsberg, Bjorn; Benini, Luca; Marongiu, Andrea. - STAMPA. - 2018-:(2018), pp. 539-544. ( 2018 Design, Automation and Test in Europe Conference and Exhibition, DATE 2018 International Congress Center Dresden, deu 2018) [10.23919/DATE.2018.8342066].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

File	Dimensione	Formato
2018-03-DATE-HePREM-CameraReady-IEEE-Compatible.pdf Open access Tipologia: AAM - Versione dell'autore revisionata e accettata per la pubblicazione Dimensione 5.66 MB Formato Adobe PDF Visualizza/Apri	5.66 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris