In this paper we address the issue of efficient doall workload distribution on a embedded 3D MPSoC. 3D stacking technology enables low latency and high bandwidth access to multiple, large memory banks in close spatial proximity. In our implementation one silicon layer contains multiple processors, whereas one or more DRAM layers on top host a NUMA memory subsystem. To obtain high locality and balanced workload we consider a two-step approach. First, a compiler pass analyzes memory references in a loop and schedules each iteration to the processor owning the most frequently accessed data. Second, if locality-aware loop parallelization has generated unbalanced workload we allow idle processors to execute part of the remaining work from neighbors by implementing runtime support for work stealing.

Vertical stealing: robust, locality-aware do-all workload distribution for 3D MPSoCs / Marongiu, A.; Burgio, P.; Benini, L.. - (2010), pp. 207-216. (Intervento presentato al convegno CASES '10, the 2010 international conference on Compilers, architectures and synthesis for embedded systems tenutosi a Scottsdale, AZ (USA) nel Oct 24-29, 2010) [10.1145/1878921.1878952].

Vertical stealing: robust, locality-aware do-all workload distribution for 3D MPSoCs

A. MARONGIU;P. BURGIO;
2010

Abstract

In this paper we address the issue of efficient doall workload distribution on a embedded 3D MPSoC. 3D stacking technology enables low latency and high bandwidth access to multiple, large memory banks in close spatial proximity. In our implementation one silicon layer contains multiple processors, whereas one or more DRAM layers on top host a NUMA memory subsystem. To obtain high locality and balanced workload we consider a two-step approach. First, a compiler pass analyzes memory references in a loop and schedules each iteration to the processor owning the most frequently accessed data. Second, if locality-aware loop parallelization has generated unbalanced workload we allow idle processors to execute part of the remaining work from neighbors by implementing runtime support for work stealing.
2010
CASES '10, the 2010 international conference on Compilers, architectures and synthesis for embedded systems
Scottsdale, AZ (USA)
Oct 24-29, 2010
207
216
Marongiu, A.; Burgio, P.; Benini, L.
Vertical stealing: robust, locality-aware do-all workload distribution for 3D MPSoCs / Marongiu, A.; Burgio, P.; Benini, L.. - (2010), pp. 207-216. (Intervento presentato al convegno CASES '10, the 2010 international conference on Compilers, architectures and synthesis for embedded systems tenutosi a Scottsdale, AZ (USA) nel Oct 24-29, 2010) [10.1145/1878921.1878952].
File in questo prodotto:
File Dimensione Formato  
Vertical stealing_robust, locality-aware do-all workload distribution for 3D MPSoCs.pdf

Accesso riservato

Dimensione 1.48 MB
Formato Adobe PDF
1.48 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1171910
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
social impact