Modern embedded MPSoC designs increasingly couple hardware accelerators to processing cores to trade between energy efficiency and platform specialization. To assist effective design of such systems there is the need on one hand for clear methodologies to streamline accelerator definition and instantiation, on the other for architectural templates and runtime techniques that minimize processors-to-accelerator communication costs. In this paper we present an architecture featuring tightly-coupled processors and accelerators, with zerocopy communication. Efficient programming is supported by an extended OpenMP programming model, where custom directives allow to specialize code regions for execution on parallel cores, accelerators, or a mix of the two. Our integrated approach enables fast yet accurate exploration of accelerator-based HW and SW architectures.

OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters / Burgio, Paolo; Marongiu, Andrea; D., Heller; C., Chavet; P., Coussy; Benini, Luca. - ELETTRONICO. - (2012), pp. 751-758. (Intervento presentato al convegno 15th Euromicro Conference on Digital Systems Design tenutosi a Cesme, Izmir, Turkey nel September 5th - 8th, 2012) [10.1109/DSD.2012.97].

OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters

BURGIO, PAOLO;MARONGIU, ANDREA;
2012

Abstract

Modern embedded MPSoC designs increasingly couple hardware accelerators to processing cores to trade between energy efficiency and platform specialization. To assist effective design of such systems there is the need on one hand for clear methodologies to streamline accelerator definition and instantiation, on the other for architectural templates and runtime techniques that minimize processors-to-accelerator communication costs. In this paper we present an architecture featuring tightly-coupled processors and accelerators, with zerocopy communication. Efficient programming is supported by an extended OpenMP programming model, where custom directives allow to specialize code regions for execution on parallel cores, accelerators, or a mix of the two. Our integrated approach enables fast yet accurate exploration of accelerator-based HW and SW architectures.
2012
15th Euromicro Conference on Digital Systems Design
Cesme, Izmir, Turkey
September 5th - 8th, 2012
751
758
Burgio, Paolo; Marongiu, Andrea; D., Heller; C., Chavet; P., Coussy; Benini, Luca
OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters / Burgio, Paolo; Marongiu, Andrea; D., Heller; C., Chavet; P., Coussy; Benini, Luca. - ELETTRONICO. - (2012), pp. 751-758. (Intervento presentato al convegno 15th Euromicro Conference on Digital Systems Design tenutosi a Cesme, Izmir, Turkey nel September 5th - 8th, 2012) [10.1109/DSD.2012.97].
File in questo prodotto:
File Dimensione Formato  
OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters.pdf

Accesso riservato

Dimensione 477.81 kB
Formato Adobe PDF
477.81 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1171883
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 15
  • ???jsp.display-item.citation.isi??? ND
social impact