Modern embedded MPSoC designs increasingly couple hardware accelerators to processing cores to trade between energy efficiency and platform specialization. To assist effective design of such systems there is the need on one hand for clear methodologies to streamline accelerator definition and instantiation, on the other for architectural templates and runtime techniques that minimize processors-to-accelerator communication costs. In this paper we present an architecture featuring tightly-coupled processors and accelerators, with zerocopy communication. Efficient programming is supported by an extended OpenMP programming model, where custom directives allow to specialize code regions for execution on parallel cores, accelerators, or a mix of the two. Our integrated approach enables fast yet accurate exploration of accelerator-based HW and SW architectures.
OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters / Burgio, Paolo; Marongiu, Andrea; D., Heller; C., Chavet; P., Coussy; Benini, Luca. - ELETTRONICO. - (2012), pp. 751-758. (Intervento presentato al convegno 15th Euromicro Conference on Digital Systems Design tenutosi a Cesme, Izmir, Turkey nel September 5th - 8th, 2012) [10.1109/DSD.2012.97].
OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters
BURGIO, PAOLO;MARONGIU, ANDREA;
2012
Abstract
Modern embedded MPSoC designs increasingly couple hardware accelerators to processing cores to trade between energy efficiency and platform specialization. To assist effective design of such systems there is the need on one hand for clear methodologies to streamline accelerator definition and instantiation, on the other for architectural templates and runtime techniques that minimize processors-to-accelerator communication costs. In this paper we present an architecture featuring tightly-coupled processors and accelerators, with zerocopy communication. Efficient programming is supported by an extended OpenMP programming model, where custom directives allow to specialize code regions for execution on parallel cores, accelerators, or a mix of the two. Our integrated approach enables fast yet accurate exploration of accelerator-based HW and SW architectures.File | Dimensione | Formato | |
---|---|---|---|
OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters.pdf
Accesso riservato
Dimensione
477.81 kB
Formato
Adobe PDF
|
477.81 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris