Cluster-based architectures are increasingly being adopted to design embedded many-cores. These platforms can deliver very high peak performance within a contained power envelope, provided that programmers can make effective use the available parallel cores. This is becoming an extremely difficult task, as embedded applications are growing in complexity and exhibit irregular and dynamic parallelism. The OpenMP tasking extensions represent a powerful abstraction to capture this form of parallelism. However, efficiently supporting it on cluster-based embedded SoCs is not easy, because the fine-grained parallel workload present in embedded applications can not tolerate high memory and run-time overheads. In this paper we present our design of the runtime support layer to OpenMP tasking for an embedded shared memory cluster, identifying key aspects to achieving performance and discussing important architectural support to removing major bottlenecks.

Enabling fine-grained OpenMP tasking on tightly-coupled shared memory clusters / Burgio, P., Tagliavini, G., Marongiu, A., Benini, L.. - STAMPA. - (2013), pp. 1504-1509. (16th Design, Automation and Test in Europe Conference and Exhibition, DATE 2013 Grenoble, fra 18-22 March 2013) [10.7873/DATE.2013.306].

Enabling fine-grained OpenMP tasking on tightly-coupled shared memory clusters

Paolo Burgio;Andrea Marongiu;
2013

Abstract

Cluster-based architectures are increasingly being adopted to design embedded many-cores. These platforms can deliver very high peak performance within a contained power envelope, provided that programmers can make effective use the available parallel cores. This is becoming an extremely difficult task, as embedded applications are growing in complexity and exhibit irregular and dynamic parallelism. The OpenMP tasking extensions represent a powerful abstraction to capture this form of parallelism. However, efficiently supporting it on cluster-based embedded SoCs is not easy, because the fine-grained parallel workload present in embedded applications can not tolerate high memory and run-time overheads. In this paper we present our design of the runtime support layer to OpenMP tasking for an embedded shared memory cluster, identifying key aspects to achieving performance and discussing important architectural support to removing major bottlenecks.
2013
Inglese
16th Design, Automation and Test in Europe Conference and Exhibition, DATE 2013
Grenoble, fra
18-22 March 2013
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013
1504
1509
6
9781467350716
2013 IEEE Conference Proceedings
STATI UNITI D'AMERICA
Cluster-based architectures; OpenMP tasking; shared memory cluster
Burgio, Paolo; Tagliavini, Giuseppe; Marongiu, Andrea; Benini, Luca
Atti di CONVEGNO::Relazione in Atti di Convegno
273
4
Enabling fine-grained OpenMP tasking on tightly-coupled shared memory clusters / Burgio, P., Tagliavini, G., Marongiu, A., Benini, L.. - STAMPA. - (2013), pp. 1504-1509. (16th Design, Automation and Test in Europe Conference and Exhibition, DATE 2013 Grenoble, fra 18-22 March 2013) [10.7873/DATE.2013.306].
reserved
info:eu-repo/semantics/conferenceObject
File in questo prodotto:
File Dimensione Formato  
Enabling fine-grained OpenMP tasking on tightly-coupled shared memory clusters.pdf

Accesso riservato

Dimensione 660.03 kB
Formato Adobe PDF
660.03 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1171849
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 12
social impact