Modern computer vision and image processing embedded systems exploit hardware acceleration inside scalable parallel architectures, such as tightly-coupled clusters, to achieve stringent performance and energy efficiency targets. Architectural heterogeneity typically makes software development cumbersome, thus shared memory processor-to-accelerator communication is typically preferred to simplify code offioading to HW IPs for critical computational kernels. However, tightly coupling a large number of accelerators and processors in a shared memory cluster is a challenging task, since the complexity of the resulting system quickly becomes too large. We tackle these issues by proposing a template of heterogeneous shared memory cluster which scales to a large number of accelerators, achieving up to 40% better performance/area/watt than simply designing larger main interconnects to accommodate several HW IPs. In addition, following a trend towards standardization of acceleration capabilities of future embedded systems, we develop a programming model which simplifies application development for heterogeneous clusters.

Architecture and programming model support for efficient heterogeneous computing on tigthly-coupled shared-memory clusters / Burgio, P.; Marongiu, A.; Danilo, R.; Coussy, P.; Benini, L.. - STAMPA. - (2013), pp. 22-29. (Intervento presentato al convegno Design and Architectures for Signal and Image Processing (DASIP) tenutosi a Cagliari, Italy nel 8-10 Oct. 2013).

Architecture and programming model support for efficient heterogeneous computing on tigthly-coupled shared-memory clusters

Burgio P.;Marongiu A.;
2013

Abstract

Modern computer vision and image processing embedded systems exploit hardware acceleration inside scalable parallel architectures, such as tightly-coupled clusters, to achieve stringent performance and energy efficiency targets. Architectural heterogeneity typically makes software development cumbersome, thus shared memory processor-to-accelerator communication is typically preferred to simplify code offioading to HW IPs for critical computational kernels. However, tightly coupling a large number of accelerators and processors in a shared memory cluster is a challenging task, since the complexity of the resulting system quickly becomes too large. We tackle these issues by proposing a template of heterogeneous shared memory cluster which scales to a large number of accelerators, achieving up to 40% better performance/area/watt than simply designing larger main interconnects to accommodate several HW IPs. In addition, following a trend towards standardization of acceleration capabilities of future embedded systems, we develop a programming model which simplifies application development for heterogeneous clusters.
2013
Design and Architectures for Signal and Image Processing (DASIP)
Cagliari, Italy
8-10 Oct. 2013
22
29
Burgio, P.; Marongiu, A.; Danilo, R.; Coussy, P.; Benini, L.
Architecture and programming model support for efficient heterogeneous computing on tigthly-coupled shared-memory clusters / Burgio, P.; Marongiu, A.; Danilo, R.; Coussy, P.; Benini, L.. - STAMPA. - (2013), pp. 22-29. (Intervento presentato al convegno Design and Architectures for Signal and Image Processing (DASIP) tenutosi a Cagliari, Italy nel 8-10 Oct. 2013).
File in questo prodotto:
File Dimensione Formato  
Architecture and programming model support for efficient heterogeneous computing on tigthly-coupled shared-memory clusters.pdf

Accesso riservato

Dimensione 1.32 MB
Formato Adobe PDF
1.32 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1171879
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact