Optimizing memory bandwidth exploitation for OpenVX applications on embedded many-core accelerators