Controlling NUMA effects in embedded manycore applications with lightweight nested parallelism support