Hi,
I'm currently experiencing an error after invoking a large number of the same OpenCL kernels. The OpenCL kernel itself work fine when run on its own. However I need the host program to launch this kernel repeatedly, one after the other.
When I run the host program with a low number of iterations eg 10 (there is 1 kernel invocation per iteration) things work fine. Similarly when I increase the iteration count to 10,000 everything works.
However at 100,000 iterations the program crashes with an "Out of host memory" error. I believe it crashes at around the 20,000th kernel invocation.
I've removed everything from my host program that may growing (in terms of memory footprint), however I still get the same problem.
Does the Altera OpenCL save the kernel or buffer object or similar during its execution. I'm just trying to understand what might be causing this problem.
A possible work around I can think off is to add the iteration loop to the kernel (fpga side) itself however I'd rather not have to do this. Therefore I'd like to understand more about why this problem is occurring before I go down this route.
I'd be grateful for any assistance solving this problem.
Many thanks
I'm currently experiencing an error after invoking a large number of the same OpenCL kernels. The OpenCL kernel itself work fine when run on its own. However I need the host program to launch this kernel repeatedly, one after the other.
When I run the host program with a low number of iterations eg 10 (there is 1 kernel invocation per iteration) things work fine. Similarly when I increase the iteration count to 10,000 everything works.
However at 100,000 iterations the program crashes with an "Out of host memory" error. I believe it crashes at around the 20,000th kernel invocation.
I've removed everything from my host program that may growing (in terms of memory footprint), however I still get the same problem.
Does the Altera OpenCL save the kernel or buffer object or similar during its execution. I'm just trying to understand what might be causing this problem.
A possible work around I can think off is to add the iteration loop to the kernel (fpga side) itself however I'd rather not have to do this. Therefore I'd like to understand more about why this problem is occurring before I go down this route.
I'd be grateful for any assistance solving this problem.
Many thanks