Notes on work mostly hosted at: my personal GitHub and GitHub of APC LLC

all | popular | tags | rss

Improving CUDA profiler output of the MPI-CUDA program

Consider we need to profile the following MPI-CUDA program on GPU cluster. The most obvious way to profile this code on console-only cluster would be to invoke th...

Continue Reading »

One non-obvious reason of 'Illegal instruction' in GPU code

If cuda-gdb throws Program received signal CUDA_EXCEPTION_4, Warp Illegal Instruction. for the following code line:

Continue Reading »
« Newer Posts Page 4 of 4