News and working notes

all | popular | tags | rss

Calling CUDA device function from OpenACC Fortran kernel

OpenACC is known to be a fast method of developing quite efficient GPU-enabled applications. It is also possible to mix CUDA kernels and libraries with OpenACC ke...

Continue Reading »

International Summer Supercomputing Academy at the Lomonosov Moscow State University

Applied Parallel Computing LLC has given a talk at the International Summer Supercomputing Academy, Lomonosov Moscow State University, Moscow.

Continue Reading »

Jetson K1: bandwidthTest

Chart on the left shows the bandwidths of memory transfers on Jetson K1 (Click to enlarge). For the baseline we also added GTX680M’s host-device and device-host (...

Continue Reading »

Jetson K1: from unboxing straight to CUDA in 5 steps

We finally got the most wanted Jetson K1 board in the house! In this post we show how to turn a just unboxed tiny board into fully-functional CUDA development nod...

Continue Reading »

Training on High Performance Computing on GPUs at Perm State University

Applied Parallel Computing LLC has delivered a GPU computing training at the “High Performance Computing on GPUs” conference, Perm State University, Russia

Continue Reading »

How to break Ubuntu 13.04/14.04 with vanilla CUDA driver and unbreak it back

After installing CUDA driver from NVIDIA website, Ubuntu 13.04/14.04 window manager decorations (Unity, via Compiz) may stop working properly on Optimus machines ...

Continue Reading »
Software Engineering, CUDA

CSCS-USI Summer School 2014

Applied Parallel Computing LLC will participate in CSCS-USI Summer School, to be held from 30 June to 10 July at Hotel Serpiano (Ticino, Switzerland). We will giv...

Continue Reading »

Improving CUDA profiler output of the MPI-CUDA program

Consider we need to profile the following MPI-CUDA program on GPU cluster. The most obvious way to profile this code on console-only cluster would be to invoke th...

Continue Reading »
Software Engineering, CUDA, MPI

One non-obvious reason of 'Illegal instruction' in GPU code

If cuda-gdb throws Program received signal CUDA_EXCEPTION_4, Warp Illegal Instruction. for the following code line:

Continue Reading »
Software Engineering, CUDA

HPC Advisory Council Switzerland

Together with out colleagues from CSCS we presented introductory talks on GPU and CUDA on the first day of HPC Advisory Council Switzerland.

Continue Reading »
« Newer Posts Page 6 of 10 Older Posts »