Description: The code mainly implements the single-thread, multithreaded method of calculating π, runs on the cluster, compares the time. Platform: |
Size: 1024 |
Author:巩晨星 |
Hits:
Description: In parallel computing, two methods of matrix multiplication, MPI and CUDA, have been tested and output with results Platform: |
Size: 1024 |
Author:张艾可
|
Hits:
Description: Skeletons and solutions for hands-on CUDA codes, they are listed as the followings: cudaMallocAndMemcpy myFirstKernel reverseArray_singleblock reverseArray_multiblock reverseArray_multiblock_fast Platform: |
Size: 2434048 |
Author:p-yang
|
Hits:
Description: Copy between host and device -- start with the cudaMallocAndMemcpy template. The first part allocates memory for the indexes d_a and d_b on the device. The second part: copy the h_a on the host to the d_a on the device. Platform: |
Size: 6144 |
Author:p-yang
|
Hits: