Location:
Search - cuda matrix
Search list
Description: This MEX performs 2d bilinear interpolation using an NVIDIA graphics chipset. To compile and run this software, one needs the NVIDIA CUDA Toolkit (http://www.nvidia.com/object/cuda_get.html) and, of course, an NVIDIA graphics card of reasonably modern vintage.
BUILDING INSTRUCTIONS: Change the 'MATLAB' (and if necessary, 'MEX') variables in the Makefile to appropriate values, then simply run 'make' at a prompt and an executable (mex/mexmac/mexmaci/dll?) file will be created.
This code uses your GPU's built-in bilinear texture interpolation capability, and is very fast. For reasonably sized operations (taking, say, a 50x50 matrix up to 1000x1000) CUDA-based code is 5-10x faster than linear interp2 (as tested on a MBP 2.4GHz C2D, GeForce 8600M GT).
With very (VERY) large matrices, however, it has the capability of completely crashing your computer or giving bizarre results. Be careful!
Platform: |
Size: 37881 |
Author: whitewalter |
Hits:
Description: 基于最新一代通用GPU--Geforce8800,在CUDA平台上开发的矩阵乘法的例子,比CPU版本性能有非常大的提高。保留了可执行程序-based on the latest generation of common GPU-- Geforce8800. CUDA platform in the development of the matrix multiplication example, than CPU performance version of a very big increase. Retained the executable
Platform: |
Size: 1572864 |
Author: 白洪涛 |
Hits:
Description: 采用GPU通用计算API(CUDA)实现矩阵相乘。-The use of general-purpose GPU computing API (CUDA) to achieve the matrix multiplication.
Platform: |
Size: 15360 |
Author: lry |
Hits:
Description: cuda开发环境下的矩阵运算,cuda sdk中的内容,矩阵运算充分显示了并行编程的优势,对于学习cuda编程有着很好的借鉴作用-matrix operations in parallel with cuda platform
Platform: |
Size: 1459200 |
Author: 陈思远 |
Hits:
Description: cuda平台开发的矩阵变换并行操作(Matrix transpose),属于cuda sdk中的内容-Small Matrix transpose with Cuda (Example for a 16x16 matrix)
Platform: |
Size: 9216 |
Author: 陈思远 |
Hits:
Description: cuda Faster Matrix-Vector Multiplication on GeForce 8800GTX
Platform: |
Size: 1024 |
Author: bertrand |
Hits:
Description: 基于CUDA的高维矩阵运算3,计算机毕业设计-CUDA-based operator of high-dimensional matrix 3, the design of computer graduates
Platform: |
Size: 409600 |
Author: 范吴斌 |
Hits:
Description: CUDA matrix multiplication
3 different optimization schemes are included
(loop unrolling, data prefetching, thread granularity)
Platform: |
Size: 15360 |
Author: mghs |
Hits:
Description: a cuda program for parallel multiplication of matrix
Platform: |
Size: 1589248 |
Author: teena |
Hits:
Description: CUDA Fast matrix multiplication and with shared memory.
Platform: |
Size: 749568 |
Author: Nikolay |
Hits:
Description: Program for mult matrix with CUDA
Platform: |
Size: 10240 |
Author: Elena |
Hits:
Description: an example of matrix multipication on cuda GPU
Platform: |
Size: 2048 |
Author: aviad |
Hits:
Description: cuda 矩阵运算程序,cuda上实现两个矩阵相乘-cuda matrix multi
Platform: |
Size: 8192 |
Author: test |
Hits:
Description: 包括矩阵相乘,转置,快速傅里叶变换,多GPU编程等23个实例-Including matrix multiplication, transposition, fast Fourier transform, multi-GPU programming in 23 instances
Platform: |
Size: 4912128 |
Author: weidong |
Hits:
Description: cuda矩阵乘法的源代码。(注:需要cuda平台)-cuda matrix multiplication of the source code. (Note: the need for cuda platform)
Platform: |
Size: 801792 |
Author: zhangzinan |
Hits:
Description: 并行计算中矩阵相乘的mpi和cuda两种方法,已完成检验,有结果输出(In parallel computing, two methods of matrix multiplication, MPI and CUDA, have been tested and output with results)
Platform: |
Size: 1024 |
Author: 张艾可
|
Hits:
Description: Matrix multiply on cuda GPU
Platform: |
Size: 12288 |
Author: mahmoudk
|
Hits:
Description: 稀疏矩阵向量乘法(Sparse Matrix-Vector Multiplication, SpMV)在许多科学计算程序中都有广泛的应用。数据矩阵A 是稀疏的,输入向量x 和输出向量y是稠密的。
y = Ax(Sparse Matrix-Vector Multiplication (SpMV) is widely used in many scientific computing programs. The data matrix A is sparse, the input vector x and the output vector y are dense.
y = Ax)
Platform: |
Size: 21504 |
Author: 甄文明 |
Hits:
Description: 利用cuda完成GPU编程,实现对给定矩阵的求逆操作(a cuda code for calculating a inverse matrix of a given matrix)
Platform: |
Size: 5841920 |
Author: 呆呆呆张 |
Hits:
Description: 结合GPU高性能编程,提供实战样例程序,包含矩阵乘法,原子操作,热传导,多GPU,多流的代码(Combined with GPU high performance programming, it provides actual sample program, including matrix multiplication, atomic operation, heat conduction, multi GPU, and multi stream code.)
Platform: |
Size: 305152 |
Author: zjujie |
Hits: