Friday, 25 March 2016

TILED Matrix Multiplication Using Shared Memory in CUDA

Tiled Matrix Multiplication using Shared Memory in CUDA
Tiled Matrix Multiplication in CUDA
 Today, I am going to discuss Matrix Multiplication in CUDA. In CUDA, number of memories are present. As we have already discussed about the same in previous post "What is CUDA". Matrix Multiplication is very basic but a crucial algorithm in the field of Engineering & Computer Science. I assumed that one who is reading this post knows how to perform Matrix Multiplication in at least one programming language. (C, C++, Python, etc).