SPM_GPU

Project for Programming Massively Parallel Processors

Include the following modules before compiling on octopus:
module load cuda
module load gcc/10.1.0

Progress:
GPU0: Done
GPU1: Done (not Efficient)
GPU2: DONE -- Optimized <<=== most efficient
GPU3: Done
GPU4: TODO

Our Results for matrix 3 (126 ms)
![Results.png](https://github.com/chriskhalil/SPM_GPU/blob/28f30831a0deb00317e76b385c3ea65616f6ff1d/Results.png)

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
Results.png		Results.png
Utility.cu		Utility.cu
Utility.cuh		Utility.cuh
common.h		common.h
instructions		instructions
kernel0.cu		kernel0.cu
kernel1.cu		kernel1.cu
kernel2.cu		kernel2.cu
kernel3.cu		kernel3.cu
kernel4.cu		kernel4.cu
main.cu		main.cu
matrix.cu		matrix.cu
matrix.h		matrix.h
timer.h		timer.h

Provide feedback