matrix-multiply

Here are 6 public repositories matching this topic...

Bruce-Lee-LY / cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

gpu cuda cublas nvidia gemm matrix-multiply tensor-core hgemm

Updated Sep 8, 2024
Cuda

Bruce-Lee-LY / cuda_hgemv

Star

Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.

gpu cuda cublas nvidia gemm gemv matrix-multiply tensor-core hgemm cuda-core hgemv

Updated Sep 8, 2024
Cuda

Bruce-Lee-LY / cutlass_gemm

Star

Multiple GEMM operators are constructed with cutlass to support LLM inference.

gpu cublas nvidia cutlass gemm cublaslt llm matrix-multiply tensor-core

Updated Sep 27, 2024
C++

Bruce-Lee-LY / matrix_multiply

Star

Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.

cpu cuda tiling cublas cpp11 nvidia shared-memory reordering naive strassen kahan coppersmith-winograd matrix-multiply

Updated Feb 8, 2023
C++

Bruce-Lee-LY / cuda_back2back_hgemm

Star

Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.

gpu cuda cublas nvidia gemm matrix-multiply tensor-core hgemm back2back-hgemm fused-hgemm back2back-gemm fused-gemm

Updated Nov 3, 2023
Cuda

JoKeRooo7 / matrix_c

Star

c lib for calculating matrices

c makefile matrix matrix-functions matrix-multiplication matrices matrix-calculations matrix-library matrix-inversion matrix-sum matrix-determinant-calculation matrix-inverse matrix-subtraction matrix-multiply

Updated Sep 26, 2023
C

Improve this page

Add a description, image, and links to the matrix-multiply topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the matrix-multiply topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

matrix-multiply

Here are 6 public repositories matching this topic...

Bruce-Lee-LY / cuda_hgemm

Bruce-Lee-LY / cuda_hgemv

Bruce-Lee-LY / cutlass_gemm

Bruce-Lee-LY / matrix_multiply

Bruce-Lee-LY / cuda_back2back_hgemm

JoKeRooo7 / matrix_c

Improve this page

Add this topic to your repo