#

reduced-precision

Here is 1 public repository matching this topic...

KernelTuner / kernel_float

CUDA/HIP header-only library for writing vectorized and low-precision (16 bit, 8 bit) GPU kernels

performance cpp gpu cuda kernel-tuner hip vectorization floating-point half-precision mixed-precision low-precision bfloat16 header-only-library reduced-precision

Updated Apr 11, 2025
C++

Improve this page

Add a description, image, and links to the reduced-precision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reduced-precision topic, visit your repo's landing page and select "manage topics."