Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0000Updated Feb 14, 2025Feb 14, 2025
    • CSS
      MIT License
      0000Updated Feb 9, 2025Feb 9, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.8k000Updated Jan 21, 2025Jan 21, 2025
    • For benchmarking the Roller
      C++
      MIT License
      1000Updated Dec 22, 2024Dec 22, 2024
    • Creative Commons Zero v1.0 Universal
      0000Updated Dec 19, 2024Dec 19, 2024
    • Python
      0000Updated Dec 15, 2024Dec 15, 2024
    • Open deep learning compiler stack for cpu, gpu and specialized accelerators
      Python
      Apache License 2.0
      3.5k000Updated Dec 11, 2024Dec 11, 2024
    • This repository contains the figures, tables and source code in the ICS'24 paper: "Accelerated Auto-Tuning of GPU Kernels for Tensor Computations".
      Python
      1820Updated Dec 5, 2024Dec 5, 2024
    • ics24tvm

      Public
      This repository contains the source code in the ICS'24 paper: "Accelerated Auto-Tuning of GPU Kernels for Tensor Computations".
      Python
      Apache License 2.0
      1010Updated Dec 5, 2024Dec 5, 2024
    • STeF

      Public
      C++
      0000Updated Nov 14, 2024Nov 14, 2024
    • CoNST

      Public
      C++
      0400Updated Aug 20, 2024Aug 20, 2024
    • tvm

      Public
      Open deep learning compiler stack for cpu, gpu and specialized accelerators
      Python
      Apache License 2.0
      3.5k200Updated Jul 6, 2024Jul 6, 2024
    • tvm-auto

      Public
      Focus on autoTVM
      Python
      Apache License 2.0
      3.5k000Updated Mar 3, 2024Mar 3, 2024
    • perf-char

      Public
      Performance characterization of auto-tuning data.
      Python
      0000Updated Feb 26, 2024Feb 26, 2024
    • ytopt

      Public
      ytopt: machine-learning-based search methods for autotuning
      Python
      BSD 2-Clause "Simplified" License
      17000Updated Aug 28, 2023Aug 28, 2023
    • All Benchmarks in single place which are ran using HPTD.
      Cuda
      0100Updated Jul 21, 2023Jul 21, 2023
    • GNN-RDM

      Public
      Python
      Other
      17000Updated Jul 14, 2023Jul 14, 2023
    • A retargetable MLIR-based machine learning compiler and runtime toolkit.
      C++
      Apache License 2.0
      658000Updated Jun 29, 2023Jun 29, 2023
    • A fork of LLVM to carry temporary patches for the IREE project
      Other
      13000Updated Jun 29, 2023Jun 29, 2023
    • C++
      BSD 4-Clause "Original" or "Old" License
      0300Updated May 14, 2023May 14, 2023
    • iree

      Public
      👻
      C++
      Apache License 2.0
      658000Updated Mar 1, 2023Mar 1, 2023
    • C++
      Other
      1001Updated Nov 20, 2022Nov 20, 2022
    • AE-PACT

      Public
      Cuda
      0000Updated Aug 15, 2022Aug 15, 2022
    • Useful tutorials and recipes for developers doing low-level work with the Graphcore IPU
      C++
      MIT License
      10000Updated Jul 7, 2022Jul 7, 2022
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      C++
      Other
      23k100Updated Mar 22, 2022Mar 22, 2022
    • Slides & posters of published papers
      BSD 4-Clause "Original" or "Old" License
      1000Updated Mar 16, 2022Mar 16, 2022
    • TLCBench

      Public
      Benchmark scripts for TVM
      Python
      29000Updated Mar 15, 2022Mar 15, 2022
    • Tensordot

      Public
      Code generator for tensor contraction
      Python
      MIT License
      7000Updated Feb 23, 2022Feb 23, 2022
    • GPU Implementation of Decomposed CNN by Tensor Networks
      Cuda
      GNU Lesser General Public License v3.0
      3000Updated Feb 12, 2022Feb 12, 2022
    • nalgebra

      Public
      Linear algebra library for Rust.
      Rust
      Apache License 2.0
      492000Updated Jan 27, 2022Jan 27, 2022