Skip to content
Change the repository type filter

All

    Repositories list

    • State-of-the-Art Text Embeddings
      Python
      Apache License 2.0
      2.5k000Updated Dec 13, 2024Dec 13, 2024
    • Python
      0120Updated Dec 12, 2024Dec 12, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      27k000Updated Dec 5, 2024Dec 5, 2024
    • AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)
      Python
      Apache License 2.0
      72176Updated Dec 4, 2024Dec 4, 2024
    • llama.cpp

      Public
      Ampere optimized llama.cpp
      Python
      0841Updated Nov 7, 2024Nov 7, 2024
    • 0000Updated Oct 16, 2024Oct 16, 2024
    • Llama3-8B scale out scripts
      Python
      0000Updated Sep 7, 2024Sep 7, 2024
    • Shell
      1110Updated Aug 28, 2024Aug 28, 2024
    • Scripts to reproduce AI results on AmpereOne platform.
      Jupyter Notebook
      0100Updated Aug 19, 2024Aug 19, 2024
    • Fork of tensorflow serving for ARM64 build
      C++
      Apache License 2.0
      2200Updated Jul 31, 2024Jul 31, 2024
    • local-rag

      Public
      Python
      1000Updated May 22, 2024May 22, 2024
    • Integrating Ampere's high performance LLM inference with popular application building frameworks in the industry
      Python
      Apache License 2.0
      1030Updated May 22, 2024May 22, 2024
    • Shell
      Apache License 2.0
      1000Updated May 16, 2024May 16, 2024
    • whisper

      Public
      Robust Speech Recognition via Large-Scale Weak Supervision
      Python
      MIT License
      8.8k001Updated Apr 25, 2024Apr 25, 2024
    • Python
      MIT License
      0101Updated Apr 16, 2024Apr 16, 2024
    • AutoGPTQ

      Public
      An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
      Python
      MIT License
      491110Updated Mar 13, 2024Mar 13, 2024
    • Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low latency across Computer Vision, Object Detection, Natural Language Processing and Generative AI models.
      Jupyter Notebook
      Other
      7000Updated Mar 11, 2024Mar 11, 2024
    • Python
      3520Updated Mar 5, 2024Mar 5, 2024
    • LlamaIndex is a data framework for your LLM applications
      Python
      MIT License
      5.4k000Updated Mar 4, 2024Mar 4, 2024
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      27k000Updated Dec 16, 2023Dec 16, 2023
    • Stable Diffusion web UI
      Python
      GNU Affero General Public License v3.0
      27k000Updated Nov 24, 2023Nov 24, 2023
    • High-Resolution Image Synthesis with Latent Diffusion Models
      Python
      MIT License
      5.1k000Updated Nov 24, 2023Nov 24, 2023
    • images

      Public
      0000Updated Jun 20, 2023Jun 20, 2023
    • Shell
      Apache License 2.0
      2300Updated May 24, 2023May 24, 2023
    • Shell
      0000Updated Apr 26, 2023Apr 26, 2023
    • Python
      12000Updated Mar 13, 2023Mar 13, 2023
    • Paddle

      Public
      Fork of PaddlePaddle framework
      C++
      Apache License 2.0
      0000Updated Mar 2, 2023Mar 2, 2023
    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k000Updated Mar 2, 2023Mar 2, 2023
    • The Triton backend for TensorRT.
      C++
      BSD 3-Clause "New" or "Revised" License
      30000Updated Mar 2, 2023Mar 2, 2023
    • oneDNN

      Public
      Fork of oneDNN
      C++
      Apache License 2.0
      0010Updated Mar 1, 2023Mar 1, 2023