Skip to content
@nndeploy

nndeploy

AI Infra (Model inference and deployment)

Introduction

nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performance, and mainstream framework compatible model inference and deployment experience.

Contact Us

  • nndeploy is currently in its development stage. If you are passionate about open source and enjoy tinkering, whether for learning purposes or if you have better ideas, you are welcome to join us.

  • WeChat: titian5566 (Please briefly introduce yourself when adding WeChat to join the AI Inference Deployment communication group)

Pinned Loading

  1. nndeploy nndeploy Public

    nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performance, and mainstream framework compatible model inference and …

    C++ 686 101

Repositories

Showing 7 of 7 repositories
  • nndeploy Public

    nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performance, and mainstream framework compatible model inference and deployment experience.一款端到端的模型推理和部署框架。它旨在为用户提供功能强大、简单易用、高性能且兼容主流框架的模型推理和部署体验。

    nndeploy/nndeploy’s past year of commit activity
    C++ 686 Apache-2.0 101 7 0 Updated Feb 7, 2025
  • .github Public
    nndeploy/.github’s past year of commit activity
    0 0 0 0 Updated Feb 5, 2025
  • safetensors-cpp Public Forked from syoyo/safetensors-cpp

    Header-only safetensors loader and saver in C++

    nndeploy/safetensors-cpp’s past year of commit activity
    C++ 0 MIT 11 0 0 Updated Nov 19, 2024
  • onnx-llm Public Forked from wangzhaode/onnx-llm

    llm deploy project based onnx.

    nndeploy/onnx-llm’s past year of commit activity
    C++ 0 Apache-2.0 7 0 0 Updated Oct 9, 2024
  • tokenizers-cpp Public Forked from mlc-ai/tokenizers-cpp

    Universal cross-platform tokenizers binding to HF and sentencepiece

    nndeploy/tokenizers-cpp’s past year of commit activity
    C++ 1 Apache-2.0 70 0 0 Updated Jun 3, 2024
  • Awesome-LLM-Inference Public Forked from DefTruth/Awesome-LLM-Inference

    💻A small Collection for Awesome LLM Inference [Papers|Blogs|Docs] with codes, contains TensorRT-LLM, streaming-llm, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

    nndeploy/Awesome-LLM-Inference’s past year of commit activity
    2 GPL-3.0 233 0 0 Updated Dec 3, 2023
  • onnx-simplifier Public Forked from daquexian/onnx-simplifier

    Simplify your onnx model

    nndeploy/onnx-simplifier’s past year of commit activity
    Python 1 Apache-2.0 394 0 0 Updated Apr 27, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…