text-generation-inference

Here are 11 public repositories matching this topic...

huggingface / optimum-benchmark

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

benchmark pytorch openvino onnxruntime text-generation-inference neural-compressor tensorrt-llm

Updated May 14, 2025
Python

InftyAI / llmaz

Star

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

kubernetes inference huggingface llm modelscope llamacpp vllm text-generation-inference ollama sglang inference-platform

Updated May 19, 2025
Go

aws-samples / amazon-sagemaker-llama2-response-streaming-recipes

Star

Amazon SageMaker Llama 2 Inference via Response Streaming

sagemaker sagemaker-endpoint response-streaming large-language-models text-generation-inference llama2 large-model-inference

Updated Jun 28, 2024
Jupyter Notebook

magichub-opensource / CLAM-Conversational-Language-AI-from-MagicData

Star

This repo introduces MagicData-CLAM, a Chinese SFT dataset, and provides to the community two relevant models that we finetuned. Contact business@magicdatatech.com for more information.

chinese-llm text-generation-inference llama2

Updated Aug 3, 2023
Python

yjg30737 / pyqt-text-generation-inference-gui

Star

GUI version of text-generation-inference

pyqt5 text-generation pyqt huggingface text-generation-webui text-generation-inference

Updated Sep 1, 2023
Python

Akshint0407 / Nano-R1

Star

This project demonstrates the process of fine-tuning the Qwen2.5-3B-Instruct model using GRPO (Generalized Reward Policy Optimization) on the GSM8K dataset.

python transformer adapters huggingface trl safetensors text-generation-inference unsloth qwen2-5 grpo

Updated Apr 7, 2025
Jupyter Notebook

HyperBlaze456 / risu-backend-python

Star

RisuAI backend with python only. TextGen works, need more memory related updates

python backend text-generation llm text-generation-inference

Updated Jun 12, 2024
Python

dcalaprice / modal-sqlcoder

Star

Deploy the Defog sqlcoder2 llm on Modal (https://modal.com) using Hugging Face Text Generation Inference (TGI)

sql code-generation text-to-sql huggingface llm text-generation-inference sqlcoder modal-labs defog-ai

Updated Dec 12, 2023
Python

aisingapore / sealion-tgi

Star

Serve the AI Singapore SEA-LION model ⚛ with TGI

text-generation-inference

Updated Apr 15, 2025
Shell

Mikesterner87 / Nano-R1

Star

This project demonstrates the process of fine-tuning the Qwen2.5-3B-Instruct model using GRPO (Generalized Reward Policy Optimization) on the GSM8K dataset.

python build openwrt transformer adapters nanopi huggingface trl nanopi-r1s nanopi-r1 safetensors text-generation-inference unsloth grpo

Updated May 19, 2025
Jupyter Notebook

yjg30737 / windows-text-generation-inference-example

Star

Text Generation Interference example in Windows (docker, WSL is needed)

text-generation text-generation-inference

Updated Jun 26, 2023
Python

Improve this page

Add a description, image, and links to the text-generation-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-generation-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-generation-inference

Here are 11 public repositories matching this topic...

huggingface / optimum-benchmark

InftyAI / llmaz

aws-samples / amazon-sagemaker-llama2-response-streaming-recipes

magichub-opensource / CLAM-Conversational-Language-AI-from-MagicData

yjg30737 / pyqt-text-generation-inference-gui

Akshint0407 / Nano-R1

HyperBlaze456 / risu-backend-python

dcalaprice / modal-sqlcoder

aisingapore / sealion-tgi

Mikesterner87 / Nano-R1

yjg30737 / windows-text-generation-inference-example

Improve this page

Add this topic to your repo