Inferless
Popular repositories Loading
-
triton-co-pilot
triton-co-pilot PublicGenerate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments
-
whisper-large-v3
whisper-large-v3 Public templateState‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>
-
-
Facebook-bart-cnn
Facebook-bart-cnn PublicBART model pre-trained on English language, and fine-tuned on CNN Daily Mail. It was introduced in the paper BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Trans…
Repositories
- stable-diffusion-xl-turbo Public template
A distilled and cost-effective variant of SDXL that delivers high-quality text-to-image generation with accelerated inference speed. <metadata> gpu: T4 | collections: ["Diffusers"] </metadata>
inferless/stable-diffusion-xl-turbo’s past year of commit activity - qwq-32b-preview Public template
A 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/qwq-32b-preview’s past year of commit activity - whisper-large-v3-turbo Public template
A turbocharged variant of Whisper large‑v3 for English speech recognition, optimized for lower latency. <metadata> gpu: T4 | collections: ["HF Transformers","Complex Outputs"] </metadata>
inferless/whisper-large-v3-turbo’s past year of commit activity - mistral-small-24b-instruct Public template
24B instruction-tuned model, delivering context-aware, reliable responses optimized for performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/mistral-small-24b-instruct’s past year of commit activity - llama-3.2-3b-instruct Public template
3B compact instruction-tuned model generate detailed responses across a range of tasks. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/llama-3.2-3b-instruct’s past year of commit activity - qwen2.5-vl-7b-instruct Public template
Vision-Language model that integrates advanced image, video, and text understanding. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/qwen2.5-vl-7b-instruct’s past year of commit activity - mistral-7b-instruct-v0.3 Public template
7B model fine-tuned for precise instruction following and robust contextual understanding. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/mistral-7b-instruct-v0.3’s past year of commit activity - llama-2-7b-gptq Public template
A 7B conversational model fine-tuned with RLHF, deployable efficiently via vLLM for low-latency serving. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
inferless/llama-2-7b-gptq’s past year of commit activity - llama-2-7b-hf Public template
A 7B parameter model fine-tuned for dialogue, utilizing supervised learning and RLHF, supports a context length of up to 4,000 tokens. <metadata> gpu: A10 | collections: ["HF Transformers"] </metadata>
inferless/llama-2-7b-hf’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…