RESTful-LLaMa-3.1-8B-app

title	emoji	colorFrom	colorTo	sdk	pinned	license
Restful Llama3.1	🔥	green	indigo	docker	false	mit

RESTful-LLaMa-3.1-8B-app

A simple RESTful service for the Meta-Llama-3.1-8B-Instruct language model.

Pre-requisites

A CUDA enabled GPU Space, runs optimal with 24GB vRAM
Access to LLaMa-3.1 weights from Huggingface
New Public Hugging Face Space https://huggingface.co/docs/hub/spaces-overview, Blank Docker Container
Personal Access Token (Read) https://huggingface.co/docs/hub/security-tokens, save it somewhere safe
Secret Access Token in your space. Name: HUGGING_FACE_HUB_TOKEN, Value: Personal Access Token
(Optional) Local .env file to store Personal Access Token

Getting Started

Fork, Adapt and Push this repo via SSH to your personal Hugging Face space

How to use

After successful startup you will be redirected to /docs, the SWAGGER-UI
Embedded means https://huggingface.co/docs/hub/spaces-embed, so the url goes from https://huggingface.co/spaces/nile4000/restful-llama3.1 to something like https://nile4000-restful-llama3-1.hf.space for your API-calls
For interacting with the model, you need to send POST requests to https://huggingface/embedded/chat.

Here is an example with curl:

curl -X POST https://huggingface/embedded/chat -H 'Content-Type: application/json' -d '{"messages":[{"role":"system","content":"You are a helpful assistant called Llama-3. Write out your answer short and succinct!"}, {"role":"user", "content":"What is the capital of Germany?"}], "temperature": 0.6, "top_p":0.75, "max_new_tokens":256}'

Another simplified example:

curl -X POST https://huggingface/embedded/chat -H 'Content-Type: application/json' -d '{"messages":[{"role":"user", "content":"Write a short essay about Istanbul."}]}'

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
data		data
.gitattributes		.gitattributes
.gitignore		.gitignore
.pylintrc		.pylintrc
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
start_app.sh		start_app.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RESTful-LLaMa-3.1-8B-app

Pre-requisites

Getting Started

How to use

About

Releases

Packages

Languages

nile4000/restful-llama-3.1-8b

Folders and files

Latest commit

History

Repository files navigation

RESTful-LLaMa-3.1-8B-app

Pre-requisites

Getting Started

How to use

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages