rc v1.2.0 #174

anirudTT · 2025-02-05T18:02:36Z

Changelog

remove HF token fron .env in tt-studio
startup.sh makes HOST_PERSISTENT_STORAGE_VOLUME if it doesnt exist
startup.sh uses safety set -euo pipefail
remove HF_TOKEN from app/docker-compose.yml
remove VLLM_LLAMA31_ENV_FILE now redundant
Adding Llama 3.x integration using new setup.sh and LLM code base
support multiple models using same container, adds support for MODEL_ID environment variable in tt-inference-server.
update volume initialization for new file permissions strategy
add SetupTypes to handle different first run and validation behaviour
hf_model_id is used to define model_id and model_name if provided (rename hf_model_path to hf_model_id)
/home/user/cache_root changed to /home/container_app_user/cache_root
fix get_devices_mounts, add mapping
use MODEL_ID if in container env_vars to map to impl model config
set defaults for ModelImpl
add configs for llama 3.x models
remove HF_TOKEN from tt-studio .env for ease of setup
add environment file processing

* remove HF token fron .env in tt-studio * startup.sh makes HOST_PERSISTENT_STORAGE_VOLUME if it doesnt exist * startup.sh uses safety set -euo pipefail * remove HF_TOKEN from app/docker-compose.yml * remove VLLM_LLAMA31_ENV_FILE now redundant * Adding Llama 3.x integration using new setup.sh and LLM code base * support multiple models using same container, adds support for MODEL_ID environment variable in tt-inference-server. * update volume initialization for new file permissions strategy * add SetupTypes to handle different first run and validation behaviour * hf_model_id is used to define model_id and model_name if provided (rename hf_model_path to hf_model_id) * /home/user/cache_root changed to /home/container_app_user/cache_root * fix get_devices_mounts, add mapping * use MODEL_ID if in container env_vars to map to impl model config * set defaults for ModelImpl * add configs for llama 3.x models * remove HF_TOKEN from tt-studio .env for ease of setup * add environment file processing

app/api/shared_config/model_config.py

anirudTT · 2025-02-24T18:14:21Z

Merge Pending Cherry Picking changes from this PR #189

* update readme to reflect new flow * fix readme issues * add Supported models tab: pointing to tt-inference-server readme * docs: Update main readme - add better quick start guide - add better notes for running in development mode * docs: re add Mock model steps * docs: fix links * docs: fix vllm * Update HowToRun_vLLM_Models.md Co-authored-by: Benjamin Goel <bgoel@tenstorrent.com> * Update HowToRun_vLLM_Models.md --------- Co-authored-by: Benjamin Goel <bgoel@tenstorrent.com>

tstescoTT and others added 2 commits February 5, 2025 12:24

adds license headers

21a8a06

anirudTT changed the title ~~Rc v1.2.0~~ rc v1.2.0 Feb 5, 2025

tstescoTT reviewed Feb 6, 2025

View reviewed changes

app/api/shared_config/model_config.py Show resolved Hide resolved

tstescoTT reviewed Feb 6, 2025

View reviewed changes

app/api/shared_config/model_config.py Show resolved Hide resolved

anirudTT mentioned this pull request Feb 6, 2025

Depreciate this llama version #178

Open

anirudTT requested a review from bgoelTT February 6, 2025 18:40

bgoelTT approved these changes Feb 6, 2025

View reviewed changes

tstescoTT approved these changes Feb 6, 2025

View reviewed changes

Merge branch 'main' into rc-v1.2.0

b49fc9b

anirudTT mentioned this pull request Feb 22, 2025

Merge in PR to support llama models #187

Closed

anirudTT added this to the Llama 3.3 70B Support Via TT-Studio milestone Feb 24, 2025

This was linked to issues Feb 24, 2025

Update readme for vllm models #188

Closed

Merge in PR to support llama models #187

Closed

update HowToRun_vLLM_Models.md #165

Closed

anirudTT merged commit 4ff3029 into main Feb 24, 2025
2 checks passed

anirudTT deleted the rc-v1.2.0 branch February 24, 2025 21:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rc v1.2.0 #174

rc v1.2.0 #174

anirudTT commented Feb 5, 2025 •

edited

Loading

anirudTT commented Feb 24, 2025

rc v1.2.0 #174

rc v1.2.0 #174

Conversation

anirudTT commented Feb 5, 2025 • edited Loading

Changelog

anirudTT commented Feb 24, 2025

anirudTT commented Feb 5, 2025 •

edited

Loading