-
Notifications
You must be signed in to change notification settings - Fork 102
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add HF model support inc. DS-R1-Distill, Qwen needs yarn support (#17421
) ### Problem description Existing codebase loads the meta checkpoint format but many derivative models are only available on huggingface. ### What's changed Add support for loading HuggingFace model formats, paving the way for full Qwen support (pending yarn rope implementation) and adding DeepSeek-R1-Distill-Llama-70B support. ### Checklist All passing locally. - [x] [all-post-commit](https://github.com/tenstorrent/tt-metal/actions/runs/13181023765) - [FIXED] Failing on loading the tokenizer on this pipeline only (investigating) - [x] [Single](https://github.com/tenstorrent/tt-metal/actions/runs/13142509908/job/36672984561) - [x] [Single-demos](https://github.com/tenstorrent/tt-metal/actions/runs/13180995444) - Only failing on N300 performance - Investigating - [ ] [T3K](https://github.com/tenstorrent/tt-metal/actions/runs/13142519276) - [x] [Unit](https://github.com/tenstorrent/tt-metal/actions/runs/13163296158/job/36737812258) - [x] [Model-perf](https://github.com/tenstorrent/tt-metal/actions/runs/13164376159) - [x] [Frequent-1](https://github.com/tenstorrent/tt-metal/actions/runs/13174954913) - [x] [Frequent-2](https://github.com/tenstorrent/tt-metal/actions/runs/13164380377/job/36742877847) - [x] [Demo](https://github.com/tenstorrent/tt-metal/actions/runs/13180986094) - [x] [TG](https://github.com/tenstorrent/tt-metal/actions/runs/13154035596/job/36707218743) - Pipelines have issues not related to these changes. --------- Signed-off-by: Salar Hosseini <skhorasgani@tenstorrent.com> Co-authored-by: mtairum <mtairum@tenstorrent.com> Co-authored-by: Salar Hosseini <skhorasgani@tenstorrent.com>
- Loading branch information
1 parent
558da69
commit d0b59bd
Showing
50 changed files
with
1,983 additions
and
780 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.