Skip to content

Issues: pytorch-labs/gpt-fast

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

int4 quant broken right now?
#217 opened Dec 20, 2024 by jerryzh168
Error with meta-llama/Llama-3.2-1B
#211 opened Oct 18, 2024 by deafTim
Activation quantization support
#194 opened Aug 12, 2024 by ayyoobimani
tokenizer.model
#186 opened Jun 27, 2024 by hasakikiki
GGUF support?
#182 opened Jun 14, 2024 by yukiarimo
Missing Keys in state_dict
#172 opened May 6, 2024 by bjohn22
Tensor Parallel Inside notebook
#167 opened Apr 29, 2024 by nivibilla
mmap issue in bf16 of gpt-fast
#165 opened Apr 28, 2024 by yanbing-j
Naming: n_local_heads -> n_kv_heads
#162 opened Apr 23, 2024 by ad8e
batching/dynamic batching
#112 opened Feb 27, 2024 by nivibilla
ProTip! Type g i on any issue or pull request to go back to the issue listing page.