-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: Lightning-AI/litgpt
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
What is the bound for Further information is requested
max_new_tokens
?
question
#1932
opened Feb 3, 2025 by
kspviswa
error: cannot unpack non-iterable ActionTypeHint object
bug
Something isn't working
#1931
opened Jan 31, 2025 by
pvaldivia4
Bug: Incorrect gradient accumulation steps calculation in multi-node training due to missing world size information
bug
Something isn't working
#1927
opened Jan 30, 2025 by
pratyushmaini
Merging weights after Finetuning with Adapter.
question
Further information is requested
#1921
opened Jan 26, 2025 by
chnyhz
error: ArgumentParser._parse_known_args() missing 1 required positional argument: 'intermixed'
bug
Something isn't working
#1915
opened Jan 22, 2025 by
JexPY
bump hf transformer version compatibility
bug
Something isn't working
#1913
opened Jan 20, 2025 by
t-vi
LitGPT fine-tuning Dont Use GPU
question
Further information is requested
#1911
opened Jan 20, 2025 by
strikene
Trouble to load a litpgt trained model using transformers library
question
Further information is requested
#1910
opened Jan 20, 2025 by
nxtr-admin-it
Add support for providing New feature or request
wandb
run name.
enhancement
#1909
opened Jan 17, 2025 by
JackUrb
Add support for OpenAISpec in Further information is requested
litgpt deploy
question
#1908
opened Jan 17, 2025 by
bhimrazy
setting pretraining learning rate in command line interface
question
Further information is requested
#1905
opened Jan 10, 2025 by
2533245542
OOM for training llama
question
Further information is requested
#1900
opened Jan 7, 2025 by
dkapur17
Refactoring of New feature or request
GPT.forward
when it comes to input_pos
and KV cache usage
enhancement
#1898
opened Jan 6, 2025 by
mseeger
How to make use of NVIDIA GH200 Grace Hopper Superchip
question
Further information is requested
#1892
opened Dec 27, 2024 by
TheLukaDragar
Slow download from HuggingFace Hub (capped at 10.5 MB/s)
bug
Something isn't working
#1886
opened Dec 23, 2024 by
Andrei-Aksionov
Overwrite system prompt on load/generate
enhancement
New feature or request
#1879
opened Dec 21, 2024 by
twsl
Exporting LoRA to HF format without merging
question
Further information is requested
#1878
opened Dec 20, 2024 by
M1TR
Fine-Tuning Chat Model with Domain-Specific Data for custom dataset
question
Further information is requested
#1877
opened Dec 18, 2024 by
anantgupta129
failure converting pretrained litgpt checkpoints to HF format: a reproducible example
bug
Something isn't working
#1871
opened Dec 12, 2024 by
2533245542
Are there any plans to support multimodal and reinforcement learning?
question
Further information is requested
#1869
opened Dec 11, 2024 by
dz1iang
Loading checkpoint before fabric.setup(model) gets abnormal loss when using fabric.init_module()
question
Further information is requested
#1868
opened Dec 10, 2024 by
kobenaxie
Previous Next
ProTip!
Adding no:label will show everything without a label.