-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor llama3 demo to the new generator API #16753
base: main
Are you sure you want to change the base?
Conversation
4f4c250
to
decc3fa
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed all the files except simple_text_demo.py for now since that one is still being cleaned up
943dff3
to
5dfa882
Compare
3847b2b
to
4161293
Compare
e89d7e5
to
82e3d03
Compare
Did you run nightly ttnn single card, t3k demos, tg demos? |
@tt-rkim In the process of fixing the CI issues. Will only merge after they pass 👍 |
ec155d5
to
2dc2e9b
Compare
679a4e2
to
27beb0c
Compare
This PR is now feature complete, with a bunch of new benchmark functionality and QoL updates to the demo to better measure what the customer team throws at us. Waiting for CI to pass then it's good to go. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Clean
8424b8f
to
ac0318e
Compare
Relaunching all tests after addressing @cglagovichTT comments. The branch is feature complete. No more changes unless to fix the CI pipelines. Checklist |
…ng prompts and tested all of them
… and reduce time to first token
… by reducing the size of rot_mats being passed to prefill
…n until it reaches the max. Also added support for custom input parameters
…n for 1 user or 32 users and we save that data to superset
8dc64b3
to
a9940a0
Compare
All CI are passing. Godspeed @yieldthought 🫡 |
Ticket
#16752
What's changed
Checklist