Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmarks for ANE Model Compilation Speed? #309

Open
BrandonWeng opened this issue Feb 25, 2025 · 1 comment
Open

Benchmarks for ANE Model Compilation Speed? #309

BrandonWeng opened this issue Feb 25, 2025 · 1 comment

Comments

@BrandonWeng
Copy link

BrandonWeng commented Feb 25, 2025

Does the team have any benchmark on how long models take to compile? I swear it used to be much faster on non-m1 devices but we're seeing compiling on cpuAndNeualEngine take a really really long time recently.

openai_whisper-large-v3-v20240930_turbo took 440s on M4 Pro, 48GB and 560s on M2 Pro, 32GB both on 15.3.1

Subsequent loads are quick, only 3-5 seconds.

Just wondering if its always been like this or this was a regression in the 15.3.1 (non-beta)

Adding some more data as they come up, hope this helps

18.993656992912292s on M2 14.6.1

@ZachNagengast
Copy link
Contributor

That is quite a while, we've also been hearing reports of issues with 15.3.1 here and there, but needs more investigation to figure out whats really going on, but data like this is very helpful 🙏

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants