Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

For anyone wants to understand the codebase #2

Open
Lurkrazy opened this issue Dec 5, 2024 · 4 comments
Open

For anyone wants to understand the codebase #2

Lurkrazy opened this issue Dec 5, 2024 · 4 comments
Labels
documentation Improvements or additions to documentation

Comments

@Lurkrazy
Copy link
Member

Lurkrazy commented Dec 5, 2024

I would recommend to check out the HPCRL/tvm.

This implementation is well-organized and entirely Python-based, making it easier to follow and understand the search strategy logic. If you're looking for additional clarity or insights into how the code works, this implementation could be helpful.

@Lurkrazy Lurkrazy pinned this issue Dec 5, 2024
@Lurkrazy Lurkrazy added the documentation Improvements or additions to documentation label Dec 5, 2024
@lqq-lqq
Copy link

lqq-lqq commented Dec 16, 2024

Why it is invalid when "up_tb < reg1 * reg2" ? The code location is following:


Looking forward to your answer.

@lqq-lqq
Copy link

lqq-lqq commented Dec 17, 2024

Sorry, I was missing a line before, I got it!

@Lurkrazy
Copy link
Member Author

Why it is invalid when "up_tb < reg1 * reg2" ? The code location is following:

Looking forward to your answer.

We mutated in the shared memory view, which means, the TB_tile = thread block size * reg1 * reg2. So up_tb < reg1 * reg2 is not valid here.

@Lurkrazy
Copy link
Member Author

Sorry, I was missing a line before, I got it!

Thank you for your interest in our work. I'd be happy to assist with any questions you have regarding the code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants