Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Should RoPE be included in embeddings? #942

Open
lockwo opened this issue Feb 4, 2025 · 2 comments
Open

Should RoPE be included in embeddings? #942

lockwo opened this issue Feb 4, 2025 · 2 comments

Comments

@lockwo
Copy link
Contributor

lockwo commented Feb 4, 2025

In the docs RoPE is only shown in the attention tab, however, since it is an embedding method (granted, probably almost entirely used with attention based methods) and is in _embedding.py I was wondering if it should at least be cross listed in the docs.

@patrick-kidger
Copy link
Owner

I'd be happy to have a cross-link if think it's worth it! :)

(Whilst we're here, I'm also conscious that our current rope implementation is kind of annoying to use, with the need for a helper function between it and the attention layer. Not sure if we could have done something better there.)

@lockwo
Copy link
Contributor Author

lockwo commented Feb 6, 2025

I'd be happy to have a cross-link if think it's worth it! :)

sure I will open one

(Whilst we're here, I'm also conscious that our current rope implementation is kind of annoying to use, with the need for a helper function between it and the attention layer. Not sure if we could have done something better there.)

I would certainly have strong opinions here, but I haven't ever really used RoPE yet (so I will save those opinions for another issue in the future).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants