Support S32/U32 indices for BWD embedding & Neuron implicit downcast #8462

rpsilva-aws · 2024-12-06T00:27:50Z

In this PR, we extend embedding tensor operations to allow S32 indices. This follows suits with other operations, in order to add flexibility and potentially performance benefits for accelerator backends. Reference for embedding dense bwd: https://github.com/pytorch/pytorch/blob/main/aten/src/ATen/native/Embedding.cpp#L117

In addition, we also re-introduce the implicit downcasting for Neuron S64/U64 types, since the Neuron compiler does not support 64 bits.

There is an ongoing effort to further extend this requirement to other tensor operations involving indices: pytorch/pytorch#142160. Once this is resolved, we adapt it on XLA as well.

rpsilva-aws · 2024-12-06T00:35:51Z

FYI, I split the previous PR: @miladm @ManfeiBai @tengyifei, this one is needed for 2.6. Unfortunately #8463 has a dependency on PT.

tengyifei

Is it possible to add a test at all?

torch_xla/csrc/dtype.cpp

rpsilva-aws · 2024-12-06T05:55:33Z

@tengyifei Ran yapf over the test file. PTAL, thanks!

rpsilva-aws changed the title ~~Rpsilva downcast v2~~ Support S32/U32 indices for BWD embedding & Neuron implicit downcast Dec 6, 2024

rpsilva-aws marked this pull request as ready for review December 6, 2024 00:28

rpsilva-aws mentioned this pull request Dec 6, 2024

Support S32/U32 indices for BWD embedding and index ops & Neuron implicit downcast #8450

Closed

tengyifei added the tpuci label Dec 6, 2024

tengyifei requested changes Dec 6, 2024

View reviewed changes

torch_xla/csrc/dtype.cpp Outdated Show resolved Hide resolved

rpsilva-aws force-pushed the rpsilva_downcast_v2 branch 2 times, most recently from c2fb7ef to 95d0f0c Compare December 6, 2024 02:26

tengyifei approved these changes Dec 6, 2024

View reviewed changes

rpsilva-aws force-pushed the rpsilva_downcast_v2 branch from 95d0f0c to 75f37b0 Compare December 6, 2024 05:54

rpsilva-aws added 3 commits December 6, 2024 18:10

Align S32 embedding indices with PyTorch

b7f16e2

Implicit S364/U32 downcasting for Neuron

1454608

Separate Neuron specific tests

a210920

rpsilva-aws force-pushed the rpsilva_downcast_v2 branch from 75f37b0 to a210920 Compare December 6, 2024 18:11

tengyifei approved these changes Dec 6, 2024

View reviewed changes

tengyifei merged commit 00c0e96 into pytorch:master Dec 7, 2024
12 checks passed

rpsilva-aws deleted the rpsilva_downcast_v2 branch December 9, 2024 19:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support S32/U32 indices for BWD embedding & Neuron implicit downcast #8462

Support S32/U32 indices for BWD embedding & Neuron implicit downcast #8462

rpsilva-aws commented Dec 6, 2024

rpsilva-aws commented Dec 6, 2024

tengyifei left a comment

rpsilva-aws commented Dec 6, 2024

Support S32/U32 indices for BWD embedding & Neuron implicit downcast #8462

Support S32/U32 indices for BWD embedding & Neuron implicit downcast #8462

Conversation

rpsilva-aws commented Dec 6, 2024

rpsilva-aws commented Dec 6, 2024

tengyifei left a comment

Choose a reason for hiding this comment

rpsilva-aws commented Dec 6, 2024