[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

bjacob · 2025-02-12T03:16:16Z

On ROCm, we want to use the device library for all math functions.

This expands on #19969, which only concerned math.erf.

We only leave one category of rewrites enabled: the operand casts to f32. The ROCm device library internally performs the same for many math functions, but we leave that unchanged here for safety and because we seemed to have accuracy issues with an earlier attempt that was dropping that.

This PR needs careful accuracy testing on end-to-end workloads before merging.

Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>

MaheshRavishankar · 2025-02-12T03:23:50Z

If CI passes that's a good indication for e2e correctness for now I think

MaheshRavishankar · 2025-02-12T04:37:40Z

Interesting. It has compilation failures

lialan · 2025-02-13T23:46:58Z

compiler/src/iree/compiler/Codegen/Common/test/math_transform.mlir

+  // CHECK:         math.exp2
+  // CHECK:         math.expm1
+  // CHECK:         math.cbrt
+  // CHECK:         math.erf


have we figured out the numerical issue with math.erf library function?

We have (99%) figured that there was no issue with it, and the issues we ran into were caused by PolynomialApproximationPass being too coarse-grained and too convoluted, so that when we thought earlier that we were enabling/disabling math.erf approximation, we were also enabling/disabling a number of other things, unintentionally. This is what #19922 solved. Now in the present PR we are finally at a place where we have some fine-grained, well-defined levers to play with.

bjacob added 2 commits February 11, 2025 20:31

no-approx-erf-on-rocm

f617d1e

Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>

review-comment

95b4938

Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>

bjacob requested review from lialan and MaheshRavishankar February 12, 2025 03:17

no-approx-on-rocm

35e4441

Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>

bjacob force-pushed the no-approx-at-all-on-rocm branch from 4ae6121 to 35e4441 Compare February 12, 2025 03:18

MaheshRavishankar approved these changes Feb 12, 2025

View reviewed changes

lialan reviewed Feb 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

bjacob commented Feb 12, 2025 •

edited

Loading

MaheshRavishankar commented Feb 12, 2025

MaheshRavishankar commented Feb 12, 2025

lialan Feb 13, 2025

bjacob Feb 14, 2025

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

Are you sure you want to change the base?

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

Conversation

bjacob commented Feb 12, 2025 • edited Loading

MaheshRavishankar commented Feb 12, 2025

MaheshRavishankar commented Feb 12, 2025

lialan Feb 13, 2025

Choose a reason for hiding this comment

bjacob Feb 14, 2025

Choose a reason for hiding this comment

bjacob commented Feb 12, 2025 •

edited

Loading