Adopt changes from JNI for casting from float to decimal #10917

ttnghia · 2024-05-28T05:35:56Z

This reimplements castFloatsToDecimal to just calling the corresponding function from spark-rapids-jni, which is designed specifically for casting floating point values to decimal for Spark.

Depends on:

Implement kernel for casting float to decimal spark-rapids-jni#2078

Closes #9682, and closes #10809.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

thirtiseven · 2024-05-28T07:54:11Z

The results look really close.

The @approximate_float in integration test can even be removed under my tests.

CastOpSuite can pass with eps of 1e-13.

Related Spark UT still fails, also close:

- casting to fixed-precision decimals *** FAILED ***
  Incorrect evaluation: cast(10.03 as decimal(38,18)), actual: 10.030000000000002048, expected: 10.03 (RapidsTestsTrait.scala:350)

Performance test to cast 5000000 floats to 10 kinds of decimal types (in ms):

Type	CPU	24.08	#10909	This PR
Double	146,524	468.33	660.33	370.66
Float	82,691	412.33	480.67	275.33

ttnghia · 2024-05-28T17:16:12Z

Should that case be acceptable as the relative error is just 1.7e-16?
For 10.3, it is stored like 9.30000000000000071054 and it is very difficult to remove the trailing bonus digits if we are going to take 18 decimal digits.

ttnghia · 2024-05-29T01:03:53Z

I've just updated my JNI code in NVIDIA/spark-rapids-jni#2078, adding some utilities functions by @pmattione-nvidia and it seems to fix all of our failed tests. @thirtiseven please run tests again on your machine.

thirtiseven · 2024-05-29T03:11:21Z

It passed Spark UT, personally I think it is good enough for this issue.

If set eps = 0 in com.nvidia.spark.rapids.CastOpSuite#cast float/double to decimal, we can still see some mismatch:
Some cases in float test:

cpu: -6.963900469355742E15	gpu: -6.963900469355743E15
cpu: 4.005061702373036E19	gpu: 4.005061702373037E19
cpu: 7.1726783043136778E18	gpu: 7.1726783043136788E18
cpu: 4.9245108190671776E17	gpu: 4.9245108190671782E17
cpu: -6.6190078985307568E16	gpu: -6.6190078985307576E16
cpu: -8.286135559736182E15	gpu: -8.286135559736183E15
cpu: 9.999999933815812E18	gpu: 9.999999933815814E18
cpu: 9.2205076355788718E18	gpu: 9.2205076355788728E18
cpu: 1.00000004091847872E17	gpu: 1.00000004091847888E17

Only diffs in last few digits.

Some cases in double test:

cpu: -1.0E16	                gpu: -1.0000000000000002E16
cpu: -3.953408452257507E34	gpu: -3.9534084522575073E34
cpu: -1.0E16	                gpu: -1.0000000000000002E16
cpu: 7.1726781626632929E18	gpu: 7.172678162663294E18
cpu: 4.9245106955378758E17	gpu: 4.9245106955378765E17
cpu: 1.0E29                     gpu: 1.0000000000000001E29
cpu: -6.619008101723092E16	gpu: -6.6190081017230928E16
cpu: 9.999999999999999E29	gpu: 1.0E30
cpu: -6.200692612954865E31	gpu: -6.200692612954866E31
cpu: -8.286135303307476E15	gpu: -8.286135303307477E15
cpu: -5.75278800663036E23	gpu: -5.7527880066303604E23
cpu: -2.786658992657616E33	gpu: -2.7866589926576164E33
cpu: -4.857510460413498E34	gpu: -4.8575104604134985E34

There some cases like -1.0E16 vs -1.0000000000000002 and 9.999999999999999E29 vs 1.0E30, not sure if they can be fix easily.

New performance:

Type	CPU	24.08	#10909	This PR old	This PR new
Double	146,524	468.33	660.33	370.66	439
Float	82,691	412.33	480.67	275.33	378.33

thirtiseven · 2024-05-29T03:51:56Z

And again some thoughts about the float => string => decimal solution. In Spark/Java double is converted to string with as many, but only as many, more digits as are needed to uniquely distinguish the argument value from adjacent values of type double. doc. But the algorithm is complex and sometimes does not produce results only as many at low version jdk. So technically the best we can match now might be the only as many digits, which is implemented in jni ftos_converter.cuh. So we don't need to really convert it to string, and can get the digits in the string in int64. But it seems not to be easy to use in your approach.

ttnghia · 2024-05-29T22:06:43Z

My solution in NVIDIA/spark-rapids-jni#2078 is adopted from the ongoing work of @pmattione-nvidia. Paul is working on even a better solution, so hopefully it will be able to fix the failures above. Let's wait for it.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

# Conflicts: # tests/src/test/spark330/scala/org/apache/spark/sql/rapids/utils/RapidsTestSettings.scala

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia · 2024-07-19T23:33:22Z

Alright, since we don't have any better solution in the meantime, I've made NVIDIA/spark-rapids-jni#2078 ready for review. With that:

The reported issues [BUG] Casting FLOAT64 to DECIMAL(12,7) produces different rows from Apache Spark CPU #9682 and [BUG] cast(9.95 as decimal(3,1)), actual: 9.9, expected: 10.0 #10809 are fixed.
floatEpsilon in our CastOpSuite#testCastToDecimal has been reduced from 1e-9 to 1e-14.
When set relative error to 0, the number of failures also reduces from 25/83 down to 3/75 for float/double to decimal tests.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

This reverts commit c185206. # Conflicts: # tests/src/test/scala/com/nvidia/spark/rapids/CastOpSuite.scala

This reverts commit 93422e8.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia · 2024-07-29T17:18:59Z

build

jihoonson

LGTM.

razajafri

LGTM

ttnghia added 3 commits May 27, 2024 08:37

Debugging

6076eb1

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Add test

d406ca2

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Adopt changes from JNI

d8f7eb7

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia added bug Something isn't working SQL part of the SQL/Dataframe plugin task Work required that improves the product but is not user facing labels May 28, 2024

ttnghia self-assigned this May 28, 2024

ttnghia mentioned this pull request May 28, 2024

[BUG] cast(9.95 as decimal(3,1)), actual: 9.9, expected: 10.0 #10809

Closed

thirtiseven mentioned this pull request May 28, 2024

[WIP] Almost match Cast Floats to Decimal #10909

Closed

ttnghia changed the base branch from branch-24.06 to branch-24.08 May 29, 2024 22:07

ttnghia added 9 commits June 7, 2024 19:50

WIP

00f5ffc

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Merge branch 'branch-24.08' into float_to_decimal

9529e47

# Conflicts: # tests/src/test/spark330/scala/org/apache/spark/sql/rapids/utils/RapidsTestSettings.scala

Merge branch 'branch-24.08' into float_to_decimal

4c02c50

Enable SparkUT test

93422e8

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Reduce test threshold

1071c0c

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Add unit tests

fb3047c

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Cleanup

2162f5b

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Update python tests

d747cb7

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Update unit tests

c0083fc

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia added 4 commits July 23, 2024 13:58

cast float to decimal: print number of failures

c185206

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Change relative error

071f433

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Revert "cast float to decimal: print number of failures"

cf6945c

This reverts commit c185206. # Conflicts: # tests/src/test/scala/com/nvidia/spark/rapids/CastOpSuite.scala

Merge branch 'branch-24.08' into float_to_decimal

0496699

ttnghia marked this pull request as ready for review July 24, 2024 21:35

ttnghia marked this pull request as draft July 24, 2024 22:16

Revert "Enable SparkUT test"

b2c8f21

This reverts commit 93422e8.

ttnghia mentioned this pull request Jul 24, 2024

[BUG] cast(10.05 as decimal(38,18)), actual: 10.050000000000001000, expected: 10.05 #11250

Open

Change issue number

239df3c

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia mentioned this pull request Jul 25, 2024

Implement kernel for casting float to decimal NVIDIA/spark-rapids-jni#2078

Merged

Merge branch 'branch-24.08' into float_to_decimal

a68cbb4

ttnghia marked this pull request as ready for review July 26, 2024 23:27

jihoonson approved these changes Jul 30, 2024

View reviewed changes

razajafri approved these changes Jul 30, 2024

View reviewed changes

ttnghia merged commit c9f1ab9 into NVIDIA:branch-24.08 Jul 30, 2024
43 checks passed

ttnghia deleted the float_to_decimal branch July 30, 2024 18:13

NvTimLiu mentioned this pull request Aug 6, 2024

Update changelog for v24.08.0 release [skip ci] #11304

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adopt changes from JNI for casting from float to decimal #10917

Adopt changes from JNI for casting from float to decimal #10917

ttnghia commented May 28, 2024 •

edited

Loading

thirtiseven commented May 28, 2024 •

edited

Loading

ttnghia commented May 28, 2024 •

edited

Loading

ttnghia commented May 29, 2024

thirtiseven commented May 29, 2024 •

edited

Loading

thirtiseven commented May 29, 2024

ttnghia commented May 29, 2024

ttnghia commented Jul 19, 2024 •

edited

Loading

ttnghia commented Jul 29, 2024

jihoonson left a comment

razajafri left a comment

Adopt changes from JNI for casting from float to decimal #10917

Adopt changes from JNI for casting from float to decimal #10917

Conversation

ttnghia commented May 28, 2024 • edited Loading

thirtiseven commented May 28, 2024 • edited Loading

ttnghia commented May 28, 2024 • edited Loading

ttnghia commented May 29, 2024

thirtiseven commented May 29, 2024 • edited Loading

thirtiseven commented May 29, 2024

ttnghia commented May 29, 2024

ttnghia commented Jul 19, 2024 • edited Loading

ttnghia commented Jul 29, 2024

jihoonson left a comment

Choose a reason for hiding this comment

razajafri left a comment

Choose a reason for hiding this comment

ttnghia commented May 28, 2024 •

edited

Loading

thirtiseven commented May 28, 2024 •

edited

Loading

ttnghia commented May 28, 2024 •

edited

Loading

thirtiseven commented May 29, 2024 •

edited

Loading

ttnghia commented Jul 19, 2024 •

edited

Loading