[Iceberg] Enable affinity scheduling on file sections #24598

ZacBlanco · 2025-02-20T06:26:35Z

Description

This change moves the affinity scheduling file section size
configuration from HiveClientConfig and HiveSessionProperties
to HiveCommonClientConfig and HiveCommonSessionProperties so
that the iceberg connector can benefit from this scheduling
strategy when tables have a small number of files but a large
number of splits.

Motivation and Context

On tables with a small number of large files, queries may perform poorly due to the distribution in split scheduling being skewed. This is more likely to occur when there is a limited number of values being hashed to determine the preferred nodes to schedule to. By changing the identifier used for selecting the preferred nodes we increase the probability that the splits are scheduled more evenly across the cluster.

Impact

Hive-specific configuration moved to common configuration.

Test Plan

Added a unit test to verify that the number of unique identifiers changes as we scale up the file section size

Contributor checklist

Please make sure your submission complies with our contributing guide, in particular code style and commit standards.
PR description addresses the issue accurately and concisely. If the change is non-trivial, a GitHub Issue is referenced.
Documented new properties (with its default value), SQL syntax, functions, or other functionality.
If release notes are required, they follow the release notes guidelines.
Adequate tests were added if applicable.
CI passed.

Release Notes

== RELEASE NOTES ==

Iceberg Connector Changes
* Add support for the ``hive.affinity-scheduling-file-section-size`` configuration property and ``affinity_scheduling_file_section_size`` session property.

steveburnett

LGTM! (docs)

Pull branch, local doc build, looks good. Thanks!

yingsu00

Mostly looks good. One minor correction: "splits not being scheduled to enough nodes" : It's not necessarily they were not scheduled to enough nodes, but in general it had more skew than Hive, even when the splits were scheduled to the same number of nodes. Scheduling to less nodes happened non-determistically when I ran the queries multiple times. More than half times they did were scheduled to all nodes, but even in such cases the load was not as balanced as Hive.

...rg/src/main/java/com/facebook/presto/iceberg/equalitydeletes/EqualityDeletesSplitSource.java

ZacBlanco · 2025-02-20T23:28:12Z

Thanks for the feedback @yingsu00 - I updated the PR description to be a bit more accurate

presto-iceberg/src/main/java/com/facebook/presto/iceberg/IcebergSplitSource.java

aaneja · 2025-02-21T08:04:22Z

presto-iceberg/src/test/java/com/facebook/presto/iceberg/TestIcebergSplitManager.java

+    {
+        Session maxIdentifiers = Session.builder(getSession())
+                .setCatalogSessionProperty("iceberg", AFFINITY_SCHEDULING_FILE_SECTION_SIZE, "1B")
+                .setCatalogSessionProperty("iceberg", TARGET_SPLIT_SIZE, "1")


Should we make TARGET_SPLIT_SIZE a DataSize instead of refering to bytes ? We do this for MAX_SPLIT_SIZE. Can punt this to a new PR

@aaneja Good catch!
@ZacBlanco Could you please address Anant's comment in this PR? I'll merge once it's done.

This was done intentionally because the Iceberg library expects this configuration in bytes directly on the "read.target.split-size" table property. Iceberg has no notion of "DataSize" or unit-based shorthand for byte-typed values. In order to make setting the values for the table and session property consistent for the target split size I opted to not make this a DataSize-typed property.

This change moves the affinity scheduling file section size configuration from HiveClientConfig and HiveSessionProperties to HiveCommonClientConfig and HiveCommonSessionProperties so that the iceberg connector can benefit from this scheduling strategy when tables have a small number of files but a large number of splits.

prestodb-ci added the from:IBM PR from IBM label Feb 20, 2025

ZacBlanco changed the title ~~[Iceberg] Enable affinity scheduling file sections~~ [Iceberg] Enable affinity scheduling on file sections Feb 20, 2025

ZacBlanco force-pushed the upstream-iceberg-split-hashing branch 2 times, most recently from ac6fce7 to 6dfb1f2 Compare February 20, 2025 06:36

aaneja self-requested a review February 20, 2025 10:38

ZacBlanco force-pushed the upstream-iceberg-split-hashing branch 3 times, most recently from bc804bc to 484408b Compare February 20, 2025 20:41

ZacBlanco marked this pull request as ready for review February 20, 2025 21:42

ZacBlanco requested review from steveburnett, elharo, hantangwangd and a team as code owners February 20, 2025 21:42

ZacBlanco requested a review from presto-oss February 20, 2025 21:42

steveburnett previously approved these changes Feb 20, 2025

View reviewed changes

yingsu00 reviewed Feb 20, 2025

View reviewed changes

...rg/src/main/java/com/facebook/presto/iceberg/equalitydeletes/EqualityDeletesSplitSource.java Show resolved Hide resolved

ZacBlanco force-pushed the upstream-iceberg-split-hashing branch from 484408b to fa2b10e Compare February 20, 2025 23:26

ZacBlanco dismissed steveburnett’s stale review via 7cf8789 February 20, 2025 23:33

ZacBlanco force-pushed the upstream-iceberg-split-hashing branch from fa2b10e to 7cf8789 Compare February 20, 2025 23:33

yingsu00 previously approved these changes Feb 21, 2025

View reviewed changes

aaneja previously approved these changes Feb 21, 2025

View reviewed changes

ZacBlanco dismissed stale reviews from aaneja and yingsu00 via d4ae7ad February 21, 2025 17:09

ZacBlanco force-pushed the upstream-iceberg-split-hashing branch from 7cf8789 to d4ae7ad Compare February 21, 2025 17:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Iceberg] Enable affinity scheduling on file sections #24598

[Iceberg] Enable affinity scheduling on file sections #24598

ZacBlanco commented Feb 20, 2025 •

edited

Loading

steveburnett left a comment

yingsu00 left a comment

ZacBlanco commented Feb 20, 2025

aaneja Feb 21, 2025

yingsu00 Feb 21, 2025

ZacBlanco Feb 21, 2025 •

edited

Loading

[Iceberg] Enable affinity scheduling on file sections #24598

Are you sure you want to change the base?

[Iceberg] Enable affinity scheduling on file sections #24598

Conversation

ZacBlanco commented Feb 20, 2025 • edited Loading

Description

Motivation and Context

Impact

Test Plan

Contributor checklist

Release Notes

steveburnett left a comment

Choose a reason for hiding this comment

yingsu00 left a comment

Choose a reason for hiding this comment

ZacBlanco commented Feb 20, 2025

aaneja Feb 21, 2025

Choose a reason for hiding this comment

yingsu00 Feb 21, 2025

Choose a reason for hiding this comment

ZacBlanco Feb 21, 2025 • edited Loading

Choose a reason for hiding this comment

ZacBlanco commented Feb 20, 2025 •

edited

Loading

ZacBlanco Feb 21, 2025 •

edited

Loading