You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a stage is executed (in ray_execute_bundle), its output can be immediately reshuffled so that its downstream processing can be parallelized.
When the upstream stage performs a write to GroupByKey, then we must group before reshuffling data (data belonging to the same key must be processed in the same worker).
If the upstream stage is not performing a GBK, then we can simply reshard everything without worrying about individual keys.
The text was updated successfully, but these errors were encountered:
When a stage is executed (in
ray_execute_bundle
), its output can be immediately reshuffled so that its downstream processing can be parallelized.When the upstream stage performs a write to GroupByKey, then we must group before reshuffling data (data belonging to the same key must be processed in the same worker).
If the upstream stage is not performing a GBK, then we can simply reshard everything without worrying about individual keys.
The text was updated successfully, but these errors were encountered: