[wgpu-hal] Blas compaction #7101

Vecvec · 2025-02-10T19:56:40Z

Connections
Hal part of #6609

Description
Allows for BLASes to be compacted in wgpu-hal

Testing
No testing, but the successful tests in #6609 tested this API.

Checklist

Run cargo fmt.
Run cargo clippy.
Run cargo xtask test to run tests.
Add change to CHANGELOG.md. See simple instructions inside file.

nical

LGTM. I suspect that it would be better to have fewer/larger query pools in the vulkan backend, but we can cross that bridge if it shows up in profiles.

CHANGELOG.md

Vecvec · 2025-02-12T19:40:48Z

LGTM. I suspect that it would be better to have fewer/larger query pools in the vulkan backend, but we can cross that bridge if it shows up in profiles.

@JMS55 mentioned this on matrix too, but I'd need to look at the specifics (and how expensive they were). For mesa at least this is 8 bytes read back from the acceleration structure so I would be surprised if this is the main bottleneck (especially since my current plan is to combine this in a build command which is not very fast itself),

nical · 2025-02-12T22:09:20Z

It's not the cost of the copy as much as the driver overhead of managing a lot of tiny pools (and maybe the per-pool memory overhead). The recommendation is generally to group things into large-ish pools but to be honest I don't know how impactful it is.

Vecvec · 2025-02-12T22:15:41Z

I think I'll add it to

wgpu/wgpu-core/src/ray_tracing.rs

Lines 1 to 8 in 9770409

    
           // Ray tracing 
        
           // Major missing optimizations (no api surface changes needed): 
        
           // - use custom tracker to track build state 
        
           // - no forced rebuilt (build mode deduction) 
        
           // - lazy instance buffer allocation 
        
           // - maybe share scratch and instance staging buffer allocation 
        
           // - partial instance buffer uploads (api surface already designed with this in mind) 
        
           // - ([non performance] extract function in build (rust function extraction with guards is a pain))

when doing wgpu-core compaction.

JMS55 · 2025-02-12T22:28:24Z

Yeah this is something we can stress test once I start the actual RT work in Bevy and have something to measure.

Initial.

f95d902

Vecvec requested a review from a team as a code owner February 10, 2025 19:56

Vecvec changed the title ~~Initial.~~ [wgpu-hal] Blas compaction Feb 10, 2025

Changelog.

9f04c3a

nical approved these changes Feb 12, 2025

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Update CHANGELOG.md

4254816

nical enabled auto-merge (squash) February 12, 2025 13:09

nical merged commit 3a4a40a into gfx-rs:trunk Feb 12, 2025
33 checks passed

cwfitzgerald deleted the compaction-hal branch February 12, 2025 18:29

marcpabst pushed a commit to marcpabst/wgpu that referenced this pull request Feb 19, 2025

[wgpu-hal] Blas compaction (gfx-rs#7101)

e2319d6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wgpu-hal] Blas compaction #7101

[wgpu-hal] Blas compaction #7101

Vecvec commented Feb 10, 2025 •

edited

Loading

nical left a comment

Vecvec commented Feb 12, 2025

nical commented Feb 12, 2025

Vecvec commented Feb 12, 2025

JMS55 commented Feb 12, 2025

[wgpu-hal] Blas compaction #7101

[wgpu-hal] Blas compaction #7101

Conversation

Vecvec commented Feb 10, 2025 • edited Loading

nical left a comment

Choose a reason for hiding this comment

Vecvec commented Feb 12, 2025

nical commented Feb 12, 2025

Vecvec commented Feb 12, 2025

JMS55 commented Feb 12, 2025

Vecvec commented Feb 10, 2025 •

edited

Loading