Metal optimizations #2062

qwerasd205 · 2024-08-07T22:24:18Z

This PR significantly reworks the Metal shaders for memory efficiency. Additionally I've adjusted the formatting of the shader code for readability, and normalized the naming conventions.

There are two main changes that affect memory efficiency.

Background cells are now a static sized buffer of cols * rows total uchar4s representing the cell colors, which are drawn with a single fullscreen shader.
I've taken steps to optimize the foreground (text) cell vertex data, namely:
- Background color (4 bytes) has been removed, since it can be fetched from the bg color buffer based on grid position.
- I demoted the glyph bearings from [2]i32 to [2]i16, it seems like a sane expectation to me that the bearings will never be more than 32K pixels in a given direction.
- I have reorganized the structs to be as compact as possible in memory, so that the size (with padding) is now 32 bytes, as opposed to the prior size which was something like 56 bytes.

Beyond this, I have added a couple other small optimizations, such as a fullscreen vertex shader that uses a single triangle, and using pixel coordinates to sample the glyph texture so that conversion to normalized coordinates isn't necessary.

Results

Experimentally I saw a ~20% reduction in total GPU memory use, and a ~80% reduction in max allocated buffer size.

TODO

Caution

There is a very real chance that this PR could have issues under macOS 13 for Intel processors, due to the known bugginess of the macOS 13 Intel Metal drivers. Comprehensive testing on an Intel mac running macOS 13 must be performed before this can be merged.

Verify correct behavior on an Intel mac under macOS 13. DO NOT MERGE WITHOUT TESTING

- Significant changes to optimize memory usage. - Adjusted formatting of the metal shader code to improve readability. - Normalized naming conventions in shader code. - Abstracted repetitive code for attribute descriptors to a helper function.

clason · 2024-08-08T17:32:23Z

@mitchellh I'm happy to test the builds on my Intel macOS 13 potato, but that machine is not set up for Zig builds. Could you enable the build workflows for this PR so I can grab the artefacts for testing?

(I'm also open for suggestions what to test specifically to exercise all the codepaths.)

mitchellh · 2024-08-08T21:38:32Z

Apparently I can't build the release from 3rd party forks :( Let me see what I can do.

mitchellh

Looks amazing, some really tiny comments. I'll try to figure out the Intel builds.

src/renderer/metal/cell.zig

mitchellh · 2024-08-08T21:45:13Z

src/renderer/metal/shaders.zig

@@ -845,6 +678,41 @@ fn initImagePipeline(device: objc.Object, library: objc.Object) !objc.Object {
    return pipeline_state;
 }

+fn autoAttribute(T: type, attrs: objc.Object) void {


Well well well aren't we fancy. ❤️

mitchellh · 2024-08-08T21:49:34Z

asciinema: bug.cast.zip

font-size = 15
font-family = JetBrains Mono
background-opacity = 0.95
background-blur-radius = 20
macos-titlebar-style = tabs
mouse-hide-while-typing = true
window-save-state = never

…ized ints

clason · 2024-08-09T07:03:13Z

That's one way, I guess :) Happy to report this build looks fine and even fixes the regression with the earlier build I reported on Discord.

qwerasd205 added 4 commits August 7, 2024 17:39

renderer: metal shaders rework

6339f9b

- Significant changes to optimize memory usage. - Adjusted formatting of the metal shader code to improve readability. - Normalized naming conventions in shader code. - Abstracted repetitive code for attribute descriptors to a helper function.

fix tests

76dc157

fix: use single triangle for metal post shader vertex

3a58b89

renderer/Metal: remove extraneous len arg from drawCellBgs

e5241cb

mitchellh requested changes Aug 8, 2024

View reviewed changes

qwerasd205 added 6 commits August 8, 2024 19:03

renderer/metal: properly support padding color = background (not extend)

d689065

renderer/metal: use memset to clear bg cell rows

732483c

comment

e4ab550

remove superfluous slicing syntax

bdbf5ad

fix: promote dimensions to usize so cell_count doesn't overflow

740dce6

fix: add Contents.bgCell to avoid accidentally indexing with unders…

f47ab3e

…ized ints

mitchellh merged commit 33d9c04 into ghostty-org:main Aug 9, 2024
17 of 19 checks passed

mitchellh deleted the metal-optimizations branch August 9, 2024 01:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metal optimizations #2062

Metal optimizations #2062

qwerasd205 commented Aug 7, 2024

clason commented Aug 8, 2024 •

edited

Loading

mitchellh commented Aug 8, 2024

mitchellh left a comment

mitchellh Aug 8, 2024

mitchellh commented Aug 8, 2024 •

edited

Loading

clason commented Aug 9, 2024

Metal optimizations #2062

Metal optimizations #2062

Conversation

qwerasd205 commented Aug 7, 2024

Results

TODO

clason commented Aug 8, 2024 • edited Loading

mitchellh commented Aug 8, 2024

mitchellh left a comment

Choose a reason for hiding this comment

mitchellh Aug 8, 2024

Choose a reason for hiding this comment

mitchellh commented Aug 8, 2024 • edited Loading

clason commented Aug 9, 2024

clason commented Aug 8, 2024 •

edited

Loading

mitchellh commented Aug 8, 2024 •

edited

Loading