reorder weights according to their precision #252

ngc92 · 2024-04-25T10:16:19Z

Simplify our logic by keeping weights of the same precision close together.

(If we want to go with this, we also need to update the fp32 network to match; hence, for now this is a Draft PR)

karpathy · 2024-04-25T15:21:34Z

Random thought I was already going to update how we write mixed precision tensors to file, reading inside mixed precision program. We could couple this change with that one, then we don't have to update the fp32 part of the code.

This is not just a code refactor, afaict this would speed up the code because our adam kernels run over more elements at a time?

karpathy · 2024-04-26T04:50:44Z

seems like we should be able to directly load the version 2 files saved from .py file now on master! will check tomorrow.

reorder weights according to their precision

d376ad2

ngc92 force-pushed the weight-reordering branch from fff90f8 to d376ad2 Compare April 25, 2024 23:07

karpathy mentioned this pull request Apr 27, 2024

load bf16 directly, and some "quality of life" handling of fp32/fp16/bf16 precisions #265

Merged

ngc92 closed this Apr 28, 2024

ngc92 deleted the weight-reordering branch April 28, 2024 22:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reorder weights according to their precision #252

reorder weights according to their precision #252

ngc92 commented Apr 25, 2024 •

edited

Loading

karpathy commented Apr 25, 2024

karpathy commented Apr 26, 2024

reorder weights according to their precision #252

reorder weights according to their precision #252

Conversation

ngc92 commented Apr 25, 2024 • edited Loading

karpathy commented Apr 25, 2024

karpathy commented Apr 26, 2024

ngc92 commented Apr 25, 2024 •

edited

Loading