fix: use correct BLOCK_SIZE in matmul function for ggml type #668

xffxff · 2023-08-30T01:49:21Z

To compute k_in_lhs_blocks, we should use T::VecDottype::BLOCK_SIZE because we obtain lhs_b using T::VecDotType::from_float. Similarly, to calculate k_in_rhs_blocks, we need to use T::BLOCK_SIZE since rhs_t is a slice of T (I'm just starting to learn quantization, so I'm not sure if it's correct, but it seems intuitive to me)

fix: use correct BLOCK_SIZE in matmul function for ggml type

34d3154

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use correct BLOCK_SIZE in matmul function for ggml type #668

fix: use correct BLOCK_SIZE in matmul function for ggml type #668

xffxff commented Aug 30, 2023

fix: use correct BLOCK_SIZE in matmul function for ggml type #668

Are you sure you want to change the base?

fix: use correct BLOCK_SIZE in matmul function for ggml type #668

Conversation

xffxff commented Aug 30, 2023