[Snippets][CPU] Moved N_tail processing to the end in BrgemmCopyBKernel #28664

a-sidorova · 2025-01-24T12:35:42Z

Details:

The performance experiments (see the mentioned ticket please) show that N_Tail processing should be at the end of BrgemmCopyBKernel. The current PR moves tail processing from the beginning to the end in kernel

Tickets:

CVS-161315

...c/transformations/snippets/x64/pass/lowered/expressions/brgemm_copy_b_buffer_expressions.cpp

IvanNovoselov · 2025-01-30T11:43:21Z

src/plugins/intel_cpu/src/transformations/snippets/x64/op/brgemm_utils.hpp

+inline T compute_LDB(T n_block, const ov::element::Type& precision) {
+    return compute_repacked_n_dim(n_block, precision);


Why do we need 2 different functions that do the same thing?
Should we replace all compute_LDB calls with compute_repacked_n_dim then?

I left the both them just to split logic into get_LDB and get_repacked_N by sense.
But the implementation is the same.

Anyway, I replaced compite_LDB with compute_repacked_n_dim in 15dadd5

IvanNovoselov · 2025-01-30T11:52:29Z

...c/transformations/snippets/x64/pass/lowered/expressions/brgemm_copy_b_buffer_expressions.cpp

-        const auto& precision = parent_expr->get_node()->get_input_element_type(0);
-        m_allocation_size = std::max(n_blk, compute_inner_n_block(precision));
-    }
+    m_allocation_size = compute_repacked_n_dim(n_blk, precision);


It's nice that we now reuse the dynamic values handling from ov::snippets::utils::rnd_up, and don't have to replicate this logic anywhere else.

IvanNovoselov

👍

a-sidorova added this to the 2025.1 milestone Jan 24, 2025

github-actions bot added the category: CPU OpenVINO CPU plugin label Jan 24, 2025

a-sidorova force-pushed the feature/snippets/optimized_brgemm_copy_b_kernel_tail branch from 9a47d46 to f5291d8 Compare January 27, 2025 05:53

a-sidorova marked this pull request as ready for review January 28, 2025 06:25

a-sidorova requested review from a team as code owners January 28, 2025 06:25

a-sidorova assigned v-Golubev and IvanNovoselov Jan 28, 2025

a-sidorova requested review from v-Golubev and IvanNovoselov January 28, 2025 06:26

v-Golubev approved these changes Jan 28, 2025

View reviewed changes

...c/transformations/snippets/x64/pass/lowered/expressions/brgemm_copy_b_buffer_expressions.cpp Outdated Show resolved Hide resolved

a-sidorova unassigned v-Golubev Jan 29, 2025

IvanNovoselov reviewed Jan 30, 2025

View reviewed changes

a-sidorova force-pushed the feature/snippets/optimized_brgemm_copy_b_kernel_tail branch from c64f34f to 15dadd5 Compare January 31, 2025 05:09

a-sidorova requested a review from IvanNovoselov January 31, 2025 05:11

IvanNovoselov approved these changes Jan 31, 2025

View reviewed changes

a-sidorova added 3 commits February 4, 2025 11:13

[Snippets][CPU] Moved N_tail processing to the end in BrgemmCopyBKernel

b659f11

[Snippets][CPU] Created compute_repacked_n_dim helper

68be3f8

[Snippets][CPU] Applied Ivan comment

6ea5aff

a-sidorova force-pushed the feature/snippets/optimized_brgemm_copy_b_kernel_tail branch from 15dadd5 to 6ea5aff Compare February 4, 2025 07:13

IvanNovoselov added this pull request to the merge queue Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Snippets][CPU] Moved N_tail processing to the end in BrgemmCopyBKernel #28664

[Snippets][CPU] Moved N_tail processing to the end in BrgemmCopyBKernel #28664

a-sidorova commented Jan 24, 2025 •

edited

Loading

IvanNovoselov Jan 30, 2025

a-sidorova Jan 31, 2025

IvanNovoselov Jan 30, 2025

IvanNovoselov left a comment

		inline T compute_LDB(T n_block, const ov::element::Type& precision) {
		return compute_repacked_n_dim(n_block, precision);

[Snippets][CPU] Moved N_tail processing to the end in BrgemmCopyBKernel #28664

[Snippets][CPU] Moved N_tail processing to the end in BrgemmCopyBKernel #28664

Conversation

a-sidorova commented Jan 24, 2025 • edited Loading

Details:

Tickets:

IvanNovoselov Jan 30, 2025

Choose a reason for hiding this comment

a-sidorova Jan 31, 2025

Choose a reason for hiding this comment

IvanNovoselov Jan 30, 2025

Choose a reason for hiding this comment

IvanNovoselov left a comment

Choose a reason for hiding this comment

a-sidorova commented Jan 24, 2025 •

edited

Loading