#10548: Support tile layout for width/height-sharded concat #13744

jerrysky3 · 2024-10-11T12:32:57Z

Ticket

Problem description

Change kernel to support concat on both width and height-sharded tensors in tile layout. The kernel supports the following cases:

Height-sharded width concat in row-major/tile layout
Width-sharded height concat in row-major/tile layout

For now it's only used to concat > 2 tensors. Two tensor concatenation is currently handled by a special unrolled kernel due to performance issue with runtime args. This new kernel can be unrolled and replace it if needed in the future

What's changed

Rename and change s2s_rm_concat_multi_core to s2s_concat_multi_core to support tile layout besides the row-major layout
Rename reader_s2s_rm_tensor_concat.cpp to reader_s2s_tensor_concat.cpp

Checklist

Post commit CI passes https://github.com/tenstorrent/tt-metal/actions/runs/11396048642
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable) https://github.com/tenstorrent/tt-metal/actions/runs/11396057538
Device performance regression CI testing passes (if applicable) https://github.com/tenstorrent/tt-metal/actions/runs/11396055725, the fails are due to other changes
New/Existing tests provide coverage for changes

jerrysky3 · 2024-10-18T08:28:25Z

Device performance regression (https://github.com/tenstorrent/tt-metal/actions/runs/11396055725) is failing with:

AssertionError: Some model(s) AVG DEVICE KERNEL SAMPLES/S are faster than expected, see above for details. {'AVG DEVICE KERNEL SAMPLES/S': [('ttnn_distilbert8_distilbert-base-uncased-distilled-squad', 90.4624, 20.394), ('ttnn_functional_ttnn_vgg11_1_', 270.5544, 108.871), ('ttnn_functional_ttnn_vgg16_1_', 195.0345, 95.378)]}

However it is currently also happening on the main branch: https://github.com/tenstorrent/tt-metal/actions/runs/11399044245/job/31717674624#step:9:2739

sjameelTT

Looks good to me, nice work.

sjameelTT · 2024-10-18T16:31:40Z

Device performance regression (https://github.com/tenstorrent/tt-metal/actions/runs/11396055725) is failing with:
AssertionError: Some model(s) AVG DEVICE KERNEL SAMPLES/S are faster than expected, see above for details. {'AVG DEVICE KERNEL SAMPLES/S': [('ttnn_distilbert8_distilbert-base-uncased-distilled-squad', 90.4624, 20.394), ('ttnn_functional_ttnn_vgg11_1_', 270.5544, 108.871), ('ttnn_functional_ttnn_vgg16_1_', 195.0345, 95.378)]}
However it is currently also happening on the main branch: https://github.com/tenstorrent/tt-metal/actions/runs/11399044245/job/31717674624#step:9:2739

Yeah this was actually me, perf numbers have improved on a bunch of models but I didn't update them with my change since we fail on very big improvements too (to prevent the numbers from getting stale).

jerrysky3 · 2024-10-21T08:34:28Z

Hi @ayerofieiev-tt , this is another kernel ramp-up task that needs a code owner review and merge. Thanks!

tenstorrent#13744) Co-authored-by: Artem Yerofieiev <169092593+ayerofieiev-tt@users.noreply.github.com>

jerrysky3 mentioned this pull request Oct 11, 2024

#10548: Support tile layout for width/height-sharded concat #13514

Closed

5 tasks

jerrysky3 temporarily deployed to dev October 11, 2024 12:33 — with GitHub Actions Inactive

jerrysky3 temporarily deployed to dev October 11, 2024 12:34 — with GitHub Actions Inactive

jerrysky3 temporarily deployed to dev October 11, 2024 12:38 — with GitHub Actions Inactive

jerrysky3 temporarily deployed to dev October 11, 2024 12:53 — with GitHub Actions Inactive

jerrysky3 had a problem deploying to dev October 14, 2024 02:27 — with GitHub Actions Error

jerrysky3 temporarily deployed to dev October 14, 2024 02:28 — with GitHub Actions Inactive

jerrysky3 force-pushed the jerrysky3/i-10548 branch from 6ffbbda to 19772ed Compare October 14, 2024 02:28

jerrysky3 temporarily deployed to dev October 14, 2024 02:29 — with GitHub Actions Inactive

jerrysky3 temporarily deployed to dev October 14, 2024 02:30 — with GitHub Actions Inactive

jerrysky3 temporarily deployed to dev October 14, 2024 02:42 — with GitHub Actions Inactive

jerrysky3 had a problem deploying to dev October 14, 2024 02:42 — with GitHub Actions Failure

jerrysky3 temporarily deployed to dev October 14, 2024 02:43 — with GitHub Actions Inactive

jerrysky3 had a problem deploying to dev October 14, 2024 02:43 — with GitHub Actions Failure

#10548: Support tile layout for width/height-sharded concat

f33f6bb

jerrysky3 temporarily deployed to dev October 18, 2024 02:06 — with GitHub Actions Inactive

jerrysky3 temporarily deployed to dev October 18, 2024 02:10 — with GitHub Actions Inactive

jerrysky3 had a problem deploying to dev October 18, 2024 02:10 — with GitHub Actions Failure

jerrysky3 had a problem deploying to dev October 18, 2024 06:12 — with GitHub Actions Failure

jerrysky3 marked this pull request as ready for review October 18, 2024 08:29

jerrysky3 requested review from ayerofieiev-tt, dmakoviichuk-tt, rfurko-tt, cfjchu, TT-BrianLiu, razorback3, dongjin-na, ntarafdar, sjameelTT, yan-zaretskiy and jaykru-tt as code owners October 18, 2024 08:29

sjameelTT approved these changes Oct 18, 2024

View reviewed changes

ayerofieiev-tt approved these changes Oct 21, 2024

View reviewed changes

ayerofieiev-tt added 2 commits October 21, 2024 15:36

Merge branch 'main' into jerrysky3/i-10548

5d83492

Merge branch 'main' into jerrysky3/i-10548

a715ba6

ayerofieiev-tt merged commit 0326577 into main Oct 22, 2024
7 checks passed

ayerofieiev-tt deleted the jerrysky3/i-10548 branch October 22, 2024 00:06

ct-clmsn pushed a commit to ct-clmsn/tt-metal that referenced this pull request Nov 12, 2024

tenstorrent#10548: Support tile layout for width/height-sharded concat (

0f60807

tenstorrent#13744) Co-authored-by: Artem Yerofieiev <169092593+ayerofieiev-tt@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#10548: Support tile layout for width/height-sharded concat #13744

#10548: Support tile layout for width/height-sharded concat #13744

jerrysky3 commented Oct 11, 2024 •

edited

Loading

jerrysky3 commented Oct 18, 2024

sjameelTT left a comment

sjameelTT commented Oct 18, 2024

jerrysky3 commented Oct 21, 2024

#10548: Support tile layout for width/height-sharded concat #13744

#10548: Support tile layout for width/height-sharded concat #13744

Conversation

jerrysky3 commented Oct 11, 2024 • edited Loading

Ticket

Problem description

What's changed

Checklist

jerrysky3 commented Oct 18, 2024

sjameelTT left a comment

Choose a reason for hiding this comment

sjameelTT commented Oct 18, 2024

jerrysky3 commented Oct 21, 2024

jerrysky3 commented Oct 11, 2024 •

edited

Loading