cpu: x64: matmul: enable treat_as_plain for weights format by xuxinzen · Pull Request #5038 · uxlfoundation/oneDNN

xuxinzen · 2026-04-16T18:55:02Z

Fixes MFDNN-14900

Using plain format when N == 1 for transpose format, which works the same before guilty commit. And the TF confirms that the patch works fine on their end.

CI tests

dzarukin · 2026-04-16T19:02:22Z

                    && memory_desc_matches_tag(
                            B_md, transposed_tensor_layout_tag);
+            const bool treat_as_plain = plain_transposed_matched && bgmmc.N == 1
+                    && !bgmmc.is_int4_weights;


I anticipate to see a comment explain the necessity of this variable and its conditions. It took several people for several days to acknowledge the issue and narrow down it to matmul specific setting behavior.

Additionally, I wonder why no recently introduced canonical call is not used.

Thirdly, it's unclear why parallel creation lead to the crash, which shouldn't happen by primitive cache design, insights are desired.

And please convert the reproducer from the tracker into a test_concurrency gtest expansion as a regression test.

I'm also wondering whether the issue has been there for long time or did we recently introduce it?

I'll add some comments and this patch is a temporary fixup. I need more time to investigate the cause and get a real solution

As for why not using is_canonical(), I tried to use it before and the issue was resolve with c++ reproducer. But the results from TF team said the segfault issue was still there.

I can reproduce this issue with rls-v3.7 when using correct weights format for transposed. brgemm matmul uses plain for transpose format when N == 1 before the guilty commit.

I'm also wondering whether the issue has been there for long time or did we recently introduce it?

It was introduced a year ago by PR #3027. Specifically this stride check creates undefined behavior when corresponding dimension is trivial.

If the stride_check is case, then the reason of not using is_canonial() that mentioned above can be explained as I still keep the code to reach the stride check. I'll continue investigating that. Thanks

I don’t fully understand this fix since we don’t know the root cause yet. You said you needed more time to find it and come up with a real solution so it’s not clear whether this is actually fixing the issue or just happens to make it disappear. I don’t think we should promote it as a fix until we understand that

cpu: x64: matmul: enable treat_as_plain for weights format

e752792

xuxinzen requested a review from a team as a code owner April 16, 2026 18:55

xuxinzen added bug A confirmed library bug platform:cpu-x64 Intel64/AMD64 processors. Codeowner: @oneapi-src/onednn-cpu-x64 labels Apr 16, 2026

This was referenced Apr 16, 2026

[Backport]cpu: x64: matmul: enable treat_as_plain for weights format #5039

Closed

[Backport]: cpu: x64: matmul: enable treat_as_plain for weights format #5040

Closed

dzarukin requested changes Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu: x64: matmul: enable treat_as_plain for weights format#5038

cpu: x64: matmul: enable treat_as_plain for weights format#5038
xuxinzen wants to merge 1 commit intomainfrom
xzeng/fixup_cache_key_collision

xuxinzen commented Apr 16, 2026

Uh oh!

dzarukin Apr 16, 2026

Uh oh!

dzarukin Apr 16, 2026

Uh oh!

densamoilov Apr 16, 2026

Uh oh!

xuxinzen Apr 16, 2026

Uh oh!

xuxinzen Apr 16, 2026

Uh oh!

vpirogov Apr 16, 2026 •

edited

Loading

Uh oh!

xuxinzen Apr 16, 2026

Uh oh!

densamoilov Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xuxinzen commented Apr 16, 2026

Uh oh!

dzarukin Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

dzarukin Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

densamoilov Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

xuxinzen Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

xuxinzen Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

vpirogov Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuxinzen Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

densamoilov Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vpirogov Apr 16, 2026 •

edited

Loading