[WIP] xe: gated_mlp: improve performance of ukernel-based gmlp by hidefromkgb · Pull Request #5059 · uxlfoundation/oneDNN

hidefromkgb · 2026-04-21T00:04:38Z

Partly addresses MFDNN-14598.

Perf results so far:

GPU	ukern, ms	ref, ms	perf, %
PTL-H	1.058240	1.079940	102.0506
BMG	0.574106	0.447617	77.9677
LNL	2.653071	2.894186	109.0881
DG2	1.374104	0.712022	51.8172

hidefromkgb added 7 commits April 20, 2026 14:04

[WIP] xe: gated_mlp: improve performance of ukernel-based gmlp

035b91c

16,32,2,1 64x64 MATRIX CORRECT

bfe6872

EVERYTHING OK EXCEPT SLM SHAPE

def4e8d

BETTER DEBUG PRINT

128d599

DST BLOCKS CORRECT

b22de35

WORKING (L*I AT LEAST)

07a6a93

REMOVE DEBUG

30263a4

hidefromkgb requested review from a team as code owners April 21, 2026 00:04

github-actions Bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:tests Codeowner: @oneapi-src/onednn-arch component:common labels Apr 21, 2026

dzarukin marked this pull request as draft April 21, 2026 01:19

TEST ALL SHAPES

34e41a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] xe: gated_mlp: improve performance of ukernel-based gmlp#5059

[WIP] xe: gated_mlp: improve performance of ukernel-based gmlp#5059
hidefromkgb wants to merge 8 commits intomainfrom
aguskov/gated_mlp_ugemm_perf

hidefromkgb commented Apr 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hidefromkgb commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hidefromkgb commented Apr 21, 2026 •

edited

Loading