Restructure MoE and Add MoE prepare input kernels #29

kareemshaik80 · 2025-10-30T05:18:03Z

restructure moe kernels folder
add prepare moe inputs kerels
- compute_problem_sizes
- compute_expert_offsets
- compute_expert_blockscale_offsets
- compute_arg_sorts
- ShuffleRows
- ApplyShuffleMulSum

- restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]>

src/sycl/kernels/moe/activations.cpp

Signed-off-by: kareem <[email protected]>

Signed-off-by: Shaik, Kareem M <[email protected]>

Signed-off-by: kareem <[email protected]>

adityachatter

LGTM.

airMeng · 2025-11-11T03:35:56Z

src/sycl/kernels/moe/activations.cpp

activations not only serve for MoE, better leave it unchanged.
Beside I'd prefer only put customized cutlass code under src/sycl/kernels/, and leave pure SYCL code outside

airMeng · 2025-11-11T03:39:33Z

tests/test_moe_prepare_input.py

+
+@pytest.mark.parametrize("num_tokens", [5, 16, 128])
+@pytest.mark.parametrize("num_experts", [4, 8, 32])
+@pytest.mark.parametrize("top_k", [2])
+@pytest.mark.parametrize("hidden_dims", [16, 32, 64])


https://github.com/sgl-project/sglang/blob/08c805a85f2915de36d483371f901b5a8dbc6f66/python/sglang/test/test_cutlass_moe.py#L32

at lease cover this scenario.

airMeng · 2025-11-11T05:56:01Z

src/sycl/kernels/moe/prepare_inputs.cpp

+    void operator()(sycl::nd_item<1> item) const {
+      int32_t tot_offset = 0;
+      int32_t tot_rounded_offset = 0;
+      expert_offsets_[0] = 0;
+      blockscale_offsets_[0] = 0;
+      for (int i = 0; i < num_experts_; ++i) {
+        atomic_buffer_[i] = tot_offset;
+        int num_tokens = problem_sizes1_[i * 3];
+        int rounded_num_tokens = (num_tokens + (block_size - 1)) / block_size * block_size;
+        tot_offset += num_tokens;
+        tot_rounded_offset += rounded_num_tokens;
+        expert_offsets_[i + 1] = tot_offset;
+        blockscale_offsets_[i + 1] = tot_rounded_offset;
+      }
+    }


The function is purely sequential? Try to parallelize it using sycl::exclusive_scan, see https://github.com/intel/llvm/blob/4474e85c51c1c3153af9938164391d1e836cfff4/sycl/doc/extensions/removed/sycl_ext_oneapi_group_algorithms.asciidoc?plain=1#L75

Restructure MoE and add prepare inputs/meta kernel

bab489e

- restructure moe kernels folder - add prepare moe inputs kerel Signed-off-by: kareem <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and add prepare inputs/meta kernel~~ Restructure MoE and add prepare inputs/meta kernel [wip] Oct 30, 2025

kareemshaik80 commented Oct 30, 2025

View reviewed changes

src/sycl/kernels/moe/activations.cpp Show resolved Hide resolved

fix minor issues

d5e78ac

Signed-off-by: kareem <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and add prepare inputs/meta kernel [wip]~~ Restructure MoE and add routing kernel [wip] Oct 30, 2025

kareemshaik80 and others added 5 commits November 3, 2025 08:11

Add tests

651c0f6

Signed-off-by: kareem <[email protected]>

Add shuffle_rows Kernel

328d63a

Signed-off-by: kareem <[email protected]>

register shuffle_rows

efb105f

Signed-off-by: kareem <[email protected]>

Enable Build and Add apply_shuffle_mul_sum kernel

d849b61

Signed-off-by: kareem <[email protected]>

functional

f2f1577

Signed-off-by: Shaik, Kareem M <[email protected]>

kareemshaik80 changed the title ~~Restructure MoE and add routing kernel [wip]~~ Restructure MoE and Add prepare input kernels Nov 10, 2025

kareemshaik80 changed the title ~~Restructure MoE and Add prepare input kernels~~ Restructure MoE and Add MoE prepare input kernels Nov 10, 2025

kareemshaik80 added 2 commits November 10, 2025 08:18

cleanup

33fe2ed

Signed-off-by: kareem <[email protected]>

cleanup1

8944fbc

Signed-off-by: kareem <[email protected]>

adityachatter approved these changes Nov 10, 2025

View reviewed changes

airMeng reviewed Nov 11, 2025

View reviewed changes

airMeng added the run-ci label Nov 11, 2025

airMeng requested changes Nov 11, 2025

View reviewed changes

Merge branch 'sgl-project:main' into main

1be44d6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Restructure MoE and Add MoE prepare input kernels #29

Restructure MoE and Add MoE prepare input kernels #29

Uh oh!

kareemshaik80 commented Oct 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

adityachatter left a comment

Uh oh!

airMeng Nov 11, 2025

Uh oh!

airMeng Nov 11, 2025

Uh oh!

airMeng Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Restructure MoE and Add MoE prepare input kernels #29

Are you sure you want to change the base?

Restructure MoE and Add MoE prepare input kernels #29

Uh oh!

Conversation

kareemshaik80 commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

adityachatter left a comment

Choose a reason for hiding this comment

Uh oh!

airMeng Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

airMeng Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

airMeng Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kareemshaik80 commented Oct 30, 2025 •

edited

Loading