Skip to content

[ascend] optimize gated rmsnorm#324

Open
wanfengcxz wants to merge 2 commits into
DeepLink-org:mainfrom
wanfengcxz:wq/optim_gated_rmsnorm
Open

[ascend] optimize gated rmsnorm#324
wanfengcxz wants to merge 2 commits into
DeepLink-org:mainfrom
wanfengcxz:wq/optim_gated_rmsnorm

Conversation

@wanfengcxz
Copy link
Copy Markdown
Collaborator

decoding stage, batch_size=128
old code(triton kernel):
20260420-160531
optimize code:
20260420-160540

@wanfengcxz wanfengcxz requested a review from jinminxi104 as a code owner April 20, 2026 08:08
@jinminxi104
Copy link
Copy Markdown
Collaborator

This PR is on hold for RL precision adjustments.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants