add batch_norm op with test and benchmark #559

yanghailong-git · 2025-02-07T13:37:00Z

Summary

Implemented a 2D batch normalization Triton operator, successfully ran the corresponding tests and benchmarks, and visualized the performance tests for speed and memory.

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

the visualization of performance:

yundai424 · 2025-02-11T06:51:35Z

looks like from the benchmark result triton impl is slower than HF original one? 👀

yanghailong-git · 2025-02-12T02:41:23Z

looks like from the benchmark result triton impl is slower than HF original one? 👀

It seems so. The memory usage is about the same, but the speed is a bit slower. Do you have any optimization or improvement methods?

add batch_norm op with test and benchmark

32507fb

Merge branch 'main' into develop

671e0bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add batch_norm op with test and benchmark #559

add batch_norm op with test and benchmark #559

Uh oh!

yanghailong-git commented Feb 7, 2025

Uh oh!

yundai424 commented Feb 11, 2025

Uh oh!

yanghailong-git commented Feb 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add batch_norm op with test and benchmark #559

Are you sure you want to change the base?

add batch_norm op with test and benchmark #559

Uh oh!

Conversation

yanghailong-git commented Feb 7, 2025

Summary

Testing Done

Uh oh!

yundai424 commented Feb 11, 2025

Uh oh!

yanghailong-git commented Feb 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants