Skip to content

Add attention kernels optimized for arm's i8mm instruction. #942

Open
copybara-service[bot] wants to merge 1 commit into
devfrom
test_938467018
Open

Add attention kernels optimized for arm's i8mm instruction. #942
copybara-service[bot] wants to merge 1 commit into
devfrom
test_938467018

Conversation

@copybara-service

Copy link
Copy Markdown

Add attention kernels optimized for arm's i8mm instruction.
They give about 8x higher throughput compared to previous i8 implementation.

They give about 8x higher throughput compared to previous i8 implementation.

PiperOrigin-RevId: 938467018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants