Skip to content

Commit c6a959c

Browse files
loisloGoogle-ML-Automation
authored andcommitted
Adds jax.lax.scaled_dot for scaled dot products.
This change introduces a new `jax.lax.scaled_dot` function, which computes a dot product where the inputs can be float8 types. It produces the Composite op that could be lowered to the triton, cuBLAS, cuDNN, or rewritten as the regular dot. The fallback: If the scaled-dot is not enabled the composite call gets inlined as a sequence of the ops: the float8 inputs and scales are converted to bfloat16, the scales broadcasted and then multiplied with the corresponding operands elementwise, then passed to `jax.lax.dot_general`. The function includes input validation for shapes and dtypes, and supports both 2D and 3D (batched) dot products. Tests are added to cover various scenarios, including error conditions and jit compilation. PiperOrigin-RevId: 821573195
1 parent f2562d4 commit c6a959c

File tree

5 files changed

+1227
-236
lines changed

5 files changed

+1227
-236
lines changed

jax/_src/lax/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -313,3 +313,4 @@
313313
from jax._src.lax.ann import (
314314
approx_top_k_p as approx_top_k_p
315315
)
316+
from jax._src.lax.scaled_dot import scaled_dot as scaled_dot

0 commit comments

Comments
 (0)