Commit 2b7d4a1
committed
Basic structure of sgl_kernel.fp8_scaled_mm
* Added support for sgl_kernel.fp8_scaled_mm op
* Input in dtype fp8 e4m3 or e5m2
* Output in dtype fp32, bf16, fp8 e4m3 or fp8 e5m2
Signed-off-by: Aditya Chatterjee <[email protected]>1 parent 1abaed2 commit 2b7d4a1
File tree
7 files changed
+814
-3
lines changed- include
- src
- sycl
- tests
7 files changed
+814
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
41 | | - | |
| 41 | + | |
42 | 42 | | |
43 | 43 | | |
44 | | - | |
45 | 44 | | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
46 | 51 | | |
47 | 52 | | |
48 | 53 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
167 | 167 | | |
168 | 168 | | |
169 | 169 | | |
170 | | - | |
| 170 | + | |
171 | 171 | | |
172 | 172 | | |
173 | 173 | | |
| |||
0 commit comments