-
Notifications
You must be signed in to change notification settings - Fork 887
Pull requests: iree-org/iree
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[LinalgExt] Fix attention NaN for fully-masked rows
#24178
opened Apr 21, 2026 by
keshavvinayak01
Contributor
•
Draft
[TMTensor] Lower TMTensor::AttentionOp directly to OnlineAttentionOp
#24177
opened Apr 21, 2026 by
keshavvinayak01
Contributor
•
Draft
[DispatchCreation] Allow fusion of multi-result producers
#24169
opened Apr 20, 2026 by
keshavvinayak01
Contributor
Loading…
[DispatchCreation] Fuse a scalar reduction with parallel consumer when they share an input.
#24166
opened Apr 20, 2026 by
Abhishek-Varma
Contributor
Loading…
amdgpu: barrier command-buffer packets after inline waits
#24163
opened Apr 19, 2026 by
stellaraccident
Collaborator
•
Draft
[Codegen] Add transfer_{gather/scatter} unrolling and lowering to CPU backend
#24155
opened Apr 17, 2026 by
NoumanAmir657
Contributor
Loading…
[Tokenizer] Fix infinite loop when encountering a special token in partial segment mode
#24150
opened Apr 17, 2026 by
Finistere
Loading…
[Codegen] Refactor MMA attr filtering and root op selection into util function
#24144
opened Apr 16, 2026 by
RattataKing
Contributor
•
Draft
[Codegen][GPU] Fix f32 attention compilation failure when head_dim=128
#24138
opened Apr 16, 2026 by
keshavvinayak01
Contributor
Loading…
[Codegen] Move scatter DPS fill into workgroup forall
#24136
opened Apr 16, 2026 by
ziliangzl
Contributor
Loading…
[draft] [do not review] unrolling transfer_read transfer_write
#24134
opened Apr 15, 2026 by
amd-eochoalo
Contributor
•
Draft
Some fixes to work around a bug in cmake, and isa naming
#24131
opened Apr 15, 2026 by
AWoloszyn
Contributor
Loading…
[LinalgExt] Add TilingInterface support for TopkV2Op
#24129
opened Apr 15, 2026 by
bangtianliu
Contributor
Loading…
[Codegen][LinalgExt] Port attention rewrites to OnlineAttentionOp
#24123
opened Apr 15, 2026 by
keshavvinayak01
Contributor
•
Draft
compiler/plugins/input/TOSA: fix: TOSA arith lowering must handle apply scale introduced by linalg lowering
#24121
opened Apr 15, 2026 by
Manewing
Contributor
Loading…
[Codegen] Enable DMA by default for F16/BF16 Gemm on gfx950
#24117
opened Apr 14, 2026 by
Yu-Zhewen
Contributor
Loading…
[runtime/python] Remove forced HOST_VISIBLE from AllocateBufferCopy
#24106
opened Apr 14, 2026 by
jtuyls
Contributor
Loading…
[Codegen] Add VectorDistribute constraint generation scaffolding for matmul and conv
#24093
opened Apr 13, 2026 by
RattataKing
Contributor
Loading…
[Codegen][Heuristics] Update seeds to be closer to triton base case (preshuffling prs 3/3)
#24089
opened Apr 13, 2026 by
Muzammiluddin-Syed-ECE
Contributor
•
Draft
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.