Living tracker for all FlyDSL (ROCm/FlyDSL) issues filed and PRs opened by @jhinpan. Last updated 2026-06-17.
Tally: 11 issues, 22 PRs (= 18 distinct PR work items; 4 PRs are closed->reopened duplicates, see β»οΈ).
Recent tracker corrections since 2026-06-11: PR #639 and PR #675 are now merged; PR #670 is closed unmerged because merged PR #683 absorbed the relevant flash-attn work; PR #637 is closed unmerged; issue #614 is closed.
Live refresh 2026-06-17: PR #685 is no longer a draft β now ready for review (retitled to batch-aware dense seq_len routing β DUALWAVE_SWP vs generic; mergeable, review required, no CI checks reported yet). No new jhinpan-authored issues/PRs since the last update; all other PR/issue states below are unchanged and re-verified against GitHub.
π’ Open PRs / review-CI state
β
Merged
β Closed unmerged / inactive PRs
π Issues β status
| Issue |
State |
Addressed by |
| #655 run_benchmark mislabels layernorm bandwidth |
closed |
#654 β
|
| #653 device printf invisible in Jupyter / piped stdout |
open |
β (no PR yet) |
| #642 moe_blockscale e2e harness bug |
closed |
#643 β
|
| #627 softmax fast path dead-coded off |
closed |
#650 β
|
| #614 rmsnorm large-M small-N crash |
closed |
#639 β
|
| #612 Discussion: make autotune usable |
open |
β (discussion) |
| #587 ATT source-location granularity |
closed |
upstream #586 β
; jhinpan #593 β closed unmerged |
| #585 const-folded fp8 cast rounding |
closed |
β (working-as-intended) |
| #583 layout-algebra op verification |
open |
#648 π’ draft |
| #574 Complete BasisAttr support |
open |
#638 π‘ open; conflicts with main (CI stale), needs rebase |
| #573 Onboarding Jupyter notebook |
open |
#635 β
(1/n merged), #665 β
(2/n merged); open for remaining notebooks |
β»οΈ Superseded PR pairs (closed -> reopened, same work)
The old head repo was deleted during a fork rename, so GitHub lost the head association and these were recreated on the same branch:
- #615 -> #639 β
merged (issue #614 closed)
- #605 -> #638 π‘ open; conflicts with
main (issue #574 open)
- #584 -> #635 β
merged (issue #573 still open for remaining notebooks)
- #565 -> #637 β closed unmerged after maintainer said another PR already fixed it (issue #453 still open)
π Follow-ups to decide
Living tracker for all FlyDSL (ROCm/FlyDSL) issues filed and PRs opened by @jhinpan. Last updated 2026-06-17.
Tally: 11 issues, 22 PRs (= 18 distinct PR work items; 4 PRs are closed->reopened duplicates, see β»οΈ).
Recent tracker corrections since 2026-06-11: PR #639 and PR #675 are now merged; PR #670 is closed unmerged because merged PR #683 absorbed the relevant flash-attn work; PR #637 is closed unmerged; issue #614 is closed.
Live refresh 2026-06-17: PR #685 is no longer a draft β now ready for review (retitled to batch-aware dense
seq_lenrouting β DUALWAVE_SWP vs generic; mergeable, review required, no CI checks reported yet). No new jhinpan-authored issues/PRs since the last update; all other PR/issue states below are unchanged and re-verified against GitHub.π’ Open PRs / review-CI state
seq_lenrouting (DUALWAVE_SWP vs generic) (ready for review) β PR #685 (no issue) Β· un-drafted 2026-06-17; mergeable, review required (branch-protection blocked), no CI checks reported yetBasisAttrsupport inIntTupleBuilderβ PR #638 β issue #574 Β· GitHub marks it conflicting / dirty againstmain; last CI run is stale (mixed pass/fail across reruns, untouched since 2026-06-11) β needs a rebase before merge review can moveβ Merged
known_block_sizeon large-M small-N path β PR #639 β fixed issue #614 Β· merged 2026-06-16; #614 closed 2026-06-16/ci-dashboardβ PR #675 (no issue) Β· merged 2026-06-16β Closed unmerged / inactive PRs
num_kv_splitsplumbing and the gfx950 O-store work; CI was green before close, but the standalone branch is now conflicting / dirtyπ Issues β status
main(CI stale), needs rebaseβ»οΈ Superseded PR pairs (closed -> reopened, same work)
The old head repo was deleted during a fork rename, so GitHub lost the head association and these were recreated on the same branch:
main(issue #574 open)π Follow-ups to decide
#585 (const-fold fp8 cast)β resolved: closed working-as-intended (apply fp8/bf16 rounding on runtime values, not a compile-time constant round-trip)