[HWToLLVM][ArcToLLVM] Spill array values early #9218

fzi-hielscher · 2025-11-09T21:33:17Z

Spill hw.array values on the stack at their definition site instead of their use sites to reduce the number of redundant alloca + store ops created. This should mitigate the negative effects of #9172, fixing #9211.

This PR adds the HWToLLVMArraySpillCache helper to the HWToLLVM conversion patterns. Certain operations on !hw.array typed values need to "spill" (i.e., copy into a buffer, usually stack allocated) their input arrays in order to access dynamically indexed elements. The cache is used to avoid spilling the array for each such user separately. Instead,operations that create an array value which needs spilling allocate a buffer that can be shared by the value's users. HW Dialect operations that inherently allocate a buffer for their output arrays (AggregateConstantOp, ArrayInjectOp, ArraySliceOp) use the cache to associate their output value with a pointer to the buffer, so it does not need to be spilled again. Non-HW Dialect producers of HW array values (e.g., arc.state_load) need to be handled pre conversion. This is done by invoking spillNonHWOps.

Use of the cache is optional and can be disabled by passing spill-arrays-early=false to convert-hw-to-llvm or using a nullopt value for the cache argument of populateHWToLLVMConversionPatterns. In fact, conversions passes that do not disable allowPatternRollback must not use the cache since a buffer added to the cache once cannot be removed in case of a rollback.

fzi-hielscher · 2025-11-14T11:42:22Z

@SimonEbner, could you check if this actually fixes the compile time regression in your example?

SimonEbner · 2025-11-17T22:31:44Z

Thank you. I tested 120dd53 vs your head:
120dd53: 12.59s, HEAD: 1.86s

fzi-hielscher · 2025-11-20T12:22:03Z

Thanks for checking.
I suspect the remaining overhead is due to arc.state_load ops now being spilled instead of accessing the heap buffer directly. We could probably implement some heuristic allowing array accesses without spilling if we can be certain that the underlying memory remains unchanged between the arc.state_load and the hw.array_get op. But at the moment I'd rather not add that complexity if it is only to reduce the load on the LLVM optimizations.

SimonEbner · 2025-11-20T12:28:02Z

Sure, I think this patch is great, more than good enough. Thanks for looking into this

maerhart

That's great! Thanks!

include/circt/Conversion/Passes.td

lib/Conversion/HWToLLVM/HWToLLVM.cpp

maerhart · 2025-11-20T17:02:42Z

lib/Conversion/HWToLLVM/HWToLLVM.cpp

+      auto oneC = LLVM::ConstantOp::create(
+          rewriter, op->getLoc(), IntegerType::get(rewriter.getContext(), 32),
+          rewriter.getI32IntegerAttr(1));
+      arrPtr = LLVM::AllocaOp::create(
+          rewriter, op->getLoc(),
+          LLVM::LLVMPointerType::get(rewriter.getContext()),
+          adaptor.getInput().getType(), oneC,
+          /*alignment=*/4);
+      LLVM::StoreOp::create(rewriter, op->getLoc(), adaptor.getInput(), arrPtr);


Nit: as this occurs a few times, we could factor it out into a helper function.

Co-authored-by: Martin Erhart <[email protected]>

fzi-hielscher force-pushed the hwtollvm-early-array-spill branch from 3c40229 to 1e486ed Compare November 12, 2025 23:56

fzi-hielscher changed the title ~~[WIP][HWToLLVM][ArcToLLVM] Spill array values early~~ [HWToLLVM][ArcToLLVM] Spill array values early Nov 14, 2025

fzi-hielscher marked this pull request as ready for review November 14, 2025 11:40

fzi-hielscher requested review from fabianschuiki and maerhart November 14, 2025 11:41

maerhart approved these changes Nov 20, 2025

View reviewed changes

fzi-hielscher added 6 commits November 20, 2025 23:47

[HWToLLVM][ArcToLLVM] Spill array values early

bdececf

Use spill cache for more ops and allow opt-out.

ea61d89

Improvments and tests.

bd9c268

Tidy

51ac7bf

Minor tweaks

c0bc2b1

Add spillValueOnStack helper.

dceae1b

fzi-hielscher force-pushed the hwtollvm-early-array-spill branch from 89a8174 to dceae1b Compare November 20, 2025 23:19

fzi-hielscher and others added 5 commits November 21, 2025 00:20

Passes.td format fix

dfe46a0

Co-authored-by: Martin Erhart <[email protected]>

Apply suggestions from code review

437d56f

Co-authored-by: Martin Erhart <[email protected]>

clang-fmt

ed31134

Cannot use variadic template in assert macro

433e3e1

Remove stray whitespace

50bbec6

fzi-hielscher merged commit d33142e into llvm:main Nov 21, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[HWToLLVM][ArcToLLVM] Spill array values early #9218

[HWToLLVM][ArcToLLVM] Spill array values early #9218

fzi-hielscher commented Nov 9, 2025 •

edited

Loading

Uh oh!

fzi-hielscher commented Nov 14, 2025

Uh oh!

SimonEbner commented Nov 17, 2025

Uh oh!

fzi-hielscher commented Nov 20, 2025

Uh oh!

SimonEbner commented Nov 20, 2025

Uh oh!

maerhart left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maerhart Nov 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[HWToLLVM][ArcToLLVM] Spill array values early #9218

[HWToLLVM][ArcToLLVM] Spill array values early #9218

Conversation

fzi-hielscher commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fzi-hielscher commented Nov 14, 2025

Uh oh!

SimonEbner commented Nov 17, 2025

Uh oh!

fzi-hielscher commented Nov 20, 2025

Uh oh!

SimonEbner commented Nov 20, 2025

Uh oh!

maerhart left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maerhart Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fzi-hielscher commented Nov 9, 2025 •

edited

Loading