Allow AP_SAT, AP_RND for 'maximum' precision in HLS Config by morunner · Pull Request #1422 · fastmachinelearning/hls4ml

morunner · 2026-01-23T11:14:57Z

Description

During the development of a gravnet model for hls4ml, I found that setting rounding and saturation for the maximum allowed precision can be beneficial for increasing accuracy. Below you can see histograms and mean difference with one standard deviation when the maximum precision is ap_fixed<16,8,AP_RND,AP_SAT,0> and when rounding and saturation are not enabled (ap_fixed<16,8>).

In the current upstream main branch of hls4ml, rounding and saturation modes set through the 'maximum' field in the HLS config are ignored during precision inference (see e.g. here). I thus propose a single function _apply_max_precision_constraints to be applied where necessary in the infer_precision.py module, which adheres the following rules:

Width/Integer: Always constrained to the minimum of inferred vs max.
Rounding/Saturation: Inherited from max_precision ONLY if they differ from the defaults
(meaning the user likely set them explicitly).
Signedness: max_precision signed arg is always preferred.

We can of course discuss, what the preferred ruleset should be here.

No additional dependencies are required for this change.

Type of change

Other (Accuracy Improvement)

Tests

Pytest

Added a new pytest module, test_max_precision.py, which tests the newly added _apply_max_precision_constraints function isolated and within the _infer_precision function, using mocks.

Conversion to HLS

Ran the full jupyter notebook for gravnet keras conversion to hls at hls4ml-gravnet (Link) to generate the below listed plots, with the proposed change enabled and disabled. The profiling section was ran with this fix applied. We currently do not provide the fully trained model open-source, since it is not finalized. @bo3z please contact me directly, also regarding the dataset, if needed.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation. -> tbd depending on the ruleset for RND, SAT
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

GravNet plots showing accuracies and bias across layers

With rounding and saturation enabled for the maximum precision.

gravnet_accuracy_with_rnd_sat_dense_params

gravnet_accuracy_without_rnd_sat_dense_params

gravnet_bias_without_rnd_sat_dense_params

jmitrevs · 2026-01-30T17:02:49Z

Generally setting the maximum precision is not something we recommend using much, I don't think. It's better to either quantize the values in the training, or if doing PTQ, explicitly set certain widths to more reasonable values in the configuration. The maximum width is not granular enough for that. One can see what width one gets without the maxumum setting and modify the configuration till it is satisfactory.

Also, rounding and saturation for the accumulator often make the accumulation much slower. It is better to keep it wider and if needed, use saturation and rounding in the activation step right after it, where its cost is insignificant. This more fine-grained way of doing things is recommended instead of using the maximum precision.

bo3z · 2026-02-02T22:04:17Z

@morunner do you have some results (in terms of resource usage) before and after this change. @jmitrevs and @calad0i mentioned in our last dev meeting that AP_SAT may not be the most resource-friendly for accumulators (due to the underlying implementation of the saturation operation). Most times, the recommended way is to simply increase the bit width of the variable.

If that's the case from your results we should keep this PR open as a reference (in case someone is interested in using similar functions), but not merge it.

morunner · 2026-03-03T12:53:33Z

Sorry for the late reply. I wanted to finalize the model architecture first before optimizing this.

I have re-run synthesis with the Vitis backend for two of the same models, one with the AP_RND, AP_SAT modes set for the default, maximum as well as dense layer weight and bias precisions. And one with the default modes (AP_TRN,AP_WRAP) for said parameters.

The model with rounding and saturation indeed consumes more LUTs (25% instead of 20%, for the Alveo u55c) and has slightly higher latency. But using truncation and wraparound of course comes with a decrease in accuracy.

Hence, I agree with @bo3z to not merge this PR.

For the time being, using rounding and saturation is still a viable approach for us since we achieve satisfiable resource utilization, latency and meet timing closure with the current setup. But it is good to know that there is still some performance to get out of the model by investing more time in tuning precisions.

calad0i · 2026-03-03T18:46:10Z

slightly higher latency

You can always increase the bitwidth to match/get better accuracy, since accum is very cheap in comparison to the other operations happening there. In most cases, one should be able to have no real performance difference between the python model and the hls model bit allocating sufficient bitwidths in the accumulator to avoid overflow/underflows. Unless there is very specific hardware restriction (usually not), I would suggest against the use of maximum precision.

morunner added 2 commits January 23, 2026 12:16

allow setting sat, rnd modes via 'maximum' in hls config

3e2272b

add pytest

55ed9c0

morunner force-pushed the allow-sat-rnd-in-max-precision branch from 5d8237f to 55ed9c0 Compare January 23, 2026 11:16

bo3z self-assigned this Jan 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow AP_SAT, AP_RND for 'maximum' precision in HLS Config#1422

Allow AP_SAT, AP_RND for 'maximum' precision in HLS Config#1422
morunner wants to merge 2 commits intofastmachinelearning:mainfrom
morunner:allow-sat-rnd-in-max-precision

morunner commented Jan 23, 2026

Uh oh!

jmitrevs commented Jan 30, 2026 •

edited

Loading

Uh oh!

bo3z commented Feb 2, 2026

Uh oh!

morunner commented Mar 3, 2026

Uh oh!

calad0i commented Mar 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

morunner commented Jan 23, 2026

Description

Type of change

Tests

Pytest

Conversion to HLS

Checklist

GravNet plots showing accuracies and bias across layers

Uh oh!

jmitrevs commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bo3z commented Feb 2, 2026

Uh oh!

morunner commented Mar 3, 2026

Uh oh!

calad0i commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jmitrevs commented Jan 30, 2026 •

edited

Loading

calad0i commented Mar 3, 2026 •

edited

Loading