Exponential bucketing tweaks #224

madamczyk-intel · 2025-06-13T13:05:32Z

always return bucketed value regardless of max_number of blocks
change defaults for sequence limits
bucketize context blocks
remove 'fill' argument from warmup_range_with_limit

- always return bucketed value regardless of max_number of blocks - change defaults for sequence limits - bucketize context blocks - remove 'fill' argument from warmup_range_with_limit

…keting_tweaks

Exponential bucketing tweaks

0e8d482

- always return bucketed value regardless of max_number of blocks - change defaults for sequence limits - bucketize context blocks - remove 'fill' argument from warmup_range_with_limit

madamczyk-intel requested review from afierka-intel, jikunshang, kzawora-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, tzielinski-habana and xuechendi as code owners June 13, 2025 13:05

madamczyk-intel mentioned this pull request Jun 16, 2025

Exp bucketing tweaks HabanaAI/vllm-fork#1425

Closed

madamczyk-intel added 4 commits June 17, 2025 11:52

Merge remote-tracking branch 'origin/main' into dev/madamczyk/exp_buc…

527b0e5

…keting_tweaks

Fix nasty typo

50b54b0

Flip decode block buckets

1612612

Fix max num buckets

101104b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Exponential bucketing tweaks #224

Exponential bucketing tweaks #224

Uh oh!

madamczyk-intel commented Jun 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Exponential bucketing tweaks #224

Are you sure you want to change the base?

Exponential bucketing tweaks #224

Uh oh!

Conversation

madamczyk-intel commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

madamczyk-intel commented Jun 13, 2025 •

edited

Loading