-
Notifications
You must be signed in to change notification settings - Fork 166
Open
Labels
PerformanceRelated to improving performanceRelated to improving performanceenhancementNew feature or requestNew feature or request
Description
Is your feature request related to a problem? Please describe.
Vllm benchmarking CLI supports ignoring EOS and synthesizing given OSL (--random-output-len ). Can we use this feature to generate synthetic benchmarks of fixed OSL for evaluating perf at certain sequence length scenario?
Describe the solution you'd like
A clear and concise description of what you want to happen.
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.
youngeunkwon0405
Metadata
Metadata
Assignees
Labels
PerformanceRelated to improving performanceRelated to improving performanceenhancementNew feature or requestNew feature or request