What's Changed
- [Docs] Update docs for scenario and fix markdown lint errors by @CatherineSue in #69
- [Docs] Add traffic-scenario to github-pages by @CatherineSue in #70
- [FIX]Make storage_bucket is_eager by @YouNeedCryDear in #73
- [Misc] Remove tolerance in
_check_discrepancyby @CatherineSue in #77 - [Report] Add option for milliseconds by @TJ5 in #74
- [Docs] Remove local.txt from repository by @Juno13340 in #80
- [Docs] Add PR template and remove duplicate PR section from contributing guide by @Juno13340 in #79
- [CI] add GHCR publish to release by @Juno13340 in #76
- [Feature] add count-based warmup/cooldown ratio by @dbdydgur2244 in #82
- [CI] Fix CI workflow failure by @CatherineSue in #84
- [CI] Remove
--no-reinstallas it is not supported in uv by @CatherineSue in #85 - [UI] Fix Minimal Dashboard by @TJ5 in #89
- [Metrics] Correct TTFT calculation for reasoning models by @Juno13340 in #87
- [Core] Extend timeout of oci genai client and improve error handling in distributed workers by @key4ng in #88
- [Frontend] Rename --time-unit to --metrics-time-unit by @TJ5 in #94
- [Misc] Update PR template by @CatherineSue in #97
- [Misc] Update PR template label sections by @CatherineSue in #98
- [Feature] Add LlamaCPP as server engine option by @rgerganov in #93
- [Bugfix] Fix OpenAI authentication header formatting by @WillemMomma in #99
- [MISC] Support PR auto labeler by @key4ng in #106
- [misc] Add issue template by @key4ng in #107
- improve naming and format in ci by @key4ng in #111
- [Docs] Update documentation to be more readable by @TJ5 in #96
- [Docs] Fix github pages yml by @TJ5 in #117
- Downsize labels to 10+ to 6 by @CatherineSue in #118
- Add --log-dir option and improve ENABLE_UI support by @fuhengwu2021 in #115
- [misc] Update .gitignore and remove local files after tests by @CatherineSue in #119
- fix(openai): strip backend-unsupported params (e.g. ignore_eos) before send by @OpheliaLjh in #116
- Revert "fix(openai): strip backend-unsupported params (e.g. ignore_eos) before send" by @CatherineSue in #123
- [Docs] Small updates to Development Guide Section by @TJ5 in #121
- [feature] Add TogetherAI backend support by @CatherineSue in #129
- Fix/OpenAI ignore eos, improved code performance by @OpheliaLjh in #124
- [Bug] Filter TPOT/inference speed by output latency by @ankrovv in #122
- [MISC] Improve token counting with usage data in OCI GenAI user by @paxiaatucsdedu in #126
- [release] Update version to 0.0.3 by @CatherineSue in #133
New Contributors
- @Juno13340 made their first contribution in #80
- @dbdydgur2244 made their first contribution in #82
- @rgerganov made their first contribution in #93
- @WillemMomma made their first contribution in #99
- @fuhengwu2021 made their first contribution in #115
- @OpheliaLjh made their first contribution in #116
- @ankrovv made their first contribution in #122
- @paxiaatucsdedu made their first contribution in #126
Full Changelog: v0.0.2...v0.0.3
What's Changed
- [Docs] Update docs for scenario and fix markdown lint errors by @CatherineSue in #69
- [Docs] Add traffic-scenario to github-pages by @CatherineSue in #70
- [FIX]Make storage_bucket is_eager by @YouNeedCryDear in #73
- [Misc] Remove tolerance in
_check_discrepancyby @CatherineSue in #77 - [Report] Add option for milliseconds by @TJ5 in #74
- [Docs] Remove local.txt from repository by @Juno13340 in #80
- [Docs] Add PR template and remove duplicate PR section from contributing guide by @Juno13340 in #79
- [CI] add GHCR publish to release by @Juno13340 in #76
- [Feature] add count-based warmup/cooldown ratio by @dbdydgur2244 in #82
- [CI] Fix CI workflow failure by @CatherineSue in #84
- [CI] Remove
--no-reinstallas it is not supported in uv by @CatherineSue in #85 - [UI] Fix Minimal Dashboard by @TJ5 in #89
- [Metrics] Correct TTFT calculation for reasoning models by @Juno13340 in #87
- [Core] Extend timeout of oci genai client and improve error handling in distributed workers by @key4ng in #88
- [Frontend] Rename --time-unit to --metrics-time-unit by @TJ5 in #94
- [Misc] Update PR template by @CatherineSue in #97
- [Misc] Update PR template label sections by @CatherineSue in #98
- [Feature] Add LlamaCPP as server engine option by @rgerganov in #93
- [Bugfix] Fix OpenAI authentication header formatting by @WillemMomma in #99
- [MISC] Support PR auto labeler by @key4ng in #106
- [misc] Add issue template by @key4ng in #107
- improve naming and format in ci by @key4ng in #111
- [Docs] Update documentation to be more readable by @TJ5 in #96
- [Docs] Fix github pages yml by @TJ5 in #117
- Downsize labels to 10+ to 6 by @CatherineSue in #118
- Add --log-dir option and improve ENABLE_UI support by @fuhengwu2021 in #115
- [misc] Update .gitignore and remove local files after tests by @CatherineSue in #119
- fix(openai): strip backend-unsupported params (e.g. ignore_eos) before send by @OpheliaLjh in #116
- Revert "fix(openai): strip backend-unsupported params (e.g. ignore_eos) before send" by @CatherineSue in #123
- [Docs] Small updates to Development Guide Section by @TJ5 in #121
- [feature] Add TogetherAI backend support by @CatherineSue in #129
- Fix/OpenAI ignore eos, improved code performance by @OpheliaLjh in #124
- [Bug] Filter TPOT/inference speed by output latency by @ankrovv in #122
- [MISC] Improve token counting with usage data in OCI GenAI user by @paxiaatucsdedu in #126
- [release] Update version to 0.0.3 by @CatherineSue in #133
- Update release.yml by @CatherineSue in #134
- Update release.yml to fix workflow errors by @CatherineSue in #135
New Contributors
- @Juno13340 made their first contribution in #80
- @dbdydgur2244 made their first contribution in #82
- @rgerganov made their first contribution in #93
- @WillemMomma made their first contribution in #99
- @fuhengwu2021 made their first contribution in #115
- @OpheliaLjh made their first contribution in #116
- @ankrovv made their first contribution in #122
- @paxiaatucsdedu made their first contribution in #126
Full Changelog: v0.0.2...v0.0.3