Skip to content

v0.0.3

Latest

Choose a tag to compare

@CatherineSue CatherineSue released this 10 Dec 01:46
· 3 commits to main since this release
d0780ee

What's Changed

  • [Docs] Update docs for scenario and fix markdown lint errors by @CatherineSue in #69
  • [Docs] Add traffic-scenario to github-pages by @CatherineSue in #70
  • [FIX]Make storage_bucket is_eager by @YouNeedCryDear in #73
  • [Misc] Remove tolerance in _check_discrepancy by @CatherineSue in #77
  • [Report] Add option for milliseconds by @TJ5 in #74
  • [Docs] Remove local.txt from repository by @Juno13340 in #80
  • [Docs] Add PR template and remove duplicate PR section from contributing guide by @Juno13340 in #79
  • [CI] add GHCR publish to release by @Juno13340 in #76
  • [Feature] add count-based warmup/cooldown ratio by @dbdydgur2244 in #82
  • [CI] Fix CI workflow failure by @CatherineSue in #84
  • [CI] Remove --no-reinstall as it is not supported in uv by @CatherineSue in #85
  • [UI] Fix Minimal Dashboard by @TJ5 in #89
  • [Metrics] Correct TTFT calculation for reasoning models by @Juno13340 in #87
  • [Core] Extend timeout of oci genai client and improve error handling in distributed workers by @key4ng in #88
  • [Frontend] Rename --time-unit to --metrics-time-unit by @TJ5 in #94
  • [Misc] Update PR template by @CatherineSue in #97
  • [Misc] Update PR template label sections by @CatherineSue in #98
  • [Feature] Add LlamaCPP as server engine option by @rgerganov in #93
  • [Bugfix] Fix OpenAI authentication header formatting by @WillemMomma in #99
  • [MISC] Support PR auto labeler by @key4ng in #106
  • [misc] Add issue template by @key4ng in #107
  • improve naming and format in ci by @key4ng in #111
  • [Docs] Update documentation to be more readable by @TJ5 in #96
  • [Docs] Fix github pages yml by @TJ5 in #117
  • Downsize labels to 10+ to 6 by @CatherineSue in #118
  • Add --log-dir option and improve ENABLE_UI support by @fuhengwu2021 in #115
  • [misc] Update .gitignore and remove local files after tests by @CatherineSue in #119
  • fix(openai): strip backend-unsupported params (e.g. ignore_eos) before send by @OpheliaLjh in #116
  • Revert "fix(openai): strip backend-unsupported params (e.g. ignore_eos) before send" by @CatherineSue in #123
  • [Docs] Small updates to Development Guide Section by @TJ5 in #121
  • [feature] Add TogetherAI backend support by @CatherineSue in #129
  • Fix/OpenAI ignore eos, improved code performance by @OpheliaLjh in #124
  • [Bug] Filter TPOT/inference speed by output latency by @ankrovv in #122
  • [MISC] Improve token counting with usage data in OCI GenAI user by @paxiaatucsdedu in #126
  • [release] Update version to 0.0.3 by @CatherineSue in #133

New Contributors

Full Changelog: v0.0.2...v0.0.3

What's Changed

New Contributors

Full Changelog: v0.0.2...v0.0.3