v0.10.2
What's Changed
Rate Limiting Support
- Add rate limiting error handling for Bedrock and OpenAI clients
- Introduce
isRateLimitedfunction to identify rate limiting errors based on specific error codes and HTTP status - Add rate limiting middleware with cluster support for coordinated throttling
- Enhanced
CompleteandStreammethods to wrap rate-limited errors with consistent messaging - Added
model.ErrRateLimitedfor better error management
Other Improvements
- Documentation updates for runtime
- Various internal improvements
Full Changelog: v0.10.1...v0.10.2