0.21.0
github-actions
released this
18 Dec 16:16
·
8 commits
to development
since this release
Features
- Add customViewState field to conversation api (#599)
- Cut off request/response body if its size exceeds limit #612 (#613)
- Get the Project name from JWT token claims (#605)
- Implement Randomized Weighted Balancer #589 (#596)
- Implement fallback strategy in tier balancer #588 (#593)
- Return aggregated error code if max attempts limit is exceeded #587 (#597)
- Return retry-after header in case if rate limit is exceeded #615 (#617)
- add week/month settings for tokens rate-limiting (#619)
- include query params when rewritePath is enabled (#598)
Fixes
- Application/Model listing API returns wrong value for max_retry_attemps #616 (#618)
- Core sets parent_deployment to a wrong value #530 (#604)
- GfLogStore writes corrupted prompt log record #607 (#608)
- ORT fails with 'No locally installed toolchains match and toolchain download repositories have not been configured.' (#595)
- Prompt logs are not saved for RouteController #611 (#614)
- Rate limit is not updated in RouteController #600 (#601)
- RouteController doesn't mark upstream as failed in case of unsuccessful http code #602 (#603)
- execution path must contain deployment's name only (#610)