Skip to content

0.21.0

Compare
Choose a tag to compare
@github-actions github-actions released this 18 Dec 16:16
· 8 commits to development since this release

Features

  • Add customViewState field to conversation api (#599)
  • Cut off request/response body if its size exceeds limit #612 (#613)
  • Get the Project name from JWT token claims (#605)
  • Implement Randomized Weighted Balancer #589 (#596)
  • Implement fallback strategy in tier balancer #588 (#593)
  • Return aggregated error code if max attempts limit is exceeded #587 (#597)
  • Return retry-after header in case if rate limit is exceeded #615 (#617)
  • add week/month settings for tokens rate-limiting (#619)
  • include query params when rewritePath is enabled (#598)

Fixes

  • Application/Model listing API returns wrong value for max_retry_attemps #616 (#618)
  • Core sets parent_deployment to a wrong value #530 (#604)
  • GfLogStore writes corrupted prompt log record #607 (#608)
  • ORT fails with 'No locally installed toolchains match and toolchain download repositories have not been configured.' (#595)
  • Prompt logs are not saved for RouteController #611 (#614)
  • Rate limit is not updated in RouteController #600 (#601)
  • RouteController doesn't mark upstream as failed in case of unsuccessful http code #602 (#603)
  • execution path must contain deployment's name only (#610)

Other

  • Improve logging of LockService (#606)
  • change layout of console appender (#609)
  • update README.md (#620)