Skip to content

chore: release 0.8.8#691

Merged
Defilan merged 1 commit into
mainfrom
release-please--branches--main
Jun 17, 2026
Merged

chore: release 0.8.8#691
Defilan merged 1 commit into
mainfrom
release-please--branches--main

Conversation

@github-actions

@github-actions github-actions Bot commented Jun 14, 2026

Copy link
Copy Markdown
Contributor

🚀 Release ${version}

0.8.8 (2026-06-17)

Features

  • AMD/Vulkan runtime image selection (hardware.gpu.runtime) (#727) (1a4544f)
  • crd: make GPU resource name configurable to support AMD/Vulkan/Intel scheduling (#709) (c88becf)
  • gateway: active HTTP health checks on the ModelRouter BTP for fast backend ejection (#662) (#704) (ba99060)
  • gateway: event-driven route-level ejection of unhealthy backends (#662) (#706) (815f2bf)
  • gateway: gateway-scoped audit access log + fail-loud auditLog in Gateway mode (2c) (#703) (b874b5e)
  • gateway: header-only data-classification routing + fail-closed sensitive guard (2e-core) (#707) (0249665)
  • gateway: InferenceService Envoy AI Gateway exposure (MVP) (#692) (3b095dc)
  • gateway: ModelRouter dataPlane Gateway mode with cross-tier failover (2a) (#693) (2842634)
  • gateway: ModelRouter JWT authentication via SecurityPolicy (2d-core) (#695) (73a2ea9)
  • gateway: ModelRouter per-team model allowlists via SecurityPolicy authorization (2d.2) (#702) (94428b4)
  • gateway: ModelRouter token budgets and 429 enforcement (2b) (#694) (627e85a)
  • metal-agent: withdraw endpoint when runtime is unhealthy (#662) (#705) (5ed9395)
  • selfupdate: bound download size + GC old agent versions (#690) (5205a62)
  • webhook: ModelRouter validating webhook for apply-time honest-boundary rejection (#708) (13d9321)

Bug Fixes

  • cache: restore shared model cache as the default (perService becomes opt-in) (#732) (44ab7dc)
  • per-node model cache so GPU on a second node can schedule (#728) (#729) (79bccce)

Documentation

  • DGX Spark (GB10) on MicroK8s setup guide (#717) (bf7d7a7)
  • fix DGX Spark guide for ARM64 (GPU operator + GB10 image) (#718) (45a4237)
  • proposal for owned AMD/Vulkan runtime image and build pipeline (#726) (3a1a150)

This PR was generated with Release Please. See documentation.

@github-actions github-actions Bot requested a review from Defilan as a code owner June 14, 2026 06:24
@github-actions github-actions Bot force-pushed the release-please--branches--main branch 17 times, most recently from 7a8efa0 to 06fdde4 Compare June 17, 2026 06:10
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
@github-actions github-actions Bot force-pushed the release-please--branches--main branch from 06fdde4 to 3efe237 Compare June 17, 2026 07:33
@codecov

codecov Bot commented Jun 17, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@Defilan Defilan merged commit b6add4a into main Jun 17, 2026
24 checks passed
@Defilan Defilan deleted the release-please--branches--main branch June 17, 2026 07:48
@github-actions

Copy link
Copy Markdown
Contributor Author

🤖 Created releases:

🌻

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant