Seamless AI-as-a-Service across Kubernetes Clusters With Envoy AI Gate... Karol Szwaj, Kubermatic

CNCF
AI summary

This session demonstrates how to build a centralized AI-as-a-Service platform across multiple Kubernetes clusters using Envoy AI Gateway and kube-bind. It targets platform engineers and DevOps teams struggling with fragmented API keys, inconsistent model access, and shadow AI across their organization. The architecture enables remote teams to consume AI services from a central GPU-enabled hub without complex service mesh overhead.