Evolving KServe: The Unified Model Inference Platform for Both Predictive and... F. Spolti & J. Lee

Name: Evolving KServe: The Unified Model Inference Platform for Both Predictive and... F. Spolti & J. Lee
Uploaded: 2026-04-30T17:13:10.000Z
Channel: CNCF

Kserve Model serving Kubernetes Machine learning Mlops Llm inference Generative ai Cncf Envoy ai gateway Cloud native Ml infrastructure

CNCF April 30, 2026

AI summary

This CNCF talk explores KServe's evolution from a predictive AI serving platform to a unified solution for generative AI workloads. The session covers production challenges for LLMs including inference efficiency, distributed execution, and cost optimization, while introducing new features like the llm-d CRD for LLM serving and disaggregated inference architectures. Ideal for ML platform engineers and cloud architects building model serving infrastructure on Kubernetes.

Beyond VLLM: Distributed LLM Inferencing With Llm-d on Kubernetes - Ravindra Patil, Red Hat

What's new for AI on GKE: Training, serving, and agents

Opening Remarks - Erica Hughberg & Rohit Agrawal

Keynote: From Inference to Agents: Where Open Source AI Is Headed - Panel (ASL)

Orchestrating ML/AI workloads with TPUs on GKE

Rab Ne Bana Di Jodi - Why Platforms & AI Need Each Other - Ram Iyengar, Cloud Foundry Foundation

CNCF TOC Public meeting 7 July 2026

KubeEdge DeepDive: Extending Kubernetes To the Edge With Real-World Industry Use Case - Ronak Raj

Longhorn: What's New & What's Next for Cloud Native Persistent Storage - Divya Mohan, SUSE

Sponsored Demo: Cloud Native AI: Model Management with Harbor & Velero - Dhruv Tyagi, Broadcom

Closing | Kate Stanley and Paolo Patierno, IBM

Keynote | Kate Stanley and Paolo Patierno, IBM