Meetup·February 28, 2026·Jaipur, India

Stop the GPU Madness! Making LLM Inference Actually Efficient on K8s

Name: Stop the GPU Madness! Making LLM Inference Actually Efficient on K8s — AWS User Group Jaipur
Start: 2026-02-28
Location: Jaipur, India

AWS User Group Jaipur

LLMKubernetesGPUInferenceAWS

Abstract

AWS User Group Jaipur — main auditorium, RIC Jaipur. A meetup talk on running LLM inference workloads on Kubernetes without burning through GPU budgets.

Resources

Event page on awsugjaipur.in

More Talks

Conference
Help! My LLM is a Resource Hog: How We Tamed Inference with Kubernetes and Open Source Muscle
KubeCon + CloudNativeCon North America 2025 · Atlanta, USA
Meetup
Accelerating CI Pipelines: Rapid Kubernetes Testing with vCluster
Cloud Native & AI Day — Beyond ChatBots · Bengaluru, India
Meetup
Multitenancy in the Kubernetes Era
Cloud Native Taiwan User Group Meetup · Taipei, Taiwan
Meetup
Multitenancy in the Kubernetes Era
Incident Management & Cloud Native Meetup · Bengaluru, India

All talks

Abstract

Resources

More Talks

Help! My LLM is a Resource Hog: How We Tamed Inference with Kubernetes and Open Source Muscle

Accelerating CI Pipelines: Rapid Kubernetes Testing with vCluster

Multitenancy in the Kubernetes Era

Multitenancy in the Kubernetes Era