GPU Scheduling in Kubernetes: Start Before the Scheduler
Most teams think gpu scheduling starts with the scheduler. It starts with demand modeling. By the time Volcano, Kueue, or KEDA enters the conversation, the expensive mistake has usually already been made. The cluster was provisioned against a theoretical peak that rarely materializes. The demand curve was never drawn. The concurrency profile was assumed rather…
