Kubernetes 1.34 made Dynamic Resource Allocation (DRA) generally available, replacing the blunt `nvidia.com/gpu: 1` device plugin model with structured, attribute-based GPU requests. DRA introduces four core objects: ResourceSlice (describes available hardware), DeviceClass (groups devices), ResourceClaimTemplate (per-pod GPU
Table of contents
DRA in three minutesThe demo workloadDeploying with DRA with precise GPU requirementsSharing the GPUWhat CAST AI addsFrom one GPU to seven MIG partitionsSort: