Kubernetes 1.34 made Dynamic Resource Allocation (DRA) generally available, replacing the blunt `nvidia.com/gpu: 1` device plugin model with structured, attribute-based GPU requests. DRA introduces four core objects: ResourceSlice (describes available hardware), DeviceClass (groups devices), ResourceClaimTemplate (per-pod GPU

10m read timeFrom cast.ai
Post cover image
Table of contents
DRA in three minutesThe demo workloadDeploying with DRA with precise GPU requirementsSharing the GPUWhat CAST AI addsFrom one GPU to seven MIG partitions

Sort: