Dynamic Resource Allocation (DRA) is a new GA feature in Kubernetes v1.34 that addresses GPU scheduling limitations like resource fragmentation, topology blindness, and cost explosion. DRA introduces a Kubernetes-native approach for requesting, allocating, and sharing hardware resources through concepts like DeviceClass, ResourceSlice, ResourceClaim, and ResourceClaimTemplate. The feature eliminates manual coordination between cluster admins and workload operators by allowing declarative device requests that Kubernetes automatically schedules and assigns to pods.
Table of contents
How is DRA different?DRA TerminologyTypical Workflow for DRAResourceClaims vs ResourceClaimTemplateExample YAML ManifestsConclusionSort: