版本:下一个
Scheduling Policies
HAMi supports multiple GPU scheduling policies to handle complex workload scenarios. A pod can select a scheduling policy using pod annotations.
Available Policies
| Policy | Scope | Effect |
|---|---|---|
binpack | Node | Tries to allocate tasks to the same GPU node as much as possible |
spread | Node | Tries to allocate tasks to different GPU nodes as much as possible |
numa-first | GPU | For multi-GPU allocations, prefers GPUs on the same NUMA node |
Default Policy
The default node scheduling policy is binpack and the default GPU scheduling policy is spread. These can be changed globally via Helm:
helm install hami hami-charts/hami \
--set scheduler.defaultSchedulerPolicy.nodeSchedulerPolicy=binpack \
--set scheduler.defaultSchedulerPolicy.gpuSchedulerPolicy=spread
Per-Pod Policy via Annotations
Individual pods can override the default by specifying a scheduling policy in .metadata.annotations.