yarn 动态重新分配剩余的计算能力
yarn reallocate remaining compute power dynamically
我想知道如何为 yarn 配置动态队列:
假设有 2 个队列
- A(高性能,集群的 70%)
- B(正常,其余 = 集群的 30%)
我注意到即使其他 70% 是空的,B 项也只会粘在它们分配的资源上。我如何重新分配这些资源(如果没有 A 作业)以更快地完成 B 作业?
容量调度程序文档https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/CapacityScheduler.html对队列之间的弹性和资源抢占提出了一些观点。
Elasticity - Free resources can be allocated to any queue beyond its capacity. When there is demand for these resources from queues running below capacity at a future point in time, as tasks scheduled on these resources complete, they will be assigned to applications on queues running below the capacity (preemption is also supported). This ensures that resources are available in a predictable and elastic manner to queues, thus preventing artificial silos of resources in the cluster which helps utilization.
它还指定了有关resources/containers的队列弹性和抢占的配置参数,例如:
yarn.scheduler.capacity.[queue-path].capacity - Queue capacity in percentage (%) as a float (e.g. 12.5). The sum of capacities for all queues, at each level, must be equal to 100. Applications in the queue may consume more resources than the queue’s capacity if there are free resources, providing elasticity.
关于抢占
The CapacityScheduler supports preemption of container from the queues whose resource usage is more than their guaranteed capacity.
该页面上列出了更多参数,您应该查看这些参数以进行良好的配置。
我想知道如何为 yarn 配置动态队列: 假设有 2 个队列
- A(高性能,集群的 70%)
- B(正常,其余 = 集群的 30%)
我注意到即使其他 70% 是空的,B 项也只会粘在它们分配的资源上。我如何重新分配这些资源(如果没有 A 作业)以更快地完成 B 作业?
容量调度程序文档https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/CapacityScheduler.html对队列之间的弹性和资源抢占提出了一些观点。
Elasticity - Free resources can be allocated to any queue beyond its capacity. When there is demand for these resources from queues running below capacity at a future point in time, as tasks scheduled on these resources complete, they will be assigned to applications on queues running below the capacity (preemption is also supported). This ensures that resources are available in a predictable and elastic manner to queues, thus preventing artificial silos of resources in the cluster which helps utilization.
它还指定了有关resources/containers的队列弹性和抢占的配置参数,例如:
yarn.scheduler.capacity.[queue-path].capacity - Queue capacity in percentage (%) as a float (e.g. 12.5). The sum of capacities for all queues, at each level, must be equal to 100. Applications in the queue may consume more resources than the queue’s capacity if there are free resources, providing elasticity.
关于抢占
The CapacityScheduler supports preemption of container from the queues whose resource usage is more than their guaranteed capacity.
该页面上列出了更多参数,您应该查看这些参数以进行良好的配置。