Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Address comments from @billnitzberg

...

Parameter NameDefault ValueDescription
enabledfalseWhen set to true, the hook will register a limit that restricts the amount of huge page memory processes may access. When set to false, no limit is registered.
exclude_hosts[ ]Specifies the list of hosts for which the use of this subsystem should be disabled.
exclude_vntypes[ ]Specifies the list of vnode types for which the use of this subsystem should be disabled. This applies to the builtin vntype attribute assigned to a node.
default0MBThe amount of huge page memory assigned to the cgroup when the job does not request hpmem.
reserve_percent0The percentage of available huge page memory that is not to be assigned to jobs. This will alter the amount of hpmem that MoM reports to the server. This value is then added to reserve_amount to obtain the total amount reserved.
reserve_amount0MBAn The amount of available huge page memory that is not to be assigned to jobs. This will alter the amount of hpmem that MoM reports to the server. This is added to reserve_percent to obtain the total amount reserved.

...

Parameter NameDefault ValueDescription
enabledfalseThe hook will register the physical memory limit for a job when set to true. No limit is registered when set to false.
exclude_hosts[ ]Specifies the list of hosts for which the use of this subsystem should be disabled.
exclude_vntypes[ ]Specifies the list of vnode types for which the use of this subsystem should be disabled. This applies to the builtin vntype attribute assigned to a node.
soft_limitfalseA soft memory limit is used to specify the minimum amount of physical memory a job should be allocated before utilizing swap space. This adjusts the behavior of the kernel by allowing the physical memory allocation to exceed the amount specified in the soft limit when memory demand (a.k.a memory pressure) is low. The cgroup is ultimately limited to the amount of virtual memory specified in the memsw system. When memory pressure increases, the kernel will begin to page physical memory out to swap space until the soft limit is reached. Soft memory limits allow processes to take advantage of physical memory when it is available, but may lead to longer run times when memory pressure is high. Soft memory limits are used when this parameter is set to true. When set to false, hard memory limits are used that prevent the processes from ever exceeding their specified mem limit.
default0MBThe amount of physical memory available to a cgroup when no mem limit has been specified.
reserve_percent0The percentage of available physical memory that is not to be assigned to jobs. This will alter the amount of mem that MoM reports to the server. This value is then added to reserve_amount to obtain the total amount reserved.
reserve_amount0MBThe amount of available physical memory that is not to be assigned to jobs. This will alter the amount of mem that MoM reports to the server. This is added to reserve_amount percent to obtain the total amount reserved.

...

Parameter NameDefault ValueDescription
enabledfalseThe hook will register the virtual memory limit for a job when set to true. No limit is registered when set to false.
exclude_hosts[ ]Specifies the list of hosts for which the use of this subsystem should be disabled.
exclude_vntypes[ ]Specifies the list of vnode types for which the use of this subsystem should be disabled. This applies to the builtin vntype attribute assigned to a node.
default0MBThe amount of virtual memory available to a cgroup when no vmem limit has been specified.
reserve_percent0The percentage of available virtual memory that is not to be assigned to jobs. This will alter the amount of mem vmem that MoM reports to the server. This value is then added to reserve_amount to obtain the total amount reserved.
reserve_amount0MBThe amount of available virtual memory that is not to be assigned to jobs. This will alter the amount of mem vmem that MoM reports to the server. This is added to reserve_amount percent to obtain the total amount reserved.

...

  • Visibility: Public
  • Change Control: Experimental
  • Synopsis: Integer resource used to request the number of Intel MIC devices required per chunk

  • Details: Users may submit jobs requesting nmics=<#> in order for the scheduler to assign nodes with a sufficient number of Intel MIC devices. The nmics resource must be added to the resources line in the scheduler configuration file for this resource to be considered for scheduling.

Interface 3: ngpus

  • Visibility: Public
  • Change Control: Experimental
  • Synopsis: Integer resource used to request the number of NVIDIA GPUs required per chunk

  • Details: Users may submit jobs requesting ngpus=<#> in order for the scheduler to assign nodes with a sufficient number of NVIDIA GPU devices. The ngpus resource must be added to the resources line in the scheduler configuration file for this resource to be considered for scheduling.

Interface 4: vmem

  • Visibility: Public
  • Change Control: Experimental
  • Synopsis: Size resource used to request the amount of virtual memory required per chunk

  • Details: Users may submit jobs requesting vmem=<size> in order for the scheduler to assign nodes with a sufficient amount of virtual memory. The vmem resource must be added to the resources line in the scheduler configuration file for this resource to be considered for scheduling. MoM must be restarted when the memsw subsystem is enabled before it will report a value for vmem.

...

  • Visibility: Public
  • Change Control: Experimental
  • Synopsis: Size resource used to request the amount of huge page memory required per chunk

  • Details: Users may submit jobs requesting hpmem=<size> in order for the scheduler to assign nodes with a sufficient amount of huge page memory. The hpmem resource must be added to the resources line in the scheduler configuration file for this resource to be considered for scheduling. MoM must be restarted when the hugetlb subsystem is enabled before it will report a value for hpmem.

...