Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

PBS in its current form releases all the consumable resources requested by the job when it is suspended. In most cases suspended job holds on to the memory it would have consumed and just releases ncpus, in some cases admin might have configured an alternate suspend signal which would make the job release a few resources (like licenses) upon suspension. Therefore, it would be better if PBS has a way for admins to specify what all resources can be released from a job upon suspension.

Link to forum discussion.

Interface 1: New scheduler attribute to specify which resources can be released.

  • Visibility: Public
  • Change Control: Stable
  • Details:
    • A new scheduler attribute “res_released_on_susp” can be used to specify a list of comma separated list of resource names that can be release when a job is suspended.
    • This scheduler attribute is of type “array_string” and can only be set by a manager.
    • If a manager is trying to set the attribute with a resource which is non existent then following error will be thrown on the console by qmgr command - 

...

Interface 2: New Job attribute “resources_released”

  • Visibility: Public
  • Change Control: Stable
  • Details:

A new job attribute “resources_released” is added.

...

Interface 3: New Job attribute “resource_released_list”

  • Visibility: Public
  • Change Control: Stable
  • Details:

A new job attribute “resource_released_list” is added.

...

Interface 4: New server log message

  • Visibility: Public
  • Change Control: Stable
  • Details:
    • If server is unable to populate “resources_released” job attribute while suspending a job then it will log following log message at LOG_INFO log level.

...

Interface 5: New scheduler log message

  • Visibility: Public
  • Change Control: Stable
  • Details:
    • If scheduler is unable to populate “resources_released” job attribute while suspending a job then it will log following log message at LOG_INFO log level.

...


Interface 6: New error message while deleting a custom resource

  • Visibility: Public
  • Change Control: Stable
  • Details:
    • If an admin tries to delete a custom resource that is part of the res_released_on_susp scheduler attribute then qmgr command will fail with “resource busy” error code.

...