Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Overview:
PBS and ALPS can sometimes get out of sync. The purpose of the synchronization hook is to check to see if the information
that PBS has is out of sync with what ALPS is reporting. When the hook detects that PBS and ALPS are out of sync, the hook 
will restart HUP the Mom.


Interface 1: PBS hook PBS_alps_inventory_check

  • Visibility: Public
  • Change Control: Experimental
  • Details: 
    • This is a periodic hook that runs on the execution host.
    • The Hook is not enabled by default. It runs as the Administrator and executes every 300 seconds.
    • The timeout for the Hook is 90 seconds.
        

Interface 2: Mom log entry: ALPS Inventory Check: apstat command cannot be found at <path>

...

Interface 10: Mom log entry: ALPS Inventory Check: Compute node <name> <list of nodes> defined in ALPS, but not in PBS: <name><list of nodes>

  • Visibility: PBS Private
  • Change Control: Experimental
  • Details: 
    • Recorded when PBS and ALPS are out of sync i.e. ALPS has information that PBS does not have.
    • Log level: PBSEVENT_ADMIN.

...

Interface 11: Mom log entry: ALPS Inventory Check: Compute node <name> <list of nodes> defined in PBS, but not in ALPS: <name><list of nodes>

  • Visibility: PBS Private
  • Change Control: Experimental
  • Details: 
    • Recorded when PBS and ALPS are out of sync i.e. PBS reports nodes that ALPS does not.
    • Log level: PBSEVENT_ADMIN.

...