Uploaded image for project: 'PBS Pro'
  1. PP-1298

Mom on windows not running exechost_startup hook every time it restarts

    Details

    • Type: Bug
    • Status: Open
    • Priority: High
    • Resolution: Unresolved
    • Affects versions: 19.1.1
    • Fix versions: None
    • Components: Hooks, MoM
    • Labels:
      None
    • Severity:
      4-Critical
    • Sprint:
    • Story Points:
      1

      Description

      Note to developer of this ticket: Please modify pbspro/.appveyor.yml once this get fixed

      don't get confused with pbs_version=18.1.0 in below log, it shows 18.1.0 because OSS master pbspro.spec file has 18.1.0 as pbs_version and this logs are from AppVeyor CI run for OSS master.
       

      cat C:\PROGRA~2\PBS\home\mom_priv\hooks\simple_hook.HK
      hook_name=simple_hook
      event=queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach,exechost_periodic,exechost_startup
      freq=35

      08/25/2018 16:23:58;0002;pbs_mom;Svr;pbs_mom;Is down
      08/25/2018 16:23:58;0002;pbs_mom;Svr;Log;Log closed
      08/25/2018 16:24:00;0002;pbs_mom;Svr;Log;Log opened
      08/25/2018 16:24:00;0002;pbs_mom;Svr;pbs_mom;pbs_version=18.1.0
      08/25/2018 16:24:00;0002;pbs_mom;Svr;pbs_mom;pbs_build=mach=WIN32:security=:configure_args=
      08/25/2018 16:24:00;0002;pbs_mom;Svr;pbs_mom;hostname=N/A;pbs_leaf_name=N/A;pbs_mom_node_name=N/A
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;ipv4/ipv6 interface

      {6E42B798-E581-4EE9-A19D-68AD45B58191}

      : APPVYR-WIN
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;ipv4/ipv6 interface {6E42B798-E581-4EE9-A19D-68AD45B58191}

      : APPVYR-WIN
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;ipv4/ipv6 interface

      {1F777394-0B42-11E3-80AD-806E6F6E6963}

      : APPVYR-WIN
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;ipv4/ipv6 interface {1F777394-0B42-11E3-80AD-806E6F6E6963}

      : APPVYR-WIN
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;ipv4/ipv6 interface

      {AFF10B87-28DE-471D-8CBA-E15E3E23751A}

      : fe80::5efe:10.0.0.4%13
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home for read access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/spool for read/write access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/undelivered for read/write access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;created file C:/PROGRA~2/PBS/home/pbs_environment
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing file C:/PROGRA~2/PBS/home/pbs_environment: full access to admin and read to Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/auxiliary for read access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/checkpoint for access to service account and administrators
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/mom_priv for read access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;created file C:/PROGRA~2/PBS/home/mom_priv/config
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/mom_priv/config for admin-only access
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/mom_logs for read access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/mom_priv/jobs for read access by Everyone
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/mom_priv/hooks for access to service account and administrators
      08/25/2018 16:24:01;0086;pbs_mom;Fil;;securing C:/PROGRA~2/PBS/home/mom_priv/hooks/tmp for access to service account and administrators
      08/25/2018 16:24:01;0100;pbs_mom;Svr;parse_config;file config
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;Adding IP address 10.0.0.4 as authorized
      08/25/2018 16:24:01;0002;pbs_mom;n/a;read_config;max_check_poll = 120, min_check_poll = 10
      08/25/2018 16:24:01;0d80;pbs_mom;TPP;pbs_mom(Main Thread);TPP set to use reserved port authentication
      08/25/2018 16:24:01;0c06;pbs_mom;TPP;pbs_mom(Main Thread);TPP leaf node names = 10.0.0.4:15003,10.0.0.4:15003,127.0.0.1:15003
      08/25/2018 16:24:01;0d80;pbs_mom;TPP;pbs_mom(Main Thread);Initializing TPP transport Layer
      08/25/2018 16:24:01;0d80;pbs_mom;TPP;pbs_mom(Main Thread);TPP initialization done
      08/25/2018 16:24:01;0c06;pbs_mom;TPP;pbs_mom(Thread 0);Thread ready
      08/25/2018 16:24:01;0c06;pbs_mom;TPP;pbs_mom(Main Thread);Single pbs_comm configured, TPP Fault tolerant mode disabled
      08/25/2018 16:24:01;0d80;pbs_mom;TPP;pbs_mom(Main Thread);Connecting to pbs_comm APPVYR-WIN:17001
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;Adding IP address 127.0.0.1 as authorized
      08/25/2018 16:24:01;0c06;pbs_mom;TPP;pbs_mom(Thread 0);Registering address 10.0.0.4:15003 to pbs_comm
      08/25/2018 16:24:01;0c06;pbs_mom;TPP;pbs_mom(Thread 0);Connected to pbs_comm APPVYR-WIN:17001
      08/25/2018 16:24:01;0002;pbs_mom;Svr;set_checkpoint_path;Using default checkpoint path.
      08/25/2018 16:24:01;0002;pbs_mom;Svr;set_checkpoint_path;Setting checkpoint path to C:/PROGRA~2/PBS/home/checkpoint/
      08/25/2018 16:24:01;0086;pbs_mom;Svr;pbs_mom;Found hook simple_hook type=site
      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;ALLHOOKS hook[0] =

      {simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=*(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach)*, alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_begin hook[0] =

      {simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_prologue hook[0] =
      {simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_launch hook[0] =

      {simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_epilogue hook[0] ={simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_preterm hook[0] =

      {simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_end hook[0] ={simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0080;pbs_mom;Hook;print_hook;execjob_attach hook[0] =

      {simple_hook, order=1, type=0, enabled=1 user=0, debug=(0) fail_action=(1), event=(queuejob,modifyjob,resvsub,movejob,runjob,periodic,resv_end,execjob_begin,execjob_prologue,execjob_epilogue,execjob_end,execjob_preterm,execjob_launch,execjob_attach), alarm=30, freq=120}

      08/25/2018 16:24:01;0001;pbs_mom;Svr;pbs_mom;No error (0) in main, Created window station=PBSProWS
      08/25/2018 16:24:01;0001;pbs_mom;Svr;pbs_mom;No error (0) in main, Created desktop default in window station=PBSProWS
      08/25/2018 16:24:01;0006;pbs_mom;Fil;pbs_mom;Version 18.1.0, started, initialization type = 0
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;Mom pid = 2196 ready, using ports Server:15001 MOM:15002 RM:15003
      08/25/2018 16:24:01;0002;pbs_mom;n/a;initialize;pcpus=2, OS reports 2 cpu(s)
      08/25/2018 16:24:01;0d80;pbs_mom;TPP;pbs_mom(Main Thread);net restore handler called
      08/25/2018 16:24:01;0002;pbs_mom;Svr;pbs_mom;Restart sent to server at APPVYR-WIN:15001
      08/25/2018 16:24:02;0002;pbs_mom;Svr;pbs_mom;Hello from server at 10.0.0.4:15001
      08/25/2018 16:24:02;0100;pbs_mom;Req;;Type 85 request received from appveyor@10.0.0.4:15001, sock=1
      08/25/2018 16:24:02;0080;pbs_mom;Hook;simple_hook.HK;copy hook-related file request received
      08/25/2018 16:24:02;0006;pbs_python;Hook;pbs_python;hook event type is exechost_periodic
      *08/25/2018 16:24:02;0086;pbs_mom;Fil;;found pid_handles[0]=1548 to have exited
      08/25/2018 16:24:02;0086;pbs_mom;Fil;;status=0
      08/25/2018 16:25:08;0100;pbs_mom;Req;;Type 1 request received from appveyor@10.0.0.4:15001, sock=1

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              hirenvadalia Hiren Vadalia
            • Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated: