monitord does load backed up .out and .err files if kickstart is not used

XMLWordPrintable

    • Type: Bug
    • Resolution: Fixed
    • Priority: Major
    • master, 4.2
    • Affects Version/s: master
    • Component/s: Monitord
    • None

      By default, pegasus-monitord is geared to expect backups of .out and .err files only for kickstart enabled jobs.

      However, for 4.2 the planner has started associated the pegasus-exitcode postscript for auxillary jobs that don;t get launched via kickstart

      For example, in case of worker package staging the create dir/stagein and untar jobs are not launched via kickstart.

      Because of this change, the data from the .out and .err files of these auxillary jobs was not loaded into the stampede database

      In monitord.log one sees.
      vahi@obelix:/data/scratch/vahi/work/creamce/test/creamce-sharedfs/work/vahi/pegasus/blackdiamond/run0007$ more monitord.log.000
      2013-01-24 13:23:59,373:job.py:read_stdout_stderr_files:410: WARNING: unable to read output file: /data/scratch/vahi/work/creamce/test/creamce-sharedfs/work/vahi/pegasus/blackdiamond/run0007/create_dir_blackdiamond_0_creamce.out, continuing...
      2013-01-24 13:23:59,374:job.py:read_stdout_stderr_files:425: WARNING: unable to read error file: /data/scratch/vahi/work/creamce/test/creamce-sharedfs/work/vahi/pegasus/blackdiamond/run0007/create_dir_blackdiamond_0_creamce.err, continuing...
      2013-01-24 13:24:15,423:job.py:read_stdout_stderr_files:410: WARNING: unable to read output file: /data/scratch/vahi/work/creamce/test/creamce-sharedfs/work/vahi/pegasus/blackdiamond/run0007/create_dir_blackdiamond_0_creamce.out, continuing...
      2013-01-24 13:24:15,423:job.py:read_stdout_stderr_files:425: WARNING: unable to read error file: /data/scratch/vahi/work/creamce/test/creamce-sharedfs/work/vahi/pegasus/blackdiamond/run0007/create_dir_blackdiamond_0_creamce.err, continuing...
      va

            Assignee:
            Karan Vahi
            Reporter:
            Karan Vahi
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: