Uploaded image for project: 'Pegasus'
  1. Pegasus
  2. PM-1085

-p 0 options for condor_dagman sub dax jobs result in dagman ( 8.2.8) dying

XMLWordPrintable

    • Type: Icon: Bug Bug
    • Resolution: Fixed
    • Priority: Icon: Major Major
    • 4.7.0, 4.6.1
    • Affects Version/s: master, 4.6.0
    • Component/s: Pegasus Planner
    • None

      tried running another workflow with the new version of 4.6.1dev, and now I'm having a new problem. My first job runs on Titan ok. It's used as input to plan a nested workflow, but when I try to run that I get weird condor problems:

      04/20/16 15:23:30 Running POST script of Node create_dir_AWP_SGT_s431.dax_0_titan...
      04/20/16 15:23:30 ERROR "Assertion ERROR on (mysin.Length() > 0)" at line 6470 in file /slots/07/dir_14430/userdir/src/condor_daemon_core.V6/daemon_core.cpp

      It keeps aborting and restarting but always has that same error. (/home/scec-02/cybershk/runs/s431_SGT_dax/run_4666/dags/cybershk/pegasus/CyberShake_SGT_s431.dax/20160420T151457-0700/AWP_SGT_s431.000/AWP_SGT_s431.dag.dagman.out) I ran condor_restart earlier today; maybe that was it? Anyway, I thought I'd ask if you'd seen this before, and make sure that it's not somehow related to something you've fixed.

            Assignee:
            vahi Karan Vahi
            Reporter:
            scottcal Scott Callaghan
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: