Uploaded image for project: 'Pegasus'
  1. Pegasus
  2. PM-1085

-p 0 options for condor_dagman sub dax jobs result in dagman ( 8.2.8) dying

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: master, 4.6.0
    • Fix Version/s: 4.7.0, 4.6.1
    • Component/s: pegasus-plan
    • Labels:
      None

      Description

       tried running another workflow with the new version of 4.6.1dev, and now I'm having a new problem. My first job runs on Titan ok. It's used as input to plan a nested workflow, but when I try to run that I get weird condor problems:

      04/20/16 15:23:30 Running POST script of Node create_dir_AWP_SGT_s431.dax_0_titan...
      04/20/16 15:23:30 ERROR "Assertion ERROR on (mysin.Length() > 0)" at line 6470 in file /slots/07/dir_14430/userdir/src/condor_daemon_core.V6/daemon_core.cpp

      It keeps aborting and restarting but always has that same error. (/home/scec-02/cybershk/runs/s431_SGT_dax/run_4666/dags/cybershk/pegasus/CyberShake_SGT_s431.dax/20160420T151457-0700/AWP_SGT_s431.000/AWP_SGT_s431.dag.dagman.out) I ran condor_restart earlier today; maybe that was it? Anyway, I thought I'd ask if you'd seen this before, and make sure that it's not somehow related to something you've fixed.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                vahi Karan Vahi
                Reporter:
                scottcal Scott Callaghan
              • Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: