Uploaded image for project: 'Pegasus'
  1. Pegasus
  2. PM-591 Support for pegasus-mpi-cluster in Pegasus
  3. PM-639

task query fails for PMC workflows if retries happen, and PMC restarts from the remote rescue file

XMLWordPrintable

      PMC has a feature whereby it recovers in case of retry from a remote rescue log file.

      Because of this if retries happen in a workflow, the second retry will try only the tasks that were failed or were not run in the first attempt. this is different from pegasus-cluster where on condor job retry the whole cluster is retried. Because of this the task query for PMC workflows in retries case, gives incomplete jobs.

            Assignee:
            gmehta Gaurang Mehta (Inactive)
            Reporter:
            gmehta Gaurang Mehta (Inactive)
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: