Uploaded image for project: 'Pegasus'
  1. Pegasus
  2. PM-591 Support for pegasus-mpi-cluster in Pegasus
  3. PM-639

task query fails for PMC workflows if retries happen, and PMC restarts from the remote rescue file

    XMLWordPrintable

    Details

      Description

      PMC has a feature whereby it recovers in case of retry from a remote rescue log file.

      Because of this if retries happen in a workflow, the second retry will try only the tasks that were failed or were not run in the first attempt. this is different from pegasus-cluster where on condor job retry the whole cluster is retried. Because of this the task query for PMC workflows in retries case, gives incomplete jobs.

        Attachments

          Activity

            People

            • Assignee:
              gmehta Gaurang Mehta (Inactive)
              Reporter:
              gmehta Gaurang Mehta (Inactive)
            • Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: