Uploaded image for project: 'Pegasus'
  1. Pegasus
  2. PM-1385

Prioritize transfers bases on dependencies


    • Type: Icon: Improvement Improvement
    • Resolution: Fixed
    • Priority: Icon: Minor Minor
    • master, 5.0.0, 4.9.4
    • Affects Version/s: master, 5.0.0, 4.9.3
    • Component/s: None
    • None

      It is common to see bag-of-tasks workflows, where the jobs have a few common inputs (executable, common db, container, ...) and then one or more unique input per job. With the current refiner, we get a bunch of stage_in jobs with the same priority. I think it would be nice to increase the priority on the jobs with common files (which can probably be detected by just looking at the fan-out of the stage in job). That means common inputs will be staged first, and give the workflow a better chance to start jobs when only a part of the unique inputs have been staged, which means better overlap of input transfers and compute jobs.

            vahi Karan Vahi
            rynge Mats Rynge
            2 Start watching this issue