fast start mode for monitord

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

XMLWordPrintable

    • Type: New Feature
    • Resolution: Fixed
    • Priority: Major
    • master, 4.6.0, 4.5.1
    • Affects Version/s: master, 4.5.0
    • Component/s: Monitord
    • None

      By default, when monitord starts up tracking a live dagman.out file, it sleeps intermittently, waiting for new lines to be logged in the dagman.out file.

      This behavior, however causes monitord to lag considerably

      • when starting for large workflows
      • when monitord gets restarted due to some failure by pegasus-dagman, or we submit a rescue dag.

      For new LIGO ahope worfklows ( there are about 190K jobs in a single DAX), this creates a problem. And one way to do this is alleviate this is for monitord to not sleep intermittently till it catches up with the dagman.out file.

            Assignee:
            Karan Vahi
            Reporter:
            Duncan Brown
            Archiver:
            Rajiv Mayani

              Created:
              Updated:
              Resolved:
              Archived: