Uploaded image for project: 'Pegasus'
  1. Pegasus
  2. PM-446

pegasus dagman restarting monitord in replay mode

XMLWordPrintable

    • Type: Icon: Improvement Improvement
    • Resolution: Fixed
    • Priority: Icon: Major Major
    • master, 4.0
    • Affects Version/s: master, 4.0
    • Component/s: Monitord
    • None

      In case monitord dies during a worklfow run, pegasus dagman restarts it automatically in the replay mode.
      However, right now in the replay mode monitord disables notifications.

      As per our discussions during the last 2 days, we should implement a --recover option in pegasus-monitord. This option will "recover" the db by cleaning current workflow data, and repopulating from the beginning. In addition, pegasus-monitord will try to figure out how far it processed the dagman.out file previously and will omit notification till that point.

            Assignee:
            fabio Fabio Silva
            Reporter:
            vahi Karan Vahi
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: