pegasus dagman restarting monitord in replay mode

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

XMLWordPrintable

    • Type: Improvement
    • Resolution: Fixed
    • Priority: Major
    • master, 4.0
    • Affects Version/s: master, 4.0
    • Component/s: Monitord
    • None

      In case monitord dies during a worklfow run, pegasus dagman restarts it automatically in the replay mode.
      However, right now in the replay mode monitord disables notifications.

      As per our discussions during the last 2 days, we should implement a --recover option in pegasus-monitord. This option will "recover" the db by cleaning current workflow data, and repopulating from the beginning. In addition, pegasus-monitord will try to figure out how far it processed the dagman.out file previously and will omit notification till that point.

            Assignee:
            Fabio Silva
            Reporter:
            Karan Vahi
            Archiver:
            Rajiv Mayani

              Created:
              Updated:
              Resolved:
              Archived: