Hi all,
I have been searching for an existing solution that employs a backoff algorithm when a Job needs re-executed because of a 'recovery' or 'failover' is encountered.
I have a task that in normal cases may fail because resources are offline for a few hours. I don't want retries to keep being immediate; I want to use a simple backoff algorithm.
If there is no prescribed solution, I'll share what I write.
Broc
|