Search before asking
Description
Currently, the Master and Worker modules are missing some key metrics that would be really useful for troubleshooting and monitoring. I noticed that a few metrics classes were already defined but weren't actually being used in the code.
I've added the following metrics:
- Master: Tracking workflow and task state changes (success, failure, kill, retry, etc.), as well as a timer for how long it takes to generate a workflow instance.
- Worker: Added a timer to measure the actual execution time of a task on the worker side.
- Timeout Monitoring: Also added a counter to track when task timeout events occur.
Are you willing to submit a PR?
Code of Conduct
Search before asking
Description
Currently, the Master and Worker modules are missing some key metrics that would be really useful for troubleshooting and monitoring. I noticed that a few metrics classes were already defined but weren't actually being used in the code.
I've added the following metrics:
Are you willing to submit a PR?
Code of Conduct