Has anyone encountered this?
I have four (similar) site reliability guardians, each guardian has 18 objectives, all based on DQL queries analyzing traces.
All four guardians are executed periodically by a single workflow (in parallel):
The workflow often fails because it is hitting an API rate limit (apparently too many DQL queries are made in a short time).
This is a problem because the Site Reliability Guardian becomes unreliable 🙂
I thought of creating one workflow per guardian, but that is associated with additional cost (every workflow costs!) and it would duplicate the actions that I'd like to take after the SRG validation steps.
So maybe that API rate limit should be increased for workflows?
kR
Certified Dynatrace Master, Dynatrace Partner - 360Performance.net