i have an host with Oneagent installed. Then, i stop Oneagent process from host console, so in Dynatrace i see this host as unmonitored (i don't shut down any other process or whole host).
So, in this case Dynatrace doesn't alert me. I have already switched on all default host alert options.
My question is, how can i have an alert if an host become unmonitored, like in image?
Solved! Go to Solution.
First lets ensure that your alert profile has the correct settings. It should look something like this at the very least:
Additionally, you will need to ensure that the host in question falls into the alert profile rules. For example, if you have a Management Zone Scope set for that Alert Profile.
Also take a look at the host settings and ensure you have the following set as a bare minimum.
Lastly, there is a similar set up for processes/services running on a host where you can set, within the settings, to be alerted if a process is turned of or unmonitored as well.
Unfortunately it's a blindspot as the Host availability alerting only covers if the Host is gracefully shut down not if the OneAgent is gracefully shutdown.
One of my customers had issues with their teams turning off OneAgents during maintenance but then oftentimes forgetting to turn them back on. I created an ActiveGate Extension that can be used for a few different use cases:
You can find the extension on GitHub and you are welcome to try this out. I must mention that of course it's not part of the official Dynatrace solution and therefore cannot be diagnosed by Dynatrace Support. As well, any modifications or customisation you may require will have to be done on your own.
Unfortunatly this project has been archived "this repository has been archived. This project needs updating as it has not been maintained in years.". However the solution seems to be tidy. and still relevant.
Starting in v 1.212 this is now possible.
You can set up a "custom event for alerting" (in "Settings / Anomaly detection") and have it alert if monitoring data is missing.
So pick a metric like "CPU usage" (which should always be there), set the threshold to "200" (which should never be reached) and set the drop-down to "Alert" "if data is missing within the above observation period."
Apparently there still isn't a proper (alert) solution for this?
Also, "Onagent process group" is no longer there?