Hi Recently we experienced issues with few servers where in one agent(version 1.205) goes down suddenly for no reason with log saying one agent stopped manually. We could not find any issues with host as well as it is up and running continiously its is only the one agent that is getting stopped. Is there any way weherin we could alert if something happens to one agent on host. Alerting of one agent goes down so tah we can atleast restart it immediately.
Solved! Go to Solution.
The Unmonitored availability state indicates that OneAgent isn't running on the host.
Thanks Babar , But we have not received any alerts for one agent getting stopped. when no of servers is more , It takes some effort to find out whats is missing in UI. we need some way to identify when onegent get stopped.
Note : In above cases we have tried enabling alert on graceful shutdown as well but this didnt work as it is dependt on host and not One agent.
you can create an alerting profile, from settings go to alerting => add alerting profile => select the management zone that you need to monitor => add event filter => select predefined then from filter problems choose Host or monitoring unavailable, also you can choose monitoring unavailable if you need to get an alert if OneAgent isn't running on the host.
Upon graceful process stop, Dynatrace will not create a problem as this will consider that you have shutdown the process. This is to reduce false alerts.
Even still if you want to get the alert, you should go inside the process and turn on its availability monitoring then whenever OneAgent process stop you will get the alert - This is deliberately turned off by default in Dynatrace.
@thasthakir No we wouldn't. We will getUnmonitored availability because the moment OneAgent goes down Dynatrace will lost their connectivity and get Unmonitored availability alert but not related to specific OneAgent.
@Sujit Thanks for your quick response.
Let me make it clear, If some user stops the one agent service or if the one agent hangs and crashes itself. we won't get an alert that one agent is down but we will still get the Unmonitored availability alert?
Can you please confirm?
@Sujit Thank you ! How ever when we test by stopping the one agents we are not getting the Unmonitored availability alert as well.
Can you please let me know if there is any specific setting for it ?
@thasthakir Can you run below command (if its linux) on the same host where you stopped user stopped oneagent...
service oneagent status
Setting -> Monitoring Overview -> Enter the host name and see whether its Monitored or Unmonitored.
I was talking about the process availability monitoring. It is possible to get the alert when OneAgent is down,
Please refer the below screenshot,
I hope this will answer the question.