cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

How to alert on usually monitored services that become unmonitored?

vlazarova
Visitor

Hi,

 

We are monitoring certain application services on Windows hosts, and we receive alerts when they (the services) are not running for some reason. However, we don't receive alerts when they become "unmonitored". How can this be set? Is it through "process availability" for the whole process group?

 

Example of the issue:

1. Service "X" is being monitored by Dynatrace

2. During Windows patching service "X" is stopped, the server is rebooted, etc. 

3. After the server reboot the service doesn't come up, because it has "Manual" as startup type

4. An hour or two after that the server crashes unexpectedly and reboots itself again. Service "X" doesn't come up again (set to "Manual")

5. Since the server crash Dynatrace shows service "X" as "unmonitored" rather than "shutdown". The service started showing as running (respectively monitored) only after it was manually started on the server.

 

So, I guess my question is if there's a way for Dynatrace to raise a problem once a usually monitored services becomes unmonitored and how that can be achieved.

 

Thanks in advance!

 

1 REPLY 1

mark_bley
Dynatrace Mentor
Dynatrace Mentor

Hi @vlazarova

 

there is a new feature you can find under Settings > OS service monitoring

Here you can define a rule specific to that service or e.g. in general all Manual Windows OS Services like follows:

 "value": {
        "enabled": true,
        "system": "WINDOWS",
        "name": "Manual Windows OS Services",
        "monitoring": true,
        "alerting": true,
        "statusConditionWindows": "$not($eq(running))",
        "detectionConditionsWindows": [
          {
            "property": "StartupType",
            "startupCondition": "$eq(manual)"
          }
        ],
        "metadata": []
      }

Or in the UI it would look like this

mark_bley_0-1661969694850.png

Feel free to add more rules e.g. for a specific service only you may adda rule for its name.

Then in terms of alerting you can play around with alerting profiles if you don't want to get notified instantly in case the service takes longer to come back

 

Best,

Mark