Alerting
Questions about alerting and problem detection in Dynatrace.
cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

Dealerting not working properly

Vijayt
Guide

Hi 

 

I have created to a metric event with  following alert model 

Violating 15 Sample 20  dealerting  20 

 

10:59Analysis timeframe start. Davis evaluates performance across timeframe (for example, 3 of 5 minutes show anomalous performance). No event reported at this time.
11:21Earliest event in analysis timeframe reported, backdated to analysis timeframe start. Problem is in processing state awaiting completion of topology analysis. Problem is raised and pushed to Dynatrace web UI and Problems API.
11:21Davis completes processing of the problem. Alerts are delivered via alerting profiles.
11:25Problem closed. Davis confirms the problem is resolved after observing normal conditions. The problem was in the open state for 11 minutes (the duration is checked against alerting profile duration criteria).


problem created at 11.21 . as per rule even if it is solved by 11.22 . as per dealerting it should close problem by 11.22+20 1mt sample = 11.44  . Is my understanding correct ?

2 .  is there any variable for metric event name to specify in metric description ? 

3 REPLIES 3

t_pawlak
Champion

Hi,
if I correctly understand your description, 

Your settings mean:

  • Violating: 15 of 20 → Davis opens a problem when, in the last 20 samples, at least 15 are violating.
  • Dealerting: 20 → Davis will close the problem only after there are ≥ 20 non-violating samples and, at the same time, the current 20-sample window has < 15 violating samples.

This is based on samples, not minutes.
If your metric is sampled every 1 minute, then after recovery you typically need about 20 minutes of normal data (plus processing time) — but:

  • If your sampling interval is shorter (e.g., every 10s), you’ll reach 20 samples much faster.
  • If violations dropped earlier than you think, the rolling 20-sample window may already satisfy “< 15 violating” before your “11:22” point, so closure can happen sooner.
  • Davis adds a bit of processing time (seconds to a minute or two) when it evaluates and closes.

So “11:22 + 20 minutes = 11:42” is only correct if you truly have 1-minute sampling and you started counting exactly at the first non-violating sample. Your observed 11:25 closure suggests either a shorter sampling interval or that the rolling 20-sample window had already accumulated enough normal samples sooner than assumed.

Here is documentation Anomaly detection configuration 

2) There isn’t a dedicated placeholder for the metric event rule’s name itself. In titles/descriptions you can reference the metric’s display name and other context (current value, threshold, unit, dimensions, entity), but not the rule name.

Can we control sample interval  ?   i dont see any option advanced model says 

//The number of one-minute samples within the evaluation window that must violate to trigger an event//

No — for metric events the sampling interval is fixed to 1-minute samples.

Featured Posts