27 Mar 2026
11:40 AM
- last edited on
30 Mar 2026
08:04 AM
by
MaciejNeumann
Recently, our Dynatrace Managed on-prem backup failed for several days. While the status tile in the Cluster Management Console (CMC) turned red, we did not receive any proactive notifications, leading to a gap in our backup history.
I am looking for the most reliable way to alert our team when a backup fails. I have the following questions:
Native Alerting: Is there a built-in 'Event' or 'Problem' type in Dynatrace Managed that can be triggered specifically by backup failures (for both Cassandra and Elasticsearch)?
Log-based Alerting: If we choose to use logs for alerting:
In which specific log files (on which nodes) is the backup status (success/failure) recorded?
What is the recommended method to ingest these local cluster logs back into Dynatrace for log-based alerting without creating a loop?
API Approach: Is it considered best practice to poll the /clusterapi/v1.0/backups/status endpoint for monitoring, and can this be integrated into a standard Dynatrace Alerting Profile?
Alternative: Are there specific 'Self-Monitoring' metrics available that we can use to create a Custom Chart and Alert for backup health?
Solved! Go to Solution.
27 Mar 2026 02:29 PM
I haven't find other source about cluster backup status than Server.0.0.log logs on cluster nodes.
30 Mar 2026 08:57 AM
I’m not aware of a native built-in Problem/Event type dedicated specifically to failed Cassandra/Elasticsearch backups in Dynatrace Managed that you can directly wire into a standard Alerting Profile. From the documentation and Community threads I could find, backup health is mainly checked through CMC status, node logs, and administrative endpoints/commands, rather than through a ready-made backup-failure problem type.
here is topis releated with backup troublehooting:
Managed Cluster backup troubleshooting - most common issues
From a practical perspective, the most reliable options today are:
Polling /clusterapi/v1.0/backups/status on a schedule, or running your own health-check script.
Log-based monitoring if you want to detect explicit backup failures from cluster logs.
Community feedback points mainly to Server.0.0.log on cluster nodes as the place where backup-related information can be found.
Here you have all endpoints:
Perform Infrastructure Health Checks in Dynatrace Managed
Base on this topisc my recommendation would be:
For logs:
Regarding self-monitoring metrics, I think dedicated backup-health metric that supports out-of-the-box alerting not exist. IMO API-based monitoring + custom alerting rather than searching for a hidden built-in metric is better option
08 Apr 2026 08:18 AM
Dear t_pawlak
Thank you for your detailed answer. I will analyze it, and hopefully, I can easily implement the solution in our organization.
Featured Posts