<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Monitor batch jobs that reoccur &amp;gt; 60 minutes in Open Q&amp;A</title>
    <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251642#M33230</link>
    <description>&lt;P&gt;On our side, we have created an extension that looks at the events, and configured options are met, generates problems.&lt;/P&gt;&lt;P&gt;I'm pretty sure there is a Product Idea for this, and this is especially important for Business metrics, of which some might have timeframes like one day...&lt;/P&gt;</description>
    <pubDate>Thu, 25 Jul 2024 18:08:59 GMT</pubDate>
    <dc:creator>AntonioSousa</dc:creator>
    <dc:date>2024-07-25T18:08:59Z</dc:date>
    <item>
      <title>Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/176590#M19180</link>
      <description>&lt;P&gt;Any thoughts on monitoring a batch job that runs say every 6 hours, 24 hours, or 168 hours?&amp;nbsp; The custom event for alerting feature allows only for a max of 60 minute (rolling window).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have two options to report status.&lt;/P&gt;
&lt;P&gt;1) we build into our batch job a Metrics v2 API call that stores a status-as-a-metric every time the batch job runs.&lt;/P&gt;
&lt;P&gt;2) we monitor a log file for the status value.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Issue: in both cases if a catastrophic failure occurs no status will be reported.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The alert needs to trigger if the metric stops reporting for 6, 12, 168 hours. Or, within the same window 6, 12, 168 hours a metric value of 0 (or less than 1) is reported.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have no way of obtaining a heartbeat or status between executions.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;thoughts?&lt;/P&gt;</description>
      <pubDate>Mon, 29 Nov 2021 10:17:58 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/176590#M19180</guid>
      <dc:creator>ct_27</dc:creator>
      <dc:date>2021-11-29T10:17:58Z</dc:date>
    </item>
    <item>
      <title>Re: Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/176602#M19181</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I've been trying to solve these kind of usecases for a very long time (with AppMon and Dynatrace). A very common case for example are periodic CronJobs on the Hybric commerce platform.&lt;/P&gt;&lt;P&gt;In the end I found a working solution but that involves quite some "external" logic, but the general process for me is the following:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;find a way that reports the status of the job execution, the job start, the job end.&lt;BR /&gt;i.e. do not try to track the Job itself as a PurePath, only track the status changes and use a request attribute to get the status. This can be tricky and really depends on the executing code.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2021-11-29 at 10.42.54.png" style="width: 999px;"&gt;&lt;img src="https://community.dynatrace.com/t5/image/serverpage/image-id/3563i364C6D2398527D36/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2021-11-29 at 10.42.54.png" alt="Screen Shot 2021-11-29 at 10.42.54.png" /&gt;&lt;/span&gt;&lt;/LI&gt;&lt;LI&gt;find a way to track the result of a job or the failure condition. In My case this is FAILURE, ERROR, OK, SUCCESS)&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screen Shot 2021-11-29 at 10.46.48.png" style="width: 999px;"&gt;&lt;img src="https://community.dynatrace.com/t5/image/serverpage/image-id/3564i37CE953300899A4C/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screen Shot 2021-11-29 at 10.46.48.png" alt="Screen Shot 2021-11-29 at 10.46.48.png" /&gt;&lt;/span&gt;&lt;/LI&gt;&lt;LI&gt;Define custom error rules (on key requests) that treat requests based on the result as error or success.&lt;/LI&gt;&lt;LI&gt;Define custom metrics for successful executions of these Jobs, split by jobname&lt;/LI&gt;&lt;LI&gt;Define custom error events, but with static thresholds. As you said the (&lt;A href="https://www.dynatrace.com/support/help/dynatrace-api/environment-api/metric-v2/metric-selector/#rollup-transformation-" target="_self"&gt;new rollup metric function&lt;/A&gt;) rolling window doesn't allow for very long timeframes, but it helps in many cases.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;For more in detail evaluation I'm using my &lt;A href="https://www.dynatrace.com/hub/detail/timeseries-streamer/?offered=partners" target="_self"&gt;timeseries streamer&lt;/A&gt; to get the above data out of Dynatrace and into a timeseries database (influxDB in my case). There I can use the full logic of flux queries to track things like:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Job execution time: calculated time between different status events (in Dynatrace this would be two independent PPs without any conjunction, in INfluxDB I can just search for two consecutive events of the same job with different status (start, finished) and calculate the duration.&lt;/LI&gt;&lt;LI&gt;Job current status: same as above, calculate if a job has a started event but no finished event. If finished event is missing it is running. This further more allows defining events, if it is running too long you could fire a webhook from your query and ingest an error to Dynatrace &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/LI&gt;&lt;LI&gt;Perform any logic to identify if a job did run in any given period (if any status timeseries has been written). If not, trigger an event to Dynatrace that data is missing =&amp;gt; alert.&lt;/LI&gt;&lt;LI&gt;Visualization bonus: Once the data is out of Dynatrace one can also use some advanced visualization of the data e.g. a swimlane representation of the individual Job execution/frequency and duration like this. This helped me and monitoring teams massively to understand which jobs are running when and where. Combine this with some coloring for job results and you have a good way of showing successes/failures.&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="r_weber_0-1638180401244.png" style="width: 400px;"&gt;&lt;img src="https://community.dynatrace.com/t5/image/serverpage/image-id/3565i562A0473C99DAF3B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="r_weber_0-1638180401244.png" alt="r_weber_0-1638180401244.png" /&gt;&lt;/span&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My bottom line, it is tricky (still) to trigger events in Dynatrace based on low frequency or missing metrics. For that purpose I get data out of DT into a system where I can perform advanced data manipulation and logic for alerting or visualization..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;kr,&lt;/P&gt;&lt;P&gt;Reinhard&lt;/P&gt;</description>
      <pubDate>Mon, 29 Nov 2021 10:09:45 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/176602#M19181</guid>
      <dc:creator>r_weber</dc:creator>
      <dc:date>2021-11-29T10:09:45Z</dc:date>
    </item>
    <item>
      <title>Re: Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251589#M33220</link>
      <description>&lt;P&gt;Hi ,&lt;BR /&gt;Rollup transformation could be the solution, not necessarily need to use Workflow at first glance.&amp;nbsp;&lt;BR /&gt;Please find my post :&lt;BR /&gt;&lt;A href="https://community.dynatrace.com/t5/Dynatrace-tips/Hack-of-the-week-expand-Metric-Event-duration-simply-with-rollup/m-p/251588#M1317" target="_blank"&gt;[Hack of the week] expand Metric Event duration simply with rollup - Dynatrace Community&lt;/A&gt;&lt;BR /&gt;Hope it helps&lt;BR /&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 09:44:45 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251589#M33220</guid>
      <dc:creator>uros_djukic1</dc:creator>
      <dc:date>2024-07-25T09:44:45Z</dc:date>
    </item>
    <item>
      <title>Re: Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251602#M33222</link>
      <description>&lt;P&gt;Nowadays we solve this with Business Events and flexible analysis of those combined with SRGs.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 10:46:22 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251602#M33222</guid>
      <dc:creator>r_weber</dc:creator>
      <dc:date>2024-07-25T10:46:22Z</dc:date>
    </item>
    <item>
      <title>Re: Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251610#M33223</link>
      <description>&lt;P&gt;yes indeed, but for additional info this workaround of rollup fit both platform Saas and Managed as well.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 11:27:15 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251610#M33223</guid>
      <dc:creator>uros_djukic1</dc:creator>
      <dc:date>2024-07-25T11:27:15Z</dc:date>
    </item>
    <item>
      <title>Re: Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251642#M33230</link>
      <description>&lt;P&gt;On our side, we have created an extension that looks at the events, and configured options are met, generates problems.&lt;/P&gt;&lt;P&gt;I'm pretty sure there is a Product Idea for this, and this is especially important for Business metrics, of which some might have timeframes like one day...&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 18:08:59 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251642#M33230</guid>
      <dc:creator>AntonioSousa</dc:creator>
      <dc:date>2024-07-25T18:08:59Z</dc:date>
    </item>
    <item>
      <title>Re: Monitor batch jobs that reoccur &gt; 60 minutes</title>
      <link>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251645#M33231</link>
      <description>&lt;P&gt;We gave up waiting for Dynatrace and deployed a local instance of&amp;nbsp;&lt;A href="https://healthchecks.io/" target="_blank"&gt;https://healthchecks.io/&lt;/A&gt;.&amp;nbsp; Cronitor is another good option.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Jul 2024 18:34:57 GMT</pubDate>
      <guid>https://community.dynatrace.com/t5/Open-Q-A/Monitor-batch-jobs-that-reoccur-gt-60-minutes/m-p/251645#M33231</guid>
      <dc:creator>ct_27</dc:creator>
      <dc:date>2024-07-25T18:34:57Z</dc:date>
    </item>
  </channel>
</rss>

