We have recently experienced a job trigger in Talend which was stuck in the 'misfired' state and never got an alert for it. We usually have email alerts if the job fails, but since it never finished, the subsequent runs were never triggered.
Is it possible to set something up where there is an alert if a job did not run over a period of time? (Not if it failed, just if it never finished) Possibly with custom script or code?
I have seen something like this done at another customer where they configured their job-completion step to write a file to specific directory. Then they had a cron job which checked the timestamp on that file, and if it was greater than X from currentTime() then used the API to trigger an alert in AppMon "Job failed to run". Hope this gives you some ideas to work with.