Welcome to Cisco Support Community. We would love to have your feedback.
For an introduction to the new site, click here. And see here for current known issues.
Aside from using job events/email actions to alert on job failures, what other tools are being used to monitor jobs and their critical path flows?
If you are using a tool outside of the Tidal console, what is the tool or tools you are using and what benefits are you getting from each of them?
We only use job events email, and Nagios triggered shells scripts that directly query the Tidal DB for specific things we want. I think you can use SNMP traps with Tidal, we just don't do it.
We use an external event handler (CorreLog) to handle complex actions. From TES we create a job event, that sends a message to CorreLog including all necessay parameters to impose actions.
Many of these actions are sacmd commands, SMS (text messages) to operators etc.
This way we can achieve a much higher level of automation than we can with TES events alone.