Hope someone can help me with this one.
I have a Nagios installation which monitors approximately 40 servers.
These servers all have between 1 and 32 service checks running on them.
The bulk of these service checks have been implemented using SNMP (Corporate requirement) and occassionally the SNMP Daemon on a server will stop responding and need restarting.
I have setup service dependencies for all these services to a service called ‘SNMP’ on each server in the hopes that if the daemon dies I get one alert telling me it’s dead instead of 20 alerts complaining about every service on the box.
Unfortunately as far as I can see Service Dependencies use the current known state for the SNMP Service.
This would work if the SNMP service got checked before the others but that isn’t guaranteed.
I therefore can get notifications for the other services up until SNMP is checked at its scheduled interval at which point the notifications stop until the SNMP Daemon is restarted.
What I want to know is if there is some way to make Nagios test SNMP automatically on the failure of one of it’s dependent services.
Any and all ideas appreciated!