I just installed nagios 3.0.5, and nagios_plugins 1.4.13 (upgraded from 2.0.12 actually) on my CentOS 5.2 box. Everything was configured with default options. Initially, nagios works fine, but after a random amount of time (as low as 10 minutes, as much as 2 hours), nagios reports everything going down all at once, and the nagios.log contains lines like:
[1226082957] Warning: Return code of 127 for check of host ‘shelby’ was out of bounds. Make sure the plugin you’re trying to run actually exists.
[1226082957] HOST ALERT: shelby;DOWN;SOFT;5;(Return code of 127 is out of bounds - plugin may be missing)
[1226082957] Warning: Return code of 127 for check of service ‘FTP’ on host ‘hrbncalibersvr’ was out of bounds. Make sure the plugin you’re trying to run actually exists.
[1226082957] Warning: Return code of 127 for check of service ‘LDAP’ on host ‘kobe’ was out of bounds. Make sure the plugin you’re trying to run actually exists.
if I run “service nagios restart” it finds all the plugins again and marks everything up again. Then later on (maybe 2 hours later) the problem reappears. Any ideas?
I tried compiling nagios with builtin-perl and without builtin-perl, and the same problem happens. Meanwhile, I’ve backed out down to v2.12, so I don’t keep uinundating my whole dept with false alerts.