Nagios 3.0.5 forgets where plugins are after a while


#1

I just installed nagios 3.0.5, and nagios_plugins 1.4.13 (upgraded from 2.0.12 actually) on my CentOS 5.2 box. Everything was configured with default options. Initially, nagios works fine, but after a random amount of time (as low as 10 minutes, as much as 2 hours), nagios reports everything going down all at once, and the nagios.log contains lines like:

[1226082957] Warning: Return code of 127 for check of host ‘shelby’ was out of bounds. Make sure the plugin you’re trying to run actually exists.
[1226082957] HOST ALERT: shelby;DOWN;SOFT;5;(Return code of 127 is out of bounds - plugin may be missing)
[1226082957] Warning: Return code of 127 for check of service ‘FTP’ on host ‘hrbncalibersvr’ was out of bounds. Make sure the plugin you’re trying to run actually exists.
[1226082957] Warning: Return code of 127 for check of service ‘LDAP’ on host ‘kobe’ was out of bounds. Make sure the plugin you’re trying to run actually exists.

if I run “service nagios restart” it finds all the plugins again and marks everything up again. Then later on (maybe 2 hours later) the problem reappears. Any ideas?

I tried compiling nagios with builtin-perl and without builtin-perl, and the same problem happens. Meanwhile, I’ve backed out down to v2.12, so I don’t keep uinundating my whole dept with false alerts.


#2

I think I may have fixed it already. The file /usr/local/nagios/etc/resource.cfg was installed owner/group root/root perms 640, so user nagios couldn’t read the file whenever “service nagios reload” was run.

I changed the file to group nagcmd and it’s working fine now.

3.0.5 is SO MUCH FASTER than 2.0.12 was. Thanks guys!