When I try and get a report on the service ‘cpu_load’ I see that 94% of the time, the state is ‘indeterminate.’ What is causing this, and what can be done to resolve it? I read through the postings here and also looked at the nagios documentation.
TIA,
Russell L. Labay
Texas Department of Transportation
System Analyst
Most likely it’s due to not having any long term data.
Check your nagios.cfg file for:
retain_state_information=
What is yours set to?
Also important are:
retention_update_interval=
use_retained_program_state=
state_retention_file=
log_file=
retain_state_information=1
retention_update_interval=60
use_retained_program_state=0 (I just changed this to 1 today)
state_retention_file=/var/log/nagios/status.sav
log_file=/var/log/nagios/nagios.log
This is the nagios included with the Novell’s SLES/OES 9 Distribution.
Here are some things from the log file:
Error: Could not create external command file
’/var/spool/nagios/nagios.cmd’ as named pipe: (17) -> File exists. If this file already exists and you are sure that another copy of nagios is not running, you should delete this file.
[1130448100] Bailing out due to errors encountered while trying to open the external command file for reading… (PID=17854).
(These messages repeated four times)
So I stopped nagios. Looked in the /var/spool/nagios/ directory and saw that nagios.cmd had a timestamp of Jul 2nd. Deleted that file, and started nagios up again.
It’s reporting that the server is in the ‘Ok’ state now.