forums.meulie.net

Nagios monitor server 'cpu_load'

Hi,

When I try and get a report on the service ‘cpu_load’ I see that 94% of the time, the state is ‘indeterminate.’ What is causing this, and what can be done to resolve it? I read through the postings here and also looked at the nagios documentation.

TIA,
Russell L. Labay
Texas Department of Transportation
System Analyst

Are you trying to monitor the cpu_load on a remote host?

no, nagios is running locally on this server.

what happens if you run the comand from command line? any errors?

Luca

Most likely it’s due to not having any long term data.
Check your nagios.cfg file for:
retain_state_information=
What is yours set to?
Also important are:
retention_update_interval=
use_retained_program_state=
state_retention_file=
log_file=

Edited Thu Dec 01 2005, 10:13AM ]

Thanks Jakkedup,

Here is the info in nagios.cfg:

retain_state_information=1
retention_update_interval=60
use_retained_program_state=0 (I just changed this to 1 today)
state_retention_file=/var/log/nagios/status.sav
log_file=/var/log/nagios/nagios.log

This is the nagios included with the Novell’s SLES/OES 9 Distribution.

Here are some things from the log file:
Error: Could not create external command file
’/var/spool/nagios/nagios.cmd’ as named pipe: (17) -> File exists. If this file already exists and you are sure that another copy of nagios is not running, you should delete this file.
[1130448100] Bailing out due to errors encountered while trying to open the external command file for reading… (PID=17854).

(These messages repeated four times)

So I stopped nagios. Looked in the /var/spool/nagios/ directory and saw that nagios.cmd had a timestamp of Jul 2nd. Deleted that file, and started nagios up again.

It’s reporting that the server is in the ‘Ok’ state now.

Thanks for the help,
Russell

Cool, way to stick with it and resolve it.