Nagios monitor server 'cpu_load'


#1

Hi,

When I try and get a report on the service ‘cpu_load’ I see that 94% of the time, the state is ‘indeterminate.’ What is causing this, and what can be done to resolve it? I read through the postings here and also looked at the nagios documentation.

TIA,
Russell L. Labay
Texas Department of Transportation
System Analyst


#2

Are you trying to monitor the cpu_load on a remote host?


#3

no, nagios is running locally on this server.


#4

what happens if you run the comand from command line? any errors?

Luca


#5

Most likely it’s due to not having any long term data.
Check your nagios.cfg file for:
retain_state_information=
What is yours set to?
Also important are:
retention_update_interval=
use_retained_program_state=
state_retention_file=
log_file=

Edited Thu Dec 01 2005, 10:13AM ]


#6

Thanks Jakkedup,

Here is the info in nagios.cfg:

retain_state_information=1
retention_update_interval=60
use_retained_program_state=0 (I just changed this to 1 today)
state_retention_file=/var/log/nagios/status.sav
log_file=/var/log/nagios/nagios.log

This is the nagios included with the Novell’s SLES/OES 9 Distribution.

Here are some things from the log file:
Error: Could not create external command file
’/var/spool/nagios/nagios.cmd’ as named pipe: (17) -> File exists. If this file already exists and you are sure that another copy of nagios is not running, you should delete this file.
[1130448100] Bailing out due to errors encountered while trying to open the external command file for reading… (PID=17854).

(These messages repeated four times)

So I stopped nagios. Looked in the /var/spool/nagios/ directory and saw that nagios.cmd had a timestamp of Jul 2nd. Deleted that file, and started nagios up again.

It’s reporting that the server is in the ‘Ok’ state now.

Thanks for the help,
Russell


#7

Cool, way to stick with it and resolve it.