Hi
I am managing a central server with 10+ remote servers using nsca. I wanted to enable freshenss checking on the central server to identify services that have not reported within a reasonable amount of time. Checks are scheduled on the remote servers for every 5 minutes.
Here is my service:
define service{
name rem-service
active_checks_enabled 0
passive_checks_enabled 1
parallelize_check 1
obsess_over_service 0
check_freshness 1
freshness_threshold 60
notifications_enabled 1
event_handler_enabled 1
flap_detection_enabled 0
process_perf_data 1
retain_status_information 1
retain_nonstatus_information 1
check_period 24x7
max_check_attempts 3
normal_check_interval 5
retry_check_interval 1
contact_groups staff
notification_interval 120
notification_period 24x7
notification_options w,u,c,r
is_volatile 0
register 0
}
Service definition
define service{
use rem-service ; Name of service template to use
host_name REM-host1,REM-host2,REM-host3
service_description PING
** check_command Service-Check-Freshness**
}
here is my check command
define command{
command_name Service-Check-Freshness
command_line $USER1$/checkfreshness.sh
}
**here is my check command script: **
#!/bin/sh
/bin/echo “CRITICAL: Results of service check was not reported!”
exit 2
here is my nagios.cfg entries
check_service_freshness=1
service_freshness_check_interval=60
The gui shows the following entries
Last Check Time: 02-02-2006 23:16:36
Status Data Age: 0d 1h 36m 34s
I am not receiving any notification for my service that has not reported for over 1 hour. I removed the test checks in the remote servers to ensure that the checks would not be sent from NSCA. Can anyone identify why I am not getting notifications of the stale service checks?
Edited ]