Hi guys,
I had this odd problem recently where a single service for one of my hosts insisted on being checked locally by my central nagios server (distributed setup, running nagios 3.0). This resulted in the service flapping a lot, as the remote server would submit a correct OK alert, then the local server would perform the check which would fail.
I eventually fixed it by just going into the web interface and clicking on ‘disable active checks for this service’, but I dont understand why it started happening for this one service (obviously tried restarting nagios etc). If it starts happening to more of my services this could become a problem. If anyone has any ideas it would be much appreciated
define service {
use generic-service
host_name CTX05
service_description DISKSPACE_M
check_command check_nt_disk!M!80!90
register 1
contact_groups cg2
notification_period 24x7-rebootsat3
active_checks_enabled 0
passive_checks_enabled 1
}
define service{
name generic-service
active_checks_enabled 0
passive_checks_enabled 1
parallelize_check 1
obsess_over_service 1
check_freshness 0
notifications_enabled 1
event_handler_enabled 0
flap_detection_enabled 0
failure_prediction_enabled 1
process_perf_data 1
retain_status_information 1
retain_nonstatus_information 1
is_volatile 0
check_period 24x7
max_check_attempts 3
normal_check_interval 3
retry_check_interval 1
contact_groups cg1
notification_options w,c,r
notification_interval 0
notification_period 24x7
register 0
}
for comparison, this is the config for a service which was/is working as expected:
define service {
use generic-service
host_name CTX05
service_description DISKSPACE_N
check_command check_nt_disk!N!80!90
register 1
contact_groups cg2
notification_period 24x7-rebootsat3
active_checks_enabled 0
passive_checks_enabled 1
}