Near as I can tell, nagios logs a failed service check with “Service check did not exit properly” when it can’t execute the service check plugin. Kinda a poorly worded log message…
I keep getting this failure at random, 'bout once a day using check_snmp_if and I can’t discern why.
I first thought it might be a timeout, but nagios logs those as timeouts.
I then thought it was a process limit, but I’m so under limits it hurts.
So, I got cute, I made a wrapper that logs when it starts, and any output from check_snmp_if or the exact error when it can’t execute it. Welp, it happened again, and I don’t see anything in my logs to indicate what happened, all I see are valid runs.
Anyone have further diag suggestions?