I need some major assistance with a problem I am unsure of how to solve.
I notice that when I use check_ping to monitor a service in service.cfg, nagios will often times tell me something is down when it is not. A false positive. I know it’s not down due to I can ping it, and the host never goes down, which is also being pinged.
I had tried to change these over to polling (snmp) instead of pinging. Thinking maybe it wanted an actual service to monitor, and I’ve only seen this problem with ping, and nothing else that I monitor.
Well, now I see the same problem with polling using check_snmp.
I have enough bandwidth, its not pegging the box or the core network it sits on. I also have tried a new box with a diff eithernet card. I needed a faster better box anyway.
Also this is database drivin using postgres.
To make this even more fun. We’re monitoring 853 hosts, and counting.
Also could I eliminate the service.cfg and just use hosts.cfg without nagios warning me that ‘there is no service associated with this host’ ?
Does anyone have any insight on this one?
Thank you for any advice.