Hello All -
I’ve been using the ‘check_smtp’ command to monitor our mail servers which has worked great. However, with increased volume on the mail servers, I somtimes get “false positives” if you will with this default check.
Let me explain, the ‘check_smtp’ query just tries to communicate with the smtp port to see if it is accessible. If it doesn’t get a response in 10 seconds, it will timeout and therefore generate a CRITICAL alert. That is fine and well except now that our volume has increased, sometimes, on occasion, the port will be really busy and timeout a few times throughout the day.
My question is, can I configure the ‘check_smtp’ script to generate to Nagios the output from the command as opposed to this:
CRITICAL - Socket timeout after 10 seconds
What I would like, is for Nagios to let me know if their are “Too many connections” on a check and then if it happens three times in succession, then it may report as being Critical.
I know that the scripts can be altered, but will Nagios allow for the output to be sent to the interface or will it only report the type as ‘UNKNOWN’,‘OK’,‘CRITICAL’,‘WARNING’?
If anyone can think of a better resolution for this, I am open for suggestions. Thanks in advance.