Snmp monitoring of drive usage

notrob · May 27, 2005, 12:22am

Hi all,

I’ve been tasked with the job of implementing disk checking via snmp in nagios. I’ve created a perl script with net::snmp to pull the data but i’ve run into a problem. when doing ( hrStorageSize / hrStorageUsed ) * 100 I consistently get back data which is off by ~ 5%. I can only assume this is due to journaling. Multiplying the data by 1.1 or 1.07 seems to generate data that is closer to what I see in df.

My question I suppose is…am I using the wrong MIB or am I forced to run a local client on these machines to get accurate reporting?

Thanks,
Robert

luca · May 27, 2005, 7:24am

don’t know the net::snmp stuff.
Here i simply use a script which makes a df-k on the partition i want and returns the percentage value on a private MIB. maybe it’s easier (problem is you need to modify snmpd.conf on each machine you want to check)

Luca

PS: Could the 5% be the root-reserved space?
Edited Thu May 26 2005, 11:24PM ]

notrob · May 27, 2005, 5:53pm

Here’s the problem:

600+ servers.

Granted a big chunk of them are all the same, but there are still over a hundred servers with individual disk monitoring needs. I really do not want to push a configs to every server.

I’m not sure if it’s a reserve of some kind, though it could be. this problem effects solaris as well as linux. Though NFS mounts do not suffer from this problem.

luca · May 31, 2005, 8:11am

just letting some ideas at the moment…
could it be simply the 1024/1000 problem?
have you considered the root reserved spaces?

Luca