Author: Zachary LaCelle
License: GPLv3
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.
This is a python script which will check the health, capacity, fragmentation, and other things for use with Nagios monitoring.
It provides perfdata feedback as well.
- Ubuntu 14.04 LTS, ZFS v5
- Ubuntu 16.04 LTS, ZFS v5
- Ubuntu 18.04 LTS, ZFS v5
- Ubuntu 20.04 LTS, ZFS v5
- CentOS 7, ZFS v5
- OmniOS r151030
This assumes you've set up a separate local configuration file include directory in your nagios.cfg to store all of your local configs. One example would be:
cfg_dir=/usr/local/nagios/etc/objects/conf.d
This also assumes you've installed check_zfs.py to your nagios/libexec directory. One way would be to softlink it to your directory where you've checked out the Git repository.
Below is an example set of command definitions which allow various levels of fidelity in zpool querying:
#Commands to check zpool status
define command {
command_name check_zpool_full
command_line $USER1$/check_zfs.py --capacity $ARG2$ $ARG3$ --fragmentation $ARG4$ $ARG5$ $ARG1$
}
define command {
command_name check_zpool_capacity
command_line $USER1$/check_zfs.py --capacity $ARG2$ $ARG3$ $ARG1$
}
define command {
command_name check_zpool
command_line $USER1$/check_zfs.py $ARG1$
}
Below is an example service definition using the above commands which will check the pool named "storage" for both capacity (warn at 70%, critical at 85%) and fragmentation (warn at 30%, critical at 40%):
define service {
use local-service
host_name localhost
service_description ZPOOL STORAGE
check_command check_zpool_full!storage!70!85!30!40
notifications_enabled 1
}
Below is an example NRPE configuration which will accept the single argument "check_zpool" and run the commands:
TODO
On systems with SELinux in enforcing mode nrpe is not granted the required permissions by SELinux, for that you can compile a policy module, then a policy package that can then be installed.
A sample can be used as follows:
- Build a policy module: checkmodule -M -m -o check_zfs_py.mod contrib/SELinux/check_zfs_py.te
- Build a policy package: semodule_package -o check_zfs_py.pp -m check_zfs_py.mod
- Load the policy package: semodule -i check_zfs_py.pp
If you want to unload it: semodule -i check_zfs_py