zabbix监控docker容器以及使用资源利用率
环境需求:由于公司使用docker程序在线上环境中,防止容器以但挂掉我们这边可以随时监控到容器的情况,以及容器的资源占用情况;
公司领导决定使用zabbix监控docker
可参考资料https://www.cnblogs.com/feigerlan/p/8193600.html
1、首先在安装zabbixserver端;这里不做介绍
2、安装zabbix客户端,然后安装模块pip install docker-py后面使用 client方法
3、完成之后在zabbix客户端安装docker服务,创建一个放脚本的文件夹
4、mkdir /etc/zabbix/script/
http://s1.运维网.com/images/20180702/1530498117903406.png
5、接下来创建三个脚本,两个是python写的,一个是shell写的
http://s1.运维网.com/images/20180702/1530498182810481.png
完成之后将zabbix的客户端配置文件修改一下
http://s1.运维网.com/images/20180702/1530498193641021.pngUserParameter=docker_discovery
[*], python /etc/zabbix/script/docker_discovery.py $1
UserParameter=docker_stats
[*],sudo /usr/bin/python/etc/zabbix/script/docker_monitor.py$1 $2
UserParameter=docker_process
[*], /bin/bash/etc/zabbix/script/docker_processmonitor.sh $1$2$3
http://s1.运维网.com/images/20180702/1530498214452588.png
重启zabbix客户端
6、在server端测试一下啊看看脚本是否能够用
zabbix_get -s 172.16.2.115 -k docker_discovery
http://s1.运维网.com/images/20180702/1530498250679474.png
查看其中一个容器的使用资源情况:
zabbix_get -s 172.16.2.115 -k docker_stats
zabbix_get -s 172.16.2.115 -k docker_stats
zabbix_get -s 172.16.2.115 -k docker_stats
zabbix_get -s 172.16.2.115 -k docker_stats
zabbix_get -s 172.16.2.115 -k docker_stats
zabbix_get -s 172.16.2.115 -k docker_stats
zabbix_get -s 172.16.2.115 -k docker_stats
http://s1.运维网.com/images/20180702/1530498268420727.png
7、给权限,在客户端上操作:
http://s1.运维网.com/images/20180702/1530498282540041.png
chown zabbix.zabbix /etc/zabbix/script/*
chmod 777 /etc/zabbix/script/docker_*
echo "zabbix ALL=(root) NOPASSWD:/bin/docker,/usr/bin/python,/etc/zabbix/script/docker_monitor.py,/etc/zabbix/script/docker_discovery.py,/etc/zabbix/script/docker_processmonitor.sh">>/etc/sudoers
8、接下来在zabbixweb上面配置模板板
http://s1.运维网.com/images/20180702/1530498323693581.pnghttp://s1.运维网.com/images/20180702/1530498337376554.png
http://s1.运维网.com/images/20180702/1530498350996277.pnghttp://s1.运维网.com/images/20180702/1530498367116209.png
http://s1.运维网.com/images/20180702/1530498428593696.png
备注:下面的都要添加
http://s1.运维网.com/images/20180702/1530498477442227.png
http://s1.运维网.com/images/20180702/1530498489862606.png
http://s1.运维网.com/images/20180702/1530498503902455.png
http://s1.运维网.com/images/20180702/1530498533902939.png
先创建了cpu、内存、和进出流量监控
http://s1.运维网.com/images/20180702/1530498557117994.png
指定图形
http://s1.运维网.com/images/20180702/1530498589800772.png
http://s1.运维网.com/images/20180702/1530498601652135.pnghttp://s1.运维网.com/images/20180702/1530498616411808.pnghttp://s1.运维网.com/images/20180702/1530498639294043.png
在定义下cpu的使用情况
和上面的一样
就改改名称就行:
http://s1.运维网.com/images/20180702/1530498657602695.png
查看下效果
http://s1.运维网.com/images/20180702/1530498692863887.png
http://s1.运维网.com/images/20180702/1530498714491760.png
页:
[1]