ifuleyou 发表于 2019-1-16 13:45:13

Redhat 5.2 配置nagios

  Redhat 5.2 配置nagios
  安装系统组件
  安装apache、gcc
  配置apache
  安装nagios组件
  安装nagios
  安装nagios-plugins
  安装nrpe
  配置被监控机
  配置linux被监控机(客户端)
  配置交换机被监控机(客户端)
  配置windows被监控机(客户端)
  配置邮件监控
  安装组件
  Apache、gcc glibc glibc-common gd-devel、 openssl、openssl-devel(open安装未列出)
  net-snmp-libs net-snmp-devel net-snmp net-snmp-utils
  安装apache、gcc
  (由于rpm的依赖关系太烦人,配置yum,舒服多了,具体请见yum安装)
  # yum install httpd
  ……
  Downloading Packages:
  (1/4): apr-util-1.2.7-7.e 100% |=========================|76 kB    00:00
  (2/4): apr-1.2.7-11.i386. 100% |=========================| 123 kB    00:00
  (3/4): httpd-2.2.3-11.el5 100% |=========================| 1.1 MB    00:01
  (4/4): postgresql-libs-8. 100% |=========================| 196 kB    00:00

  warning: rpmts_HdrFromFdno: Header V3 DSA signature: NOKEY, key>  Importing GPG key 0x37017186 "Red Hat, Inc. (release key) " from http://10.155.2.75/Server/RPM-GPG-KEY-redhat-release
  Is this ok : y
  Running rpm_check_debug
  Running Transaction Test
  Finished Transaction Test
  Transaction Test Succeeded
  Running Transaction
  Installing: apr                        #########################
  Installing: postgresql-libs            #########################
  Installing: apr-util                     #########################
  Installing: httpd                        #########################
  ……
  # yum install gcc
  ……
  Downloading Packages:
  (1/5): libgomp-4.1.2-42.e 100% |=========================|82 kB    00:00
  (2/5): glibc-headers-2.5- 100% |=========================| 610 kB    00:00
  (3/5): glibc-devel-2.5-24 100% |=========================| 2.0 MB    00:02
  (4/5): gcc-4.1.2-42.el5.i 100% |=========================| 5.2 MB    00:07
  (5/5): kernel-headers-2.6 100% |=========================| 843 kB    00:01
  Running rpm_check_debug
  Running Transaction Test
  Finished Transaction Test
  Transaction Test Succeeded
  Running Transaction
  Installing: libgomp                      #########################
  Installing: kernel-headers               #########################
  Installing: glibc-headers                #########################
  Installing: glibc-devel                  #########################
  Installing: gcc                        #########################
  Installed: gcc.i386 0:4.1.2-42.el5
  Dependency Installed: glibc-devel.i386 0:2.5-24 glibc-headers.i386 0:2.5-24 kernel-headers.i386 0:2.6.18-92.el5 libgomp.i386 0:4.1.2-42.el5
  Complete!
  # yum install glibc glibc-common gd-devel
  ……
  Running Transaction
  Installing: zlib-devel                   ####################### [ 1/12]
  Installing: freetype-devel               ####################### [ 2/12]
  Installing: fontconfig-devel             ####################### [ 3/12]
  Installing: libpng-devel               ####################### [ 4/12]
  Installing: libXau-devel               ####################### [ 5/12]
  Installing: libjpeg-devel                ####################### [ 6/12]
  Installing: xorg-x11-proto-devel         ####################### [ 7/12]
  Installing: libX11-devel               ####################### [ 8/12]
  Installing: libXpm-devel               ####################### [ 9/12]
  Installing: libXdmcp-devel               #######################
  Installing: mesa-libGL-devel             #######################
  Installing: gd-devel                     #######################
  Installed: gd-devel.i386 0:2.0.33-9.4.el5_1.1
  Dependency Installed: fontconfig-devel.i386 0:2.4.1-7.el5 freetype-devel.i386 0:2.2.1-19.el5 libX11-devel.i386 0:1.0.3-9.el5 libXau-devel.i386 0:1.0.1-3.1 libXdmcp-devel.i386 0:1.0.1-2.1 libXpm-devel.i386 0:3.5.5-3 libjpeg-devel.i386 0:6b-37 libpng-devel.i386 2:1.2.10-7.1.el5_0.1 mesa-libGL-devel.i386 0:6.5.1-7.5.el5 xorg-x11-proto-devel.i386 0:7.1-9.fc6 zlib-devel.i386 0:1.2.3-3
  Complete!
  配置apache
  注把apache 加入到nagcmd组,以便在通过web Interface 操作nagios是有足够的权限;
  # useradd nagios && passwd nagios   //chinahr123$
  Changing password for user nagios.
  New UNIX password:
  BAD PASSWORD: it is based on a dictionary word
  Retype new UNIX password:
  Sorry, passwords do not match.
  New UNIX password:
  Retype new UNIX password:
  passwd: all authentication tokens updated successfully.
  # groupadd nagcmd
  # usermod -G nagcmd nagios
  # usermod -G nagcmd apache
  安装nagios组件
  安装nagios
  http://cdnetworks-kr-2.dl.sourceforge.net/project/nagios/nagios-3.x/nagios-3.2.1/nagios-3.2.1.tar.gz
  http://cdnetworks-kr-2.dl.sourceforge.net/project/nagiosplug/nagiosplug/1.4.15/nagios-plugins-1.4.15.tar.gz
  # tar zxvf nagios-3.2.1.tar.gz
  # cd nagios-3.2.1
  #./configure --with-command-group=nagcmd --prefix=/usr/local/nagios
  # make all
  # make install
  # make install.init
  /usr/bin/install -c -m 755 -d -o root -g root /etc/rc.d/init.d
  /usr/bin/install -c -m 755 -o root -g root daemon-init /etc/rc.d/init.d/nagios
  # make install-config
  ……
  *** Config files installed ***
  Remember, these are *SAMPLE* config files.You'll need to read
  the documentation for more information on how to actually define
  services, hosts, etc. to fit your particular needs.
  # make install-commandmode
  /usr/bin/install -c -m 775 -o nagios -g nagcmd -d /usr/local/nagios/var/rw
  chmod g+s /usr/local/nagios/var/rw
  *** External command directory configured ***
  验证安装在/usr/local/nagios,是否生成目录binetcsbinsharevar.
  # cd /usr/local/nagios
  # ls
  binetcsbinsharevar
  安装nagios-plugins
  # tar zxvf nagios-plugins-1.4.15.tar.gz
  #./configure --with-nagios-user=nagios --with-nagios-group=nagios --prefix=/usr/local/nagios
  # make && make install
  验证是否安装成功,是否生成如下文件
  # ls /usr/local/nagios/libexec
  check_apt      check_disk_smbcheck_ide_smart   check_mrtg      check_nwstatcheck_sensorscheck_users
  check_breeze   check_dns       check_ifoperstatuscheck_mrtgtrafcheck_oraclecheck_smtp   check_wave
  check_by_ssh   check_dummy   check_ifstatus      check_nagios    check_overcrcheck_ssh      negate
  check_clamd    check_file_agecheck_imap          check_nntp      check_ping    check_swap   urlize
  check_clustercheck_flexlm    check_ircd          check_nt      check_pop   check_tcp      utils.pm
  check_dhcp   check_ftp       check_load          check_ntp       check_procs   check_time   utils.sh
  check_dig      check_http      check_log         check_ntp_peercheck_real    check_udp
  check_disk   check_icmp      check_mailq         check_ntp_timecheck_rpc   check_ups
  配置nagios的WEB接口,也可以通过如下命令更改用户密码,
  # /usr/bin/htpasswd -c /usr/local/nagios/etc/htpasswd nagiosadmin
  New password:
  Re-type new password:
  Adding password for user nagiosadmin
  # service httpd start
  Starting httpd:                                          
  编辑httpd.conf,在结尾加上如下内容,保存,启动apache,service httpd restart
  ScriptAlias /nagios/cgi-bin /usr/local/nagios/sbin
  
  Options ExecCGI
  AllowOverride None
  Order allow,deny
  Allow from all
  AuthName "Nagios Access"
  AuthType Basic
  AuthUserFile /usr/local/nagios/etc/htpasswd
  Require valid-user
  
  Alias /nagios /usr/local/nagios/share
  
  Options None
  AllowOverride None
  Order allow,deny
  Allow from all
  AuthName "Nagios Access"
  AuthType Basic
  AuthUserFile /usr/local/nagios/etc/htpasswd
  Require valid-user
  
  设置开机启动nagios服务
  # chkconfig --add nagios
  # chkconfig nagios on
  验证nagios配置是否正常,此命令以后会经常使用,warning和error为0,表示正常,后面不正常的讨论,这里可以启动nagios服务
  # /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg
  ……
  Checking misc settings...
  Total Warnings: 0
  Total Errors:   0
  # service nagios start
  Starting nagios: done.
  安装nrpe
  http://sourceforge.net/projects/nagios/files/nrpe-2.x/nrpe-2.8b1/nrpe-2.8b1.tar.gz/download
  安装之前之前一定要安装gcc、 openssl、openssl-devel(尤其是linux被监控机,也是客户端)安装之后,是用/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg –d命令启动nrpe
  tar -zxvf nrpe-2.8b1
  cd nrpe-2.8b1
  # ./configure
  # make all
  # make install-plugin
  # make install-daemon
  # make install-daemon-config
  /usr/bin/install -c -m 775 -o nagios -g nagios -d /usr/local/nagios/etc
  /usr/bin/install -c -m 644 -o nagios -g nagios sample-config/nrpe.cfg /usr/local/nagios/etc
  /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d
  通过如下命令验证nrpe是否正常,可以看到端口5666已经打开
  # /usr/local/nagios/libexec/check_nrpe -H localhost
  NRPE v2.8b1
  #
  # netstat -atulnp | grep 'nrpe'
  tcp      0      0 0.0.0.0:5666                0.0.0.0:*   LISTEN      24823/nrpe
  以上服务器端(也是就控制端)的nrpe(用于服务器的自身的监控)已经配置完成,接着服务器的nagios监控配置,配置如下nagios.cfg,去掉如下的“#”,每条记录都对应目录的相应的配置文件,下表中有说明,host.cfg和service.cfg是不存在的,可以自己建立,这些cfg文件都是可以自己建立的,只要在nagios.cfg添加相应的记录就行。现在我们只是做服务器自身的监控,只是用的commands.cfg、localhost.cfg,不用修改什么。
  # vi /usr/local/nagios/etc/nagios.cfg
  cfg_file=/usr/local/nagios/etc/objects/commands.cfg            ;
  cfg_file=/usr/local/nagios/etc/objects/contacts.cfg
  cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg
  cfg_file=/usr/local/nagios/etc/objects/templates.cfg
  cfg_file=/usr/local/nagios/etc/objects/localhost.cfg
  cfg_file=/usr/local/nagios/etc/objects/windows.cfg
  cfg_file=/usr/local/nagios/etc/objects/switch.cfg
  监控命令
  command
  nagios发出的哪个指令来执行某个监控,这也是自己定义的
  联系人
  contact
  设置报警联系人,一般当然是系统管理员了
  监控时间段
  timeperiod
  7X24小时不间断还是周一至周五,或是自定义的其他时间段
  监控交换机
  switch
  例如主机是否存活,80端口是否开,磁盘使用情况或者自定义的服务等
  被监控主机
  localhost
  监控机自己服务器
  被监控主机
  host
  所需要监控的服务器,当然可以是监控机自己
  被监控的服务
  service
  例如主机是否存活,80端口是否开,磁盘使用情况或者自定义的服务等
  保存nagios.cfg后,用service httpd restart重启nagios服务,如果失败,用/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg验证,会提示出问题所在,关于问题解决在后面再将,这里只是服务器自身监控,应该不会有问题。
  在IE下http://IP/nagios,输入nagiosadmin和密码(图1和图2)
  配置被监控机
  配置linux被监控机(客户端)
  需要先安装gcc、openssl、openssl-devel。
  先添加一个用户
  # useradd nagios
  # passwd nagios                  //123456
  Changing password for user nagios.
  New UNIX password:
  BAD PASSWORD: it is too simplistic/systematic
  Retype new UNIX password:
  passwd: all authentication tokens updated successfully.
  安装nagios-plugins-1.4.15.tar.gz
  # tar -zxvf nagios-plugins-1.4.15.tar.gz
  # cd nagios-plugins-1.4.15
  # ./configure --prefile=/usr/local/nagios
  # make
  # make install
  # chown nagios.nagios /usr/local/nagios
  # chown -R nagios.nagios /usr/local/nagios/libexec
  安装nrpe
  # tar -zxvf nrpe-2.8b1
  # cd nrpe-2.8b1
  # ./configure
  # make all
  # make install-plugin
  # make install-daemon
  # make install-daemon-config
  编辑nrpe.cfg
  #vi /usr/local/nagios/etc/nrpe.cfg
  查找如下加上nagios服务器ip地址
  allowed_hosts=127.0.0.1,10.155.2.65
  保存后,用如下命令启动nrpe和添加到开机启动中
  /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg –d
  echo '/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d &> /dev/null' >> /etc/rc.local
  验证
  # /usr/local/nagios/libexec/check_nrpe -H localhost
  NRPE v2.8b1
  # netstat -atulnp | grep 'nrpe'
  tcp      0      0 0.0.0.0:5666                0.0.0.0:*   LISTEN      24823/nrpe
  如果不正常见文章最后的问题和解决(1)
  这里客户端的配置已经完成,已经可以到nagios服务器配置command.cfg监控,但是这里介绍另一种方法在编辑nrpe.cfg配置命令,nagios服务器会调用客户端的nrpe命令
  vi /usr/local/nagios/etc/nrpe.cfg
  command=/usr/local/nagios/libexec/check_users -w 5 -c 10
  command=/usr/local/nagios/libexec/check_load -w 15,10,5 -c 30,25,20
  command=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z
  command=/usr/local/nagios/libexec/check_procs -w 150 -c 200
  command=/usr/local/nagios/libexec/check_swap -w 20% -c 10%
  下面这条是自己添加
  command=/usr/local/nagios/libexec/check_ping -H 10.155.0.1 -w 100.0,20% -c 500.0,60%#
  command=/usr/local/nagios/libexec/check_disk -w 20 -c 10 -p /dev/hda1
  #command[命令名称]=命令定义
  具体使用见帮助
  # ls
  binetcincludelibexecshare
  # pwd
  /usr/local/nagios
  # libexec/check_ping -h
  check_ping v1.4.15 (nagios-plugins 1.4.15)
  Copyright (c) 1999 Ethan Galstad
  Copyright (c) 2000-2007 Nagios Plugin Development Team
  
  Use ping to check connection statistics for a remote host.
  Usage:
  check_ping -H-w ,% -c ,%
  [-p packets] [-t timeout] [-4|-6]
  Options:
  -h, --help
  Print detailed help screen
  -V, --version
  Print version information
  -4, --use-ipv4
  Use IPv4 connection
  -6, --use-ipv6
  Use IPv6 connection
  -H, --hostname=HOST
  host to ping
  -w, --warning=THRESHOLD
  warning threshold pair
  -c, --critical=THRESHOLD
  critical threshold pair
  -p, --packets=INTEGER
  number of ICMP ECHO packets to send (Default: 5)
  -L, --link
  show HTML in the plugin output (obsoleted by urlize)
  -t, --timeout=INTEGER
  Seconds before connection times out (default: 10)
  配置nrpe.cfg后,必须重启nrpe。重启方法(杀死进程,在重启),否则更改不会生效
  # ps aux|grep nrpe
  nagios    33270.00.0   4880   924 ?      Ss   Sep02   0:03 /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d
  root   222930.00.0   3908   648 pts/0    R+   11:07   0:00 grep nrpe
  # /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d
  服务器端
  测试与客户端连接,如下表示正常
  # /usr/local/nagios/libexec/check_nrpe -H 10.155.2.81
  NRPE v2.8b1
  下面表示不正常,见问题与解决(2)
  # /usr/local/nagios/libexec/check_nrpe -H 10.155.2.81
  Connection refused by host
  配置nagios.cfg,增加一条记录
  # vi /usr/local/nagios/etc/nagios.cfg
  cfg_file=/usr/local/nagios/etc/objects/mylinux.cfg
  cfg_file=/usr/local/nagios/etc/objects/commands.cfg    ;此前面已经设置过
  配置nagios.cfg,增加一条记录
  vi /usr/local/nagios/etc/objects/commands.cfg
  #在后面增加追加如下内容
  #check nrpe
  define command{
  command_name check_nrpe
  command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$
  }
  mylinux.cfg在目录中没有,需要自己添加
  # touch/usr/local/nagios/etc/objects/mylinux.cfg
  #---------------------------------------------此段定义主机--------------------------------------------------------
  define host{
  use         linux-server
  host_name   10.155.2.81-cacti
  alias       mylinux
  address   10.155.2.81
  }
  #--------------------------------下面定义服务器,也就是监控项------------------------------------------
  define service{
  use   generic-service
  host_name       10.155.2.81-cacti
  service_description Swap Usage
  check_command       check_nrpe!check_swap
  }
  define service{
  use               generic-service
  host_name         10.155.2.81-cacti
  service_description Current Load
  check_command       check_nrpe!check_load
  }
  define service{
  use               generic-service
  host_name         10.155.2.81-cacti
  service_description Partition Usage
  check_command       check_nrpe!check_hda1
  }
  define service{
  use               generic-service
  host_name         10.155.2.81-cacti
  service_description Current Users
  check_command       check_nrpe!check_users
  }
  define service{
  use               generic-service
  host_name         10.155.2.81-cacti
  service_description Total Processes
  check_command       check_nrpe!check_total_procs
  }
  define service{
  use               generic-service
  host_name         10.155.2.81-cacti
  service_description PING
  check_command       check_nrpe!check_ping81
  }
  重启nagios,ie浏览(图3)
  配置交换机被监控机(客户端)
  这里的switch.cfg是交换机的一个监控模板,我保留这个cfg,复制为switch31.cfg编辑
  # vi /usr/local/nagios/etc/nagios.cfg
  # Definitions for monitoring a router/switch
  #cfg_file=/usr/local/nagios/etc/objects/switch.cfg
  cfg_file=/usr/local/nagios/etc/objects/switch31.cfg
  几乎不用改,把host name和address改了,最后一个mrtg我没用网络里没有注释掉了
  # vi switch31.cfg
  define host{
  use             generic-switch          ; Inherit default values from a template
  host_name       g13a-dell5424-31      ; The name we're giving to this switch
  alias         Linksys SRW224P Switch; A longer name associated with the switch
  address         10.155.0.31             ; IP address of the switch
  hostgroups      switches                ; Host groups this switch is associated with
  }
  define hostgroup{
  hostgroup_nameswitches                ; The name of the hostgroup
  alias         Network Switches      ; Long name of the group
  }
  define service{
  use            generic-service ; Inherit values from a template
  host_name      g13a-dell5424-31; The name of the host the service is associated with
  service_description   PING            ; The service description
  check_command         check_ping!200.0,20%!600.0,60%; The command used to monitor the service
  normal_check_interval   5               ; Check the service every 5 minutes under normal conditions
  retry_check_interval    1               ; Re-check the service every minute until its final/hard state is determined
  }
  # Monitor uptime via SNMP
  define service{
  use                     generic-service ; Inherit values from a template
  host_name               g13a-dell5424-31
  service_description   Uptime
  check_command         check_snmp!-C chrswitch-o sysUpTime.0
  }
  # Monitor Port 1 status via SNMP
  define service{
  use                     generic-service ; Inherit values from a template
  host_name               g13a-dell5424-31
  service_description   Port 1 Link Status
  check_command         check_snmp!-C chrswitch -o ifOperStatus.1 -r 1 -m RFC1213-MIB
  }
  # Monitor bandwidth via MRTG logs
  #define service{
  #       use                     generic-service ; Inherit values from a template
  #       host_name               linksys-srw224p
  #       service_description   Port 1 Bandwidth Usage
  #       check_command         check_local_mrtgtraf!/var/lib/mrtg/192.168.1.253_1.log!AVG!1000000,1000000!5000000,5000000!10
  #       }
  Switch就完成,红色见问题3(图4)
  配置windows被监控机(客户端)
  Windows监控需要在windows安装一个插件nsclient++,http://nsclient.org/nscp/downloads
  下载后解压到c盘
  C:\NSClient>nsclient++ /install
  Service NSClientpp installed...
  l NSClient++.cpp(227) Service installed!
  用记事本编辑nsc.ini,去掉注释符号”;”除了CheckWMI.dll和RemoteConfiguration.dll
  
  FileLogger.dll
  CheckSystem.dll
  CheckDisk.dll
  NSClientListener.dll
  NRPEListener.dll
  SysTray.dll
  CheckEventLog.dll
  CheckHelpers.dll
  ;CheckWMI.dll
  CheckExternalScripts.dll
  NSCAAgent.dll
  LUAScript.dll
  ;RemoteConfiguration.dll
  NRPEClient.dll
  CheckTaskSched.dll
  
  allowed_hosts=10.155.2.65/32
  
  port=12489
  C:\NSClient>NSClient++ -start
  Starting NSClientpp
  C:\NSClient>netstat -an | more
  Active Connections
  ProtoLocal Address          Foreign Address      State
  TCP    0.0.0.0:5666         0.0.0.0:0            LISTENING
  TCP    0.0.0.0:12489          0.0.0.0:0            LISTENING
  服务器
  # vi /usr/local/nagios/etc/nagios.cfg
  # Definitions for monitoring a Windows machine
  cfg_file=/usr/local/nagios/etc/objects/windows.cfg
  # vi /usr/local/nagios/etc/objects/windows.cfg
  windows.cfg没有什么可改的,只是把hostname和ip改了就行了(图5)
  邮件监控
  邮件监控可以配置sendmail,我对sendmail一窍不通,这里还是用mail命令完成。首先是编辑nagios.cfg,以前已经设置了,这里看看就行了
  # vi /usr/local/nagios/etc/nagios.cfg
  cfg_file=/usr/local/nagios/etc/objects/contacts.cfg
  这里要注意的是如果在这设置间隔和期限等是在整个nagios生效的,所有的客户端都是按照这个配置,如果想单独设置在各个cfg中单独配置,一会会讲。整个cfg内容没什么说的了,contact_name和email,还有就是contactgroup_name在所有的地方都是设置这个名字,不要写错,member可以设置多个用”,”隔开,
  # vi contacts.cfg
  define contact{
  contact_name                  user1
  alias                           Nagios Admin
  service_notification_period   24x7
  host_notification_period      24x7
  service_notification_options    w,u,c,r
  host_notification_options       d,r
  service_notification_commands   notify-by-email
  host_notification_commands      host-notify-by-email
  email                           xxxx@163.com
  pager      13800138000
  }
  define contactgroup{
  contactgroup_name      admins
  alias                   Nagios Administrators
  members               user1
  }
  notify-by-email和host-notify-by-email在command.cfg中定义了(见问题5)
  这个就是在服务中单独设置,就不多讲了
  define service{
  use generic-service ; Name of service template to use
  host_name test_nrpe
  service_description apache
  is_volatile 0                     #类似声音警告功能关闭
  check_period 24x7               #监控期限为24X7
  max_check_attempts 1            #最大重试次数
  normal_check_interval 1         #标准检测时间间隔 1分钟
  retry_check_interval 1            #重试时间间隔
  contact_groups admins             #联系组
  notification_options w,u,c,r      # w,u,c,r 发生这四种情况时,进行通告。
  notification_interval 960         # 通告间隔
  notification_period 24x7          #通告过期时间
  check_command check_http!100.0,20%!500.0,60%
  }
  参数说明
  服务出了状况通知的时间段,这个时间段是前面 timeperiods.cfg 里面定义的。
  service_notification_period    24x7
  主机出现状况时通知的时间段,这个时间段是前面 timeperiods.cfg 里面定义的。
  host_notification_period       24x7
  当服务出现 w— 报警 (warning),u— 未知 (unkown),c— 严重 (critical),r— 从异常恢复到正常,在这四种情况下通知联系人
  service_notification_options   w,u,c,r
  当主机出现 d­­­­— 当机 (down),u— 返回不可达 (unreachable),r— 从异常情况恢复正常 , 在这 3 种情况下通知联系人
  host_notification_options       d,u,r
  服务出问题通知采用的命令 notify-service-by-email , 这个命令是在 commands.cfg 中定义的 , 作用是给联系人发邮件 . 在 nagios2.x 的版本上可以不一样,可以自己到 commands.cfg 里看一下;在这里也可以设置发送短信的方式通知联系人,前提是你要配置有发送知道的脚本,还要到 commands.cfg 里面添加发送脚本所用到的命令;
  service_notification_commandsnotify-service-by-email
  同上 , 主机出问题时采用的也是发邮件的方式通知联系人
  host_notification_commands   notify-host-by-email
  指定 联系的人 email 地址
  email         yaozhan189@163.com
  联系人的手机 , 前提是要支持短信通知,这里没有启用通过手机短信的方式发送警报   pager      13800138000
  问题和解决
  1、
  # /usr/local/nagios/libexec/check_nrpe -H localhost
  Connection refused by host
  如果在本机上都出下如下结果,说明nrpe没有启动,用启动命令启动
  2、
  # /usr/local/nagios/libexec/check_nrpe -H 10.155.2.81
  Connection refused by host
  可能是客户端nrpe.cfg,没有加服务器ip地址allowed_hosts=127.0.0.1,10.155.2.6
  可能是客户端的防火墙
  3、是因为nagios服务器中没有check_snmp命令,有安装nagios之前没有安装net-snmp和net-snmp-utils组件。网上有人提示安装组件后在重新安装nagios-plugins-1.4.15,我没有成功。(图6)
  # libexec/check_snmp -h
  -bash: libexec/check_snmp: No such file or directory
  4网上有文章提示安装nagios之前,系统需要安装如下。
  yum -y install gcc gcc-c++ autoconf libjpeg libjpeg-devel libpng libpng-devel freetype freetype-devel libxml2 libxml2-devel zlib zlib-devel glibc glibc-devel glib2 glib2-devel bzip2 bzip2-devel ncurses ncurses-devel curl curl-devel e2fsprogs e2fsprogs-devel krb5 krb5-devel libidn libidn-devel openssl openssl-devel openldap openldap-devel nss_ldap openldap-clients openldap-servers perl gd gd-devel jpeg jpeg-devel libpng libpng-devel Net-snmp zlib freetype libart_lgpl cairo-devel pango-devel lrzsz*
  本文来自CSDN博客,转载请标明出处:http://blog.csdn.net/wonitazansa1/archive/2010/09/06/5866086.aspx

页: [1]
查看完整版本: Redhat 5.2 配置nagios