设为首页 收藏本站
查看: 1234|回复: 0

[经验分享] RHEL5.4 Heartbeat安装(第二部份 测试与监控)

[复制链接]

尚未签到

发表于 2019-1-6 15:37:47 | 显示全部楼层 |阅读模式
9) 启动服务 监视服务

HA1HA2heartheat 服务启动

/etc/init.d/heartbeat start

监视服务:

首先在HA1上查看 messages

#cat /var/log/messages

Sep 19 15:56:37 HA1 heartbeat: [26814]: info: Version 2 support: false

Sep 19 15:56:37 HA1 heartbeat: [26814]: WARN: Logging daemon is disabled --enabling logging daemon is recommended

Sep 19 15:56:37 HA1 heartbeat: [26814]: info: **************************

Sep 19 15:56:37 HA1 heartbeat: [26814]: info: Configuration validated. Starting heartbeat 3.0.2

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: heartbeat: version 3.0.2

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: Heartbeat generation: 1284708296

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth1

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth1 - Status: 1

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: G_main_add_TriggerHandler: Added signal manual handler

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: G_main_add_TriggerHandler: Added signal manual handler

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: G_main_add_SignalHandler: Added signal handler for signal 17

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: Local status now set to: 'up'

Sep 19 15:56:41 HA1 heartbeat: [26815]: info: Link ha1:eth1 up.

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Link ha2:eth1 up.

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Status update for node ha2: status up

Sep 19 15:56:47 HA1 harc[26822]: info: Running /usr/etc/ha.d//rc.d/status status

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Comm_now_up(): updating status to active

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Local status now set to: 'active'

Sep 19 15:56:48 HA1 heartbeat: [26815]: info: Status update for node ha2: status active

Sep 19 15:56:48 HA1 harc[26842]: info: Running /usr/etc/ha.d//rc.d/status status

Sep 19 15:57:04 HA1 heartbeat: [26815]: info: remote resource transition completed.

Sep 19 15:57:04 HA1 heartbeat: [26815]: info: remote resource transition completed.

Sep 19 15:57:04 HA1 heartbeat: [26815]: info: Initial resource acquisition complete (T_RESOURCES(us))

Sep 19 15:57:04 HA1 IPaddr[26898]: INFO:  Resource is stopped

Sep 19 15:57:04 HA1 heartbeat: [26862]: info: Local Resource acquisition completed.

Sep 19 15:57:04 HA1 harc[26941]: info: Running /usr/etc/ha.d//rc.d/ip-request-resp ip-request-resp

Sep 19 15:57:04 HA1 ip-request-resp[26941]: received ip-request-resp IPaddr::172.16.6.66/21/eth0 OK yes

Sep 19 15:57:04 HA1 ResourceManager[26964]: info: Acquiring resource group: ha1 IPaddr::172.16.6.66/21/eth0 test

Sep 19 15:57:05 HA1 IPaddr[26992]: INFO:  Resource is stopped

Sep 19 15:57:05 HA1 ResourceManager[26964]: info: Running /etc/ha.d/resource.d/IPaddr 172.16.6.66/21/eth0 start

Sep 19 15:57:05 HA1 IPaddr[27077]: INFO: Using calculated netmask for 172.16.6.66: 255.255.248.0

Sep 19 15:57:05 HA1 IPaddr[27077]: INFO: eval ifconfig eth0:0 172.16.6.66 netmask 255.255.248.0 broadcast 172.16.7.255

Sep 19 15:57:05 HA1 IPaddr[27051]: INFO:  Success

Sep 19 15:57:05 HA1 logger: /etc/ha.d/resource.d/test called with status

Sep 19 15:57:05 HA1 ResourceManager[26964]: info: Running /etc/ha.d/resource.d/test  start

Sep 19 15:57:05 HA1 logger: /etc/ha.d/resource.d/test called with start



可以看到HA1HA2都启动起来了  我们的test脚本也运行了。我们的 172.16.6.66IP也启来了。

然后再去 ha-log里面看一下


[root@HA1 ~]# cat /var/log/ha-log

Sep 19 15:56:37 HA1 heartbeat: [26814]: info: Version 2 support: false

Sep 19 15:56:37 HA1 heartbeat: [26814]: WARN: Logging daemon is disabled --enabling logging daemon is recommended

Sep 19 15:56:37 HA1 heartbeat: [26814]: info: **************************

Sep 19 15:56:37 HA1 heartbeat: [26814]: info: Configuration validated. Starting heartbeat 3.0.2

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: heartbeat: version 3.0.2

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: Heartbeat generation: 1284708296

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth1

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth1 - Status: 1

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: G_main_add_TriggerHandler: Added signal manual handler

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: G_main_add_TriggerHandler: Added signal manual handler

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: G_main_add_SignalHandler: Added signal handler for signal 17

Sep 19 15:56:37 HA1 heartbeat: [26815]: info: Local status now set to: 'up'

Sep 19 15:56:41 HA1 heartbeat: [26815]: info: Link ha1:eth1 up.

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Link ha2:eth1 up.

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Status update for node ha2: status up

harc[26822]:    2010/09/19_15:56:47 info: Running /usr/etc/ha.d//rc.d/status status

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Comm_now_up(): updating status to active

Sep 19 15:56:47 HA1 heartbeat: [26815]: info: Local status now set to: 'active'

Sep 19 15:56:48 HA1 heartbeat: [26815]: info: Status update for node ha2: status active

harc[26842]:    2010/09/19_15:56:48 info: Running /usr/etc/ha.d//rc.d/status status

Sep 19 15:57:04 HA1 heartbeat: [26815]: info: remote resource transition completed.

Sep 19 15:57:04 HA1 heartbeat: [26815]: info: remote resource transition completed.

Sep 19 15:57:04 HA1 heartbeat: [26815]: info: Initial resource acquisition complete (T_RESOURCES(us))

IPaddr[26898]:         2010/09/19_15:57:04 INFO:  Resource is stopped

Sep 19 15:57:04 HA1 heartbeat: [26862]: info: Local Resource acquisition completed.

harc[26941]:    2010/09/19_15:57:04 info: Running /usr/etc/ha.d//rc.d/ip-request-resp ip-request-resp

ip-request-resp[26941]:  2010/09/19_15:57:04 received ip-request-resp IPaddr::172.16.6.66/21/eth0 OK yes

ResourceManager[26964]:     2010/09/19_15:57:04 info: Acquiring resource group: ha1 IPaddr::172.16.6.66/21/eth0 test

IPaddr[26992]:         2010/09/19_15:57:05 INFO:  Resource is stopped

ResourceManager[26964]:     2010/09/19_15:57:05 info: Running /etc/ha.d/resource.d/IPaddr 172.16.6.66/21/eth0 start

IPaddr[27077]:         2010/09/19_15:57:05 INFO: Using calculated netmask for 172.16.6.66: 255.255.248.0

IPaddr[27077]:         2010/09/19_15:57:05 INFO: eval ifconfig eth0:0 172.16.6.66 netmask 255.255.248.0 broadcast 172.16.7.255

IPaddr[27051]:         2010/09/19_15:57:05 INFO:  Success

ResourceManager[26964]:     2010/09/19_15:57:05 info: Running /etc/ha.d/resource.d/test  start


内容和messages里面的差不多。


HA2里面的日志

Sep 19 23:57:24 HA2 heartbeat: [14041]: info: Version 2 support: false

Sep 19 23:57:24 HA2 heartbeat: [14041]: WARN: Logging daemon is disabled --enabling logging daemon is recommended

Sep 19 23:57:24 HA2 heartbeat: [14041]: info: **************************

Sep 19 23:57:24 HA2 heartbeat: [14041]: info: Configuration validated. Starting heartbeat 3.0.2

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: heartbeat: version 3.0.2

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: Heartbeat generation: 1284893027

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth1

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth1 - Status: 1

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: G_main_add_TriggerHandler: Added signal manual handler

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: G_main_add_TriggerHandler: Added signal manual handler

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: G_main_add_SignalHandler: Added signal handler for signal 17

Sep 19 23:57:24 HA2 heartbeat: [14042]: info: Local status now set to: 'up'

Sep 19 23:57:26 HA2 heartbeat: [14042]: info: Link ha1:eth1 up.

Sep 19 23:57:26 HA2 heartbeat: [14042]: info: Status update for node ha1: status up

Sep 19 23:57:26 HA2 heartbeat: [14042]: info: Link ha2:eth1 up.

Sep 19 23:57:26 HA2 harc[14049]: info: Running /usr/etc/ha.d//rc.d/status status

Sep 19 23:57:26 HA2 heartbeat: [14042]: info: Comm_now_up(): updating status to active

Sep 19 23:57:26 HA2 heartbeat: [14042]: info: Local status now set to: 'active'

Sep 19 23:57:26 HA2 heartbeat: [14042]: info: Status update for node ha1: status active

Sep 19 23:57:26 HA2 harc[14067]: info: Running /usr/etc/ha.d//rc.d/status status

Sep 19 23:57:42 HA2 heartbeat: [14042]: info: local resource transition completed.

Sep 19 23:57:42 HA2 heartbeat: [14042]: info: Initial resource acquisition complete (T_RESOURCES(us))

Sep 19 23:57:42 HA2 heartbeat: [14086]: info: No local resources [/usr/share/heartbeat/ResourceManager listkeys ha2] to acquire.

Sep 19 23:57:43 HA2 heartbeat: [14042]: info: remote resource transition completed.



会看到 info: No local resources [/usr/share/heartbeat/ResourceManager listkeys ha2] to acquire.

说明没有任何本地资源,该机器将作为备份服务器并闲置,它只监听来自主服务器的心跳直到主服务器失败为止。


# tcpdump -i eth1 -n -p udp port 694

可以查看到 eth1过来的心跳广播。如下所示:

23:06:49.576155 IP 10.0.0.1.40661 > 10.0.0.255.ha-cluster: UDP, length 174

23:06:49.734999 IP 10.0.0.2.50487 > 10.0.0.255.ha-cluster: UDP, length 174

23:06:50.324281 IP 10.0.0.1.40661 > 10.0.0.255.ha-cluster: UDP, length 167

23:06:50.324283 IP 10.0.0.1.40661 > 10.0.0.255.ha-cluster: UDP, length 174

23:06:50.486151 IP 10.0.0.2.50487 > 10.0.0.255.ha-cluster: UDP, length 174


10) 模拟故障

我们现在把主服务器的电源直接拔掉 模拟宕机 再找一台机器一直ping 172.16.6.66


我们会在ha2ha-log里面看到如下信息

Sep 20 00:03:47 HA2 heartbeat: [14042]: WARN: node ha1: is dead

Sep 20 00:03:47 HA2 heartbeat: [14042]: WARN: No STONITH device configured.

Sep 20 00:03:47 HA2 heartbeat: [14042]: WARN: Shared disks are not protected.

Sep 20 00:03:47 HA2 heartbeat: [14042]: info: Resources being acquired from ha1.

Sep 20 00:03:47 HA2 heartbeat: [14042]: info: Link ha1:eth1 dead.

harc[14105]:    2010/09/20_00:03:48 info: Running /usr/etc/ha.d//rc.d/status status

Sep 20 00:03:48 HA2 heartbeat: [14106]: info: No local resources [/usr/share/heartbeat/ResourceManager listkeys ha2] to acquire.

mach_down[14135]:        2010/09/20_00:03:48 info: Taking over resource group IPaddr::172.16.6.66/21/eth0

ResourceManager[14162]:     2010/09/20_00:03:48 info: Acquiring resource group: ha1 IPaddr::172.16.6.66/21/eth0 test

IPaddr[14190]:         2010/09/20_00:03:48 INFO:  Resource is stopped

ResourceManager[14162]:     2010/09/20_00:03:48 info: Running /etc/ha.d/resource.d/IPaddr 172.16.6.66/21/eth0 start

IPaddr[14275]:         2010/09/20_00:03:48 INFO: Using calculated netmask for 172.16.6.66: 255.255.248.0

IPaddr[14275]:         2010/09/20_00:03:48 INFO: eval ifconfig eth0:0 172.16.6.66 netmask 255.255.248.0 broadcast 172.16.7.255

IPaddr[14249]:         2010/09/20_00:03:48 INFO:  Success

ResourceManager[14162]:     2010/09/20_00:03:48 info: Running /etc/ha.d/resource.d/test  start

mach_down[14135]:        2010/09/20_00:03:48 info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired

mach_down[14135]:        2010/09/20_00:03:49 info: mach_down takeover complete for node ha1.

Sep 20 00:03:49 HA2 heartbeat: [14042]: info: mach_down takeover complete.



我们会看到

node ha1: is dead

说明 ha1以经宕机  

资源脚本首先用status参数调用 然后用start参数启动test 脚本。以完成故障转移。

也可以从messages 里面看到

logger: /etc/ha.d/resource.d/test called with start

说明我们的test脚本以经在HA2中运行了。

再看一下IP地址

HA2eht0:0以经有了 172.16.6.66


一旦完成故障转移,则备份服务器会再次监视主服务器的心跳 如果主服务器启动则会再将服务转移回主服务器。


试验成功。

  如果不需要 主服务器恢复后自动获得主权限 要在ha.cf中 加入一条
auto_failback on




运维网声明 1、欢迎大家加入本站运维交流群:群②:261659950 群⑤:202807635 群⑦870801961 群⑧679858003
2、本站所有主题由该帖子作者发表,该帖子作者与运维网享有帖子相关版权
3、所有作品的著作权均归原作者享有,请您和我们一样尊重他人的著作权等合法权益。如果您对作品感到满意,请购买正版
4、禁止制作、复制、发布和传播具有反动、淫秽、色情、暴力、凶杀等内容的信息,一经发现立即删除。若您因此触犯法律,一切后果自负,我们对此不承担任何责任
5、所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其内容的准确性、可靠性、正当性、安全性、合法性等负责,亦不承担任何法律责任
6、所有作品仅供您个人学习、研究或欣赏,不得用于商业或者其他用途,否则,一切后果均由您自己承担,我们对此不承担任何法律责任
7、如涉及侵犯版权等问题,请您及时通知我们,我们将立即采取措施予以解决
8、联系人Email:admin@iyunv.com 网址:www.yunweiku.com

所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其承担任何法律责任,如涉及侵犯版权等问题,请您及时通知我们,我们将立即处理,联系人Email:kefu@iyunv.com,QQ:1061981298 本贴地址:https://www.yunweiku.com/thread-660046-1-1.html 上篇帖子: HA集群之Heartbeat浅析及heartbeat+haresource 下篇帖子: heartbeat v2基于haresources实现HA Web
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

扫码加入运维网微信交流群X

扫码加入运维网微信交流群

扫描二维码加入运维网微信交流群,最新一手资源尽在官方微信交流群!快快加入我们吧...

扫描微信二维码查看详情

客服E-mail:kefu@iyunv.com 客服QQ:1061981298


QQ群⑦:运维网交流群⑦ QQ群⑧:运维网交流群⑧ k8s群:运维网kubernetes交流群


提醒:禁止发布任何违反国家法律、法规的言论与图片等内容;本站内容均来自个人观点与网络等信息,非本站认同之观点.


本站大部分资源是网友从网上搜集分享而来,其版权均归原作者及其网站所有,我们尊重他人的合法权益,如有内容侵犯您的合法权益,请及时与我们联系进行核实删除!



合作伙伴: 青云cloud

快速回复 返回顶部 返回列表