ewrfdsfwqe 发表于 2014-11-19 11:21:04

ansible+heartbeatV2构建高可用群集

主机的ip地址分配



hostnameIP address role
master192.168.1.250/24ansible
node2.mictiger.com192.168.1.120/24heartbeat,httpd
node3.mictiger.com192.168.1.130/24heartbeat,httpd
1)修改hostname
在node2和node3的/etc/hosts加入如下信息

1
2
192.168.1.120   node2.mictiger.comnode2
192.168.1.130   node3.mictiger.comnode3




2)建立master到node2,node3的节点互信
生成密钥对

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
# ssh-keygen -t rsa -P ''    生成一对密钥
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa):
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
dc:a5:7d:fc:ff:42:8c:8a:55:38:73:87:35:9a:f5:6a root@CentOS
The key's randomart image is:
+--[ RSA 2048]----+
|               |
|            +|
|         ..* o |
|       . .++*.. .|
|      S o=.+o. |
|          . ..E. |
|         o . o.|
|      . .   . .|
|               .+|
+-----------------+




将生成的公钥传到目标主机

1
2
3
# ssh-copy-id -i .ssh/id_rsa.pub root@192.168.1.120
# ssh-copy-id -i .ssh/id_rsa.pub root@192.168.1.130
# ssh-copy-id -i .ssh/id_rsa.pub root@192.168.1.150




在master上验证是否与node2,node3建立互信成功

1
2
3
4
# ssh 192.168.1.130 'date';ssh 192.168.1.120 'date';date
Tue Nov 18 22:39:55 CST 2014
Tue Nov 18 22:40:03 CST 2014
Tue Nov 18 22:40:04 CST 2014




3)安装配置ansible

1
2
3
4
5
6
7
8
9
10
11
# yum install ansible
# vim /etc/ansible/hosts   只需要配置这文件配置要管理的主机

192.168.1.150
192.168.1.120
192.168.1.130

192.168.1.120
192.168.1.130

192.168.1.150




验证ansible是否配置成功

1
2
3
4
5
6
7
8
9
# ansible all -a 'date'
192.168.1.120 | success | rc=0 >>
Tue Nov 18 13:40:21 CST 2014

192.168.1.150 | success | rc=0 >>
Tue Nov 18 13:40:21 CST 2014

192.168.1.130 | success | rc=0 >>
Tue Nov 18 13:40:21 CST 2014




4)利用ansible更新时间,高可用群集必须配置一样的时间

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
# ansible web -a 'ntpdate202.118.1.81'
192.168.1.130 | success | rc=0 >>
18 Nov 22:50:54 ntpdate: step time server 202.118.1.81 offset 1.531700 sec

192.168.1.120 | success | rc=0 >>
18 Nov 22:50:54 ntpdate: adjust time server 202.118.1.81 offset 0.313219 sec
将时间更新写入cron
# ansible all -m cron -a 'name="sync time" minute="*/5" job="/usr/sbin/ntpdate 202.118.1.81 &> /dev/null"'
192.168.1.130 | success >> {
    "changed": true,
    "jobs": [
      "sync time"
    ]
}

192.168.1.120 | success >> {
    "changed": true,
    "jobs": [
      "sync time"
    ]
}

192.168.1.150 | success >> {
    "changed": true,
    "jobs": [
      "sync time"
    ]
}




5) 安装并配置httpd程序

1
2
3
4
5
6
7
8
9
10
11
12
13
14
# ansible web -m yum -a 'name=httpd state=present'
# ansible web -a 'service httpd start'
192.168.1.130 | success | rc=0 >>
Starting httpd: httpd: apr_sockaddr_info_get() failed for CentOS
httpd: Could not reliably determine the server's fully qualified domain name, using 127.0.0.1 for ServerName

192.168.1.120 | success | rc=0 >>
Starting httpd: httpd: Could not reliably determine the server's fully qualified domain name, using localhost.localdomain for ServerName
# touch index.html
# echo heartbeatV2 > index.html
# scp index.html root@192.168.1.130:/var/www/html/
index.html                                                                                                                   100%   16   0.0KB/s   00:00
# scp index.html root@192.168.1.120:/var/www/html/
index.html                                                                                                                   100%   16   0.0KB/s   00:00




验证httpd是否安装成功

1
2
3
4
5
6
7
8
9
10
11
12
13
# ansible master -a 'curl192.168.1.{120,130}'
192.168.1.150 | success | rc=0 >>
--_curl_--192.168.1.120
heartbeatV2
--_curl_--192.168.1.130
heartbeatV2
: 192.168.1.120 -->
% Total    % Received % XferdAverage Speed   Time    Time   TimeCurrent
                                 DloadUpload   Total   Spent    LeftSpeed
0    16    0    16    0   0   5072      0 --:--:-- --:--:-- --:--:--8000

: 192.168.1.130 -->
0    16    0    16    0   010349      0 --:--:-- --:--:-- --:--:-- 10349




将httpd程序stop并不让它开机自动启动

1
2
3
4
5
6
7
8
9
10
11
12
# ansible web -a 'service httpd stop'
192.168.1.120 | success | rc=0 >>
Stopping httpd:

192.168.1.130 | success | rc=0 >>
Stopping httpd:

# ansible web -a 'chkconfig httpd off'
192.168.1.120 | success | rc=0 >>


192.168.1.130 | success | rc=0 >>




6)安装heatbeatV2需要处理依赖关系所以没有用ansible

1
2
3
4
5
6
#rpm -ivh heartbeat-2.1.4-12.el6.x86_64.rpm heartbeat-gui-2.1.4-12.el6.x86_64.rpm heartbeat-stonith-2.1.4-12.el6.x86_64.rpm heartbeat-pils-2.1.4-12.el6.x86_64.rpm
Preparing...                ###########################################
   1:heartbeat-pils         ########################################### [ 25%]
   2:heartbeat-stonith      ########################################### [ 50%]
   3:heartbeat            ########################################### [ 75%]
   4:heartbeat-gui          ###########################################




7)配置heartbeat

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
# cp /usr/share/doc/heartbeat-2.1.4/{authkeys,ha.cf} /etc/ha.d/
配置authkeys
# openssl rand -hex 16    生成16的随机数作为heartbeat传递心跳信息的验证
701782e0e6872f444edd0bbb726871fb
auth 2
#1 crc
#2 sha1 HI!
#3 md5 Hello!
2 sha1 701782e0e6872f444edd0bbb726871f
# chmod 600 authkeys    修改authkeys的权限不让其他人访问
配置heartbeat的主配置文件ha.cf大概配置如下信息(根据个人需求而定)
logfile /var/log/ha-log            日志文件位置
mcast eth0 225.10.10.10 694 1 0    使用组播地址通告和端口
auto_failback on                   自动添加down了又重新上线的节点到群集
node    node2.mictiger.com         配置节点信息
node    node3.mictiger.com

ping 192.168.1.1                   网关地址
配置haresources添加如下信息
node2.mictiger.com       192.168.1.250/24/eth0 httpd    主节点和VIP绑定的网卡,管理的资源
复制node2 的配置信息到node3
# scp -r /etc/ha.d/* root@192.168.1.130:/etc/ha.d/




8)启动heartbeat

1
2
3
4
5
6
7
8
# ansible web -a 'service heartbeat start'
192.168.1.130 | success | rc=0 >>
Starting High-Availability services:
Done.2014/11/18_23:36:19 INFO:Resource is stopped

192.168.1.120 | success | rc=0 >>
Starting High-Availability services:
Done.2014/11/18_23:36:19 INFO:Resource is stopped




查看heartbeat运行在哪个节点上
# ansible web -a 'ifconfig eth0:0'

192.168.1.130 | success | rc=0 >>
eth0:0    Link encap:EthernetHWaddr 00:0C:29:8F:A0:EF
          UP BROADCAST RUNNING MULTICASTMTU:1500Metric:1

192.168.1.120 | success | rc=0 >>
eth0:0    Link encap:EthernetHWaddr 00:0C:29:57:5C:BB
          inet addr:192.168.1.250Bcast:192.168.1.255Mask:255.255.255.0
          UP BROADCAST RUNNING MULTICASTMTU:1500Metric:1
# curl 192.168.1.250
heartbeatV2
关闭node2验证是否能够访问验证高可用性
# service heartbeat stop
Stopping High-Availability services:
Done.


页: [1]
查看完整版本: ansible+heartbeatV2构建高可用群集