3、虚拟机共享磁盘最好需分配好大小,这是经验之谈,虽然找不到什么理论根据,但是实践表明预分配好的磁盘出现坏块的几率大大减小,很多莫名其妙的问题就是坏块引起的。但是预分配好磁盘可能会在一开始占用空间。
4、虚拟机不要设置网关,这个问题困扰了很久,也将近耗时1天,之前我设置了和公有网卡同一网段的网关,设置好,安装crs之后,古怪的事情发生了:
a、节点间vip经常莫名掉线
b、vip地址经常跑错节点,即1节点启动2节点的vip,但是1节点不能启动自身vip地址
c、vip地址不能被nodeapp带动
这三个怪现象困扰了我好久,期间尝试了crs重装升级,网卡从hostonly模式转成bridge模式,更换网关,但是故障依旧。crsd.log错误日志也仅仅显示
引用
2011-06-24 13:33:51.682: [ CRSRES][570047408]0Attempting to start `ora.racsvr1.vip` on member `racsvr1`
2011-06-24 13:34:04.374: [ CRSAPP][570047408]0StartResource error for ora.racsvr1.vip error code = 1
2011-06-24 13:34:07.884: [ CRSRES][570047408]0Start of `ora.racsvr1.vip` on member `racsvr1` failed.
使劲浑身解数,问题还得不到解决,比较郁闷,此时突然想起,单独尝试启动vip,错误不一样了
引用
[oracle@racsvr1 oracle]$ crs_start ora.racsvr1.vip
Attempting to start `ora.racsvr1.vip` on member `racsvr1`
Start of `ora.racsvr1.vip` on member `racsvr1` failed.
CRS-1006: No more members to consider
CRS-0215: Could not start resource 'ora.racsvr1.vip'.
马上查metalink,根据 metalink 文档 CRS-0215: Could not start resource 'ora..vip' [ID 356535.1]
修改$ORA_CRS_HOME/bin/racgvip将参数FAIL_WHEN_DEFAULTGW_NOT_FOUND=0,此参数的意思vip启动时检查默认网关不存在时不进行报错。
设置好参数之后,继续尝试启动,这次采用nodeapp带动vip,继续报错。这次报错和metalink 文档Bug 5076555: VIP CRASHING FAIL_WHEN_DEFAULTGW_NOT_FOUND=0 SEEMS TO BE IGNORED很类似,但是没有解决方案。这里需要鄙视一下Oracle,Oracle既然已经将其归为bug了,竟然无动于衷,解决方案也不给一个。
引用
[oracle@racsvr1 oracle]$ srvctl start nodeapps -n racsvr1
racsvr1:ora.racsvr1.vip:ping to 10.20.30.99 via eth0 failed, rc = 1 (host=racsvr1)
racsvr1:ora.racsvr1.vip:ping to 10.20.30.99 via eth0 failed, rc = 1 (host=racsvr1)
racsvr1:ora.racsvr1.vip:Interface eth0 checked failed (host=racsvr1)
racsvr1:ora.racsvr1.vip:Invalid parameters, or failed to bring up VIP (host=racsvr1)
CRS-1006: No more members to consider
CRS-0215: Could not start resource 'ora.racsvr1.vip'.
racsvr1:ora.racsvr1.vip:ping to 10.20.30.99 via eth0 failed, rc = 1 (host=racsvr1)
racsvr1:ora.racsvr1.vip:ping to 10.20.30.99 via eth0 failed, rc = 1 (host=racsvr1)
racsvr1:ora.racsvr1.vip:Interface eth0 checked failed (host=racsvr1)
racsvr1:ora.racsvr1.vip:Invalid parameters, or failed to bring up VIP (host=racsvr1)
CRS-1006: No more members to consider
CRS-0215: Could not start resource 'ora.racsvr1.LISTENER_RACSVR1.lsnr'.
其实上面提示已经很明确,ping to 10.20.30.99 via eth0 failed,但是我们ping网关完全没问题
引用
[oracle@racsvr1 oracle]$ ping 10.20.30.99
PING 10.20.30.99 (10.20.30.99) 56(84) bytes of data.
64 bytes from 10.20.30.99: icmp_seq=0 ttl=64 time=0.028 ms
64 bytes from 10.20.30.99: icmp_seq=1 ttl=64 time=0.028 ms
64 bytes from 10.20.30.99: icmp_seq=2 ttl=64 time=0.028 ms
64 bytes from 10.20.30.99: icmp_seq=3 ttl=64 time=0.026 ms
再次启动vip时,终于启动成功了!鼓掌!!!
引用
[oracle@racsvr1 oracle]$ crs_start ora.racsvr1.vip
Attempting to start `ora.racsvr1.vip` on member `racsvr1`
Start of `ora.racsvr1.vip` on member `racsvr1` succeeded.