一、故障现象
监控显示设备故障,SVC故障节点停止对外服务,故障原因待确认。
二、问题确认
1、登录管理界面检查报错信息
经确认,由于设备长时间运行,大量IO读写导致节点内部磁盘故障。
2、确认故障节点信息
io_grp1 lvgsvc4
SVC版本:6.4.1.6
设备型号:2145-CF8
故障硬盘信息:PN:42D0673
3、确认设备物理位置
Command line登录设备,确认信息:
确认后,准备硬盘,按流程更换(为避免发生新损,建议多备硬盘)。
三、更换流程
1、Make sure that the node cover is in place and fully closed.
进入维护模式:关闭电源
2、Touch the static-protective package that contains the drive to any unpainted metal surface on the node; then, remove the drive from the package and place it on a static-protective surface.
3、Make sure that the disk-drive handle is in the open (unlocked) position.
4、Align the drive assembly with the guide rails in drive bay 4 for SAN Volume Controller 2145-CF8 nodes.
5、Gently push the drive assembly into the bay until the drive stops.
6、Install the service controller
7、Make sure that all cables, adapters, and other components are installed and seated correctly and that you have not left loose tools or parts inside the node. Make sure that all internal cables are correctly routed. If you disconnected the Fibre Channel and Ethernet cables, make sure that each cable is reconnected to the same port from which it was removed.
8、Turn on the node. When you turn on the node, use the node rescue procedure to install the SAN Volume Controller software on the new disk
Completing the node rescue when the node boots
1) Turn off the node.
2) Press and hold the left and right buttons on the front panel.
3) Press the power button.
4) Continue to hold the left and right buttons until the node-rescue-request symbol is displayed on the front panel
Results
Figure 1. Node rescue display
The node rescue request symbol displays on the front panel display until the node starts to boot from the service controller. If the node rescue request symbol displays for more than two minutes, go to the hardware boot MAP to resolve the problem. When the node rescue starts, the service display shows the progress or failure of the node rescue operation.
9 Then add the node back into the cluster.
10 登陆主控台关闭事件
运行修订过程
确认节点硬盘更换完成后点确认,设备告警已清除。
四、故障总结
更换SVC本地硬盘过程之中,需注意以下三点:
1、确认故障硬盘节点、位置;
2、启动救援模式的方法;
3、恢复过程中的通信链路。如果过程中出现卡停,时间超过2分钟,则需检查通信链路,包扩FC链路和设备内部链路。
如欲了解更多,请登录365bet足球比分官方网站:qz48.scentoferos.com