Check and up NSD disk in GPFS on AIX

Проверяем состояние GPFS и поднимаем NSD диск.

  • Получаем ошибку по errpt

errpt
9C6C05FA 0517031118 P H mmfs DISK FAILURE
---------------------------------------------------------------------------
LABEL: MMFS_DISKFAIL
IDENTIFIER: 9C6C05FA
Date/Time: Thu May 17 03:11:59 MSK 2018
Sequence Number: 3058
Machine Id: 00A68GGD4B00
Node Id: MY_HOSTNAME_ONE
Class: H
Type: PERM
WPAR: Global
Resource Name: mmfs
Resource Class: NONE
Resource Type: NONE
Location:
Description
DISK FAILURE
Probable Causes
STORAGE SUBSYSTEM
DISK
Failure Causes
STORAGE SUBSYSTEM
DISK
Recommended Actions
CHECK POWER
RUN DIAGNOSTICS AGAINST THE FAILING DEVICE
Detail Data
EVENT CODE
16566394
VOLUME
my_file_system
RETURN CODE
5
PHYSICAL VOLUME
nsd3_host_one
---------------------------------------------------------------------------

  • Проверяем состояние GPFS

mmlsmount all
File system my_file_system is mounted on 6 nodes.

mmgetstate -aLs
Node number Node name Quorum Nodes up Total nodes GPFS state Remarks
------------------------------------------------------------------------------------
1 MY_HOSTNAME_ONE 1* 6 6 active quorum node
2 MY_HOSTNAME_TWO 1* 6 6 active quorum node
3 MY_HOSTNAME_THREE 1* 6 6 active quorum node
4 MY_HOSTNAME_FOUR 1* 6 6 active quorum node
5 MY_HOSTNAME_FIVE 1* 6 6 active quorum node
6 MY_HOSTNAME_SIX 1* 6 6 active quorum node
Summary information
---------------------
Number of nodes defined in the cluster: 6
Number of local nodes active in the cluster: 6
Number of remote nodes joined in this cluster: 0
Number of quorum nodes defined in the cluster: 6
Number of quorum nodes active in the cluster: 6
Quorum = 1*, Quorum achieved

mmlsdisk /dev/my_file_system -L
disk driver sector failure holds holds storage
name type size group metadata data status availability disk id pool remarks
------------ -------- ------ ----------- -------- ----- ------------- ------------ ------- ------------ ---------
nsd1 nsd 512 1 yes yes ready up 1 system
nsd2 nsd 512 2 yes yes ready up 2 system
nsd3_host_two nsd 512 3 no no ready down 3 system
nsd3_host_three nsd 512 3 no no ready up 4 system
nsd3_host_one nsd 512 3 no no ready down 5 system
nsd3_host_four nsd 512 3 no no ready up 6 system
Number of quorum disks: 3
Read quorum value: 2
Write quorum value: 2
mmlsdisk my_file_system -e
disk driver sector failure holds holds storage
name type size group metadata data status availability pool
------------ -------- ------ ----------- -------- ----- ------------- ------------ ------------
nsd3_host_two nsd 512 3 no no ready down system
nsd3_host_one nsd 512 3 no no ready down system

Видим, что 2 NSD диска в состоянии down.
Но на этих диска не содержится metadata или data. Они нужны только для кворума, который не был нарушен.

  • Находим название дисков

mmlsnsd -X
Disk name NSD volume ID Device Devtype Node name Remarks
---------------------------------------------------------------------------------------------------
nsd1 0A140A285228702A /dev/hdisk105 hdisk MY_HOSTNAME_ONE
nsd1 0A140A285228702A /dev/hdisk105 hdisk MY_HOSTNAME_SIX
nsd1 0A140A285228702A /dev/hdisk105 hdisk MY_HOSTNAME_FOUR
nsd1 0A140A285228702A /dev/hdisk105 hdisk MY_HOSTNAME_ONE
nsd1 0A140A285228702A /dev/hdisk105 hdisk MY_HOSTNAME_FIVE
nsd2 0A140A285228702B /dev/hdisk106 hdisk MY_HOSTNAME_ONE
nsd2 0A140A285228702B /dev/hdisk106 hdisk MY_HOSTNAME_SIX
nsd2 0A140A285228702B /dev/hdisk106 hdisk MY_HOSTNAME_FOUR
nsd2 0A140A285228702B /dev/hdisk106 hdisk MY_HOSTNAME_ONE
nsd2 0A140A285228702B /dev/hdisk106 hdisk MY_HOSTNAME_FIVE
nsd3_host_one 0A140A29543BCDA2 /dev/hdisk107 hdisk MY_HOSTNAME_ONE server node
nsd3_host_four 0A140A2E543BCDA5 /dev/hdisk107 hdisk MY_HOSTNAME_FIVE server node
nsd3_host_two 0A140A28542BFCFE /dev/hdisk107 hdisk MY_HOSTNAME_ONE server node
nsd3_host_three 0A140A2D542BFD00 /dev/hdisk107 hdisk MY_HOSTNAME_FOUR server node

  • Проверяем доступность диска в системе

lsdev | grep 107
hdisk107 Available Virtual SCSI Disk Drive

  • Проверяем диск на чтение

dd if=/dev/hdisk107 of=/dev/null bs=1m count=10
10+0 records in
10+0 records out

  • Меняем состояние диска

mmchdisk my_file_system start -d nsd3_host_one
mmnsddiscover: Attempting to rediscover the disks. This may take a while ...
mmnsddiscover: Finished.
MY_HOSTNAME_ONE: Rediscovered nsd server access to nsd3_host_one.
Scanning file system metadata, phase 1 ...
Scan completed successfully.
Scanning file system metadata, phase 2 ...
Scan completed successfully.
Scanning file system metadata, phase 3 ...
Scan completed successfully.
Scanning file system metadata, phase 4 ...
Scan completed successfully.
Scanning user file metadata ...
100.00 % complete on Thu May 17 13:30:02 2018
Scan completed successfully.

  • Проверяем изменения

mmlsdisk /dev/my_file_system -L
disk driver sector failure holds holds storage
name type size group metadata data status availability disk id pool remarks
------------ -------- ------ ----------- -------- ----- ------------- ------------ ------- ------------ ---------
nsd1 nsd 512 1 yes yes ready up 1 system desc
nsd2 nsd 512 2 yes yes ready up 2 system desc
nsd3_host_two nsd 512 3 no no ready down 3 system
nsd3_host_three nsd 512 3 no no ready up 4 system desc
nsd3_host_one nsd 512 3 no no ready up 5 system
nsd3_host_four nsd 512 3 no no ready up 6 system
Number of quorum disks: 3
Read quorum value: 2
Write quorum value: 2

  • Повторяем операции проверки для второго диска
  • Меняем состояние второго диска

mmchdisk my_file_system start -d nsd3_host_two
mmnsddiscover: Attempting to rediscover the disks. This may take a while ...
mmnsddiscover: Finished.
MY_HOSTNAME_ONE: Rediscovered nsd server access to nsd3_host_two.
GPFS: 6027-589 Scanning file system metadata, phase 1 ...
GPFS: 6027-552 Scan completed successfully.
GPFS: 6027-589 Scanning file system metadata, phase 2 ...
GPFS: 6027-552 Scan completed successfully.
GPFS: 6027-589 Scanning file system metadata, phase 3 ...
GPFS: 6027-552 Scan completed successfully.
GPFS: 6027-589 Scanning file system metadata, phase 4 ...
GPFS: 6027-552 Scan completed successfully.
GPFS: 6027-565 Scanning user file metadata ...
100.00 % complete on Thu May 17 13:39:13 2018
GPFS: 6027-552 Scan completed successfully.

  • Проверяем изменения

mmlsdisk my_file_system -e
GPFS: 6027-623 All disks up and ready

Оставьте комментарий

Ваш e-mail не будет опубликован. Обязательные поля помечены *