Hello,
In our environment of Ceph Cluster(version 15.2.7) we are trying to use NFS HA Mode.Facing certain issues in the same as below:
"Active/Passive HA NFS Cluster"
When we are using Active/Passive HA Config for NFS Server using Corosync/Pacemekar:
1. configuration is done and we are able to perform fail-over, but when an active node is tested with power-off two scenarios are observed:
1.1 : I/O operations gets stuck until the node is powered on although the handover from active to other standby node happens immediately once the node is powered-off. All the existing requests are stuck.
And you're sure they're not just stuck for the duration of NFS-GRACE? (Default = 90sec)
Do you see the entries in the ganesha logs for entering and leaving NFS-GRACE?
Are you using the appropriate fencing agent(s) for your hardware?
--
Kaleb