Hi Kaleb,
Setting up of the grace-period is not working in case of active/active setup. we have tried setting up Grace_Period = 20; but it does not impact the problem.
Setup:
3 Node Ceph Cluster,
Node1: ceph(mon/mds/rgw/osd)
Node2: ceph(mon/mds/rgw/osd)+NFS(Active)
Node3: Ceph(only Mon)+NFS(Active)
Issues is :
I/O operations is not resuming after NFS Node is powered off.
if Node 2(refer above) is powered off, it remains stuck unless powered-on.
if Node3is powered off,it takes around 5 min to resume the I/O.
---
Yes i see the ganesha.log for entering and leaving NFS-GRACE?
Next thing to check is whether your switch gear is allowing gratuitous ARP forwarding. It's usually on by default but some admins turn it off if they don't know what it's for.
--
Kaleb