In our system test setups with NFSv4.1 mounts, we keep hitting this RHEL client bug where
under some unknown condition client fails to send RECLAIM_COMPLETE_OP after client id
expiry and recovery. And ganesha keeps sending NFS4ERR_GRACE indefinitely and this client
will hit timeout on all the OPENs and hence halting the system testing.
We upgraded to RHEL7.7 but still issue persists. I tried searching in redhat bugs but do
not see any outstanding issues about this. Is anyone else here facing this issue? Do we
know if this is really a RHEL client issue?
<ganesha.log>
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 1223 :nfs4_op_open :STATE :Entering NFS
v4 OPEN handler -----------------------------
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 1283 :nfs4_op_open :STATE :OPEN Client id
= 5e983c8e00000026
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 507 :create_nfs4_owner :STATE :Previously
known owner {STATE_OPEN_OWNER_NFSV4 0x7fe1475ff610: clientid={0x7fe177711ab0
ClientID={Epoch=0x5e983c8e Counter=0x00000026} CONFIRMED Client={0x7fe1777119e0
name=(45:Linux NFSv4.1 w1h34v25-c0017.vsanstfsad.local) refcount=1} t_delta=0
reservations=2 refcount=4} owner=(24:0x6f70656e2069643a0000002c00000000000000c4d78a2b49)
confirmed=1 seqid=0 refcount=2} is being reused
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 1322 :nfs4_op_open :NFS4
:open4_validate_claim failed
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 1409 :nfs4_op_open :STATE :failed with
status NFS4ERR_GRACE
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 983 :nfs4_Compound :NFS4 :Status of
OP_OPEN in position 2 = NFS4ERR_GRACE, op response size is 4 total response size is 92
2020-04-18T16:28:00Z : epoch 5e983c8e : w1hs3e1512.vsanstfsad.local :
ganesha.nfsd-164[::ffff:172.30.63.199] [svc_495] 1112 :nfs4_Compound :NFS4 :End status =
NFS4ERR_GRACE lastindex = 2
<ganesha.log>
We had discussed this before in this community and ganesha behaves as per RFC, so we dint
do anything:
https://lists.nfs-ganesha.org/archives/list/devel@lists.nfs-ganesha.org/m...