CephFS with active-active NFS Ganesha

Thursday, 12 March 2020

Hi all,

I move this issue/question from ceph-users to nfs-ganesha devel list as requested by
Daniel Gryniewicz. (thanks for pointing me to that list)

I might have a configuration issue or at least a non-optimal working ganesha cluster. You
might help me. :)
But I am not sure if my problem is by design, a bug or just a configuration issue. Anyway,
thanks in advance for your help and time!

Specs:
Ceph v14.2.8
Ganesha v3.0 within Docker container running on Ubuntu 18.04 Image
Config: Please find attached my configuration. (other values are default such as
GracePeriod or LeaseTime)

Setup:
Two running Ganesha daemons which I configured in the grace db (with rados_cluster
backend). The db lies in the cephfsmetadata pool in a separate namespace. I add two nodes
to the db using:
ganesha-rados-cluster add a
ganesha-rados-cluster add b (for sure on the right pool and ns in Ceph)
Both daemons can read/write to the db, and this is fine. They can also clean up rec-XX
files after a restart (meaning deleting them if they are outdated). I can mount the nfs
exposed path over both daemons. So far so good!

Problem:
When I turn off one daemon (e.g. b ), i.e. stopping the container, the shutdown works
smoothly and the db finally shows:
a 	E
b 	NE
I assume that all clients connected to b are stale. But I experience that also all clients
to a are stale (or at least most tasks). Meaning that I cannot read nor write to the
mounted filesystem. But I can ls the mountpoint what means that it is not completely
broken. This cluster state is not cleaned up, so waiting for 5 mins did not change the
behavior over ganesha a. I would assume that at least after some periods the clients
connected to daemon a can read/write as usual. Also the db, entries do not change.

If daemon b crashes (instead of shutdown). The clients connected to daemon a can still
read/write and are not affected by the crash of b. So this is fine for a crash situation.
This is probably related to the fact that daemon b cannot set the NEED flag in the db.
After a while, the running daemon a shows a heartbeat warning, what is certainly expected
and a very handy message to let you know that something in the cluster is shaky.

Expectation:
I would expect that a proper shutdown off one daemon does not affect the clients connected
to the running ganesha a.

Logs are very clean:
# Situation where I stopped daemon b
11/03/2020 15:46:06 : epoch 5e68d0c3 : a : ganesha.nfsd-1[reaper] nfs_lift_grace_locked
:STATE :EVENT :NFS Server Now NOT IN GRACE
11/03/2020 15:46:31 : epoch 5e68d0c3 : a : ganesha.nfsd-1[reaper] nfs_start_grace :STATE
:EVENT :NFS Server Now IN GRACE, duration 90
--> and hear it hangs (so no GRACE lift appears, even waiting for 5-10mins what is not
nice in an active-active environment)

Once I start the daemon again, everything works like a charm! And the logs show only ONE
additional line (compared to above):
11/03/2020 15:46:06 : epoch 5e68d0c3 : a : ganesha.nfsd-1[reaper] nfs_lift_grace_locked
:STATE :EVENT :NFS Server Now NOT IN GRACE
11/03/2020 15:46:31 : epoch 5e68d0c3 : a : ganesha.nfsd-1[reaper] nfs_start_grace :STATE
:EVENT :NFS Server Now IN GRACE, duration 90
11/03/2020 15:54:53 : epoch 5e68d0c3 : a : ganesha.nfsd-1[reaper] nfs_lift_grace_locked
:STATE :EVENT :NFS Server Now NOT IN GRACE

I do not have more informative logs (using default log-level FULL_DEBUG) with warnings or
errors, everything seems to work just fine!
Any explanation might help to understand the situation.

Kind regards,

Michael

2025

2024

2023

2022

2021

2020

2019

2018