[Support] ERR 20: Auth Rejected Credentials (client should begin new session)
by TomK
Hey All,
I have an external NFS cluster serviced by a VIP. The clients run
autofs configured via IPA to provide NFS home directories to client.
However, running into an issue on one of the clients and wondering if
anyone seen this message from a tcpdump of a simple mount session that's
preventing the mount:
psql02: mount nfs-c01:/n /m
Yields this message
ERR 20: Auth Rejected Credentials (client should begin new session)
and the mount attempt never exits and never mounts /m . nfs-c01 is a
VIP that's serviced by HAproxy / keepalived. nfs-c01 however has a
record in IPA Server, both forward and a reverse one. Using one of the
underlying hosts that services nfs-c01 works and mounts succeeds for
them. All VM hosts are clones of the same template.
I have autofs running as part of this IPA client setup and applied the
following fix as well:
https://access.redhat.com/solutions/3261981
/m is a test mount folder I'm using on this client to troubleshoot the
autofs mounting issue. So autofs is also running on the same hosts
where I'm trying this mount from.
Trying to trace the exact source of this error and not quite sure where
to look further.
idmipa01/02 are the IPA servers. (192.168.0.44/45 respectively)
psql01/02 are the problem VM's. (192.168.0.108/124 )
nfs01/02 are the NFS hosts. (192.168.0.131/119 )
nfs-c01 192.168.0.80
This works fine on the other two VM hosts without any issue but I just
can't find any difference comparing all the configs and so looking for
suggestions to bounce off of.
--
Cheers,
Tom K.
-------------------------------------------------------------------------------------
Living on earth is expensive, but it includes a free trip around the sun.
Apr 15 23:29:54 psql02 kernel: INFO: task mount.nfs:1443 blocked for
more than 120 seconds.
Apr 15 23:29:54 psql02 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 15 23:29:54 psql02 kernel: mount.nfs D ffff880135ed8000 0
1443 1442 0x00000080
Apr 15 23:29:54 psql02 kernel: Call Trace:
Apr 15 23:29:54 psql02 kernel: [<ffffffff816ac7c9>]
schedule_preempt_disabled+0x29/0x70
Apr 15 23:29:54 psql02 kernel: [<ffffffff816aa5f7>]
__mutex_lock_slowpath+0xc7/0x1d0
Apr 15 23:29:54 psql02 kernel: [<ffffffff816a9a0f>] mutex_lock+0x1f/0x2f
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05ddd58>]
nfs4_discover_server_trunking+0x48/0x2e0 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05e6906>]
nfs4_init_client+0x126/0x300 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffff811e17d3>] ?
kmem_cache_alloc+0x193/0x1e0
Apr 15 23:29:54 psql02 kernel: [<ffffffffc0562526>] ?
__fscache_acquire_cookie+0x66/0x180 [fscache]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc0562526>] ?
__fscache_acquire_cookie+0x66/0x180 [fscache]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc0341b61>] ?
__rpc_init_priority_wait_queue+0x81/0xc0 [sunrpc]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc057c6d6>]
nfs_get_client+0x2c6/0x3e0 [nfs]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05e5de8>]
nfs4_set_client+0x98/0x130 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05e75de>]
nfs4_create_server+0x13e/0x3b0 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05de7ce>]
nfs4_remote_mount+0x2e/0x60 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffff81207549>] mount_fs+0x39/0x1b0
Apr 15 23:29:54 psql02 kernel: [<ffffffff811a7f25>] ?
__alloc_percpu+0x15/0x20
Apr 15 23:29:54 psql02 kernel: [<ffffffff81224177>]
vfs_kern_mount+0x67/0x110
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05de6f6>]
nfs_do_root_mount+0x86/0xc0 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05deaf4>]
nfs4_try_mount+0x44/0xc0 [nfsv4]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc057d627>] ?
get_nfs_version+0x27/0x90 [nfs]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc0589a95>]
nfs_fs_mount+0x4c5/0xd90 [nfs]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc058a9c0>] ?
nfs_clone_super+0x140/0x140 [nfs]
Apr 15 23:29:54 psql02 kernel: [<ffffffffc05888c0>] ?
param_set_portnr+0x70/0x70 [nfs]
Apr 15 23:29:54 psql02 kernel: [<ffffffff81207549>] mount_fs+0x39/0x1b0
Apr 15 23:29:54 psql02 kernel: [<ffffffff811a7f25>] ?
__alloc_percpu+0x15/0x20
Apr 15 23:29:54 psql02 kernel: [<ffffffff81224177>]
vfs_kern_mount+0x67/0x110
Apr 15 23:29:54 psql02 kernel: [<ffffffff81226683>] do_mount+0x233/0xaf0
Apr 15 23:29:54 psql02 kernel: [<ffffffff812272c6>] SyS_mount+0x96/0xf0
Apr 15 23:29:54 psql02 kernel: [<ffffffff816b89fd>]
system_call_fastpath+0x16/0x1b
Apr 15 23:29:54 psql02 kernel: [<ffffffff816b889d>] ?
system_call_after_swapgs+0xca/0x214
Message from syslogd@psql01 at Apr 17 03:08:31 ...
kernel:NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s!
[mount.nfs:1606]
Linux psql02.nix.my.dom 3.10.0-693.21.1.el7.x86_64 #1 SMP Wed Mar 7
19:03:37 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
[root@nfs02 log]# tcpdump -i eth0|grep -v "192.168.0.76"|grep -v
NLB|grep -v nfs01|grep -v netbios|grep -v NBT
tcpdump: verbose output suppressed, use -v or -vv for full protocol decode
listening on eth0, link-type EN10MB (Ethernet), capture size 262144 bytes
01:23:42.183158 IP nfs02.my.dom.xyz.60807 > idmipa01.my.dom.xyz.domain:
23358+ PTR? 76.0.168.192.in-addr.arpa. (43)
01:23:42.183884 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.60807:
23358 NXDomain* 0/1/0 (110)
01:23:42.184090 IP nfs02.my.dom.xyz.59911 > idmipa01.my.dom.xyz.domain:
29059+ PTR? 119.0.168.192.in-addr.arpa. (44)
01:23:42.184601 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.59911:
29059* 1/2/2 PTR nfs02.my.dom.xyz. (153)
01:23:42.184827 IP nfs02.my.dom.xyz.49329 > idmipa01.my.dom.xyz.domain:
50753+ PTR? 44.0.168.192.in-addr.arpa. (43)
01:23:42.185122 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.49329:
50753* 1/2/2 PTR idmipa01.my.dom.xyz. (146)
01:23:42.250263 IP nfs02.my.dom.xyz.49035 > idmipa01.my.dom.xyz.domain:
17264+ PTR? 255.0.168.192.in-addr.arpa. (44)
01:23:42.250983 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.49035:
17264 NXDomain* 0/1/0 (111)
01:23:42.257700 IP nfs02.my.dom.xyz.51938 > idmipa01.my.dom.xyz.domain:
51451+ PTR? 131.0.168.192.in-addr.arpa. (44)
01:23:42.360669 IP nfs02.my.dom.xyz.46447 > idmipa01.my.dom.xyz.domain:
12552+ PTR? 224.0.168.192.in-addr.arpa. (44)
01:23:42.361247 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.46447:
12552 NXDomain* 0/1/0 (111)
01:23:42.361434 IP nfs02.my.dom.xyz.37305 > idmipa01.my.dom.xyz.domain:
34850+ PTR? 223.0.168.192.in-addr.arpa. (44)
01:23:42.361766 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.37305:
34850 NXDomain* 0/1/0 (111)
01:23:42.420742 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:42.421026 IP nfs02.my.dom.xyz.58510 > idmipa01.my.dom.xyz.domain:
7249+ PTR? 18.0.0.224.in-addr.arpa. (41)
01:23:42.421745 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.58510:
7249 1/13/0 PTR vrrp.mcast.net. (277)
01:23:42.751583 IP nfs02.my.dom.xyz.43409 > idmipa01.my.dom.xyz.domain:
29327+ PTR? 222.0.168.192.in-addr.arpa. (44)
01:23:42.752250 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.43409:
29327 NXDomain* 0/1/0 (111)
01:23:43.421723 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:44.422648 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:45.423492 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:46.424492 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:47.188951 ARP, Request who-has nfs02.my.dom.xyz tell
idmipa01.my.dom.xyz, length 46
01:23:47.188966 ARP, Reply nfs02.my.dom.xyz is-at 00:50:56:86:2d:21 (oui
Unknown), length 28
01:23:47.248948 ARP, Request who-has 192.168.0.103 tell 192.168.0.222,
length 46
01:23:47.249297 IP nfs02.my.dom.xyz.50518 > idmipa01.my.dom.xyz.domain:
46693+ PTR? 103.0.168.192.in-addr.arpa. (44)
01:23:47.250150 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.50518:
46693 NXDomain* 0/1/0 (111)
01:23:47.425440 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:48.426303 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:49.427153 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:50.428133 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:51.428574 IP psql02.my.dom.xyz.885 > nfs-c01.my.dom.xyz.nfs: Flags
[S], seq 1812770089, win 29200, options [mss 1460,sackOK,TS val
167449689 ecr 0,nop,wscale 7], length 0
01:23:51.428634 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.885: Flags
[S.], seq 2612074554, ack 1812770090, win 28960, options [mss
1460,sackOK,TS val 172963836 ecr 167449689,nop,wscale 7], length 0
01:23:51.428787 IP psql02.my.dom.xyz.885 > nfs-c01.my.dom.xyz.nfs: Flags
[.], ack 1, win 229, options [nop,nop,TS val 167449689 ecr 172963836],
length 0
01:23:51.428838 IP psql02.my.dom.xyz.885 > nfs-c01.my.dom.xyz.nfs: Flags
[P.], seq 1:45, ack 1, win 229, options [nop,nop,TS val 167449689 ecr
172963836], length 44: NFS request xid 2544096005 40 null
01:23:51.428859 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.885: Flags
[.], ack 45, win 227, options [nop,nop,TS val 172963836 ecr 167449689],
length 0
01:23:51.429003 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:51.429079 IP nfs02.my.dom.xyz.52213 > idmipa01.my.dom.xyz.domain:
17880+ PTR? 80.0.168.192.in-addr.arpa. (43)
01:23:51.429514 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.52213:
17880* 1/2/2 PTR nfs-c01.my.dom.xyz. (154)
01:23:51.429748 IP nfs02.my.dom.xyz.58966 > idmipa01.my.dom.xyz.domain:
50455+ PTR? 124.0.168.192.in-addr.arpa. (44)
01:23:51.430092 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.58966:
50455* 1/2/2 PTR psql02.my.dom.xyz. (154)
01:23:51.430129 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.885: Flags
[P.], seq 1:29, ack 45, win 227, options [nop,nop,TS val 172963838 ecr
167449689], length 28: NFS reply xid 2544096005 reply ok 24 null
01:23:51.430247 IP psql02.my.dom.xyz.885 > nfs-c01.my.dom.xyz.nfs: Flags
[.], ack 29, win 229, options [nop,nop,TS val 167449690 ecr 172963838],
length 0
01:23:51.433079 IP psql02.my.dom.xyz.40999 > nfs-c01.my.dom.xyz.nfs:
Flags [S], seq 3882292069, win 29200, options [mss 1460,sackOK,TS val
167449693 ecr 0,nop,wscale 7], length 0
01:23:51.433124 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.40999:
Flags [S.], seq 876416044, ack 3882292070, win 28960, options [mss
1460,sackOK,TS val 172963841 ecr 167449693,nop,wscale 7], length 0
01:23:51.433214 IP psql02.my.dom.xyz.40999 > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 1, win 229, options [nop,nop,TS val 167449693 ecr
172963841], length 0
01:23:51.435147 IP psql02.my.dom.xyz.40999 > nfs-c01.my.dom.xyz.nfs:
Flags [P.], seq 1:693, ack 1, win 229, options [nop,nop,TS val 167449695
ecr 172963841], length 692: NFS request xid 3844890745 688 null
01:23:51.435184 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.40999:
Flags [.], ack 693, win 238, options [nop,nop,TS val 172963843 ecr
167449695], length 0
01:23:51.436257 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.40999:
Flags [P.], seq 1:25, ack 693, win 238, options [nop,nop,TS val
172963844 ecr 167449695], length 24: NFS reply xid 3844890745 reply ERR
20: Auth Rejected Credentials (client should begin new session)
01:23:51.436374 IP psql02.my.dom.xyz.40999 > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 25, win 229, options [nop,nop,TS val 167449697 ecr
172963844], length 0
01:23:51.483369 IP nfs02.my.dom.xyz.53527 > idmipa01.my.dom.xyz.domain:
62714+ PTR? 105.0.168.192.in-addr.arpa. (44)
01:23:51.484027 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.53527:
62714 NXDomain* 0/1/0 (111)
01:23:51.487612 IP nfs02.my.dom.xyz.41147 > idmipa01.my.dom.xyz.domain:
4106+ PTR? 100.0.168.192.in-addr.arpa. (44)
01:23:51.487992 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.41147:
4106 NXDomain* 0/1/0 (111)
01:23:52.429933 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:53.430801 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:54.432246 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:55.433173 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:55.453824 IP ovirt01.my.dom.xyz.843 > nfs-c01.my.dom.xyz.nfs:
Flags [P.], seq 2034285336:2034285468, ack 3501594280, win 614, options
[nop,nop,TS val 272589184 ecr 172927798], length 132: NFS request xid
2543449196 128 getattr fh 0,1/53
01:23:55.454235 IP nfs02.my.dom.xyz.34495 > idmipa01.my.dom.xyz.domain:
10353+ PTR? 145.0.168.192.in-addr.arpa. (44)
01:23:55.454456 IP nfs-c01.my.dom.xyz.nfs > ovirt01.my.dom.xyz.843:
Flags [P.], seq 1:85, ack 132, win 788, options [nop,nop,TS val
172967862 ecr 272589184], length 84: NFS reply xid 2543449196 reply ok
80 getattr NON 1 ids 0/50331648 sz 1518458202
01:23:55.454669 IP ovirt01.my.dom.xyz.843 > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 85, win 614, options [nop,nop,TS val 272589185 ecr
172967862], length 0
01:23:55.455038 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.34495:
10353* 1/2/2 PTR ovirt01.my.dom.xyz. (155)
01:23:56.434163 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:56.443495 ARP, Request who-has nfs-c01.my.dom.xyz tell
psql02.my.dom.xyz, length 46
01:23:56.443577 ARP, Reply nfs-c01.my.dom.xyz is-at 00:50:56:86:2d:21
(oui Unknown), length 28
01:23:56.541140 IP nfs02.my.dom.xyz.51079 > idmipa01.my.dom.xyz.domain:
58928+ PTR? 14.0.168.192.in-addr.arpa. (43)
01:23:56.541904 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.51079:
58928 NXDomain* 0/1/0 (110)
01:23:57.435087 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:58.436052 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:23:59.437014 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:00.437885 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:00.456019 ARP, Request who-has ovirt01.my.dom.xyz tell
nfs02.my.dom.xyz, length 28
01:24:00.456439 ARP, Reply ovirt01.my.dom.xyz is-at 00:50:56:86:f7:7e
(oui Unknown), length 46
01:24:00.461740 ARP, Request who-has nfs-c01.my.dom.xyz tell
ovirt01.my.dom.xyz, length 46
01:24:00.461754 ARP, Reply nfs-c01.my.dom.xyz is-at 00:50:56:86:2d:21
(oui Unknown), length 28
01:24:01.233709 ARP, Request who-has 192.168.0.222 tell 192.168.0.221,
length 46
01:24:01.234102 IP nfs02.my.dom.xyz.56105 > idmipa01.my.dom.xyz.domain:
38431+ PTR? 221.0.168.192.in-addr.arpa. (44)
01:24:01.234620 ARP, Request who-has 192.168.0.221 tell 192.168.0.222,
length 46
01:24:01.234749 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.56105:
38431 NXDomain* 0/1/0 (111)
01:24:01.438897 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:02.439939 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:03.440864 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:03.552006 ARP, Request who-has 192.168.0.1 tell 192.168.0.105,
length 46
01:24:03.552336 IP nfs02.my.dom.xyz.46470 > idmipa01.my.dom.xyz.domain:
51908+ PTR? 1.0.168.192.in-addr.arpa. (42)
01:24:03.552821 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.46470:
51908 NXDomain* 0/1/0 (109)
01:24:03.563789 ARP, Request who-has 192.168.0.1 tell 192.168.0.100,
length 46
01:24:04.441825 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:04.947786 ARP, Request who-has idmipa01.my.dom.xyz tell
192.168.0.220, length 46
01:24:04.948254 IP nfs02.my.dom.xyz.57360 > idmipa01.my.dom.xyz.domain:
43921+ PTR? 220.0.168.192.in-addr.arpa. (44)
01:24:04.949368 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.57360:
43921 NXDomain* 0/1/0 (111)
01:24:05.442850 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:05.545549 IP 192.168.0.236.connendp > nfs-c01.my.dom.xyz.nfs:
Flags [P.], seq 3632955218:3632955354, ack 762815828, win 229, options
[nop,nop,TS val 797511168 ecr 172937890], length 136: NFS request xid
351180291 132 getattr fh 0,1/53
01:24:05.545893 IP nfs02.my.dom.xyz.44899 > idmipa01.my.dom.xyz.domain:
42739+ PTR? 236.0.168.192.in-addr.arpa. (44)
01:24:05.546508 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.44899:
42739 NXDomain* 0/1/0 (111)
01:24:05.546662 IP nfs-c01.my.dom.xyz.nfs > 192.168.0.236.connendp:
Flags [P.], seq 1:85, ack 136, win 361, options [nop,nop,TS val
172977954 ecr 797511168], length 84: NFS reply xid 351180291 reply ok 80
getattr NON 1 ids 0/83886080 sz 260232538
01:24:05.546890 IP 192.168.0.236.connendp > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 85, win 229, options [nop,nop,TS val 797511169 ecr
172977954], length 0
01:24:06.443781 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:07.444765 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:08.445813 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:08.568058 ARP, Request who-has idmipa01.my.dom.xyz tell
nfs02.my.dom.xyz, length 28
01:24:08.568511 ARP, Reply idmipa01.my.dom.xyz is-at 00:50:56:86:0d:fa
(oui Unknown), length 46
01:24:08.898274 ARP, Request who-has 192.168.0.14 tell 192.168.0.2,
length 46
01:24:08.898657 IP nfs02.my.dom.xyz.54390 > idmipa01.my.dom.xyz.domain:
42138+ PTR? 2.0.168.192.in-addr.arpa. (42)
01:24:08.899280 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.54390:
42138 NXDomain* 0/1/0 (109)
01:24:08.899574 ARP, Request who-has 192.168.0.14 tell 192.168.0.222,
length 46
01:24:09.446886 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:10.447830 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:10.552020 ARP, Request who-has 192.168.0.236 tell
nfs02.my.dom.xyz, length 28
01:24:10.552560 ARP, Reply 192.168.0.236 is-at 00:50:56:86:d7:4c (oui
Unknown), length 46
01:24:10.553359 ARP, Request who-has nfs-c01.my.dom.xyz tell
192.168.0.236, length 46
01:24:10.553369 ARP, Reply nfs-c01.my.dom.xyz is-at 00:50:56:86:2d:21
(oui Unknown), length 28
01:24:11.448844 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:12.449409 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:12.832432 ARP, Request who-has idmipa02.my.dom.xyz tell
192.168.0.1, length 46
01:24:12.832806 IP nfs02.my.dom.xyz.33863 > idmipa01.my.dom.xyz.domain:
37466+ PTR? 45.0.168.192.in-addr.arpa. (43)
01:24:12.833413 IP idmipa01.my.dom.xyz.domain > nfs02.my.dom.xyz.33863:
37466* 1/2/2 PTR idmipa02.my.dom.xyz. (146)
01:24:13.450394 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:14.040800 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.802: Flags
[F.], seq 2201716585, ack 1255679435, win 227, options [nop,nop,TS val
172986448 ecr 167412300], length 0
01:24:14.041146 IP psql02.my.dom.xyz.802 > nfs-c01.my.dom.xyz.nfs: Flags
[F.], seq 1, ack 1, win 229, options [nop,nop,TS val 167472301 ecr
172986448], length 0
01:24:14.041250 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.802: Flags
[.], ack 2, win 227, options [nop,nop,TS val 172986449 ecr 167472301],
length 0
01:24:14.046310 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.58177:
Flags [F.], seq 4209302181, ack 3983186871, win 238, options [nop,nop,TS
val 172986454 ecr 167412306], length 0
01:24:14.086690 IP psql02.my.dom.xyz.58177 > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 1, win 229, options [nop,nop,TS val 167472347 ecr
172986454], length 0
01:24:14.451465 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:15.452474 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:16.453415 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:17.454417 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:18.455350 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:19.456342 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:20.457423 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:21.079896 ARP, Request who-has 192.168.0.223 tell 192.168.0.220,
length 46
01:24:21.080332 ARP, Request who-has 192.168.0.220 tell 192.168.0.223,
length 46
01:24:21.458358 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:22.459358 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:22.628588 ARP, Reply nfs02.my.dom.xyz is-at 00:50:56:86:2d:21 (oui
Unknown), length 28
01:24:23.460347 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:24.461359 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:25.462168 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:26.463188 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:27.464196 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:28.465215 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:29.466269 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:30.467227 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:31.468185 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:32.469246 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:33.470258 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:34.471274 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:35.472280 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:35.517800 IP ovirt01.my.dom.xyz.843 > nfs-c01.my.dom.xyz.nfs:
Flags [P.], seq 132:264, ack 85, win 614, options [nop,nop,TS val
272629248 ecr 172967862], length 132: NFS request xid 2560226412 128
getattr fh 0,1/53
01:24:35.518462 IP nfs-c01.my.dom.xyz.nfs > ovirt01.my.dom.xyz.843:
Flags [P.], seq 85:169, ack 264, win 796, options [nop,nop,TS val
173007926 ecr 272629248], length 84: NFS reply xid 2560226412 reply ok
80 getattr NON 1 ids 0/50331648 sz 1518458202
01:24:35.518691 IP ovirt01.my.dom.xyz.843 > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 169, win 614, options [nop,nop,TS val 272629249 ecr
173007926], length 0
01:24:36.473289 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:37.474260 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:38.475265 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:39.476199 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:39.786771 ARP, Request who-has idmipa02.my.dom.xyz tell
192.168.0.222, length 46
01:24:40.477213 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:40.520055 ARP, Request who-has ovirt01.my.dom.xyz tell
nfs02.my.dom.xyz, length 28
01:24:40.520485 ARP, Reply ovirt01.my.dom.xyz is-at 00:50:56:86:f7:7e
(oui Unknown), length 46
01:24:40.525642 ARP, Request who-has nfs-c01.my.dom.xyz tell
ovirt01.my.dom.xyz, length 46
01:24:40.525653 ARP, Reply nfs-c01.my.dom.xyz is-at 00:50:56:86:2d:21
(oui Unknown), length 28
01:24:41.478228 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:42.478738 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:43.479744 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:44.480774 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:45.481793 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:45.609470 IP 192.168.0.236.connendp > nfs-c01.my.dom.xyz.nfs:
Flags [P.], seq 136:272, ack 85, win 229, options [nop,nop,TS val
797551232 ecr 172977954], length 136: NFS request xid 367957507 132
getattr fh 0,1/53
01:24:45.610283 IP nfs-c01.my.dom.xyz.nfs > 192.168.0.236.connendp:
Flags [P.], seq 85:169, ack 272, win 369, options [nop,nop,TS val
173018018 ecr 797551232], length 84: NFS reply xid 367957507 reply ok 80
getattr NON 1 ids 0/83886080 sz 260232538
01:24:45.610540 IP 192.168.0.236.connendp > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 169, win 229, options [nop,nop,TS val 797551233 ecr
173018018], length 0
01:24:46.482737 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:47.483693 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:48.484672 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:49.485644 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:50.486707 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:50.615938 ARP, Request who-has 192.168.0.236 tell
nfs02.my.dom.xyz, length 28
01:24:50.616486 ARP, Reply 192.168.0.236 is-at 00:50:56:86:d7:4c (oui
Unknown), length 46
01:24:50.617258 ARP, Request who-has nfs-c01.my.dom.xyz tell
192.168.0.236, length 46
01:24:50.617273 ARP, Reply nfs-c01.my.dom.xyz is-at 00:50:56:86:2d:21
(oui Unknown), length 28
01:24:51.432203 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.885: Flags
[F.], seq 29, ack 45, win 227, options [nop,nop,TS val 173023840 ecr
167449690], length 0
01:24:51.432585 IP psql02.my.dom.xyz.885 > nfs-c01.my.dom.xyz.nfs: Flags
[F.], seq 45, ack 30, win 229, options [nop,nop,TS val 167509693 ecr
173023840], length 0
01:24:51.432617 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.885: Flags
[.], ack 46, win 227, options [nop,nop,TS val 173023840 ecr 167509693],
length 0
01:24:51.437650 IP nfs-c01.my.dom.xyz.nfs > psql02.my.dom.xyz.40999:
Flags [F.], seq 25, ack 693, win 238, options [nop,nop,TS val 173023845
ecr 167449697], length 0
01:24:51.477546 IP psql02.my.dom.xyz.40999 > nfs-c01.my.dom.xyz.nfs:
Flags [.], ack 26, win 229, options [nop,nop,TS val 167509738 ecr
173023845], length 0
01:24:51.487729 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:52.488796 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:53.489834 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:54.490898 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
01:24:55.492024 IP nfs02.my.dom.xyz > vrrp.mcast.net: VRRPv2,
Advertisement, vrid 51, prio 104, authtype none, intvl 1s, length 20
^C1161 packets captured
1162 packets received by filter
0 packets dropped by kernel
[root@nfs02 log]#
6 years, 4 months
Fwd: Ganesha 2.5, crash /segfault while executing nlm4_Unlock
by Sachin Punadikar
---------- Forwarded message ----------
From: Sachin Punadikar <punadikar.sachin(a)gmail.com>
Date: Tue, Jun 26, 2018 at 3:57 PM
Subject: Ganesha 2.5, crash /segfault while executing nlm4_Unlock
To: nfs-ganesha-devel <nfs-ganesha-devel(a)lists.sourceforge.net>
Hi All,
Recently a crash was reported by customer for Ganesha 2.5.
(gdb) where
#0 0x00007f475872900b in pthread_rwlock_wrlock () from
/lib64/libpthread.so.0
#1 0x000000000041eac9 in fsal_obj_handle_fini (obj=0x7f4378028028) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL/commonlib.c:192
#2 0x000000000053180f in mdcache_lru_clean (entry=0x7f4378027ff0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_lru.c:589
#3 0x0000000000536587 in _mdcache_lru_unref (entry=0x7f4378027ff0,
flags=0, func=0x5a9380 <__func__.23209> "cih_remove_checked", line=406)
at /usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_lru.c:1921
#4 0x0000000000543e91 in cih_remove_checked (entry=0x7f4378027ff0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_hash.h:406
#5 0x0000000000544b26 in mdc_clean_entry (entry=0x7f4378027ff0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_helpers.c:235
#6 0x000000000053181e in mdcache_lru_clean (entry=0x7f4378027ff0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_lru.c:592
#7 0x0000000000536587 in _mdcache_lru_unref (entry=0x7f4378027ff0,
flags=0, func=0x5a70af <__func__.23112> "mdcache_put", line=190)
at /usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_lru.c:1921
#8 0x0000000000539666 in mdcache_put (entry=0x7f4378027ff0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_lru.h:190
#9 0x000000000053f062 in mdcache_put_ref (obj_hdl=0x7f4378028028) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/FSAL
/Stackable_FSALs/FSAL_MDCACHE/mdcache_handle.c:1709
#10 0x000000000049bf0f in nlm4_Unlock (args=0x7f4294165830,
req=0x7f4294165028, res=0x7f43f001e0e0)
at /usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/Prot
ocols/NLM/nlm_Unlock.c:128
#11 0x000000000044c719 in nfs_rpc_execute (reqdata=0x7f4294165000) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/Main
NFSD/nfs_worker_thread.c:1290
#12 0x000000000044cf23 in worker_run (ctx=0x3c200e0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/Main
NFSD/nfs_worker_thread.c:1562
#13 0x000000000050a3e7 in fridgethr_start_routine (arg=0x3c200e0) at
/usr/src/debug/nfs-ganesha-2.5.3-ibm013.00-0.1.1-Source/supp
ort/fridgethr.c:550
#14 0x00007f4758725dc5 in start_thread () from /lib64/libpthread.so.0
#15 0x00007f4757de673d in clone () from /lib64/libc.so.6
A closer look at the backtrace indicates that there was cyclic flow of
execution as below:
nlm4_Unlock -> mdcache_put_ref -> mdcache_put -> _mdcache_lru_unref ->
mdcache_lru_clean -> fsal_obj_handle_fini and then mdc_clean_entry ->
cih_remove_checked -> (purposely coping next flow on below line)
-> _mdcache_lru_unref -> mdcache_lru_clean -> fsal_obj_handle_fini
(currently crashing here)
Do we see any code issue here ? Any hints on how to RCA this issue ?
Thanks in advance.
--
with regards,
Sachin Punadikar
--
with regards,
Sachin Punadikar
6 years, 5 months