From: Marcin Mirosław on
Hello!
I have repetitious (in Oops using kernel from 2.6.32 to 2.6.34.
On host called "wzor" i start up nfs server, nfs is serving data from
separate partition. When a few clients connect to this server using udp,
start copying filef from and to this server, i run:
umount /data/nfs ; /etc/init.d/nfs stop
(sometimes better works this: umount /data/nfs ; /etc/init.d/nfs stop ;
umount /data/nfs )

In 3 times per 10 it triggers Oops.
I couldn't trigger Oops using kernel 2.6.35_rc3-r8.
Clients are connecting using this parameters:
mount.nfs 192.168.138.79:/data/nfs /mnt/test/ -v -o
rw,proto=udp,rsize=8192,wsize=8192,intr,nfsvers=3,nolock,bg,soft

I don't know which commit fixes this in most recent kernel, is it
possible to fix problem in earlier kernels?
Regards,
Marcin

../ver_linux
If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.

Linux wzor 2.6.34-gentoo-r1 #7 SMP Thu Jul 8 14:46:09 CEST 2010 x86_64
Intel(R) Xeon(R) CPU E5420 @ 2.50GHz GenuineIntel GNU/Linux

Gnu C 4.4.3
Gnu make 3.81
binutils 2.20.1.20100303
util-linux 2.17.2
mount support
module-init-tools 3.5
e2fsprogs 1.41.11
reiserfsprogs 3.6.21
xfsprogs 3.0.3
Linux C Library 2.11.2
Dynamic linker (ldd) 2.11.2
Procps 3.2.8
Net-tools 1.60
Kbd 1.15
Sh-utils 8.4
Modules Loaded nfsd lockd sunrpc exportfs bnx2 iTCO_wdt

From console:
Message from syslogd(a)wzor at Thu Jul 8 15:20:31 2010 ...
wzor kernel: Oops: 0000 [#1] SMP

Message from syslogd(a)wzor at Thu Jul 8 15:20:31 2010 ...
wzor kernel: last sysfs file: /sys/devices/virtual/block/md0/dev

Message from syslogd(a)wzor at Thu Jul 8 15:20:31 2010 ...
wzor kernel: Stack:

Message from syslogd(a)wzor at Thu Jul 8 15:20:31 2010 ...
wzor kernel: Call Trace:

Message from syslogd(a)wzor at Thu Jul 8 15:20:31 2010 ...
wzor kernel: Code: 41 5f c9 c3 49 8b bd 00 01 00 00 41 bf c3 ff ff ff e8
81 06 00 00 4d 8b b5 00 01 00 00 49 8b 86 80 02 00 00 48 8b 80 70 04 00
00 <48> 8b 50 10 48 85 d2 0f 84 43 04 00 00 48 8d ba b0 00 00 00 48

Message from syslogd(a)wzor at Thu Jul 8 15:20:31 2010 ...
wzor kernel: CR2: 0000000000000010
Killed


From dmesg:
Jul 8 15:20:30 wzor mountd[4890]: Caught signal 15, un-registering and
exiting.
Jul 8 15:20:30 wzor kernel: nfsd: last server has exited, flushing
export cache
Jul 8 15:20:30 wzor kernel: nfsd: last server has exited, flushing
export cache
Jul 8 15:20:31 wzor kernel: BUG: unable to handle kernel NULL pointer
dereference at 0000000000000010
Jul 8 15:20:31 wzor kernel: IP: [<ffffffff81129fb4>]
reiserfs_for_each_xattr+0x94/0x560
Jul 8 15:20:31 wzor kernel: PGD 679885067 PUD 679522067 PMD 0
Jul 8 15:20:31 wzor kernel: Oops: 0000 [#1] SMP
Jul 8 15:20:31 wzor kernel: last sysfs file:
/sys/devices/virtual/block/md0/dev
Jul 8 15:20:31 wzor kernel: CPU 7
Jul 8 15:20:31 wzor kernel: Modules linked in: nfsd lockd sunrpc
exportfs bnx2 iTCO_wdt
Jul 8 15:20:31 wzor kernel:
Jul 8 15:20:31 wzor kernel: Pid: 4950, comm: umount Not tainted
2.6.34-gentoo-r1 #7 0TT740/PowerEdge 1950
Jul 8 15:20:31 wzor kernel: RIP: 0010:[<ffffffff81129fb4>]
[<ffffffff81129fb4>] reiserfs_for_each_xattr+0x94/0x560
Jul 8 15:20:31 wzor kernel: RSP: 0018:ffff88067ebb5bf8 EFLAGS: 00010202
Jul 8 15:20:31 wzor kernel: RAX: 0000000000000000 RBX: ffffffff81129da0
RCX: 0000000000000000
Jul 8 15:20:31 wzor kernel: RDX: 0000000000000000 RSI: ffffffff81129da0
RDI: ffff88067e9ef828
Jul 8 15:20:31 wzor kernel: RBP: ffff88067ebb5ce8 R08: 000000000006bdc4
R09: dead000000200200
Jul 8 15:20:31 wzor kernel: R10: dead000000100100 R11: dead000000200200
R12: 0000000000000000
Jul 8 15:20:31 wzor kernel: R13: ffff88065ba3fcc8 R14: ffff880678cb7c00
R15: 00000000ffffffc3
Jul 8 15:20:31 wzor kernel: FS: 00007f4fe7313740(0000)
GS:ffff8800019c0000(0000) knlGS:0000000000000000
Jul 8 15:20:31 wzor kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Jul 8 15:20:31 wzor kernel: CR2: 0000000000000010 CR3: 000000067b139000
CR4: 00000000000006e0
Jul 8 15:20:31 wzor kernel: DR0: 0000000000000000 DR1: 0000000000000000
DR2: 0000000000000000
Jul 8 15:20:31 wzor kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0
DR7: 0000000000000400
Jul 8 15:20:31 wzor kernel: Process umount (pid: 4950, threadinfo
ffff88067ebb4000, task ffff88067e8c3380)
Jul 8 15:20:31 wzor kernel: Stack:
Jul 8 15:20:31 wzor kernel: ffff88067ebb5c58 ffff88067ebb5c28
ffff88067ebb5cf8 ffff88067e92e006
Jul 8 15:20:31 wzor kernel: <0> 0000000f00000000 000000000000000f
0000000000000000 0000000000000000
Jul 8 15:20:31 wzor kernel: <0> 0000000000000000 0000000000000000
0000000000000000 0000000000000000
Jul 8 15:20:31 wzor kernel: Call Trace:
Jul 8 15:20:31 wzor kernel: [<ffffffff8102a29b>] ?
__dequeue_entity+0x2b/0x50
Jul 8 15:20:31 wzor kernel: [<ffffffff8112a502>]
reiserfs_delete_xattrs+0x22/0x60
Jul 8 15:20:31 wzor kernel: [<ffffffff8110fb04>]
reiserfs_delete_inode+0x94/0x110
Jul 8 15:20:31 wzor kernel: [<ffffffff810bfcd0>]
generic_delete_inode+0x80/0x120
Jul 8 15:20:31 wzor kernel: [<ffffffff810bfdbd>]
generic_drop_inode+0x4d/0x70
Jul 8 15:20:31 wzor kernel: [<ffffffff810beb6d>] iput+0x5d/0x70
Jul 8 15:20:31 wzor kernel: [<ffffffff810bd101>]
shrink_dcache_for_umount_subtree+0x1d1/0x270
Jul 8 15:20:31 wzor kernel: [<ffffffff810bd1e6>]
shrink_dcache_for_umount+0x46/0x50
Jul 8 15:20:31 wzor kernel: [<ffffffff810accca>]
generic_shutdown_super+0x1a/0x100
Jul 8 15:20:31 wzor kernel: [<ffffffff810acddc>]
kill_block_super+0x2c/0x50
Jul 8 15:20:31 wzor kernel: [<ffffffff811148f3>]
reiserfs_kill_sb+0x93/0xa0
Jul 8 15:20:31 wzor kernel: [<ffffffff810ad320>]
deactivate_super+0x50/0x70
Jul 8 15:20:31 wzor kernel: [<ffffffff810c349e>]
mntput_no_expire+0x9e/0xe0
Jul 8 15:20:31 wzor kernel: [<ffffffff810c3846>] sys_umount+0x76/0x370
Jul 8 15:20:31 wzor kernel: [<ffffffff810023ab>]
system_call_fastpath+0x16/0x1b
Jul 8 15:20:31 wzor kernel: Code: 41 5f c9 c3 49 8b bd 00 01 00 00 41
bf c3 ff ff ff e8 81 06 00 00 4d 8b b5 00 01 00 00 49 8b 86 80 02 00 00
48 8b 80 70 04 00 00 <48> 8b 50 10 48 85 d2 0f 84 43 04 00 00 48 8d ba
b0 00 00 00 48
Jul 8 15:20:31 wzor kernel: RIP [<ffffffff81129fb4>]
reiserfs_for_each_xattr+0x94/0x560
Jul 8 15:20:31 wzor kernel: RSP <ffff88067ebb5bf8>
Jul 8 15:20:31 wzor kernel: CR2: 0000000000000010
Jul 8 15:20:31 wzor kernel: ---[ end trace 8142c08791238025 ]---

../decodecode:

Jul 8 15:20:31 wzor kernel: Code: 41 5f c9 c3 49 8b bd 00 01 00 00 41 bf
c3 ff ff ff e8 81 06 00 00 4d 8b b5 00 01 00 00 49 8b 86 80 02 00 00 48
8b 80 70 04 00 00 <48> 8b 50 10 48 85 d2 0f 84 43 04 00 00 48 8d ba b0
00 00 00 48
All code
========
0: 41 5f pop %r15
2: c9 leaveq
3: c3 retq
4: 49 8b bd 00 01 00 00 mov 0x100(%r13),%rdi
b: 41 bf c3 ff ff ff mov $0xffffffc3,%r15d
11: e8 81 06 00 00 callq 0x697
16: 4d 8b b5 00 01 00 00 mov 0x100(%r13),%r14
1d: 49 8b 86 80 02 00 00 mov 0x280(%r14),%rax
24: 48 8b 80 70 04 00 00 mov 0x470(%rax),%rax
2b:* 48 8b 50 10 mov 0x10(%rax),%rdx <-- trapping
instruction
2f: 48 85 d2 test %rdx,%rdx
32: 0f 84 43 04 00 00 je 0x47b
38: 48 8d ba b0 00 00 00 lea 0xb0(%rdx),%rdi
3f: 48 rex.W

Code starting with the faulting instruction
===========================================
0: 48 8b 50 10 mov 0x10(%rax),%rdx
4: 48 85 d2 test %rdx,%rdx
7: 0f 84 43 04 00 00 je 0x450
d: 48 8d ba b0 00 00 00 lea 0xb0(%rdx),%rdi
14: 48 rex.W


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/