| 	
Prev: [PATCH -V14 07/11] x86: Add new syscalls for x86_32 Next: pm_qos: only schedule work when in interrupt context 	
		 From: Chris Vine on 15 Jun 2010 13:20 Booting up kernel 2.6.35-rc3 on my netbook, with the kernel compiled for an Atom N270 processor, hangs when starting nfsd. If I then end the hang by rebooting with Ctrl-Alt-Delete, I get a kernel bug reported. The only one of these that has been logged is as follows, as on other boots the bug seems to be reported elsewhere but is not committed to syslog, or no bug is reported at all (it just hangs): .... Jun 15 16:06:18 laptop kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Jun 15 16:07:18 laptop kernel: svc: failed to register lockdv3 RPC service (errno 110). Jun 15 16:07:18 laptop kernel: lockd_up: makesock failed, error=-110 [Ctrl-Alt-Delete here] Jun 15 16:08:21 laptop kernel: lockd_down: no users! task=(null) Jun 15 16:08:21 laptop kernel: ------------[ cut here ]------------ Jun 15 16:08:21 laptop kernel: kernel BUG at fs/lockd/svc.c:347! Jun 15 16:08:21 laptop kernel: invalid opcode: 0000 [#1] SMP Jun 15 16:08:21 laptop kernel: last sysfs file: /sys/module/x_tables/initstate Jun 15 16:08:21 laptop kernel: Modules linked in: nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs xt_recent xt_tcpudp nf_conntrack_ipv4 nf_defrag_i pv4 iptable_filter ip_tables xt_helper xt_conntrack xt_state x_tables nf_conntrack_irc nf_conntrack_ftp nf_conntrack pcmcia pcmcia_core cpufreq_on demand speedstep_lib acpi_cpufreq freq_table mperf parport_pc parport fuse snd_hda_codec_realtek i915 snd_hda_intel drm_kms_helper snd_hda_codec d rm snd_hwdep tg3 uvcvideo i2c_algo_bit snd_pcm videodev rtc_cmos joydev snd_timer rtc_core intel_agp snd soundcore btusb usbhid bluetooth snd_page _alloc video led_class agpgart processor sg rtc_lib output thermal libphy psmouse i2c_i801 rfkill evdev serio_raw thermal_sys ac v4l1_compat batte ry wmi button hwmon [last unloaded: pcmcia_core] Jun 15 16:08:21 laptop kernel: Jun 15 16:08:21 laptop kernel: Pid: 1852, comm: rpc.nfsd Not tainted 2.6.35-rc3 #1 MoutCook/20021,2959 Jun 15 16:08:21 laptop kernel: EIP: 0060:[<f90ef780>] EFLAGS: 00010286 CPU: 1 Jun 15 16:08:21 laptop kernel: EIP is at lockd_down+0x70/0x90 [lockd] Jun 15 16:08:21 laptop kernel: EAX: 00000028 EBX: f6a9c400 ECX: f69e5ec4 EDX: f90f6be4 Jun 15 16:08:21 laptop kernel: ESI: f70e7920 EDI: f70e7928 EBP: f9143a60 ESP: f69e5ec0 Jun 15 16:08:21 laptop kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Jun 15 16:08:21 laptop kernel: Process rpc.nfsd (pid: 1852, ti=f69e4000 task=f7184a20 task.ti=f69e4000) Jun 15 16:08:21 laptop kernel: Stack: Jun 15 16:08:21 laptop kernel: f90f6be4 00000000 f9142995 f70e7900 f70e7900 00000801 f909960b f90ef8f1 Jun 15 16:08:21 laptop kernel: <0> f90f6c74 ffffff92 00000008 00000801 ffffff92 f914279f 00000801 00000000 Jun 15 16:08:21 laptop kernel: <0> f697c004 f69e5f22 f9143440 f91434eb f71df400 f6c027f8 00000101 00000000 Jun 15 16:08:21 laptop kernel: Call Trace: Jun 15 16:08:21 laptop kernel: [<f9142995>] ? nfsd_last_thread+0x25/0x60 [nfsd] Jun 15 16:08:21 laptop kernel: [<f909960b>] ? svc_destroy+0x4b/0x130 [sunrpc] Jun 15 16:08:21 laptop kernel: [<f90ef8f1>] ? lockd_up+0x31/0x1c0 [lockd] Jun 15 16:08:21 laptop kernel: [<f914279f>] ? nfsd_svc+0xcf/0x160 [nfsd] Jun 15 16:08:21 laptop kernel: [<f9143440>] ? write_threads+0x0/0xc0 [nfsd] Jun 15 16:08:21 laptop kernel: [<f91434eb>] ? write_threads+0xab/0xc0 [nfsd] Jun 15 16:08:21 laptop kernel: [<c101fb50>] ? do_page_fault+0x0/0x360 Jun 15 16:08:21 laptop kernel: [<c11460e1>] ? _copy_from_user+0x31/0x80 Jun 15 16:08:21 laptop kernel: [<c10b395f>] ? simple_transaction_get+0x8f/0xa0 Jun 15 16:08:21 laptop kernel: [<f9143ab9>] ? nfsctl_transaction_write+0x59/0x70 [nfsd] Jun 15 16:08:21 laptop kernel: [<c1099a20>] ? vfs_write+0xa0/0x160 Jun 15 16:08:21 laptop kernel: [<c1099ba1>] ? sys_write+0x41/0x70 Jun 15 16:08:21 laptop kernel: [<c12eefc5>] ? syscall_call+0x7/0xb Jun 15 16:08:21 laptop kernel: Code: 05 3c 8e 0f f9 00 00 00 00 b8 c8 7c 0f f9 83 c4 08 e9 15 e3 1f c8 a1 38 8e 0f f9 c7 04 24 e4 6b 0f f9 89 44 24 04 e8 3f d4 1f c8 <0f> 0b eb fe c7 04 24 08 6c 0f f9 e8 2f d4 1f c8 0f 0b eb fe 8d Jun 15 16:08:21 laptop kernel: EIP: [<f90ef780>] lockd_down+0x70/0x90 [lockd] SS:ESP 0068:f69e5ec0 Jun 15 16:08:21 laptop kernel: ---[ end trace 8ca67da05153a656 ]--- This particular trace is with NFS v4 compiled in, but I get a similar hang with only v2/3 compiled in. Chris -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ 	
		 From: J. Bruce Fields on 16 Jun 2010 11:40 Jeff, is this one of the problems you saw? --b. On Tue, Jun 15, 2010 at 05:50:34PM +0100, Chris Vine wrote: > Booting up kernel 2.6.35-rc3 on my netbook, with the kernel compiled for > an Atom N270 processor, hangs when starting nfsd. If I then end the hang > by rebooting with Ctrl-Alt-Delete, I get a kernel bug reported. The only > one of these that has been logged is as follows, as on other boots the bug > seems to be reported elsewhere but is not committed to syslog, or no bug > is reported at all (it just hangs): > > ... > Jun 15 16:06:18 laptop kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Jun 15 16:07:18 laptop kernel: svc: failed to register lockdv3 RPC service (errno 110). > Jun 15 16:07:18 laptop kernel: lockd_up: makesock failed, error=-110 > [Ctrl-Alt-Delete here] > Jun 15 16:08:21 laptop kernel: lockd_down: no users! task=(null) > Jun 15 16:08:21 laptop kernel: ------------[ cut here ]------------ > Jun 15 16:08:21 laptop kernel: kernel BUG at fs/lockd/svc.c:347! > Jun 15 16:08:21 laptop kernel: invalid opcode: 0000 [#1] SMP > Jun 15 16:08:21 laptop kernel: last sysfs file: /sys/module/x_tables/initstate > Jun 15 16:08:21 laptop kernel: Modules linked in: nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs xt_recent xt_tcpudp nf_conntrack_ipv4 nf_defrag_i > pv4 iptable_filter ip_tables xt_helper xt_conntrack xt_state x_tables nf_conntrack_irc nf_conntrack_ftp nf_conntrack pcmcia pcmcia_core cpufreq_on > demand speedstep_lib acpi_cpufreq freq_table mperf parport_pc parport fuse snd_hda_codec_realtek i915 snd_hda_intel drm_kms_helper snd_hda_codec d > rm snd_hwdep tg3 uvcvideo i2c_algo_bit snd_pcm videodev rtc_cmos joydev snd_timer rtc_core intel_agp snd soundcore btusb usbhid bluetooth snd_page > _alloc video led_class agpgart processor sg rtc_lib output thermal libphy psmouse i2c_i801 rfkill evdev serio_raw thermal_sys ac v4l1_compat batte > ry wmi button hwmon [last unloaded: pcmcia_core] > Jun 15 16:08:21 laptop kernel: > Jun 15 16:08:21 laptop kernel: Pid: 1852, comm: rpc.nfsd Not tainted 2.6.35-rc3 #1 MoutCook/20021,2959 > Jun 15 16:08:21 laptop kernel: EIP: 0060:[<f90ef780>] EFLAGS: 00010286 CPU: 1 > Jun 15 16:08:21 laptop kernel: EIP is at lockd_down+0x70/0x90 [lockd] > Jun 15 16:08:21 laptop kernel: EAX: 00000028 EBX: f6a9c400 ECX: f69e5ec4 EDX: f90f6be4 > Jun 15 16:08:21 laptop kernel: ESI: f70e7920 EDI: f70e7928 EBP: f9143a60 ESP: f69e5ec0 > Jun 15 16:08:21 laptop kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 > Jun 15 16:08:21 laptop kernel: Process rpc.nfsd (pid: 1852, ti=f69e4000 task=f7184a20 task.ti=f69e4000) > Jun 15 16:08:21 laptop kernel: Stack: > Jun 15 16:08:21 laptop kernel: f90f6be4 00000000 f9142995 f70e7900 f70e7900 00000801 f909960b f90ef8f1 > Jun 15 16:08:21 laptop kernel: <0> f90f6c74 ffffff92 00000008 00000801 ffffff92 f914279f 00000801 00000000 > Jun 15 16:08:21 laptop kernel: <0> f697c004 f69e5f22 f9143440 f91434eb f71df400 f6c027f8 00000101 00000000 > Jun 15 16:08:21 laptop kernel: Call Trace: > Jun 15 16:08:21 laptop kernel: [<f9142995>] ? nfsd_last_thread+0x25/0x60 [nfsd] > Jun 15 16:08:21 laptop kernel: [<f909960b>] ? svc_destroy+0x4b/0x130 [sunrpc] > Jun 15 16:08:21 laptop kernel: [<f90ef8f1>] ? lockd_up+0x31/0x1c0 [lockd] > Jun 15 16:08:21 laptop kernel: [<f914279f>] ? nfsd_svc+0xcf/0x160 [nfsd] > Jun 15 16:08:21 laptop kernel: [<f9143440>] ? write_threads+0x0/0xc0 [nfsd] > Jun 15 16:08:21 laptop kernel: [<f91434eb>] ? write_threads+0xab/0xc0 [nfsd] > Jun 15 16:08:21 laptop kernel: [<c101fb50>] ? do_page_fault+0x0/0x360 > Jun 15 16:08:21 laptop kernel: [<c11460e1>] ? _copy_from_user+0x31/0x80 > Jun 15 16:08:21 laptop kernel: [<c10b395f>] ? simple_transaction_get+0x8f/0xa0 > Jun 15 16:08:21 laptop kernel: [<f9143ab9>] ? nfsctl_transaction_write+0x59/0x70 [nfsd] > Jun 15 16:08:21 laptop kernel: [<c1099a20>] ? vfs_write+0xa0/0x160 > Jun 15 16:08:21 laptop kernel: [<c1099ba1>] ? sys_write+0x41/0x70 > Jun 15 16:08:21 laptop kernel: [<c12eefc5>] ? syscall_call+0x7/0xb > Jun 15 16:08:21 laptop kernel: Code: 05 3c 8e 0f f9 00 00 00 00 b8 c8 7c 0f f9 83 c4 08 e9 15 e3 1f c8 a1 38 8e 0f f9 c7 04 24 e4 6b 0f f9 89 44 24 04 e8 3f d4 1f c8 <0f> 0b eb fe c7 04 24 08 6c 0f f9 e8 2f d4 1f c8 0f 0b eb fe 8d > Jun 15 16:08:21 laptop kernel: EIP: [<f90ef780>] lockd_down+0x70/0x90 [lockd] SS:ESP 0068:f69e5ec0 > Jun 15 16:08:21 laptop kernel: ---[ end trace 8ca67da05153a656 ]--- > > This particular trace is with NFS v4 compiled in, but I get a similar hang > with only v2/3 compiled in. > > Chris > > > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo(a)vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ 	
		 From: Andreas Schwab on 16 Jun 2010 12:00 Chris Vine <chris(a)cvine.freeserve.co.uk> writes: > Booting up kernel 2.6.35-rc3 on my netbook, with the kernel compiled for > an Atom N270 processor, hangs when starting nfsd. See <https://bugzilla.kernel.org/show_bug.cgi?id=16188>. Andreas. -- Andreas Schwab, schwab(a)redhat.com GPG Key fingerprint = D4E8 DBE3 3813 BB5D FA84 5EC7 45C6 250E 6F00 984E "And now for something completely different." -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ 	
		 From: Jeff Layton on 16 Jun 2010 13:00 On Wed, 16 Jun 2010 11:36:03 -0400 "J. Bruce Fields" <bfields(a)fieldses.org> wrote: > Jeff, is this one of the problems you saw? > > --b. > > On Tue, Jun 15, 2010 at 05:50:34PM +0100, Chris Vine wrote: > > Booting up kernel 2.6.35-rc3 on my netbook, with the kernel compiled for > > an Atom N270 processor, hangs when starting nfsd. If I then end the hang > > by rebooting with Ctrl-Alt-Delete, I get a kernel bug reported. The only > > one of these that has been logged is as follows, as on other boots the bug > > seems to be reported elsewhere but is not committed to syslog, or no bug > > is reported at all (it just hangs): > > No, I don't think we ever saw any oopses from this, but I think I can see what happened here: rpc.nfsd was unable to hand any socket fd's off to the kernel due to being unable to start lockd. Regardless though, it tried to start threads anyway, and called into nfsd_init_socks. It then started a udp socket, and tried to call lockd_up again. That failed, and it returned error. Now sv_permsocks is non-empty but the socket there doesn't hold a lockd reference. The right fix is probably to tear down the socket when lockd_up fails in nfsd_init_socks. I suspect that Chris may be using an older version of rpc.nfsd though that might behave a little differently than the one I was using, and that might account for why he hit this and we didn't. Chris, what version of nfs-utils do you have installed on this box? > > ... > > Jun 15 16:06:18 laptop kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > > Jun 15 16:07:18 laptop kernel: svc: failed to register lockdv3 RPC service (errno 110). > > Jun 15 16:07:18 laptop kernel: lockd_up: makesock failed, error=-110 > > [Ctrl-Alt-Delete here] > > Jun 15 16:08:21 laptop kernel: lockd_down: no users! task=(null) > > Jun 15 16:08:21 laptop kernel: ------------[ cut here ]------------ > > Jun 15 16:08:21 laptop kernel: kernel BUG at fs/lockd/svc.c:347! > > Jun 15 16:08:21 laptop kernel: invalid opcode: 0000 [#1] SMP > > Jun 15 16:08:21 laptop kernel: last sysfs file: /sys/module/x_tables/initstate > > Jun 15 16:08:21 laptop kernel: Modules linked in: nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs xt_recent xt_tcpudp nf_conntrack_ipv4 nf_defrag_i > > pv4 iptable_filter ip_tables xt_helper xt_conntrack xt_state x_tables nf_conntrack_irc nf_conntrack_ftp nf_conntrack pcmcia pcmcia_core cpufreq_on > > demand speedstep_lib acpi_cpufreq freq_table mperf parport_pc parport fuse snd_hda_codec_realtek i915 snd_hda_intel drm_kms_helper snd_hda_codec d > > rm snd_hwdep tg3 uvcvideo i2c_algo_bit snd_pcm videodev rtc_cmos joydev snd_timer rtc_core intel_agp snd soundcore btusb usbhid bluetooth snd_page > > _alloc video led_class agpgart processor sg rtc_lib output thermal libphy psmouse i2c_i801 rfkill evdev serio_raw thermal_sys ac v4l1_compat batte > > ry wmi button hwmon [last unloaded: pcmcia_core] > > Jun 15 16:08:21 laptop kernel: > > Jun 15 16:08:21 laptop kernel: Pid: 1852, comm: rpc.nfsd Not tainted 2.6.35-rc3 #1 MoutCook/20021,2959 > > Jun 15 16:08:21 laptop kernel: EIP: 0060:[<f90ef780>] EFLAGS: 00010286 CPU: 1 > > Jun 15 16:08:21 laptop kernel: EIP is at lockd_down+0x70/0x90 [lockd] > > Jun 15 16:08:21 laptop kernel: EAX: 00000028 EBX: f6a9c400 ECX: f69e5ec4 EDX: f90f6be4 > > Jun 15 16:08:21 laptop kernel: ESI: f70e7920 EDI: f70e7928 EBP: f9143a60 ESP: f69e5ec0 > > Jun 15 16:08:21 laptop kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 > > Jun 15 16:08:21 laptop kernel: Process rpc.nfsd (pid: 1852, ti=f69e4000 task=f7184a20 task.ti=f69e4000) > > Jun 15 16:08:21 laptop kernel: Stack: > > Jun 15 16:08:21 laptop kernel: f90f6be4 00000000 f9142995 f70e7900 f70e7900 00000801 f909960b f90ef8f1 > > Jun 15 16:08:21 laptop kernel: <0> f90f6c74 ffffff92 00000008 00000801 ffffff92 f914279f 00000801 00000000 > > Jun 15 16:08:21 laptop kernel: <0> f697c004 f69e5f22 f9143440 f91434eb f71df400 f6c027f8 00000101 00000000 > > Jun 15 16:08:21 laptop kernel: Call Trace: > > Jun 15 16:08:21 laptop kernel: [<f9142995>] ? nfsd_last_thread+0x25/0x60 [nfsd] > > Jun 15 16:08:21 laptop kernel: [<f909960b>] ? svc_destroy+0x4b/0x130 [sunrpc] > > Jun 15 16:08:21 laptop kernel: [<f90ef8f1>] ? lockd_up+0x31/0x1c0 [lockd] > > Jun 15 16:08:21 laptop kernel: [<f914279f>] ? nfsd_svc+0xcf/0x160 [nfsd] > > Jun 15 16:08:21 laptop kernel: [<f9143440>] ? write_threads+0x0/0xc0 [nfsd] > > Jun 15 16:08:21 laptop kernel: [<f91434eb>] ? write_threads+0xab/0xc0 [nfsd] > > Jun 15 16:08:21 laptop kernel: [<c101fb50>] ? do_page_fault+0x0/0x360 > > Jun 15 16:08:21 laptop kernel: [<c11460e1>] ? _copy_from_user+0x31/0x80 > > Jun 15 16:08:21 laptop kernel: [<c10b395f>] ? simple_transaction_get+0x8f/0xa0 > > Jun 15 16:08:21 laptop kernel: [<f9143ab9>] ? nfsctl_transaction_write+0x59/0x70 [nfsd] > > Jun 15 16:08:21 laptop kernel: [<c1099a20>] ? vfs_write+0xa0/0x160 > > Jun 15 16:08:21 laptop kernel: [<c1099ba1>] ? sys_write+0x41/0x70 > > Jun 15 16:08:21 laptop kernel: [<c12eefc5>] ? syscall_call+0x7/0xb > > Jun 15 16:08:21 laptop kernel: Code: 05 3c 8e 0f f9 00 00 00 00 b8 c8 7c 0f f9 83 c4 08 e9 15 e3 1f c8 a1 38 8e 0f f9 c7 04 24 e4 6b 0f f9 89 44 24 04 e8 3f d4 1f c8 <0f> 0b eb fe c7 04 24 08 6c 0f f9 e8 2f d4 1f c8 0f 0b eb fe 8d > > Jun 15 16:08:21 laptop kernel: EIP: [<f90ef780>] lockd_down+0x70/0x90 [lockd] SS:ESP 0068:f69e5ec0 > > Jun 15 16:08:21 laptop kernel: ---[ end trace 8ca67da05153a656 ]--- > > > > This particular trace is with NFS v4 compiled in, but I get a similar hang > > with only v2/3 compiled in. > > > > Chris > > > > > > -- > > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > > the body of a message to majordomo(a)vger.kernel.org > > More majordomo info at http://vger.kernel.org/majordomo-info.html > > Please read the FAQ at http://www.tux.org/lkml/ -- Jeff Layton <jlayton(a)redhat.com> -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ 	
		 From: Chris Vine on 16 Jun 2010 17:10 On Wed, 16 Jun 2010 12:35:32 -0400 Jeff Layton <jlayton(a)redhat.com> wrote: [snip] > No, I don't think we ever saw any oopses from this, but I think I can > see what happened here: > > rpc.nfsd was unable to hand any socket fd's off to the kernel due to > being unable to start lockd. Regardless though, it tried to start > threads anyway, and called into nfsd_init_socks. It then started a udp > socket, and tried to call lockd_up again. That failed, and it > returned error. Now sv_permsocks is non-empty but the socket there > doesn't hold a lockd reference. > > The right fix is probably to tear down the socket when lockd_up fails > in nfsd_init_socks. > > I suspect that Chris may be using an older version of rpc.nfsd though > that might behave a little differently than the one I was using, and > that might account for why he hit this and we didn't. > > Chris, what version of nfs-utils do you have installed on this box? [snip] It's the stock nfs-utils-1.2.2 which comes with slackware 13.1, which seems to be the latest (stable) release. Chris -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ 
		  | 
Next
 | 
Last
 Pages: 1 2 Prev: [PATCH -V14 07/11] x86: Add new syscalls for x86_32 Next: pm_qos: only schedule work when in interrupt context |