Prev: [PATCHv5 07/16] AFFS: fix race condition in marking SB dirty
Next: [PATCH 6/7] Add scripts/coccinelle/err_cast.cocci
From: Sachin Sant on 6 Jun 2010 11:10 While executing LTP Controller tests(memcg regression) on a POWER6 box came across this following OOPS. Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB, file-rss:1088kB Memory cgroup out of memory: kill process 9140 (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1) vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging request for data at address 0x720072007200720 Faulting instruction address: 0xc00000000015b778 Oops: Kernel access of bad area, sig: 11 [#2] SMP NR_CPUS=1024 NUMA pSeries last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: c00000000015b778 LR: c00000000015b740 CTR: 0000000000000000 REGS: c000000009812ff0 TRAP: 0300 Tainted: G D (2.6.35-rc2-autotest) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 44004424 XER: 00000001 DAR: 0720072007200720, DSISR: 0000000040000000 TASK = c000000005fb1100[9155] 'umount' THREAD: c000000009810000 CPU: 0 GPR00: 0000000000000000 c000000009813270 c000000000d3d7a0 0000000000000000 GPR04: 0000000000008050 0000000000160000 0000000000000027 c00000000f2c6870 GPR08: 00000000000006a5 c000000000b16870 c000000000cf0140 000000000e7b0000 GPR12: 0000000024004428 c000000007440000 0000000000008000 fffffffffffff000 GPR16: 0000000000000000 c0000000098138f0 000000000000002d 0000000000000027 GPR20: 0000000000000000 0000000000000027 0000000000000000 c000000007063138 GPR24: ffffffffffffffff 0000000000000000 c00000000019bafc c00000000e02e000 GPR28: 0000000000000001 0000000000008050 c000000000ca6b00 0720072007200720 NIP [c00000000015b778] .kmem_cache_alloc+0xb0/0x13c LR [c00000000015b740] .kmem_cache_alloc+0x78/0x13c Call Trace: [c000000009813270] [c00000000015b740] .kmem_cache_alloc+0x78/0x13c (unreliable) [c000000009813310] [c00000000019bafc] .alloc_buffer_head+0x2c/0x78 [c000000009813390] [c00000000019c99c] .alloc_page_buffers+0x60/0x114 [c000000009813450] [c00000000019ca78] .create_empty_buffers+0x28/0x140 [c0000000098134e0] [c00000000019f2ec] .__block_prepare_write+0xe4/0x4f0 [c000000009813610] [c00000000019f94c] .block_write_begin_newtrunc+0xa8/0x120 [c0000000098136d0] [c00000000019fea0] .block_write_begin+0x34/0x8c [c000000009813770] [c00000000022b458] .ext3_write_begin+0x13c/0x298 [c000000009813880] [c000000000117500] .generic_file_buffered_write+0x13c/0x320 [c0000000098139b0] [c000000000119c80] .__generic_file_aio_write+0x378/0x3dc [c000000009813ab0] [c000000000119d68] .generic_file_aio_write+0x84/0xfc [c000000009813b60] [c00000000016e460] .do_sync_write+0xac/0x10c [c000000009813ce0] [c00000000016f204] .vfs_write+0xd0/0x1dc [c000000009813d80] [c00000000016f418] .SyS_write+0x58/0xa0 [c000000009813e30] [c0000000000085b4] syscall_exit+0x0/0x40 Instruction dump: 38600000 409e0090 38000000 8b8d0212 980d0212 e96d0040 e93b0000 7ce95a14 7fe9582a 2fbf0000 419e0014 e81b001a <7c1f002a> 7c09592a 4800001c 7f46d378 ---[ end trace f24cb0cb5729d2bb ]--- And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...) was good. Thanks -Sachin -- --------------------------------- Sachin Sant IBM Linux Technology Center India Systems and Technology Labs Bangalore, India --------------------------------- -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Al Viro on 6 Jun 2010 11:50 On Sun, Jun 06, 2010 at 08:36:54PM +0530, Sachin Sant wrote: > And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...) > was good. That's very odd, since ; git diff --stat 6c5de280b6..v2.6.35-rc2 Makefile | 2 +- drivers/gpu/drm/i915/intel_display.c | 9 +++++++ fs/ext4/inode.c | 40 +++++++++++++++++++-------------- fs/ext4/move_extent.c | 3 ++ 4 files changed, 36 insertions(+), 18 deletions(-) ; and nothing of that looks like good candidates... -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Markus Trippelsdorf on 6 Jun 2010 12:20 >> And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...) >> was good. > >That's very odd, since >; git diff --stat 6c5de280b6..v2.6.35-rc2 > Makefile | 2 +- > drivers/gpu/drm/i915/intel_display.c | 9 +++++++ > fs/ext4/inode.c | 40 +++++++++++++++++++-------------- > fs/ext4/move_extent.c | 3 ++ > 4 files changed, 36 insertions(+), 18 deletions(-) > >and nothing of that looks like good candidates... I may have the same problem on my machine. (See also the thread: ext4 2.6.35-rc2 regression (ext4: Make sure the MOVE_EXT ioctl...)) general protection fault: 0000 [#1] SMP last sysfs file: /sys/devices/pci0000:00/0000:00:11.0/host2/target2:0:0/2:0:0:0/block/sdb/size CPU 2 Pid: 1683, comm: iptables-restor Not tainted 2.6.35-rc2-00033-gcc1f375 #46 M4A78T-E/System Product Name RIP: 0010:[<ffffffff810cc6e6>] [<ffffffff810cc6e6>] kmem_cache_alloc+0x59/0xda RSP: 0018:ffff88011c993d78 EFLAGS: 00010002 RAX: 0000000000000000 RBX: 0720072007200720 RCX: ffffffff810bd4c9 RDX: 00007f076cee3000 RSI: 00000000000000d0 RDI: ffff88011fc01800 RBP: ffff88011c993db8 R08: ffff880001b13f48 R09: 0000000000000000 R10: ffff88011d387c00 R11: ffff88011c983930 R12: ffff88011fc01800 R13: 0000000000000202 R14: 00000000000000d0 R15: 00000000000000d0 FS: 00007f076dc43700(0000) GS:ffff880001b00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 00007f8595d364f8 CR3: 000000011b8b0000 CR4: 00000000000006e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Process iptables-restor (pid: 1683, threadinfo ffff88011c992000, task ffff88011ec09610) Stack: ffff88011d387c10 ffff88011c983930 ffff88011c993d98 000000000000fffa <0> ffff88011d387bd0 00007f076cee3000 ffff88011f77ea40 0000000000000000 <0> ffff88011c993e08 ffffffff810bd4c9 ffff88011b8f5cc0 ffffffff810bd639 Call Trace: [<ffffffff810bd4c9>] __split_vma+0x33/0x18d [<ffffffff810bd639>] ? vma_merge+0x16/0x1fc [<ffffffff810bdc01>] split_vma+0x23/0x28 [<ffffffff810bf572>] mprotect_fixup+0x146/0x54c [<ffffffff810befff>] ? do_mmap_pgoff+0x2a4/0x2fe [<ffffffff810bfaf0>] sys_mprotect+0x178/0x1f4 [<ffffffff8102b93b>] system_call_fastpath+0x16/0x1b Code: 65 4c 8b 04 25 88 d4 00 00 48 8b 07 49 01 c0 49 8b 18 48 85 db 75 10 83 ca ff 44 89 f6 e8 58 fa ff ff 48 89 c3 eb 0b 48 63 47 18 <48> 8b 04 03 49 89 00 41 55 9d 48 85 db 74 15 41 81 e6 00 80 00 RIP [<ffffffff810cc6e6>] kmem_cache_alloc+0x59/0xda RSP <ffff88011c993d78> ---[ end trace e2fb1ccd3cb9dd77 ]--- -- Markus -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: Maciej Rutecki on 10 Jun 2010 16:10 I created a Bugzilla entry at https://bugzilla.kernel.org/show_bug.cgi?id=16178 for your bug report, please add your address to the CC list in there, thanks! On niedziela, 6 czerwca 2010 o 17:06:54 Sachin Sant wrote: > While executing LTP Controller tests(memcg regression) on > a POWER6 box came across this following OOPS. > > Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a > child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB, > file-rss:1088kB Memory cgroup out of memory: kill process 9140 > (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1) > vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging > request for data at address 0x720072007200720 Faulting instruction > address: 0xc00000000015b778 > Oops: Kernel access of bad area, sig: 11 [#2] > SMP NR_CPUS=1024 NUMA pSeries > last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map > Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom > sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: > c00000000015b778 LR: c00000000015b740 CTR: 0000000000000000 > REGS: c000000009812ff0 TRAP: 0300 Tainted: G D > (2.6.35-rc2-autotest) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 44004424 > XER: 00000001 > DAR: 0720072007200720, DSISR: 0000000040000000 > TASK = c000000005fb1100[9155] 'umount' THREAD: c000000009810000 CPU: 0 > GPR00: 0000000000000000 c000000009813270 c000000000d3d7a0 0000000000000000 > GPR04: 0000000000008050 0000000000160000 0000000000000027 c00000000f2c6870 > GPR08: 00000000000006a5 c000000000b16870 c000000000cf0140 000000000e7b0000 > GPR12: 0000000024004428 c000000007440000 0000000000008000 fffffffffffff000 > GPR16: 0000000000000000 c0000000098138f0 000000000000002d 0000000000000027 > GPR20: 0000000000000000 0000000000000027 0000000000000000 c000000007063138 > GPR24: ffffffffffffffff 0000000000000000 c00000000019bafc c00000000e02e000 > GPR28: 0000000000000001 0000000000008050 c000000000ca6b00 0720072007200720 > NIP [c00000000015b778] .kmem_cache_alloc+0xb0/0x13c > LR [c00000000015b740] .kmem_cache_alloc+0x78/0x13c > Call Trace: > [c000000009813270] [c00000000015b740] .kmem_cache_alloc+0x78/0x13c > (unreliable) [c000000009813310] [c00000000019bafc] > .alloc_buffer_head+0x2c/0x78 [c000000009813390] [c00000000019c99c] > .alloc_page_buffers+0x60/0x114 [c000000009813450] [c00000000019ca78] > .create_empty_buffers+0x28/0x140 [c0000000098134e0] [c00000000019f2ec] > .__block_prepare_write+0xe4/0x4f0 [c000000009813610] [c00000000019f94c] > .block_write_begin_newtrunc+0xa8/0x120 [c0000000098136d0] > [c00000000019fea0] .block_write_begin+0x34/0x8c [c000000009813770] > [c00000000022b458] .ext3_write_begin+0x13c/0x298 [c000000009813880] > [c000000000117500] .generic_file_buffered_write+0x13c/0x320 > [c0000000098139b0] [c000000000119c80] > .__generic_file_aio_write+0x378/0x3dc [c000000009813ab0] > [c000000000119d68] .generic_file_aio_write+0x84/0xfc [c000000009813b60] > [c00000000016e460] .do_sync_write+0xac/0x10c > [c000000009813ce0] [c00000000016f204] .vfs_write+0xd0/0x1dc > [c000000009813d80] [c00000000016f418] .SyS_write+0x58/0xa0 > [c000000009813e30] [c0000000000085b4] syscall_exit+0x0/0x40 > Instruction dump: > 38600000 409e0090 38000000 8b8d0212 980d0212 e96d0040 e93b0000 7ce95a14 > 7fe9582a 2fbf0000 419e0014 e81b001a <7c1f002a> 7c09592a 4800001c 7f46d378 > ---[ end trace f24cb0cb5729d2bb ]--- > > And few more of these. Previous snapshot release > 2.6.35-rc1-git5(6c5de280b6...) was good. > > Thanks > -Sachin > -- Maciej Rutecki http://www.maciek.unixy.pl -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
From: KAMEZAWA Hiroyuki on 10 Jun 2010 21:40
On Thu, 10 Jun 2010 22:00:57 +0200 Maciej Rutecki <maciej.rutecki(a)gmail.com> wrote: > I created a Bugzilla entry at > https://bugzilla.kernel.org/show_bug.cgi?id=16178 > for your bug report, please add your address to the CC list in there, thanks! > Hmm... It seems a panic in SLUB or SLAB. Is .config available ? -Kame > On niedziela, 6 czerwca 2010 o 17:06:54 Sachin Sant wrote: > > While executing LTP Controller tests(memcg regression) on > > a POWER6 box came across this following OOPS. > > > > Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a > > child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB, > > file-rss:1088kB Memory cgroup out of memory: kill process 9140 > > (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1) > > vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging > > request for data at address 0x720072007200720 Faulting instruction > > address: 0xc00000000015b778 > > Oops: Kernel access of bad area, sig: 11 [#2] > > SMP NR_CPUS=1024 NUMA pSeries > > last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map > > Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom > > sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: > > c00000000015b778 LR: c00000000015b740 CTR: 0000000000000000 > > REGS: c000000009812ff0 TRAP: 0300 Tainted: G D > > (2.6.35-rc2-autotest) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 44004424 > > XER: 00000001 > > DAR: 0720072007200720, DSISR: 0000000040000000 > > TASK = c000000005fb1100[9155] 'umount' THREAD: c000000009810000 CPU: 0 > > GPR00: 0000000000000000 c000000009813270 c000000000d3d7a0 0000000000000000 > > GPR04: 0000000000008050 0000000000160000 0000000000000027 c00000000f2c6870 > > GPR08: 00000000000006a5 c000000000b16870 c000000000cf0140 000000000e7b0000 > > GPR12: 0000000024004428 c000000007440000 0000000000008000 fffffffffffff000 > > GPR16: 0000000000000000 c0000000098138f0 000000000000002d 0000000000000027 > > GPR20: 0000000000000000 0000000000000027 0000000000000000 c000000007063138 > > GPR24: ffffffffffffffff 0000000000000000 c00000000019bafc c00000000e02e000 > > GPR28: 0000000000000001 0000000000008050 c000000000ca6b00 0720072007200720 > > NIP [c00000000015b778] .kmem_cache_alloc+0xb0/0x13c > > LR [c00000000015b740] .kmem_cache_alloc+0x78/0x13c > > Call Trace: > > [c000000009813270] [c00000000015b740] .kmem_cache_alloc+0x78/0x13c > > (unreliable) [c000000009813310] [c00000000019bafc] > > .alloc_buffer_head+0x2c/0x78 [c000000009813390] [c00000000019c99c] > > .alloc_page_buffers+0x60/0x114 [c000000009813450] [c00000000019ca78] > > .create_empty_buffers+0x28/0x140 [c0000000098134e0] [c00000000019f2ec] > > .__block_prepare_write+0xe4/0x4f0 [c000000009813610] [c00000000019f94c] > > .block_write_begin_newtrunc+0xa8/0x120 [c0000000098136d0] > > [c00000000019fea0] .block_write_begin+0x34/0x8c [c000000009813770] > > [c00000000022b458] .ext3_write_begin+0x13c/0x298 [c000000009813880] > > [c000000000117500] .generic_file_buffered_write+0x13c/0x320 > > [c0000000098139b0] [c000000000119c80] > > .__generic_file_aio_write+0x378/0x3dc [c000000009813ab0] > > [c000000000119d68] .generic_file_aio_write+0x84/0xfc [c000000009813b60] > > [c00000000016e460] .do_sync_write+0xac/0x10c > > [c000000009813ce0] [c00000000016f204] .vfs_write+0xd0/0x1dc > > [c000000009813d80] [c00000000016f418] .SyS_write+0x58/0xa0 > > [c000000009813e30] [c0000000000085b4] syscall_exit+0x0/0x40 > > Instruction dump: > > 38600000 409e0090 38000000 8b8d0212 980d0212 e96d0040 e93b0000 7ce95a14 > > 7fe9582a 2fbf0000 419e0014 e81b001a <7c1f002a> 7c09592a 4800001c 7f46d378 > > ---[ end trace f24cb0cb5729d2bb ]--- > > > > And few more of these. Previous snapshot release > > 2.6.35-rc1-git5(6c5de280b6...) was good. > > > > Thanks > > -Sachin > > > > -- > Maciej Rutecki > http://www.maciek.unixy.pl > > -- > To unsubscribe, send a message with 'unsubscribe linux-mm' in > the body to majordomo(a)kvack.org. For more info on Linux MM, > see: http://www.linux-mm.org/ . > Don't email: <a href=mailto:"dont(a)kvack.org"> email(a)kvack.org </a> > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |