Prev: USB transfer_buffer allocations on 64bit systems
Next: Ugly rmap NULL ptr deref oopsie on hibernate (was Linux 2.6.34-rc3)
From: Davide Ferrari on 7 Apr 2010 06:00 Hi I've just installed some new webservers (Apache + mod_php5) on Debian Lenny and on some machines I see this message appearing from time to time in the kernel log. [ 840.319551] INFO: task apache2:7601 blocked for more than 120 seconds. [ 840.336259] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 840.353598] apache2 D 0000000000000000 0 7601 6041 0x00000000 [ 840.353602] ffffffff814451f0 0000000000000082 0000000000000000 0000000000000000 [ 840.353605] 0000000000000000 000000000000f8a0 ffff8802126bffd8 00000000000155c0 [ 840.353608] 00000000000155c0 ffff88021158b880 ffff88021158bb78 00000000126bfe24 [ 840.353611] Call Trace: [ 840.353620] [<ffffffff8110bb35>] ? __getblk+0x26/0x29a [ 840.353623] [<ffffffff8110c57f>] ? sync_buffer+0x0/0x40 [ 840.353629] [<ffffffff812edf7d>] ? io_schedule+0x73/0xb7 [ 840.353631] [<ffffffff8110c5ba>] ? sync_buffer+0x3b/0x40 [ 840.353633] [<ffffffff812ee380>] ? __wait_on_bit_lock+0x3f/0x84 [ 840.353636] [<ffffffff8110c57f>] ? sync_buffer+0x0/0x40 [ 840.353638] [<ffffffff812ee430>] ? out_of_line_wait_on_bit_lock+0x6b/0x77 [ 840.353642] [<ffffffff81064a64>] ? wake_bit_function+0x0/0x23 [ 840.353652] [<ffffffffa0104657>] ? journal_invalidatepage+0x8e/0x282 [jbd] [ 840.353657] [<ffffffff810bbd4a>] ? truncate_inode_page+0x45/0x84 [ 840.353660] [<ffffffff810bbe33>] ? truncate_inode_pages_range+0xaa/0x2b0 [ 840.353666] [<ffffffffa01262d6>] ? __ext3_journal_dirty_metadata+0x1e/0x46 [ext3] [ 840.353669] [<ffffffff812ee7b5>] ? mutex_lock+0xd/0x31 [ 840.353674] [<ffffffff8111513a>] ? fsnotify_clear_marks_by_inode+0x20/0xcd [ 840.353679] [<ffffffffa011ccd0>] ? ext3_delete_inode+0x0/0xd0 [ext3] [ 840.353683] [<ffffffffa011cce7>] ? ext3_delete_inode+0x17/0xd0 [ext3] [ 840.353687] [<ffffffffa011ccd0>] ? ext3_delete_inode+0x0/0xd0 [ext3] [ 840.353692] [<ffffffff810fee24>] ? generic_delete_inode+0xdc/0x168 [ 840.353695] [<ffffffff810f7c4a>] ? do_unlinkat+0xe2/0x134 [ 840.353700] [<ffffffff81010b42>] ? system_call_fastpath+0x16/0x1b Moreover, this server reaches incredibly high load value (> 150), cause almost every Apache process is waiting IO. It happen both with Lenny (backport) kernel 2.6.30 and 2.6.32. I've seen a similiar problem here http://lkml.org/lkml/2009/1/11/70 and the patch is already applied in my kernel. Anyway it doesn't matter what elevator I use (CFQ, deadline), the CPUs are almost always on a high iowait (70-80%). The machines it's 8-cores with Hyperthreading and a single SATA disk. It doesn't do too much disk activity (circa 200MB logs per day, that's no big deal). Any idea? Do you need more info? Thanks in advance PS: Please CC me when replying cause I'm not subscribed to the LKML -- Davide Ferrari Atrapalo.com System Administrator -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo(a)vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ |